Re: A multiple regression stumper



rick.deshon@xxxxxxxxx wrote:
> Hi All.
>
> I can't figure out the solution to what should be a fairly
> straightforward regression problem.
>
> Assume you have a set of variables (X) that you use to predict a single
> variable (Y) in a standard multiple regression model. X is nxp and Y
> is nx1.
>
> In this model, Y = Xb + e, where e is a nx1 vector of residuals.
>
> The OLS estimate of b is inv(X`X)*X'Y. Consider b to be a (px1) vector
> of optimal weights that minimize the variance of e.
>
> One way to examine the quality of the fitted regression is to compute
> R^2 (the coefficient of variation or determination). R2 = (b'*
> cov_XY)/var(Y) where cov_XY is a px1 vector of covariances between the
> columns of X with Y (cov_XY = (X'Y)/(n-1)) and var(Y) is the variance
> of the vector Y. Conceptually, R2 is the ratio of predictable variance
> in Y to total variance in Y.
>
> I would like to compute R2 for non-optimal sets of weights. What
> happens to R2 as you use less and less optimal weights?
>
> This would be simple under normal circumstances but I'd like to do it
> for a special case where you don't know Y. In other words, you have X,
> b, and you know the R2 for the optimal model. Further, using knowledge
> of X, b, and the optimal R2 you can compute the variance of Y so you
> know that quantity also.
>
> Is it possible to estimate R2 for non-optimal weights if you know b, X,
> and var(y)? The only missing quantity is cov_XY but b clearly has
> information on these missing covariances of X and Y. I have not been
> able to determine a unique solution to this apparently simple problem.
> Perhaps an orthogonal projection?
>
> Thanks for any insights you can provide!
>
> Rick

Given X, b, R^2.

y = X.b + e for some n-vector e that is orthogonal to the columns of X,

where "." denotes matrix multiplication. Then

var[y] = var[X.b] / R^2 = b'.cov[X,X].b / R^2

and cov[X,y] = cov[X,X].b .

For any arbitrary weight vector a ,

corr[X.a,y] = cov[X.a,y] / sqrt[ var[X.a] * var[y] ] ,

where cov[X.a,y] = a'.cov[X,y] = a'.cov[X,X].b

and var[X.a] = a'.cov[X,X].a .

.



Relevant Pages

  • A multiple regression stumper
    ... straightforward regression problem. ... variable in a standard multiple regression model. ... of optimal weights that minimize the variance of e. ...
    (sci.stat.math)
  • Re: A multiple regression stumper
    ... > variable in a standard multiple regression model. ... > of optimal weights that minimize the variance of e. ... > I would like to compute R2 for non-optimal sets of weights. ...
    (sci.stat.math)
  • Re: Linear regression w/ confidence intervals
    ... outputs confidence intervals for the best-fit parameters that it comes up with. ... Any book on linear regression will describe these. ... I see that the nonlinear fitting routines accept a 'weight' option--how do I translate variances into weights? ... "I also know the variance of each of the data points " might mean several things. ...
    (comp.soft-sys.matlab)
  • Re: nonlinear regression for dummies
    ... > variance, chi-squared, and so forth. ... But the output of the regression ... is easier and simpler, ordinary least squares regression. ... you would learn that the ANOVA model ordinarily ...
    (sci.stat.edu)
  • Re: Need help understanding Homogeneity of Variance please
    ... What are the consequences of not having homogeneity of variance. ... regression, the assumption of homogeneity of variance is used as part ... The "unbiassedness" condition can ...
    (sci.stat.edu)