Re: Stepwise with categorical variables




Blas Navarro wrote:
> Hi all. I would like to estimate a predictive linear regression model
> with 5 predictors. One predictor is categorical (socioeconomic
> status-SS) with 3 categories, so I generate 2 dummy variables: SS1 and
> SS2.
> The problem appears when using STEPWISE regression with the procedure
> REGRESSION of SPSS, because this method of selection does not take into
> account that the 2 dummy variables must go together in any candidate
> model.
> Does anyone how to solve this problem working with SPSS (I know other
> software make the dummy decomposition automatically so there is no
> problem).

Stepwise is almost always a bad idea, regardless of whether your
predictors are categorical or continuous. There has been lots written
in these newsgroups about this, do a search and you will find some of
the reasons.

--
Paige Miller
paige.miller@xxxxxxx

.



Relevant Pages

  • Re: stepwise regression by GENSTAT
    ... My handbook considers only stepwise regression as a method to select ... leaving behind only "random variation" in the residuals (residuals = ... to which subset of these to use as predictors. ...
    (sci.stat.math)
  • Re: Questions about square errors
    ... Take a look at the 10X10 correlation coefficient matrix and the ... multicollinearities. ... least squares and/or multiple regression. ... Your model may have several unnecessary predictors. ...
    (sci.stat.math)
  • Re: Enter versus forward method for linear regression
    ... Regression, ... present the coefficents and p values of all predictors so that readers ... try Robert Abelson's book "Statistics as Principled Argument." ... and examine the effects on the coefficients. ...
    (sci.stat.edu)
  • Re: Using Ridge Regression to disentangle highly correlated explanatory variables
    ... the regression model, which did it's job at reducing the VIF greatly. ... the relative impact of each of the three explanatory variables. ... impacts of your three correlated predictors, ... and the VIFs should be 1 for each score variable. ...
    (sci.stat.math)
  • Re: Collinearity, confidence intervals and sampling
    ... read that if you have collinear variables, the best fitting plane of the ... predictors are highly correlated so the predictor values fall in a straight ... Does this mean that this is not a problem if you have population level data ... What about other assumptions of regression e.g. ...
    (sci.stat.consult)