Re: Principal Component Analysis- Do I need to scale (i.e. normalize) my variables?
- From: Art Kendall <Arthur.Kendall@xxxxxxxxxxx>
- Date: Sat, 16 May 2009 13:46:16 GMT
When you tell the software to use the covariance matrix in any type of factor analysis (principal components, principal factors, alpha, image, etc) the scales are important. In the more routine situation, where you tell the software to use the correlation matrix the variables are implicitly standardized. z = (value of x-mean of x) /(standard deviation of x).
I don't know what the substantive meaning of x's would be if you used the absolute value of the z-scores aka standardized variables.
This is a correlation matrix on 4 z-scores. (it is the same as for raw variables).
x1 x2 x3 x4
x1 1 .646 -.303 -.533
x2 .646 1 -.437 -.399
x3 -.303 -.437 1 .327
x4 -.533 -.399 .327 1
This is a correlation on the absolute value of the z-scores
abs1 abs2 abs3 abs4
abs1 1 .535 -.269 .125
abs2 .535 1 -.324 .117
abs3 -.269 -.324 1 .022
abs4 .125 .117 .022 1
Art Kendall
Social Research Consultants
Kerry wrote:
Hi,.
I need to perform PCA on 20 or so variables (ex. height of say a tree,
weight of a tree, age, etc), many with different units and/or value
ranges. Will this bias my results? I noticed in a past PCA I did that
I converted all of my values to z scores [i.e. ((abs(value-mean))/std
dev)], but not sure why or if this was even a correct way to
normalize. If I do need to normalize my values, wouldn't it make more
sense to convert them to value/mean? Or what about value/sum(values)?
To be clear, I am referring to making my values unitless prior to
adding them all to my PCA.
Thanks,
K
- Follow-Ups:
- References:
- Prev by Date: Sale Apple Macbook Pro
- Next by Date: Re: Using Mann Whitey test for Likert Scales
- Previous by thread: Re: Principal Component Analysis- Do I need to scale (i.e. normalize) my variables?
- Next by thread: Re: Principal Component Analysis- Do I need to scale (i.e. normalize) my variables?
- Index(es):
Relevant Pages
|
Loading