Re: Why, for Sample Standard Deviation, Divide by N-1, Instead of N?




DarkProtoman wrote:
Kevin E. Thorpe wrote:
DarkProtoman wrote:
Why, when you calculate the sample standard deviation, you divide by
n-1, instead of n? I've heard someone said it has "nice mathematical
properties that make the math work out smoothly". What are those "nice
mathematical properties"? Thanks!!!!!

Supposing you have a random sample of observations from a population.
The sample variance \sum[(x_i - \bar{x})^2]/(n-1) is the unbiased
estimate
of the population variance. The standard deviation is the square root
of
this. It is actually the variance that has the "nice" properties
(unbiasedness,
its relation to the chi-squared distribution).


OK, then, what are the nice mathematical properties of the variance,
then?

I mentioned two.

It's unbiased.

Let s^2 be defined as above and xbar be the sample mean
computed on a sample from a normally distributed population
with mean mu and variance sigma^2. A theorem from
mathematical statistics states that xbar and s^2 are independent
random variables. Furthermore, (n-1)*s^2 / sigma^2 has a
chi-square distribution on n-1 degrees of freedom.

It is also this sample variance that ultimately gives the t-statistic
its distribution (again under normality).

--
Kevin E. Thorpe
Assistant Professor, Department of Public Health Sciences
Faculty of Medicine, University of Toronto

.



Relevant Pages

  • Re: variance
    ... of a sample wee divide by n-1 instead of n? ... The variance of a discrete distribution is defined as ... the mean of the squares minus the square of the mean. ...
    (sci.math)
  • Re: standard error: bootstrap vs standard normal
    ... I denote by Xbar it mean. ... Distribution with n-1 degrees of freedom. ... It is not well known because it is not true when the population variance, which you've denoted by s^2, is not equal to one. ...
    (sci.stat.math)
  • Re: Measuring Turquoise Underwear
    ... that the distribution had to be normal. ... 6/49 game and has stats for about 52 draws. ... he claims the variance for the mean is /12n. ... The revised formula would yield ...
    (rec.gambling.lottery)
  • Re: feedback...
    ... >>>Hi Duncan, ... >>mean (from N sample draws) to fall with 95% confidence. ... The variance of the mean, after N draws, for a given position is ... the variance of the distribution from which a draw is made. ...
    (rec.gambling.lottery)
  • averaging noisy data (was: Re: Spacecraft earth-flyby data reveals dynamical preferred frame)
    ... filtered data contains much less noise than the raw data, ... The obvious thing to do is to average our N measurements by defining ... What can we say about the probability distribution of xbar? ... We'd like to say that the variance of xbar's distribution is about ...
    (sci.physics.research)