Re: variance



In article <1142472447.227577.326260@xxxxxxxxxxxxxxxxxxxxxxxxxxxx>,
gmarkowsky@xxxxxxxxx wrote:
Can someone please tell me why sometimes when calculating the variance
of a sample wee divide by n-1 instead of n? Thanks.

The variance of a discrete distribution is defined as

n n
1 --- _ 2 1 --- 2 _ 2
- > (x - x) = - > x - x
n --- k n --- k
k=1 k=1

That is, the mean of the squares minus the square of the mean.

However, when taking sampling, in general the sample mean differs
from the distribution mean. The sample mean is shifted toward
the samples which, in general, causes the sample variance to be
smaller than the distribution variance. As shown on the page at
<http://www.whim.org/nebula/math/varn-1.html>, the the expected
sample variance is (n-1)/n times the distribution variance.
Therefore, the proper way to correct this is to multiply the
sample variance by n/(n-1). Since the formula for the variance
has n in the denominator, the correction replaces the n with an
n-1.

Rob Johnson <rob@xxxxxxxxxxxxxx>
take out the trash before replying
to view any ASCII art, display article in a monospaced font
.



Relevant Pages

  • Re: Why, for Sample Standard Deviation, Divide by N-1, Instead of N?
    ... It is actually the variance that has the "nice" properties ... computed on a sample from a normally distributed population ... chi-square distribution on n-1 degrees of freedom. ...
    (sci.stat.math)
  • Re: Measuring Turquoise Underwear
    ... that the distribution had to be normal. ... 6/49 game and has stats for about 52 draws. ... he claims the variance for the mean is /12n. ... The revised formula would yield ...
    (rec.gambling.lottery)
  • Re: feedback...
    ... >>>Hi Duncan, ... >>mean (from N sample draws) to fall with 95% confidence. ... The variance of the mean, after N draws, for a given position is ... the variance of the distribution from which a draw is made. ...
    (rec.gambling.lottery)
  • averaging noisy data (was: Re: Spacecraft earth-flyby data reveals dynamical preferred frame)
    ... filtered data contains much less noise than the raw data, ... The obvious thing to do is to average our N measurements by defining ... What can we say about the probability distribution of xbar? ... We'd like to say that the variance of xbar's distribution is about ...
    (sci.physics.research)
  • Re: Questions about a distribution
    ... Let's say the PDF has the mean = 250 ... The variance tells how ... units as the reaction time measurements, so I would have said that the SD ... lets you know how "wide" the distribution is. ...
    (sci.stat.math)

Loading