Re: Biased and unbiased std dev



"Vinay" <vinkolar@xxxxxxxxx> wrote in news:1121288120.240622.52330
@g43g2000cwa.googlegroups.com:

> I was wondering if someone can explain in simple terms why std dev s=
> sqrt(sum_(x_i-\bar{x})/n) is biased. I understand that 's' should be
> invalid when n=1 and hence they divide by (n-1). But is it just to
> avoid this case? Or is there more intersting fact for that.
>
> Why does dividing by n always give biased estimates? Is this because
> there are n-1 intervals between x_1 to x_n?

As Bob has pointed out, it's only the variance that you can get an
unbiased estimate of by dividing by (n-1); taking the square root biases
the result.

The reason for the bias is that when you take the average sum of squares
around the *sample* mean rather than the population mean, you end up
subtracting out the variance of the sample mean itself (which will be
sigma^2/N, where sigma^2 is the population variance) and thus wind up
with a value that's too small by a factor of (n-1)/n. So you multiply
the result by n/(n-1) and come up with the standard formula.

A more intuitive explanation is that the mean of a set of data is, by
definition, the value that minimizes the sum of squared deviations around
it. For a sample, the sample mean will have a smaller sum of squared
deviations around it than any other value. But if you're trying to
estimate the *population* variance, you're really interested in the
deviations around the *population* mean. Since the population mean will
not in general be *exactly* equal to the sample mean, the sum of squared
deviations around it will be greater than the sum of squared deviations
around the sample mean, and therefore the latter will be biased low.
.



Relevant Pages

  • Re: z test how
    ... variance 2 is variance squared ... sum of V2/n = 0.0000145545 ... at the probability that is my probability that this 2 groups are ... It doesn't mean you need to square it. ...
    (sci.stat.edu)
  • Re: Biased and unbiased std dev
    ... > unbiased estimate of by dividing by; taking the square root biases ... That's only ONE consequence of the definition of an unbiased estimate. ... the value that minimizes the sum of squared deviations around ... > deviations around it than any other value. ...
    (sci.stat.math)
  • Re: z test how
    ... Bruce Weaver wrote: ... variance 2 is variance squared ... sum of V2/n = 0.0000145545 ... It doesn't mean you need to square it. ...
    (sci.stat.edu)
  • Re: VHDL code for finding standard deviation for a chunk of numbers
    ... The variance is the mean square minus the ... Define two numbers: sum, ... forced zero mean you can dispense with the calculation of the mean and just ...
    (comp.arch.fpga)
  • Re: Standard Deviation
    ... Suppose you have five observations and want to compute the sum of the squared deviations from the mean. ... Now the square of 10015 is 100300225, which is divided by 5 to yield 20060045, and the SS is 20060055 minus 20060045, which is 10 as before. ... Using the computational formula is dangerous on computers when the mean is large and the variance is small relative to the mean ...
    (borland.public.delphi.non-technical)