Re: The sample s.d. always underestimates




Paul Sanchez wrote:
"Luis A. Afonso" wrote:
Jack wrote

*** Here is a counter-example. Suppose the population variance,
sigma^2 = 1. Then for a sample of size N=2, let x(1) = 0 and
x(2) = 10. The sample variance s^2 = sqrt(50) = 7.07... > 1.
I found one.***

My response

This is idiotic. Without knowing what are

The Distribution probability function (or density if it is continuous) in order to evaluate the
_____Population Mean
_____Population Variance

I have no way to compare your sample variance with the Population Variance!


What are the probabilities of to have x(1)=0 and x(2)=10? What are?

Your *soi-disant * counter-example is TRASH. You had found NOTHING.


______licas (Luis A. Afonso)

No, Jack was right. He told you that the population variance was 1, as
a given. There are an infinite number of distributions which have
population variance 1 which could yield 0 and 10 as outcomes. For
example, the simple distribution p(0) = (1 + sqrt(0.96)) / 2, p(10) = 1
- p(0) does the job nicely. If you do the math, you'll find that the
distribution I've specified has a variance of exactly 1 and both 0 and
10 occur with positive probabilities. QED, Jack's sample containing
{0,10} may be rare, but it's perfectly realizable. And it yields a
sample variance > 1, exactly as claimed.

Here is another counter-example. In R I did this:

set.seed(31233)
var(rnorm(100))
[1] 1.211056

The rnorm function generates normal variates, in this case
standard normal (var = 1). The sample variance in this
sample of 100 is clearly bigger than 1.

--
Kevin E. Thorpe
Assistant Professor, Department of Public Health Sciences
Faculty of Medicine, University of Toronto

.



Relevant Pages

  • Re: maximum entropy versus maximum variance
    ... illywhacker wrote: ... is maximal for this values of probabilities as well. ... distribution that respects these constraints while maximizing the ... Would anybody be able to give an example for a discrete case when we have max entropy but the variance is not max? ...
    (sci.stat.math)
  • Re: Measuring Turquoise Underwear
    ... that the distribution had to be normal. ... 6/49 game and has stats for about 52 draws. ... he claims the variance for the mean is /12n. ... The revised formula would yield ...
    (rec.gambling.lottery)
  • Re: feedback...
    ... >>>Hi Duncan, ... >>mean (from N sample draws) to fall with 95% confidence. ... The variance of the mean, after N draws, for a given position is ... the variance of the distribution from which a draw is made. ...
    (rec.gambling.lottery)
  • averaging noisy data (was: Re: Spacecraft earth-flyby data reveals dynamical preferred frame)
    ... filtered data contains much less noise than the raw data, ... The obvious thing to do is to average our N measurements by defining ... What can we say about the probability distribution of xbar? ... We'd like to say that the variance of xbar's distribution is about ...
    (sci.physics.research)
  • Re: Need Help Determining the "True" Mean of a Sample
    ... > I'm a software engineer, not a statistician, so please forgive my ... > The distribution for these samples is such that about 75% of the ... only 4% as much of the total variance. ... of dropping immediately to zero when the N is under 58. ...
    (sci.stat.math)