Re: Distribution of a vowel on the page



On Oct 24, 12:28 am, David Winsemius <doe_s...@xxxxxxxxxxx> wrote:
"Graham Jones" <x...@xxx> wrote innews:pbWdnXmtGry-cYDanZ2dneKdnZydnZ2d@xxxxxx:





"David Winsemius" <doe_s...@xxxxxxxxxxx> wrote in message
news:Xns99D1DA83F1BB4dwtttttt@xxxxxxxxxxxxxxxxx
Richard Ulrich <Rich.Ulr...@xxxxxxxxxxx> wrote in
news:9ahph3t1qek63cvlokenoro8g9sfgagb5d@xxxxxxx:

If there were a positive correlation (consecutive repetitions),
the distribution, if otherwise Poisson, would be over-dispersed
(variance too large for the mean). Since there is a negative
correlation, it will be under-dispersed.

Why should the vowel count of one line be dependent on the vowel
count in a prior line?

You seem to be missing Richard's point, which is about lack of
independence within words affecting the *variance* of the
vowel-per-line counts. For example, suppose that all lines contain 10
words, and that all words contain 5 letters and either 1 or 2 vowels.

Please explain to us why your supposition has any connection with real
text. If that were true. it might be interesting....but is it?

--
David Winsemius

Ahhhhh...ok, I misunderstood. Here are the numbers:

So, 35 lines of text:

Number of times letter 'a' occurs per line Frequency
0 0
1 1
2 0
3 2
4 3
5 5
6 9
7 2
8 4
9 5
10 3
11 0
12 1

Or if you want them line by line:

9
5
4
7
6
8
4
9
6
6
3
8
5
6
12
8
5
1
6
5
6
3
6
9
5
10
6
8
10
9
10
6
9
7
4

Cheers,
Luca





.



Relevant Pages

  • Re: Variance question
    ... Poisson sample they calculte the mean and states that as variance. ... Now it just so happens that the mean of a Poisson distribution with parameter lambda is lambda, and it also happens that the variance of that distribution is lambda. ... The sample variance is also a perfectly reasonable estimator of the variance of a Poisson distribution, ...
    (comp.soft-sys.matlab)
  • Re: Measuring Turquoise Underwear
    ... that the distribution had to be normal. ... 6/49 game and has stats for about 52 draws. ... he claims the variance for the mean is /12n. ... The revised formula would yield ...
    (rec.gambling.lottery)
  • Re: feedback...
    ... >>>Hi Duncan, ... >>mean (from N sample draws) to fall with 95% confidence. ... The variance of the mean, after N draws, for a given position is ... the variance of the distribution from which a draw is made. ...
    (rec.gambling.lottery)
  • Re: Need Help Determining the "True" Mean of a Sample
    ... > I'm a software engineer, not a statistician, so please forgive my ... > The distribution for these samples is such that about 75% of the ... only 4% as much of the total variance. ... of dropping immediately to zero when the N is under 58. ...
    (sci.stat.math)
  • Re: Questions about a distribution
    ... Let's say the PDF has the mean = 250 ... The variance tells how ... units as the reaction time measurements, so I would have said that the SD ... lets you know how "wide" the distribution is. ...
    (sci.stat.math)