Re: Unusual formulae for confidence intervals




Greg Heath wrote:
Stephen J. Herschkorn wrote:
Just to clarify, I wrote exactly what I meant.

Ray Koopman wrote:

Stephen J. Herschkorn wrote:

One of my tutees is taking a course in statistics for public policy. I
think this is my first such client. (I have had other studetns in
psychology, sociology, and public health.)

As we were discussing confidence intervals, he pointed out two formulae
in his text and notes. I had never encountered these before.

- For a population proportion, use as the standard error 0.5 /
sqrt(n), where n is the sample size. The justification was that
since we do not know the population proportion pi, use pi = 0.5 for a
conservative interval estimate. Most formulae I have seen substitute
the sample proportion Q (my notation) to get sqrt(Q (1-Q) / n) for
the standard error. If one is going to be a stickler, one can solve the
quadratic inequality
-z <= (Q - pi) / sqrt( pi (1- pi) / n) <= z for explicit bounds on
pi. So using 0.5 seems silly to me.

That's for choosing the sample size (before the data are collected)
so that the standard error is guaranteed to be small enough.

No, they explicity said to use 0.5 / sqrt(n) for all confidence
intervals for the proportion. Actually, more precisely, they
cicuitously said use sqrt(P_u (1-P_u) / n), where P_u is the population
proportion, and set P_u = 0.5 for a conservative estimate. I am
familiar with the standard techniques for determining sample size; the
book was not discussing this issue.

The reasons are:

1. 0<= Q(1-Q) <= 0.25 for 0<= Q <=1 with the max
occuring at Q = 0.5. Therefore

It is true for ALL Q on the real line.

2. It is easy to remember.

Why do you have to remember it?

The latter is used when the standard deviation is known.
The former is used when it is obtained from the sample
estimate of the variance.

Regardless what you call the former or latter, your statement
is at best misleading when you did NOT specify which form
of the "sample estimate" of the unknown population variance
is used. That was a VALID point made by Koopman about
the ambiguous usage of S that probably led to some of the
present confusion about (n-1) and n, because S/sqrt(n-1)
and S/sqrt(n) could be IDENTICAL if different estimates were
used for S.

If S is estimated from the sample variance use sqrt(n-1) and t
with nu = n-2 degrees of freedom.

That is a trivial statistical result, relevant only to PART of the
problem raised.

Hope this helps.

Greg

Helped yourself, perhaps. I doubt if it helped anyone else,
because you are just rehashing everything that had been said,
and said better, by others, because you came late into the
discussion and NOT fully aware of all the statistical issues
involved.

-- Reef Fish Bob.

.



Relevant Pages

  • Re: Unusual formulae for confidence intervals
    ... As we were discussing confidence intervals, ... For a population proportion, use as the standard error 0.5 / ... That is only superficial reason for addressing the symptom without ...
    (sci.stat.math)
  • Re: Unusual formulae for confidence intervals
    ... As we were discussing confidence intervals, ... For a population proportion, use as the standard error 0.5 / ... capital Latin letters for statistics. ...
    (sci.stat.math)
  • Re: Estimation of variance of proportions
    ... I would like to estimate the variance of estimated proportions. ... of a proportion when the estimated proportion is null? ... necessarily zero, then you should not use zero, but rather a small ... One fairly widely known quick approximation for the upper 95% confidence ...
    (sci.stat.math)
  • Re: Estimation of variance of proportions
    ... I would like to estimate the variance of estimated proportions. ... proportion when the estimated proportion is null? ... If you think that the proportion of successes, hits or events is not necessarily zero, then you should not use zero, but rather a small non-zero estimate. ... the Poisson distribution and associated methods might be mathematically convenient. ...
    (sci.stat.math)
  • Unusual formulae for confidence intervals
    ... One of my tutees is taking a course in statistics for public policy. ... For a population proportion, use as the standard error 0.5 / sqrt, where n is the sample size. ... I prefer to use lower-case Greek letters for parameters and capital Latin letters for statistics. ...
    (sci.stat.math)