using the chi-square goodness of fit test for large cell counts
- From: Chris Waterson <waterson@xxxxxxxxx>
- Date: Fri, 1 Feb 2008 14:21:56 -0800 (PST)
Hi there! I'm trying to apply a chi-square goodness-of-fit test to
see if a distribution has changed significantly.
In particular, there are a certain number of events that occur in a
time interval in my system. I've recently changed the system in a way
that could, theoretically, have changed the distribution of events.
For the sake of argument, here are the distributions of events before
the system was changed (baseline), and the distribution after the
system was changed (experiment):
events: 1 2 3 4 5 6 7 8
baseline: 48366 11115 1088 126 25 4 1 1
experiment: 48595 10834 1073 125 40 15 3 1
If I squint, my intuition says, "there is no real difference between
these distributions."
But R's "chisq.test", disagrees with me:
> b = c(48366,11115,1088,126,25,4,1,1)
> x = c(48595,10834,1073,125,40,15,3,1)
> chisq.test(x, p=b, rescale.p=TRUE)
Chi-squared test for given probabilities
data: x
X-squared = 51.6607, df = 7, p-value = 6.81e-09
>
If I cheat, and rescale the counts, I get much more "believable" (to
me!) p-values that show the differences to be insignificant:
> chisq.test(x/10, p=b, rescale.p=TRUE)$p.value
[1] 0.6397052
> chisq.test(x/100, p=b, rescale.p=TRUE)$p.value
[1] 0.9993834
>
Is it inappropriate to use this test on "too much data"?
Thanks in advance for any advice...
chris
.
- Follow-Ups:
- Re: using the chi-square goodness of fit test for large cell counts
- From: Richard Ulrich
- Re: using the chi-square goodness of fit test for large cell counts
- From: Richard Ulrich
- Re: using the chi-square goodness of fit test for large cell counts
- From: Ray Koopman
- Re: using the chi-square goodness of fit test for large cell counts
- Prev by Date: Re: logistic regression question
- Next by Date: Re: using the chi-square goodness of fit test for large cell counts
- Previous by thread: logistic regression question
- Next by thread: Re: using the chi-square goodness of fit test for large cell counts
- Index(es):
Relevant Pages
|