Re: How many data points for statistical significant results?



Paige Miller wrote:
On Sep 8, 2:59 pm, Dave <f...@xxxxxxx> wrote:

Dave

First of all, you don't have good definitions. I think you probably
won't ever get very far unless you have good definitions. And the
problem is not that percentile isn't well defined, the problem is that
"statistically significant" isn't well defined.

Thank you


In particular, you need to know what the words "statistically
significant" mean to the person making the request. In statistics,
they refer to an hypothesis test, and I don't see that you have such a
thing here.

I see.

To define an hypothesis test, you want to test that the
mean (or median or standard deviation or 5th-percentile or whatever)
is equal to a specific value. I don't see that here. Alternatively,
you might want the width of an X% confidence interval for the mean (or
median or standard deviation or 5th-percentile or whatever) to be
smaller than a given quantity. I don't see that here either.




If someone makes a request of you in the language of statistics, you
and your requester have to use the language and mechanics of
statistics throughout the problem definition phase and throughout the
problem execution phase, or you have to translate some words that are
not statistical into words that mean something statistically. Asking
for something to be "statistically significant" without additional
information that statistics requires, is an undefined problem and
cannot be solved. To perform statistics properly, you will need to
consult with the person who made the request to obtain a much better
idea about what he is looking for and why, so that his request will
make sense and be defined in a statistical way.


Thank you. I suspect the person making the request is not sure, but I have gone back to clarify matters.
--
Paige Miller
paige\dot\miller \at\ kodak\dot\com
.



Relevant Pages

  • Re: [Patch 4/6] statistics infrastructure - documentation
    ... +which is the actual source of statistics data. ... a histogram ... +looks like should be left to the individual modes of data processing. ... +For example, a (request size, occurrence)-statistic would yield the ...
    (Linux-Kernel)
  • Re: [Patch 4/6] statistics infrastructure - documentation
    ... +which is the actual source of statistics data. ... +looks like should be left to the individual modes of data processing. ... +For example, a (request size, occurrence)-statistic would yield the ... +For example, a -statistic would yield a histogram ...
    (Linux-Kernel)
  • Re: [RFC] kernel facilities for cache prefetching
    ... you'll just waste time reading the data into a buffer that will ... So I would _seriously_ claim that the place to do all the statistics ... filesystem level, ... You can't just look at the request queue and see ...
    (Linux-Kernel)
  • [Patch 0/5] I/O statistics through request queues
    ... This patch set makes the block layer maintain statistics for request ... For sample data please have a look at the SCSI stack patch or the DASD ...
    (Linux-Kernel)
  • Re: [Patch] SCSI I/O statistics
    ... This patch makes the SCSI mid layer report statistics data, i.e. request ... this seems to be a nice application of the new statistics layer. ...
    (Linux-Kernel)

Quantcast