Re: Cluster analysis for beginners
- From: "illywhacker" <illywacker@xxxxxxxxx>
- Date: 29 Mar 2007 08:57:13 -0700
On Mar 29, 4:38 pm, David Winsemius <doe_s...@xxxxxxxxxxx> wrote:
Sidney <milan_y...@xxxxxx> wrote innews:24466740.1175159875339.JavaMail.jakarta@xxxxxxxxxxxxxxxxxxxxxx:
Dear all,
I ran into a problem that I can't solve with my basic statistics high
school knowledge - apologies if this is too trivial to some of you.
Assume you have 5000 proteins that are ordered by their molecular
weight from 1000 Daltons to 100000 Daltons (the numbers don't matter).
If you now find that a certain motif (e.g. a specific phosphorylation
motif) which is only found within a certain molecular weight range,
e.g. only between 77000-81000 Daltons, how do you determine if this
'clustering' is significant? At this point I have no idea what to do
and where to start at. Your input is very much appreciated. Thanks a
lot in advance. Sidney
I did see illywacker's reply, but I disagree. I thought your scientific
question was reasonably clear for one thing. If your null hypothesis is
that there is no association between MW and presence of the motif, you
could start by arranging the proteins in deciles of weight and testing for
uniformity of motif-ication in a multinomial model with 9 degrees of
freedom. Unless most of your proteins are in that specified range above,
with 5000 data points it seems reasonably clear that you will get a
'significant' result using that approach. I would argue that such a test
does not represent one with "strong assumptions".
Check it out, and the Prartt and Savage references therein.
illywhacker;
.
- References:
- Cluster analysis for beginners
- From: Sidney
- Re: Cluster analysis for beginners
- From: David Winsemius
- Cluster analysis for beginners
- Prev by Date: Re: Cluster analysis for beginners
- Next by Date: Re: Cluster analysis for beginners
- Previous by thread: Re: Cluster analysis for beginners
- Next by thread: Re: Cluster analysis for beginners
- Index(es):
Relevant Pages
|
|