Re: Question about the Shannon "entropy" of genomes
- From: Steven Sullivan <ssully@xxxxxxxxx>
- Date: Tue, 15 Jul 2008 01:16:32 -0400 (EDT)
Doug Wedel <dougwedel@xxxxxxxxxxxxx> wrote:
Using Claude Shannon's formulas for measuring the redundancy of symbol
tokens in message strings , and given a large enough text to work with, it
is possible to identify the language of a text simply from the statistical
analysis of token use alone, since all languages have unique "signatures" of
redundancy in symbol token use. It strikes me as possible that different
organisms (or species or genuses) may also have characteristic redundancy
levels in their genome, and I was wondering if anyone knows of statistical
studies of this kind.
look up 'codon bias' for one level of redundancy
Also look up 'sequence logos', Tom Schneider s work primarily, which have been used for years
to represent DNA/protein sequence in terms of Shannon Entropy.
http://www-lmmb.ncifcrf.gov/~toms/
--
-S
A wise man, therefore, proportions his belief to the evidence. -- David Hume, "On Miracles"
(1748)
.
- References:
- Question about the Shannon "entropy" of genomes
- From: Doug Wedel
- Question about the Shannon "entropy" of genomes
- Prev by Date: Re: David Deamer Response to my E-mail Comment
- Next by Date: Re: Evolution is NOT random
- Previous by thread: Question about the Shannon "entropy" of genomes
- Next by thread: Re: Question about the Shannon "entropy" of genomes
- Index(es):
Relevant Pages
|