Question about the Shannon "entropy" of genomes



Using Claude Shannon's formulas for measuring the redundancy of symbol
tokens in message strings , and given a large enough text to work with, it
is possible to identify the language of a text simply from the statistical
analysis of token use alone, since all languages have unique "signatures" of
redundancy in symbol token use. It strikes me as possible that different
organisms (or species or genuses) may also have characteristic redundancy
levels in their genome, and I was wondering if anyone knows of statistical
studies of this kind.



.



Relevant Pages

  • Re: Question about the Shannon "entropy" of genomes
    ... tokens in message strings, and given a large enough text to work with, it ... is possible to identify the language of a text simply from the statistical ... redundancy in symbol token use. ... Also look up 'sequence logos', Tom Schneider s work primarily, which have been used for years ...
    (sci.bio.evolution)
  • Re: compression type
    ... redundancy elimination as their main score. ... is to draw say a curve in the sphere through each word as what you say ... in pattern, like from one way to the other is the sentence backwards. ... doesn't have any patterns at all (the way of tokens, file allocations, ...
    (comp.compression)
  • Re: Strategic Functional Migration and Multiple Inheritance
    ... Would C# be a "simpler" language by making all the keywords single ... tokens mean different things in different contexts, ... I'm not trying to argue this in metrics of information theory, ... I don't believe such metrics give a good idea of the readability of ...
    (microsoft.public.dotnet.languages.csharp)
  • Re: observable language change - "off of" makes it to the NY Times
    ... "barbarism" makes him sound learned. ... By the style experts you've found who feel the same way and whom you've ... I'd elaborate on the absurdity of your determination to ascribe this usage to "literal mindedness" and Americans in particular by giving you examples from the vast history of language that show that this type of thing is an extremely common mechanism for language evolution and that this is true in many, many, many languages, but your reaction EVERY TIME someone has tried to give you the broader perspective is to complain that you don't have time to learn the broader perspective or that it's irrelevant because, darn it, when you dislike something, it's wrongness stands on its own regardless of how consistent the supposedly irritating factors are with normal language features and developments, including the most standard ones you might imagine. ... I'd go into the redundancy involved in gender and case agreement of adjectives with the nouns they modify, the redundancy of the ordinary negative in French and Afrikaans, and other phenomena, but I won't waste my time explaining them since you have chosen to ignore the examples I've already given you and have shown your aversion for actually learning anything. ...
    (sci.lang)
  • Re: Relational model versus object model
    ... >The relational model, on the other hand, provides a very simple ... The "redundancy" is the ability to access data anywhere in the ... >one of the language you work in, as opposed to SQL. ...
    (comp.object)