Re: where do so many tenses come from?
- From: Helmut Richter <a282244@xxxxxxxxxxxxxxxxxxxx>
- Date: 7 Apr 2006 09:39:09 GMT
In article <b9ta32dlgi0kl0o1ou21kktcqb49ipmj19@xxxxxxx>, Ruud Harmsen wrote:
I'm currently writing a program to make such calculations
automatically, so that eventually I'll be able to tell if a Portuguese
and a Hungarian spoken news item contain more or less bits of
information.
Not only the frequency of phonemes plays a rôle but much more so the
frequency of clusters. I have once computed Huffman codes not only for
single letters (not the same as phonemes, but the effect will be the same
for both), but also for pairs, triples, and so on. In the text there were
102 distinct letters, but given one letter there were in the average only
27 pairs beginning with that letter, and for one quadruple of letters
there were in the average less than 2 quintuples beginning with this
quadruple. So the most recent history determines a lot what follows, and
this must be taken into account when calculating information content.
The numbers are in
http://www.lrz-muenchen.de/services/schulung/unterlagen/compress/, but be
warned that this is in no way a reliable source: it is the collection of
arbitrary examples a set up for a single lecture, and published only to
save giving out handouts.
Helmut Richter
.
- Follow-Ups:
- Re: Are some languages really faster than others?
- From: Ruud Harmsen
- Re: Are some languages really faster than others?
- References:
- Re: where do so many tenses come from?
- From: António Marques
- Re: where do so many tenses come from?
- From: Lee Sau Dan
- Re: where do so many tenses come from?
- From: Herman Rubin
- Re: where do so many tenses come from?
- From: Ruud Harmsen
- Re: where do so many tenses come from?
- Prev by Date: Re: aestimare
- Next by Date: Re: Are some languages really faster than others?
- Previous by thread: Re: Are some languages really faster than others?
- Next by thread: Re: Are some languages really faster than others?
- Index(es):
Relevant Pages
|