Re: Chinese character & pinyin frequency analysis
- From: hxy <huangxinyu714@xxxxxxxxx>
- Date: Wed, 10 Oct 2007 09:13:37 -0700
On Oct 2, 9:33 pm, d...@xxxxxxxxxxxx wrote:
I've put together another Chinese characterfrequencylist, but this
one has a bit of a twist.
The usual 100,000 Chinese web pages were analyzed, and unique strings
at least 50 characters long were analyzed.
The results were fed into NJStar to attempt translation into pinyin.
The characters (with their pinyin) were then analyzed.
Results here:
http://readmandarin.com/research.htm
Any feedback is most appreciated.
Thanks!
Daniel
Nice page. I have a question. The code for Chinese characters in
your page does not look like unicode. Your code is &# plus 5 digits.
I remember that unicde for Chinese characters is &#x plus 4 digits, at
least I did that way with the unicode chart, and it works. What type
of code is it in your page?
By the way, is it easy to get the order by frequency of the alphabetic
letters for the pinyin from your data? It will help to conceive an
optimal keyboard layout for the pinyin.
Thanks.
hxy
.
- Follow-Ups:
- Re: Chinese character & pinyin frequency analysis
- From: Ruud Harmsen
- Re: Chinese character & pinyin frequency analysis
- From: Harlan Messinger
- Re: Chinese character & pinyin frequency analysis
- References:
- Prev by Date: WATCH THE NEW MODELS OF SONY ERICSON
- Next by Date: Re: Chinese character & pinyin frequency analysis
- Previous by thread: Re: Chinese character & pinyin frequency analysis
- Next by thread: Re: Chinese character & pinyin frequency analysis
- Index(es):
Relevant Pages
|