Re: Chinese character & pinyin frequency analysis



On Oct 2, 9:33 pm, d...@xxxxxxxxxxxx wrote:
I've put together another Chinese characterfrequencylist, but this
one has a bit of a twist.

The usual 100,000 Chinese web pages were analyzed, and unique strings
at least 50 characters long were analyzed.

The results were fed into NJStar to attempt translation into pinyin.

The characters (with their pinyin) were then analyzed.

Results here:

http://readmandarin.com/research.htm

Any feedback is most appreciated.

Thanks!

Daniel


Nice page. I have a question. The code for Chinese characters in
your page does not look like unicode. Your code is &# plus 5 digits.
I remember that unicde for Chinese characters is &#x plus 4 digits, at
least I did that way with the unicode chart, and it works. What type
of code is it in your page?

By the way, is it easy to get the order by frequency of the alphabetic
letters for the pinyin from your data? It will help to conceive an
optimal keyboard layout for the pinyin.

Thanks.
hxy

.



Relevant Pages

  • Re: tea & chinese characters
    ... the corresponding Chinese characters. ... Chinese tea from the PinYinlike ... have developed historically and not a PinYin translation of the ...
    (rec.food.drink.tea)
  • chinese pinyin
    ... Chinese characters. ... ni hao (third tone ove the 'i', ... Which is means 'hello' in chinese pinyin (that is not Chinese ...
    (alt.php)
  • Re: Pinyin can help you learning Mandarin Chinese
    ... Pinyin is one of the most important tool you should use to learn speak ... replaced zhuyin as the method of Chinese phonetic instruction in ... Roman letters to represent sounds in Standard Mandarin Chinese. ...
    (sci.lang)
  • Re: Pinyin can help you learning Mandarin Chinese
    ... Help you learn Chinese pronounciation ... 1.History of Pinyin ... Mandarin Chinese has four pitched tones and a "toneless" tone. ...
    (sci.lang)
  • Pinyin can help you learning Mandarin Chinese
    ... Help you learn Chinese pronounciation ... Pinyin is one of the most important tool you should use to learn speak ... Roman letters to represent sounds in Standard Mandarin Chinese. ...
    (sci.lang)