Chinese character & pinyin frequency analysis



I've put together another Chinese character frequency list, but this
one has a bit of a twist.

The usual 100,000 Chinese web pages were analyzed, and unique strings
at least 50 characters long were analyzed.

The results were fed into NJStar to attempt translation into pinyin.

The characters (with their pinyin) were then analyzed.


Results here:

http://readmandarin.com/research.htm


Any feedback is most appreciated.


Thanks!

Daniel

.



Relevant Pages

  • Re: Chinese character & pinyin frequency analysis
    ... The usual 100,000 Chinese web pages were analyzed, and unique strings ... The results were fed into NJStar to attempt translation into pinyin. ... The characters were then analyzed. ... I looked around long enough to find that it isn't really a file type, ...
    (sci.lang)
  • Re: Chinese character & pinyin frequency analysis
    ... skrev i en meddelelse ... | I've put together another Chinese character frequency list, ... | The results were fed into NJStar to attempt translation into pinyin. ... | The characters were then analyzed. ...
    (sci.lang)
  • Re: Chinese character & pinyin frequency analysis
    ... The usual 100,000 Chinese web pages were analyzed, and unique strings ... The results were fed into NJStar to attempt translation into pinyin. ... The characters were then analyzed. ... What are these "usual 100,000 Chinese web pages"? ...
    (sci.lang)
  • Re: Letter frequency of Chinese pinyin
    ... characters into pinyin manually, I hope more volunteers can jump in. ... character is both 'hang' and 'xing'. ... transcription, but better than assigning only one spelling to the ...
    (sci.lang)
  • Half-width Pinyin, Mac OS X
    ... I am able to make flash-card sets pretty well using Studycard, ... except I have slight annoyance with fonts. ... pronunciation characters absent.) ... The Pinyin ...
    (comp.fonts)