Re: Most Komplex Kanji ?

From: Bart Mathias (bartmathias_at_verizon.net)
Date: 09/06/04


Date: Mon, 06 Sep 2004 02:15:23 GMT

jim_breen@idontreadhotmail.com wrote:
> [...]
> I think you'll see that the postings I "read" had been converted to
> ISO-2022-JP by someone else's response. I have to do an off-line
> conversion to read UTF8, and usually I can't be bothered.

I wonder why that's?

My Mozilla is a bit broken (but not as badly as the rest of the stuff on
my desktop), but the only way I know whether something is in UTF8 is
because the characters are small (and seriffed), or there are extended-
ASCII looking characters in the same post.

Bart



Relevant Pages

  • Re: unicode conversion
    ... breaks utf8 output of Chinese characters to an otherwise perfectly utf8- transparent console, see my XML::Simple and utf8 woe posting of ... As I explained in the other thread, what's probably happening is that, without -CS, your data is being read in by Perl as octets, then printed out as octets; however, under -CS your data is still read as octets yet printed to a UTF8-aware filehandle. ... my latest experience is with bulk quantities of utf8 data (latin, CJK material, _tons_ of characters with accents and diacritics in one soup). ... When I try to segment such a string with approx. ...
    (comp.lang.perl.misc)
  • Re: Reg multilanguage support by gnuplot
    ... So far as I know, Matsuda's postscript example used EUC encoding, not utf8. ... Your original query was about an EUC font. ... Unicode is an assignment of "all" characters to unique ... # Test of UTF-8 support by gnuplot terminals. ...
    (comp.graphics.apps.gnuplot)
  • Re: utf8 Problems
    ... I converted to utf8 in the hope that my non ASCII character problems ... use all sorts of special characters, limited only by the fonts you have ... encoding in a standardized way, for example in plain text files. ... $ locale | grep -v en_US ...
    (Debian-User)
  • Re: perl 5.6 multi byte
    ... GB1312 is in fact GB2312 and is used for Simplified Chinese. ... Both GB2312 and ShiftJIS are double byte character sets (DBCS). ... Some characters have on byte, ... they are very different from utf8. ...
    (comp.lang.perl.modules)
  • Re: perl 5.6 multi byte
    ... GB1312 is in fact GB2312 and is used for Simplified Chinese. ... Both GB2312 and ShiftJIS are double byte character sets (DBCS). ... Some characters have on byte, ... they are very different from utf8. ...
    (comp.lang.perl.misc)