Re: Best encoding for a Japanese web site to deliver?

From: Timmy Douglas (timmy+slj_at_delspamcc.gatech.edu)
Date: 02/04/05


Date: Thu, 03 Feb 2005 20:26:39 -0500

Maciej Katafiasz <mnews2@wp.pl> writes:

> I don't know about common japanese practice and rationale for it, I
> admit, but I'd say just go with Unicode and be done with it
> forever. It's really hard to find browser that can't grok UTF-8
> nowadays, and Unicode is the future, unlike all the broken,
> complicated (ISO-2022) standards of the past.

The problem with using a unicode based encoding is that the unicode
character set unifies Chinese characters (without specifing the
language) and people whose unicode font is a non-japanese one will
likely get an ugly display (perhaps mixed fonts) of characters.

If you are just planning to support English and Japanese, it's
probably better to go with an encoding like EUC-JP.



Relevant Pages

  • Re: Help me!! Why java is so popular
    ... Well, Unicode is not a storage encoding system, or anything like that. ... Unicode is primarily a mapping from characters (in the linguistic conceptual ... French, Russian, Japanese and Korean songs. ...
    (comp.lang.java.programmer)
  • Re: utf8 and ftplib
    ... It opens a new local file using utf8 encoding and then reads from a file ... characters from the source file (e.g. foreign characters, ... Is there any way that I can correctly retrieve a utf8 encoded file from an FTP server? ... to be decoded to unicode on being read later. ...
    (comp.lang.python)
  • Re: Unicode Support
    ... >> (I know this is a poor example, but think about other languages, eg ... First things first, when you register your RosAsm windows classes, you ... the messages with ANSI / UNICODE parameters in ANSI or UNICODE form... ... with their alphabet characters, as with the numbers and punctuation...so, ...
    (alt.lang.asm)
  • Re: TCHAR string?
    ... According to Microsoft's documentation the 'A' functions are "ANSI" ... although Unicode is not itself an ISO standard; ... just as much an ISO encoding as any of the ISO encodings ... Windows) *was* to be able to represent any of the characters of the ...
    (microsoft.public.vc.mfc)
  • Re: Unicode support in Smalltalk
    ... Characters 128-255, as they mean both "the bytes 128-255 used in the ... encoding of a String" and "the Unicode Characters whose code points are ... Characters represent the encoding, UnicodeCharacters represent, well, ... EncodedString class that holds explicitly the encoding, ...
    (comp.lang.smalltalk)