Re: Google Beta mangles ASCII-IPA

From: Jukka K. Korpela (jkorpela_at_cs.tut.fi)
Date: 12/19/04


Date: Sun, 19 Dec 2004 14:47:05 +0000 (UTC)


"Reinhold (Rey) Aman" <aman@sonic.net> wrote:

> You may be in the minority if you use "US-ASCII"

Surely not, if you mean the _character repertoire_, which is what
really matters here.

> (unless that's the
> same as "Western" and "Latin 1").

It's not. But the character repertoire ASCII is a subset of virtually
all character sets in use.

> Your header shows that you're
> using "charset=ISO-8859-1," the very same character set I
> recommended.

The charset (character encoding) isn't the issue really - as far as the
codes of ASCII characters are the same as in ASCII, as they are in all
charsets discussed here.

> So I don't understand your problem or objection to my
> recommendation.

What I objected to was saying utf-8 was the problem. The problem is the
use of characters outside the ASCII repertoire.

Followups trimmed (since this is even more off-topic on a.u.e.).

-- 
Yucca, http://www.cs.tut.fi/~jkorpela/


Relevant Pages

  • Re: what does "serialization" mean?
    ... Sorry eddie, but you're dead wrong there as usual. ... >>How about ASCII character 0xB0, ... > Totalitarians and Fascists are often self-appointed language police. ...
    (comp.programming)
  • Re: what does "serialization" mean?
    ... > attempt to present myself as an authority on any and every topic I have ... >> survived and EBCDIC did not because ASCII properly sequenced letters. ... > How about ASCII character 0xB0, ... >> must assert negative facts, for all he knows is there is no knowledge ...
    (comp.programming)
  • Re: Cohens paper on byte order
    ... I think you're using "ASCII" in a notional sense. ... a good reason to teach the *opposite* convention, ... Computers should be as easy to understand as is possible _without_ ... arithmetic on character strings ...
    (sci.crypt)
  • Re: Reading a file.
    ... your program will interpret them as ASCII. ... Bruce.Eitman AT EuroTech DOT com ... buffer is character values, then in memory ASCII values are displayed. ... DWORD d = GetLastError; ...
    (microsoft.public.windowsce.app.development)
  • Re: Get ASCII values for PC arrow keys?
    ... those responsible for standards usually do attempt ... ASCII is a character set, ... ISO/IEC registry for character sets for them to receive identifying ...
    (alt.comp.lang.learn.c-cpp)