Re: Armenian, Sumerian, Burushaski, and Turkic languages



Paul J Kriha skreiv:

Brian M. Scott <b.scott@xxxxxxxxxxx> wrote in message
news:172x4a5l7broh.jr8pq8qts19z$.dlg@xxxxxxxxxxxxx

On Fri, 8 Jun 2007 17:09:28 +1200, Paul J Kriha
<paul.nospam.kriha@xxxxxxxxxxxxxxx> wrote in
<news:4668e3fe@xxxxxxxxxxxx> in sci.lang:

The newsreader I am using at the moment doesn't allow me
to chose iso-8859-15. I don't remember what it does when
it receives a post in that charset.

Let's find out: žè. (That was z-hachek and e-grave, a
combination that Dialog first finds in iso-8859-15.)

Well what a surprise, it screws it up.
I checked, your post header indeed contains
Content-Type: text/plain; charset="iso-8859-15"
Content-Transfer-Encoding: 8bit

I see z-hachek displayed as an ogonek, just a little black
tail descender with otherwise blank character. The e-grave
is displayed as e with a grave accent.

Your header specifies
Content-Type: text/plain;
charset="iso-8859-15"
Content-Transfer-Encoding: 8bit

Did you set that yourself, or did OE choose to specify a charset it doesn't have? Anyway, the quote from Brian in your reply looks fine here.


--
Trond Engen
- sort of a blank character, himself
.



Relevant Pages

  • Re: Changing the default charset for composing messages
    ... > correct default for the localized version of Entourage you're using. ... > UTF-8 if your message contains characters from more than one character set. ... > will just choose the correct charset on the basis of the characters you've ...
    (microsoft.public.mac.office.entourage)
  • Re: [OT] Funny Sig
    ... > on English language newsgroups are doing. ... The us-ascii charset is the most used charset in english groups. ... As soon as I insert a special character, ...
    (news.software.readers)
  • Re[3]: A message with octets up to 127 _forced_ by MUA into 7bit, us-ascii: an interesting and d
    ... The issue at hand is whether a completely unnecessary character set declaration should be used in spite of the very real possibility that doing so will cause hardship upon the recipient. ... First, my MUA will not do that, since it can display Japanese ... The only reason for tagging ASCII text with a charset tag other than US-ASCII is if the text uses ESC to indicate ISO 2022 shifts of G0. ...
    (comp.mail.mime)
  • Re: UTF-8 without external modules on Perl 5.0
    ... Hum, effectively, I didn't realize all the aspect about this charset ... character in iso-8859-* table only. ... And for this I found a pure Perl module called Unicode::UTF8simple ... they will input using these two languages (and will read in these two ...
    (comp.lang.perl.misc)
  • Choice of Charset
    ... to chose iso-8859-15. ... it receives a post in that charset. ... the LATIN SMALL LETTER Z WITH CARON transmitted with 'Latin-9' is displayed as CEDILLA on the Google Groups web portal. ... Carefully choosing an encoding to save a few bytes is a waste of time - the inconvenience it causes outweighs the advantage. ...
    (sci.lang)