Re: Arabic cursive in Unicode



On 21 Nov 2006, Danny wrote:

I'm trying to draw up a table of Arabic cursive characters for a text
editor: I want to take the raw data and translate into a sequence of
cursive variants.

What do you mean by "cursive characters", "cursive variants"?

An example: the letter Alef Maksura (0649) exists in an isolated and
final form at points FEEF and FEF0, but the initial and medial forms
are listed at FBE8 and FBE9 (with the complicated name ARABIC LETTER
UIGHUR KAZAKH KIRGHIZ ALEF MAKSURA INITIAL FORM). To me, this suggests
that standard Arabic includes only the first two forms, and the other
two only appear in a variant.

Never rely on the Unicode *names* for characters! They are never
changed and may be misleading. The most prominent example is the
"byte order mark" U+FEFF, which will be known forever as "zero-width
no-break space". Therefore, do not infer anything from the *name*
"alef maksura".

Second, do not rely on "compatibility characters" such as "Arabic
presentation forms". They exist mainly for compatibility with
older character sets. Never use them.

Think of your letters as follows:
U+0649 all four glyphs have no dots
U+064A all four glyphs have dots
U+06CC two glyphs have dots, two glyphs have no dots

However, most existing fonts supply only two glyphs for U+0649.
In theory, you should use U+0649 for Uighur, for example. But with
existing fonts, you need to take two glyphs from U+06CC.

.



Relevant Pages

  • Re: Arabic cursive in Unicode
    ... separate groups of characters: the ones that you below call "logical ... But one set of the allographs -- what Arabic grammars call "independent ... Isn't that what a grapheme is? ... And the Mac Character Palette, could just as well display only the character name in that range--or, it could display the isolated forms drawn from in the Presentation Forms ranges, even if no fonts had glyphs in the 0600 range. ...
    (sci.lang)
  • Re: If you could add anything you want
    ... The Japanese don't write their characters exactly the same way as the Chinese do and vice versa. ... Some people aren't too happy that the example glyphs are drawn the "wrong" way. ... There'd be no way to express /what/ the standard was standardising. ...
    (comp.lang.java.programmer)
  • X.EXEs virtual keyboard.
    ... // covering 1.3 thousand glyphs. ... SetTextColor(DC, * Hue); ... // If our surface of 100 monospaced characters is used up, ... Paint_Maybe { ...
    (microsoft.public.vc.ide_general)
  • Re: in search of font for Aleut linguistics
    ... It's problematic for us that so many far-out glyphs are being ... Consortium's unwillingness to add the precomposed characters needed ... for indigenous American languages and type designers' reluctance to ... linguists had consulted with some typographers, ...
    (comp.fonts)
  • Re: Arabic cursive in Unicode
    ... separate groups of characters: the ones that you below call "logical ... But one set of the allographs -- what Arabic grammars call "independent ... the "logical characters" have no need of any concrete form. ... A font must store all those glyphs, ...
    (sci.lang)