Re: Arabic cursive in Unicode



Danny wrote:
Hi

I'm trying to draw up a table of Arabic cursive characters for a text
editor: I want to take the raw data and translate into a sequence of
cursive variants. This is working fine, but I'm stuck with trying to
extract the exact set of characters needed. UnicodeData.txt includes a
complete list of characters, including all the variants of non-standard
Arabic and complex ligatures that don't exist in all fonts (we're using
Arial).

An example: the letter Alef Maksura (0649) exists in an isolated and
final form at points FEEF and FEF0, but the initial and medial forms
are listed at FBE8 and FBE9 (with the complicated name ARABIC LETTER
UIGHUR KAZAKH KIRGHIZ ALEF MAKSURA INITIAL FORM). To me, this suggests
that standard Arabic includes only the first two forms, and the other
two only appear in a variant.

Isn't the point of alef maksura--in Arabic, at least--that it's a special character that appears only at the end of a word? So an initial or medial form of alef maksura isn't an expected component of an Arabic-specific glyph inventory any more than an initial or medial ra or da would be, even if such variants exist(ed) in the Arabic-derived scripts of other languages.

But how am I supposed to get the data
that are standard? ArabicShaping.txt isn't much help because it lists
the character as dual-joining.
.



Relevant Pages

  • Re: Arabic cursive in Unicode
    ... I'm trying to draw up a table of Arabic cursive characters for a text ... I want to take the raw data and translate into a sequence of ... including all the variants of non-standard ...
    (sci.lang)
  • Re: Arabic and other languages
    ... > Arabic keyboard mapping and Word will insert the characters and words ... > long-hand connects the letters but doesn't change their shape much by ... > Arabic range of a font like Arial using the Mac's Character Palette). ... > with character variants. ...
    (microsoft.public.mac.office.word)
  • Re: Converting unsigned long to string in C
    ... Once C had made the decision that they wanted to support variants of the ... ISO-646 must contain the same set of characters as ASCII, ... trigraph equivalent, and in fact is relied upon for the trigraph ... int res1=0, res2; ...
    (comp.lang.c)
  • Arabic cursive in Unicode
    ... I'm trying to draw up a table of Arabic cursive characters for a text ... I want to take the raw data and translate into a sequence of ... including all the variants of non-standard ...
    (sci.lang)
  • Re: new be in Fortran
    ... Then the characters are valued as usual for Greek characters ... Only Revelations was known to be a form of Greek at source. ... The header already came in 6 variants: ... This Bible Code contains prophecies also. ...
    (comp.lang.fortran)

Quantcast