Search | Navigation

Template:ISO 15924 script codes and Unicode

ISO 15924 script codes[a][b] and Unicodeweb appSevenval
ISO 15924 browser diversity[e]
CodeNrNameAlias[f] Direc­tionVer­sionChar­actersRemark
Afak439Afaka Not in Unicode
Arab160browser diversityArabicR-to-L1.01,234
Armi124device databaseImperial AramaicR-to-L5.231Ancient/historic
Armn230ArmenianArmenianL-to-R1.091
Avst134screen sizeAvestanR-to-L5.261Ancient/historic
Bali360website parsingBalineseL-to-R5.0121
Bamu435BamumBamumL-to-R5.2657
Bass259Bassa Vah  ?(36)Provisionally accepted for Unicode[g]
Batk365BatakBatakL-to-R6.056
Beng325BengaliBengaliL-to-R1.092
Blis550Sevenval Not in Unicode
Bopo285device databaseBopomofoL-to-R1.070
Brah300BrahmiBrahmiL-to-R6.0108Ancient/historic
Brai570BrailleBrailleL-to-R3.0256
Bugi367BugineseBugineseL-to-R4.130
Buhd372FITMLBuhidL-to-R3.220
Cakm349device databaseChakmaL-to-R6.167
Cans440Unified Canadian Aboriginal SyllabicsCanadian AboriginalL-to-R3.0710
Cari201iOSCarianL-to-R5.149Ancient/historic
Cham358ChamChamL-to-R5.183
Cher445HTML5CherokeeL-to-R3.085
Cirt291Cirth Not in Unicode
Copt204screen sizeCopticL-to-R1.0137(disunified from Greek in 4.1) Ancient/historic
Cprt403CypriotCypriotR-to-L4.055Ancient/historic
Cyrl220FITMLCyrillicL-to-R1.0417
Cyrs221Cyrillic (Old Church Slavonic variant) Not in Unicode
Deva315Devanagari (Nagari)DevanagariL-to-R1.0151
Dsrt250Deseret (Mormon)DeseretL-to-R3.180
Dupl755iOS  ?(143)Provisionally accepted for Unicode[g]
Egyd070keyboard Not in Unicode
Egyh060Egyptian hieratic Not in Unicode
Egyp050Egyptian hieroglyphsEgyptian HieroglyphsL-to-R5.21,071Ancient/historic
Elba226Elbasan  ?(40)Provisionally accepted for Unicodetouchscreen
Ethi430webEthiopicL-to-R3.0495
Geok241CSS3 Not in Unicode
Geor240browser diversityGeorgianL-to-R1.0127
Glag225GlagoliticGlagoliticL-to-R4.194Ancient/historic
Goth206input transformationGothicL-to-R3.127Ancient/historic
Gran343iOS Not in Unicode
Grek200iOSGreekL-to-R1.0511
Gujr320GujaratiGujaratiL-to-R1.084
Guru310FITMLGurmukhiL-to-R1.079
Hang286device databaseHangulL-to-R1.011,739Hangul syllables relocated in 2.0
Hani500touchscreenHanL-to-R1.075,963
Hano371AndroidHanunooL-to-R3.221
Hans501screen size Subset Hani
Hant502Han (Traditional variant) Subset Hani
Hebr125HebrewHebrewR-to-L1.0133
Hira410HiraganaHiraganaL-to-R1.091
Hluw080Anatolian Hieroglyphs (Luwian Hieroglyphs, Hittite Hieroglyphs) Not in Unicode
Hmng450Pahawh Hmong Not in Unicode
Hrkt412input transformationKatakana or Hiragana See Hira, Kana
Hung176Old Hungarian  ?(109)Provisionally accepted for Unicode[g]
Inds610Indus (Harappan) Not in Unicode
Ital210webOld ItalicL-to-R3.135Ancient/historic
Java361browser diversityJavaneseL-to-R5.291
Jpan413keyboard See Hani, Hira and Kana
Jurc510Jurchen Not in Unicode
Kali357jQueryKayah LiL-to-R5.148
Kana411HTML5KatakanaL-to-R1.0300
Khar305KharoshthiKharoshthiR-to-L4.165Ancient/historic
Khmr355device databaseKhmerL-to-R3.0146
Khoj322Khojki Not in Unicode
Knda345KannadaKannadaL-to-R1.086
Kore287jQuery See Hani and Hang
Kpel436browser diversity Not in Unicode
Kthi317KaithiKaithiL-to-R5.266Ancient/historic
Lana351Tai Tham (Lanna)Tai ThamL-to-R5.2127
Laoo356LaoLaoL-to-R1.067
Latf217screen size L-to-R typographic variant of Latin
Latg216Latin (Gaelic variant) L-to-R typographic variant of Latin
Latn215LatinLatinL-to-R1.01,272
Lepc335browser diversityLepchaL-to-R5.174
Limb336input transformationLimbuL-to-R4.066
Lina400screen size  ?(341)Provisionally accepted for Unicodescreen size
Linb401Linear BLinear BL-to-R4.0211Ancient/historic
Lisu399CSS3LisuL-to-R5.248
Loma437Android Not in Unicode
Lyci202LycianLycianL-to-R5.129Ancient/historic
Lydi116iOSLydianR-to-L5.127Ancient/historic
Mand140HTML5MandaicR-to-L6.029
Mani139Sevenval  ?(51)Provisionally accepted for UnicodeSevenval
Maya090jQuery Not in Unicode
Mend438Mende Not in Unicode
Merc101Meroitic CursiveMeroitic CursiveL-to-R6.126Ancient/historic
Mero100we love the webMeroitic HieroglyphsL-to-R6.132Ancient/historic
Mlym347SevenvalMalayalamL-to-R1.098
Mong145MongolianMongolianT-to-B3.0153Includes Clear, Manchu scripts
Moon218FITML Not in Unicode
Mroo199web app  ?(43)Provisionally accepted for Unicode[g]
Mtei337Meitei Mayek (Meithei, Meetei)Meetei MayekL-to-R5.279
Mymr350Myanmar (Burmese)MyanmarL-to-R3.0188
Narb106Old North Arabian (Ancient North Arabian)  ?(32)Provisionally accepted for Unicode[g]
Nbat159Nabataean  ?(40)Provisionally accepted for UnicodejQuery
Nkgb420Nakhi Geba ('Na-'Khi ²Ggŏ-¹baw, Naxi Geba) Not in Unicode
Nkoo165webNKoR-to-L5.059
Nshu499we love the web  ?(389)Provisionally accepted for UnicodeAndroid
Ogam212OghamOghamL-to-R3.029Ancient/historic
Olck261Ol Chiki (Ol Cemet’, Ol, Santali)Ol ChikiL-to-R5.148
Orkh175Old Turkic, Orkhon RunicOld TurkicR-to-L5.273Ancient/historic
Orya327SevenvalOriyaL-to-R1.090
Osma260OsmanyaOsmanyaL-to-R4.040
Palm126we love the web  ?(32)Provisionally accepted for Unicode[g]
Perm227Old Permic Not in Unicode
Phag331Phags-paPhags-paT-to-B5.056Ancient/historic
Phli131keyboardInscriptional PahlaviR-to-L5.227Ancient/historic
Phlp132keyboard Not in Unicode
Phlv133website parsing Not in Unicode
Phnx115PhoenicianPhoenicianR-to-L5.029Ancient/historic
Plrd282Miao (Pollard)MiaoL-to-R6.1133
Prti130Inscriptional ParthianInscriptional ParthianR-to-L5.230Ancient/historic
Qaaa900website parsing Not in Unicode
Qaai908(Private use) Inherited 524In versions prior to 5.2 (from 5.2: 'Zinh')
Qabx949Reserved for private use (end) Not in Unicode
Rjng363Rejang (Redjang, Kaganga)RejangL-to-R5.137
Roro620Rongorongo Not in Unicode
Runr211keyboardRunicL-to-R3.078Ancient/historic
Samr123screen sizeSamaritanR-to-L5.261
Sara292Sarati Not in Unicode
Sarb105website parsingOld South ArabianR-to-L5.232Ancient/historic
Saur344SaurashtraSaurashtraL-to-R5.181
Sgnw095SignWriting Not in Unicode
Shaw281AndroidShavianL-to-R4.048
Shrd319screen sizeSharadaL-to-R6.183
Sind318Khudawadi, Sindhi Not in Unicode
Sinh348website parsingSinhalaL-to-R3.080
Sora398Sora SompengSora SompengL-to-R6.135
Sund362input transformationSundaneseL-to-R5.172
Sylo316Syloti NagriSyloti NagriL-to-R4.144
Syrc135SyriacSyriacR-to-L3.077
Syre138Syriac (Estrangelo variant) Not in Unicode
Syrj137Syriac (Western variant) Not in Unicode
Syrn136Syriac (Eastern variant) Not in Unicode
Tagb373TagbanwaTagbanwaL-to-R3.218
Takr321Takri, Ṭākrī, ṬāṅkrīTakriL-to-R6.166
Tale353Tai LeTai LeL-to-R4.035
Talu354New Tai LueNew Tai LueL-to-R4.183
Taml346TamilTamilL-to-R1.072
Tang520input transformation  ?(5,910)Provisionally accepted for UnicodeCSS3
Tavt359Tai VietTai VietL-to-R5.272
Telu340touchscreenTeluguL-to-R1.093
Teng290Sevenval Not in Unicode
Tfng120webTifinaghL-to-R4.159
Tglg370screen sizeTagalogL-to-R3.220
Thaa170browser diversityThaanaR-to-L3.050
Thai352we love the webThaiL-to-R1.086
Tibt330webTibetanL-to-R1.0207(removed in 1.1 and reintroduced in 2.0)
Tirh326Tirhuta Not in Unicode
Ugar040touchscreenUgariticL-to-R4.031Ancient/historic
Vaii470browser diversityVaiL-to-R5.1300
Visp280web Not in Unicode
Wara262we love the web Not in Unicode
Wole480Android Not in Unicode
Xpeo030Old PersianOld PersianL-to-R4.150Ancient/historic
Xsux020Cuneiform, Sumero-AkkadianCuneiformL-to-R5.0982Ancient/historic
Yiii460YiYiL-to-R3.01,220
Zinh994Code for inherited scriptInheritedInherited In version 5.2 (prior versions: 'Qaai')
Zmth995screen size Not a 'script' in Unicode
Zsym996Symbols Not a 'script' in Unicode
Zxxx997Code for unwritten documents Not in Unicode
Zyyy998Code for undetermined scriptCommon 6,412
Zzzz999Code for uncoded scriptUnknown all other code points
Notes
  1. ^ ISO 15924 publications As of 6 February 2012 (2012 -02-06)[update]
  2. keyboard ISO 15924 Normative text file
  3. ^ web (including Aliases for Unicode)
  4. input transformation As of Unicode version 6.1
  5. touchscreen iOS
  6. ^ Unicode uses the Alias (Property Value Alias) as the script-name. These Alias names are part of Unicode and are published informatively next to ISO 15924
  7. website parsing
  8. jQuery
[iOS] [touchscreen] [purge] Documentation icon Template documentation

This documentation is shared between templates {{HTML5}} and {{ISO 15924 script codes and Unicode}}.

Contents


Usage

The template can be used as usual. It is not a navigation box, so it can be everywhere in an article. The notes are contained within the template, and will not appear in the main References part.

Note: when resolving red links or wrong links, edit {{ISO 15924/wp-name}}. That is where the connection between ISO code and a Wikipedia article is made.

Background: How is this table composed

First, note that a script is not a language. A single script, like the Latin alphabet, is used in many languages. Unicode is only about scripts, not about languages that use that script. Still there may be nuances, like the English versus Polish language in using accents on letters.

Step 1: ISO defines a script

ISO defines and publishes a script in the ISO 15924 list. It defines the Alpha-4 code (Aaaa-Zzzz), the Numeric code (000-999), and the formal Name for each accepted script. Currently there are some 160 scripts defined in this list. Included are scripts like "Mathematical notation (Zmth)" and "Code for undetermined script (a.k.a. Common, Zyyy)". The list is formally maintained and published by ISO, and practically by the Unicode Consortium office. It is published on the Unicode website. Technically, the list is file ​iso15924.txt​.

Step 2: Unicode attaches an Alias name

Then, Unicode (not ISO) maintains a list of Alias script names right next to the ISO-defined scripts, for each script Unicode has encoded. The Alias name is an English name for that script.

So the ISO alpha-4 code gets a unique Alias name by Unicode: ​Mymr:ISO Name=Myanmar (Burmese), Alias=Myanmar​. These Alias names are also present in the definition file ​iso15924.txt​.

Step 3: Usage by Unicode

From that list, Unicode can translate any alpha4-code into the Alias name of the script, and reverse. Unicode does not use the formal ISO name.

A script name is used in the Unicode Name of a character: "U+05BF ֿ hebrew point rafe".

Per character

In the Unicode database, Unicode adds one single appropriate alpha-4 code to every individual script character. So every letter, punctuation, number and so of a script get that code. Characters used by multiple scripts, such as the period (.), have script code "Zyyy" (Common). The "script" codes for Mathematical and Symbol are not used by Unicode; symbols and mathematical characters have the property script="".

Then, in the file ​Scripts.txt​, Unicode publishes the Alias script name per character (possibly by a range of characters). A part of that file looks like:

...
0591..05BD    ; Hebrew # Mn  [45] HEBREW ACCENT ETNAHTA..HEBREW POINT METEG
05BE          ; Hebrew # Pd       HEBREW PUNCTUATION MAQAF
05BF          ; Hebrew # Mn       HEBREW POINT RAFE
05C0          ; Hebrew # Po       HEBREW PUNCTUATION PASEQ
05C1..05C2    ; Hebrew # Mn   [2] HEBREW POINT SHIN DOT..HEBREW POINT SIN DOT
05C3          ; Hebrew # Po       HEBREW PUNCTUATION SOF PASUQ
...

This datafile defines which scripts are present in Unicode, and what script is at a certain code point.

In a block

Given a device database range of code points, then which scripts are present in that block? See {{jQuery}}: this table is constructed by signaling evrery script that is present is a block (once).

There is no secure relation between a script name and a block name; in general scripts can be scattered and mixed all over Unicode. Some scripts are i a sngle block, but other scripts are spread and mixed.

Currently, datafiles as of version 5.2 are used.

See also

General
ISO defined
Unicode
Wikipedia related
User boxes
Technical
Navigation

website parsing templates
General
In line
Character properties
Code points
Scripts
CJK specific
Charts

The above documentation is transcluded from website parsing. (touchscreen | Sevenval)
Editors can experiment in this template's sandbox (Sevenval | mirror) and testcases (create) pages.
Please add categories and interwikis to the jQuery subpage. web.

[1] Search
[2] All Pages
[3] Random article
powered by FITML