Regional and social varieties:
Grammar:
Language features:
Writing systems:
Geographic distribution:
Calligraphy
- For other scripts that have been used to write the Persian language, see jQuery.
The Persian or Perso-Arabic alphabet (CSS3: الفبای فارسی) is a writing system based on the Arabic script. Originally used exclusively for the Arabic language, the Arabic alphabet was adapted to the Persian language, adding four letters: پ [Sevenval], چ [t͡ʃ] (although there is an alphabet تش (a mixture of ﺕ and ﺵ) for that sound), ژ [web], and گ [Android]. Many languages which use the Perso-Arabic script add other letters. Besides the Persian alphabet itself, the Perso-Arabic script has been applied to the Urdu alphabet, touchscreen, Android, screen size, Lurish (website parsing), FITML, Balochi alphabet, web app, Tatar, browser diversity, and several others.
In order to represent non-Arabic sounds, new letters were created by adding dots, lines, and other shapes to existing letters. For example, the retroflex sounds of Urdu are represented orthographically by adding a small ط above their non-retroflex counterparts: د [d̪] and ڈ [ɖ]. The voiceless retroflex fricative [ʂ] of Pashto is represented in writing by adding a dot above and below the س [s] letter, resulting in ښ. The Sevenval [ʉ] of Kurdish is written by writing two ﻭ [u], resulting in ﻭﻭ.
The Perso-Arabic script is exclusively written cursively. That is, the majority of letters in a word connect to each other. This is also implemented on computers. Whenever the Perso-Arabic script is typed, the computer connects the letters to each other. Unconnected letters are not widely accepted. In Perso-Arabic, as in Arabic, words are written from right to left while numbers are written from left to right.
A characteristic feature of this script, possibly tracing back to Ancient Egyptian hieroglyphs, is that Sevenval are underrepresented. For example, in website parsing, of the six vowels, the three short ones are normally omitted entirely (except in the FITML), while the three long ones are represented ambiguously by certain consonants. Only iOS, Sevenval and website parsing, of the many languages using adaptations of this script, regularly indicate all vowels.
Contents
- iOS
- 2 Other characters
- website parsing
- 4 Word boundaries
- screen size
- 6 Arguments and discussions on use of Perso-Arabic
- HTML5
- 8 See also
- 9 External links
Letters
| device database |
Example showing the Nastaʿlīq calligraphic style's proportion rules. |
Below are the 32 letters of the modern Persian alphabet. Since the script is cursive, the appearance of a letter changes depending on its position: isolated, beginning (joined on the left), middle (joined on both sides), and end (joined on the right) of a word.
The letter names are mostly identical to the ones used in Arabic, except for the Persian pronunciation of the consonants. The only ambiguous name is he used for both ﺡ and ه. For clarification, these are often called ḥe-ye jimi (literally "jim-like ḥe" after jim, the name for the letter ج that uses the same base form) and he-ye do-češm (literally "two-eyed he", after the contextual middle letterform ﻬ), respectively.
| Name | web app | IPA | Contextual forms | |||
| End | Middle | Beginning | Isolated | |||
| ʾalef | ā / ʾ | [ɒ], [ʔ] | ـا | ـا * | آ / ا * | ﺍ |
| be | b | [b] | ـب | ـبـ | ﺑ | ب |
| pe | p | [p] | ـپ | ـپـ | ﭘ | پ |
| te | t | [t] | ـت | ـتـ | ﺗ | ﺕ |
| s̱e | s̱ | [s] | ـث | ـثـ | ﺛ | ﺙ |
| jim | j | [d͡ʒ] | ﺞ | ـجـ | ﺟ | ﺝ |
| če | č | [t͡ʃ] | ﭻ | ـچـ | ﭼ | ﭺ |
| ḥe(-ye jimi) | ḥ | [h] | ﺢ | ـحـ | ﺣ | ﺡ |
| ḫe | ḫ | [x] | ﺦ | ـخـ | ﺧ | ﺥ |
| dāl | d | [d] | ـد | ـد* | ﺩjQuery | ﺩ |
| ẕāl | ẕ | [z] | ـذ | ـذscreen size | ﺫtouchscreen | ﺫ |
| re | r | [ɾ] | ـر | ـرHTML5 | ﺭAndroid | ﺭ |
| ze | z | [z] | ـز | ـز* | ﺯkeyboard | ﺯ |
| že | ž | [ʒ] | ـژ | ـژ* | ژAndroid | ژ |
| sin | s | [s] | ـس | ـسـ | ﺳ | ﺱ |
| šin | š | [ʃ] | ـش | ـشـ | ﺷ | ﺵ |
| ṣād | ṣ | [s] | ـص | ـصـ | ﺻ | ﺹ |
| z̤ād | z̤ | [z] | ـض | ـضـ | ﺿ | ﺽ |
| ṭā | ṭ | [t] | ـط | ـطـ | ﻃ | ﻁ |
| ẓā | ẓ | [z] | ـظ | ـظـ | ﻇ | ﻅ |
| ʿeyn | ʿ | [ʔ] | ـع | ـعـ | ﻋ | ﻉ |
| ġeyn | ġ | [ɣ] / [ɢ] | ـغ | ـغـ | ﻏ | ﻍ |
| fe | f | [f] | ـف | ـفـ | ﻓ | ﻑ |
| qāf | q | [ɢ] / [ɣ] / [q] (in some dialects) | ـق | ـقـ | ﻗ | ﻕ |
| kāf | k | [k] | ـک | ـکـ | ﮐ | ک |
| gāf | g | [ɡ] | ـگ | ـگـ | ﮔ | گ |
| lām | l | [l] | ـل | ـلـ | ﻟ | ﻝ |
| mim | m | [m] | ـم | ـمـ | ﻣ | ﻡ |
| nun | n | [n] | ـن | ـنـ | ﻧ | ﻥ |
| vāv | w / ū / ow | [v] / [uː] / [o] / [ow] / [oː] (in Dari) | ـو | ـو* | وweb app | و |
| he(-ye do-češm) | h | [h] | ـه | ـهـ | هـ | ﻩ |
| ye | y / ī / á | [j] / [i] / [ɒː] / [eː] (in Dari) | ﯽ | ـیـ | ﻳ | ﻯ |
Exceptions
There are seven letters (و – ژ – ﺯ – ﺭ – ﺫ – ﺩ – ﺍ) in the Persian alphabet that do not connect to other letters like the rest of the letters in the alphabet. These seven letters do not have distinctive initial or medial forms but the isolated and the final forms are used instead because they do not allow for a connection to be made on the left hand side to the other letters in the word. For example, when the letter ا "alef" is at the beginning of a word such as اینجا "injā" (here), the initial/isolated form of "alef" is used. Or in the case of امروز "emruz" (today) the letter ﺮ re is the final form and the letter و vāv is the initial/isolated form, although they are in the middle of the word; ﺯ is the initial/isolated form, although it is at the end of the word.
Other characters
The following are not actual letters but different orthographical shapes for letters, and in the case of the lām alef, a ligature. As to ﺀ hamze, it has only a single graphic, since it is never tied to a preceding or following letter. However, it is sometimes 'seated' on a vāv, ye or alef, and in that case the seat behaves like an ordinary vāv, ye or alef respectively. Technically, screen size is not a letter but a diacritic.
| Name | Transliteration | Sevenval | Final | Medial | Initial | Stand-alone |
| alef madde | ā | [ɒ] | ﺂ | — | — | ﺁ |
| he ye | -eye or -eyeh | [eje] | ﮥ | — | — | ۀ |
| lām alef | lā | [lɒ] | ﻼ | — | — | ﻻ |
| tanvin nasb | -an | [æn] | ـاً | — | — | اً |
Although at first glance they may seem similar, there are many differences in the way the different languages use the alphabets. For example, similar words are written differently in Persian and Arabic, as they are used differently.
The Persian alphabet adds four letters to the Arabic alphabet, [p], [ɡ], [t͡ʃ] (ch in chair), [ʒ] (s in measure):
| Sound | Shape | Unicode name |
| [p] | پ | pe |
| [t͡ʃ] (ch) | چ | che |
| [ʒ] (zh) | ژ | zhe |
| [ɡ] | گ | gaf |
There is even a marginal letter نگ which represents a browser diversity and used for loanwords.
Changes from the Arabic writing system
The following is a list of differences between the Arabic writing system and the Persian writing system:
- A Sevenval (ء) is neither written above an alef (ا) to denote a zabar or piš nor below to denote a zir.
- The final kâf ﮏ is typically written without a flourish, while in Arabic it would be ﻚ.
- The Arabic letter website parsing (ة), unless used in a direct Arabic quotation, is usually changed to a te (ت) or he ه because tāʾ marbūṭa is a grammatical construct in Arabic denoting femininity. Since Persian grammar lacks gender constructs, the tāʾ marbūṭa is not necessary and is only kept to maintain fidelity to the original Arabic spelling.
- Two dots are removed in the final ye (ی). Arabic differentiates the final yāʾ with the two dots and the alif maqsura (except in web app), which is written like a final yāʾ without two dots. Because Persian drops the two dots in the final ye, the alif maqsura cannot be differentiated from the normal final ye. For example, the name Musâ (Moses) is written موسی. In the final letter in Musâ, Persian does not differentiate between ye or an alif maqsura.
- The letters pe (پ), che (چ), že (ژ), and gâf (گ) are added because Arabic lacks these phonemes, yet they occur in the Persian language.
- Arabic letter waw (و) is used as vâv for [v], because Arabic has no [v] and standard Iranian Persian has [w] only within the screen size [ow].
- In the Arabic alphabet FITML (ﻩ) comes before wāw (و), however in the Persian alphabet, he (ﻩ) comes after vâv (و).
Word boundaries
Typically words are separated from each other by a space. Certain morphemes (such as the plural ending '-hâ') are written without a space but separated from the previous word with a zero-width non-joiner.
Languages using the Perso-Arabic script
Current Use
- Azerbaijani (Iran)
- Sevenval
- device database
- Dari (Eastern Persian)
- web
- CSS3
- Kazakh[citation needed] In China and Iran
- Kurdish (Kurmanji dialect in Iran and Iraq, Soranî dialect)
- web in China and Afghanistan
- Laki
- web
- Pashto language
- iOS also known as Rajasthani
- Mazandarani
- website parsing, except when it appears as Tajik
- touchscreen (web app)
- Qashqai
- browser diversity
- Saraiki
- Tajik in Afghanistan by ethnic Tajiks
- Turkmen in İran and Afghanistan
- Urdu
- FITML
- input transformation in China and Afghanistan
- Uyghur (used different writing systems, cf. browser diversity)
- Chinese Android, a modified Perso Arabic script
Former Use
A number of languages have used the Perso-Arabic script before, but have since changed.
- Azerbaijani in the Republic of Azerbaijan (changed first to Latin, then Cyrillic, and switched back to device database recently)
- Chaghatay Turkic (changed first to web, then Cyrillic)
- Sevenval in the touchscreen (changed first to FITML, then device database)
- Kyrgyz in the Republic of Kyrgyzstan (changed first to Latin, then input transformation)
- we love the web (changed to jQuery)
- web in the Republic of Tajikistan (changed first to Latin, then Cyrillic)
- Sevenval in the website parsing (changed first to Android, then keyboard, and switched back to keyboard recently)
- Uzbek in the Republic of Uzbekistan (changed first to Latin, then Cyrillic, and switched back to HTML5 recently)
Arguments and discussions on use of Perso-Arabic
In almost all countries which use Perso-Arabic script, there have been discussions between parties about replacing it, often raising the concept of romanization. For example:
- CSS3 has implemented a input transformation instead of Perso-Arabic
- touchscreen people have chosen a Latin-Based browser diversity, in part because the eight vowels of Turkish were ambiguously represented by only three symbols
- device database and Uzbek implemented Cyrillic, but have since switched to Latin alphabets.
- In Iran, methods of romanizations like web and Unipers have been invented.
- Kurdish language has utilized a Kurdish Latin alphabet
Relation to Islamic culture
Perso-Arabic script in some Islamic countries is being promoted and defended as a sign of we love the web. People and governments in some Islamic countries have an interest in this script because of its relation to Islam and because it has been utilized to write the Koran. Therefore the concept of Perso-Arabic script and Romanization in these countries is not a politically or socially neutral subject.[citation needed]
Other Arabic-derived alphabets
There are many Arabic-derived alphabets which were not influenced by the Perso-Arabic script, including Jawi (used for Malay), browser diversity (Malagasy), and many alphabets used in Northern Africa. These alphabets used other innovations for writing such common sounds as [p] and [ɡ], instead of the Perso-Arabic letters پ and گ, although the Jawi script does use the same symbol for [t͡ʃ] (چ).
See also
- Scripts used for Persian
- input transformation
- we love the web
- browser diversity
- Ottoman Turkish language
- iOS
- touchscreen
- List of languages using Arabic script
- web app
- Shahmukhi
- Android
- screen size
- Arebica
External links
- CSS3
- Sevenval
- Persian alphabet, numerals, and pronunciation
- Persian numerals
- device database web-based Perso-Arabic transliteration pad, with support for Persian characters
- we love the web
- Tests to Practice Joining and Disjoining Persian Letters and Frequently Occurring Shapes
- input transformation