The Semitic languages are a branch of the Afroasiatic language family. They are spoken by more than 330 million people across much of West Asia,[note 1] the Horn of Africa,[note 2] and latterly North Africa,[note 3] Malta,[note 4] West Africa, and in large immigrant and expatriate communities in North America, Europe, and Australasia. The terminology was first used in the 1780s by members of the Göttingen school of history, who derived the name from Shem, one of the three sons of Noah in the Book of Genesis.
Semitic languages occur in written form from a very early historical date in West Asia, with East Semitic Akkadian and Eblaite texts (written in a script adapted from Sumerian cuneiform) appearing from the 30th century BCE and the 25th century BCE in Mesopotamia and the north eastern Levant respectively. The only earlier attested languages are Sumerian and Elamite (2800 BCE to 550 BCE), both language isolates, and Egyptian (a sister branch of the Afroasiatic family, related to the Semitic languages but not part of them). Amorite appeared in Mesopotamia and the northern Levant circa 2000 BC, followed by the mutually intelligible Canaanite languages (including Hebrew, Phoenician, Moabite, Edomite and Ammonite, and perhaps Ekronite, Amalekite and Sutean), the still spoken Aramaic, and Ugaritic during the 2nd millennium BC.
Most scripts used to write Semitic languages are abjads – a type of alphabetic script that omits some or all of the vowels, which is feasible for these languages because the consonants are the primary carriers of meaning in the Semitic languages. These include the Ugaritic, Phoenician, Aramaic, Hebrew, Syriac, Arabic, and ancient South Arabian alphabets. The Geʽez script, used for writing the Semitic languages of Ethiopia and Eritrea, is technically an abugida – a modified abjad in which vowels are notated using diacritic marks added to the consonants at all times, in contrast with other Semitic languages which indicate diacritics based on need or for introductory purposes. Maltese is the only Semitic language written in the Latin script and the only Semitic language to be an official language of the European Union.
The Semitic languages are notable for their nonconcatenative morphology. That is, word roots are not themselves syllables or words, but instead are isolated sets of consonants (usually three, making a so-called triliteral root). Words are composed out of roots not so much by adding prefixes or suffixes, but rather by filling in the vowels between the root consonants (although prefixes and suffixes are often added as well). For example, in Arabic, the root meaning "write" has the form k-t-b. From this root, words are formed by filling in the vowels and sometimes adding additional consonants, e.g. كتاب kitāb "book", كتب kutub "books", كاتب kātib "writer", كتّاب kuttāb "writers", كتب kataba "he wrote", يكتب yaktubu "he writes", etc.
Name and identification
The similarity of the Hebrew, Arabic and Aramaic languages has been accepted by all scholars since medieval times. The languages were familiar to Western European scholars due to historical contact with neighbouring Near Eastern countries and through Biblical studies, and a comparative analysis of Hebrew, Arabic, and Aramaic was published in Latin in 1538 by Guillaume Postel. Almost two centuries later, Hiob Ludolf described the similarities between these three languages and the Ethiopian Semitic languages.[page needed] However, neither scholar named this grouping as "Semitic".[page needed]
The term "Semitic" was created by members of the Göttingen School of History, and specifically by August Ludwig von Schlözer (1781). Johann Gottfried Eichhorn, (1787) coined the name "Semitic" in the late 18th century to designate the languages closely related to Arabic, Aramaic, and Hebrew. The choice of name was derived from Shem, one of the three sons of Noah in the genealogical accounts of the biblical Book of Genesis, or more precisely from the Koine Greek rendering of the name, Σήμ (Sēm). Eichhorn is credited with popularising the term, particularly via a 1795 article "Semitische Sprachen" (Semitic languages) in which he justified the terminology against criticism that Hebrew and Canaanite were the same language despite Canaan being "Hamitic" in the Table of Nations:
In the Mosaic Table of Nations, those names which are listed as Semites are purely names of tribes who speak the so-called Oriental languages and live in Southwest Asia. As far as we can trace the history of these very languages back in time, they have always been written with syllabograms or with alphabetic script (never with hieroglyphs or pictograms); and the legends about the invention of the syllabograms and alphabetic script go back to the Semites. In contrast, all so called Hamitic peoples originally used hieroglyphs, until they here and there, either through contact with the Semites, or through their settlement among them, became familiar with their syllabograms or alphabetic script, and partly adopted them. Viewed from this aspect too, with respect to the alphabet used, the name "Semitic languages" is completely appropriate.— Johann Gottfried Eichhorn, Semitische Sprachen, 1795
Previously these languages had been commonly known as the "Oriental languages" in European literature. In the 19th century, "Semitic" became the conventional name; however, an alternative name, "Syro-Arabian languages", was later introduced by James Cowles Prichard and used by some writers.
Ancient Semitic-speaking peoples
Semitic languages were spoken and written across much of the Middle East and Asia Minor during the Bronze Age and Iron Age, the earliest attested being the East Semitic Akkadian of Mesopotamia (Akkad, Assyria, Isin, Larsa and Babylonia) from the third millennium BC.
The origin of Semitic-speaking peoples is still under discussion. Several locations were proposed as possible sites of a prehistoric origin of Semitic-speaking peoples: Mesopotamia, the Levant, the Eastern Mediterranean region, the Arabian Peninsula, and North Africa. Some claim that the Semitic languages originated in the Levant around 3800 BC, and were introduced to the Horn of Africa at about 800 BC from the southern Arabian peninsula, and to North Africa via Phoenician colonists at approximately the same time. Others assign the arrival of Semitic speakers in the Horn of Africa to a much earlier date and say that the view that Proto-Semitic speaking groups in the Horn of Africa originated in Western Asia cannot be supported by archaeological, epigraphic and linguistic evidence. Some of these claim that Proto-Semitic separated from Afroasiatic in the Horn of Africa and that the original road of Semitic migration into the Near East was from Ethiopia.
The various extremely closely related and mutually intelligible Canaanite languages, a branch of the Northwest Semitic languages included Amorite, first attested in the 21st century BC, Edomite, Hebrew, Ammonite, Moabite, Phoenician (Punic/Carthaginian), Samaritan Hebrew, Ekronite, Amalekite and Sutean. They were spoken in what is today Israel, Syria, Lebanon, the Palestinian territories, Jordan, the northern Sinai peninsula, some northern and eastern parts of the Arabian peninsula, southwest fringes of Turkey, and in the case of Phoenician, coastal regions of Tunisia (Carthage), Libya, Algeria and parts of Morocco, Spain and possibly in Malta and other Mediterranean islands. Ugaritic, a Northwest Semitic language closely related to but distinct from the Canaanite group was spoken in the kingdom of Ugarit in north western Syria.
A hybrid Canaano-Akkadian language also emerged in Canaan (Israel, Jordan, Lebanon) during the 14th century BC, incorporating elements of the Mesopotamian East Semitic Akkadian language of Assyria and Babylonia with the West Semitic Canaanite languages.
Aramaic, a still living ancient Northwest Semitic language, first attested in the 12th century BC in the northern Levant, gradually replaced the East Semitic and Canaanite languages across much of the Near East, particularly after being adopted as the lingua franca of the vast Neo-Assyrian Empire (911–605 BC) by Tiglath-Pileser III during the 8th century BC, and being retained by the succeeding Neo-Babylonian and Achaemenid Empires.
The Chaldean language (not to be confused with Aramaic or its Biblical variant, sometimes referred to as Chaldean) was a Northwest Semitic language, possibly closely related to Aramaic, but no examples of the language remain, as after settling in south eastern Mesopotamia from the Levant during the 9th century BC, the Chaldeans appear to have rapidly adopted the Akkadian and Aramaic languages of the indigenous Mesopotamians.
Old South Arabian languages (classified as South Semitic and therefore distinct from the Central-Semitic Arabic) were spoken in the kingdoms of Dilmun, Meluhha, Sheba, Ubar, Socotra and Magan, which in modern terms encompassed part of the eastern coast of Saudi Arabia, and Bahrain, Qatar, Oman and Yemen. South Semitic languages are thought to have spread to the Horn of Africa circa 8th century BC where the Ge'ez language emerged (though the direction of influence remains uncertain).
The Akkadian-influenced Syriac, a 5th-century BC Mesopotamian (Assyrian), descendant of Aramaic used in Mesopotamia, northeastern Syria, and south east Anatolia, rose to importance as a literary language of early Christianity in the third to fifth centuries and continued into the early Islamic era.
The Arabic language, although originating in the Arabian Peninsula, first emerged in written form in the 1st to 4th centuries CE in the southern regions of The Levant. With the advent of the early Arab conquests of the seventh and eighth centuries, Classical Arabic eventually replaced many (but not all) of the indigenous Semitic languages and cultures of the Near East. Both the Near East and North Africa saw an influx of Muslim Arabs from the Arabian Peninsula, followed later by non-Semitic Muslim Iranian and Turkic peoples. The previously dominant Aramaic dialects maintained by the Assyrians, Babylonians and Persians gradually began to be sidelined, however descendant dialects of Eastern Aramaic (including the Akkadian influenced Assyrian Neo-Aramaic, Chaldean Neo-Aramaic, Turoyo and Mandaic) survive to this day among the Assyrians and Mandaeans of northern and southern Iraq, northwestern Iran, northeastern Syria and southeastern Turkey, with up to a million fluent speakers. Eastern Aramic is a recognized language in Iraq, furthermore, Mesopotamian Arabic is the most Aramaic-Syriac influenced dialect of Arabic, due to Aramaic-Syriac having originated in Mesopotamia. Meanwhile Western Aramaic is now only spoken by a few thousand Aramean Syriac Christians in western Syria. The Arabs spread their Central Semitic language to North Africa (Egypt, Libya, Tunisia, Algeria, Morocco and northern Sudan and Mauritania), where it gradually replaced Egyptian Coptic and many Berber languages (although Berber is still largely extant in many areas), and for a time to the Iberian Peninsula (modern Spain, Portugal and Gibraltar) and Malta.
With the patronage of the caliphs and the prestige of its liturgical status, Arabic rapidly became one of the world's main literary languages. Its spread among the masses took much longer, however, as many (although not all) of the native populations outside the Arabian Peninsula only gradually abandoned their languages in favour of Arabic. As Bedouin tribes settled in conquered areas, it became the main language of not only central Arabia, but also Yemen, the Fertile Crescent, and Egypt. Most of the Maghreb followed, specifically in the wake of the Banu Hilal's incursion in the 11th century, and Arabic became the native language of many inhabitants of al-Andalus. After the collapse of the Nubian kingdom of Dongola in the 14th century, Arabic began to spread south of Egypt into modern Sudan; soon after, the Beni Ḥassān brought Arabization to Mauritania. A number of Modern South Arabian languages distinct from Arabic still survive, such as Soqotri, Mehri and Shehri which are mainly spoken in Socotra, Yemen and Oman.
Meanwhile, the Semitic languages that had arrived from southern Arabia in the 8th century BC were diversifying in Ethiopia and Eritrea, where, under heavy Cushitic influence, they split into a number of languages, including Amharic and Tigrinya. With the expansion of Ethiopia under the Solomonic dynasty, Amharic, previously a minor local language, spread throughout much of the country, replacing both Semitic (such as Gafat) and non-Semitic (such as Weyto) languages, and replacing Ge'ez as the principal literary language (though Ge'ez remains the liturgical language for Christians in the region); this spread continues to this day, with Qimant set to disappear in another generation.
Arabic is currently the native language of majorities from Mauritania to Oman, and from Iraq to the Sudan. Classical Arabic is the language of the Quran. It is also studied widely in the non-Arabic-speaking Muslim world. The Maltese language is genetically a descendant of the extinct Siculo-Arabic, a variety of Maghrebi Arabic formerly spoken in Sicily. The modern Maltese alphabet is based on the Latin script with the addition of some letters with diacritic marks and digraphs. Maltese is the only Semitic official language within the European Union.
Successful as second languages far beyond their numbers of contemporary first-language speakers, a few Semitic languages today are the base of the sacred literature of some of the world's major religions, including Islam (Arabic), Judaism (Hebrew and Aramaic), churches of Syriac Christianity (Syriac) and Ethiopian and Eritrean Orthodox Christianity (Ge'ez). Millions learn these as a second language (or an archaic version of their modern tongues): many Muslims learn to read and recite the Qur'an and Jews speak and study Biblical Hebrew, the language of the Torah, Midrash, and other Jewish scriptures. Ethnic Assyrian followers of the Assyrian Church of the East, Chaldean Catholic Church, Ancient Church of the East, Assyrian Pentecostal Church, Assyrian Evangelical Church and Assyrian members of the Syriac Orthodox Church both speak Mesopotamian eastern Aramaic and use it also as a liturgical tongue. The language is also used liturgically by the primarily Arabic-speaking followers of the Maronite, Syriac Catholic Church and some Melkite Christians. Greek and Arabic are the main liturgical languages of Oriental Orthodox Christians in the Middle East, who compose the patriarchates of Antioch, Jerusalem and Alexandria. Mandaic is both spoken and used as a liturgical language by the Mandaeans.
Despite the ascendancy of Arabic in the Middle East, other Semitic languages still exist. Biblical Hebrew, long extinct as a colloquial language and in use only in Jewish literary, intellectual, and liturgical activity, was revived in spoken form at the end of the 19th century. Modern Hebrew is the main language of Israel, with Biblical Hebrew remaining as the language of liturgy and religious scholarship of Jews worldwide.
Ethnic groups, in particular the Assyrians, Kurdish Jews, and Gnostic Mandeans, continue to speak and write Mesopotamian Aramaic languages, particularly Neo-Aramaic languages descended from Syriac, in those areas roughly corresponding to Kurdistan (northern Iraq, northeast Syria, south eastern Turkey and northwestern Iran). Syriac language itself, a descendant of Eastern Aramaic languages (Mesopotamian Old Aramaic), is used also liturgically by the Syriac Christians throughout the area. Although the majority of Neo-Aramaic dialects spoken today are descended from Eastern varieties, Western Neo-Aramaic is still spoken in 3 villages in Syria.
In Arab-dominated Yemen and Oman, on the southern rim of the Arabian Peninsula, a few tribes continue to speak Modern South Arabian languages such as Mahri and Soqotri. These languages differ greatly from both the surrounding Arabic dialects and from the (unrelated but previously thought to be related) languages of the Old South Arabian inscriptions.
Historically linked to the peninsular homeland of Old South Arabian, of which only one language, Razihi, remains, Ethiopia and Eritrea contain a substantial number of Semitic languages; the most widely spoken are Amharic in Ethiopia, Tigre in Eritrea, and Tigrinya in both. Amharic is the official language of Ethiopia. Tigrinya is a working language in Eritrea. Tigre is spoken by over one million people in the northern and central Eritrean lowlands and parts of eastern Sudan. A number of Gurage languages are spoken by populations in the semi-mountainous region of central Ethiopia, while Harari is restricted to the city of Harar. Ge'ez remains the liturgical language for certain groups of Christians in Ethiopia and in Eritrea.
The phonologies of the attested Semitic languages are presented here from a comparative point of view. See Proto-Semitic language#Phonology for details on the phonological reconstruction of Proto-Semitic used in this article. The reconstruction of Proto-Semitic (PS) was originally based primarily on Arabic, whose phonology and morphology (particularly in Classical Arabic) is very conservative, and which preserves as contrastive 28 out of the evident 29 consonantal phonemes. with *s [s] and *š [ʃ] merging into Arabic /s/ ⟨س⟩ and *ś [ɬ] becoming Arabic /ʃ/ ⟨ش⟩.
|Obstruent||Stop||voiceless||*p [p]||*t [t]||*k [k]|
|emphatic||(pʼ)[a]||*ṭ [tʼ]||*q/ḳ [kʼ]||*ʼ,ˀ [ʔ]|
|voiced||*b [b]||*d [d]||*g [g]|
|Fricative||voiceless||*ṯ [θ]||*s [s]||*š [ʃ]||*ś [ɬ]||*ḫ [x~χ]||*ḥ [ħ]||*h [h]|
|emphatic||*ṱ[b]/θ̣/ẓ [θʼ]||*ṣ [sʼ]||*ṣ́/ḏ̣ [ɬʼ]||(xʼ~χʼ)[c]|
|voiced||*ḏ [ð]||*z [z]||*ġ/ǵ [ɣ~ʁ]||*ʻ,ˤ [ʕ]|
|Approximant||*w [w]||*y [j]||*l [l]|
|Nasal||*m [m]||*n [n]|
Note: the fricatives *s, *z, *ṣ, *ś, *ṣ́, *ṱ may also be interpreted as affricates (/t͡s/, /d͡z/, /t͡sʼ/, /t͡ɬ/, /t͡ɬʼ/, /t͡θʼ/), as discussed in Proto-Semitic language § Fricatives.
This comparative approach is natural for the consonants, as sound correspondences among the consonants of the Semitic languages are very straightforward for a family of its time depth. Sound shifts affecting the vowels are more numerous and, at times, less regular.
Each Proto-Semitic phoneme was reconstructed to explain a certain regular sound correspondence between various Semitic languages. Note that Latin letter values (italicized) for extinct languages are a question of transcription; the exact pronunciation is not recorded.
Most of the attested languages have merged a number of the reconstructed original fricatives, though South Arabian retains all fourteen (and has added a fifteenth from *p > f).
In Aramaic and Hebrew, all non-emphatic stops occurring singly after a vowel were softened to fricatives, leading to an alternation that was often later phonemicized as a result of the loss of gemination.
In languages exhibiting pharyngealization of emphatics, the original velar emphatic has rather developed to a uvular stop [q].
|*b||[b]||ب||b||/b/||b||/b/||b||𐎁||b||𐤁||b||b||ḇ, b5||ב||b5||/b/||/v/, /b/||ḇ, b5||/v/, /b/||𐡁||ܒ||ḇ, b5||በ||/b/|
|*g||[ɡ]||ج||ǧ||/ɟ ~ d͡ʒ/9||/d͡ʒ/11||ġ||/d͡ʒ/||g||𐎂||g||𐤂||g||g||ḡ, g5||ג||g5||/g/||/ɣ/, /g/||g5||/ɡ/||𐡂||ܓ||ḡ, g5||ገ||/ɡ/|
|*p||[p]||ف||p̄||/f/||f||/f/||p||𐎔||p||𐤐||p||p||p̄, p5||פ||p5||/p/||/f/, /p/||f, p5||/f/, /p/||𐡐||ܦ||p̄, p5||ፈ||/f/|
|*k||[k]||ك||k||/k/||k||/k/||k||𐎋||k||𐤊||k||k||ḵ, k5||כ||k5||/k/||/x/, /k/||ḵ, k5||/χ/, /k/||𐡊||ܟ||ḵ, k5||ከ||/k/|
|*ḳ||[kʼ]||ق||q||/g ~ q/9||/q/12||q||/ʔ ~ q/||q||𐎖||ḳ||𐤒||q||q||q||ק||q||/q/||/q/||q||/k/||𐡒||ܩ||q||ቀ||/kʼ/|
|*d||[d]||د||d||/d/||d||/d/||d||𐎄||d||𐤃||d||d||ḏ, d5||ד||d5||/d/||/ð/, /d/||dh, d5||/d/||𐡃||ܕ||ḏ, d5||ደ||/d/|
|*ḏ||[ð]||ذ||ḏ||/ð/||z||𐎏||ḏ > d||𐤆||z||z||z||ז||z||/z/||/z/||z||/z/||𐡆3, 𐡃||ܖ3, ܕ||ḏ3, d||ዘ||/z/|
|*s||[s]||س||s||/s/||s||/s/||s||𐎒||s||𐤎||ṡ||s||ṡ1||ס||s||/s/||/s/||s||/s/||𐡎||ܤ||s||ሰ||/s/||/s/, /ʃ/||/s/, /ʃ/|
|*ś||[ɬ]||ش||š||/ʃ/||x||/ʃ/||s1||שׂ1||ś1||/ɬ/||/s/||ś1||/s/||𐡔3, 𐡎||ܫ3, ܤ||ś3, s||ሠ||/ɬ/|
|*ṯ||[θ]||ث||ṯ||/θ/||t||/t/||𐎘||ṯ||š||שׁ||š||/ʃ/||/ʃ/||sh||/ʃ/||𐡔3, 𐡕||ܫ3, ܬ||ṯ3, t||ሰ||/s/|
|*t||[t]||ت||t||/t/||t||𐎚||t||𐤕||t||t||ṯ, t5||ת||t5||/t/||/θ/, /t/||th, t5||/t/||𐡕||ܬ||ṯ, t5||ተ||/t/|
|*ṱ||[θʼ]||ظ||ṱ||/ðˤ/||d||/d/||ṣ||𐎑||ẓ13 > ġ||𐤑||ṩ||ṣ||ṩ||צ||ṣ||/sˤ/||/sˤ/||ts||/ts/||𐡑3, 𐡈||ܨ3, ܛ||ṯʼ3, ṭ||ጸ||/tsʼ/,
|/tsʼ ~ sʼ/||/tsʼ ~ sʼ/,
|*ṣ́||[ɬʼ]||ض||s̭||/ɮˤ/||/dˤ/||d||/d/||𐡒3, 𐡏||ܩ3, ܥ||*ġʼ3, ʻ||ፀ||/ɬʼ/|
|*ġ||[ɣ]~[ʁ]||غ||ʻ̱||/ɣ ~ ʁ/||għ||/ˤː/||ḫ||𐎙||ġ,ʻ||𐤏||o̯||ʿ||o̯||ע2||ʻ2||/ʁ/||/ʕ/||ʻ2||/ʔ/, -,
|𐡏3||ܥ3||ġ3, ʻ||ዐ||/ʕ/||/ʔ/, –|
|*ʼ||[ʔ]||ء||ʼ||/ʔ/||–||–||–, ʾ||𐎀, 𐎛, 𐎜||ʼa, ʼi, ʼu10||𐤀||q̇||ʾ||q̇||א||ʼ||/ʔ/||/ʔ/||ʼ||/ʔ/, -||𐡀||ܐ||ʼ||አ||/ʔ/|
|*ḫ||[x]~[χ]||خ||h̭||/x ~ χ/||ħ||/ħ/||ḫ||𐎃||ḫ||𐤇||h||ḥ||h2||ח2||ḥ2||/χ/||/ħ/||ḫ, ḥ2||/χ/,
|𐡇3||ܟ3||ḫ3, ḥ||ኀ||/χ/||/ħ/, /x/||/h/, /ʔ/, –|
|*r||[ɾ]||ر||r||/r/||r||/r/||r||𐎗||r||𐤓||r||r||r||ר||r||/r/||/ʀ/, /r/, /ʀː/||r||/ʁ/||𐡓||ܪ||r||ረ||/r/|
|*w||[w]||و||w||/w/||w||/w/||w||𐎆||w||𐤅||w||w||w||ו||w||/w/||/w/||v, w||/v/, /w/||𐡅||ܘ||w||ወ||/w/|
Note: the fricatives *s, *z, *ṣ, *ś, *ṣ́, *ṱ may also be interpreted as affricates (/t͡s/, /d͡z/, /t͡sʼ/, /t͡ɬ/, /t͡ɬʼ/, /t͡θʼ/).
- Proto-Semitic *ś was still pronounced as [ɬ] in Biblical Hebrew, but no letter was available in the Early Linear Script, so the letter ש did double duty, representing both /ʃ/ and /ɬ/. Later on, however, /ɬ/ merged with /s/, but the old spelling was largely retained, and the two pronunciations of ש were distinguished graphically in Tiberian Hebrew as שׁ /ʃ/ vs. שׂ /s/ < /ɬ/.
- Biblical Hebrew as of the 3rd century BCE apparently still distinguished the phonemes ġ /ʁ/ and ḫ /χ/ from ʻ /ʕ/ and ḥ /ħ/, respectively, based on transcriptions in the Septuagint. As in the case of /ɬ/, no letters were available to represent these sounds, and existing letters did double duty: ח /χ/ /ħ/ and ע /ʁ/ /ʕ/. In both of these cases, however, the two sounds represented by the same letter eventually merged, leaving no evidence (other than early transcriptions) of the former distinctions.
- Although early Aramaic (pre-7th century BCE) had only 22 consonants in its alphabet, it apparently distinguished all of the original 29 Proto-Semitic phonemes, including *ḏ, *ṯ, *ṱ, *ś, *ṣ́, *ġ and *ḫ – although by Middle Aramaic times, these had all merged with other sounds. This conclusion is mainly based on the shifting representation of words etymologically containing these sounds; in early Aramaic writing, the first five are merged with z, š, ṣ, š, q, respectively, but later with d, t, ṭ, s, ʿ. (Also note that due to begadkefat spirantization, which occurred after this merger, OAm. t > ṯ and d > ḏ in some positions, so that PS *t,ṯ and *d, ḏ may be realized as either of t, ṯ and d, ḏ respectively.) The sounds *ġ and *ḫ were always represented using the pharyngeal letters ʿ ḥ, but they are distinguished from the pharyngeals in the Demotic-script papyrus Amherst 63, written about 200 BCE. This suggests that these sounds, too, were distinguished in Old Aramaic language, but written using the same letters as they later merged with.
- The earlier pharyngeals can be distinguished in Akkadian from the zero reflexes of *ḥ, *ʕ by e-coloring adjacent *a, e.g. pS *ˈbaʕal-um 'owner, lord' > Akk. bēlu(m).
- Hebrew and Aramaic underwent begadkefat spirantization at a certain point, whereby the stop sounds /b ɡ d k p t/ were softened to the corresponding fricatives [v ɣ ð x f θ] (written ḇ ḡ ḏ ḵ p̄ ṯ) when occurring after a vowel and not geminated. This change probably happened after the original Old Aramaic phonemes /θ, ð/ disappeared in the 7th century BCE, and most likely occurred after the loss of Hebrew /χ, ʁ/ c. 200 BCE.[note 5] It is known to have occurred in Hebrew by the 2nd century CE. After a certain point this alternation became contrastive in word-medial and final position (though bearing low functional load), but in word-initial position they remained allophonic. In Modern Hebrew, the distinction has a higher functional load due to the loss of gemination, although only the three fricatives /v χ f/ are still preserved (the fricative /x/ is pronounced /χ/ in modern Hebrew).
- In the Northwest Semitic languages, */w/ became */j/ at the beginning of a word, e.g. Hebrew yeled "boy" < *wald (cf. Arabic walad).
- There is evidence of a rule of assimilation of /j/ to the following coronal consonant in pre-tonic position,[clarification needed] shared by Hebrew, Phoenician and Aramaic.
- In Assyrian Neo-Aramaic, [ħ] is nonexistent. In general cases, the language would lack pharyngeal fricative [ʕ] (as heard in Ayin). However, /ʕ/ is retained in educational speech, especially among Assyrian priests.
- The palatalization of Proto-Semitic gīm /g/ to Arabic /d͡ʒ/ jīm, is most probably connected to the pronunciation of qāf /q/ as a /g/ gāf (this sound change also occurred in Yemenite Hebrew), hence in most of the Arabian peninsula (which is the homeland of the Arabic language) ج is jīm /d͡ʒ/ and ق is gāf /g/, except in western and southern Yemen and parts of Oman where ج is gīm /g/ and ق is qāf /q/.
- Ugaritic orthography indicated the vowel after the glottal stop.
- The Arabic letter jīm (ج) has three main pronunciations in Modern Standard Arabic. [d͡ʒ] in north Algeria, Iraq, also in most of the Arabian peninsula and as the predominant pronunciation of Literary Arabic outside the Arab world, [ʒ] occurs in most of the Levant and most North Africa; and [ɡ] is used in northern Egypt and some regions in Yemen and Oman. In addition to other minor allophones.
- The Arabic letter qāf (ق) has three main pronunciations in spoken varieties. [ɡ] in most of the Arabian Peninsula, Northern and Eastern Yemen and parts of Oman, Southern Iraq, Upper Egypt, Sudan, Libya, some parts of the Levant and to lesser extent in some parts (mostly rural) of Maghreb. [q] in most of Tunisia, Algeria and Morocco, Southern and Western Yemen and parts of Oman, Northern Iraq, parts of the Levant especially Druze dialects. [ʔ] in most of the Levant and Lower Egypt, as well as some North African towns such as Tlemcen and Fez. In addition to other minor allophones.
- ṱ can be written ẓ, and always is in the Ugaritic and Arabic contexts. In Ugaritic, sometimes assimilates to ġ, as in ġmʔ 'thirsty' (Arabic ẓmʔ, Hebrew ṣmʔ, but Ugaritic mẓmủ 'thirsty', root ẓmʔ, is also attested).
- Early Amharic might have had a different phonology.
- The pronunciations /ʕ/ and /ħ/ for ʿAyin and Ḥet, respectively, still occur among some older Mizrahi speakers, but for most modern Israelis, ʿAyin and Ḥet are realized as /ʔ, -/ and /χ ~ x/, respectively.
The following table shows the development of the various fricatives in Hebrew, Aramaic and Arabic through cognate words:
|*/ð/ *ḏ||*/ð/ ذ||*/d/ ד||*/z/ ז||ذهب
|*/z/1 *z||*/z/ ز||*/z/ ז||موازين
|*/s/ *s||*/s/ س
|*/s/ ס||*/s/ ס||سكين
|*/ɬ/ *ś||*/ʃ/ ش||*/s/ שׂ||*/s/ שׂ||عشر||עשׂר||עשׂר||'ten'|
|*/ʃ/ *š||*/s/ س||*/ʃ/ שׁ||*/ʃ/ שׁ||سنة
|*/θ/ *ṯ||*/θ/ ث||*/t/ ת||ثلاثة
|*/θʼ/1 *ṱ||*/ðˤ/ ظ||*/tʼ/ ט||*/sˤ~ts/1 צ||ظل
|*/ɬʼ/1 *ṣ́||*/dˤ/ ض||*/ʕ/ ע||أرض
|*/sʼ/1 *ṣ||*/sˤ/ ص||*/sʼ/ צ||صرخ
'water melon like plant'
|*/χ/ *ḫ||*/x~χ/ خ||*/ħ/ ח||*/ħ~χ/ ח||خمسة
|*/ħ/ *ḥ||*/ħ/ ح||ملح
|*/ʁ/ *ġ||*/ɣ~ʁ/ غ||*/ʕ/ ע||*/ʕ~ʔ/ ע||غراب
|*/ʕ/ *ʻ||*/ʕ/ ع||عبد
- possibly affricated (/dz/ /tɬʼ/ /ʦʼ/ /tθʼ/ /tɬ/)
Proto-Semitic vowels are, in general, harder to deduce due to the nonconcatenative morphology of Semitic languages. The history of vowel changes in the languages makes drawing up a complete table of correspondences impossible, so only the most common reflexes can be given:
|*a||a||a||a||ə||ā||a||ɛ||a, later ä||a, e, ē5|
|*u||u||u||u, o||ə||ō||o||o||ə, ʷə6||u|
|*ā||ā||ā||ā||ō[note 6]||ā later a||ā, ē|
|*ay||ay||ē, ay||BA, JA ay(i), ē,
WSyr. ay/ī & ay/ē
- in a stressed open syllable
- in a stressed closed syllable before a geminate
- in a stressed closed syllable before a consonant cluster
- when the proto-Semitic stressed vowel remained stressed
- pS *a,*ā > Akk. e,ē in the neighborhood of pS *ʕ,*ħ and before r.
- i.e. pS *g,*k,*ḳ,*χ > Ge'ez gʷ, kʷ,ḳʷ,χʷ / _u
The Semitic languages share a number of grammatical features, although variation — both between separate languages, and within the languages themselves — has naturally occurred over time.
The reconstructed default word order in Proto-Semitic is verb–subject–object (VSO), possessed–possessor (NG), and noun–adjective (NA). This was still the case in Classical Arabic and Biblical Hebrew, e.g. Classical Arabic رأى محمد فريدا ra'ā muħammadun farīdan. (literally "saw Muhammad Farid", Muhammad saw Farid). In the modern Arabic vernaculars, however, as well as sometimes in Modern Standard Arabic (the modern literary language based on Classical Arabic) and Modern Hebrew, the classical VSO order has given way to SVO. Modern Ethiopian Semitic languages follow a different word order: SOV, possessor–possessed, and adjective–noun; however, the oldest attested Ethiopian Semitic language, Ge'ez, was VSO, possessed–possessor, and noun–adjective. Akkadian was also predominantly SOV.
Cases in nouns and adjectives
The proto-Semitic three-case system (nominative, accusative and genitive) with differing vowel endings (-u, -a -i), fully preserved in Qur'anic Arabic (see ʾIʿrab), Akkadian and Ugaritic, has disappeared everywhere in the many colloquial forms of Semitic languages. Modern Standard Arabic maintains such case distinctions, although they are typically lost in free speech due to colloquial influence. An accusative ending -n is preserved in Ethiopian Semitic.[note 7] In the northwest, the scarcely attested Samalian reflects a case distinction in the plural between nominative -ū and oblique -ī (compare the same distinction in Classical Arabic). Additionally, Semitic nouns and adjectives had a category of state, the indefinite state being expressed by nunation.
Number in nouns
Semitic languages originally had three grammatical numbers: singular, dual, and plural. Classical Arabic still has a mandatory dual (i.e. it must be used in all circumstances when referring to two entities), marked on nouns, verbs, adjectives and pronouns. Many contemporary dialects of Arabic still have a dual, as in the name for the nation of Bahrain (baħr "sea" + -ayn "two"), although it is marked only on nouns. It also occurs in Hebrew in a few nouns (šana means "one year", šnatayim means "two years", and šanim means "years"), but for those it is obligatory. The curious phenomenon of broken plurals – e.g. in Arabic, sadd "one dam" vs. sudūd "dams" – found most profusely in the languages of Arabia and Ethiopia, may be partly of proto-Semitic origin, and partly elaborated from simpler origins.
Verb aspect and tense
All Semitic languages show two quite distinct styles of morphology used for conjugating verbs. Suffix conjugations take suffixes indicating the person, number and gender of the subject, which bear some resemblance to the pronominal suffixes used to indicate direct objects on verbs ("I saw him") and possession on nouns ("his dog"). So-called prefix conjugations actually takes both prefixes and suffixes, with the prefixes primarily indicating person (and sometimes number or gender), while the suffixes (which are completely different from those used in the suffix conjugation) indicate number and gender whenever the prefix does not mark this. The prefix conjugation is noted for a particular pattern of ʔ- t- y- n- prefixes where (1) a t- prefix is used in the singular to mark the second person and third-person feminine, while a y- prefix marks the third-person masculine; and (2) identical words are used for second-person masculine and third-person feminine singular. The prefix conjugation is extremely old, with clear analogues in nearly all the families of Afroasiatic languages (i.e. at least 10,000 years old). The table on the right shows examples of the prefix and suffix conjugations in Classical Arabic, which has forms that are close to Proto-Semitic.
In Proto-Semitic, as still largely reflected in East Semitic, prefix conjugations are used both for the past and the non-past, with different vocalizations. Cf. Akkadian niprus "we decided" (preterite), niptaras "we have decided" (perfect), niparras "we decide" (non-past or imperfect), vs. suffix-conjugated parsānu "we are/were/will be deciding" (stative). Some of these features, e.g. gemination indicating the non-past/imperfect, are generally attributed to Afroasiatic. Proto-Semitic had an additional form, the jussive, which was distinguished from the preterite only by the position of stress: the jussive had final stress while the preterite had non-final (retracted) stress.
The West Semitic languages significantly reshaped the system. The most substantial changes occurred in the Central Semitic languages (the ancestors of modern Hebrew, Arabic and Aramaic). Essentially, the old prefix-conjugated jussive or preterite became a new non-past (or imperfect), while the stative became a new past (or perfect), and the old prefix-conjugated non-past (or imperfect) with gemination was discarded. New suffixes were used to mark different moods in the non-past, e.g. Classical Arabic -u (indicative), -a (subjunctive), vs no suffix (jussive). (It is not generally agreed whether the systems of the various Semitic languages are better interpreted in terms of tense, i.e. past vs. non-past, or aspect, i.e. perfect vs. imperfect.) A special feature in classical Hebrew is the waw-consecutive, prefixing a verb form with the letter waw in order to change its tense or aspect. The South Semitic languages show a system somewhere between the East and Central Semitic languages.
Later languages show further developments. In the modern varieties of Arabic, for example, the old mood suffixes were dropped, and new mood prefixes developed (e.g. bi- for indicative vs. no prefix for subjunctive in many varieties). In the extreme case of Neo-Aramaic, the verb conjugations have been entirely reworked under Iranian influence.
Morphology: triliteral roots
All Semitic languages exhibit a unique pattern of stems called Semitic roots consisting typically of triliteral, or three-consonant consonantal roots (two- and four-consonant roots also exist), from which nouns, adjectives, and verbs are formed in various ways (e.g., by inserting vowels, doubling consonants, lengthening vowels or by adding prefixes, suffixes, or infixes).
For instance, the root k-t-b, (dealing with "writing" generally) yields in Arabic:
- katabtu كَتَبْتُ or كتبت "I wrote" (f and m)
- yuktab(u) يُكْتَبُ or يكتب "being written" (masculine)
- tuktab(u) تُكتَبُ or تكتب "being written" (feminine)
- yatakātabūn(a) يَتَكَاتَبُونَ or يتكاتبون "they write to each other" (masculine)
- istiktāb اِسْتِكْتابَ or استكتاب "causing to write"
- kitāb كِتابٌ or كتاب "book" (the hyphen shows end of stem before various case endings)
- kutayyib كُتَيِّبٌ or كتيب "booklet" (diminutive)
- kitābah كِتابةٌ or كتابة "writing"
- kuttāb كُتّابٌ or كتاب "writers" (broken plural)
- katabah كَتَبةٌ or كتبة "clerks" (broken plural)
- maktab مَكْتَبٌ or مكتب "desk" or "office"
- maktabah مَكْتَبةٌ or مكتبة "library" or "bookshop"
- maktūb مَكْتُوبٌ or مكتوب "written" (participle) or "postal letter" (noun)
- katībah كَتِيبةٌ or كتيبة "squadron" or "document"
- iktitāb اِكْتِتابٌ or اكتتاب "registration" or "contribution of funds"
- muktatib مُكتَتِبٌ or مكتتب "subscription"
and the same root in Hebrew: (A line under k and b mean a fricative, x for k and v for b.)
- kāṯaḇti כָּתַבְתִּי or כתבתי "I wrote"
- kattāḇ כַּתָּב or כתב "reporter" (m)
- katteḇeṯ כַּתָּבֶת or כתבת "reporter" (f)
- kattāḇā כַּתָּבָה or כתבה "article"
- miḵtāḇ מִכְתָּב or מכתב "postal letter"
- miḵtāḇā מִכְתָבָה or מכתבה "writing desk"
- kəṯōḇeṯ כְּתֹבֶת or כתבת "address"
- kəṯāḇ כְּתָב or כתב "handwriting"
- kāṯūḇ כָּתוּב or כתוב "written"
- hiḵtīḇ הִכְתִּיב or הכתיב "he dictated"
- hiṯkattēḇ הִתְכַּתֵּב or התכתב "he corresponded
- niḵtaḇ נִכְתָּב or נכתב "it was written" (m)
- kəṯīḇ כְּתִיב or כתיב "spelling" (m)
- taḵtīḇ תַּכְתִּיב or תכתיב "prescript" (m)
- məḵuttāḇ מְכוּתָּב or מכותב "addressee"
- kəṯubbā כְּתֻבָּה or כתבה "ketubah (a Jewish marriage contract)" (f)
In Tigrinya and Amharic, this root used to be used widely but is now seen as an Archaic form. Ethiopic-derived languages use different roots for things that have to do with writing (and in some cases counting) primitive root: ṣ-f and trilateral root stems: m-ṣ-f, ṣ-h-f, and ṣ-f-r are used. This roots also exists in other Semitic languages like (Hebrew: sep̄er "book", sōp̄er "scribe", mispār "number" and sippūr "story"). (this root also exists in Arabic and is used to form words with a close meaning to "writing", such as ṣaḥāfa "journalism", and ṣaḥīfa "newspaper" or "parchment"). Verbs in other non-Semitic Afroasiatic languages show similar radical patterns, but more usually with biconsonantal roots; e.g. Kabyle afeg means "fly!", while affug means "flight", and yufeg means "he flew" (compare with Hebrew, where hap̄lēḡ means "set sail!", hap̄lāḡā means "a sailing trip", and hip̄līḡ means "he sailed", while the unrelated ʕūp̄, təʕūp̄ā and ʕāp̄ pertain to flight).
Independent personal pronouns
|I||*ʔanāku,[note 8] *ʔaniya||anāku||أنا ʔanā||ʔanā, anā, ana, āni, āna, ānig||አነ ʔana||אנכי, אני ʔānōḵī, ʔănī||אנא ʔanā||ānā||jiena, jien|
|You (sg., masc.)||*ʔanka > *ʔanta||atta||أنت ʔanta||ʔant, ant, inta, inte, inti, int, (i)nta||አንተ ʔánta||אתה ʔattā||אנת ʔantā||āt, āty, āten||int, inti|
|You (sg., fem.)||*ʔanti||atti||أنت ʔanti||ʔanti, anti, inti, init (i)nti, intch||አንቲ ʔánti||את ʔatt||אנת ʔanti||āt, āty, āten||int, inti|
|He||*suʔa||šū||هو huwa, hū||huwwa, huwwe, hū||ውእቱ wəʔətu||הוא hū||הוא hu||owā||hu, huwa|
|She||*siʔa||šī||هي hiya, hī||hiyya, hiyye, hī||ይእቲ yəʔəti||היא hī||היא hi||ayā||hi, hija|
|We||*niyaħnū, *niyaħnā||nīnu||نحن naħnu||niħna, iħna, ħinna||ንሕነ ʔnəħnā||אנו, אנחנו ʔānū, ʔănaħnū||נחנא náħnā||axnan||aħna|
|You (dual)||*ʔantunā||أنتما ʔantumā||Plural form is used|
|They (dual)||*sunā[note 9]||*sunī(ti)||هما humā||Plural form is used|
|You (pl., masc.)||*ʔantunū||attunu||أنتم ʔantum, ʔantumu||ʔantum, antum, antu, intu, intum, (i)ntūma||አንትሙ ʔantəmu||אתם ʔattem||אנתן ʔantun||axtōxūn||intom|
|You (pl., fem.)||*ʔantinā||attina||أنتنّ ʔantunna||ʔantin, antin, ʔantum, antu, intu, intum, (i)ntūma||አንትን ʔantən||אתן ʔatten||אנתן ʔanten||axtōxūn||intom|
|They (masc.)||*sunū||šunu||هم hum, humu||hum, humma, hūma, hom, hinne(n)||እሙንቱ ʔəmuntu||הם, המה hēm, hēmmā||הנן hinnun||eni||huma|
|They (fem.)||*sinā||šina||هنّ hunna||hin, hinne(n), hum, humma, hūma||እማንቱ ʔəmāntu||הן, הנה hēn, hēnnā||הנן hinnin||eni||huma|
|One||*ʼaḥad-, *ʻišt-||ʔaħad, ʔiʃt||واحد، أحد waːħid-, ʔaħad-||אחד ʼeḥáḏ, ʔeˈχad||ʔḥd||xā||wieħed||አሐዱ ʾäḥädu|
|Two||*ṯin-ān (nom.), *ṯin-ayn (obl.), *kilʼ-||θinaːn, θinajn, kilʔ||اثنان iθn-āni (nom.), اثنين iθn-ajni (obj.), اثنتان fem. iθnat-āni, اثنتين iθnat-ajni||שנים ešnáyim ˈʃn-ajim, fem. שתים eštáyim ˈʃt-ajim||*ṯny||treh||tnejn||ክልኤቱ kəlʾetu|
|Three||*śalāṯ- > *ṯalāṯ-[note 10]||ɬalaːθ > θalaːθ||ثلاث θalaːθ-||fem. שלוש šālṓš ʃaˈloʃ||*ślṯ||ṭlā||tlieta||ሠለስቱ śälästu|
|Four||*ʼarbaʻ-||ʔarbaʕ||أربع ʔarbaʕ-||fem. ארבע ʼárbaʻ ˈʔaʁba||*ʼrbʻ||arpā||erbgħa||አርባዕቱ ʾärbaʿtu|
|Five||*ḫamš-||χamʃ||خمس χams-||fem. חמש ḥā́mēš ˈχameʃ||*ḫmš||xamšā||ħamsa||ኀምስቱ ḫämsətu|
|Six||*šidṯ-[note 11]||ʃidθ||ستّ sitt- (ordinal سادس saːdis-)||fem. שש šēš ʃeʃ||*šdṯ/šṯ||ëštā||sitta||ስድስቱ sədsətu|
|Seven||*šabʻ-||ʃabʕ||سبع sabʕ-||fem. שבע šéḇaʻ ˈʃeva||*šbʻ||šowā||sebgħa||ሰብዐቱ säbʿätu|
|Eight||*ṯamāniy-||θamaːnij-||ثماني θamaːn-ij-||fem. שמונה šəmṓneh ʃˈmone||*ṯmny/ṯmn||*tmanyā||tmienja||ሰማንቱ sämantu|
|Nine||*tišʻ-||tiʃʕ||تسع tisʕ-||fem. תשע tḗšaʻ ˈtejʃa||*tšʻ||*učā||disgħa||ተስዐቱ täsʿätu|
|Ten||*ʻaśr-||ʕaɬr||عشر ʕaʃ(a)r-||fem. עשר ʻéśer ˈʔeseʁ||*ʻśr||*uṣrā||għaxra||ዐሠርቱ ʿäśärtu|
These are the basic numeral stems without feminine suffixes. Note that in most older Semitic languages, the forms of the numerals from 3 to 10 exhibit polarity of gender (also called "chiastic concord" or "reverse agreement"), i.e. if the counted noun is masculine, the numeral would be feminine and vice versa.
Due to the Semitic languages' common origin, they share some words and roots. Others differ. For example:
|heart||*lib(a)b-||libb-||lubb-, (qalb-)||lebb-āʼ||lëbā||lëḇ, lëḇāḇ||ləbb||ḥa-wbēb||ilbieba, (qalb)|
|house||*bayt-||bītu, bētu||bayt-, (dār-)||bayt-āʼ||bētā||báyiṯ||bet||beyt, bêt||bejt, (dar)|
|water||*may-/*māy-||mû (root *mā-/*māy-)||māʼ-/māy||mayy-āʼ||mēyā||máyim||māy||ḥə-mō||ilma|
Terms given in brackets are not derived from the respective Proto-Semitic roots, though they may also derive from Proto-Semitic (as does e.g. Arabic dār, cf. Biblical Hebrew dōr "dwelling").
Sometimes, certain roots differ in meaning from one Semitic language to another. For example, the root b-y-ḍ in Arabic has the meaning of "white" as well as "egg", whereas in Hebrew it only means "egg". The root l-b-n means "milk" in Arabic, but the color "white" in Hebrew. The root l-ḥ-m means "meat" in Arabic, but "bread" in Hebrew and "cow" in Ethiopian Semitic; the original meaning was most probably "food". The word medina (root: d-y-n/d-w-n) has the meaning of "metropolis" in Amharic, "city" in Arabic and Ancient Hebrew, and "State" in Modern Hebrew.
Of course, there is sometimes no relation between the roots. For example, "knowledge" is represented in Hebrew by the root y-d-ʿ, but in Arabic by the roots ʿ-r-f and ʿ-l-m and in Ethiosemitic by the roots ʿ-w-q and f-l-ṭ.
For more comparative vocabulary lists, see Wiktionary appendices:
There are six fairly uncontroversial nodes within the Semitic languages: East Semitic, Northwest Semitic, North Arabian, Old South Arabian (also known as Sayhadic), Modern South Arabian, and Ethiopian Semitic. These are generally grouped further, but there is ongoing debate as to which belong together. The classification based on shared innovations given below, established by Robert Hetzron in 1976 and with later emendations by John Huehnergard and Rodgers as summarized in Hetzron 1997, is the most widely accepted today. In particular, several Semiticists still argue for the traditional (partially nonlinguistic) view of Arabic as part of South Semitic, and a few (e.g. Alexander Militarev or the German-Egyptian professor Arafa Hussein Mustafa,) see the South Arabian languages,[clarification needed] as a third branch of Semitic alongside East and West Semitic, rather than as a subgroup of South Semitic. However, a new classification groups Old South Arabian as Central Semitic instead.
Roger Blench notes, that the Gurage languages are highly divergent and wonders whether they might not be a primary branch, reflecting an origin of Afroasiatic in or near Ethiopia. At a lower level, there is still no general agreement on where to draw the line between "languages" and "dialects" – an issue particularly relevant in Arabic, Aramaic and Gurage – and the strong mutual influences between Arabic dialects render a genetic subclassification of them particularly difficult.
A computational phylogenetic analysis by Kitchen et al. (2009), considers the Semitic languages to have originated in the Levant about 5,750 years ago during the Early Bronze Age, with early Ethiosemitic originating from southern Arabia approximately 2,800 years ago. Evidence for gene movements consistent with this were found in Almarri et al. (2021).
- East Semitic (†)
- West Semitic
The following is a list of some modern and ancient Semitic-speaking peoples and nations:
- Ammonite speakers of Ammon
- Amorites – 20th century BC
- Ancient North Arabian-speaking bedouins
- Arameans – 16th to 8th centuries BC / Akhlames (Ahlamu) 14th century BC.
- Canaanite-speaking nations of the early Iron Age:
- Chaldea – appeared in southern Mesopotamia c. 1000 BC and eventually disappeared into the general Babylonian population.
- Hebrews/Israelites – founded the nation of Israel which later split into the Kingdoms of Israel and Judah. The remnants of these people became the Jews and the Samaritans.
- Phoenicia – founded Mediterranean colonies including Tyre, Sidon and ancient Carthage. The remnants of these people became the modern inhabitants of Lebanon.
- Ugarit, 14th to 12th centuries BC
- Nasrani (Syrian Christian)
- Akkadian Empire – ancient Semitic speakers moved into Mesopotamia in the fourth millennium BC and settled among the local peoples of Sumer.
- Babylonian Empire
- Assyrian Empire
- Ebla – 23rd century BC
- Kingdom of Aksum – 4th century BC to 7th century AD
- Amhara people
- Argobba people
- Dahalik people
- Gurage people
- Harari people
- Mehri people
- Old South Arabian-speaking peoples
- Sabaeans of Yemen – 9th to 1st centuries BC
- Silt'e people
- Tigrigna People
- Tigray people
- Tigre people
- Zay people
- Arabic is one of the world's largest, spoken natively by about 300 million speakers, and as a second language by perhaps another 60 million.
- Amharic has perhaps fifteen million speakers, in Africa probably fewer than only Arabic, Swahili, Hausa, and Oromo, and is the second most populous Semitic language, after just Arabic. It is the lingua franca and constitutionally recognized national language of Ethiopia, and the national language of instruction of Ethiopian public education in the primary grades. 
- Tigrinya, not to be confused with the related but distinct language Tigre, is, like Amharic, a northern Ethiopian Semitic language, is spoken as a native language by the overwhelming majority of the population in the Tigre province of Ethiopia and in the highland part of Eritrea (the provinces of Akkele Guzay, Serae and Hamasien, where the capital of the state, Asmara, is situated). Outside of this area Tigrinya is also spoken in the Tambien and Wolqayt historical districts (Ethiopia) and in the administrative districts of Massara and Keren (Eritrea), these being respectively the southern and northern limits of its expansion. The number of speaker of Tigrinya has been estimated at 4 million in 1995; 1.3 million of them live in Eritrea (around 50 percent of the population of the country), in 2008 by an estimated 5 million. Hebrew speaking about ~5 million native/L1 speakers, Gurage has around 1.5 million speakers, Tigre has c. ~1.05 million speakers, Aramaic is spoken by around 575,000 to 1 million largely Assyrian speakers).
- Maltese has around 483,000 speakers,
- According to the generally accepted view, it is unlikely that begadkefat spirantization occurred before the merger of /χ, ʁ/ and /ħ, ʕ/, or else [x, χ] and [ɣ, ʁ] would have to be contrastive, which is cross-linguistically rare. However, Blau argues that it is possible that lenited /k/ and /χ/ could coexist even if pronounced identically, since one would be recognized as an alternating allophone (as apparently is the case in Nestorian Syriac).
- see Canaanite shift
- "In the historically attested Semitic languages, the endings of the singular noun-flexions survive, as is well known, only partially: in Akkadian and Arabic and Ugaritic and, limited to the accusative, in Ethiopic."
- While some believe that *ʔanāku was an innovation in some branches of Semitic utilizing an "intensifying" *-ku, comparison to other Afro-Asiatic 1ps pronouns (e.g. 3nk, Coptic anak, anok, proto-Berber *ənakkʷ) suggests that this goes further back.
- The Akkadian form is from Sargonic Akkadian. Among the Semitic languages, there are languages with /i/ as the final vowel (this is the form in Mehri). For a recent discussion concerning the reconstruction of the forms of the dual pronouns, see Bar-Asher, Elitzur. 2009. "Dual Pronouns in Semitics and an Evaluation of the Evidence for their Existence in Biblical Hebrew," Ancient Near Eastern Studies 46: 32–49
- This root underwent regressive assimilation.[page needed] This parallels the non-adjacent assimilation of *ś... > *š...š in proto-Canaanite or proto-North-West-Semitic in the roots *śam?š > *šamš 'sun' and *śur?š > *šurš 'root'. The form *ṯalāṯ- appears in most languages (e.g. Aramaic, Arabic, Ugaritic), but the original form ślṯ appears in the Old South Arabian languages, and a form with s < *ś (rather than š < *ṯ) appears in Akkadian.
- This root was also assimilated in various ways. For example, Hebrew reflects *šišš-, with total assimilation; Arabic reflects *šitt- in cardinal numerals, but less assimilated *šādiš- in ordinal numerals. Epigraphic South Arabian reflects original *šdṯ; Ugaritic has a form ṯṯ, in which the ṯ has been assimilated throughout the root.[page needed]
- Owens 2013, p. 2.
- Hudson & Kogan 1997, p. 457.
- Hudson & Kogan 1997, p. 424; Austin 2008, p. 74
- Kuntz 1981, p. 25.
- Ruhlen 1991.
- Kiraz 2001, p. 25; Baasten 2003, p. 67
- Kitto 1845, p. 192.
- Baasten 2003, p. 68.
- Kiraz 2001, p. 25.
- Baasten 2003, p. 69.
- Eichhorn 1794, pp. 773–6; Baasten 2003, p. 69
- Kiraz 2001, p. 25; Kitto 1845, p. 192
-  Archived 2020-07-31 at the Wayback Machine Andrew George, "Babylonian and Assyrian: A History of Akkadian", In: Postgate, J. N., (ed.), Languages of Iraq, Ancient and Modern. London: British School of Archaeology in Iraq, pp. 37.
- Kitchen, Ehret & Assefa 2009, pp. 2703–10.
- "Semite". Encyclopædia Britannica. Retrieved 24 March 2014.
- Phillipson 2012, p. 11.
- Hodgson, Jason A.; Mulligan, Connie J.; Al-Meeri, Ali; Raaum, Ryan L. (2014). "Early Back-to-Africa Migration into the Horn of Africa". PLOS Genetics. 10 (6): e1004393. doi:10.1371/journal.pgen.1004393. ISSN 1553-7404. PMC 4055572. PMID 24921250.
- Levine 2000, p. 28.
- Brandão 2020, p. 23. sfn error: no target: CITEREFBrandão2020 (help)
- Izre'el 1987c, p. 4.
- Waltke & O'Connor 1990, p. 8.
- Brock 1998, p. 708.
- Harrak 1992, pp. 209–14.
- Afsaruddin & Zahniser 1997, p. 464; Smart 2013, p. 253; Sánchez 2013, p. 129
- Nebes 2005, p. 335.
- Versteegh 1997, p. 13.
- Kogan (2011), p. 54.
- Kogan 2012, pp. 54–151.
- Watson 2002, p. 13.
- Bekins, Peter (12 September 2008). "Old Aramaic (c. 850 to c. 612 BCE)". Retrieved 22 August 2011.
- Harrison, Shelly. "LIN325: Introduction to Semitic Languages. Common Consonant Changes" (PDF). Archived from the original (PDF) on 21 August 2006. Retrieved 25 June 2006.
- Kaufman, Stephen (1997), "Aramaic", in Hetzron, Robert (ed.), The Semitic Languages, Routledge, pp. 117–119.
- Dolgopolsky 1999, p. 35.
- Dolgopolsky 1999, p. 72.
- Blau 2010, p. 56.
- Dolgopolsky 1999, p. 73.
- Blau (2010:78–81)
- Garnier, Romain; Jacques, Guillaume (2012). "A neglected phonetic law: The assimilation of pretonic yod to a following coronal in North-West Semitic". Bulletin of the School of Oriental and African Studies. 75 (1): 135–145. CiteSeerX 10.1.1.395.1033. doi:10.1017/s0041977x11001261. S2CID 16649580.
- Brock, Sebastian (2006). An Introduction to Syriac Studies. Piscataway, NJ: Gorgias Press. ISBN 1-59333-349-8.
- Dolgopolsky 1999, pp. 85–86.
- Greenberg 1999, p. 157.
- Moscati 1958, pp. 142–43.
- Hetzron 1997, p. 123.
- "Semitic languages | Definition, Map, Tree, Distribution, & Facts". Encyclopedia Britannica. Retrieved 23 January 2020.
- Hetzron, Kaye & Zuckermann 2018, p. 568.
- Dolgopolsky 1999, pp. 10–11.
- Weninger, Stefan (2011). "Reconstructive Morphology". In Semitic languages: an international handbook, Stefan Weninger, ed. Berlin: Walter de Gruyter. p. 166.
- Lipiński 2001.
- Dolgopolsky 1999, pp. 61–62.
- Müller 1995, pp. 261–71; Coghill 2016[page needed]
- Hackett 2006, pp. 929–35.
- Almarri, Mohamed A.; Haber, Marc; Lootah, Reem A.; Hallast, Pille; Turki, Saeed Al; Martin, Hilary C.; Xue, Yali; Tyler-Smith, Chris (2020). "The Genomic History of the Middle East". Cell. 184 (18): 4612–4625.e14. bioRxiv 10.1101/2020.10.18.342816. doi:10.1016/j.cell.2021.07.013. PMC 8445022. PMID 34352227.
- "Aramaean – Britannica Online Encyclopedia". Britannica.com. Retrieved 27 January 2013.
- "Akhlame – Britannica Online Encyclopedia". Britannica.com. Retrieved 27 January 2013.
- "Mesopotamian religion – Britannica Online Encyclopedia". Britannica.com. Retrieved 27 January 2013.
- "Akkadian language – Britannica Online Encyclopedia". Britannica.com. Retrieved 27 January 2013.
- Afsaruddin, Asma; Zahniser, A. H. Mathias (1997). Humanism, Culture, and Language in the Near East: Studies in Honor of Georg Krotkoff. Winona Lake, Ind.: Penn State University Press. doi:10.5325/j.ctv1w36pkt. ISBN 978-1-57506-020-0. JSTOR 10.5325/j.ctv1w36pkt.
- Austin, Peter K., ed. (2008). One Thousand Languages: Living, Endangered, and Lost. Berkeley: University of California Press. ISBN 978-0-520-25560-9.
- Baasten, Martin F. J. (2003). "A Note on the History of 'Semitic'". In Baasten, M. F. J.; Van Peursen, W. Th. (eds.). Hamlet on a Hill: Semitic and Greek Studies Presented to Professor T. Muraoka on the Occasion of His Sixty-fifth Birthday. Peeters. pp. 57–73. ISBN 90-429-1215-4.
- Bennett, Patrick R. (1998). Comparative Semitic Linguistics: A Manual. Winona Lake, Indiana: Eisenbrauns. ISBN 1-57506-021-3.
- Blau, Joshua (2010). Phonology and Morphology of Biblical Hebrew. Winona Lake, Indiana: Eisenbrauns. ISBN 978-1-57506-129-0.
- Coghill, Eleanor (2016). The Rise and Fall of Ergativity in Aramaic: Cycles of Alignment Change. Oxford: Oxford University Press. ISBN 978-0-19-872380-6.
- Davies, John (1854). "On the Semitic Languages, and their relations with the Indo-European Class. Pt I. On the Nature and Development of Semitic Roots". Transactions of the Philological Society (10).
- Davies, John (1854). "On the Semitic Languages, and their relations with the Indo-European Class. Pt II. On the Connection of Semitic Roots with corresponding forms in the Indo-European Class of Languages". Transactions of the Philological Society (13).
- Dolgopolsky, Aron (1999). From Proto-Semitic to Hebrew. Milan: Centro Studi Camito-Semitici di Milano.
- Eichhorn, Johann Gottfried (1794). Allgemeine Bibliothek der biblischen Literatur [General Library of Biblical Literature] (in German). Vol. 6.
- Brock, Sebastian (1998). "Syriac Culture, 337–425". In Cameron, Averil; Garnsey, Peter (eds.). The Cambridge Ancient History. Vol. 13: The Late Empire, A.D. 337–425. Cambridge: Cambridge University Press. pp. 708–719. ISBN 0-521-85073-8.
- Greenberg, Joseph H. (1999). "The Diachronic Typological Approach to Language". In Shibatani, Masayoshi; Bynon, Theodora (eds.). Approaches to Language Typology. Oxford: Oxford University Press. pp. 145–166. ISBN 0-19-823866-5.
- Bergsträsser, Gotthelf (1995). Introduction to the Semitic Languages: Text Specimens and Grammatical Sketches. Translated by Daniels, Peter T. Winona Lake, Indiana: Eisenbrauns. ISBN 0-931464-10-2.
- Garbini, Giovanni (1984). Le lingue semitiche: studi di storia linguistica [Semitic languages: studies of linguistic history] (in Italian). Naples: Istituto Orientale.
- Garbini, Giovanni; Durand, Olivier (1994). Introduzione alle lingue semitiche [Introduction to Semitic languages] (in Italian). Brescia: Paideia.
- Goldenberg, Gideon (2013). Semitic Languages: Features, Structures, Relations, Processes. Oxford. ISBN 978-0-19-964491-9.
- Hackett, Jo Ann (2006). "Semitic Languages". In Keith Brown; Sarah Ogilvie (eds.). Concise Encyclopedia of Languages of the World. Elsevier. pp. 929–935. ISBN 9780080877754 – via Google Books.
- Harrak, Amir (1992). "The ancient name of Edessa". Journal of Near Eastern Studies. 51 (3): 209–214. doi:10.1086/373553. JSTOR 545546. S2CID 162190342.
- Hetzron, Robert (1997). The Semitic Languages. Routledge. ISBN 978-0-415-05767-7.
- Hetzron, Robert; Kaye, Alan S.; Zuckermann, Ghil'ad (2018). "Semitic Languages". In Comrie, Bernard (ed.). The World's Major Languages (3rd ed.). London: Routledge. pp. 568–576. doi:10.4324/9781315644936. ISBN 978-1-315-64493-6.
- Hudson, Grover; Kogan, Leonid E. (1997). "Amharic and Argobba". In Hetzron, Robert (ed.). The Semitic Languages. New York: Routledge. pp. 457–485. ISBN 0-415-05767-1.
- Izre'el, Shlomo (1987c), Canaano-Akkadian (PDF)
- Kiraz, George Anton (2001). Computational Nonlinear Morphology: With Emphasis on Semitic Languages. Cambridge University Press. p. 25. ISBN 9780521631969.
The term "Semitic" is borrowed from the Bible (Gene. x.21 and xi.10–26). It was first used by the Orientalist A. L. Schlözer in 1781 to designate the languages spoken by the Aramæans, Hebrews, Arabs, and other peoples of the Near East (Moscati et al., 1969, Sect. 1.2). Before Schlözer, these languages and dialects were known as Oriental languages.
- Kitchen, A.; Ehret, C.; Assefa, S. (2009). "Bayesian phylogenetic analysis of Semitic languages identifies an Early Bronze Age origin of Semitic in the Near East". Proceedings. Biological Sciences. 276 (1668): 2703–10. doi:10.1098/rspb.2009.0408. PMC 2839953. PMID 19403539.
- Kitto, John (1845). A Cyclopædia of Biblical Literature. London: W. Clowes and Sons.
That important family of languages, of which the Arabic is the most cultivated and most widely-extended branch, has long wanted an appropriate common name. The term Oriental languages, which was exclusively applied to it from the time of Jerome down to the end of the last century, and which is even now not entirely abandoned, must always have been an unscientific one, inasmuch as the countries in which these languages prevailed are only the east in respect to Europe; and when Sanskrit, Chinese, and other idioms of the remoter East were brought within the reach of our research, it became palpably incorrect. Under a sense of this impropriety, Eichhorn was the first, as he says himself (Allg. Bibl. Biblioth. vi. 772), to introduce the name Semitic languages, which was soon generally adopted, and which is the most usual one at the present day. [...] In modern times, however, the very appropriate designation Syro-Arabian languages has been proposed by Dr. Prichard, in his Physical History of Man. This term, [...] has the advantage of forming an exact counterpart to the name by which the only other great family of languages with which we are likely to bring the Syro-Arabian into relations of contrast or accordance, is now universally known—the Indo-Germanic. Like it, by taking up only the two extreme members of a whole sisterhood according to their geographical position when in their native seats, it embraces all the intermediate branches under a common band; and, like it, it constitutes a name which is not only at once intelligible, but one which in itself conveys a notion of that affinity between the sister dialects, which it is one of the objects of comparative philology to demonstrate and to apply.
- Kogan, Leonid (2012). "Proto-Semitic Phonology and Phonetics". In Weninger, Stefan (ed.). The Semitic Languages: An International Handbook. Walter de Gruyter. ISBN 978-3-11-025158-6.
- Kuntz, Marion Leathers (1981). Guillaume Postel: Prophet of the Restitution of All Things His Life and Thought. The Hague: Nijhoff. ISBN 90-247-2523-2.
- Kogan, Leonid (2011). "Proto-Semitic Phonology and Phonetics". In Weninger, Stefan (ed.). The Semitic Languages: An International Handbook. Walter de Gruyter. ISBN 978-3-11-025158-6.
- Levine, Donald N. (2000). Greater Ethiopia: The Evolution of a Multiethnic Society (2. ed.). Chicago. ISBN 978-0-226-22967-6.
- Lipiński, Edward (2001). Semitic Languages: Outline of a Comparative Grammar (2nd ed.). Leuven: Peeters. ISBN 90-429-0815-7.
- Mustafa, Arafa Hussein. 1974. Analytical study of phrases and sentences in epic texts of Ugarit. (German title: Untersuchungen zu Satztypen in den epischen Texten von Ugarit). Dissertation. Halle-Wittenberg: Martin-Luther-University.
- Moscati, Sabatino (1969). An Introduction to the Comparative Grammar of the Semitic Languages: Phonology and Morphology. Wiesbaden: Harrassowitz.
- Moscati, Sabatino (1958). "On Semitic Case-Endings". Journal of Near Eastern Studies. 17 (2): 142–144. doi:10.1086/371454. S2CID 161828505.
- Müller, Hans-Peter (1995). "Ergative Constructions In Early Semitic Languages". Journal of Near Eastern Studies. 54 (4): 261–271. doi:10.1086/373769. JSTOR 545846. S2CID 161626451.
- Nebes, Norbert (2005). "Epigraphic South Arabian". In Uhlig, Siegbert (ed.). Encyclopaedia Aethiopica. Wiesbaden: Harrassowitz. ISBN 978-3-447-05238-2.
- Ullendorff, Edward (1955). The Semitic Languages of Ethiopia: A Comparative Phonology. London: Taylor's (Foreign) Press.
- Owens, Jonathan (2013). The Oxford Handbook of Arabic Linguistics. Oxford University Press. ISBN 978-0199344093.
- Phillipson, David (2012). Foundations of an African Civilization, Aksum and the Northern Horn 1000 BC-AD 1300. Boydell & Brewer. ISBN 9781846158735. Retrieved 6 May 2021.
The former belief that this arrival of South-Semitic-speakers took place in about the second quarter of the first millennium BC can no longer be accepted in view of linguistic indications that these languages were spoken in the northern Horn at a much earlier date.
- Ruhlen, Merritt (1991). A Guide to the World's Languages: Classification. Stanford, California: Stanford University Press. ISBN 0-8047-1894-6.
The other linguistic group to be recognized in the eighteenth century was the Semitic family. The German scholar Ludwig von Schlozer is often credited with having recognized, and named, the Semitic family in 1781. But the affinity of Hebrew, Arabic, and Aramaic had been recognized for centuries by Jewish, Christian and Islamic scholars, and this knowledge was published in Western Europe as early as 1538 (see Postel 1538). Around 1700 Hiob Ludolf, who had written grammars of Geez and Amharic (both Ethiopic Semitic languages) in the seventeenth century, recognized the extension of the Semitic family into East Africa. Thus when von Schlozer named the family in 1781 he was merely recognizing genetic relationships that had been known for centuries. Three Semitic languages (Aramaic, Arabic, and Hebrew) were long familiar to Europeans both because of their geographic proximity and because the Bible was written in Hebrew and Aramaic.
- Sánchez, Francisco del Río (2013). Monferrer-Sala, Juan Pedro; Watson, Wilfred G. E. (eds.). Archaism and Innovation in the Semitic Languages. Selected Papers. Córdoba: Oriens Academic. ISBN 978-84-695-7829-2.
- Smart, J. R. (2013). Tradition and modernity in Arabic language and literature. Smart, J. R., Shaban Memorial Conference (2nd : 1994 : University of Exeter). Richmond, Surrey, U.K. ISBN 978-1-13678-812-3.
- Versteegh, Kees (1997). The Arabic Language. New York: Columbia University Press. ISBN 978-0-231-11152-2.
- Waltke, Bruce K.; O'Connor, Michael Patrick (1990). An Introduction to Biblical Hebrew Syntax. Vol. 3. Winona Lake, Indiana: Eisenbrauns. ISBN 0-931464-31-5.
- Watson, Janet C. E. (2002). The Phonology and Morphology of Arabic (PDF). New York: Oxford University Press. ISBN 0-19-824137-2. Archived from the original (PDF) on 1 March 2016 – via Wayback Machine.
- Woodard, Roger D., ed. (2008). The Ancient Languages of Syrio-Palestine and Arabia (PDF). Cambridge: Cambridge University Press.
- Wright, William; Smith, William Robertson (1890). Lectures on the Comparative Grammar of the Semitic Languages. Cambridge: Cambridge University Press. [2002 edition: ISBN 1-931956-12-X]
- Semitic genealogical tree (as well as the Afroasiatic one), presented by Alexander Militarev at his talk "Genealogical classification of Afro-Asiatic languages according to the latest data" (at the conference on the 70th anniversary of Vladislav Illich-Svitych, Moscow, 2004; short annotations of the talks given there (in Russian)
- Pattern-and-root inflectional morphology: the Arabic broken plural
- Ancient snake spell in Egyptian pyramid may be oldest Semitic inscription
- Alexis Neme and Sébastien Paumier (2019), Restoring Arabic vowels through omission-tolerant dictionary lookup, Lang Resources & Evaluation, Vol 53, 1–65 pages
- Swadesh vocabulary lists of Semitic languages (from Wiktionary's Swadesh-list appendix)