Proto-Austroasiatic language

Reconstruction ofAustroasiatic languages
Lower-order reconstructions

Proto-Austroasiatic is the reconstructed ancestor of the Austroasiatic languages. Proto-Mon–Khmer (i.e., all Austroasiatic branches except for Munda) has been reconstructed in Harry L. Shorto's Mon–Khmer Comparative Dictionary, while a new Proto-Austroasiatic reconstruction is currently being undertaken by Paul Sidwell.

Scholars generally date the ancestral language to 5,000-4,000 B.P. (i.e. 3,000-2,000 BCE) with a homeland in southern China or the Mekong River valley.


Shorto (2006)

The Proto-Mon–Khmer language is the reconstructed ancestor of the Mon–Khmer languages, a purported primary branch of the Austroasiatic language family. However, Mon–Khmer as a taxon has been abandoned in recent classifications, making Proto-Mon–Khmer synonymous with Proto-Austroasiatic;[1] the Munda languages, which are not well documented, and have been restructured through external language contact, have not been included in the reconstructions.

Proto-Mon–Khmer as reconstructed by Harry L. Shorto (2006) has a total of 21 consonants, 7 distinct vowels, which can be lengthened and glottalized, and 3 diphthongs.

Proto-Mon–Khmer consonants
Labial Alveolar Palatal Velar Glottal
Unvoiced stop p /p/ t /t/ c /c/ k /k/ ʔ /ʔ/
Voiced stop b /b/ d /d/ j /ɟ/ g /g/
Implosive stop ɓ ɗ
Nasal m /m/ n /n/ ɲ /ɲ/ ŋ /ŋ/
Semivowel w /w/ y /j/
Liquid r /r/, l /l/
Fricative s /ç/ h /h/

Proto-Mon–Khmer is rich in vowels. The vowels are:

  • *a, *aa
  • *e, *ee
  • *ə, *əə
  • *i, *-iʔ, *ii, *-iiʔ
  • *o, *oo
  • *ɔ, *ɔɔ
  • *u, *uu, *-uuʔ
Proto-Mon–Khmer vowels
Height Front Central Back
Close i /i/, ii /iː/ u /u/, uu /uː/
Mid e /e/, ee /eː/ ə /ə/, əə /əː/ o /o/, oo /oː/
Open a /a/, aa /aː/ ɔ /ɔ/, ɔɔ /ɔː/

The diphthongs are:

  • *iə, *uə, *ai

Sidwell & Rau (2015)

Paul Sidwell and Felix Rau (2015)[2][3] propose the following syllable structure for Proto-Austroasiatic.

  • *Cᵢ(Cm)VCf

Also possible are more complex forms with prefixes and infixes, as well as presyllable "coda-copying" from main syllables.

  • *(Cp(n/r/l))CᵢVCf

Sidwell & Rau (2015)[2] reconstruct 21-22 Proto-Austroasiatic consonants (the reconstruction of *ʄ is uncertain).

All of the Proto-Austroasiatic consonants except for implosives and voiced stops can occur as syllable finals (Cf).

All of the Proto-Austroasiatic unvoiced stops and voiced stops, as well as *m-, *N-, *r-, *l-, and *s-, can occur as presyllables or sesquisyllables (Cp).

Medial consonants (Cm) are *-w -, *-r -, *-l -, *-j -, and *-h-.

Proto-Austroasiatic consonants
Labial Alveolar Palatal Velar Glottal
Unvoiced Stop p t c k ʔ
Voiced Stop b d ɟ ɡ
Implosive ɓ ɗ (ʄ)
Nasal m n ɲ ŋ
Unvoiced Fricative s h
Approximant w l j
Rhotic r

Sidwell & Rau (2015)[2] reconstructs 8 Proto-Austroasiatic vowels, for which there is vowel length contrast. A long vowel will be appended with triangular colon (ː) instead of doubling.

Proto-Austroasiatic vowels
Height Front Central Back
Close i u
Mid e ə o
Open ɛ a ɔ

Proto-Austroasiatic diphthongs are *iə and *uə, and possibly *ie and *uo.[4]


Common structures include *CV(C) and *CCV(C) roots. *CVC roots can also be affixed either via prefixes or infixes, as in *C-CVC or *C⟨C⟩VC (Shorto 2006). Sidwell (2008) gives the following phonological shapes for two types of stems.

  • Monosyllabic - C(R)V(V)C
  • Sesquisyllabic - CCV(V)C

Note: R is one of the optional medial consonants /r, l, j, w, h/.

Sidwell (2008) considers the two most morphologically conservative Mon–Khmer branches to be Khmuic and Aslian. On the other hand, Vietnamese morphology is far more similar to that of Chinese and the Tai languages and has lost many morphological features found in Proto-Mon–Khmer.

The following Proto-Mon–Khmer affixes, which are still tentative, have been reconstructed by Paul Sidwell (Sidwell 2008:257-263).

  • Nominalizing *-n- (instrumental in Kammu, resultative in Khmu)
  • Nominalizing agentive *-m-
  • Nominalizing iterative (expressive of repetitiveness/numerousness) *-l-/*-r-
  • Nominalizing instrumental *-p-
  • Causative *p(V)- (allomorphs: p-, pn-, -m-)
  • Reciprocal *tr-/*t(N)-
  • Stative *h-/*hN- (?)

Roger Blench (2012)[5] notes that Austroasiatic and Sino-Tibetan share many similarities regarding word structure, particularly nominal affixes (otherwise known as sesquisyllables or minor syllable prefixes). Blench (2012) does not make any definitive conclusions about how these similarities could have arisen, but suggests that this typological diffusion might have come about as a result of intensive contact in an area between northern Vietnam, Laos, and northeast Myanmar.


Like the Tai languages, Proto-Mon–Khmer has an SVO, or verb-medial, order. Proto-Mon–Khmer also makes use of noun classifiers and serial verb constructions (Shorto 2006).

However, Paul Sidwell (2018)[3] suggests that Proto-Austroasiatic may have in fact been verb-initial, with SVO order occurring in Indochina due to convergence in the Mainland Southeast Asia linguistic area. Various modern-day Austroasiatic languages display verb-initial word order, including Pnar and Wa (Jenny 2015).[6] Nicobarese also displays verb-initial word order.[3]



Proto-Austroasiatic personal pronouns as follows, with reconstructions from Sidwell & Rau (2015) and Shorto (2006).

Pronoun English gloss Proto-Austroasiatic
1s. 'I' *ʔaɲ
2s. 'you (sg.)' *miːʔ/*mi(ː)ʔ
3s./3p. 'third person' *gi(ː)ʔ
1p. (incl.) 'we (incl.)' *ʔiːʔ
1p. (excl.) 'we (excl.)' *ʔjeːʔ
2p. 'you (pl.)' *piʔ
Interrogative (animate) 'who' *mVh
Interrogative (inanimate) 'what' *məh/*m(o)ʔ; *m(o)h


English gloss Proto-Austroasiatic
'that (distal)' *tiːʔ
'that (medial)' *tɔʔ
'this (proximal)' *niʔ/*neʔ
'here' *nɔ(ː)ʔ


English gloss Proto-Austroasiatic
‘used up, finished, lacking’ *ʔət; *ʔəːt; *[ʔ]it
'not' *ʔam

Branch reconstructions

Austroasiatic branch-level reconstructions include:

Origin and dispersal

Theories of the Austroasiatic homeland and dispersal have evolved rapidly in the 21st century. Combining phylogenetic linguistics with recent archaeological findings, scholarly opinion has been converging on an origin in southern China.

Austroasiatic migration

Paul Sidwell (2009)[1] suggested that the likely homeland of Austroasiatic is in the Mekong River region, and that the family is not as old as frequently assumed, dating to perhaps 2,000 BCE.[19]

However, Peiros (2011) criticized Sidwell's 2009 riverine dispersal hypothesis heavily and claimed many contradictions. He showed with his analysis that the homeland of Austroasiatic is somewhere near the Yangtze. He suggests the Sichuan Basin as likely homeland of proto-Austroasiatic before they migrated to other parts of central and southern China and then into Southeast Asia. He further suggests that the family must be as old as proto-Austronesian and proto-Sino-Tibetan or even older.[20]

Georg van Driem (2011) proposed that the homeland of Austroasiatic is somewhere in southern China. He suggested that the region around the Pearl River (China) is the likely homeland of the Austroasiatic languages and people. He further suggested, based on genetic studies, that the migration of Kra–Dai people from Taiwan replaced the original Austroasiatic language but the effect on the people was only minor. Local Austroasiatic speakers adopted Kra-Dai languages and partially their culture.[21]

The linguists Sagart (2011) and Bellwood (2013) supported the theory of an origin of Austroasiatic along the Yangtze river in southern China.[22]

Genetic and linguistic research in 2015 about ancient people in East Asia suggest an origin and homeland of Austroasiatic in today southern China or even further north.[23]

Integrating computational phylogenetic linguistics with recent archaeological findings, Paul Sidwell (2015)[4] further expanded his Mekong riverine hypothesis by proposing that Austroasiatic had ultimately expanded into Indochina from the Lingnan area of southern China, with the subsequent Mekong riverine dispersal taking place after the initial arrival of Neolithic farmers from southern China. He tentatively suggests that Austroasiatic may have begun to split up 5,000 years B.P. during the Neolithic transition era of mainland Southeast Asia, with all the major branches of Austroasiatic formed by 4,000 B.P. Austroasiatic would have had two possible dispersal routes from the western periphery of the Pearl River watershed of Lingnan, which would have been either a coastal route down the coast of Vietnam, or downstream through the Mekong River via Yunnan.[4] Both the reconstructed lexicon of Proto-Austroasiatic and the archaeological record clearly show that early Austroasiatic speakers around 4,000 B.P. cultivated rice and millet, kept livestock such dogs, pigs, and chickens, and thrived mostly in estuarine rather than coastal environments.[4] At 4,500 B.P., this "Neolithic package" suddenly arrived in Indochina from the Lingnan area without cereal grains and displaced the earlier pre-Neolithic hunter-gatherer cultures, with grain husks found in northern Indochina by 4,100 B.P. and in southern Indochina by 3,800 B.P.[4] However, Sidwell found that iron is not reconstructable in Proto-Austroasiatic, since each Austroasiatic branch has different terms for iron that had been borrowed relatively lately from Tai, Chinese, Tibetan, Malay, and other languages. During the Iron Age about 2,500 B.P., relatively young Austroasiatic branches in Indochina such as Vietic, Katuic, Pearic, and Khmer were formed, while the more internally diverse Bahnaric branch (dating to about 3,000 B.P.) underwent more extensive internal diversification.[4] By the Iron Age, all of the Austroasiatic branches were more or less in their present-day locations, with most of the diversification within Austroasiatic taking place during the Iron Age.[4]

Paul Sidwell (2018)[24] considers the Austroasiatic language family to have rapidly diversified around 4,000 years B.P. during the arrival of rice agriculture in Indochina, but notes that the origin of Proto-Austroasiatic itself is older than that date. The lexicon of Proto-Austroasiatic can be divided into an early and late stratum. The early stratum consists of basic lexicon including body parts, animal names, natural features, and pronouns, while the names of cultural items (agriculture terms and words for cultural artifacts, which are reconstructable in Proto-Austroasiatic) form part of the later stratum.

Roger Blench (2018)[25][26] suggests that vocabulary related to aquatic subsistence strategies (such as boats, waterways, river fauna, and fish capture techniques) can be reconstructed for Proto-Austroasiatic. Blench (2018) finds widespread Austroasiatic roots for 'river, valley', 'boat', 'fish', 'catfish sp.', 'eel', 'prawn', 'shrimp' (Central Austroasiatic), 'crab', 'tortoise', 'turtle', 'otter', 'crocodile', 'heron, fishing bird', and 'fish trap'. Archaeological evidence for the presence of agriculture in northern Indochina (northern Vietnam, Laos, and other nearby areas) dates back to only about 4,000 years B.P. (2,000 B.C.), with agriculture ultimately being introduced from further up to the north in the Yangtze valley where it has been dated to 6,000 B.P.[25] Hence, this points to a relatively late riverine dispersal of Austroasiatic as compared to Sino-Tibetan, whose speakers had a distinct non-riverine culture. In addition to living an aquatic-based lifestyle, early Austroasiatic speakers would have also had access to livestock, crops, and newer types of watercraft. As early Austroasiatic speakers dispersed rapidly via waterways, they would have encountered speakers of older language families who were already settled in the area, such as Sino-Tibetan.[25]


