Learning Celebrations

Richard Powell - Fall 2024 - Intro to Linguistics

Proto-Indo-European Cognates

The Proto-Indo-European homeland according to the steppe hypothesis (dark green) and the present distribution of Indo-European languages in Eurasia (light green). Cropped, from Joe Roe, Wikimedia Commons

The Proto-Indo-European homeland according to the steppe hypothesis (dark green) and the present distribution of Indo-European languages in Eurasia (light green). Cropped, from Joe Roe, Wikimedia Commons.

Proto-Indo-European

The Indo-European language group consists of over 450 diverse languages spanning across Eurasia. Examples include Italian, Greek, Irish, Hindi, Spanish, German, Russian, and of course, English. It is the most spoken language group by far, with over 3.3 billion speakers worldwide.

The idea of an Indo-European language group was most famously put forward by William Jones in 1786 when he noticed similarities between the vocabulary and grammar of Sanskrit and those of Greek and Latin. He was not the first, but his work was the most influential in establishing the idea of a common proto-language. Since then, historical and comparative linguists have largely reconstructed an estimate of the Proto-Indo-European language and have worked out approximate timelines of the divergence of each language, though much debate still occurs in both areas.

Swadesh List

The Swadesh list, named for creator Morris Swadesh, is a list of basic words used in comparative linguistics. There are multiple versions of this list, ranging from 100 to over 200 words. Words were chosen for the list because of their universality and cultural independence, so that any given language should have nearly every word. By getting translations of these concepts in multiple languages and comparing them, linguists can determine how related those languages are and attempt to date their divergence.

Cognates

The following is a sample of words from the Swadesh List in a variety of Indo-European languages.

English Rom- anian Scottish Gaelic Lithu- anian Ukra- inian Bengali Urdu
three trei trì trys три
try
তিন
tina
تین
tin
snake şarpe nathair gyvatė змія
zmiya
সাপ
sāpa
سانپ
sanp
nose nas sròn nosies ніс
nis
নাক
nāka
ناک
naak
to see vedea fhaicinn žr бачити
bachyty
দেখা
dēkhā
دیکھیں
dekhen
to fall cădea tuiteam kristi падати
padaty
পড়া
paṛā
گرنا
garna
lake lac loch ežeras озеро
ozero
হ্রদ
hrada
جھیل
jhel
year an bliadhna metų рік
rik
বছর
bachara
سال
sal

Three

It seems that every language on this list has a cognate of the English “three.” they all include the [t] followed by an [i], and most include the [r] as well. English, of course, uses the [θ] here instead of [t] due to Grimm’s Law

Snake

Romanian, Bengali, and Urdu all have [s], then [a], then [p]. These are likely cognates with the English “serpent,” through probably not “snake.”

Nose

Languages here with likely cognates of the English “nose” are Romanian, Lituanian, and possibly Bengali and Urdu. All of their translations start with [n], and the Romanian and Lithuanian both have the [s] like in English.

See

The only word that could reasonably be cognate the english “see” is the Lithuanian “žr,” pronounced like “zher.” The Romanian “vedea” is clearly related to English words like “vision,” though, and the Bengali and Urdu words seem to be very close cognates.

Fall

None of the languages here have words cognate with English “fall,” but interestingly it seems that the Ukrainian and Bengali words may be cognates. They both start with [p] and [a], and the Bengali “ṛ” is pronounced somewhat similarly to a [d].

Lake

The English “lake” is an obvious cognate of Romanian “lac” and Scottish Gaelic “loch.” The Lithuanian and Ukrainian words seem related too, as they both include a vowel followed by [z]/[ʒ] then [e] then [r]/[ɾ].

Year

Interestingly, none of the words in this row seem to be cognate with each other. Romanian “an” is related to English words like “annual,” though.