search 
Welcome to the Languages of the World
welcome
 
about language
 
language study
 
world languages
 
test yourself
The Indo-European Language Family
introductionstructurebranchesresources
 
Introduction

IE MapIndo-European is a family of languages that first spread throughout Europe and many parts of South Asia, and later to every corner of the globe as a result of colonialization. The family includes most of the languages of Europe, as well as many languages in Southwest, Central and South Asia. With close to 3 billion speakers, the Indo-European language family has the largest number of speakers of all language families.

It would not have been possible to establish the existence of the Indo-European language family if scholars had not compared the systematic resemblances between European languages and Sanskrit, the oldest language of the Indian subcontinent that left many written documents. The common origin of European languages and Sanskrit was first proposed by Sir William Jones (1746-1794). Systematic comparisons between these languages by Franz Bopp supported this theory and laid the foundation for postulating that all Indo-European languages descended from a common ancestor, Proto-Indo-European (PIE), thought to have been spoken before 3,000 B.C. It then split into different branches that, in turn, split into different languages in the subsequent millennia.


Kurgan
Kurgan

Sir William Jones
Sir William Jones

Franz Bopp
Franz Bopp

August Schleicher
August Schleicher

Scholars disagree about the original homeland of PIE.

  • Some scholars propose that PIE originated in the steppes north of the Black and Caspian Seas (Kurgan hypothesis). Proponents of this hypothesis date PIE as far back as 4,000 BC.
  • Other scholars suggest that it originated in Anatolia around 7,000 BC. broad peninsula that lies between the Black and Mediterranean seas. Called Asia Minor (Lesser Asia) by the Romans, the land is the Asian part of modern Turkey. It lies across the Aegean Sea to the east of Greece and is usually known by its Greek name Anatolia.

Since *Proto-Indo-European has no no written records, historical linguists construct family trees, an idea pioneered by August Schleicher, on the basis of the comparative method used to detect genetic relationships between languages and to establish a consistent relationship hypothesis. Using this method, a common ancestral language is reconstructed, and a sequence of regular changes by which these languages can be derived from it are described. The chart below shows how this method actually works with some Indo-European languages.

Please note that * means "reconstructed, not attested" form; > means "became.".

PIE *dek^m> Proto-Germanic *texun > Old English teon > Modern English ten
PIE *dek^m> Proto-Italic *dekem > Latin decem > Modern Italian dieci
PIE *dek^m> Old Church Slavonic desenti > Modern Bulgarian deset
PIE *dek^m> Proto-Indo-Iranian *daca > Sanskrit dáça > Hindi/Urdu das
PIE *dek^m> Greek deka


Structure

Sound Changes

Jacob Grimm
Jacob Grimm

 

As Proto-Indo-European broke up, its sound system diverged as well, following different laws in the daughter languages. One of them is Grimm's law.

Grimm's law
You may think of Jacob Grimm as the author, together with his brother, of children's fairy tales and songs. But did you know that he is also known as a historical linguist? He explained how the consonants of different Indo-European languages relate to each other. For instance, there is a regular relationship between words beginning with p in Latin and f in Germanic languages (as in pater and father) and between initial t- in Greek and initial th- in English as in treis and three. The rules governing these sound changes are known as Grimm's law. You can easily see the resemblances among these common words across five languages.

English Greek Latin Gothic Sanskrit
father
pater
pater
fadar
pita
brother
phrater
frater
brothar
bhratar
foot
poda
pedem
fotu
pada
three
tris
tres
thri
trí

Click here for an amusing illustration of Grimm's Law and of words for family, plants, animals, sky, and counting in nine Indo-European languages.

Centum-Satem division
The Centum-Satem division explains the evolution of PIE labiovelar, velar, and palatovelar consonants.

  • Labiovelar consonants include [kw, gw, xw, ngw] which are pronounced like [k, g, x, and ng] but with rounded lips.
  • Velars are consonants articulated with the back part of the tongue (the dorsum) against the soft palate (the back part of the roof of the mouth, known also as the velum). They include [k, g, x, ng].
  • Palatovelar consonants are articulated with the back part of the tongue against the hard palate. They include [k', g', x', ng']. For example, [k'] is pronounced as the k in keen.

The terms centum-satem come from the words for "one hundred" in representative languages of each group. Please note that not all languages fall neatly into these categories.

  • Satem languages include Baltic, Slavic, Albanian, Armenian, and Indo-Iranian languages. For example, Sanskrit satam, Lithuanian simtas, Russia sto.
    Click here to see the complete satem language tree.
  • Centum languages include Romance, Celtic, Germanic, and Greek. For example, Latin centum, Irish cead, English hundred, Greek hekaton.
    Click here to see the complete centum language tree.
Grammar Changes
Hittite Relief
Hittite Relief

Reconstructions of Proto-Indo-European show that its nouns were marked for gender (masculine, feminine, and neuter), number (singular, plural, and dual), case (nominative, vocative, accusative, genitive, ablative, locative, and instrumental). Adjectives agreed with nouns in case, number, and gender. Verbs were marked for aspect, mood, tense, voice, person, and number. They agreed with their subjects in person and number. These characteristics are present to various degrees in all modern Indo-European languages.

With a few exceptions, all Indo-European words underwent inflection. The structure of all inflected words, regardless of part of speech, was the same: root plus one or more suffixes plus ending. This structure is apparent in all present-day Indo-European languages.

Vocabulary
Relief
The comparative method enables us to reconstruct a basic vocabulary for Proto-Indo-European that extends to most aspects of their culture. This basic vocabulary is not uniformly attested across all languages which suggests that some words may have developed later or were borrowed from other languages. For instance *wĪro "man" can be found in English virile, while *gwenā "woman" is found in English gynecology and Slavic zhena "woman, wife."

Click here to learn about PIE vocabulary in greater detail.
Branches

Irish
Irish

Gypsy Children
Gypsy

Kurd Child
Kurd


German

Greek
Greek

Nepali
Nepali

Indian
Indian

Iranian
Iranian

Italian
Italian


Afghan


Armenian

Russian Children
Russian

Tajik Children
Tajik

Branches of the Indo-European Language Family
There are 449 languages in this family. For a complete listing of all the languages, click on Ethnologue. The branches of the Indo-European family and some of its most important and populous languages are listed below. Click on the name of the language to learn more about it on this website.

Branch/language
Number of speakers
Where spoken primarily
Baltic (3)
  Latvian 1.5 million Latvia
  Lithuanian 3.1million Lithuania
Celtic (7)
  Breton (coming soon) 533,000. France
  Irish Gaelic 355,000 Ireland
  Scots Gaelic (coming soon) 62,175. Scotland
  Welsh 575,000 Wales
Germanic (53)
  West Germanic
    Afrikaans 6 million South Africa
    Dutch 17 million Holland
    English 309 million UK, US, Australia, Canada
    German 95 million Germany
    Yiddish 50,000 Germany, Israel
  North Germanic
    Danish 5.3million Denmark
    Icelandic 240,000 Iceland
    Norwegian 4.6 million Norway
    Swedish 8.8 million Sweden
Romance (47)
  Catalan 6.7 million Spain
  French 65 million France
  Italian 61.5 million Italy
  Portuguese 178 million Portugal, Brazil
  Romanian 23.5 million Romania
  Spanish 322 million Spain, Latin America
Slavic (19)
  West Slavic (8)
    Czech 11.5 million Czech Republic
    Polish 43 million Poland
    Slovak 5 million Slovakia
    Sorbian (coming soon) 70,000 to 110,000 Germany
  East Slavic (4)
    Belarusian 9 million Belorusia
    Russian 9 million Russia
    Ukrainian 37.1 million Ukraine
  South Slavic (7)
    Bosnian 4 million Bosnia and Hercegovina
    Croatian 6.2 million Croatia
    Macedonian 1.6 million Macedonia
    Serbian 11.1 million Serbia
    Slovenian 2 million Slovenia
Indo-Iranian (308)
  Indo-Aryan (219)
    Balochi 1.8 million Pakistan
    Bengali 100 million 1st language speakers; 211 million including 2nd language speakers Bangladesh
    Bhojpuri (coming soon) 26.6 million India
    Hindi 180.8 million India
    Gujarati 46.1 million India
    Marathi 68 million India
    Nepali (coming soon) 17.2 million Nepal
    Maithili (coming soon) 24.8 million India
    Oriya 31.7 million India
    Punjabi (Panjabi) 60.8 million India
    Romani 1.5 million Romania & elsewhere
    Sanskrit 194,000 2nd language speakers India & elsewhere
    Sindhi (coming soon) 21.3 million Pakistan
    Sinhalese (coming soon) 13.2 million Sri Lanka
    Urdu 60.5 million Pakistan
  Iranian (87)
    Dari 7.6 million Afghanistan
    Farsi (Persian) 24.3 million Iran
    Kurdish 11 million Iraq & elsewhere
    Pashto 19 million Afghanistan & elsewhere
    Tajik 4.3 million Tajikistan
Languages without close relatives
  Albanian (4) 5 million Albania
  Armenian 6.7 million Armenia
  Greek (6) 12.3 million Greece

Figures in parentheses indicates number of languages/dialects listed by Ethnologue.
Resources
Resources

Resources
American Heritage Dictionary: Indo-European and the Indo-Europeans
Indo-European Home Page
University of Texas Indo-European Documentation Center
Wikipedia article on Indo-European Language Family
Ethnologue listing of Indo-European languages

 


home privacy policy National Virtual Translation Center

Copyright 2007 © National Virtual Translation Center