search 
welcome
 
about language
 
language study
 
world languages
 
test yourself
Language Families and  Their Branches

Family Tree

 

Comparative Linguistics

Majo rLanguage FamilieWhat is a language family?
Most languages belong to language families. A language family is a group of related languages that developed from a common historic ancestor, referred to as protolanguage (proto- means 'early' in Greek). The ancestral language is usually not known directly, but it is possible to discover many of its features by applying the comparative method that can demonstrate the family status of many languages.

Sometimes a protolanguage can be identified with a historically known language. Thus, provincial dialects of Vulgar Latin gave rise to the modern Romance languages, so the *Proto-Romance language is more or less identical with Latin. Similarly, Old Norse was the ancestral of Norwegian, Swedish, Danish and Icelandic. Sanskrit was the protolanguage of many of the languages of the Indian subcontinent, such as Bengali, Hindi, Marathi, and Urdu.

Further back in time, all these ancestral languages descended, in turn, from one common ancestor. We call this ancestor *Proto-Indo-European (PIE).

Language families can be subdivided into smaller units called branches.

Coliseum

Barcelona

How do linguists establish relationships among languages?
In some cases, it is relatively easy to establish relationships among languages. Let's look at the Romance languages. We know that Italian, for instance, is a descendant of Latin, a language that was spoken in Italy two thousand years ago, and one which left a great number of written documents. The Roman conquest helped spread Latin throughout Europe where it eventually developed into regional dialects. When the Roman Empire broke up, these regional dialects evolved into the modern Romance languages that we know today: French, Italian, Portuguese, Spanish, and others. These languages form the Romance branch of the Indo-European language family. By looking at the word for 'water' in three Romance languages, one can easily see the similarities.

Italian aqua
Spanish agua
Portuguese agua

Norse Mythology

slavic Mythology

What if the ancestral language left no records?
The case with Romance languages is unusually easy because their common ancestor — Latin — left many written documents. In most cases, however, the ancestral language was not written. As a result, linguists look at similarities among the modern descendants to establish common origins. Take a look at these examples:

English
water
German
wasser
Danish
vand
 
Russian
voda
Polish
woda
Czech
voda

It is clear that the word for 'water' looks very similar within each group, but not so similar across groups. Languages in the first group belong to the Germanic branch of the Indo-European language family. Languages in the second group belong to the Slavic branch. Although there are no written records of the ancestral *Proto-Germanic or *Proto-Slavic languages, we have to assume that these two ancestral languages must have existed, just like Latin did.

Riga

Albania

 

Where do these mystery languages belong?
Here is the word for 'water' in two more languages. Do you think these languages belong to any of the three branches above?

Latvian udens
Albanian uje
Basque ur

As it turns out, Latvian belongs to the Baltic branch of the Indo-European language family, Albanian has no close relatives and does not belong to any branch within the Indo-European language family, and Basque does not belong to any language family at all. In fact, it is a language isolate, i.e., a language that cannot be reliably assigned to any family.

click here to listenClick here to listen to Exploratorium Magazine lecture on the evolution of languages.

Ancient Scribe
What if there are no records, and we know little about the languages?
In many parts of the world, there are no written records, and we don't know enough about the languages themselves. Consequently, we have to resort to grouping languages on the basis of geography. This is the case with many of the aboriginal languages of Australia, the native Indian languages of the Americas, the tribal languages of Africa, and countless other languages all over the world.

Tower of Babel

European Man

Chinese Man

Arab an

Swahili Woman

Indonesian Boy

Papua New Guinea Man

Tamil Woman

Afghan Woman

 

How many language families are there?
According to Ethnologue, there are 218 language families in the world. This figure is probably an overestimate because of our limited knowledge about the languages of the most linguistically diverse areas of the world. For example, the Austronesian family (Pacific area) has 1,262 languages, many of them little known. So the actual number of families, once these languages are studied and relationships among them are established, will probably be smaller.

The largest language families are listed below. Together they account for nearly two-thirds of all languages and five-sixths of the world's population. The Indo-European language family has the largest number of speakers, followed by Sino-Tibetan. Niger-Congo has the largest number of languages, followed by Austronesian.

Language family & membership
Number of languages
Number of speakers
Where the languages that belong to the family
are spoken
Afro-Asiatic
353
339,478,607
Afghanistan, Algeria, Bahrain, Cameroon, Chad, Cyprus, Egypt, Eritrea, Ethiopia, Georgia, Iran, Iraq, Israel, Jordan, Kenya, Libya, Mali, Malta, Mauritania, Morocco, Niger, Nigeria, Oman, Saudi Arabia, Somalia, Sudan, Syria, Tanzania, Tunisia, Turkey, United Arab Emirates, Uzbekistan, Yemen
Austronesian
1,246
311,740,132
Brunei, Cambodia, China, Cook Islands, East Timor, Fiji, French Polynesia, Guam, Indonesia, Kiribati, Madagascar, Malaysia, Marshall Islands, Mayotte, Micronesia, Myanmar, Nauru, New Caledonia, New Zealand, Niue, Northern Mariana Islands, Palau, Papua New Guinea, Philippines, Samoa, Solomon Islands, Suriname, Taiwan, Thailand, Tokelau, Tonga, Tuvalu, USA, Vanuatu, Viet Nam, Wallis and Futuna
Indo-European
430
2,562,896,428
Afghanistan, Albania, Armenia, Austria, Azerbaijan, Bangladesh, Belarus, Belgium, Bosnia and Herzegovina, Bulgaria, Canada, China, Croatia, Czech Republic, Denmark, Fiji, Finland, France, Georgia, Germany, Greece, Iceland, India, Iran, Iraq, Ireland, Israel, Italy, Latvia, Lithuania, Luxembourg, Macedonia, Maldives, Nepal, Netherlands, Norway, Oman, Pakistan, Peru, Poland, Portugal, Romania, Russia, Serbia and Montenegro, Slovakia, Slovenia, South Africa, Spain, Sri Lanka, Suriname, Sweden, Switzerland, Tajikistan, Turkey, USA, Ukraine, United Kingdom, Venezuela
Niger-Congo
1,514
358,091,103
Angola, Benin, Botswana, Burkina Faso, Burundi, Cameroon, Central African Republic, Chad, Comoros, Congo, Cote d'Ivoire, Democratic Republic of the Congo, Equatorial Guinea, Gabon, Gambia, Ghana, Guinea, Guinea-Bissau, Kenya, Lesotho, Liberia, Malawi, Mali, Mozambique, Namibia, Niger, Nigeria, Rwanda, Senegal, Sierra Leone, Somalia, South Africa, Sudan, Swaziland, Tanzania, Togo, Uganda, Zambia, Zimbabwe
Sino-Tibetan
399
1,275,531,921
Bangladesh, Bhutan, China, India, Kyrgyzstan, Laos, Myanmar, Nepal, Pakistan, Thailand, Viet Nam
Trans New Guinea
564
3,359,894
Australia, East Timor, Indonesia, Papua New Guinea

Additional large language families include the following:

Nilo-Saharan
197
34,953,324
Algeria, Benin, Burkina Faso, Central African Republic, Chad, Democratic Republic of the Congo, Eritrea, Ethiopia, Kenya, Mali, Niger, Nigeria, Sudan, Tanzania, Uganda
Tai-Kadai
74
78,375,999
China, India, Laos, Myanmar, Thailand, Viet Nam
Dravidian
73
221,515,995
India, Nepal, Pakistan
Altaic
64
145,069,278 Afghanistan, Azerbaijan, China, Georgia, Iran, Kazakhstan, Kyrgyzstan, Lithuania, Moldova, Mongolia, Russia, Turkey, Turkmenistan, Uzbekistan
Uralic
36
22,623,030
Estonia, Finland, Hungary, Latvia, Norway, Russia, Sweden

There are no large language families in the Americas and in Australia. Instead, these continents have a large number of small language families consisting of languages with small numbers of speakers.

 
home privacy policy National Virtual Translation Center

Copyright 2007 © NVTC