There are probably hundreds of major and minor languages and many hundreds of recognized dialects in India, whose languages belong to four different language families: Indo-Iranian (a subfamily of the Indo-European language family), Dravidian, Austroasiatic, and Tibeto-Burman (a subfamily of Sino-Tibetan). There are also several isolate languages, such as Nahali, which is spoken in a small area of Madhya Pradesh state. The overwhelming majority of Indians speak Indo-Iranian or Dravidian languages.
The difference between language and dialect in India is often arbitrary, however, and official designations vary notably from one census to another. This is complicated by the fact that, owing to their long-standing contact with one another, India's languages have come to converge and to form an amalgamated linguistic area—a sprachbund—comparable, for example, to that found in the Balkans. Languages within India have adopted words and grammatical forms from one another, and vernacular dialects within languages often diverge widely. Over much of India, and especially the Indo-Gangetic Plain, there are no clear boundaries between one vernacular and another (although ordinary villagers are sensitive to nuances of dialect that differentiate nearby localities). In the mountain fringes of the country, especially in the northeast, spoken dialects are often sufficiently different from one valley to the next to merit classifying each as a truly distinct language. There were at one time, for example, no fewer than 25 languages classified within the Naga group, not one of which was spoken by more than 60,000 people.
Lending order to this linguistic mix are a number of written, or literary, languages used on the subcontinent, each of which often differs markedly from the vernacular with which it is associated. Many people are bilingual or multilingual, knowing their local vernacular dialect (“mother tongue”), its associated written variant, and, perhaps, one or more other languages. The official national language is Hindi, but there are 22 (originally 14) so-called “scheduled languages” recognized in the Indian constitution that may be used by states in official correspondence. Of these, 15 are Indo-European (Assamese, Bengali, Dogri, Gujarati, Hindi, Kashmiri, Konkani, Maithili, Marathi, Nepali, Oriya, Punjabi, Sanskrit, Sindhi, and Urdu), 4 are Dravidian (Kannada, Malayalam, Tamil, and Telugu), 2 are Sino-Tibetan (Bodo and Manipuri), and 1 is Austroasiatic (Santhali). These languages have become increasingly standardized since independence because of improved education and the influence of mass media. English is an “associate” official language and is widely spoken.
Most Indian languages are written using some variety of Devanagari script, but other scripts are used. Sindhi, for instance, is written in a Persianized form of Arabic script, but it also is sometimes written in the Devanagari or Gurmukhi scripts.
Indo-European languages
The Indo-Iranian branch of the Indo-European family is the largest language group in the subcontinent, with nearly three-fourths of the population speaking a language of this family as a mother tongue. It can be further split into three subfamilies: Indo-Aryan, Dardic, and Iranian. The numerous languages of this family all derive from Sanskrit, the language of the ancient Aryans. Sanskrit, the classic language of India, underwent a process of systematization and grammatical refinement at an early date, rendering it unique among Indo-Aryan languages in its degree of linguistic cultivation. Subsequently, the Prakrit languages developed from local vernaculars but later were refined into literary tongues. The modern Indian languages were derived from the Prakrit languages.
By far the most widely spoken Indo-Iranian language is Hindi, which is used in one form or another by some three-fifths of the population. Hindi has a large number of dialects, generally divided into Eastern and Western Hindi, some of which are mutually unintelligible. Apart from its nationally preeminent position, Hindi has been adopted as the official language by each of a large contiguous bloc of northern states—Bihar, Chhattisgarh, Haryana, Himachal Pradesh, Jharkhand, Madhya Pradesh, Rajasthan, Uttaranchal, and Uttar Pradesh—as well as by the national capital territory of Delhi.
Other Indo-European languages with official status in individual states are Assamese, in Assam; Bengali, in West Bengal and Tripura; Gujarati, in Gujarat; Kashmiri, in Jammu and Kashmir; Konkani, in Goa; Marathi, in Maharashtra; Nepali, in portions of northern West Bengal; Oriya, in Orissa; and Punjabi, in Punjab. Urdu, the official language of Pakistan, is also the language of most Muslims of northern and peninsular India as far south as Chennai (Madras). Sindhi is spoken mainly by inhabitants of the Kachchh district of Gujarat, which borders the Pakistani province of Sind, as well as in other areas by immigrants (and their descendants) who fled Sind after the 1947 partition.
Dravidian and other languages
- Distribution of Dravidian languages.
Dravidian languages are spoken by about one-fourth of all Indians, overwhelmingly in southern India. Dravidian speakers among tribal peoples (e.g., Gonds) in central India, in eastern Bihar, and in the Brahui-speaking region of the distant Pakistani province of Balochistan suggest a much wider distribution in ancient times. The four constitutionally recognized Dravidian languages also enjoy official state status: Kannada, in Karnataka; Malayalam, in Kerala; Tamil (the oldest of the main Dravidian tongues), in Tamil Nadu; and Telugu, in Andhra Pradesh. Manipuri and other Sino-Tibetan languages are spoken by small numbers of people in northeastern India.
Lingua francas
The two major lingua francas in India are Hindustani and English. Hindustani is based on an early dialect of Hindi, known by linguists as Khari Boli, which originated in Delhi and an adjacent region within the Ganges-Yamuna Doab (interfluve). During the Mughal period (early 16th to mid-18th century), when political power became centred on Delhi, Khari Boli absorbed numerous Persian words and came to be used as a lingua franca throughout the empire, especially by merchants who needed a common commercial language. Hindustani was promoted by the British during the colonial period.
In the 19th century two literary languages arose from this colloquial tongue: among Hindus, the modern form of Hindi, which derives its vocabulary and script (Devanagari) mainly from Sanskrit; and among Muslims, Urdu, which, though grammatically identical with Hindi, draws much of its vocabulary from Persian and Arabic and is written in the Perso-Arabic script. Despite this rift, Hindi and Urdu remain mutually intelligible, while their Hindustani progenitor still serves as a lingua franca in many parts of the subcontinent, particularly in the north.
English, a remnant of British colonial rule, is the most widely used lingua franca. The great size of India's population makes it one of the largest English-speaking communities in the world, although English is claimed as the mother tongue by only a small number of Indians and is spoken fluently by less than 5 percent of the population. English serves as the language linking the central government with the states, especially with those in which Hindi is not widely understood. English is also the principal language of commerce and the language of instruction in almost all of the country's prestigious universities and private schools. The English-language press remains highly influential; scholarly publication is predominantly in English (almost exclusively so in science); and many Indians are devotees of literature in English (much of it written by Indians), as well as of English-language film, radio, television, popular music, and theatre.
Minor languages and dialects
Although many tribal communities are gradually abandoning their tribal languages, scores of such languages survive. Few, however, are still spoken by more than a million persons, with the exception of Bhili (Indo-European) and Santhali (of the Munda branch of the Austroasiatic family), which are both estimated as having more than five million speakers. Others include Gondi (Dravidian), Kurukh, or Oraon (Dravidian), Ho (Munda), Manipuri (Sino-Tibetan), and Mundari (Munda). Generally, tribal languages lack a written tradition, though many are now written in the Roman script or, less often, in scripts adapted from those of neighbouring nontribal regions.




