:: wikimiki.org ::
| Classical Language |
Classical languageA classical language is a language with a literary tradition that can be judged as "classical".
According to George L. Hart:[To] qualify as a classical tradition, a language must fit several criteria: it should be ancient, it should be an independent tradition that arose mostly on its own not as an offshoot of another tradition, and it must have a large and extremely rich body of ancient literature.
Note that the judgement of a language as "classical" is a judgement of its literature, not the language itself. Although languages may change at different rates, languages that change at slower rates are not necessarily the same as languages judged as "classical" or "ancient". Also, since there is no language immune from change, classical languages tend to be either dead languages, or show a high degree of diglossia, as the spoken varieties of the language diverge further and further away from the classical written language over centuries.
List of classical languages
- Indo-European languages:
- Classical Greek
- Latin
- Sanskrit
- Classical Persian
- Pali
- Afro-Asiatic languages:
- Classical Arabic
- Hebrew
- Dravidian languages:
- Tamil
- Sino-Tibetan languages:
- Classical Chinese
See also
- history of the Greek language
- literary language
- official language
- sacred language
- standard language
External links
- [http://tamil.berkeley.edu/Tamil%20Chair/TamilClassicalLanguage/TamilClassicalLgeLtr.html Prof. Hart's defense of the inclusion of Tamil as a Classical language]
Category:Classical languages
LanguageA language is a system of symbols, generally known as lexemes and the rules by which they are manipulated. The word language is also used to refer to the whole phenomenon of language, i.e., the common properties of languages. Though language is commonly used for communication, it is not synonymous with it.
Human language is a natural phenomenon, and language learning is instinctive in childhood. In their natural form, human languages use patterns of sound or gesture for the symbols in order to communicate with others through the senses. Though there are thousands of human languages, they all share a number of properties from which there are no known deviations.
Humans have also invented (or arguably in some cases discovered) many other languages, including constructed human languages such as Esperanto or Klingon, programming languages such as Python or Ruby, and various mathematical formalisms. These languages are not restricted to the properties shared by natural human languages.
Properties of language
Languages are not just sets of symbols. They also contain a grammar, or system of rules, used to manipulate the symbols. While a set of symbols may be used for expression or communication, it is primitive and relatively unexpressive, because there are no clear or regular relationships between the symbols. Because a language also has a grammar, it can manipulate its symbols to express clear and regular relationships between them.
For example, imagine going on a walk with a person who only knew individual symbols, or words. If you saw a dog, he might say, "Dog scare" or "Scare Dog". Although any English speaker would have some notion of what he was talking about, the relationship between the words is unclear. Is he scared of dogs? Or just that dog? Or does he want to scare the dog off? Does he think the dog is scared? But if you respond, "I’m not scared of dogs," the relationship between dog and scare is quite apparent and hence the meaning of the utterance.
Another important property of language is the arbitrariness of the symbols. Any symbol can be mapped onto any concept (or even onto one of the rules of the grammar). For instance, there is nothing about the Spanish word nada itself that forces Spanish speakers to use it to mean nothing. That is the meaning all Spanish speakers have memorized for that sound pattern. But for Croatian speakers nada means hope.
However, it must be understood that just because in principle the symbols are arbitrary does not mean that a language cannot have symbols that are iconic of what they stand for. Words such as meow sound similar to what they represent, but they could be replaced with words such as jarn, and as long as everyone memorized the new word, the same concepts could be expressed with it.
Human languages
Human languages are usually referred to as natural languages, and the science studying them is linguistics.
Making a principled distinction between one language and another is usually impossible. For example, the boundaries between named language groups are in effect arbitrary due to blending between populations (the dialect continuum). For instance, there are dialects of German very similar to Dutch which are not mutually intelligible with other dialects of (what Germans call) German.
Some like to make parallels with biology, where it is not always possible to make a well-defined distinction between one species and the next. In either case, the ultimate difficulty may stem from the interactions between languages and populations. (See Dialect or August Schleicher for a longer discussion.)
The concepts of Ausbausprache, Abstandsprache, and Dachsprache are used to make finer distinctions about the degrees of difference between languages or dialects.
Origins of human language
Scientists do not yet agree on when language was first used by humans (or their ancestors). Estimates range from about two million (2,000,000) years ago, during the time of Homo habilis, to as recently as forty thousand (40,000) years ago, during the time of Cro-Magnon man. The nature of speech means that there is almost no data on which to base conclusions on the subject.
Language taxonomy
The classification of natural languages can be performed on the basis of different underlying principles (different closeness notions, respecting different properties and relations between languages); important directions of present classifications are:
- paying attention to the historical evolution of languages results in a genetic classification of languages—which is based on genetic relatedness of languages,
- paying attention to the internal structure of languages (grammar) results in a typological classification of languages—which is based on similarity of one or more components of the language’s grammar across languages,
- and respecting geographical closeness and contacts between language-speaking communities results in areal groupings of languages.
The different classifications do not match each other and are not expected to, but the correlation between them is an important point for many linguistic research works. (There is a parallel to the classification of species in biological phylogenetics here: consider monophyletic vs. polyphyletic groups of species.)
The task of genetic classification belongs to the field of historical-comparative linguistics, of typological—to linguistic typology.
See also: Taxonomy, Taxonomic classification—for the general idea of classification and taxonomies.
Genetic classification
The world’s languages have been grouped into families of languages that are believed to have common ancestors. Some of the major families are the Indo-European languages, the Afro-Asiatic languages, the Austronesian languages, and the Sino-Tibetan languages.
The shared features of languages from one family can be due to shared ancestry. (Compare with homology in biology.)
Typological classification
An example of a typological classification is the classification of languages on the basis of the basic order of the verb, the subject and the object in a sentence into several types: SVO, SOV, VSO, and so on, languages. (, for instance, belongs to the SVO language type.)
The shared features of languages of one type (= from one typological class) may have arisen completely independently. (Compare with analogy in biology.) Their cooccurence might be due to the universal laws governing the structure of natural languages—language universals.
Areal classification
The following language groupings can serve as some linguistically significant examples of areal linguistic units, or sprachbunds: Balkan linguistic union, or the bigger group of European languages; Caucasian languages. Although the members of each group are not closely genetically related, there is a reason for them to share similar features, namely: their speakers have been in contact for a long time within a common community and the languages converged in the course of the history. These are called areal features.
NB. One should be careful about the underlying classification principle for groups of languages which have apparently a geographical name: besides areal linguistic units, the taxa of the genetic classification (language families) are often given names which themselves or parts of which refer to geographical areas.
Constructed languages
One prominent artificial language, called Esperanto, was created by L. L. Zamenhof. It is a compilation of various elements of different languages, and it is intended to be an easy-to-learn language. Another prominent artificial language, called Ido, is intended to be reformed Esperanto.
Other constructed languages strive to be more logical than natural languages; a prominent example of this is Lojban.
Other writers, such as J. R. R. Tolkien, have created fantasy languages, for literary, artistic, or personal reasons. One of Tolkien’s languages is called Quenya, which is a form of Elvish. It has its own alphabet, and its phonology and syntax are modelled on Finnish. Linguist Mark Okrand has devised Klingon and Vulcan for Star Trek, which have since been developed into full languages.
The study of language
The oldest surviving written grammar for any language is believed to be the Tolkāppiyam (தொல்காப்பியம்), a book on the grammar of the Tamil language, written around 200 BCE by Tolkāppiyar. Its classification of the alphabet into consonants and vowel was a breakthrough.
The historical record of the study of language begins in North India with Pāṇini, the 5th century BCE grammarian who formulated 3,959 rules of Sanskrit morphology, known as the (अष्टाध्यायी). grammar is highly systematized and technical. Inherent in its analytic approach are the concepts of the phoneme, the morpheme, and the root; the phoneme was only recognized by Western linguists some two millennia later.
In the Middle East, the Persian linguist Sibawayh made a detailed and professional description of Arabic in 760 CE in his monumental work, Al-kitab fi an-nahw (الكتاب في النحو, The Book on Grammar), bringing many linguistic aspects of language to light. In his book he distinguished phonetics from phonology.
Later in the West, the success of science, mathematics, and other formal systems in the 20th century led many to attempt a formalization of the study of language as a "semantic code". This resulted in the academic discipline of linguistics, the founding of which is attributed to Ferdinand de Saussure.
Animal (nonhuman) language
While the term animal languages is widely used, most researchers agree that they are not as complex or expressive as human language; a more accurate term is animal communication. Some researchers argue that there are significant differences separating human language from the communication of other animals, and that the underlying principles are not related.
In several widely publicised instances, animals have been trained to mimic certain features of human language. For example, chimpanzees and gorillas have been taught hand signs based on American Sign Language; however, they have never been taught its grammar. There was also a case in 2003 of Kanzi, a captive bonobo chimpanzee allegedly independently creating some words to mean certain concepts. While animal communication has debated levels of semantics, it has not been shown to have syntax in the sense that human languages do.
Some researchers argue that a continuum exists among the communication methods of all social animals, pointing to the fundamental requirements of group behaviour and the existence of "mirror cells" in primates. This, however, may not be a scientific question, but is perhaps more one of definition. What exactly is the definition of the word "language"? Most researchers agree that, although human and more primitive languages have analogous features, they are not homologous.
Formal languages
Mathematics and computer science use artificial entities called formal languages (including programming languages and markup languages, but also some that are far more theoretical in nature). These often take the form of character strings, produced by some combination of formal grammar and semantics of arbitrary complexity.
See also
- Common phrases in different languages
- Computer-assisted language learning (a historical perspective)
- Deception
- Ethnologue, which provides a fairly complete list of languages, locations, population and genetic affiliation
- Extinct language
- FOXP2 (Language gene)
- ILR scale (defines five levels of language proficiency)
- ISO 639 (2- and 3-letter codes for language names)
- Language education
- Language reform
- Language policy
- Language school
- Linguistic protectionism
- Linguistics basic topics
- List of language academies
- List of languages
- List of official languages
- Naming
- Non-verbal communication
- Non-sexist language
- Official language
- Orthography
- Philology and Historical linguistics
- Philosophy of language
- Profanity
- Psycholinguistics
- Sapir-Whorf Hypothesis
- Slang
- Symbolic communication
- Speech therapy
- Terminology
- Tongue-twister
- Translation
- Whistled language
References
- Crystal, David (1997). The Cambridge Encyclopedia of Language. Cambridge, Cambridge University Press.
- Crystal, David (2001). The Cambridge Encyclopedia of the English Language. Cambridge, Cambridge University Press.
- Katzner, K. (1999). The Languages of the World. New York, Routledge.
- McArthur, T. (1996). The Concise Companion to the English Language. Oxford, Oxford University Press.
- Kandel, Jessel, and Schwartz (1991). Principles of Neural Science. McGraw Hill (esp. p. 1173).
External links
- [http://www.zompist.com/ Mark Rosenfelder’s Metaverse] provides a useful listing of 5000 languages and dialects (grouped by their relationships), where the numbers one to ten in each language may be found
- [http://www.geocities.com/agihard/mohl/mohl_languages.html Museum of Languages]
- The [http://www.ethnologue.com/ Ethnologue], a catalog of the world’s languages
- [http://www.language-capitals.com Language Capitals] Guide to 8 major languages of the world with facts, characteristics and varieties
- [http://www.vistawide.com/languages/ World Languages and Cultures] — Practical information and resources on languages and language learning
- [http://www.georgetown.edu/faculty/ballc/animals/animals.html Animal sounds in different languages]
- [http://www.netz-tipp.de/languages.html Distribution of languages on the Internet]
- [http://classweb.gmu.edu/accent/ Speech accent archive]
- [http://ourworld.compuserve.com/homepages/G_Kunkel/homepage.htm a collection of bird songs] provides many kinds of bird songs
- [http://acp.eugraph.com The Animal Communication Project]
- [http://reference.allrefer.com/encyclopedia/categories/lang.html Language Articles]
- [http://www.primitivism.com/language.htm Language: Origin and Meaning by John Zerzan]
Category:Technology
als:Sprache
zh-min-nan:Gí-giân
ko:언어
ms:Bahasa
nb:Språk
ja:言語
simple:Language
th:ภาษา
George L. HartGeorge L. Hart is a professor of Tamil language at the University of California, Berkeley.
Hart received his Ph.D. in Sanskrit from Harvard University and taught at the University of Wisconsin before joining the faculty at Berkeley. He is conversant with Latin and Greek as well as several modern European and Indian languages.
Hart is best-known for his translations of several Tamil epics into English and for asserting that Tamil should be classified as a classical language. In 2002, Hart, along with co-author Hank Heifetz, was the recipient of the AAS South Asia Council (SAC) Ramanujan Book Prize. The two produced a volume of translated Tamil poetry entitled Four Hundred Songs of War and Wisdom.
Hart is also the author of several Tamil and Sanskrit textbooks. He is married to Kausalya Hart, herself a professor and Tamil textbook author.
Works
- The Poems of Ancient Tamil, Their Milieu and Their Sanskrit Counterparts
- A Rapid Sanskrit Method
- The Four Hundred Songs of War and Wisdom: An Anthology of Poems from Classical Tamil, the Purananuru
- Poets of the Tamil Anthologies: Ancient Poems of Love and War
- The Forest Book of the Ramayana of Kampan
DiglossiaIn linguistics, diglossia is a situation where, in a given society, there are two (often) closely-related languages, one of high prestige, which is generally used by the government and in formal texts, and one of low prestige, which is usually the spoken vernacular tongue. The high-prestige language tends to be the more formalised, and its forms and vocabulary often 'filter down' into the vernacular, though often in a changed form.
The French term diglossie was first coined (as a translation of Greek διγλωσσία, 'bilingualism') by the Greek linguist and demoticist Ioannis Psycharis. The Arabist William Marçais used the term in 1930 to describe the linguistic situation in Arabic-speaking countries. In Charles Ferguson's article "Diglossia" in the journal Word (1959), diglossia was described as a kind of bilingualism in a given society in which one of the languages is (H), i.e. has high prestige, and another of the languages is (L), i.e. has low prestige. In Ferguson's definition, (H) and (L) are always closely related. Fishman also talks about diglossia with unrelated languages: "extended diglossia" (Fishman 1967), for example Sanskrit as (H) and Kannada as (L) or Alsatian (Elsässisch) in Alsace as (L) and French as (H). Kloss calls the (H) variant exoglossia and the (L) variant endoglossia.
In some cases, the nature of the connection between (H) and (L) is disputed; for example, Jamaican Creole as (L) and Standard English as (H) in Jamaica.
(H) is usually the written language whereas (L) is the spoken language. In formal situations, (H) is used; in informal situations, (L) is used.
The (L) variants are not just simplifications of the (H) variants. Many (L) languages have certain features that are more complex than the corresponding (H) languages: some Swiss German dialects have , and while Standard German only has and . Jamaican Creole has fewer vowel phonemes than standard Englishes, but it has additional palatal and phonemes.
Especially in endoglossia the (L) form may also be called "basilect", the (H) form "acrolect", and an intermediate form "mesolect". Note however that there is no "mesolect" in German-speaking Switzerland and in Luxembourg. Whether Paraguay has a form of diglossia is controversial. Guaraní and Spanish are both official languages of Paraguay. Some scholars argue that there are Paraguayans who actually don't speak Guaraní. See Guaraní language. The Chinese language also offers an interesting case.
Ferguson's classic examples include Standard German/Swiss German, Standard Arabic/vernacular Arabic, Standard French/Kréyòl in Haiti, and Katharevousa/Dhimotiki in Greece. However, Kréyòl is now recognised as a standard language in Haiti. Swiss German dialects are hardly languages with low prestige in Switzerland; and colloquial Arabic has more prestige in some respects than standard Arabic nowadays (see Chambers, Sociolinguistic Theory). And after the end of the military regime in 1974, Dhimotiki was made into Greece's only standard language (1976). Nowadays, Katharevousa is no longer used. Harold Schiffman writes about Swiss German: "it seems to be the case that Swiss German was once consensually agreed to be in a diglossic hierarchy with Standard German, but that this consensus is now breaking." There is also a lot of code switching especially in the Arabic world; according to Andrew Freeman this is "different from Ferguson's description of diglossia which states that the two forms are in complementary distribution." To a certain extent, there is code switching and overlap in all diglossic societies, even German-speaking Switzerland. Furthermore, in Ferguson's definition, diglossia is not bilingualism; however this depends on the scholar's definition of language. For example, different kinds of Arabic are not mutually intelligible; even though many are, but this may also be due to exposure to different varieties rather than inherent linguistic properties.
Examples where the High/Low dichotomy is justified in terms of social prestige include Italian dialects as (L) and Standard Italian as (H) in Italy and German dialects and standard German in Germany. In Italy and Germany, those speakers who still speak dialects typically use dialect in informal situations, especially in the family. In German-speaking Switzerland, on the other hand, Swiss German dialects are to a certain extent even used in schools and to a larger extent in churches. Ramseier calls German-speaking Switzerland's diglossia a "medial diglossia", whereas Felicity Rash prefers "functional diglossia". Paradoxically, Swiss German offers both the best example for diglossia (all speakers are native speakers of Swiss German and thus diglossic) and the worst, because there is no clear-cut hierarchy.
Catalan
Catalan is in most of catalan countries (it could be excluded the case of Andorra, in spite that in this case the pressure of the Castilian —"Español", but the name of that language is Castellano according to The Constitution of 1978— and of the French is very strong) in a situation of external diglossia; not only because the other languages (Castilian, French, Italian) are the languages used, to a large extent, by the corresponding official organizations; also the immigration, in the case of the Castilian at less, it has been so strong in the 20th century, that it has made that we can speak of esternal diglossia, produced by the relative increase of Castilian speakers; Also due that the Spanish Government as well as the French during many years have left apart the education in Catalan and provide education only for theirs respectives languages. In Algiers the Catalan is in severe danger, which has been recognized by UNESCO. It could be said that the Catalan is in danger everywhere due to the pressure of the official languages of the different states.
Brazilian Portuguese
According to many Brazilian linguists (Bortoni, Kato, Mattos e Silva, Bagno, Perini), Brazilian Portuguese is a highly diglossic language. L-variant (also known as Brazilian vernacular) is the mother tongue of all Brazilians, and H-variant (standard Brazilian Portuguese) is acquired through schooling. L-variant is a simplified form of archaic Portuguese, influenced by Amerindian and African languages, while H-variant is a form based on 19th century European Portuguese (and it is very similar to Standard European Portuguese, with only orthography being a little different). Mário A. Perini, a Brazilian linguist, compares the differences between L- and H- variants of Brazilian Portuguese with those between Standard Spanish and Standard Portuguese.
The L-variant is the spoken form of Brazilian Portuguese, avoided only in very formal speech (court interrogation, political debate) while the H-variant is the written form of Brazilian Portuguese, avoided only in informal writing (such as songs lyrics, love letters, intimate friends correspondence). Even language professors many times use the L-variant while explaining students the structure and usage of the H-variant; in essays, nevertheless, all students are expected to use H-variant.
The L-variant is used in songs, movies, soap operas, sitcoms and other television shows, although, at times, the H-variant is used in historic films or soap operas to make the language used sound more ‘elegant’ and/or ‘archaic’. The H-variant used to be preferred when dubbing foreign films and series into Brazilian Portuguese, but nowadays the L-variant is preferred. Movie subtitles normally use a mixture of L- and H-variants, but remain closer to the H-variant.
Most literary works are written in the H-variant. They have been attempts at writing in the L-variant (masterpiece Macunaíma, written by Brazilian modernist Mário de Andrade), but, presently, the L-variant is used only in dialogue. Still, many contemporary writers like using the H-variant even in informal dialogue. This is also true of translated books, which never use the L-variant, only the H one. Childrens books seem to be more L-friendly, but, again, if they are translated from another language (The Little Prince, for instance) they will use the H-variant only.
Most linguist use the term "Brazilian Portuguese" to describe the mesolect of Brazilian Vernacular, not the Standard Brazilian Portuguese which is almost identical to Standard European Portuguese. This idiom is characterized by simplification in verbal and pronominal systems and many changes in prepositional system, but the most striking differences are those affecting syntax. Brazilian linguist Fernando Tarallo claims that "the Portuguese language variety used in Brazil has developed quite a reasonable number of syntactic features different from the European system. These differences are large enough to allow for a description of the Brazilian variety in the sense of a Brazilian grammar". The same was confirmed later by Brazilian-based French linguist Galves.
The mesolect form of Brazilian Vernacular (that is, the one used in the speech of middle class Brazilians) is the form of Brazilian Portuguese language taught at American universities. H-varieties are explained later after students have mastered the L-variants.
Mário A. Perini, a famous Brazilian linguist, has said:
:"There are two languages in Brazil. The one we write (and which is called "Portuguese"), and another one that we speak (which is so despised that there is not a name to call it). The latter is the mother tongue of Brazilians, the former has to be learned in school, and a majority of population does not manage to master it appropriately.... Personally, I do not object to us writing Portuguese, but I think it is important to make clear that Portuguese is (at least in Brazil) only a written language. Our mother tongue is not Portuguese, but Brazilian Vernacular. This is not a slogan, nor a political statement, it is simply recognition of a fact.... There are linguistic teams working hard in order to give the full description of the structure of the Vernacular. So, there are hopes, that within some years, we will have appropriate grammars of our mother tongue, the language that has been ignored, denied and despised for such a long time."
According to Milton M. Azevedo (Brazilian linguist):
:"The relationship between Vernacular Brazilian Portuguese and the formal prescriptive variety fulfills the basic conditions of Ferguson's definition [of diglossia]...[...] Considering the difficulty encountered by vernacular speakers to acquire the standard, an understanding of those relationships appears to have broad educational significance. The teaching of Portuguese has traditionally meant imparting a prescriptive formal standard based on a literary register (Cunha 1985: 24) that is often at variance with the language with which students are familiar. As in a diglossic situation, vernacular speakers must learn to read and write in a dialect they neither speak nor fully understand, a circumstance that may have a bearing on the high dropout rate in elementary schools..."
Tamil
Tamil is a diglossic language. The classic form (chentamil) of the language is different from the colloquial (koduntamil) form. This difference in the language has existed since ancient times.
The classic form is preferred for writing, and is also used for public speaking. While written Tamil is mostly standard across various Tamil speaking regions, spoken form of the language differs widely from the written form.
Novels, even popular ones, will use (H) for all description and narration and use (L) only for dialogue, if they use it at all. The (L) variant is often only used for dialogue of rural or less educated speakers. Even though all Tamilians in ordinary conversation will use (L), novels often depict educated people speaking in an (H) form.
Regional and caste differences predominate in (L) variation. Tamil in the state capital Chennai (formerly Madras) is often quite distinct from that spoken elsewhere. Due to its proximity to Andhra Pradesh, there are often more Telugu words. Chennai (L) Tamil also often has more words of Urdu (or Deccani) than do varieties of Tamil from elsewhere in the state. Because of the larger role of English, Chennai Tamil also shows a great influence from this language. Of course, the Tamil spoken in Sri Lanka, while fully intelligible, also has clear differences in vocabulary and pronunciation. Throughout the state, a tripartite caste-based division is also common. Brahmans and Scheduled Castes (formerly called Untouchables) speak forms of (L) Tamil with clear grammatical differences from that of the members of other castes.
Bibliography
- Azevedo, Milton. 2005. "Portuguese. A linguistic introduction". Cambridge University Press.
- Azevedo, Milton; University of California. "Vernacular Features in Educated Speech in Brazilian Portuguese" http://www.cervantesvirtual.com/servlet/SirveObras/79117399329793384100080/p0000008.htm
- Bagno, Marcos. "Português ou Brasileiro? (Portuguese or Brazilian?)" http://paginas.terra.com.br/educacao/marcosbagno/
- Eeden, Petrus van. "Diglossie" http://www.afrikaans.nu/pag7.htm
- Freeman, Andrew. "Andrew Freeman's Perspectives on Arabic Diglossia" http://www-personal.umich.edu/~andyf/digl_96.htm
- Lubliner, Jacob. "Reflections on Diglossia" http://www.ce.berkeley.edu/~coby/essays/refdigl.htm
- Módolo, Marcelo. "As duas línguas do Brasil.(Two languages of Brazil)" Editora FAUUSP.
- Perini, Mário. 2002. "Modern Portuguese. A Reference Grammar." Yale University Press. New Haven.
- Rash, Felicity. 1998. The German Language in Switzerland. Multilingualism, Diglossia and Variation. Bern: Peter Lang.
- Schiffman, Harold. "Diglossia as a Sociolinguistic Situation" http://ccat.sas.upenn.edu/~haroldfs/messeas/diglossia/node1.html
Category:Sociolinguistics
Indo-European languages
The Indo-European languages include some 443 (SIL estimate) languages and dialects, including most of the major language families of Europe, as well as many languages of Southwest and South Asia, which belong to a single superfamily. Contemporary languages in this superfamily include Bengali, English, French, German, Hindi, Portuguese, Russian, and Spanish (each with more than 100 million native speakers), as well as numerous smaller national or minority languages. The Indo-European is the largest widely accepted family of languages in the world today, spoken by approximately 3 billion native speakers. (The second most common family of tongues being Sino-Tibetan)
Classification
The various subgroups of the Indo-European family include (in historical order of their first attestation):
- Anatolian languages — earliest attested branch, from the 18th century BC; extinct, most notable was the language of the Hittites.
- Indo-Iranian languages, descending from a common ancestor, Proto-Indo-Iranian
- Indo-Aryan, including Sanskrit, attested from the 2nd millennium BC
- Iranian languages, attested from roughly 1000 BC, including Avestan and Persian.
- Greek language — fragmentary records in Mycenaean from the 14th century BC; Homer dates to the 8th century BC. See History of the Greek language.
- Italic languages — including Latin and its descendants, the Romance languages, attested from the 1st millennium BC.
- Celtic languages — Gaulish inscriptions date as early as the 6th century BC; Old Irish texts from the 6th century AD.
- Germanic languages (including English) — earliest testimonies in runic inscriptions from around the 2nd century, earliest coherent texts in Gothic, 4th century.
- Armenian language — attested from the 5th century.
- Tocharian languages — extinct tongues of the Tocharians, extant in two dialects, attested from roughly the 6th century.
- Balto-Slavic languages, believed by many Indo-Europeanists to derive from a common proto-language later than Proto-Indo-European, while others are skeptical and think that Baltic and Slavic are no more closely related than any other two branches of Indo-European.
- Slavic languages — attested from the 9th century, earliest texts in Old Church Slavonic.
- Baltic languages — attested from the 14th century, and, for languages attested that late, they retain unusually many archaic features attributed to Proto-Indo-European.
- Albanian language — attested from the 15th century (1462); relations with Illyrian, Dacian, or Thracian proposed.
In addition to the classical ten branches listed above, there are several extinct languages, about which very little is known:
- Illyrian languages — possibly related to Messapian or Venetic; relation to Albanian also proposed.
- Venetic language — close to Italic.
- Liburnian language — apparently grouped with Venetic.
- Messapian language — not conclusively deciphered.
- Phrygian language — language of ancient Phrygia, possibly close to Greek, Thracian, or Armenian.
- Paionian language — extinct language once spoken north of Macedon.
- Thracian language — possibly close to Dacian.
- Dacian language — possibly close to Thracian and Albanian.
- Ancient Macedonian language — probably related to Greek, others propose relation to Ilyrian, Thracian or Phrygian.
- Ligurian language — possibly not Indo-European; possibly close to or part of Celtic
There were no doubt other Indo-European languages which are now lost without a trace. The fragmentary Raetian language cannot be classified with any certainty.
Further subfamilies have been suggested, among them Italo-Celtic and Graeco-Aryan. Neither of these is widely accepted. Indo-Hittite refers to the hypothesis that there is a significant separation between Anatolian and all the remaining groups.
Satem and Centum languages
Indo-Hittite/Srubna cultures).]]
The Indo-European sub-branches are often classified in a Satem and a Centum group. This is based on the varying treatments of the three original velar rows. Satem languages lost the distinction between labiovelar and pure velar sounds, and at the same time assibilated the palatal velars. The centum languages, on the other hand, lost the distinction between palatal velars and pure velars. Thus, geographically, the "eastern" languages are Satem (Indo-Iranian, Balto-Slavic, but not including Tocharian and Anatolian), and the "western" languages are Centum (Germanic, Italic, Celtic). The Satem-Centum isogloss runs right between the Greek (Centum) and Armenian (Satem) languages (thought to be related by a number of scholars), with Greek exhibiting some marginal Satem features. Some scholars think that there may be some languages that classify neither as Satem nor as Centum (Anatolian, Tocharian, and possibly Albanian). It should be noted that the grouping does not imply a claim of monophyly: there never was a "proto-Centum" or a "proto-Satem", but the sound changes spread by areal contact among already distinct post-PIE languages (say, during the 3rd millennium BC).
Suggested superfamilies
Some linguists propose that Indo-European languages are part of a hypothetical Nostratic language superfamily, and attempt to relate Indo-European to other language families, such as South Caucasian languages, Altaic languages, Uralic languages, Dravidian languages, Afro-Asiatic languages. This theory is controversial, as is the similar Eurasiatic theory of Joseph Greenberg, and the Proto-Pontic of John Colarusso.
History
See also: Proto-Indo-European, Historical linguistics, Glottochronology.
The possibility of common origin for some of these languages was first proposed by Marcus Zuerius van Boxhorn in 1647, proposing their derivation from "Scythian". However, the suggestions of van Boxhorn did not become widely known and were not pursued. The hypothesis was again proposed by Sir William Jones, who noticed similarities between four of the oldest languages known in his time, Latin, Greek, Sanskrit, and Persian. Systematic comparison of these and other old languages conducted by Franz Bopp supported this theory, and Bopp's Comparative Grammar, appearing between 1833 and 1852 is considered the starting point of Indo-European studies as an academic discipline.
The common ancestral (reconstructed) language is called Proto-Indo-European (PIE). There is disagreement as to the original geographic location (the so-called "Urheimat" or "original homeland") from where it originated. There are two main candidates today:
# the steppes north of the Black Sea and the Caspian Sea (see Kurgan)
# Anatolia (see Colin Renfrew).
Proponents of the Kurgan hypothesis tend to date the proto-language to ca. 4000 BC, while proponents of Anatolian origin usually date it several millennia earlier, associating the spread of Indo-European languages with the Neolithic spread of farming (see Indo-Hittite).
Kurgan hypothesis
The Kurgan hypothesis was originally suggested by Marija Gimbutas in the 1950s. According to the Kurgan hypothesis, early PIE was spoken in the chalcolithic steppe cultures of the 5th millennium BC between the Black Sea and the Volga.
Timeline
- 4500–4000: Early PIE. Sredny Stog, Dnieper-Donets and Sarama cultures, domestication of the horse.
- 4000–3500: The Yamna culture, the prototypical kurgan builders, emerges in the steppe, and the Maykop culture in the northern Caucasus. Indo-Hittite models postulate the separation of Proto-Anatolian before this time.
- 3500–3000: Middle PIE. The Yamna culture is at its peak, representing the classical reconstructed Proto-Indo-European society, with stone idols, early two-wheeled proto-chariots, predominantly practicing animal husbandry, but also with permanent settlements and hillforts, subsisting on agriculture and fishing, along rivers. Contact of the Yamna culture with late Neolithic Europe cultures results in the "kurganized" Globular Amphora and Baden cultures. The Maykop culture shows the earliest evidence of the beginning Bronze Age, and bronze weapons and artefacts are introduced to Yamna territory. Probable early Satemization.
- 3000–2500: Late PIE. The Yamna culture extends over the entire Pontic steppe. The Corded Ware culture extends from the Rhine to the Volga, corresponding to the latest phase of Indo-European unity, the vast "kurganized" area disintegrating into various independent languages and cultures, still in loose contact enabling the spread of technology and early loans between the groups, except for the Anatolian and Tocharian branches, which are already isolated from these processes. The Centum-Satem break is probably complete, but the phonetic trends of Satemization remain active.
- 2500–2000: The breakup into the proto-languages of the attested dialects is complete. Proto-Greek is spoken in the Balkans, Proto-Indo-Iranian north of the Caspian in the Sintashta-Petrovka culture. The Bronze Age reaches Central Europe with the Beaker culture, likely composed of various Centum dialects. Proto-Balto-Slavic (or alternatively, Proto-Slavic and Proto-Baltic communities in close contact) develops in north-eastern Europe. The Tarim mummies possibly correspond to proto-Tocharians.
- 2000–1500: The chariot is invented, leading to the split and rapid spread of Iranian and Indo-Aryan from the Andronovo culture and the Bactria-Margiana Archaeological Complex over much of Central Asia, Northern India, Iran and Eastern Anatolia. Proto-Anatolian is split into Hittite and Luwian. The pre-Proto-Celtic Unetice culture has an active metal industry (Nebra skydisk).
- 1500–1000: The Nordic Bronze Age develops (pre-)Proto-Germanic, and the (pre-)Proto-Celtic Urnfield and Hallstatt cultures emerge in Central Europe, introducing the Iron Age. Proto-Italic migration into the Italian peninsula. Redaction of the Rigveda and rise of the Vedic civilization in the Punjab. Flourishing and decline of the Hittite Empire. The Mycenaean civilization gives way to the Greek Dark Ages.
- 1000 BC–500 BC: The Celtic languages spread over Central and Western Europe. Northern Europe enters the Pre-Roman Iron Age, the formative phase of Proto Germanic. Homer initiates Greek literature and early Classical Antiquity. The Vedic Civilization gives way to the Mahajanapadas. Zoroaster composes the Gathas, rise of the Achaemenid Empire, replacing the Elamites and Babylonia. The Cimmerians (Srubna culture) are replaced by Scythians in the Pontic steppe. Armenians succeed the Urartu culture. Separation of Proto-Italic into Osco-Umbrian and Latin-Faliscan, and foundation of Rome. Genesis of the Greek and Old Italic alphabets. A variety of Paleo-Balkan languages are spoken in Southern Europe. The Anatolian languages are extinct.
Competing hypotheses
Colin Renfrew in 1987 suggested that the spread of Indo-European was associated with the Neolithic revolution, spreading peacefully into Europe from Asia Minor from around 7000 BC with the advance of farming (wave of advance). Accordingly, all of Neolithic Europe would have been Indo-European speaking, and the Kurgan migrations would at best have replaced Indo-European dialects with other Indo-European dialects.
Thomas Gamkrelidze and Vyacheslav V. Ivanov in 1984 placed the Indo-European homeland on Lake Urmia. They suggested that Armenian was the language which stayed in the Indo-European cradle while other Indo-European languages left the homeland. They are also the originators of the Glottalic theory.
Some people have pointed to the Black Sea deluge theory, dating the genesis of the Sea of Azov to ca. 5600 BC, as a direct cause of the Indo-European expansion. This event occurred in still clearly Neolithic times and is rather too early to fit with Kurgan archaeology. It may still be imagined as an event in the remote past of the Sredny Stog culture, and the people living on the land now beneath the Sea of Azov as possible pre-Proto-Indo-Europeans.
Other theories exist, often with a nationalistic flavour, sometimes bordering on national mysticism, typically positing the development in situ of the proponents' respective homes. One prominent example of such are the Indian theories that derive Vedic Sanskrit from the Indus valley civilization, postulating that Vedic Sanskrit is essentially identical to Proto-Indo-European, and that all other dialects must ultimately trace back to the early Indus valley civilization of ca. 3000 BC. This theory is not widely accepted by scholars. See Indo-Aryan migration for a discussion. Another example may be the Paleolithic Continuity Theory proposed by Italian theorists that derives Indo-European from the European Paleolithic cultures.
References
-
- August Schleicher, A Compendium of the Comparative Grammar of the Indo-European Languages (1861/62).
- Leszek Bednarczuk (red.), Języki indoeuropejskie. PWN. Warszawa. 1986 (in Polish).
See also
- Language family
- Indo-European studies
- Proto-Indo-European language
- List of Indo-European roots
- List of Indo-European languages
- List of languages
External links
- [http://www.HJHolm.de Slide-show of subgrouping roughly agreeing with the figures of the article.]
- [http://www.ship.edu/%7Ecgboeree/indoeuropean.html The Evolution of the Indo-European Languages, by Dr. C. George Boeree].
- [http://www.bartleby.com/61/IEroots.html Indo-European Roots, from the American Heritage Dictionary].
- [http://www.utexas.edu/cola/depts/lrc/iedocctr/ie.html Indo-European Documentation Center] at the University of Texas
- [http://www.grsampson.net/Q_PIE.html Say something in Proto-Indo-European] (by Geoffrey Sampson)
- [http://www.ethnologue.com/show_family.asp?subid=90017 IE language family overview (SIL)]
- [http://www.psych.auckland.ac.nz/psych/research/Evolution/Gray&Atkinson2003.pdf Gray & Atkinson, article on PIE Phylogeny]
- [http://www.geocities.com/protoillyrian Indo-European Root/lemmas] (by Andi Zeneli)
- [http://www.indoeuropean.nl The Indo-European Database]
- [http://groups.yahoo.com/group/cybalist/message/17084 Discussion on proto-Satem]
- [http://www.geocities.com/Paris/LeftBank/6507/chronicle120.html The Early History of Indo-European Languages]
-
ja:インド・ヨーロッパ語族
ko:인도유럽어족
th:ภาษากลุ่มอินโด-ยูโรเปียน
Latin (language)
Latin is an ancient Indo-European language originally spoken in the region around Rome called Latium. It gained great importance as the formal language of the Roman Empire. All Romance languages, those being most notably Spanish, French, Portuguese, Italian, and Romanian, are descended from Latin, and many words based on Latin are found in other modern languages such as English. The Latin alphabet, derived from the Greek, remains the most widely-used alphabet in the world. It is said that 80 percent of scholarly English words are derived from Latin (in a large number of cases by way of French). Moreover, in the Western world, Latin was a lingua franca, the learned language for scientific and political affairs, for more than a thousand years, being eventually replaced by French in the 18th century and English in the late 19th. Ecclesiastical Latin remains the formal language of the Roman Catholic Church to this day, and thus the official national language of the Vatican. The Church used Latin as its primary liturgical language until the Second Vatican Council in the 1960s. Latin is also still used (drawing heavily on Greek roots) to furnish the names used in the scientific classification of living things. The modern study of Latin, along with Greek, is known as Classics.
Main features
Latin is a synthetic inflectional language: affixes (which usually encode more than one grammatical category) are attached to fixed stems to express gender, number, and case in adjectives, nouns, and pronouns, which is called declension; and person, number, tense, voice, mood, and aspect in verbs, which is called conjugation. There are five declensions (declinationes) of nouns and four conjugations of verbs.
There are six noun cases:
#nominative (used as the subject of the verb or the predicate nominative),
#genitive (used to indicate relation or possession, often represented by the English of or the addition of s to a noun),
#dative (used of the indirect object of the verb, often represented by the English to or for),
#accusative (used of the direct object of the verb, or object of the preposition in some cases),
#ablative (separation, source, cause, or instrument, often represented by the English by, with, from),
#vocative (used of the person or thing being addressed).
In addition, some nouns have a locative case used to express location (otherwise expressed by the ablative with a preposition such as in), but this survival from Proto-Indo-European is found only in the names of lakes, cities, towns, small islands, and a few other words related to locations, such as "house", "ground", and "countryside". Latin itself, being a very old language, is far closer to Proto-Indo-European than are most modern Western European languages; it has, in fact, about the same relationship with PIE as modern Italian or French has to Latin.
There are six general tenses in Latin (technically they are tense/aspect/mood complexes). The indicative mood can be used with all of them. The subjunctive mood, however, has only present, imperfect, perfect, and pluperfect tenses. These tenses in the subjunctive mood do not completely correlate in meaning to the tenses in the indicative. The following examples are of the first conjugation verb "laudare" ("to praise") in the indicative mood and the active voice:
Primary sequence tenses
# present (laudo, "I praise")
# imperfect (laudabam, "I was praising")
# future (laudabo, "I shall praise," "I will praise")
Secondary sequence tenses
# perfect (laudavi, "I praised", "I have praised")
# pluperfect (laudaveram, "I had praised")
# future perfect (laudavero, "I shall have praised," "I will have praised")
The future perfect tense can also imply a normal future idea (like in "When I will have run...") and so may also sometimes be included in the primary sequence.
Latin and Romance
After the collapse of the Roman Empire, Latin evolved into the various Romance languages. These were for many centuries only spoken languages, Latin still being used for writing. For example, Latin was the official language of Portugal until 1296 when it was replaced by Portuguese.
The Romance languages evolved from Vulgar Latin, the spoken language of common usage, which in turn evolved from an older speech which also produced the formal classical standard. Latin and Romance differ (for example) in that Romance had distinctive stress, whereas Latin had distinctive length of vowels. In Italian and Sardo logudorese, there is distinctive length of consonants and stress, in Spanish only distinctive stress, and in French even stress is no longer distinctive.
Another major distinction between Romance and Latin is that all Romance languages, excluding Romanian, have lost their case endings in most words except for some pronouns. Romanian retains a direct case (nominative/accusative), an indirect case (dative/genitive), and vocative.
In Italy, Latin is still compulsory in secondary schools as Liceo Classico and Liceo Scientifico which are usually attended by people who aim to the highest level of education. In Liceo Classico Ancient Greek is a compulsory subject.
Latin and English
See Latin influence in English for a more complete exposition.
English grammar is independent of Latin grammar, though prescriptive grammarians in English have been heavily influenced by Latin. Attempts to make English grammar follow Latin rules — such as the prohibition against the split infinitive — have not worked successfully in regular usage. However, as many as half the words in English were derived from Latin, including many words of Greek origin first adopted by the Romans, not to mention the thousands of French, hundreds of Spanish, Portuguese and Italian words of Latin origin that have also enriched English.
During the 16th and on through the 18th century English writers created huge numbers of new words from Latin and Greek roots. These words were dubbed "inkhorn" or "inkpot" words (as if they had spilled from a pot of ink). Many of these words were used once by the author and then forgotten, but some remain. Imbibe, extrapolate, dormant and inebriation are all inkhorn terms carved from Latin words. In fact, the word etymology is derived from the Greek word etymologia, meaning "true sense of the word."
Latin was once taught in many of the schools in Britain with academic leanings - perhaps 25% of the total [http://www.channel4.com/history/microsites/T/teachem2/thennow/]. However, the requirement for it was gradually abandoned in the professions such as the law and medicine, and then, from around the late 1960s, for admission to university. After the introduction of the Modern Language GCSE in the 1980s, it was gradually replaced by other languages, although it is now being taught by more schools along with other classical languages.
Latin education
The linguistic element of Latin courses offered in high schools or secondary schools, and in universities, is primarily geared toward an ability to translate Latin texts into modern languages, rather than using it in oral communication. As such, the skill of reading is heavily emphasized, whereas speaking and listening skills are barely touched upon. However, there is a growing movement, sometimes known as the Living Latin movement, whose supporters believe that Latin can, or should, be taught in the same way that modern "living" languages are taught, that is, as a means of both spoken and written communication. One of the most interesting aspects of such an approach is that it assists speculative insight into how many of the ancient authors spoke and incorporated sounds of the language stylistically; without understanding how the language is meant to be heard it is very difficult to identify patterns in Latin poetry. Institutions offering Living Latin instruction include the Vatican and the University of Kentucky. In Britain the Classical Association encourages this approach, and there has been something of a vogue for books describing the adventures of a mouse called Minimus. In the United States there is a thriving competitive organization for high school Latin students, the National Junior Classical League (the second-largest youth organization in the world after the Boy Scouts), backed up by the Senior Classical League for college students. Many would-be international auxiliary languages have been heavily influenced by Latin, and the moderately successful Interlingua considers itself to be the modernized and simplified version of the language (le latino moderne international e simplificate).
Latin translations of modern literature such as Paddington Bear, Winnie the Pooh, Harry Potter and the Philosopher's Stone, Le Petit Prince, Max und Moritz, and The Cat in the Hat have also helped boost interest in the language.
See also
About the Latin language
- Latin grammar
- Latin spelling and pronunciation
- Latin declension
- Latin conjugation
- Latin alphabet
- List of Latin words with English derivatives
- Latin verbs with English derivatives
- Latin nouns with English derivatives
- ablative absolute
- Word order in Latin
About the Latin literary heritage
- Latin literature
- Romance languages
- Loeb Classical Library
- List of Latin phrases
- List of Latin proverbs
- Brocard
- List of Latin and Greek words commonly used in systematic names
- List of Latin place names in Europe
- Carmen Possum
Other related topics
- Roman Empire
- Internationalism
References
- Bennett, Charles E. Latin Grammar (Allyn and Bacon, Chicago, 1908)
- N. Vincent: "Latin", in The Romance Languages, M. Harris and N. Vincent, eds., (Oxford Univ. Press. 1990), ISBN 0195208293
- Waquet, Françoise, Latin, or the Empire of a Sign: From the Sixteenth to the Twentieth Centuries (Verso, 2003) ISBN 1859844022; translated from the French by John Howe.
- Wheelock, Frederic. Latin: An Introduction (Collins, 6th ed., 2005) ISBN 0060784237
External links
- [http://www.jambell.com/latin.html Latin Phrases for after dinner conversation (Thanks to Elaine Poole)]
- [http://www.ethnologue.com/show_language.asp?code=lat Ethnologue report for Latin]
- [http://forumromanum.org/literature/index.html Corpus Scriptorum Latinorum] is a comprehensive webography of Latin texts and their translations.
- [http://www.perseus.tufts.edu/ The Perseus Project] has many useful pages for the study of classical languages and literatures, including [http://www.perseus.tufts.edu/cgi-bin/resolveform?lang=Latin an interactive Latin dictionary].
- [http://lysy2.archives.nd.edu/cgi-bin/words.exe words by William whitaker] is a dictionary program online capable of looking up various word forms.
- [http://retiarius.org/ Retiarius.Org] includes a Latin text search engine.
- [http://www.nd.edu/~archives/latgramm.htm Latin-English dictionary and Latin grammar from U of Notre Dame]
- [http://latin-language.co.uk/ Latin language] History of Latin language, Latin texts with English translation and a collection of dictionaries.
- [http://augustinus.eresmas.net/scl/ Societas Circulorum Latinorum] gathers together Latin Circles all over the world.
- [http://www.learnlatin.tk LearnLatin.tk] - Free online course in Latin
- [http://www.latintests.net/ LatinTests.net] - Lets Latin learners test their grammar and vocabulary with self-checking quizzes.
- [http://thelatinlibrary.com/ The Latin Library] contains many Latin etexts
- [http://www.textkit.com/ Textkit] has Latin textbooks and etexts.
- [http://www.websters-online-dictionary.org/definition/Latin-english/ Latin–English Dictionary]: from Webster's Rosetta Edition.
- [http://www.language-reference.com/ Language reference] Cross-foreign-language lexicon powered by its own search engine. All cross combinations between Latin and French, German, Italian, Spanish.
- [http://comp.uark.edu/~mreynold/rhetor.html Rhetor by Gabriel Harvey] was originally published in 1577 and never again reprinted.
- [http://freewebs.com/omniamundamundis omniamundamundis] Latin hypertexts from fourteen ancient Roman authors.
- [http://www.saltspring.com/capewest/pron.htm Pronunciation of Biological Latin, Including Taxonomic Names of Plants and Animals]
- [http://www.yleradio1.fi/nuntii Nuntii Latini (News in Latin)], written and spoken (RealAudio) news in latin. Weekly review of world news in Classical Latin, the only international broadcast of its kind in the world, produced by YLE, the Finnish Broadcasting Company.
- [http://www.tranexp.com:2000/InterTran?url=http%3A%2F%2F&type=text&text=Replace%20Me&from=eng&to=ltt InterTran Latin], Translate from Latin to ENGLISH or vice versa.
- [http://www.latinvulgate.com Latin Vulgate] The Latin and English of the Old & New Testaments in parallel, along with the Complete Sayings of Jesus in parallel Latin and English.
Category:Classical languages
Category:Ancient languages
Category:Fusional languages
Category:Languages of Italy
Category:Languages of Vatican City
als:Latein
zh-min-nan:Latin-gí
ko:라틴어
ja:ラテン語
simple:Latin language
th:ภาษาละติน
Persian language
Persian (فارسی = Fârsi ... پارسی = Pârsi), (local name in Iran, Afghanistan and Tajikistan: ‘Fârsi’), ‘Pârsi’ (older local name, but still used by some speakers), Tajik (a Central Asian dialect) or Dari (another local name in Tajikistan and Afghanistan), is a language spoken in Iran, Tajikistan, Afghanistan, Uzbekistan, Bahrain, and elsewhere. Prior to British colonization, Persian was also widely used as a second language in the Indian subcontinent; it took prominence as the language of culture and education in several Muslim courts in the subcontinent throughout the Middle Ages and became the official court language under the Mughal emperors. Evidence of its former rank in the region can still be seen by the extent of its influence on Hindi or Urdu, as well as the popularity that Persian literature still enjoys in the region. Persian or its dialects have official-language status in the countries of Iran, Afghanistan, and Tajikistan. There are 61-71 million native speakers [http://www.ethnologue.com/show_family.asp?subid=90035]. It belongs to the Indo-European language family, and is of the Subject Object Verb type.
History
Persian is a member of the Indo-European family of languages, and within that family it belongs to the Indo-Iranian (Aryan) branch. Scholars believe the Iranian sub-branch consists of the following chronological linguistic path: Old Persian (Avestan and Achaemenid Persian) → Middle Persian (Pahlavi, Parthian, and Sassanids Persian) → Modern Persian (Dari, c. 900 to present Persian).
Old Persian, the main language of the Achaemenid inscriptions, should not be confused with the non-Indo-European Elamite language (see Behistun inscription). Over this period, the morphology of the language was simplified from the complex conjugation and declension system of Old Persian to the almost completely regularized morphology and rigid syntax of Modern Persian, in a manner often described as paralleling the development of English. Additionally, many words were introduced from neighboring languages, including Aramaic and Greek in earlier times, and later Arabic and to a lesser extent Turkish. In more recent times, some Western European words have entered the language (notably from French and English).
The language itself has greatly developed during the centuries. Due to technological developments, new words and idioms are created and enter into Persian like any other language. In Tehran the Academy of Persian Language and Literature is a center that evaluates the new words in order to initiate and advise their Persian equivalents. In Afghanistan, the Academy of Sciences of Afghanistan does the same for Afghan Persian (among other languages).
Nomenclature
Persian, the more widely used name of the language in English, is an Anglicized form derived from Latin - Persianus, < Latin Persia, < Greek Persis, a hellenized form of Old Persian Parsa. Farsi is the Arabicized form of Parsi, due to a lack of the /p/ phoneme in Standard Arabic. Its use in the English language is very recent (since the 1970s). Native Persian speakers typically call it "Fârsi" in modern usage. ISO, the Academy of Persian Language and Literature, and many other sources call the language Persian. The government of Afghanistan uses both "Dari" and "Persian" in English communications.
The Academy of Persian Language and Literature as well as many lexicographers have announced that "Farsi" is not the appropriate term to use for the Persian language in English. In the ISO 639-1, the local names form the basis for the language codes and for this reason "fa" is the designation for the Persian language in that system.
Dialects and close languages
ISO 639-1
Communication is generally mutually intelligible between Iranians, Tajiks, and Persian-speaking Afghans; however, by popular definition:
- Dari is the local name for the eastern dialect of Persian, one of the two official languages of Afghanistan, including Hazaragi — spoken by the Hazara people of central Afghanistan.
- Tajik could also be considered an eastern dialect of Persian, but, contrary to Iranian and Afghan Persian, it is written in the Cyrillic script.
The following are some of the closely related languages of various Iranian peoples within modern Iran proper:
- Mazerooni, or Mazandarani, spoken in northern Iran mainly in the province of Mazandaran.
- Guilaki, or Gilaki — spoken in the province of Guilan.
- Talysh, or Talishi — spoken in northern Iran and southern parts of the Republic of Azerbaijan.
- Luri, or Lori — spoken mainly in the southwestern Iranian province of Lorestan.
- (a.k.a. Tati, or Eshtehardi) — spoken in parts of the Iranian provinces of East Azarbaijan, Zanjan and Qazvin.
- Dari or Gabri — spoken originally in Yazd and Kerman by the Zoroastrians of Iran. Also called Yazdi by some.
- Dzhidi or Judæo-Persian — a collection of languages or dialects spoken by the many varied and ancient Jewish communities throughout the former greatest extent of the Persian Empire, one of the many Jewish languages of Persian Jews.
Orthography
The vast majority of modern Persian text is written in a form of the Arabic alphabet. In recent years the Latin alphabet has been used by some for technological or internationalization reasons.
Arabic Alphabet
Modern Persian is normally written using a modified variant of the Arabic alphabet.
The adoption of the Arabic script
After the conversion of Persia to Islam (see Islamic conquest of Iran), it took approximately one hundred and fifty years before Persians adopted the Arabic alphabet as a replacement for the older alphabet. Previously, two different alphabets were used for the Persian language (Middle Persian, or Pahlavi, at that time): one was also called Pahlavi and was a modified version of the Aramaic alphabet, and the other was a native Iranian alphabet called Dîndapirak (literally: religion script).
Note: "independence" of Arabic and Persian languages
One should note that despite their shared standard alphabet, Persian and Arabic are entirely different languages: they are not closely genetically related (they belong to separate genetic language families, namely, Indo-European and Afro-Asiatic) and naturally have different phonology and grammar.
The features of the Persian variant
The Persian variant adds four letters to the Arabic alphabet for its use, due to the fact that four sounds that exist in Persian do not exist in Arabic. Additionally, it changes the shape of another two. Some people call this modified alphabet the Perso-Arabic alphabet. The additional four letters are:
The letters different in shape are:
The diacritical marks used in the Arabic script, a.k.a. harakat, are also used in Persian, although some of them have different pronunciations. For example, an Arabic Damma is pronounced as /u/, while in Persian it is pronounced as /o/.
The Persian variant also adds the notion of a pseudo-space to the Arabic script, called a Zero-width non-joiner (ZWNJ) by the Unicode Standard. It acts like a space in disconnecting two otherwise-joining adjacent letters, but does not have a visual width.
Note: Spelling of Arabic words in Arabic and in Persian
It should also be noted that many Persian words with an Arabic root are spelled differently from the original Arabic word. Alef with hamza below ( إ ) always changes to alef ( ا ); teh marbuta ( ة ) usually, but not always, changes to teh ( ت ) or heh ( ه ); and words using various hamzas get spelled with yet another kind of hamza (so that مسؤول becomes مسئول).
Further expansion of the Persian variant
The features of the Persian variant have been taken for other languages, such as Pashto or Urdu, and have sometimes been further extended with new letters or punctuation.
Latin Alphabet
The Universal Persian (UniPers / Pârsiye Jahâni) Alphabet is a Latin-based alphabet created over 50 years ago in Iran and popularized by Mohamed Keyvan, who had used it in a number of Persian textbooks for foreigners and travellers. It sidesteps the difficulties of the traditional Arabic-based alphabet, with its multiple letter shapes and ambiguous spellings, and fits particularly well in contemporary electronically written media.
Fingilish is the name given to texts written in Persian using the Basic Latin alphabet. It is most commonly used in chat, emails and SMS applications.
Phonology
:Main article: Persian phonology
The Persian language has six vowels and twenty-three consonants, including two affricates /ʧ/ (ch) and /ʤ/ (j).
Historically, Persian distinguished length: the long vowels , , contrasting with the short vowels , , respectively. Modern spoken Persian, however, generally does not make this distinction anymore.
Persian phonology
Consonants
| | labial |
alveolars |
post-alveolars |
velars |
glottals |
| voiceless stops | |
| |
| |
| voiced stops | |
| |
| |
| voiceless fricatives | |
| |
| |
| voiced fricatives | |
|
|
| |
| nasals | |
| | | |
| liquids | | , |
| | |
| glides | | |
|
| |
Note that and are affricates, not stops.
Grammar
:Main article: Persian grammar
Suffixes predominate Persian morphology, though there are a small number of prefixes. Verbs can express tense and aspect, and they agree with the subject in person and number. There is no grammatical gender for nouns, nor are pronouns marked for natural gender.
Normal sentences are structured as "(S) (PP) (O) V". If the object is definite, then the order is "(S) (O + "rɑ:") (PP) V".
Vocabulary
There are many loanwords in the Persian language, mostly coming from Arabic, English, French, and the Turkic languages.
Persian has likewise influenced the vocabularies of other languages, especially Indo-Iranian languages and Turkic languages. Many Persian words have also found their way into the English language. See List of English words of Persian origin.
See also
- Academy of Persian Language and Literature
- Arabic numerals
- Dzhidi language
- History of Urdu
- List of English words of Persian origin
- List of Persian poets and authors
- Middle Persian literature
- Persian grammar
- Persian literature
- Persian mythology
- Persian phonology
- Persian or Farsi? - The announcement of the Academy of Persian Language and Literature
References
- Mace, J. (2003). Persian Grammar: For reference and revision. Routledge-Curzon, London.
- Mahootian, S. (1997). Persian. Descriptive Grammars. Routledge, London.
- Windfuhr, G. L. (1987). Persian. In Comrie, B., editor, The World’s Major Languages, pages 523–546. Oxford University Press, Oxford.
External links
- [http://www.ethnologue.org/show_language.asp?code=PRS Ethnologue report for Eastern Persian]
- [http://www.ethnologue.org/show_language.asp?code=PES Ethnologue report for Western Persian]
- [http://www.easypersian.com/ Easypersian.com]
- [http://www.websters-online-dictionary.org/translation/Farsi/ Dictionary] with Farsi - English Translations from [http://www.websters-online-dictionary.org Webster's Online Dictionary] - the Rosetta Edition
- [http://www.aryanpour.com/ Aryanpour Persian-English English-Persian Dictionary]
- [http://www.lmp.ucla.edu/Profile.aspx?LangID=63 UCLA Language Materials Project: Persian]
- [http://www.unipers.com/ UniPers.com A proposed Latin-based writing system designed specifically for the Persian language.]
- [http://www.persiandirect.com Persian Linguistics Association]
- [http://homepages.nyu.edu/%7Emmk4/AATP.htm American Association of Teachers of Persian (AATP)]
- [http://www.apersian.org The Centre for Promotion of Persian Language and Literature]
- [http://www.voanews.com/persian VOA’s Persian Language Service]
- [http://www.bbc.co.uk/persian BBC’s Persian Language Service]
- [http://www.dwelle.de/persian Deutche Welle’s Persian Service]
- [http://iran-heritage.org/interestgroups/language-articles.htm An Online Persian Language Forum]
Category:Classical languages
Category:Iranian culture
Category:Iranian languages
Category:Languages of Afghanistan
Category:Languages of Iran
Category:Languages of Tajikistan
Category:Languages of Uzbekistan
Category:Languages of Pakistan
Category:Languages of Russia
ko:페르시아어
ja:ペルシア語
th:ภาษาเปอร์เซีย
Arabic language
The Arabic language (; , less formally, ) is the largest member of the Semitic branch of the Afro-Asiatic language family (classification: South Central Semitic) and is closely related to Hebrew and Aramaic. It is spoken throughout the Arab world and is widely studied and known throughout the Islamic world. Arabic has been a literary language since at least the 6th century and is the liturgical language of Islam.
Literary and Modern Standard Arabic
The term "Arabic" may refer either to literary Arabic, which no Arab speaks as a mother tongue, or Modern Standard Arabic or to the many spoken varieties of Arabic commonly called "colloquial Arabic." Arabs consider literary Arabic as the standard language and tend to view everything else as mere dialects. Literary Arabic, (Literally: "the most eloquent Arabic language" — ) refers both to the language of present-day media across North Africa and the Middle East and to the more archaic language of the Qur'an. (The expression media here includes most television and radio, and all written matter, including all books, newspapers, magazines, documents of every kind, and reading primers for small children.) "Colloquial" or "dialectal" Arabic refers to the many national or regional dialects/languages derived from Classical Arabic, spoken daily across North Africa and the Middle East, which constitute the everyday spoken language. These sometimes differ enough to be mutually incomprehensible. These dialects are not typically written, although a certain amount of literature (particularly plays and poetry) exists in many of them. They are often used to varying degrees in informal spoken media, such as soap operas and talk shows.
Literary Arabic or classical Arabic, is the official language of all Arab countries and is the only form of Arabic taught in schools at all stages.
The sociolinguistic situation of Arabic in modern times provides a prime example of the linguistic phenomenon of Diglossia -the normal use of two separate varieties of the same language, usually in different social situations. In the case of Arabic, educated Arabs of whatever nationality can be assumed to speak both their local dialect and their school-taught literary Arabic (to an equal or lesser degree). This diglossic situation facilitates code switching in which a speaker switches back and forth unaware between the two varieties of the language, sometimes even within the same sentence. In instances in which Arabs of different nationalities engage in conversation only to find their dialects mutually unintelligible (e.g. a Moroccan speaking with a Lebanese), both should be able to code switch into Literary Arabic for the sake of communication.
Since the written Arabic of today differs from the written Arabic of the Qur'anic era, it has become customary in western scholarship and among non-Arab scholars of Arabic to refer to the language of the Qur'an as Classical Arabic and the modern language of the media and of formal speeches as Modern Standard Arabic. Arabs, on the other hand, often use the term to refer to both forms, thus placing greater emphasis on the similarities between the two. The difference between Arabic of the Qur'anic era and today's Classical Arabic is only in the degree of eloquance. The vocabulary, the syntatic and grammatical rules are the same.
Quite a few English words are ultimately derived from Arabic, often through other European languages, especially Spanish, among them every-day vocabulary like sugar (sukkar), cotton (qutn) or magazine (). More recognizable are words like algorithm, algebra, alchemy, alcohol, azimuth, nadir, and zenith (see List of English words of Arabic origin). The Maltese language spoken on the Mediterranean island of Malta is the only surviving European language to derive primarily from Arabic (a North African dialect), though it contains a large number of Italian and English borrowings.
Arabic and Islam
It is sometimes | | |