|
The Romance languages, a major branch of the Indo-European language family, comprise all languages that descended from Latin, the language of the Roman Empire. The Romance languages have more than 600 million native speakers worldwide, mainly in the Americas, Europe, and Africa, as well as in many smaller regions scattered through the world. All Romance languages (sometimes referred to as Romanic) descend from Vulgar Latin, the language of soldiers, settlers and slaves of the Roman Empire, which was substantially different from the Classical Latin of the Roman literati. Between 200 BCE and 100 CE, the expansion of the Empire, coupled with administrative and educational policies of Rome, made Vulgar Latin the dominant native language over a wide area spanning from the Iberian Peninsula to the Western coast of the Black Sea. During the Empire's decline and after its collapse and fragmentation in the 5th Century, Vulgar Latin began to evolve independently within each local area, and eventually diverged into dozens of distinct languages. The oversea empires established by Spain, Portugal and France after the 15th Century then spread Romance to the other continents—to such an extent that about two-thirds of all Romance speakers are now outside Europe. Despite multiple influences from pre-Roman languages and from later invasions, the phonology, morphology, lexicon, and syntax of all Romance languages are predominantly derived from Vulgar Latin. As a result, the group shares a number of linguistic features that set it apart from other Indo-European branches. In particular, with only one or two exceptions, Romance languages have lost the declension system of Classical Latin and, as a result, have a relatively rigid SVO sentence structure and make extensive use of prepositions. History Vulgar Latin There is very little documentary evidence about the nature of Vulgar Latin, and what little there is, is often hard to interpret or generalize. In any case, many of its speakers were soldiers, slaves, displaced peoples and forced resettlers—that is, more likely to be natives of the conquered lands than natives of Rome. It is believed that Vulgar Latin already had most of the features that are shared by all Romance languages, which distinguishes them from Classical Latin—such as the almost complete loss of the Latin declension system and its replacement by prepositions, the loss of the neuter gender, of comparative inflections, and of many verbal tenses, the use of articles, and the change in pronunciation of and . Fall of the Empire The political decline of the Roman Empire in the 5th Century and the large-scale migrations of the period, notably the Germanic incursions, led to a fragmentation of the Latin-speaking world into several independent states. Central Europe and the Balkans were occupied by Germanic and Slavic tribes, Huns, and Turks, isolating Romania from the rest of Latin Europe. Latin also disappeared from England, which had been for a time part of the Empire. On the other hand, the Germanic tribes that had entered Italy, France, and the Iberian Peninsula eventually adopted Latin and the remains of Roman culture, and thus Latin continued to be the dominant language in those areas. Latent incubation Between the 5th and 10th Centuries, spoken Vulgar Latin underwent divergent evolution in various parts of its domain, leading to dozens of distinct languages. This evolution is poorly documented, as the written language for all purposes continued to be a Latin close to the Classical variant. Recognition of the vernaculars Between the 10th and 13th Centuries, some local vernaculars came to be written and began to supplant Latin in many of its roles. In some countries, such as Portugal, this transition was expedited by force of law, whereas in other countries, such as Italy, the rise of the vernacular was the result of many prominent poets and writers adopting it as their medium. Uniformization and standardization The invention of the press apparently slowed down the evolution of Romance language from the 16th century on, and brought instead a tendency towards greater uniformity of language within political boundaries. In France, for instance, the "Francien" spoken in the region of Paris gradually spread over the whole country, while the Langue d'Oc and Franco-Provençal of the south lost much ground. History of the name The term "Romance" comes from the Vulgar Latin adverb romanice, derived from romanicus, as used in the expression romanice loqui ("to speak the Roman vernacular", contrasted with barbarice loqui, "to speak the non-Latin (barbarian) languages of the invaders", and latine loqui, "to speak the Latin taught in schools")•. From this adverb originated the noun romance, which applied initially to anything written romanice, "in the Roman vernacular". Incidentally, "Romance" meaning "love story, love affair" has the same origin. In the medieval literature of Western Europe, while serious writing was usually in Latin, popular tales, often focusing on love, were composed in the vernacular and came to be called "romances". Status The most spoken Romance language is Spanish, followed by Portuguese, French, Italian, Romanian and Catalan. The first five languages are all main and official national languages in more than one country each. A few other languages have official status on a regional or otherwise limited level, for instance Friulian, Sardinian and Valdôtain in Italy; Romansh in Switzerland; Galician, Aranese and Catalan in Spain (the latter which is also the only official language in the small sovereign state of Andorra). Spanish, Portuguese, French and Italian are also official and working languages of the European Union (Romanian will also become an official and working language of the EU on January 1st, 2007); and Spanish and French are two of the six official languages of the United Nations. The remaining Romance languages survive mostly as spoken languages for informal contact. National governments have historically viewed linguistic diversity as an economic, administrative or military liability, as well a potential source of separatist movements; therefore, they have generally fought to eliminate it—by extensively promoting the use of the official language, by restricting the use of the "other" languages in the media, by characterizing them as mere "dialects"—or worse. In the last decades of the 20th Century, however, increased sensitivity to the rights of minorities have allowed these languages to recover some of their prestige and lost rights. Yet, it is unclear whether these political changes will be enough to reverse the decline of the non-official languages. Features inherited from Indo-European As members of the Indo-European (IE) family, Romance languages have a number of features that are shared by other IE subfamilies (such as the Celtic, Germanic, Slavic, and Indo-Persian languages, Albanian, Armenian, Greek, Lithuanian, etc.), and in particular with English; but which set them apart from non-IE languages like Arabic, Basque, Hungarian, Tamil, and many more. These features include: Features inherited from Classical Latin The Romance languages share a number of features that were inherited from Classical Latin, and collectively set them apart from most other Indo-European languages. Features inherited from Vulgar Latin Romance languages also have a number of features that are not shared with Classical Latin. Most of these features are thought to be inherited from Vulgar Latin. Even though the Romance languages are all derived from Latin, they are arguably much closer to each other than to their common ancestor. The main difference is the loss of the case system of Classical Latin, an essential feature which allowed great freedom of word order, and has no counterpart in any Romance language (except to some extent in Romanian, which preserved three of Latin's seven noun cases). In this regard, the distance between any modern Romance language and Latin is comparable to that between Modern English and Old English. While speakers of French, Spanish or Italian, for example, can quickly learn to see through the spelling changes and thus recognize many Latin words, they will often fail to understand the meaning of Latin sentences. Other shared features The Romance languages also share a number of features that were not the result of common inheritance, but rather of various cultural diffusion processes in the Middle Ages — such as literary diffusion, commercial and military interactions, political domination, influence of the Catholic Church, and (especially in later times) conscious attempts to "purify" the languages by reference to Classical Latin. Some of those features have in fact spread to other non-Romance (and even non-Indo-European) languages, chiefly in Europe. Here are some of these "late origin" shared features: Divergent features In spite of their common origin, the descendants of Vulgar Latin have many differences. These occur at all levels, including the sound systems, the orthography, the nominal, verbal, and adjectival inflections, the auxiliary verbs and the semantics of verbal tenses, the function words, the rules for subordinate clauses, and, especially, in their vocabularies. While most of those differences are clearly due to independent development after the breakup of the Roman Empire (including invasions and cultural exchanges), one must also consider the influence of prior languages in territories of Latin Europe that fell under Roman rule, and possible inhomogeneities in Vulgar Latin itself. It is often said that Portuguese and French are the most innovative of the Romance languages, each in different ways, that Sardinian and Romanian are the most isolated and conservative variants, and that the languages of Italy other than Sardinian (including Italian) occupy a middle ground. Some even claim that Languedocian Occitan is the "most average" western Romance language. However, these evaluations are largely subjective, as they depend on how much weight one assigns to specific features. In fact all Romance languages, including Sardinian and Romanian, are all vastly different from their common ancestor. Romanian (together with other related minor languages, like Aromanian) in fact has a number of grammatical features which are unique within Romance, but are shared with other non-Romance languages of the Balkans, such as Albanian, Bulgarian, Greek, and Serbian. These features include, for example, the structure of the vestigial case system, the placement of articles as suffixes of the nouns (cer = "sky", cerul= "the sky"), and several more. This phenomenon, called the Balkan linguistic union, may be due to contacts between those languages in post-Roman times. Sound changes The vocabularies of Romance languages have undergone massive change since their birth, by various phonological processes that were characteristic of each language. Those changes applied more or less systematically to all words, but were often conditioned by the sound context or morphological structure. Some languages have dropped letters from the original Latin words. French, in particular, has dropped all final vowels, and sometimes also the preceding consonant: thus Latin LUPUS and LUNA became Italian lupo and luna but French loup and lune . Catalan, Occitan, and Romanian (Daco-Romanian) lost the final vowels in most masculine nouns and adjectives, but retained them in the feminine. Other languages, including Portuguese, Spanish, Italian, Franco-Provençal, and the Southern dialects of Romanian have retained those vowels. Some languages, like Portuguese, Spanish, and Venetian, have lost the final vowel -E from verbal infinitives, e.g. DĪCERE → Portuguese dizer ("to say"). Other common cases of final truncation are the verbal endings, eg. Latin AMĀT → Italian ama ("he loves"), AMĀBAM → amavo ("I loved"), AMĀBAT → amava ("he loved"), AMĀBATIS → amavate ("You pl. loved"), etc.. Sounds have often been dropped in the middle of the word, too; e.g. Latin LUNA → Galician and Portuguese lua, CRĒDERE → Spanish creer ("to believe"). On the other hand, some languages have inserted many epenthetic vowels in certain contexts. For instance Spanish and Portuguese have generally inserted an e in front of Latin words that began with S + consonant, such as SPERŌ → espero ("I hope"). French has gone the same way, but then dropped the s: SPATULA → épaule ("shoulder"). In the case of Italian, a unique article, lo for the definite and uno for the indefinite, is used for masculine S + consonant words (sbaglio, "mistake"), as well as all masculine words beginning with Z (zaino, "backpack"). Lexical stress The position of the stressed syllable in a word generally varies from word to word in each Romance language, and often moves as the word is inflected. Sometimes the stress is lexically significant, e.g. Italian Papa ("Pope") and papà ("daddy"), or Spanish imperfect subjunctive cantara ("he would sing") and future cantará ("he will sing"). However, the main function of Romance stress in appears to be a clue for speech segmentation — namely to help the listener identify the word boundaries in normal speech, where inter-word spaces are usually absent. In Romance languages, the stress is usually confined to one of the last three syllables of the word. That limit may be occasionally exceeded by some verbs with attached clitics, e.g. Italian mettiamocene or Metintilu in Friulian ("let's put some of it in there") or Spanish entregándomelo ("delivering it to me"). Originally the stress was predominantly in the penultimate syllable, but that pattern has changed considerably in some languages. In French, for instance, the loss of final vowels has left the stress almost exclusively on the last syllable. Formation of plurals Some Romance languages form plurals by adding (derived from the plural of the Latin accusative case), while others form the plural by changing the final vowel (by influence of the Latin nominative ending ). See La Spezia-Rimini Line for more information. Borrowed words Vulgar Latin has borrowed many words, often from Germanic languages that replaced words from Classical Latin during the Migration Period, even including common basic vocabulary. Notable examples is Words for "more" Some Romance languages use a version of Latin plus, others a version of magis. Words for "nothing" The common word for "nothing" is nada in Spanish and Portuguese, nada and ren in Galician, rien in French, res in Catalan, ren in Occitan, nimic in Romanian, and niente and nulla in Italian, nue and nuie in Friulian. It is said that all three roots derive from different parts of a Latin phrase NULLAM REM NATAM ("no thing born"), an emphatic idiom for "nothing". The number 16 Romanian constructs the names of the numbers 11–19 by a regular pattern which could be translated as "one-over-ten", "two-over-ten", etc.. All the other Romance languages use a pattern like "one-ten", "two-ten", etc. for 11–15, and the pattern "ten-and-seven, "ten-and-eight", "ten-and-nine" for 17–19. For 16, however, they split into two groups: some use "six-ten", some use "ten-and-six": To have and to hold The verbs derived from Latin HABĒRE "to have", TENĒRE "to hold", and ESSE "to be" are used differently in the various Romance languages, to express possession, to construct perfect tenses, and to make existential statements ("there is"). If we use T for TENĒRE, H for HABĒRE, and E for ESSE, we have the following distribution: For example: English: I have, I have done, there is Friulian: (jo) o ai, (jo) o ai fat, a 'nd è, al è (HHE) Romanian: (eu) am, (eu) am făcut, este (or e) (HHE) Italian: (io) ho, (io) ho fatto, c'è (HHE) French: j'ai, j'ai fait, il y a (HHH) Catalan: (jo) tinc, (jo) he fet, hi ha (THH) Spanish: (yo) tengo, (yo) he hecho, hay (THH) Portuguese: (eu) tenho, (eu) tenho feito, há (TTH) Galician-Portuguese used to employ the auxiliary H for permanent states, such as Eu hei um nome "I have a name" (i.e. for all my life), and T for non-permanent states Eu tenho um livro "I have a book" (i.e. perhaps not so tomorrow). Informal Brazilian Portuguese uses the T verb even in the existential sense, e.g. Tem água no copo "There is water in the glass". In most languages, the descendant of TENĒRE still has the sense of "to hold", as well, e.g. Italian tieni il libro, French tu tiens le livre, Catalan tens el llibre, Romanian ţine cartea, Friulian Tu tu tegnis il libri "You're holding the book". In others, like Portuguese, this sense has been mostly lost, and a different verb is currently used for "to hold". To have or to be Some languages use their equivalent of "have" as an auxiliary verb to form the perfect forms (e. g. French passé composé) of all verbs; others use "be" for some verbs and "have" for others. I did or I have done Some languages (e.g. Spanish, Catalan, Occitan and written French and Italian) make a distinction between a preterite and a perfect tense (cf. English I did vs. I have done). Others (spoken French and Italian) contain only one tense, which renders both meanings. French and Italian use the compound past for this, while Sicilian uses the simple past. Portuguese is unique in that its equivalent of the passé composé — made with the auxiliary ter (Spanish tener) is uncommon and does not have the same meaning as for other Romance languages. The phrase eu tenho feito is closer in meaning to I have been doing than to I have done, which would be rendered with the simple past eu fiz (although the meaning of the compound tense overlaps with "I have done" in some contexts). Galician is also unique in that it does not use auxiliary verbs in perfect tenses, except a similar use of Portuguese ter (never haver). Writing systems Letter values While most of the 22 basic Latin Letters have similar sound values in all Romance languages, the values of some letters have diverged considerably; and the new letters added since the Middle Ages have been put to different uses in different scripts. Some letters, notably H and Q, have been variously combined in digraphs or trigraphs (see below) to represent phonetic phenomena not recorded in Latin, or to get around previously established spelling conventions. A characteristic feature of the writing systems of all Romance languages is that the Latin letters C and G — which originally always represented and respectively — represent other sounds when they come before E, I, and in some cases Y. This is due to a general palatalization of and before front vowels like and . This is believed to have occurred in the transition from Classical to Vulgar Latin. Since the written form of all the affected words was tied to the classical language, the shift was accommodated by a change in the pronunciation rules. However, the new sounds of C and G in those contexts differ from language to language. The spelling rules of most Romance languages are fairly complex, and subject to considerable regional variation. To a first approximation, the phonetic representation of non-combined letters can be summarized as follows: C: generally , but "softened" before E, I, or Y in most Romance languages — to in French, Portuguese, Occitan, Catalan, and American Spanish; to in Peninsular Spanish; and to in Italian and Romanian. G: generally or , but "softened" before E, I, or Y in most languages — to in French, Portuguese, Occitan and Catalan; to or in Spanish (according to dialect); and to in Italian and Romanian. H: silent in most languages, but represents in Romanian and Gascon Occitan. Used in various digraphs (see below). J: represents in most languages; or in Spanish; in several of Italy's languages, though it is normally replaced with I in native Italian words. S: normally represents (either laminal or apical) at syllable onset, but usually between vowels. Intervocalic S is, however, pronounced in Spanish, Romanian, and several varieties of Italian. In the syllable coda, it may have special allophonic pronunciations. W: used only in Walloon. Represents in French, with the exception of words borrowed from English. X: at the beginning of words, represents ) in French, or in Spanish, and in Portuguese, Catalan, Galician). In intervocalic position, represents in French, Portuguese, Spanish, and Romanian; in Catalan, French, and Romanian; in Galician and Spanish); in Catalan, Galician and Portuguese; in French and Portuguese; or in French and Portuguese. Not used in Italian. Y: used in French and Spanish for the vowel , and also as a consonant, (esp. in French), , or . Z: in most languages; either or in Galician and Spanish; either or in Italian. Otherwise, letters that are not combined as digraphs generally have the same sounds as in the International Phonetic Alphabet (IPA), whose design was, in fact, greatly influenced by the Romance spelling systems. Digraphs and trigraphs Since most Romance languages have more sounds than can be accommodated in the Roman Latin alphabet they all resort to the use of digraphs and trigraphs — combinations of two or three letters with a single sound value. The concept (but not the actual combinations) derives from Classical Latin; which used, for example, TH, PH, and CH when transliterating the Greek letters "θ", "ϕ" (later "φ"), and "χ" (These were once aspirated sounds in Greek before changing to corresponding fricatives and the CI: used in Italian and Romanian to represent before A, O, or U. CH: used in Italian, Romanian and Sardinian to represent before E or I; in Spanish and Galician; and in most other languages. ÇH: used in Poitevin-Saintongeais for voiceless palatal fricative DD: used in Sicilian and Sardinian to represent the voiced retroflex plosive . DJ: used in Walloon for . GI: used in Italian and Romanian to represent before A, O, or U. GH: used in Italian, Romanian and Sardinian to represent before E or I, and in Galician for the voiceless pharyngeal fricative (not standard sound). GLI: used in Italian for . GN: used in French and Italian for , as in champignon or gnocchi. GU: used before E or I to represent or in all Romance languages except Italian and Romanian. LH: used in Portuguese, reintegrationist Galician and Occitan for . LL: used in Spanish, Catalan, Galician, Norman and Dgèrnésiais, originally for which has merged with . Represents in French unless it follows I (i) when it represents (or in some dialects). ĿL: used in Catalan for a long . NH: used in Portuguese, reintegrationist Galician and Occitan for , used in (official) Galician for (in reintegrationist Galician MH is used for this sound, or less frequently ũ since it only occurs in the indefinite article and its derivates). NY: used in Catalan for . QU: used before E or I to represent , in all Romance languages except Italian and Romanian. RR: used between vowels in several languages to denote a trilled or a guttural R, instead of the flap . SC: used before E or I in Italian for , and in French and Spanish as as in words of certain etymology. SCI: used in Italian to represent before A, O, or U. SH: used in Aranese Catalan for . SS: used in French, Portuguese, Occitan and Catalan for between vowels. TH: used in Jèrriais for (as in English "thick"); used in Aranese for either or While the digraphs CH, PH, RH and TH were at one time used in many words of Greek origin, most languages have now replaced them with C/QU, F, R and T. Only French has kept these etymological spellings, which now represent or , , and , respectively. Gemination For most languages in this family, consonant length is no longer phonemically distinctive or present. The double consonants in French spelling are due to etymology. However, Italian, Sardinian and Sicilian do have long consonants like BB, CC, DD, etc., where the doubling indicates a short hold before the consonant is released, which often has lexical value: e.g. note ("notes") vs. notte ("night"). They may even occur at the beginning of words in Romanesco, Neapolitan and Sicilian, and are occasionally written, e.g. Sicilian cchiù (more), and ddà (there). In general, the letters B, R and Z are long at the start of a word. In Jèrriais, long consonants are marked with an apostrophe: S'S is a long , SS'S is a long , and T'T is a long . In Catalan, there exists a geminate sound written ŀl, but it is usually pronounced as a simple sound in colloquial (and even some formal) speech. Diacritics and special characters Diacritics common across Romance languages are the acute accent (á), the grave accent (à), the circumflex accent (â), the diaeresis mark (ü), the cedilla (ç), and the tilde (ã). French spelling includes the etymological ligatures œ and (more rarely) æ. Romanian has a few diacritics of its own. An accent mark placed over a vowel generally denotes stress, height, or both. In Spanish, only stress is indicated, with an acute accent. Romanian â/î and ă are central vowels; stress is not marked in this language. Catalan regularly marks stress with an acute accent on high vowels, and with a grave accent on low vowels. Similarly, French é is a high-mid vowel and French è is a low-mid vowel, although in French stress is not indicated with diacritics. Italian marks stress with the grave accent, except on high e and o, which are sometimes marked with an acute accent. Portuguese marks stress with the acute accent, except for high a, e, o, which take a circumflex accent. Homophones may be differentiated by a grave accent in Italian and French, by an acute accent in Spanish. Upper and lower case Most languages are written with a mixture of two distinct but phonetically identical variants or "cases" of the alphabet: majuscule ("uppercase" or "capital letters"), derived from Roman stone-carved letter shapes, and minuscule ("lowercase"), derived from Carolingian writing and Medieval quill pen handwriting which were later adapted by printers in the 15th and 16th centuries. In particular, all Romance languages presently capitalize (use uppercase for the first letter of) the following words: the first word of each complete sentence, most words in names of people, places, and organizations, and most words in titles of books. Text in all upper case is used for emphasis and is generally interpreted as shouting. The Romance languages do not follow the German practice of capitalizing all nouns including common ones. Unlike English, the names of months (except in European Portuguese), days of the weeks, and derivatives of proper nouns are not capitalized: thus, in Italian one capitalizes Francia ("France") and Francesco ("Francis"), but not francese ("French") or francescano ("Franciscan"). However, each language has some exceptions to this general rule. Vocabulary comparison The table below provides a vocabulary comparison that illustrates a number of examples of sound shifts that have occurred between Latin and the main romance languages, along with a selection of minority languages. |class="wikitable" |- !width="75"|English ! Latin ! Catalan ! French ! Italian ! Jèrriais ! Lombard (literary Milanese) ! Occitan (Languedocien) ! Portuguese ! Romanian ! Sardinian ! Sicilian ! Spanish |- |Apple || Mattiana Mala (Pomum, fruit) || Poma || Pomme || Mela / Pomo || Poumme || Pomm || Poma || Maçã || Măr || Mela || Pumu || Manzana |- |Arm || Bracchium || Braç || Bras || Braccio || Bras || Brasc || Braç ||Braço || Braţ || Bratzu || Vrazzu || Brazo |- |Arrow || Sagitta (Frankish fleuka) || Fletxa / Sageta || Flèche || Freccia / Saetta || Èrchelle || Frecia || Sageta || Seta / Flecha || Săgeată || Fretza || Fileccia || Flecha / Saeta |- |Bed || Lectus (Camba, for sleeping) || Llit || Lit || Letto || Liet || Lecc || Lièch || Leito / Cama || Pat Illi/Illae/Illa, acc.Illum/Illam/Illud, Illos/Illas/Illa) || el/la/ho els/les/los || le/la les || il/lo/la i/gli/le || lé/la || el/la i || lo/la los/las || o/a os/as || -ul/-a -i/-le || su/sa sos/sas (is) li/i || el/la/lo los/las |- |Throw || Iacio (Lanceo, -are, to throw a weapon) || Llençar || Lancer || Lanciare || Pitchi || Trà List of languages The following is a listing of the Romance languages and some of their dialects. The classification of Romance languages is inherently difficult, since most of the linguistic area is a continuum. Top level groups are listed roughly West to East. Mixed languages There are some languages that developed from a mixture of two established Romance languages. It is not always clear whether they should be classified as pidgins, creole languages, or mixed languages. Proposed subfamilies Here are some of the subfamiles that have been proposed within the various classification schemes for Romance languages: Pidgins and creoles The global spread of colonial Romance languages has given rise to numerous creole languages and pidgins. Some of the lesser-spoken languages have also had influences on varieties spoken far from their traditional regions. The following is a partial list of creole languages and pidgins, grouped by their main source language. While not being pidgins nor creoles, English (see Middle English creole hypothesis), Basque and Albanian have a substantial Romance influence in their vocabularies. Constructed languages Latin and the Romance languages also give rise to numerous constructed languages, both international auxiliary languages (such as Interlingua, Latino sine flexione, Occidental, Lingua Franca Nova, and Esperanto (which derives the majority of its word roots from Latin and the other Romance languages)) and languages created for artistic purposes only (such as Brithenig and Wenedyk). See also | |||||||
|
| ||||||||
![]() |
|
| |