Navigation
  • Home
  • Recent
  • Most Active
  • Popular
  • Credits
  • RSS
  •  
      Help
  • How to Edit
  • Help



  • [Edit]



    The Latin alphabet, also called the Roman alphabet, is the most widely used alphabetic writing system in the world today. The basic alphabet comprises 26 letters and is used, with some modification, for most of the languages of Europe (excluding some Eastern European countries), the Americas, Sub-Saharan Africa, and the islands of the Pacific. Languages that use the Latin alphabet include the descendants of the Latin language (i.e. the Romance languages: French, Spanish, Italian, Portuguese, Romanian, Catalan, Galician), Germanic languages (like English, German, Dutch), Western Slavic languages (e.g., Polish, Czech), Celtic languages (e.g., Irish, Welsh, Scots Gaelic, Manx) , Finno-Ugric language (Finnish, Estonian, Hungarian) Austronesian languages (Hawaiian, Indonesian, Javanese, Malay, Tagalog) and other languages such as Vietnamese, Turkish, Hausa, Swahili, and many others. In modern usage, the term Latin alphabet is used for any straightforward derivation of the alphabet used by the Romans. These variants may drop letters (e.g., Hawaiian) or add letters (e.g., Czech, Lithuanian) to or from the classical Roman script, and of course many letter shapes have changed over the centuries — such as the lower-case letters which the Romans would not have recognized. The Latin alphabet evolved from the Greek alphabet which is based upon the Phoenician alphabet.


        Latin alphabet
            Overview
            Letters of the alphabet
            Extensions
                New forms
                Ligatures
                Diacritics
            Evolution
                Medieval and later developments
                Spread of the Latin alphabet
            Collating sequence with extensions
            See also
    NameLatin alphabet
    TypeAlphabet
    LanguagesSome variation exists for almost all existing...
    Time~400 B.C. to the present
    Fam1Proto-Canaanite alphabet
    Fam2Phoenician alphabet
    Fam3Greek alphabet
    Fam4Old Italic alphabet
    SistersCyrillic alphabet
    SampleLatin alphabet.png
    Image Size200px
    UnicodeSee Unicode Latin

    top

    Overview
    The default Latin alphabet is the Roman, supplemented with G, J, U, W, Y, Z, and lower-case variants:

    A, B, C, D, E, F, G, H, I, J, K, L, M, N, O, P, Q, R, S, T, U, V, W, X, Y, Z


    Additional letters may be formed
      by modification, as J was from I, G from C, Ø from O, eth Ð from D, yogh from G, ''schwa'' Ə from E, or ezh from z;
      may even be borrowed from another alphabet entirely, as thorn Þ and wynn were from Futhark.

    However, these glyphs are not always considered independent letters of the alphabet.

    top

    Letters of the alphabet
    As used in modern English, the Latin alphabet consists of the following characters (cf. English alphabet):



    top

    Extensions
    In the course of its use, the Latin alphabet was adapted for use in new languages, sometimes representing phonemes not found in languages that were already written with the Roman characters. To represent these new sounds, extensions were therefore sometimes created. They were made by adding marks to create diacritics, by joining multiple letters together to make ligatures, or by creation of completely new forms.

    These new forms are given a place in the alphabet by defining a collating sequence. This is language-dependent, as shown below.

    top

    New forms
    Eth Ðð and the Runic letters thorn Þþ, and wynn were added to the Old English alphabet. Eth and thorn were later replaced with 'th', and wynn with the new letter 'w'. Although these three letters are no longer part of the English alphabet, eth and thorn are still used in the modern Icelandic alphabet.

    For a short time in Roman history, three new letters, called the Claudian letters, were added to the alphabet, but they were not widely received and were eventually removed.

    Some West, Central and Southern African languages use a few additional letters which have a similar sound value to their equivalents in the IPA. For example, Ga uses the letters , Ŋŋ and and Adangme uses and . Hausa uses and for implosives and for an ejective. Africanists have standardized these into the African reference alphabet.

    top

    Ligatures
    A ligature is a fusion of two or more ordinary letters into a new glyph. Examples are Æ from AE, Œ from OE, ß (eszett) from ſʒ, Dutch ij from i and j, and & from et. The "ſs" pair is simply an archaic double s. The first glyph is the archaic medial form, and the second the final form. Note that ij is capitalised as IJ (never Ij).

    top

    Diacritics
    Diacritics are marks that are added to specific letters to modify their pronunciation. The effect is language dependent.

      the cedilla in ç, originally a small z written below the c (once symbolized in Romance languages, now gives c a 'soft' sound before a, o, and u, for example, in French façade, Portuguese Caçar and in Catalan Barça). While in Albanian and Turkish the "ç" changes the quality of the sound "c" and is pronounced as the "ch" (/t/) in the word "check" in English;

      the caron (háček, meaning 'little hook' in Czech amongst others) in ě ř ť š ď ľ ž č ň (used in some Baltic and Slavic languages).

      the tilde in Portuguese ã and õ, Estonian õ. In Portuguese, it was originally a small n written above the letter (once used to mark the elision of a former n, now marks nasalization of the base letter). In Estonian, õ is considered a separate letter of the alphabet. In Spanish ñ is considered a different letter from n and represents a palatal nasal ;




      the circumflex in the vowels â ê î ô û in French, Portuguese, Romanian (in Romanian, such vowels are considered completely distinct letters and appear in the alphabet), and other languages, the semi-vowels ŵ ŷ in Welsh and in the consonants ĉ ĝ ĥ ĵ ŝ in Esperanto;

      the umlaut in ä ö ü in German, Finnish and Swedish and other languages, and ë in Albanian, French and Ladin, which changes the quality (sound) of the vowel. In German, this mark was formerly written as a small e over the affected vowel. Modern German spelling accepts ae oe and ue as variants when the umlaut is unavailable; in Turkish ö ü are used to represent front rounded vowels;

      the diaeresis (same visual appearance as the umlaut above) in ä ë ï ö ü in several languages, indicates that the vowel is pronounced separately from the preceding one when it would otherwise be interpreted as a diphthong; for example in Portuguese (Brazilian), Spanish and Catalan, it is used only on ü to mark where it sounds a semi-vowel after G, as in "lingüiça" (sausage), "pingüino" (penguin) and "aigües" (waters). In Dutch, a second e is turned into an ë to distinguish between syllables, as in "biografieën" (biographies).


      the dot below in ạ ặ ậ ẹ ệ ị ọ ộ ợ ụ ự ỵ in Vietnamese to indicate constricted voice;

      the ogonek in ą ę in Polish, į ų in Lithuanian, and several Native American languages to indicate vowel nasalization (although, in Lithuanian it indicates length);




      the comma underneath, as used in ş and ţ in Romanian (often rendered less than optimally in fonts as a cedilla). Also used for ķ ļ ņ ŗ in Latvian;

      the dotless i (a "negative diacritic") in ı (minuscule form of normal latin I, majuscule of i is İ) as used in Turkish to represent , (essentially a without lip rounding);

      the hook used in ả ẳ ẩ ẻ ể ỉ ỏ ổ ở ủ ử ỷ in Vietnamese to indicate "Dipping rising" tone.

    There are other diacritics and other uses for the ones described here. Please see Alphabets derived from the Latin for a more complete list.

    top

    Evolution




    It is generally held that the Latins adopted the western variant of the Greek alphabet in the 7th century BC from Cumae, a Greek colony in southern Italy. Roman legend credited the introduction to one Evander, son of the Sibyl, supposedly 60 years before the Trojan war, but there is no historically sound basis to this tale. From the Cumae alphabet, the Etruscan alphabet was derived and the Latins finally adopted 21 of the original 26 Etruscan letters.

    In the original Latin alphabet,

      C stood for both g and k.
      I stood for both i and j.
      V stood for both u and v.

    Later, probably during the 3rd century BC, the Z was dropped and a new letter G was placed in its position. An attempt by the emperor Claudius to introduce three additional letters was short-lived, but after the conquest of Greece in the first century BC the letters Y and Z were, respectively, adopted and readopted from the Greek alphabet and placed at the end. Now the new Latin alphabet contained 23 letters:





    W is a letter made up from two Vs or Us. It was added in late Roman times to represent a Germanic sound. The letters U and J, similarly, were originally not distinguished from V and I, respectively.

    The Latin names of some of the letters are disputed. In general, however, the Romans did not use the traditional (Semitic-derived) names as in Greek: the names of the stop consonant letters were formed by adding to the sound (except for C, K, and Q which needed different vowels to distinguish them) and the names of the continuants consisted either of the bare sound, or the sound preceded by . The letter Y when introduced was probably called hy as in Greek (the name upsilon being not yet in use) but was changed to i Graeca ("Greek i") as Latin speakers had difficulty distinguishing and . Z was given its Greek name, zeta. For the Latin sounds represented by the various letters see Latin spelling and pronunciation; for the names of the letters in English see English alphabet.

    top

    Medieval and later developments
    It was not until the Middle Ages that the letter J (representing non-syllabic I) and the letters U and W (to distinguish them from V) were added.

    The alphabet used by the Romans consisted only of capital (upper case or majuscule) letters. The lower case (minuscule) letters developed in the Middle Ages from cursive writing, first as the uncial script, and later as minuscule script. The old Roman letters were retained for formal inscriptions and for emphasis in written documents. The languages that use the Latin alphabet generally use capital letters to begin paragraphs and sentences and for proper nouns. The rules for capitalization have changed over time, and different languages have varied in their rules for capitalization. Old English, for example, was rarely written with even proper nouns capitalised; whereas Modern English of the 18th century had frequently all nouns capitalised, in the same way that Modern German is today, e.g. "All the Sisters of the old Town had seen the Birds".

    top

    Spread of the Latin alphabet

    The Latin alphabet spread from Italy, along with the Latin language, to the lands surrounding the Mediterranean Sea with the expansion of the Roman Empire. The eastern half of the Roman Empire, including Greece, Asia Minor, the Levant, and Egypt, continued to use Greek as a lingua franca, but Latin was widely spoken in the western half of the Empire, and as the western Romance languages, including Spanish, French, Catalan, Portuguese and Italian, evolved out of Latin they continued to use and adapt the Latin alphabet. With the spread of Western Christianity the Latin alphabet spread to the peoples of northern Europe who spoke Germanic languages, displacing their earlier Runic alphabets, as well as to the speakers of Baltic languages, such as Lithuanian and Latvian, and several (non-Indo-European) Finno-Ugric languages, most notably Hungarian, Finnish and Estonian. During the Middle Ages the Latin alphabet also came into use among the peoples speaking West Slavic languages, including the ancestors of modern Poles, Czechs, Croats, Slovenes, and Slovaks, as these peoples adopted Roman Catholicism; the speakers of East Slavic languages generally adopted both Orthodox Christianity and the Cyrillic alphabet.

    As late as 1492, the Latin alphabet was limited primarily to the languages spoken in western, northern and central Europe. The Orthodox Christian Slavs of eastern and southern Europe mostly used the Cyrillic alphabet, and the Greek alphabet was still in use by Greek-speakers around the eastern Mediterranean. The Arabic alphabet was widespread within Islam, both among Arabs and non-Arab nations like the Iranians, Indonesians, Malays, and Turkic peoples. Most of the rest of Asia used a variety of Brahmic alphabets or the Chinese script.



    Over the past 500 years, the Latin alphabet has spread around the world. It spread to the Americas, Australia, and parts of Asia, Africa, and the Pacific with European colonization, along with the Spanish, Portuguese, English, French, and Dutch languages. In the late eighteenth century, the Romanians adopted the Latin alphabet; although Romanian is a Romance language, the Romanians were predominantly Orthodox Christians, and until the nineteenth century the Church used the Cyrillic alphabet. Vietnam, under French rule, adapted the Latin alphabet for use with the Vietnamese language, which had previously used Chinese characters. The Latin alphabet is also used for many Austronesian languages, including Tagalog and the other languages of the Philippines, and the official Malaysian and Indonesian languages, replacing earlier Arabic and indigenous Brahmic alphabets. In 1928, as part of Kemal Atatürk's reforms, Turkey adopted the Latin alphabet for the Turkish language, replacing the Arabic alphabet. Most of Turkic-speaking peoples of the former USSR, including Tatars, Bashkirs, Azeri, Kazakh, Kyrgyz and others, used the Uniform Turkic alphabet in the 1930s. In the 1940s all those alphabets were replaced by Cyrillic. After the collapse of the Soviet Union in 1991, several of the newly-independent Turkic-speaking republics adopted the Latin alphabet, replacing Cyrillic. Azerbaijan, Uzbekistan, and Turkmenistan have officially adopted the Latin alphabet for Azeri, Uzbek, and Turkmen, respectively. In the 1970s, the People's Republic of China developed an official transliteration of Mandarin Chinese into the Latin alphabet, called Pinyin, although use of the Pinyin has been very rare outside educational and tourism purposes.

    West Slavic and most South Slavic languages use the Latin alphabet rather than the Cyrillic, a reflection of the dominant religion practiced among those peoples. Among these, Polish uses a variety of diacritics and digraphs to represent special phonetic values, as well as the l with stroke - ł - for a sound similar to w. Czech uses diacritics as in Dvořák — the term háček (caron) originates from Czech. Croatian and the Latin version of Serbian use carons in č, š, ž, an acute in ć and a bar in đ. The languages of Eastern Orthodox Slavs generally use Cyrillic instead which is much closer to the Greek alphabet. The Serbian language uses two alphabets.

    top

    Collating sequence with extensions
    Alphabets derived from the Latin have varying collating rules:

      In Azerbaijani, there are 7 additional letters. 4 of them are vowels: ı, ö, ü, ə and 3 are consonants: ç, ş, ğ. The alphabet is the same as the Turkish alphabet, with the same sounds written with the same letters, except for three additional letters: q, x and ə for sounds that do not exist in Turkish. Although all the "Turkish letters" are collated in their "normal" alphabetical order like in Turkish, the three extra letters are collated arbitrarly after letters whose sounds approach theirs. So, q is collated just after k, x (pronounced like a German ch) is collated just after h and ə (pronounced roughly like an English short a) is collated just after e.
      In Breton, there is no "c" but there are the ligatures "ch" and "c'h", which are collated between "b" and "d". For example: « buzhugenn, chug, c'hoar, daeraouenn » (earthworm, juice, sister, teardrop).
      In Bosnian, Croatian and Serbian and other related South Slavic languages, the five accented characters and three conjoined characters are sorted after the originals: ..., C, Č, Ć, D, DŽ, Đ, E, ..., L, LJ, M, N, NJ, O, ..., S, Š, T, ..., Z, Ž.
      In Czech and Slovak, accented vowels have secondary collating weight - compared to other letters, they are treated as their unaccented forms (A-Á, E-É-Ě, I-Í, O-Ó-Ô, U-Ú-Ů, Y-Ý), but then they are sorted after the unaccented letters (for example, the correct lexicographic order is baa, baá, báa, bab, báb, bac, bác, bač, báč). Accented consonants (the ones with caron) have primary collating weight and are collocated immediately after their unaccented counterparts, with exception of Ď, Ň and Ť, which have again secondary weight. CH is considered to be a separate letter and goes between H and I. In Slovak, DZ and are also considered separate letters and are positioned between Ď and E (A-Á-Ä-B-C-Č-D-Ď-DZ-DŽ-E-É…).
      In the Danish and Norwegian alphabets, the same extra vowels as in Swedish (see below) are also present but in a different order and with different glyphs (..., X, Y, Z, Æ, Ø, Å). Also, "Aa" collates as an equivalent to "Å". The Danish alphabet has traditionally seen "W" as a variant of "V", but nowadays "W" is considered a separate letter.
      In Dutch the combination IJ (representing IJ) was formerly to be collated as Y (or sometimes, as a separate letter Y < IJ < Z), but is currently mostly collated as 2 letters (II < IJ < IK). Exceptions are phone directories; IJ is always collated as Y here because in many Dutch family names Y is used where modern spelling would require IJ. Note that a word starting with ij that is written with a capital I is also written with a capital J, for example, the town IJmuiden and the river IJssel.
      In Estonian õ, ä, ö and ü are considered separate letters and collate after w. Letters š, z and ž appear in loanwords and foreign proper names only and follow the letter s in the Estonian alphabet, which otherwise does not differ from the basic Latin alphabet.
      The Faroese alphabet also has some of the Danish, Norwegian, and Swedish extra letters, namely Æ and Ø. Furthermore, the Faroese alphabet uses the Icelandic eth, which follows the D. Five of the six vowels A, I, O, U and Y can get accents and are after that considered separate letters. The consonants C, Q, X, W and Z are not found. Therefore the first five letters are A, Á, B, D and Ð, and the last five are V, Y, Ý, Æ, Ø
      In Filipino and other Philippine languages, the letter Ng is treated as a separate letter. It is pronounced as in sing, ping-pong, etc. By itself, it is pronounced nang, but in general Philippine orthography, it is spelled as if it were two separate letters (n and g). Also, letter derivatives (such as Ñ) immediately follow the base letter. Filipino also is written with accents and other marks, but the marks are not in very wide use (except the tilde). (Philippine orthography also includes spelling.)
      The Finnish alphabet and collating rules are the same as in Swedish, except for the addition of the letters Š and Ž, which are considered variants of S and Z.
      In German letters with umlaut (Ä, Ö, Ü) are treated generally just like their non-umlauted versions; ß is always sorted as ss. This makes the alphabetic order Arg, Ärgerlich, Arm, Assistent, Aßlar, Assoziation. For phone directories and similar lists of names, the umlauts are to be collated like the letter combinations "ae", "oe", "ue". This makes the alphabetic order Udet, Übelacker, Uell, Ülle, Ueve, Üxküll, Uffenbach.
      The Hungarian vowels have accents, umlauts, and double accents, while consonants are written with single or with double characters (digraphs). In collating, accented vowels always follow their non-accented counterparts and double characters follow their single originals. Hungarian alphabetic order is: A, Á, B, C, CS, D, DZ, DZS, E, É, F, G, GY, H, I, Í, J, K, L, LY, M, N, NY, O, Ó, Ö, Ő, P, Q, R, S, SZ, T, TY, U, Ú, Ü, Ű, V, W, X, Y, Z, ZS. (For example, the correct lexicographic order is baa, baá, bab, bac, bacs, ..., baz, bazs, báa, báá, báb, bác, bács).
      In Icelandic, Þ is added, and D is followed by Ð. Each vowel (A, E, I, O, U, Y) is followed by its correspondent with acute: Á, É, Í, Ó, Ú, Ý. There is no Z, and after Ý, it goes like this: ... Þ, Æ, Ö.
        Both letters were also used by Anglo-Saxon scribes who also used the Runic letter Wynn to represent /w/.
        Þ (called thorn; lowercase þ) is also a Runic letter.
        Ð (called eth; lowercase ð) is the letter D with an added stroke.
      In Lithuanian, specifically Lithuanian letters go after their Latin originals. Another change is that Y comes just before J: ... G, H, I, Į, Y, J, K...
      In Polish, specifically Polish letters derived from the Latin alphabet are collated after their originals: A, Ą, B, C, Ć, D, E, Ę, ..., L, Ł, M, N, Ń, O, Ó, P, ..., S, Ś, T, ..., Z, Ź, Ż.
      In Romanian, special characters derived from the Latin alphabet are collated after their originals: A, Ă, Â, ..., I, Î, ..., S, Ş, T, Ţ, ..., Z.
      In the Swedish alphabet, there are three extra vowels placed at its end (..., X, Y, Z, Å, Ä, Ö), similar to the Danish and Norwegian alphabet, but with different glyphs and a different collating order. The letter "W" has been treated as a variant of "V", but in the 13th edition of Svenska Akademiens ordlista (2006) "W" was considered a separate letter.
      Some languages have more complex rules: for example, Spanish treated (until 1997) "CH" and "LL" as single letters, giving an ordering of CINCO, CREDO, CHISPA and LOMO, LUZ, LLAMA. This is not true anymore since in 1997 RAE adopted the more conventional usage, and now LL is collated between LK and LM, and CH between CG and CI. The only Spanish specific collating question is Ñ (eñe) as a different letter collated after N.
      In Tatar and Turkish, there are 9 additional letters. 5 of them are vowels, paired with main alphabet vowels as hard-smooth: a-ä, o-ö, u-ü, í-i, ı-e. The four remaining are consonants: ş is sh, ç is ch, ñ is ng and ğ is gh.
      Welsh also has complex rules: the combinations CH, DD, FF, NG, LL, PH, RH and TH are all considered single letters, and each is listed after the letter which is the first character in the combination, with the exception of NG which is listed after G. However, the situation is further complicated by these combinations not always being single letters. An example ordering is LAWR, LWCUS, LLONG, LLOM, LLONGYFARCH: the last of these words is a juxtaposition of LLON and GYFARCH, and, unlike LLONG, does not contain the letter NG.

    The Unicode Collation Algorithm can be used to get any of the collation sequences
    described above, by tailoring its default collation table. Several such tailorings
    are collected in Common Locale Data Repository.

    top

    See also
     


    Search more:
     

       
    Source Privacy License Download Contact Us Atlas
    Scientus.org Dictionary (Yet Another Wiki) RC : 1.41
    MIT OpenCourseWare
    This article is licensed under the GNU Free Documentation License [copyleft]. It uses material from the Wikipedia article "Latin alphabet". link