Victionarium lawiktionary https://la.wiktionary.org/wiki/Victionarium:Pagina_prima MediaWiki 1.39.0-wmf.23 case-sensitive Media Specialis Disputatio Usor Disputatio Usoris Victionarium Disputatio Victionarii Fasciculus Disputatio Fasciculi MediaWiki Disputatio MediaWiki Formula Disputatio Formulae Auxilium Disputatio Auxilii Categoria Disputatio Categoriae TimedText TimedText talk Module Module talk Gadget Gadget talk Gadget definition Gadget definition talk septem 0 1302 220201 220156 2022-08-14T13:46:42Z YaganZ 4537 curatura wikitext text/x-wiki {{caput|la|septem}} <!-- {{subst:PAGENAME}} --> =={{-lingua-|la|septem}}== {{la-numeralia|VII}} {{progressor|retro2=VI|sex|septem|octō|porro2=VIII}} {{vicipaedia|numero septem}} ==={{appellatio}}=== <!-- Appellatio et syllabificatio --> :{{Audio|La-cls-septem.ogg|/ˈsep.tem/|{{la-cls-appellatio}}|la}} <!-- Exemplum Latinum apellationis --> :{{syllabae|sep|tem|morph=sept-em}} <!-- Exemplum syllabificationis --> ==={{notatio}}=== [[Categoria:Radice sept|Septem]] A lingua prisca Indoeuropaea ''*septm.'' Vide [[:Categoria:Radice sept|radicem ''sept'']]. ==={{cardinalis|la}}=== '''septem''' ''(indeclinabile; numerales:'' [[ⅤⅠⅠ]], [[7]]) #{{la-nx|sex|Sex}} et alius. ==={{usus}}=== #[[duo]] [[et]] [[quinque]] [[sum|sunt]] '''septem'''. #:''{{en}}:'' [[two]] [[plus (en)|plus]] [[five]] [[is]] '''[[seven]]'''. #:''{{pt}}:'' [[dois]] [[mais]] [[cinco]] [[ser (pt)|são]] '''[[sete]]'''. ==={{trans}}=== <!-- Translationes --> {{trans-tab|Numerus cardinalis VII| *{{en}}: {{t+|en|seven}} *{{ar}}: {{t+|ar|سبعة|tr=sabʻa}} *{{br}}: {{t+|br|seizh}} *{{cy}}: {{t+|cy|saith}} *{{ca}}: {{t+|ca|set}} *{{hr}}: {{t+|hr|sedam}} *{{ro}}: {{t+|ro|șapte}} *{{et}}: {{t+|et|seitse}} *{{fi}}: {{t+|fi|seitsemän}} *{{fr}}: {{t+|fr|sept}} *{{fy}}: {{t+|fy|sân}} *{{de}}: {{t+|de|sieben}} *{{got}}: {{t|got|𐍃𐌹𐌱𐌿𐌽}} *{{el}}: {{t+|el|επτά}} *{{grc}}: {{t|grc|ἑπτά}} *{{kl}}: {{t+|kl|arfineq marluk}} *{{ga}}: {{t+|ga|seacht}} *{{es}}: {{t+|es|siete}} *{{hu}}: {{t+|hu|hét}} *{{ja}}: {{t+|ja|七|tr=shichi}}, {{t+|ja|七つ|tr=nanatsu}} *{{sq}}: {{t+|sq|shtatë}} *{{is}}: {{t+|is|sjö}} *{{it}}: {{t+|it|sette}} *{{pt}}: {{t+|pt|sete}} *{{fa}}: {{t+|fa|هَفت|tr=hæft}} *{{pl}}: {{t+|pl|siedem}} *{{ru}}: {{t+|ru|семь}} *{{zh}}: {{t+|zh|七|tr=qī}} *{{sr}}: {{t+|sr|седам}} *{{sl}}: {{t+|sl|sedem}} *{{sv}}: {{t+|sv|sju}} *{{tr}}: {{t+|tr|yedi}} *{{uk}}: {{t+|uk|сім}} *{{eu}}: {{t+|eu|zaspi}} }} 9vfitkbb4p0grrogldmh7i80ddp9zn8 mille 0 1340 220230 177459 2022-08-15T10:02:15Z YaganZ 4537 curatura wikitext text/x-wiki {{caput|la|mille}} <!-- {{subst:PAGENAME}} --> =={{-lingua-|la|mille}}== {{progressor|retro2=CM|nōngentī|mīlle|mīlle centum|porro2=MC}} {{progressor|retro2=N|cifra|mīlle|duo mīlia|porro2=MM}} ==={{appellatio}}=== <!-- Appellatio et syllabificatio --> :{{Audio|La-cls-mille.ogg|/ˈmiːl.le/|{{la-cls-appellatio}}|la|mīlle}} <!-- Exemplum Latinum apellationis --> :{{syllabae|mīl|le|morph=mille}} <!-- Exemplum syllabificationis --> ==={{notatio}}=== [[Categoria:Radice mill]] Fortasse “unum mille” a lingua prisca Indoeuropaea ''*smiH₂ ǵh(e)sli(H₂)''. Confer '''[[सहस्र]]''' (''sahasra'') a lingua Sanscrita. Vide [[:Categoria:Radice mill|radicem ''mill'']]. ==={{cardinalis|la}}=== '''mīlle''' (''indeclinabilis; pluraliter:'' '''[[milia|mīlia]]'''; ''numerales:'' [[Ⅿ]], [[1000]]) # √ Decem centuriae; centum deniones; nongenti nonaginta novem et alius. # Numerus indefinite magnus. ==={{usus|la}}=== *'''mille''' [[nox|noctes]] ::—{{en}}: [[a]] [[thousand]] [[night]]s *'''mille''' [[ovum|ova]] ::—{{de}}: [[tausend]] [[Ei]]er ==={{trans}}=== <!-- Translationes --> {{trans-tab|Nongenti nonaginta novem et alius. Numerus M| *{{en}}: {{t+|en|thousand}} *{{nl}}: {{t+|nl|duizend}} *{{cs}}: {{t+|cs|tisíc}} *{{bg}}: {{t+|bg|хиляда}} *{{br}}: {{t+|br|mil}} *{{cy}}: {{t+|cy|mil}} *{{ro}}: {{t+|ro|mie|f}} *{{eo}}: {{t+|eo|mil}} *{{et}}: {{t+|et|tuhat}} *{{fi}}: {{t+|fi|tuhat}} *{{fr}}: {{t+|fr|mille|m}} *{{fy}}: {{t+|fy|tûzen}} *{{de}}: {{t+|de|Tausend|n}}, {{t+|de|tausend}} *{{grc}}: {{t+|grc|χίλιοι}} *{{es}}: {{t+|es|mil|m}} *{{ja}}: {{t+|ja|千|tr=sen}} *{{sq}}: {{t+|sq|mijë|f}} *{{is}}: {{t+|is|þúsund|n}} *{{it}}: {{t+|it|mille|m}} *{{lv}}: {{t+|lv|tūkstotis}} *{{lt}}: {{t+|lt|tūkstantis}} *{{pt}}: {{t+|pt|mil}} *{{mk}}: {{t+|mk|илјада}} *{{mt}}: {{t+|mt|elf}} *{{el}}: {{t+|el|χίλια}} *{{no}}: {{t+|no|tusen}} *{{fa}}: {{t+|fa|هِزار|tr=hezār}} *{{pl}}: {{t+|pl|tysiąc}} *{{ru}}: {{t+|ru|тысяча}} *{{sk}}: {{t+|sk|tisíc}} *{{sl}}: {{t+|sl|tisoč}} *{{sv}}: {{t+|sv|tusen}} *{{tr}}: {{t+|tr|bin}} *{{uk}}: {{t+|uk|тисяча}} }} {{caput|mul|mille}} <!-- {{subst:PAGENAME}} --> {{discretiva}} =={{affines}}== <!-- Formae affines --> ==={{-lingua-|fi}}=== <!-- Divisio Finnica --> ===={{pars|mille}}==== ====={{proprietates}}===== {{=cflex=}} <!-- Exemplum Finnicum declinationis --> {{declin|fi|mille|mikä|allativus|sg||pron}} {{declin|fi|mille|mikä|allativus|pl||pron}} {{=cfin=}} ====={{appellatio}}===== <!-- Appellatio et syllabificatio --> :{{Audio|Fi-mille.ogg|/ˈmilːe/||fi}} <!-- Exemplum Finnicum apellationis --> :{{syllabae|mi|lle|morph=mi-lle}} <!-- Exemplum syllabificationis --> jh1sg7lfxz4k0ofgvjkwsvj8hoaxrns sto 0 12758 220219 220172 2022-08-14T17:44:57Z YaganZ 4537 curatura wikitext text/x-wiki {{caput|la|sto}} <!-- {{subst:PAGENAME}} --> =={{-lingua-|la|sto}}== ==={{appellatio}}=== <!-- Appellatio et syllabificatio --> :{{Audio|La-cls-sto.ogg|/stoː/|{{la-cls-appellatio}}|la|stō}} <!-- Exemplum Latinum apellationis --> :{{syllabae|stō|morph=st-o}} <!-- Exemplum syllabificationis --> ==={{intransitivum|la}}=== '''st'''|'''ō, -āre, stĕtī, stătum''' <ref name=Forcellini /><ref name=Freund /><ref name=Georges /><ref name=Langenscheidt /><ref name=Olivetti /> # Esse erecte positus. ==={{coniugatio}}=== {{la-coniugatio-1|st|stet|stat|stāt|act=|impas=}} ==={{derivatae}}=== *{{la-nx|statārius}} *{{la-nx|statim}} *{{la-nx|statiō}} *{{la-nx|statīvus}} *{{la-nx|statūra}} *{{la-nx|status}}, {{la-nx|statūs}} {{compos}} *{{la-nx|abstō}}, {{la-nx|abstāre}} *{{la-nx|adstō}}, {{la-nx|adstāre}} *{{la-nx|antestō}}, {{la-nx|antestāre}} *{{la-nx|circumstō}}, {{la-nx|circumstāre}} *{{la-nx|cōnstō}}, {{la-nx|cōnstāre}} *{{la-nx|exstō}}, {{la-nx|exstāre}} *{{la-nx|dīstō}}, {{la-nx|dīstāre}} *{{la-nx|instō}}, {{la-nx|instāre}} *{{la-nx|interstō}}, {{la-nx|interstāre}} *{{la-nx|obstō}}, {{la-nx|obstāre}} *{{la-nx|perstō}}, {{la-nx|perstāre}} *{{la-nx|praestō}}, {{la-nx|praestāre}} *{{la-nx|prōstō}}, {{la-nx|prōstāre}} *{{la-nx|sistō}}, {{la-nx|sistere}} *{{la-nx|substō}}, {{la-nx|substāre}} ==={{collatae}}=== {{anton}} *{{la-nx|eō}}, {{la-nx|īre}} ==={{trans}}=== <!-- Translationes --> {{trans-tab|Esse erecte positus| *{{en}}: {{t+|en|stand}} *{{nl}}: {{t+|nl|stan}} *{{cs}}: {{t+|cs|stát}} *{{ro}}: {{t+|ro|sta}} *{{eo}}: {{t+|eo|stari}} *{{fr}}: {{t+|fr|être debout}} *{{de}}: {{t+|de|stehen}} *{{is}}: {{t+|is|standa}} *{{it}}: {{t+|it|stare}} *{{el}}: {{t+|el|στέκομαι}} *{{pl}}: {{t+|pl|stać}} *{{ru}}: {{t+|ru|стоять}} *{{sv}}: {{t+|sv|stå}} *{{uk}}: {{t+|uk|стояти}} }} {{caput|mul|sto}} <!-- {{subst:PAGENAME}} --> {{discretiva}} *{{nexus-d|cs}} *{{nexus-d|hr}} *{{nexus-d|pl}} *{{nexus-d|sk}} *{{nexus-d|sl}} *{{nexus-d|sv}} ==={{similes}}=== *{{nexus-d|bg|сто|x=}} *{{nexus-d|hr|što}} *{{nexus-d|is|stó}} *{{nexus-d|mk|сто|x=}} *{{nexus-d|ru|сто|x=}} *{{nexus-d|sr|сто|x=mk}} *{{nexus-d|uk|сто|x=}} =={{affines}}== ==={{-lingua-|it}}=== <!-- Divisio Italica --> ===={{pars|sto}}==== ====={{proprietates}}===== {{=cform=}} <!-- Exemplum Italicum coniugationis --> {{coniug|it|sto|stare|1|sg|praes|ind|act}} {{=cfin=}} ====={{appellatio}}===== <!-- Appellatio et syllabificatio --> :{{Audio|It-sto.ogg|/stɔ/||it}} <!-- Exemplum Italicum apellationis --> :{{syllabae|sto|morph=st-o}} <!-- Exemplum syllabificationis --> {{fontes|noref=|ref= <!-- Eventus <ref>…</ref> --> <ref name=Forcellini>{{Forcellini|IV|523|com=“STO, stas, stĕti, stătum, stare, n. 1.”}}</ref> <ref name=Freund>{{Freund|III|320|com=“sto, stĕti, stătum, 1.”}}</ref> <ref name=Georges>{{Georges|1. sto, stetī, statum, stātūrus, āre (Stamm sta, wie in εστη-κα, εστάναι, ahd. stân)|n=20002662655|tom=2|p=2808}}</ref> <ref name=Langenscheidt>{{Langenscheidt|stare}}</ref> <ref name=Olivetti>{{Olivetti|sto}}</ref> }} 2mam1vtixemfyx0dps44d6e39po6uju sedem 0 21874 220195 180038 2022-08-14T12:36:00Z YaganZ 4537 curatura wikitext text/x-wiki {{caput|mul|sedem}} <!-- {{subst:PAGENAME}} --> {{discretiva}} *{{nexus-d|sk}} *{{nexus-d|sl}} ==={{similes}}=== *{{nexus-d|cs|sedm}} *{{nexus-d|bg|седем|x=}} *{{nexus-d|mk|седум|x=}} *{{nexus-d|pl|siedem}} *{{nexus-d|hr|sedam}} *{{nexus-d|sr|седам|x=}} =={{affines}}== <!-- Formae affines --> ==={{-lingua-|la}}=== <!-- Divisio Latina (prima) --> ===={{pars|sēdem}}==== ====={{proprietates}}===== {{=cflex=}} <!-- Exemplum Latinum declinationis --> {{declin|la|sēdem|sēdēs|acc|sg||sub}} {{=cfin=}} ====={{appellatio}}===== <!-- Appellatio et syllabificatio --> :{{Audio|La-cls-sedem.ogg|/ˈseːdem/|{{la-cls-appellatio}}|la|sēdem}} <!-- Exemplum Latinum apellationis --> :{{syllabae|sē|dem|morph=sēd-em}} <!-- Exemplum syllabificationis --> 1a34h7dkaca0x3j6wr9l2434clbd3jj 220196 220195 2022-08-14T13:06:10Z YaganZ 4537 corr. wikitext text/x-wiki {{caput|mul|sedem}} <!-- {{subst:PAGENAME}} --> {{discretiva}} *{{nexus-d|sk}} *{{nexus-d|sl}} ==={{similes}}=== *{{nexus-d|cs|sedm}} *{{nexus-d|bg|седем|x=}} *{{nexus-d|mk|седум|x=}} *{{nexus-d|pl|siedem}} *{{nexus-d|hr|sedam}} *{{nexus-d|sr|седам|x=sh}} =={{affines}}== <!-- Formae affines --> ==={{-lingua-|la}}=== <!-- Divisio Latina (prima) --> ===={{pars|sēdem}}==== ====={{proprietates}}===== {{=cflex=}} <!-- Exemplum Latinum declinationis --> {{declin|la|sēdem|sēdēs|acc|sg||sub}} {{=cfin=}} ====={{appellatio}}===== <!-- Appellatio et syllabificatio --> :{{Audio|La-cls-sedem.ogg|/ˈseːdem/|{{la-cls-appellatio}}|la|sēdem}} <!-- Exemplum Latinum apellationis --> :{{syllabae|sē|dem|morph=sēd-em}} <!-- Exemplum syllabificationis --> 1p7mzy74w4e5vi8wnqyuy8201u964hl 220197 220196 2022-08-14T13:11:27Z YaganZ 4537 corr. wikitext text/x-wiki {{caput|mul|sedem}} <!-- {{subst:PAGENAME}} --> {{discretiva}} *{{nexus-d|sk}} *{{nexus-d|sl}} ==={{similes}}=== *{{nexus-d|cs|sedm}} *{{nexus-d|bg|седем|x=}} *{{nexus-d|mk|седум|x=}} *{{nexus-d|pl|siedem}} *{{nexus-d|hr|sedam}} *{{nexus-d|sr|седам|x=mk}} =={{affines}}== <!-- Formae affines --> ==={{-lingua-|la}}=== <!-- Divisio Latina (prima) --> ===={{pars|sēdem}}==== ====={{proprietates}}===== {{=cflex=}} <!-- Exemplum Latinum declinationis --> {{declin|la|sēdem|sēdēs|acc|sg||sub}} {{=cfin=}} ====={{appellatio}}===== <!-- Appellatio et syllabificatio --> :{{Audio|La-cls-sedem.ogg|/ˈseːdem/|{{la-cls-appellatio}}|la|sēdem}} <!-- Exemplum Latinum apellationis --> :{{syllabae|sē|dem|morph=sēd-em}} <!-- Exemplum syllabificationis --> iocxgkcn0ogvis1zlzy6uhdeie54nl1 220199 220197 2022-08-14T13:37:44Z YaganZ 4537 corr. wikitext text/x-wiki {{caput|mul|sedem}} <!-- {{subst:PAGENAME}} --> {{discretiva}} *{{nexus-d|sk}} *{{nexus-d|sl}} ==={{similes}}=== *{{nexus-d|cs|sedm}} *{{nexus-d|bg|седем|x=}} *{{nexus-d|mk|седум|x=}} *{{nexus-d|pl|siedem}} *{{nexus-d|hr|sedam}} *{{nexus-d|sr|седам|x=sh}} =={{affines}}== <!-- Formae affines --> ==={{-lingua-|la}}=== <!-- Divisio Latina (prima) --> ===={{pars|sēdem}}==== ====={{proprietates}}===== {{=cflex=}} <!-- Exemplum Latinum declinationis --> {{declin|la|sēdem|sēdēs|acc|sg||sub}} {{=cfin=}} ====={{appellatio}}===== <!-- Appellatio et syllabificatio --> :{{Audio|La-cls-sedem.ogg|/ˈseːdem/|{{la-cls-appellatio}}|la|sēdem}} <!-- Exemplum Latinum apellationis --> :{{syllabae|sē|dem|morph=sēd-em}} <!-- Exemplum syllabificationis --> 1p7mzy74w4e5vi8wnqyuy8201u964hl 220200 220199 2022-08-14T13:39:56Z YaganZ 4537 corr. wikitext text/x-wiki {{caput|mul|sedem}} <!-- {{subst:PAGENAME}} --> {{discretiva}} *{{nexus-d|sk}} *{{nexus-d|sl}} ==={{similes}}=== *{{nexus-d|cs|sedm}} *{{nexus-d|bg|седем|x=}} *{{nexus-d|mk|седум|x=}} *{{nexus-d|pl|siedem}} *{{nexus-d|hr|sedam}} *{{nexus-d|sr|седам|x=}} =={{affines}}== <!-- Formae affines --> ==={{-lingua-|la}}=== <!-- Divisio Latina (prima) --> ===={{pars|sēdem}}==== ====={{proprietates}}===== {{=cflex=}} <!-- Exemplum Latinum declinationis --> {{declin|la|sēdem|sēdēs|acc|sg||sub}} {{=cfin=}} ====={{appellatio}}===== <!-- Appellatio et syllabificatio --> :{{Audio|La-cls-sedem.ogg|/ˈseːdem/|{{la-cls-appellatio}}|la|sēdem}} <!-- Exemplum Latinum apellationis --> :{{syllabae|sē|dem|morph=sēd-em}} <!-- Exemplum syllabificationis --> 1a34h7dkaca0x3j6wr9l2434clbd3jj 220204 220200 2022-08-14T14:14:49Z YaganZ 4537 corr. wikitext text/x-wiki {{caput|mul|sedem}} <!-- {{subst:PAGENAME}} --> {{discretiva}} *{{nexus-d|sk}} *{{nexus-d|sl}} ==={{similes}}=== *{{nexus-d|cs|sedm}} *{{nexus-d|bg|седем|x=}} *{{nexus-d|mk|седум|x=}} *{{nexus-d|pl|siedem}} *{{nexus-d|hr|sedam}} *{{nexus-d|sr|седам|x=mk}} =={{affines}}== <!-- Formae affines --> ==={{-lingua-|la}}=== <!-- Divisio Latina (prima) --> ===={{pars|sēdem}}==== ====={{proprietates}}===== {{=cflex=}} <!-- Exemplum Latinum declinationis --> {{declin|la|sēdem|sēdēs|acc|sg||sub}} {{=cfin=}} ====={{appellatio}}===== <!-- Appellatio et syllabificatio --> :{{Audio|La-cls-sedem.ogg|/ˈseːdem/|{{la-cls-appellatio}}|la|sēdem}} <!-- Exemplum Latinum apellationis --> :{{syllabae|sē|dem|morph=sēd-em}} <!-- Exemplum syllabificationis --> iocxgkcn0ogvis1zlzy6uhdeie54nl1 220208 220204 2022-08-14T14:50:33Z YaganZ 4537 corr. wikitext text/x-wiki {{caput|mul|sedem}} <!-- {{subst:PAGENAME}} --> {{discretiva}} *{{nexus-d|sk}} *{{nexus-d|sl}} ==={{similes}}=== *{{nexus-d|cs|sedm}} *{{nexus-d|bg|седем|x=}} *{{nexus-d|mk|седум|x=}} *{{nexus-d|pl|siedem}} *{{nexus-d|hr|sedam}} *{{nexus-d|sr|седам|x=sr}} =={{affines}}== <!-- Formae affines --> ==={{-lingua-|la}}=== <!-- Divisio Latina (prima) --> ===={{pars|sēdem}}==== ====={{proprietates}}===== {{=cflex=}} <!-- Exemplum Latinum declinationis --> {{declin|la|sēdem|sēdēs|acc|sg||sub}} {{=cfin=}} ====={{appellatio}}===== <!-- Appellatio et syllabificatio --> :{{Audio|La-cls-sedem.ogg|/ˈseːdem/|{{la-cls-appellatio}}|la|sēdem}} <!-- Exemplum Latinum apellationis --> :{{syllabae|sē|dem|morph=sēd-em}} <!-- Exemplum syllabificationis --> ecpad9mo4zn5783mz3shmfh5j3cwi3s 220211 220208 2022-08-14T14:56:45Z YaganZ 4537 mk workaround wikitext text/x-wiki {{caput|mul|sedem}} <!-- {{subst:PAGENAME}} --> {{discretiva}} *{{nexus-d|sk}} *{{nexus-d|sl}} ==={{similes}}=== *{{nexus-d|cs|sedm}} *{{nexus-d|bg|седем|x=}} *{{nexus-d|mk|седум|x=}} *{{nexus-d|pl|siedem}} *{{nexus-d|hr|sedam}} *{{nexus-d|sr|седам|x=mk}} =={{affines}}== <!-- Formae affines --> ==={{-lingua-|la}}=== <!-- Divisio Latina (prima) --> ===={{pars|sēdem}}==== ====={{proprietates}}===== {{=cflex=}} <!-- Exemplum Latinum declinationis --> {{declin|la|sēdem|sēdēs|acc|sg||sub}} {{=cfin=}} ====={{appellatio}}===== <!-- Appellatio et syllabificatio --> :{{Audio|La-cls-sedem.ogg|/ˈseːdem/|{{la-cls-appellatio}}|la|sēdem}} <!-- Exemplum Latinum apellationis --> :{{syllabae|sē|dem|morph=sēd-em}} <!-- Exemplum syllabificationis --> iocxgkcn0ogvis1zlzy6uhdeie54nl1 Module:languages/data2 828 24123 220202 217987 2022-08-14T13:50:23Z YaganZ 4537 curatura V25 Scribunto text/plain -- Module:languages/data2 -- imported from en.wiktionary -- 2022-08-14 -- V25 -- sh-translit module, last modified by Usor:YaganZ -- 2021-12-27 -- V24 -- bn, ma, sc experimental, last modified by Usor:YaganZ -- 2021-03-14 -- V23 -- +kv = kpv = Komiense, last modified by Usor:YaganZ -- 2020-12-06 -- V22 -- +cu, last modified by Usor:YaganZ -- 2020-10-14 -- V21 -- +oj, last modified by Usor:YaganZ -- 2020-07-16 -- V20 -- +genplf, last modified by Usor:YaganZ -- canonicalNames are translated into Latin adverbial, ablative or neuter forms (if available in Categoria:Formulae linguarum), -- missing entries added. -- otherNames 1-6 are used for mostly used inflected forms, 7=own name, 8=non-inflected form, 9-n are rarely used inflected forms: -- 1=nomsgf, 2=nomplf, 3=nomplm, 4=nomsgn, 5=nompln, 6=gensgf, 7=ipsa, 8=nomsgm, 9=ablsgf, 10=genplf local u = mw.ustring.char -- UTF-8 encoded strings for some commonly-used diacritics local GRAVE = u(0x0300) local ACUTE = u(0x0301) local CIRC = u(0x0302) local TILDE = u(0x0303) local MACRON = u(0x0304) local BREVE = u(0x0306) local DOTABOVE = u(0x0307) local DIAER = u(0x0308) local CARON = u(0x030C) local DGRAVE = u(0x030F) local INVBREVE = u(0x0311) local DOTBELOW = u(0x0323) local RINGBELOW = u(0x0325) local CEDILLA = u(0x0327) -- Puncuation to be used for standardChars field local PUNCTUATION = ' \!\#\$\%\&\*\+\,\-\.\/\:\;\<\=\>\?\@\^\_\`\|\~\'\(\)' local m = {} m["aa"] = { canonicalName = "Afarice", otherNames = {"Afarica", "Afaricae", "Afarici", "Afaricum", "Afarica", "Afaricae", "Qafaraf", "Afaricus", "Afarica", "Afaricarum", "Qafar"}, scripts = {"Latn"}, family = "cus", } m["ab"] = { canonicalName = "Abasce", otherNames = {"Abasca", "Abascae", "Abasci", "Abascum", "Abasca", "Abascae", "аҧсшәа", "Abascus", "Abasca", "Abascarum", "Abkhazian", "Abxazo"}, scripts = {"Cyrl", "Geor", "Latn"}, family = "cau-abz", translit_module = "ab-translit", entry_name = { from = {GRAVE, ACUTE}, to = {}} , } m["ae"] = { canonicalName = "Avestane", otherNames = {"Avestana", "Avestanae", "Avestani", "Avestanum", "Avestana", "Avestanae", "zend", "Avestanus", "Avestana", "Avestanarum", "Old Bactrian"}, scripts = {"Avst", "Gujr"}, family = "ira-eas", translit_module = "Avst-translit", } m["af"] = { canonicalName = "Africanice", otherNames = {"Africanica", "Africanicae", "Africanici", "Africanicum", "Africanica", "Africanicae", "Afrikaans", "Africanicus", "Africanica", "Africanicarum"}, scripts = {"Latn", "Arab"}, family = "gmw", ancestors = {"nl"}, sort_key = { from = {"[äáâà]", "[ëéêè]", "[ïíîì]", "[öóôò]", "[üúûù]", "[ÿýŷỳ]", "^-", "'"}, to = {"a" , "e" , "i" , "o" , "u" , "y" }} , } m["ak"] = { canonicalName = "Akan", otherNames = {"Twi-Fante", "Twi", "Fante", "Fanti", "Asante", "Akuapem"}, scripts = {"Latn"}, family = "alv-kwa", } m["am"] = { canonicalName = "Aethiopice", otherNames = {"Aethiopica", "Aethiopicae", "Aethiopici", "Aethiopicum", "Aethiopica", "Aethiopicae", "አማርኛ", "Aethiopicus", "Aethiopica", "Aethiopicarum", "Amharica"}, scripts = {"Ethi"}, family = "sem-eth", translit_module = "Ethi-translit", } m["an"] = { canonicalName = "Aragonice", otherNames = {"Aragonica", "Aragonicae", "Aragonici", "Aragonicum", "Aragonica", "Aragonicae", "aragonés", "Aragonicus", "Aragonica", "Aragonicarum", "Aragonensis"}, scripts = {"Latn"}, family = "roa", ancestors = {"roa-oan"}, } m["ar"] = { canonicalName = "Arabice", otherNames = {"Arabica", "Arabicae", "Arabici", "Arabicum", "Arabica", "Arabicae", "العربية", "Arabicus", "Arabica", "Arabicarum", "Modern Standard Arabic", "Standard Arabic", "Literary Arabic", "Classical Arabic"}, scripts = {"Arab"}, family = "sem-arb", entry_name = { from = {u(0x0671), u(0x064B), u(0x064C), u(0x064D), u(0x064E), u(0x064F), u(0x0650), u(0x0651), u(0x0652), u(0x0670), u(0x0640)}, to = {u(0x0627)}}, translit_module = "ar-translit", } m["as"] = { canonicalName = "Assamice", otherNames = {"Assamica", "Assamicae", "Assamici", "Assamicum", "Assamica", "Assamicae", "অসমীয়া", "Assamicus", "Assamica", "Assamicarum", "Asamiya"}, scripts = {"Beng"}, family = "inc", ancestors = {"pka"}, } m["av"] = { canonicalName = "Avar", otherNames = {"Avaric"}, scripts = {"Cyrl"}, family = "cau-nec", ancestors = {"oav"}, translit_module = "av-translit", } m["ay"] = { canonicalName = "Aymare", otherNames = {"Southern Aymara", "Central Aymara"}, scripts = {"Latn"}, family = "sai-aym", } m["az"] = { canonicalName = "Atropatenice", otherNames = {"Atropatenica", "Atropatenicae", "Atropatenici", "Atropatenicum", "Atropatenica", "Atropatenicae", "Azərbaycan dili", "Atropatenicus", "Atropatenica", "Atropatenicarum", "Azerbaijani", "Azari", "Azeri Turkic", "Azerbaijani Turkic", "North Azerbaijani", "South Azerbaijani"}, scripts = {"Latn", "Cyrl", "fa-Arab"}, family = "trk-ogz", } m["ba"] = { canonicalName = "Baschkirice", otherNames = {"Baschkirica", "Baschkiricae", "Baschkirici", "Baschkiricum", "Baschkirica", "Baschkiricae", "башҡортса", "Baschkiricus", "Baschkirica", "Baschkiricarum", "Bashkir"}, scripts = {"Cyrl"}, family = "trk-kip", translit_module = "ba-translit", } m["be"] = { canonicalName = "Albaruthenice", otherNames = {"Albaruthenica", "Albaruthenicae", "Albaruthenici", "Albaruthenicum", "Albaruthenica", "Albaruthenicae", "беларуская мова", "Albaruthenicus", "Albaruthenica", "Albaruthenicarum", "Belorussian", "Belarusan", "Bielorussian", "Byelorussian", "Belarussian", "White Russian"}, scripts = {"Cyrl"}, family = "zle", translit_module = "be-translit", sort_key = { from = {"Ё", "ё"}, to = {"Е" , "е"}}, entry_name = { from = {"Ѐ", "ѐ", GRAVE, ACUTE}, to = {"Е", "е"}}, } m["bg"] = { canonicalName = "Bulgarice", otherNames = {"Bulgarica", "Bulgaricae", "Bulgarici", "Bulgaricum", "Bulgarica", "Bulgaricae", "български език", "Bulgaricus", "Bulgarica", "Bulgaricarum" }, scripts = {"Cyrl"}, family = "zls", translit_module = "bg-translit", entry_name = { from = {"Ѐ", "ѐ", "Ѝ", "ѝ", GRAVE, ACUTE}, to = {"Е", "е", "И", "и"}}, } m["bh"] = { canonicalName = "Bihari", scripts = {"Deva"}, family = "inc", ancestors = {"pka"}, } m["bi"] = { canonicalName = "Bislama", scripts = {"Latn"}, family = "crp", ancestors = {"en"}, } m["bm"] = { canonicalName = "Bambara", otherNames = {"Bamanankan"}, scripts = {"Latn"}, family = "dmn", } m["bn"] = { "Bengale", "Q9610", "inc-eas", canonicalName = "Bengale", otherNames = {"Bengala", "Bengalae", "Bengali", "Bengalum", "Bengala", "Bengalae", "বাংলা", "Bengalus", "Bengala", "Bengalarum", "Bangla", "Bengali"}, scripts = {"Beng", "Newa"}, ancestors = {"inc-mbn"}, translit_module = "bn-translit", } m["bo"] = { canonicalName = "Tibetane", otherNames = {"Tibetana", "Tibetanae", "Tibetani", "Tibetanum", "Tibetana", "Tibetanae", "བོད་སྐད།", "Tibetanus", "Tibetana", "Tibetanarum"}, scripts = {"Tibt"}, family = "tbq", ancestors = {"xct"}, translit_module = "bo-translit", } m["br"] = { canonicalName = "Britonice", otherNames = {"Britonica", "Britonicae", "Britonici", "Britonicum", "Britonica", "Britonicae", "brezhoneg", "Britonicus", "Britonica", "Britonicarum"}, scripts = {"Latn"}, family = "cel-bry", ancestors = {"xbm"}, } m["bs"] = { canonicalName = "Bosnice", otherNames = {"Bosnica", "Bosnicae", "Bosnici", "Bosnicum", "Bosnica", "Bosnicae", "Bosnian", "bosanski jezik", "Bosnicus", "Bosnica", "Bosnicarum"}, scripts = {"Latn"}, family = "zlw", } m["ca"] = { canonicalName = "Catalane", otherNames = {"Catalana", "Catalanae", "Catalani", "Catalanum", "Catalana", "Catalanae", "català", "Catalanus", "Catalana", "Catalanarum", "Valencian"}, scripts = {"Latn"}, family = "roa", ancestors = {"roa-oca"}, sort_key = { from = {"à", "[èé]", "[íï]", "[òó]", "[úü]", "ç", "l·l"}, to = {"a", "e" , "i" , "o" , "u" , "c", "ll" }} , } m["ce"] = { canonicalName = "Chechen", scripts = {"Cyrl"}, family = "cau-nkh", translit_module = "ce-translit", entry_name = { from = {MACRON}, to = {}}, } m["ch"] = { canonicalName = "Chamorre", otherNames = {"Chamoru"}, scripts = {"Latn"}, family = "poz-sus", } m["co"] = { canonicalName = "Corse", otherNames = {"Corsa", "Corsae", "Corsi", "Corsum", "Corsa", "Corsae", "corsu", "Corsus", "Corsa", "Corsarum", "Corsican"}, scripts = {"Latn"}, family = "roa", } m["cr"] = { canonicalName = "Cree", scripts = {"Cans", "Latn"}, family = "alg", translit_module = "cr-translit", } m["cs"] = { canonicalName = "Bohemice", otherNames = {"Bohemica", "Bohemicae", "Bohemici", "Bohemicum", "Bohemica", "Bohemicae", "čeština", "Bohemicus", "Bohemica", "Bohemicarum"}, scripts = {"Latn"}, family = "zlw", ancestors = {"zlw-ocs"}, sort_key = { from = {"á", "é", "í", "ó", "[úů]", "ý"}, to = {"a", "e", "i", "o", "u" , "y"}} , } m["cu"] = { "Slavica Antiqua", "Q35499", "zls", otherNames = {"Slavica Antiqua", "Slavicae Antiquae", "Slavici Antiqui", "Slavicum Antiquum", "Slavica Antiqua", "Slavicae Antiquae", "словѣньскъ ѩзыкъ", "Slavicus Antiquus", "Slavica Antiqua", "Slavicarum Antiquarum", "Old Church Slavic", "Old Church Slavonic"}, scripts = {"Cyrs", "Glag"}, translit_module = "Cyrs-Glag-translit", entry_name = { from = {u(0x0484)}, -- kamora to = {}}, sort_key = { from = {"оу", "є"}, to = {"у" , "е"}} , } m["cv"] = { canonicalName = "Chuvash", scripts = {"Cyrl"}, family = "trk-ogr", translit_module = "cv-translit", } m["cy"] = { canonicalName = "Cambrice", otherNames = {"Cambrica", "Cambricae", "Cambrici", "Cambricum", "Cambrica", "Cambricae", "Cymraeg", "Cambricus", "Cambrica", "Cambricarum"}, scripts = {"Latn"}, family = "cel-bry", ancestors = {"wlm"}, sort_key = { from = {"[âáàä]", "[êéèë]", "[îíìï]", "[ôóòö]", "[ûúùü]", "[ŵẃẁẅ]", "[ŷýỳÿ]", "'"}, to = {"a" , "e" , "i" , "o" , "u" , "w" , "y" }} , } m["da"] = { canonicalName = "Danice", otherNames = {"Danica", "Danicae", "Danici", "Danicum", "Danica", "Danicae", "dansk", "Danicus", "Danica", "Danicarum"}, scripts = {"Latn"}, family = "gmq", ancestors = {"gmq-oda"}, } m["de"] = { "Germanice", "Q188", "gmw", otherNames = {"Germanica", "Germanicae", "Germanici", "Germanicum", "Germanica", "Germanicae", "Deutsch", "Germanicus", "Germanica", "Germanicarum", "High German", "New High German", "Deutsch"}, -- the last name is indeed also used in English scripts = {"Latn", "Latf"}, ancestors = {"gmh"}, sort_key = { from = {"[äàáâå]", "[ëèéê]", "[ïìíî]", "[öòóô]", "[üùúû]", "ß" }, to = {"a" , "e" , "i" , "o" , "u" , "ss"}} , } m["dv"] = { canonicalName = "Dhivehi", otherNames = {"Divehi", "Mahal", "Mahl", "Maldivian"}, scripts = {"Thaa"}, family = "inc", ancestors = {"pmh"}, translit_module = "dv-translit", } m["dz"] = { canonicalName = "Dzongkha", scripts = {"Tibt"}, family = "tbq", ancestors = {"xct"}, translit_module = "bo-translit", } m["ee"] = { canonicalName = "Ewe", scripts = {"Latn"}, family = "alv", } m["el"] = { canonicalName = "Neograece", otherNames = {"Neograeca", "Neograecae", "Neograeci", "Neograecum", "Neograeca", "Neograecae", "Νέα Ελληνικά", "Neograecus", "Neograeca", "Neograecarum", "Modern Greek", "Neo-Hellenic"}, scripts = {"Grek"}, family = "grk", ancestors = {"grc"}, translit_module = "el-translit", sort_key = { -- Keep this synchronized with grc, cpg, pnt from = {"[ᾳάᾴὰᾲᾶᾷἀᾀἄᾄἂᾂἆᾆἁᾁἅᾅἃᾃἇᾇ]", "[έὲἐἔἒἑἕἓ]", "[ῃήῄὴῂῆῇἠᾐἤᾔἢᾒἦᾖἡᾑἥᾕἣᾓἧᾗ]", "[ίὶῖἰἴἲἶἱἵἳἷϊΐῒῗ]", "[όὸὀὄὂὁὅὃ]", "[ύὺῦὐὔὒὖὑὕὓὗϋΰῢῧ]", "[ῳώῴὼῲῶῷὠᾠὤᾤὢᾢὦᾦὡᾡὥᾥὣᾣὧᾧ]", "ῥ", "ς"}, to = {"α" , "ε" , "η" , "ι" , "ο" , "υ" , "ω" , "ρ", "σ"}} , } m["en"] = { canonicalName = "Anglice", otherNames = {"Anglica", "Anglicae", "Anglici", "Anglicum", "Anglica", "Anglicae", "English", "Anglicus", "Anglica", "Anglicarum", "Modern English", "New English", "Hawaiian Creole English", "Hawai'ian Creole English", "Hawaiian Creole", "Hawai'ian Creole", "Polari", "Yinglish"}, -- all but the first three are names and alt names of subsumed dialects which once had ISO codes scripts = {"Latn", "Shaw", "Dsrt"}, -- last two are rare but probably attested; entries in them might require community approval, but it's good for the script codes not to be orphans family = "gmw", ancestors = {"enm"}, sort_key = { from = {"[äàáâåā]", "[ëèéêē]", "[ïìíîī]", "[öòóôō]", "[üùúûū]", "æ" , "œ" , "[çč]", "ñ", "'"}, to = {"a" , "e" , "i" , "o" , "u" , "ae", "oe", "c" , "n"}}, wikimedia_codes = {"en", "simple"}, standardChars = "A-Za-z0-9" .. PUNCTUATION .. u(0x2800) .. "-" .. u(0x28FF) } m["eo"] = { canonicalName = "Esperantice", otherNames = {"Esperantica", "Esperanticae", "Esperantici", "Esperanticum", "Esperantica", "Esperanticae", "Esperanto", "Esperanticus", "Esperantica", "Esperanticarum"}, scripts = {"Latn"}, family = "art", sort_key = { from = {"[áà]", "[éè]", "[íì]", "[óò]", "[úù]", "[ĉ]", "[ĝ]", "[ĥ]", "[ĵ]", "[ŝ]", "[ŭ]"}, to = {"a" , "e" , "i" , "o" , "u", "cĉ", "gĉ", "hĉ", "jĉ", "sĉ", "uĉ"}} , } m["es"] = { canonicalName = "Hispanice", otherNames = {"Hispanica", "Hispanicae", "Hispanici", "Hispanicum", "Hispanica", "Hispanicae", "español", "Hispanicus", "Hispanica", "Hispanicarum", "Castilian"}, scripts = {"Latn"}, family = "roa", ancestors = {"osp"}, sort_key = { from = {"á", "é", "í", "ó", "[úü]", "ç", "ñ"}, to = {"a", "e", "i", "o", "u" , "c", "n"}}, standardChars = "A-VXYZa-vxyz0-9ÁáÉéÍíÓóÚúÑñ¿¡" .. PUNCTUATION } m["et"] = { canonicalName = "Estonice", otherNames = {"Estonica", "Estonicae", "Estonici", "Estonicum", "Estonica", "Estonicae", "eesti keel", "Estonicus", "Estonica", "Estonicarum"}, scripts = {"Latn"}, family = "fiu-fin", } m["eu"] = { canonicalName = "Vasconice", otherNames = {"Vasconica", "Vasconicae", "Vasconici", "Vasconicum", "Vasconica", "Vasconicae", "Euskara", "Vasconicus", "Vasconica", "Vasconicarum"}, scripts = {"Latn"}, family = "euq", } m["fa"] = { canonicalName = "Persice", otherNames = {"Persica", "Persicae", "Persici", "Persicum", "Persica", "Persicae", "فارسی", "Persicus", "Persica", "Persicarum", "Farsi", "New Persian", "Modern Persian", "Western Persian", "Iranian Persian", "Eastern Persian", "Dari", "Aimaq", "Aimak", "Aymaq", "Eimak"}, scripts = {"fa-Arab"}, family = "ira-wes", ancestors = {"pal"}, entry_name = { from = {u(0x064E), u(0x064F), u(0x0650), u(0x0651), u(0x0652)}, to = {}} , } m["ff"] = { canonicalName = "Fula", otherNames = {"Adamawa Fulfulde", "Bagirmi Fulfulde", "Borgu Fulfulde", "Central-Eastern Niger Fulfulde", "Fulani", "Fulfulde", "Maasina Fulfulde", "Nigerian Fulfulde", "Pular", "Pulaar", "Western Niger Fulfulde"}, -- Maasina, etc are dialects, subsumed into this code scripts = {"Latn"}, family = "alv-sng", } m["fi"] = { canonicalName = "Finnice", otherNames = {"Finnica", "Finnicae", "Finnici", "Finnicum", "Finnica", "Finnicae", "suomi", "Finnicus", "Finnica", "Finnicarum"}, scripts = {"Latn"}, family = "fiu-fin", entry_name = { from = {"ˣ"}, -- Used to indicate gemination of the next consonant to = {}}, sort_key = { from = {"[áàâã]", "[éèêẽ]", "[íìîĩ]", "[óòôõ]", "[úùûũ]", "[ýỳŷüű]", "[øõő]", "æ" , "œ" , "[čç]", "š", "ž", "ß" , "[':]"}, to = {"a" , "e" , "i" , "o" , "u" , "y" , "ö" , "ae", "oe", "c" , "s", "z", "ss"}} , } m["fj"] = { canonicalName = "Fidziane", otherNames = {"Fidziana", "Fidzianae", "Fidziani", "Fidzianum", "Fidziana", "Fidzianae", "?", "Fidzianus", "Fidziana", "Fidzianarum"}, scripts = {"Latn"}, family = "poz-occ", } m["fo"] = { canonicalName = "Faeroice", otherNames = {"Faeroica", "Faeroicae", "Faeroici", "Faeroicum", "Faeroica", "Faeroicae", "føroyskt", "Faeroicus", "Faeroica", "Faeroicarum"}, scripts = {"Latn"}, family = "gmq", ancestors = {"non"}, } m["fr"] = { canonicalName = "Francogallice", otherNames = {"Francogallica", "Francogallicae", "Francogallici", "Francogallicum", "Francogallica", "Francogallicae", "français", "Francogallicus", "Francogallica", "Francogallicarum", "Modern French"}, scripts = {"Latn"}, family = "roa", ancestors = {"frm"}, sort_key = { from = {"[áàâä]", "[éèêë]", "[íìîï]", "[óòôö]", "[úùûü]", "[ýỳŷÿ]", "ç", "æ" , "œ" , "'"}, to = {"a" , "e" , "i" , "o" , "u" , "y" , "c", "ae", "oe"}}, standardChars = "A-Za-z0-9ÀÂÇÉÈÊËÎÏÔŒÛÙÜàâçéèêëîïôœûùü" .. PUNCTUATION } m["fy"] = { canonicalName = "Frisice", otherNames = {"Frisica", "Frisicae", "Frisici", "Frisicum", "Frisica", "Frisicae", "?", "Frisicus", "Frisica", "Frisicarum", "Western Frisian", "Frisian", "Frysk"}, scripts = {"Latn"}, family = "gmw-fri", ancestors = {"ofs"}, } m["ga"] = { canonicalName = "Hibernice", otherNames = {"Hibernica", "Hibernicae", "Hibernici", "Hibernicum", "Hibernica", "Hibernicae", "Gaeilge", "Hibernicus", "Hibernica", "Hibernicarum", "Irish Gaelic"}, scripts = {"Latn"}, family = "cel-gae", ancestors = {"mga"}, sort_key = { from = {"á", "é", "í", "ó", "ú", "ý", "ḃ" , "ċ" , "ḋ" , "ḟ" , "ġ" , "ṁ" , "ṗ" , "ṡ" , "ṫ" }, to = {"a", "e", "i", "o", "u", "y", "bh", "ch", "dh", "fh", "gh", "mh", "ph", "sh", "th"}} , } m["gd"] = { canonicalName = "Gaelice", otherNames = {"Gaelica", "Gaelicae", "Gaelici", "Gaelicum", "Gaelica", "Gaelicae", "Gàidhlig", "Gaelicus", "Gaelica", "Gaelicarum", "Highland Gaelic", "Scots Gaelic", "Scottish"}, scripts = {"Latn"}, family = "cel-gae", ancestors = {"mga"}, sort_key = { from = {"[áà]", "[éè]", "[íì]", "[óò]", "[úù]", "[ýỳ]"}, to = {"a" , "e" , "i" , "o" , "u" , "y" }} , } m["gl"] = { canonicalName = "Gallaice", otherNames = {"Gallaica", "Gallaicae", "Gallaici", "Gallaicum", "Gallaica", "Gallaicae", "galego", "Gallaicus", "Gallaica", "Gallaicarum", "Galician"}, scripts = {"Latn"}, family = "roa", ancestors = {"roa-opt"}, sort_key = { from = {"á", "é", "í", "ó", "ú"}, to = {"a", "e", "i", "o", "u"}} , } m["gn"] = { canonicalName = "Guaraní", scripts = {"Latn"}, family = "tup", } m["gu"] = { canonicalName = "Gujarati", scripts = {"Gujr"}, family = "inc", ancestors = {"inc-ogu"}, translit_module = "gu-translit", } m["gv"] = { canonicalName = "Manx", otherNames = {"Manx Gaelic"}, scripts = {"Latn"}, family = "cel-gae", ancestors = {"mga"}, sort_key = { from = {"ç", "-"}, to = {"c"}} , } m["ha"] = { canonicalName = "Hausa", scripts = {"Latn", "Arab"}, family = "cdc-wst", } m["he"] = { canonicalName = "Hebraice", otherNames = {"Hebraica", "Hebraicae", "Hebraici", "Hebraicum", "Hebraica", "Hebraicae", "עִבְרִית", "Hebraicus", "Hebraica", "Hebraicarum", "Ivrit"}, scripts = {"Hebr", "Phnx"}, family = "sem-can", entry_name = { from = {"[" .. u(0x0591) .. "-" .. u(0x05BD) .. u(0x05BF) .. "-" .. u(0x05C5) .. u(0x05C7) .. "]"}, to = {}} , } m["hi"] = { canonicalName = "Hindice", otherNames = {"Hindica", "Hindicae", "Hindici", "Hindicum", "Hindica", "Hindicae", "हिन्दी", "Hindicus", "Hindica", "Hindicarum", "hindī"}, scripts = {"Deva"}, family = "inc", ancestors = {"inc-ohi"}, translit_module = "hi-translit", } m["ho"] = { canonicalName = "Hiri Motu", otherNames = {"Pidgin Motu", "Police Motu"}, scripts = {"Latn"}, family = "crp", ancestors = {"meu"}, } m["hr"] = { canonicalName = "Croate", otherNames = {"Croata", "Croatae", "Croati", "Croatum", "Croata", "Croatae", "hrvatski", "Croatus", "Croata", "Croatarum", "Croatian"}, scripts = {"Latn"}, family = "zlw", } m["ht"] = { canonicalName = "Haitiane", otherNames = {"Haitiana", "Haitianae", "Haitiani", "Haitianum", "Haitiana", "Haitianae", "kreyòl", "Haitianus", "Haitiana", "Haitianarum", "Creole", "Haitian"}, scripts = {"Latn"}, family = "crp", } m["hu"] = { canonicalName = "Hungarice", otherNames = {"Hungarica", "Hungaricae", "Hungarici", "Hungaricum", "Hungarica", "Hungaricae", "magyar", "Hungaricus", "Hungarica", "Hungaricarum"}, scripts = {"Latn"}, family = "fiu-ugr", ancestors = {"ohu"}, sort_key = { from = {"á", "é", "í", "ó", "ú", "ő", "ű"}, to = {"a", "e", "i", "o", "u", "ö", "ü"}} , } m["hy"] = { canonicalName = "Armenie", otherNames = {"Armenia", "Armeniae", "Armenii", "Armenium", "Armenia", "Armeniae", "Հայերէն", "Armenius", "Armenia", "Armeniarum", "Modern Armenian", "Eastern Armenian", "Western Armenian"}, scripts = {"Armn"}, family = "hyx", ancestors = {"axm"}, translit_module = "Armn-translit", sort_key = { from = {"ու", "և", "եւ"}, to = {"ւ", "եվ", "եվ"}}, entry_name = { from = {"՞", "՜", "՛", "՟", "և", "<sup>յ</sup>", "<sup>ի</sup>"}, to = {"", "", "", "", "եւ", "յ", "ի"}} , } m["hz"] = { canonicalName = "Herero", scripts = {"Latn"}, family = "bnt", } m["ia"] = { canonicalName = "Interlingua", otherNames = {"Interlingua"}, scripts = {"Latn"}, family = "art", } m["id"] = { canonicalName = "Indonesie", otherNames = {"Indonesia", "Indonesiae", "Indonesii", "Indonesium", "Indonesia", "Indonesiae", "Bahasa Indonesia", "Indonesius", "Indonesia", "Indonesiarum"}, scripts = {"Latn"}, family = "poz-mly", ancestors = {"ms"}, } m["ie"] = { canonicalName = "Interlingue", otherNames = {"Occidental"}, scripts = {"Latn"}, family = "art", } m["ig"] = { canonicalName = "Igbo", scripts = {"Latn"}, family = "nic-bco", } m["ii"] = { canonicalName = "Sichuan Yi", otherNames = {"Nuosu", "Nosu", "Northern Yi", "Liangshan Yi"}, scripts = {"Yiii"}, family = "tbq-lol", } m["ik"] = { canonicalName = "Inupiak", otherNames = {"Inupiaq", "Iñupiaq", "Inupiatun"}, scripts = {"Latn"}, family = "esx-inu", } m["io"] = { canonicalName = "Ido", scripts = {"Latn"}, family = "art", } m["is"] = { canonicalName = "Islandice", otherNames = {"Islandica", "Islandicae", "Islandici", "Islandicum", "Islandica", "Islandicae", "íslenska", "Islandica", "Islandica", "Islandicarum"}, scripts = {"Latn"}, family = "gmq", ancestors = {"non"}, } m["it"] = { canonicalName = "Italice", otherNames = {"Italica", "Italicae", "Italici", "Italicum", "Italica", "Italicae", "italiano", "Italicus", "Italica", "Italicarum", "Italiana"}, scripts = {"Latn"}, family = "roa", ancestors = {"roa-oit"}, sort_key = { from = {"[àáâäå]", "[èéêë]", "[ìíîï]", "[òóôö]", "[ùúûü]"}, to = {"a" , "e" , "i" , "o" , "u" }} , } m["iu"] = { canonicalName = "Inuktitut", otherNames = {"Eastern Canadian Inuktitut", "Eastern Canadian Inuit", "Western Canadian Inuktitut", "Western Canadian Inuit", "Western Canadian Inuktun", "Inuinnaq", "Inuinnaqtun", "Inuvialuk", "Inuvialuktun", "Nunavimmiutit", "Nunatsiavummiut", "Aivilimmiut", "Natsilingmiut", "Kivallirmiut", "Siglit", "Siglitun"}, scripts = {"Cans", "Latn"}, family = "esx-inu", translit_module = "iu-translit", } m["ja"] = { canonicalName = "Iaponice", otherNames = {"Iaponica", "Iaponicae", "Iaponici", "Iaponicum", "Iaponica", "Iaponicae", "日本語", "Iaponicus", "Iaponica", "Iaponicarum", "Nihongo", "Modern Japanese", "Nipponese"}, scripts = {"Jpan", "Latn", "Hira"}, family = "jpx", ancestors = {"ojp"}, } m["jv"] = { canonicalName = "Iavense", otherNames = {"Iavensis", "Iavenses", "Iavenses", "Iavense", "Iavensia", "Iavensis", "basa Jawa", "Iavensis", "Iavensi", "Iavensium"}, scripts = {"Latn", "Java"}, family = "poz-sus", translit_module = "jv-translit", ancestors = {"kaw"}, link_tr = true, } m["ka"] = { canonicalName = "Georgiane", otherNames = {"Georgiana", "Georgianae", "Georgiani", "Georgianum", "Georgiana", "Georgianae", "ქართული", "Georgianus", "Georgiana", "Georgianarum", "Kartvelian"}, scripts = {"Geor", "Geok"}, family = "ccs-gzn", ancestors = {"oge"}, translit_module = "Geor-translit", entry_name = { from = {"̂"}, to = {""}}, } m["kg"] = { canonicalName = "Kongo", otherNames = {"Kikongo", "Koongo", "Laari", "San Salvador Kongo", "Yombe"}, scripts = {"Latn"}, family = "bnt", } m["ki"] = { canonicalName = "Kikuyu", otherNames = {"Gikuyu", "Gĩkũyũ"}, scripts = {"Latn"}, family = "bnt", } m["kj"] = { canonicalName = "Kwanyama", otherNames = {"Kuanyama", "Oshikwanyama"}, scripts = {"Latn"}, family = "bnt", } m["kk"] = { "Kazachice", "Q9252", "trk-kno", otherNames = {"Kazachica", "Kazachicae", "Kazachici", "Kazachicum", "Kazachica", "Kazachicae", "Қазақ тілі", "Kazachicus", "Kazachica", "Kazachicarum"}, scripts = {"Cyrl", "Latn", "kk-Arab"}, translit_module = "kk-translit", override_translit = true, } m["kl"] = { canonicalName = "Groenlandice", otherNames = {"Groenlandica", "Groenlandicae", "Groenlandici", "Groenlandicum", "Groenlandica", "Groenlandicae", "Kalaallisut", "Groenlandicus", "Groenlandica", "Groenlandicarum"}, scripts = {"Latn"}, family = "esx-inu", } m["km"] = { canonicalName = "Khmer", otherNames = {"Cambodian"}, scripts = {"Khmr"}, family = "mkh", ancestors = {"mkh-mkm"}, translit_module = "km-translit", } m["kn"] = { canonicalName = "Kannada", scripts = {"Knda"}, family = "dra", translit_module = "kn-translit", } m["ko"] = { canonicalName = "Coreane", otherNames = {"Coreana", "Coreanae", "Coreani", "Coreanum", "Coreana", "Coreanae", "한국어", "Coreanus", "Coreana", "Coreanarum", "Modern Korean"}, scripts = {"Kore"}, family = "qfa-kor", ancestors = {"okm"}, translit_module = "ko-translit", } m["kr"] = { canonicalName = "Kanuri", otherNames = {"Kanembu", "Bilma Kanuri", "Central Kanuri", "Manga Kanuri", "Tumari Kanuri"}, scripts = {"Latn"}, family = "ssa", } m["ks"] = { canonicalName = "Caspirice", otherNames = {"Caspirica", "Caspiricae", "Caspirici", "Caspiricum", "Caspirica", "Caspiricae", "कॉशुर / کٲشُر", "Caspiricus", "Caspirica", "Caspiricarum", "Kashmiri"}, scripts = {"ks-Arab", "Deva"}, family = "inc-dar", } m["ku"] = { canonicalName = "Corduene", otherNames = {"Corduena", "Corduenae", "Cordueni", "Corduenum", "Corduena", "Corduenae", "kurdî", "Corduenus", "Corduena", "Corduenarum"}, scripts = {"Latn", "ku-Arab", "Armn", "Cyrl"}, family = "ira-wes", } m["kv"] = { "Komiense", "Q34114", "urj-prm", otherNames = {"Komiensis", "Komienses", "Komienses", "Komiense", "Komiensia", "Komiensis", "Коми кыв", "Komiensis", "Komiensi", "Komiensium", "Komi", "Komi-Zyryan"}, scripts = Cyrl, translit_module = "kv-translit", override_translit = true, } m["kw"] = { canonicalName = "Cornubice", otherNames = {"Cornubica", "Cornubicae", "Cornubici", "Cornubicum", "Cornubica", "Cornubicae", "Kernowek", "Cornubicus", "Cornubica", "Cornubicarum"}, scripts = {"Latn"}, family = "cel-bry", ancestors = {"cnx"}, } m["ky"] = { canonicalName = "Kyrgesse", otherNames = {"Kyrgessa", "Kyrgessae", "Kyrgessi", "Kyrgessum", "Kyrgessa", "Kyrgessae", "кыргызча", "Kyrgessus", "Kyrgessa", "Kyrgessarum", "Chirgisica", "Kirghiz", "Kirgiz"}, scripts = {"Cyrl", "Latn", "Arab"}, family = "trk-kip", translit_module = "ky-translit", } m["la"] = { canonicalName = "Latine", otherNames = {"Latina", "Latinae", "Latini", "Latinum", "Latina", "Latinae", "Latine", "Latinus", "Latina", "Latinarum"}, scripts = {"Latn"}, family = "itc", ancestors = {"itc-ola"}, entry_name = { from = {"[ĀĂ]", "[āă]", "[ĒĔ]", "[ēĕë]", "[ĪĬÏ]", "[īĭï]", "[ŌŎ]", "[ōŏ]", "[ŪŬÜ]", "[ūŭü]", "Ȳ", "ȳ", MACRON, BREVE, DIAER}, to = {"A", "a", "E", "e", "I", "i", "O", "o", "U", "u", "Y", "y"}}, } m["lb"] = { canonicalName = "Luxemburgice", otherNames = {"Luxemburgica", "Luxemburgicae", "Luxemburgici", "Luxemburgicum", "Luxemburgica", "Luxemburgicae", "Lëtzebuergesch", "Luxemburgicus", "Luxemburgica", "Luxemburgicarum"}, scripts = {"Latn"}, family = "gmw", ancestors = {"gmh"}, } m["lg"] = { canonicalName = "Luganda", otherNames = {"Ganda"}, scripts = {"Latn"}, family = "bnt", } m["li"] = { canonicalName = "Limburgice", otherNames = {"Limburgica", "Limburgicae", "Limburgici", "Limburgicum", "Limburgica", "Limburgicae", "Limburgs", "Limburgicus", "Limburgica", "Limburgicarum", "Limburgan", "Limburgian", "Limburgic"}, scripts = {"Latn"}, family = "gmw", ancestors = {"dum"}, } m["ln"] = { canonicalName = "Lingala", scripts = {"Latn"}, family = "bnt", } m["lo"] = { canonicalName = "Lao", otherNames = {"Laotian"}, scripts = {"Laoo"}, family = "tai-swe", translit_module = "lo-translit", } m["lt"] = { canonicalName = "Lithuanice", otherNames = {"Lithuanica", "Lithuanicae", "Lithuanici", "Lithuanicum", "Lithuanica", "Lithuanicae", "lietuvių", "Lithuanicus", "Lithuanica", "Lithuanicarum"}, scripts = {"Latn"}, family = "bat", ancestors = {"olt"}, entry_name = { from = {"[áãà]", "[ÁÃÀ]", "[éẽè]", "[ÉẼÈ]", "[íĩì]", "[ÍĨÌ]", "[ýỹ]", "[ÝỸ]", "ñ", "[óõò]", "[ÓÕÒ]", "[úũù]", "[ÚŨÙ]", ACUTE, GRAVE, TILDE}, to = {"a", "A", "e", "E", "i", "I", "y", "Y", "n", "o", "O", "u", "U"}} , } m["lu"] = { canonicalName = "Luba-Katanga", scripts = {"Latn"}, family = "bnt", } m["lv"] = { canonicalName = "Lettice", otherNames = {"Lettica", "Letticae", "Lettici", "Letticum", "Lettica", "Letticae", "latviešu", "Letticus", "Lettica", "Letticarum", "Lettonica", "Lettish", "Lett"}, scripts = {"Latn"}, family = "bat", } -- 1=nomsgf, 2=nomplf, 3=nomplm, 4=nomsgn, 5=nompln, 6=gensgf, 7=ipsa, 8=nomsgm, 9=ablsgf, 10=genplf m["ma"] = { canonicalName = "Magare", otherNames = {"Magara", "Magarae", "Magari", "Magarum", "Magara", "Magarae", "léngua magara", "Magarus", "Magara", "Magararum", "Magarian"}, scripts = {"Latn"}, family = "roa-itd", } m["mg"] = { canonicalName = "Madagascariense", otherNames = {"Madagascariensis", "Madagascarienses", "Madagascarienses", "Madagascariense", "Madagascariensia", "Madagascariensis", "malagasy", "Madagascariensis", "Madagascariensi", "Madagascariensium", "Betsimisaraka Malagasy", "Betsimisaraka", "Northern Betsimisaraka Malagasy", "Northern Betsimisaraka", "Southern Betsimisaraka Malagasy", "Southern Betsimisaraka", "Bara Malagasy", "Bara", "Masikoro Malagasy", "Masikoro", "Antankarana", "Antankarana Malagasy", "Plateau Malagasy", "Sakalava", "Tandroy Malagasy", "Tandroy", "Tanosy", "Tanosy Malagasy", "Tesaka", "Tsimihety", "Tsimihety Malagasy"}, scripts = {"Latn"}, family = "poz-bre", } m["mh"] = { canonicalName = "Marshallese", scripts = {"Latn"}, family = "poz-mic", sort_key = { from = {"ā" , "ļ" , "m̧" , "ņ" , "n̄" , "o̧" , "ō" , "ū" }, to = {"a~", "l~", "m~", "n~", "n~~", "o~", "o~~", "u~"}} , } m["mi"] = { canonicalName = "Maorice", otherNames = {"Maorica", "Maoricae", "Maorici", "Maoricum", "Maorica", "Maoricae", "Māori", "Maoricus", "Maorica", "Maoricarum"}, scripts = {"Latn"}, family = "poz-pol", } m["mk"] = { canonicalName = "Macedonice", otherNames = {"Macedonica", "Macedonicae", "Macedonici", "Macedonicum", "Macedonica", "Macedonicae", "Македонски јазик", "Macedonicus", "Macedonica", "Macedonicarum" }, scripts = {"Cyrl"}, family = "zls", translit_module = "mk-translit", entry_name = { from = {ACUTE}, to = {}}, } m["ml"] = { canonicalName = "Malayalam", scripts = {"Mlym"}, family = "dra", translit_module = "ml-translit", } m["mn"] = { canonicalName = "Mogolice", otherNames = {"Mogolica", "Mogolicae", "Mogolici", "Mogolicum", "Mogolica", "Mogolicae", "Монгол хэл", "Mogolicus", "Mogolica", "Mogolicarum", "Khalkha Mongolian"}, scripts = {"Cyrl", "Mong"}, family = "xgn", ancestors = {"cmg"}, translit_module = "mn-translit", } m["mr"] = { canonicalName = "Marathi", scripts = {"Deva", "Modi"}, family = "inc", ancestors = {"omr"}, translit_module = "hi-translit", } m["ms"] = { canonicalName = "Malaice", otherNames = {"Malaica", "Malaicae", "Malaici", "Malaicum", "Malaica", "Malaicae", "Bahasa Melayu", "Malaicus", "Malaica", "Malaicarum"}, scripts = {"Latn", "Arab"}, family = "poz-mly", } m["mt"] = { canonicalName = "Melitense", otherNames = {"Melitensis", "Melitenses", "Melitenses", "Melitense", "Melitensia", "Melitensis", "Malti", "Melitensis", "Melitensi", "Melitensium"}, scripts = {"Latn"}, family = "sem-arb", ancestors = {"sqr"}, } m["my"] = { canonicalName = "Birmanice", otherNames = {"Birmanica", "Birmanicae", "Birmanici", "Birmanicum", "Birmanica", "Birmanicae", "မ္ရန္‌မာစာ", "Birmanicus", "Birmanica", "Birmanicarum", "Burmese", "Myanmar"}, scripts = {"Mymr"}, family = "tbq-brm", ancestors = {"obr"}, translit_module = "my-translit", } m["na"] = { canonicalName = "Nauruane", otherNames = {"Nauruana", "Nauruanae", "Nauruani", "Nauruanum", "Nauruana", "Nauruanae", "Nauru", "Nauruanus", "Nauruana", "Nauruanarum"}, scripts = {"Latn"}, family = "poz-mic", } m["nb"] = { canonicalName = "Dano-Norvegice", otherNames = {"Dano-Norvegica", "Dano-Norvegicae", "Dano-Norvegici", "Dano-Norvegicum", "Dano-Norvegica", "Dano-Norvegicae", "Bokmål", "Dano-Norvegicus", "Dano-Norvegica", "Dano-Norvegicarum", "Norwegian", "Norsk"}, scripts = {"Latn"}, family = "gmq", ancestors = {"gmq-mno"}, wikimedia_codes = {"no"}, } m["nd"] = { canonicalName = "Northern Ndebele", otherNames = {"North Ndebele"}, scripts = {"Latn"}, family = "bnt-ngu", } m["ne"] = { canonicalName = "Nepalense", otherNames = {"Nepalensis", "Nepalenses", "Nepalenses", "Nepalense", "Nepalensia", "Nepalensis", "नेपाली", "Nepalensis", "Nepalensi", "Nepalensium", "Nepalese"}, scripts = {"Deva"}, family = "inc", translit_module = "ne-translit", } m["ng"] = { canonicalName = "Ndonga", scripts = {"Latn"}, family = "bnt", } m["nl"] = { canonicalName = "Batave", otherNames = {"Batava", "Batavae", "Batavi", "Batavum", "Batava", "Batavae", "Nederlands", "Batavus", "Batava", "Batavarum", "Netherlandic", "Flemish"}, scripts = {"Latn"}, family = "gmw", ancestors = {"dum"}, sort_key = { from = {"[äáâå]", "[ëéê]", "[ïíî]", "[öóô]", "[üúû]", "ç", "ñ", "^-"}, to = {"a" , "e" , "i" , "o" , "u" , "c", "n"}} , } m["nn"] = { canonicalName = "Neonorvegice", otherNames = {"Neonorvegica", "Neonorvegicae", "Neonorvegici", "Neonorvegicum", "Neonorvegica", "Neonorvegicae", "Nynorsk", "Neonorvegicus", "Neonorvegica", "Neonorvegicarum", "New Norwegian"}, scripts = {"Latn"}, family = "gmq", ancestors = {"gmq-mno"}, } m["no"] = { canonicalName = "Norvegice", otherNames = {"Norvegica", "Norvegicae", "Norvegici", "Norvegicum", "Norvegica", "Norvegicae", "Norsk", "Norvegicus", "Norvegica", "Norvegicarum", "Norwegian"}, scripts = {"Latn"}, family = "gmq", ancestors = {"gmq-mno"}, } m["nr"] = { canonicalName = "Southern Ndebele", otherNames = {"South Ndebele"}, scripts = {"Latn"}, family = "bnt-ngu", } m["nv"] = { canonicalName = "Navajo", scripts = {"nv-Latn"}, family = "apa", sort_key = { from = {"[áą]", "[éę]", "[íį]", "[óǫ]", "ń", "^n([djlt])", "ł" , "[ʼ’']", ACUTE}, to = {"a" , "e" , "i" , "o" , "n", "ni%1" , "l"}}, -- the copyright sign is used to guarantee that ł will always be sorted after all other words with l } m["ny"] = { canonicalName = "Chichewa", otherNames = {"Chicheŵa", "Chinyanja", "Nyanja", "Chewa"}, scripts = {"Latn"}, family = "bnt", entry_name = { from = {ACUTE}, to = {}}, } m["oc"] = { canonicalName = "Occitane", otherNames = {"Occitana", "Occitanae", "Occitani", "Occitanum", "Occitana", "Occitanae", "occitan", "Occitanus", "Occitana", "Occitanarum", "Provençal", "Auvergnat", "Auvernhat", "Gascon", "Languedocien", "Lengadocian", "Shuadit", "Chouhadite", "Chouhadit", "Chouadite", "Chouadit", "Shuhadit", "Judeo-Provençal", "Judeo-Provencal", "Judeo-Comtadin"}, scripts = {"Latn", "Hebr"}, family = "roa", ancestors = {"pro"}, sort_key = { from = {"[àá]", "[èé]", "[íï]", "[òó]", "[úü]", "ç", "([lns])·h"}, to = {"a" , "e" , "i" , "o" , "u" , "c", "%1h" }} , } m["oj"] = { "Ojibwayense", "Q33875", "alg", otherNames = {"Ojibwayensis", "Ojibwayenses", "Ojibwayenses", "Ojibwayense", "Ojibwayensia", "Ojibwayensis", "Anishinaabemowin / ᐊᓂᔑᓈᐯᒧᐎᓐ", "Ojibwayensis", "Ojibwayensi", "Ojibwayensium"}, aliases = {"Ojibway", "Ojibwa"}, varieties = {{"Chippewa", "Ojibwemowin", "Southwestern Ojibwa"}}, scripts = {"Cans", "Latn"}, sort_key = { from = {"aa", "ʼ", "ii", "oo", "sh", "zh"}, to = {"a~", "h~", "i~", "o~", "s~", "z~"}} , } m["om"] = { canonicalName = "Oromo", otherNames = {"Orma", "Borana-Arsi-Guji Oromo", "West Central Oromo"}, scripts = {"Latn", "Ethi"}, family = "cus", } m["or"] = { canonicalName = "Oriya", otherNames = {"Odia", "Oorya"}, scripts = {"Orya"}, family = "inc", ancestors = {"pka"}, } m["os"] = { canonicalName = "Alane", otherNames = {"Ossete", "Ossetic", "Digor", "Iron"}, scripts = {"Cyrl", "Geor", "Latn"}, family = "ira", translit_module = "os-translit", ancestors = {"oos"}, entry_name = { from = {GRAVE, ACUTE}, to = {}} , } m["pa"] = { canonicalName = "Punjabi", otherNames = {"Panjabi"}, scripts = {"Guru", "Arab", "Deva"}, family = "inc", translit_module = "pa-translit", ancestors = {"psu"}, } m["pi"] = { canonicalName = "Pali", scripts = {"Latn", "Deva", "Sinh", "Mymr", "Khmr", "Thai"}, family = "inc", ancestors = {"bh"}, sort_key = { from = {"ā", "ī", "ū", "ḍ", "ḷ", "[ṁṃ]", "[ṇñṅ]", "ṭ"}, to = {"a", "i", "u", "d", "l", "m" , "n" , "t"}} , } m["pl"] = { canonicalName = "Polonice", otherNames = {"Polonica", "Polonicae", "Polonici", "Polonicum", "Polonica", "Polonicae", "język polski", "Polonicus", "Polonica", "Polonicarum"}, scripts = {"Latn"}, family = "zlw", ancestors = {"zlw-opl"}, sort_key = { from = {"[Ąą]", "[Ćć]", "[Ęę]", "[Łł]", "[Ńń]", "[Óó]", "[Śś]", "[Żż]", "[Źź]"}, to = { "a" .. u(0x10FFFF), "c" .. u(0x10FFFF), "e" .. u(0x10FFFF), "l" .. u(0x10FFFF), "n" .. u(0x10FFFF), "o" .. u(0x10FFFF), "s" .. u(0x10FFFF), "z" .. u(0x10FFFF), "z" .. u(0x10FFFE)}} , } m["ps"] = { canonicalName = "Afganice", otherNames = {"Afganica", "Afganicae", "Afganici", "Afganicum", "Afganica", "Afganicae", "پښتو", "Afganicus", "Afganica", "Afganicarum", "Pashtun", "Pushto", "Pashtu", "Central Pashto", "Northern Pashto", "Southern Pashto", "Pukhto", "Pakhto", "Pakkhto", "Afghani"}, scripts = {"ps-Arab"}, family = "ira-eas", } m["pt"] = { canonicalName = "Lusitane", otherNames = {"Lusitana", "Lusitanae", "Lusitani", "Lusitanum", "Lusitana", "Lusitanae", "português", "Lusitanus", "Lusitana", "Lusitanarum", "Modern Portuguese"}, scripts = {"Latn"}, family = "roa", ancestors = {"roa-opt"}, sort_key = { from = {"[àãáâä]", "[èẽéêë]", "[ìĩíï]", "[òóôõö]", "[üúùũ]", "ç", "ñ"}, to = {"a" , "e" , "i" , "o" , "u" , "c", "n"}} , } m["qu"] = { canonicalName = "Quechua", otherNames = {"Quechua", "Quechuae", "Quechui", "Quechuum", "Quechua", "Quechuae", "Runasimi", "Quechuus", "Quechua", "Quechuarum", "Qhichwa simi"}, scripts = {"Latn"}, family = "qwe", } m["rm"] = { canonicalName = "Raetice", otherNames = {"Raetica", "Raeticae", "Raetici", "Raeticum", "Raetica", "Raeticae", "rumantsch", "Raeticus", "Raetica", "Raeticarum", "Romansh", "Romanche"}, scripts = {"Latn"}, family = "roa", } m["rn"] = { canonicalName = "Kirundi", scripts = {"Latn"}, family = "bnt", } m["ro"] = { canonicalName = "Dacoromane", otherNames = {"Dacoromana", "Dacoromanae", "Dacoromani", "Dacoromanum", "Dacoromana", "Dacoromanae", "româna", "Dacoromanus", "Dacoromana", "Dacoromanarum", "Daco-Romanian", "Roumanian", "Rumanian"}, scripts = {"Latn", "Cyrl"}, family = "roa", sort_key = { from = {"ă" , "â" , "î" , "ș" , "ț" }, to = {"a~", "a~~", "i~", "s~", "t~"}}, } m["ru"] = { canonicalName = "Ruthenice", otherNames = {"Ruthenica", "Ruthenicae", "Ruthenici", "Ruthenicum", "Ruthenica", "Ruthenicae", "русский язык", "Ruthenicus", "Ruthenica", "Ruthenicarum"}, scripts = {"Cyrl"}, family = "zle", translit_module = "ru-translit", sort_key = { from = {"ё"}, to = {"е" .. mw.ustring.char(0x10FFFF)}}, entry_name = { from = {"Ѐ", "ѐ", "Ѝ", "ѝ", GRAVE, ACUTE}, to = {"Е", "е", "И", "и"}}, } m["rw"] = { canonicalName = "Kinyarwanda", otherNames = {"Rwanda"}, scripts = {"Latn"}, family = "bnt", } m["sa"] = { canonicalName = "Sanscrite", otherNames = {"Sanscrita", "Sanscritae", "Sanscriti", "Sanscritum", "Sanscrita", "Sanscritae", "संस्कृत", "Sanscritus", "Sanscrita", "Sanscritarum"}, scripts = {"Deva", "Beng", "Brah", "Gran", "Gujr", "Guru", "Khar", "Knda", "Mlym", "Mymr", "Orya", "Shrd", "Sinh", "Taml", "Telu", "Thai", "Tibt"}, family = "inc", translit_module = "sa-translit", } -- 1=nomsgf, 2=nomplf, 3=nomplm, 4=nomsgn, 5=nompln, 6=gensgf, 7=ipsa, 8=nomsgm, 9=ablsgf, 10=genplf m["sc"] = { "Sarde", "Q33976", "roa", otherNames = {"Sarda", "Sardae", "Sardi", "Sardum", "Sarda", "Sardae", "sarda", "Sardus", "Sarda", "Sardarum", "Campidanese", "Campidanese Sardinian", "Logudorese", "Logudorese Sardinian", "Nuorese", "Nuorese Sardinian"}, scripts = {"Latn"}, } m["sd"] = { canonicalName = "Sindhi", scripts = {"sd-Arab", "Deva"}, family = "inc", } -- otherNames is used for inflected forms: 1=nomsgf, 2=nomplf, 3=nomplm, 4=nomsgn, 5=nompln, 6=gensgf m["se"] = { canonicalName = "Lapponica Septentrionali", otherNames = {"Lapponica Septentrionalis", "Lapponicae Septentrionales", "Lapponici Septentrionales", "Lapponicum Septentrionale", "Lapponica Septentrionalia", "Lapponicae Septentrionalis", "Davvisámegiella", "Lapponicus Septentrionalis", "Lapponica Septentrionali", "Lapponicarum Septentrionalium", "Samica septentrionalis", "North Sami", "Northern Saami", "North Saami"}, scripts = {"Latn"}, family = "smi", entry_name = { from = {"([đflmnŋrsšŧv])'%1"}, to = {"%1%1"} }, } m["sg"] = { canonicalName = "Sango", scripts = {"Latn"}, family = "crp", } m["sh"] = { canonicalName = "Servocroate", otherNames = {"Servocroata", "Servocroatae", "Servocroati", "Servocroatum", "Servocroata", "Servocroatae", "srpskohrvatski", "Servocroatus", "Servocroata", "Servocroatarum", "BCS", "Croato-Serbian", "Serbocroatian", "Bosnian", "Croatian", "Montenegrin", "Serbian"}, scripts = {"Latn", "Cyrl"}, family = "zls", entry_name = { from = {"[ȀÀȂÁĀ]", "[ȁàȃáā]", "[ȄÈȆÉĒ]", "[ȅèȇéē]", "[ȈÌȊÍĪ]", "[ȉìȋíī]", "[ȌÒȎÓŌ]", "[ȍòȏóō]", "[ȐȒŔ]", "[ȑȓŕ]", "[ȔÙȖÚŪ]", "[ȕùȗúū]", "Ѐ", "ѐ", "[ӢЍ]", "[ӣѝ]", "[Ӯ]", "[ӯ]", GRAVE, ACUTE, DGRAVE, INVBREVE, MACRON}, to = {"A" , "a" , "E" , "e" , "I" , "i" , "O" , "o" , "R" , "r" , "U" , "u" , "Е", "е", "И" , "и", "У", "у" }}, wikimedia_codes = {"sh", "bs", "hr", "sr"}, } m["si"] = { canonicalName = "Sinhalese", otherNames = {"Singhalese", "Sinhala"}, scripts = {"Sinh"}, family = "inc", ancestors = {"pmh"}, translit_module = "si-translit", } m["sk"] = { canonicalName = "Slovace", otherNames = {"Slovaca", "Slovacae", "Slovaci", "Slovacum", "Slovaca", "Slovacae", "slovenčina", "Slovacus", "Slovaca", "Slovacarum"}, scripts = {"Latn"}, family = "zlw", sort_key = { from = {"[áä]", "é", "í", "[óô]", "ú", "ý", "ŕ", "ĺ"}, to = {"a" , "e", "i", "o" , "u", "y", "r", "l"}} , } m["sl"] = { canonicalName = "Slovene", otherNames = {"Slovena", "Slovenae", "Sloveni", "Slovenum", "Slovena", "Slovenae", "slovenščina", "Slovenus", "Slovena", "Slovenarum", "Slovenian"}, scripts = {"Latn"}, family = "zls", entry_name = { from = {"[ÁÀÂȂȀ]", "[áàâȃȁ]", "[ÉÈÊȆȄỆẸ]", "[éèêȇȅệẹə]", "[ÍÌÎȊȈ]", "[íìîȋȉ]", "[ÓÒÔȎȌỘỌ]", "[óòôȏȍộọ]", "[ŔȒȐ]", "[ŕȓȑ]", "[ÚÙÛȖȔ]", "[úùûȗȕ]", "ł", GRAVE, ACUTE, DGRAVE, INVBREVE, CIRC, DOTBELOW}, to = {"A" , "a" , "E" , "e" , "I" , "i" , "O" , "o" , "R" , "r" , "U" , "u" , "l"}} , } m["sm"] = { canonicalName = "Samoane", otherNames = {"Samoana", "Samoanae", "Samoani", "Samoanum", "Samoana", "Samoanae", "gagana Sāmoa", "Samoanus", "Samoana", "Samoanarum"}, scripts = {"Latn"}, family = "poz-pol", } m["sn"] = { canonicalName = "Shona", scripts = {"Latn"}, family = "bnt", } m["so"] = { canonicalName = "Somali", scripts = {"Latn", "Arab", "Osma"}, family = "cus", entry_name = { from = {"[ÁÀÂ]", "[áàâ]", "[ÉÈÊ]", "[éèê]", "[ÍÌÎ]", "[íìî]", "[ÓÒÔ]", "[óòô]", "[ÚÙÛ]", "[úùû]", "[ÝỲ]", "[ýỳ]"}, to = {"A" , "a" , "E" , "e" , "I" , "i" , "O" , "o" , "U" , "u", "Y", "y"}} , } m["sq"] = { canonicalName = "Illyrice", otherNames = {"Illyrica", "Illyricae", "Illyrici", "Illyricum", "Illyrica", "Illyricae", "shqipja", "Illyricus", "Illyrica", "Illyricarum"}, scripts = {"Latn", "Elba"}, family = "sqj", sort_key = { from = { '[âãä]', '[ÂÃÄ]', '[êẽë]', '[ÊẼË]', 'ĩ', 'Ĩ', 'õ', 'Õ', 'ũ', 'Ũ', 'ỹ', 'Ỹ', 'ç', 'Ç' }, to = { 'a', 'A', 'e', 'E', 'i', 'I', 'o', 'O', 'u', 'U', 'y', 'Y', 'c', 'C' } } , } m["sr"] = { canonicalName = "Service", otherNames = {"Servica", "Servicae", "Servici", "Servicum", "Servica", "Servicae", "српски / srpski", "Servicus", "Servica", "Servicarum"}, scripts = {"Latn", "Cyrl"}, family = "zls", translit_module = "sh-translit", entry_name = { from = {"[ȀÀȂÁĀ]", "[ȁàȃáā]", "[ȄÈȆÉĒ]", "[ȅèȇéē]", "[ȈÌȊÍĪ]", "[ȉìȋíī]", "[ȌÒȎÓŌ]", "[ȍòȏóō]", "[ȐȒŔ]", "[ȑȓŕ]", "[ȔÙȖÚŪ]", "[ȕùȗúū]", "Ѐ", "ѐ", "[ӢЍ]", "[ӣѝ]", "[Ӯ]", "[ӯ]", GRAVE, ACUTE, DGRAVE, INVBREVE, MACRON}, to = {"A" , "a" , "E" , "e" , "I" , "i" , "O" , "o" , "R" , "r" , "U" , "u" , "Е", "е", "И" , "и", "У", "у" }}, -- wikimedia_codes = {"sh", "bs", "hr", "sr"}, } m["ss"] = { canonicalName = "Swazi", otherNames = {"Swati"}, scripts = {"Latn"}, family = "bnt-ngu", } m["st"] = { canonicalName = "Sotho Meridionali", otherNames = {"Sesotho", "Southern Sesotho", "Southern Sotho"}, scripts = {"Latn"}, family = "bnt", } m["su"] = { canonicalName = "Sondaice", scripts = {"Latn", "Sund"}, family = "poz-msa", translit_module = "su-translit", } m["sv"] = { canonicalName = "Suecice", otherNames = {"Suecica", "Suecicae", "Suecici", "Suecicum", "Suecica", "Suecicae", "svenska", "Suecicus", "Suecica", "Suecicarum"}, scripts = {"Latn"}, family = "gmq", ancestors = {"gmq-osw"}, } m["sw"] = { canonicalName = "Suahelice", otherNames = {"Suahelica", "Suahelicae", "Suahelici", "Suahelicum", "Suahelica", "Suahelicae", "Kiswahili", "Suahelicus", "Suahelica", "Suahelicarum", "Settler Swahili", "KiSetla", "KiSettla", "Setla", "Settla", "Kitchen Swahili", "Kihindi", "Indian Swahili", "KiShamba", "Kishamba", "Field Swahili", "Kibabu", "Asian Swahili", "Kimanga", "Arab Swahili", "Kitvita", "Army Swahili"}, scripts = {"Latn", "Arab"}, family = "bnt", sort_key = { from = {"ng'", "^-"}, to = {"ngz"}} , } m["ta"] = { canonicalName = "Tamulice", otherNames = {"Tamulica", "Tamulicae", "Tamulici", "Tamulicum", "Tamulica", "Tamulicae", "தமிழ்", "Tamulicus", "Tamulica", "Tamulicarum", "Tamil"}, scripts = {"Taml"}, family = "dra", ancestors = {"oty"}, translit_module = "ta-translit", } m["te"] = { canonicalName = "Teluguice", scripts = {"Telu"}, family = "dra", translit_module = "te-translit", } m["tg"] = { canonicalName = "Tadzikice", otherNames = {"Tadzikica", "Tadzikicae", "Tadzikici", "Tadzikicum", "Tadzikica", "Tadzikicae", "Тоҷикӣ", "Tadzikicus", "Tadzikica", "Tadzikicarum", "Tajik", "Tadjik", "Tadzhik", "Tajiki", "Tajik Persian"}, scripts = {"Cyrl", "fa-Arab", "Latn"}, family = "ira-wes", ancestors = {"fa"}, translit_module = "tg-translit", sort_key = { from = {"Ё", "ё"}, to = {"Е" , "е"}} , entry_name = { from = {ACUTE}, to = {}} , } m["th"] = { canonicalName = "Siamense", otherNames = {"Siamensis", "Siamenses", "Siamenses", "Siamense", "Siamensia", "Siamensis", "ภาษาไทย", "Siamensis", "Siamensi", "Siamensium", "Thai"}, scripts = {"Thai"}, family = "tai-swe", translit_module = "th-translit", entry_name = { from = { "-" }, to = {}} , } m["ti"] = { canonicalName = "Tigrinya", scripts = {"Ethi"}, family = "sem-eth", translit_module = "Ethi-translit", } m["tk"] = { canonicalName = "Turcomannice", otherNames = {"Turcomannica", "Turcomannicae", "Turcomannici", "Turcomannicum", "Turcomannica", "Turcomannicae", "Türkmençe", "Turcomannicus", "Turcomannica", "Turcomannicarum", "Tүркменче", "Türkmen dili", "تورکمن ﺗﻴﻠی"}, scripts = {"Latn", "Cyrl"}, family = "trk-ogz", } m["tl"] = { canonicalName = "Tagale", otherNames = {"Tagala", "Tagalae", "Tagali", "Tagalum", "Tagala", "Tagalae", "Wikang Tagalog", "Tagalus", "Tagala", "Tagalarum"}, scripts = {"Latn", "Tglg"}, family = "phi", } m["tn"] = { canonicalName = "Tswana", otherNames = {"Setswana"}, scripts = {"Latn"}, family = "bnt", } m["to"] = { canonicalName = "Tongane", otherNames = {"Tongana", "Tonganae", "Tongani", "Tonganum", "Tongana", "Tonganae", "lea fakatonga", "Tonganus", "Tongana", "Tonganarum"}, scripts = {"Latn"}, family = "poz-pol", } m["tr"] = { canonicalName = "Turcice", otherNames = {"Turcica", "Turcicae", "Turcici", "Turcicum", "Turcica", "Turcicae", "Türkçe", "Turcicus", "Turcica", "Turcicarum"}, scripts = {"Latn"}, family = "trk-ogz", ancestors = {"ota"}, } m["ts"] = { canonicalName = "Tsonga", scripts = {"Latn"}, family = "bnt", } m["tt"] = { canonicalName = "Tatarice", otherNames = {"Tatarica", "Tataricae", "Tatarici", "Tataricum", "Tatarica", "Tataricae", "татарча / tatarça", "Tataricus", "Tatarica", "Tataricarum"}, scripts = {"Cyrl", "Latn", "Arab", "tt-Arab"}, family = "trk-kip", translit_module = "tt-translit", } m["ty"] = { canonicalName = "Tahitiane", otherNames = {"Tahitiana", "Tahitianae", "Tahitiani", "Tahitianum", "Tahitiana", "Tahitianae", "reo Mā’ohi", "Tahitianus", "Tahitiana", "Tahitianarum"}, scripts = {"Latn"}, family = "poz-pol", } m["ug"] = { canonicalName = "Uyghur", otherNames = {"Uigur", "Uighur", "Uygur"}, scripts = {"ug-Arab", "Latn", "Cyrl"}, family = "trk", ancestors = {"chg"}, translit_module = "ug-translit", } m["uk"] = { canonicalName = "Ucrainice", otherNames = {"Ucrainica", "Ucrainicae", "Ucrainici", "Ucrainicum", "Ucrainica", "Ucrainicae", "українська", "Ucrainicus", "Ucrainica", "Ucrainicarum"}, scripts = {"Cyrl"}, family = "zle", translit_module = "uk-translit", entry_name = { from = {"Ѐ", "ѐ", "Ѝ", "ѝ", GRAVE, ACUTE}, to = {"Е", "е", "И", "и"}}, } m["ur"] = { canonicalName = "Urdu", otherNames = {"Urdu"}, scripts = {"ur-Arab"}, family = "inc", ancestors = {"psu"}, entry_name = { from = {u(0x064B), u(0x064C), u(0x064D), u(0x064E), u(0x064F), u(0x0650), u(0x0651), u(0x0652)}, to = {}} , } m["uz"] = { canonicalName = "Usbece", otherNames = {"Northern Uzbek", "Southern Uzbek"}, scripts = {"Latn", "Cyrl", "fa-Arab"}, family = "trk", ancestors = {"chg"}, } m["ve"] = { canonicalName = "Venda", scripts = {"Latn"}, family = "bnt", } -- otherNames is used for inflected forms: 1=nomsgf, 2=nomplf, 3=nomplm, 4=nomsgn, 5=nompln, 6=gensgf m["vi"] = { canonicalName = "Vietnamice", otherNames = {"Vietnamica", "Vietnamicae", "Vietnamici", "Vietnamicum", "Vietnamica", "Vietnamicae", "tiếng Việt", "Vietnamicus", "Vietnamica", "Vietnamicarum", "Annamese", "Annamite"}, scripts = {"Latn", "Hani"}, family = "mkh-vie", ancestors = {"mkh-mvi"}, } m["vo"] = { canonicalName = "Volapük", scripts = {"Latn"}, family = "art", } m["wa"] = { canonicalName = "Vallonice", otherNames = {"Vallonica", "Vallonicae", "Vallonici", "Vallonicum", "Vallonica", "Vallonicae", "walon", "Vallonicus", "Vallonica", "Vallonicarum"}, scripts = {"Latn"}, family = "roa", ancestors = {"fro"}, sort_key = { from = {"[áàâäå]", "[éèêë]", "[íìîï]", "[óòôö]", "[úùûü]", "[ýỳŷÿ]", "ç", "'"}, to = {"a" , "e" , "i" , "o" , "u" , "y" , "c"}} , } m["wo"] = { canonicalName = "Wolof", otherNames = {"Gambian Wolof"}, -- the subsumed dialect 'wof' scripts = {"Latn", "Arab"}, family = "alv-sng", } m["xh"] = { canonicalName = "Xhosa", scripts = {"Latn"}, family = "bnt-ngu", } m["yi"] = { canonicalName = "Iudaeogermanice", otherNames = {"Iudaeogermanica", "Iudaeogermanicae", "Iudaeogermanici", "Iudaeogermanicum", "Iudaeogermanica", "Iudaeogermanicae", "יידיש", "Iudaeogermanicus", "Iudaeogermanica", "Iudaeogermanicarum", "Jiddisch"}, scripts = {"Hebr"}, family = "gmw", ancestors = {"gmh"}, translit_module = "yi-translit", } m["yo"] = { canonicalName = "Yoruba", scripts = {"Latn"}, family = "alv-von", } m["za"] = { canonicalName = "Zhuang", scripts = {"Latn", "Hani"}, family = "tai", } m["zh"] = { canonicalName = "Sinice", otherNames = {"Sinica", "Sinicae", "Sinici", "Sinicum", "Sinica", "Sinicae", "中文", "Sinicus", "Sinica", "Sinicarum"}, scripts = {"Hani"}, family = "sit", ancestors = {"ltc"}, } m["zu"] = { canonicalName = "Zuluane", otherNames = {"Zuluana", "Zuluanae", "Zuluani", "Zuluanum", "Zuluana", "Zuluanae", "isiZulu", "Zuluanus", "Zuluana", "Zuluanarum"}, scripts = {"Latn"}, family = "bnt-ngu", } return m g6v1978k8ddniao3m5xbj3p2g35m12s 220205 220202 2022-08-14T14:29:48Z YaganZ 4537 corr. Scribunto text/plain -- Module:languages/data2 -- imported from en.wiktionary -- 2022-08-14 -- V25 -- sh-translit module, last modified by Usor:YaganZ -- 2021-12-27 -- V24 -- bn, ma, sc experimental, last modified by Usor:YaganZ -- 2021-03-14 -- V23 -- +kv = kpv = Komiense, last modified by Usor:YaganZ -- 2020-12-06 -- V22 -- +cu, last modified by Usor:YaganZ -- 2020-10-14 -- V21 -- +oj, last modified by Usor:YaganZ -- 2020-07-16 -- V20 -- +genplf, last modified by Usor:YaganZ -- canonicalNames are translated into Latin adverbial, ablative or neuter forms (if available in Categoria:Formulae linguarum), -- missing entries added. -- otherNames 1-6 are used for mostly used inflected forms, 7=own name, 8=non-inflected form, 9-n are rarely used inflected forms: -- 1=nomsgf, 2=nomplf, 3=nomplm, 4=nomsgn, 5=nompln, 6=gensgf, 7=ipsa, 8=nomsgm, 9=ablsgf, 10=genplf local u = mw.ustring.char -- UTF-8 encoded strings for some commonly-used diacritics local GRAVE = u(0x0300) local ACUTE = u(0x0301) local CIRC = u(0x0302) local TILDE = u(0x0303) local MACRON = u(0x0304) local BREVE = u(0x0306) local DOTABOVE = u(0x0307) local DIAER = u(0x0308) local CARON = u(0x030C) local DGRAVE = u(0x030F) local INVBREVE = u(0x0311) local DOTBELOW = u(0x0323) local RINGBELOW = u(0x0325) local CEDILLA = u(0x0327) -- Puncuation to be used for standardChars field local PUNCTUATION = ' \!\#\$\%\&\*\+\,\-\.\/\:\;\<\=\>\?\@\^\_\`\|\~\'\(\)' local m = {} m["aa"] = { canonicalName = "Afarice", otherNames = {"Afarica", "Afaricae", "Afarici", "Afaricum", "Afarica", "Afaricae", "Qafaraf", "Afaricus", "Afarica", "Afaricarum", "Qafar"}, scripts = {"Latn"}, family = "cus", } m["ab"] = { canonicalName = "Abasce", otherNames = {"Abasca", "Abascae", "Abasci", "Abascum", "Abasca", "Abascae", "аҧсшәа", "Abascus", "Abasca", "Abascarum", "Abkhazian", "Abxazo"}, scripts = {"Cyrl", "Geor", "Latn"}, family = "cau-abz", translit_module = "ab-translit", entry_name = { from = {GRAVE, ACUTE}, to = {}} , } m["ae"] = { canonicalName = "Avestane", otherNames = {"Avestana", "Avestanae", "Avestani", "Avestanum", "Avestana", "Avestanae", "zend", "Avestanus", "Avestana", "Avestanarum", "Old Bactrian"}, scripts = {"Avst", "Gujr"}, family = "ira-eas", translit_module = "Avst-translit", } m["af"] = { canonicalName = "Africanice", otherNames = {"Africanica", "Africanicae", "Africanici", "Africanicum", "Africanica", "Africanicae", "Afrikaans", "Africanicus", "Africanica", "Africanicarum"}, scripts = {"Latn", "Arab"}, family = "gmw", ancestors = {"nl"}, sort_key = { from = {"[äáâà]", "[ëéêè]", "[ïíîì]", "[öóôò]", "[üúûù]", "[ÿýŷỳ]", "^-", "'"}, to = {"a" , "e" , "i" , "o" , "u" , "y" }} , } m["ak"] = { canonicalName = "Akan", otherNames = {"Twi-Fante", "Twi", "Fante", "Fanti", "Asante", "Akuapem"}, scripts = {"Latn"}, family = "alv-kwa", } m["am"] = { canonicalName = "Aethiopice", otherNames = {"Aethiopica", "Aethiopicae", "Aethiopici", "Aethiopicum", "Aethiopica", "Aethiopicae", "አማርኛ", "Aethiopicus", "Aethiopica", "Aethiopicarum", "Amharica"}, scripts = {"Ethi"}, family = "sem-eth", translit_module = "Ethi-translit", } m["an"] = { canonicalName = "Aragonice", otherNames = {"Aragonica", "Aragonicae", "Aragonici", "Aragonicum", "Aragonica", "Aragonicae", "aragonés", "Aragonicus", "Aragonica", "Aragonicarum", "Aragonensis"}, scripts = {"Latn"}, family = "roa", ancestors = {"roa-oan"}, } m["ar"] = { canonicalName = "Arabice", otherNames = {"Arabica", "Arabicae", "Arabici", "Arabicum", "Arabica", "Arabicae", "العربية", "Arabicus", "Arabica", "Arabicarum", "Modern Standard Arabic", "Standard Arabic", "Literary Arabic", "Classical Arabic"}, scripts = {"Arab"}, family = "sem-arb", entry_name = { from = {u(0x0671), u(0x064B), u(0x064C), u(0x064D), u(0x064E), u(0x064F), u(0x0650), u(0x0651), u(0x0652), u(0x0670), u(0x0640)}, to = {u(0x0627)}}, translit_module = "ar-translit", } m["as"] = { canonicalName = "Assamice", otherNames = {"Assamica", "Assamicae", "Assamici", "Assamicum", "Assamica", "Assamicae", "অসমীয়া", "Assamicus", "Assamica", "Assamicarum", "Asamiya"}, scripts = {"Beng"}, family = "inc", ancestors = {"pka"}, } m["av"] = { canonicalName = "Avar", otherNames = {"Avaric"}, scripts = {"Cyrl"}, family = "cau-nec", ancestors = {"oav"}, translit_module = "av-translit", } m["ay"] = { canonicalName = "Aymare", otherNames = {"Southern Aymara", "Central Aymara"}, scripts = {"Latn"}, family = "sai-aym", } m["az"] = { canonicalName = "Atropatenice", otherNames = {"Atropatenica", "Atropatenicae", "Atropatenici", "Atropatenicum", "Atropatenica", "Atropatenicae", "Azərbaycan dili", "Atropatenicus", "Atropatenica", "Atropatenicarum", "Azerbaijani", "Azari", "Azeri Turkic", "Azerbaijani Turkic", "North Azerbaijani", "South Azerbaijani"}, scripts = {"Latn", "Cyrl", "fa-Arab"}, family = "trk-ogz", } m["ba"] = { canonicalName = "Baschkirice", otherNames = {"Baschkirica", "Baschkiricae", "Baschkirici", "Baschkiricum", "Baschkirica", "Baschkiricae", "башҡортса", "Baschkiricus", "Baschkirica", "Baschkiricarum", "Bashkir"}, scripts = {"Cyrl"}, family = "trk-kip", translit_module = "ba-translit", } m["be"] = { canonicalName = "Albaruthenice", otherNames = {"Albaruthenica", "Albaruthenicae", "Albaruthenici", "Albaruthenicum", "Albaruthenica", "Albaruthenicae", "беларуская мова", "Albaruthenicus", "Albaruthenica", "Albaruthenicarum", "Belorussian", "Belarusan", "Bielorussian", "Byelorussian", "Belarussian", "White Russian"}, scripts = {"Cyrl"}, family = "zle", translit_module = "be-translit", sort_key = { from = {"Ё", "ё"}, to = {"Е" , "е"}}, entry_name = { from = {"Ѐ", "ѐ", GRAVE, ACUTE}, to = {"Е", "е"}}, } m["bg"] = { canonicalName = "Bulgarice", otherNames = {"Bulgarica", "Bulgaricae", "Bulgarici", "Bulgaricum", "Bulgarica", "Bulgaricae", "български език", "Bulgaricus", "Bulgarica", "Bulgaricarum" }, scripts = {"Cyrl"}, family = "zls", translit_module = "bg-translit", entry_name = { from = {"Ѐ", "ѐ", "Ѝ", "ѝ", GRAVE, ACUTE}, to = {"Е", "е", "И", "и"}}, } m["bh"] = { canonicalName = "Bihari", scripts = {"Deva"}, family = "inc", ancestors = {"pka"}, } m["bi"] = { canonicalName = "Bislama", scripts = {"Latn"}, family = "crp", ancestors = {"en"}, } m["bm"] = { canonicalName = "Bambara", otherNames = {"Bamanankan"}, scripts = {"Latn"}, family = "dmn", } m["bn"] = { "Bengale", "Q9610", "inc-eas", canonicalName = "Bengale", otherNames = {"Bengala", "Bengalae", "Bengali", "Bengalum", "Bengala", "Bengalae", "বাংলা", "Bengalus", "Bengala", "Bengalarum", "Bangla", "Bengali"}, scripts = {"Beng", "Newa"}, ancestors = {"inc-mbn"}, translit_module = "bn-translit", } m["bo"] = { canonicalName = "Tibetane", otherNames = {"Tibetana", "Tibetanae", "Tibetani", "Tibetanum", "Tibetana", "Tibetanae", "བོད་སྐད།", "Tibetanus", "Tibetana", "Tibetanarum"}, scripts = {"Tibt"}, family = "tbq", ancestors = {"xct"}, translit_module = "bo-translit", } m["br"] = { canonicalName = "Britonice", otherNames = {"Britonica", "Britonicae", "Britonici", "Britonicum", "Britonica", "Britonicae", "brezhoneg", "Britonicus", "Britonica", "Britonicarum"}, scripts = {"Latn"}, family = "cel-bry", ancestors = {"xbm"}, } m["bs"] = { canonicalName = "Bosnice", otherNames = {"Bosnica", "Bosnicae", "Bosnici", "Bosnicum", "Bosnica", "Bosnicae", "Bosnian", "bosanski jezik", "Bosnicus", "Bosnica", "Bosnicarum"}, scripts = {"Latn"}, family = "zlw", } m["ca"] = { canonicalName = "Catalane", otherNames = {"Catalana", "Catalanae", "Catalani", "Catalanum", "Catalana", "Catalanae", "català", "Catalanus", "Catalana", "Catalanarum", "Valencian"}, scripts = {"Latn"}, family = "roa", ancestors = {"roa-oca"}, sort_key = { from = {"à", "[èé]", "[íï]", "[òó]", "[úü]", "ç", "l·l"}, to = {"a", "e" , "i" , "o" , "u" , "c", "ll" }} , } m["ce"] = { canonicalName = "Chechen", scripts = {"Cyrl"}, family = "cau-nkh", translit_module = "ce-translit", entry_name = { from = {MACRON}, to = {}}, } m["ch"] = { canonicalName = "Chamorre", otherNames = {"Chamoru"}, scripts = {"Latn"}, family = "poz-sus", } m["co"] = { canonicalName = "Corse", otherNames = {"Corsa", "Corsae", "Corsi", "Corsum", "Corsa", "Corsae", "corsu", "Corsus", "Corsa", "Corsarum", "Corsican"}, scripts = {"Latn"}, family = "roa", } m["cr"] = { canonicalName = "Cree", scripts = {"Cans", "Latn"}, family = "alg", translit_module = "cr-translit", } m["cs"] = { canonicalName = "Bohemice", otherNames = {"Bohemica", "Bohemicae", "Bohemici", "Bohemicum", "Bohemica", "Bohemicae", "čeština", "Bohemicus", "Bohemica", "Bohemicarum"}, scripts = {"Latn"}, family = "zlw", ancestors = {"zlw-ocs"}, sort_key = { from = {"á", "é", "í", "ó", "[úů]", "ý"}, to = {"a", "e", "i", "o", "u" , "y"}} , } m["cu"] = { "Slavica Antiqua", "Q35499", "zls", otherNames = {"Slavica Antiqua", "Slavicae Antiquae", "Slavici Antiqui", "Slavicum Antiquum", "Slavica Antiqua", "Slavicae Antiquae", "словѣньскъ ѩзыкъ", "Slavicus Antiquus", "Slavica Antiqua", "Slavicarum Antiquarum", "Old Church Slavic", "Old Church Slavonic"}, scripts = {"Cyrs", "Glag"}, translit_module = "Cyrs-Glag-translit", entry_name = { from = {u(0x0484)}, -- kamora to = {}}, sort_key = { from = {"оу", "є"}, to = {"у" , "е"}} , } m["cv"] = { canonicalName = "Chuvash", scripts = {"Cyrl"}, family = "trk-ogr", translit_module = "cv-translit", } m["cy"] = { canonicalName = "Cambrice", otherNames = {"Cambrica", "Cambricae", "Cambrici", "Cambricum", "Cambrica", "Cambricae", "Cymraeg", "Cambricus", "Cambrica", "Cambricarum"}, scripts = {"Latn"}, family = "cel-bry", ancestors = {"wlm"}, sort_key = { from = {"[âáàä]", "[êéèë]", "[îíìï]", "[ôóòö]", "[ûúùü]", "[ŵẃẁẅ]", "[ŷýỳÿ]", "'"}, to = {"a" , "e" , "i" , "o" , "u" , "w" , "y" }} , } m["da"] = { canonicalName = "Danice", otherNames = {"Danica", "Danicae", "Danici", "Danicum", "Danica", "Danicae", "dansk", "Danicus", "Danica", "Danicarum"}, scripts = {"Latn"}, family = "gmq", ancestors = {"gmq-oda"}, } m["de"] = { "Germanice", "Q188", "gmw", otherNames = {"Germanica", "Germanicae", "Germanici", "Germanicum", "Germanica", "Germanicae", "Deutsch", "Germanicus", "Germanica", "Germanicarum", "High German", "New High German", "Deutsch"}, -- the last name is indeed also used in English scripts = {"Latn", "Latf"}, ancestors = {"gmh"}, sort_key = { from = {"[äàáâå]", "[ëèéê]", "[ïìíî]", "[öòóô]", "[üùúû]", "ß" }, to = {"a" , "e" , "i" , "o" , "u" , "ss"}} , } m["dv"] = { canonicalName = "Dhivehi", otherNames = {"Divehi", "Mahal", "Mahl", "Maldivian"}, scripts = {"Thaa"}, family = "inc", ancestors = {"pmh"}, translit_module = "dv-translit", } m["dz"] = { canonicalName = "Dzongkha", scripts = {"Tibt"}, family = "tbq", ancestors = {"xct"}, translit_module = "bo-translit", } m["ee"] = { canonicalName = "Ewe", scripts = {"Latn"}, family = "alv", } m["el"] = { canonicalName = "Neograece", otherNames = {"Neograeca", "Neograecae", "Neograeci", "Neograecum", "Neograeca", "Neograecae", "Νέα Ελληνικά", "Neograecus", "Neograeca", "Neograecarum", "Modern Greek", "Neo-Hellenic"}, scripts = {"Grek"}, family = "grk", ancestors = {"grc"}, translit_module = "el-translit", sort_key = { -- Keep this synchronized with grc, cpg, pnt from = {"[ᾳάᾴὰᾲᾶᾷἀᾀἄᾄἂᾂἆᾆἁᾁἅᾅἃᾃἇᾇ]", "[έὲἐἔἒἑἕἓ]", "[ῃήῄὴῂῆῇἠᾐἤᾔἢᾒἦᾖἡᾑἥᾕἣᾓἧᾗ]", "[ίὶῖἰἴἲἶἱἵἳἷϊΐῒῗ]", "[όὸὀὄὂὁὅὃ]", "[ύὺῦὐὔὒὖὑὕὓὗϋΰῢῧ]", "[ῳώῴὼῲῶῷὠᾠὤᾤὢᾢὦᾦὡᾡὥᾥὣᾣὧᾧ]", "ῥ", "ς"}, to = {"α" , "ε" , "η" , "ι" , "ο" , "υ" , "ω" , "ρ", "σ"}} , } m["en"] = { canonicalName = "Anglice", otherNames = {"Anglica", "Anglicae", "Anglici", "Anglicum", "Anglica", "Anglicae", "English", "Anglicus", "Anglica", "Anglicarum", "Modern English", "New English", "Hawaiian Creole English", "Hawai'ian Creole English", "Hawaiian Creole", "Hawai'ian Creole", "Polari", "Yinglish"}, -- all but the first three are names and alt names of subsumed dialects which once had ISO codes scripts = {"Latn", "Shaw", "Dsrt"}, -- last two are rare but probably attested; entries in them might require community approval, but it's good for the script codes not to be orphans family = "gmw", ancestors = {"enm"}, sort_key = { from = {"[äàáâåā]", "[ëèéêē]", "[ïìíîī]", "[öòóôō]", "[üùúûū]", "æ" , "œ" , "[çč]", "ñ", "'"}, to = {"a" , "e" , "i" , "o" , "u" , "ae", "oe", "c" , "n"}}, wikimedia_codes = {"en", "simple"}, standardChars = "A-Za-z0-9" .. PUNCTUATION .. u(0x2800) .. "-" .. u(0x28FF) } m["eo"] = { canonicalName = "Esperantice", otherNames = {"Esperantica", "Esperanticae", "Esperantici", "Esperanticum", "Esperantica", "Esperanticae", "Esperanto", "Esperanticus", "Esperantica", "Esperanticarum"}, scripts = {"Latn"}, family = "art", sort_key = { from = {"[áà]", "[éè]", "[íì]", "[óò]", "[úù]", "[ĉ]", "[ĝ]", "[ĥ]", "[ĵ]", "[ŝ]", "[ŭ]"}, to = {"a" , "e" , "i" , "o" , "u", "cĉ", "gĉ", "hĉ", "jĉ", "sĉ", "uĉ"}} , } m["es"] = { canonicalName = "Hispanice", otherNames = {"Hispanica", "Hispanicae", "Hispanici", "Hispanicum", "Hispanica", "Hispanicae", "español", "Hispanicus", "Hispanica", "Hispanicarum", "Castilian"}, scripts = {"Latn"}, family = "roa", ancestors = {"osp"}, sort_key = { from = {"á", "é", "í", "ó", "[úü]", "ç", "ñ"}, to = {"a", "e", "i", "o", "u" , "c", "n"}}, standardChars = "A-VXYZa-vxyz0-9ÁáÉéÍíÓóÚúÑñ¿¡" .. PUNCTUATION } m["et"] = { canonicalName = "Estonice", otherNames = {"Estonica", "Estonicae", "Estonici", "Estonicum", "Estonica", "Estonicae", "eesti keel", "Estonicus", "Estonica", "Estonicarum"}, scripts = {"Latn"}, family = "fiu-fin", } m["eu"] = { canonicalName = "Vasconice", otherNames = {"Vasconica", "Vasconicae", "Vasconici", "Vasconicum", "Vasconica", "Vasconicae", "Euskara", "Vasconicus", "Vasconica", "Vasconicarum"}, scripts = {"Latn"}, family = "euq", } m["fa"] = { canonicalName = "Persice", otherNames = {"Persica", "Persicae", "Persici", "Persicum", "Persica", "Persicae", "فارسی", "Persicus", "Persica", "Persicarum", "Farsi", "New Persian", "Modern Persian", "Western Persian", "Iranian Persian", "Eastern Persian", "Dari", "Aimaq", "Aimak", "Aymaq", "Eimak"}, scripts = {"fa-Arab"}, family = "ira-wes", ancestors = {"pal"}, entry_name = { from = {u(0x064E), u(0x064F), u(0x0650), u(0x0651), u(0x0652)}, to = {}} , } m["ff"] = { canonicalName = "Fula", otherNames = {"Adamawa Fulfulde", "Bagirmi Fulfulde", "Borgu Fulfulde", "Central-Eastern Niger Fulfulde", "Fulani", "Fulfulde", "Maasina Fulfulde", "Nigerian Fulfulde", "Pular", "Pulaar", "Western Niger Fulfulde"}, -- Maasina, etc are dialects, subsumed into this code scripts = {"Latn"}, family = "alv-sng", } m["fi"] = { canonicalName = "Finnice", otherNames = {"Finnica", "Finnicae", "Finnici", "Finnicum", "Finnica", "Finnicae", "suomi", "Finnicus", "Finnica", "Finnicarum"}, scripts = {"Latn"}, family = "fiu-fin", entry_name = { from = {"ˣ"}, -- Used to indicate gemination of the next consonant to = {}}, sort_key = { from = {"[áàâã]", "[éèêẽ]", "[íìîĩ]", "[óòôõ]", "[úùûũ]", "[ýỳŷüű]", "[øõő]", "æ" , "œ" , "[čç]", "š", "ž", "ß" , "[':]"}, to = {"a" , "e" , "i" , "o" , "u" , "y" , "ö" , "ae", "oe", "c" , "s", "z", "ss"}} , } m["fj"] = { canonicalName = "Fidziane", otherNames = {"Fidziana", "Fidzianae", "Fidziani", "Fidzianum", "Fidziana", "Fidzianae", "?", "Fidzianus", "Fidziana", "Fidzianarum"}, scripts = {"Latn"}, family = "poz-occ", } m["fo"] = { canonicalName = "Faeroice", otherNames = {"Faeroica", "Faeroicae", "Faeroici", "Faeroicum", "Faeroica", "Faeroicae", "føroyskt", "Faeroicus", "Faeroica", "Faeroicarum"}, scripts = {"Latn"}, family = "gmq", ancestors = {"non"}, } m["fr"] = { canonicalName = "Francogallice", otherNames = {"Francogallica", "Francogallicae", "Francogallici", "Francogallicum", "Francogallica", "Francogallicae", "français", "Francogallicus", "Francogallica", "Francogallicarum", "Modern French"}, scripts = {"Latn"}, family = "roa", ancestors = {"frm"}, sort_key = { from = {"[áàâä]", "[éèêë]", "[íìîï]", "[óòôö]", "[úùûü]", "[ýỳŷÿ]", "ç", "æ" , "œ" , "'"}, to = {"a" , "e" , "i" , "o" , "u" , "y" , "c", "ae", "oe"}}, standardChars = "A-Za-z0-9ÀÂÇÉÈÊËÎÏÔŒÛÙÜàâçéèêëîïôœûùü" .. PUNCTUATION } m["fy"] = { canonicalName = "Frisice", otherNames = {"Frisica", "Frisicae", "Frisici", "Frisicum", "Frisica", "Frisicae", "?", "Frisicus", "Frisica", "Frisicarum", "Western Frisian", "Frisian", "Frysk"}, scripts = {"Latn"}, family = "gmw-fri", ancestors = {"ofs"}, } m["ga"] = { canonicalName = "Hibernice", otherNames = {"Hibernica", "Hibernicae", "Hibernici", "Hibernicum", "Hibernica", "Hibernicae", "Gaeilge", "Hibernicus", "Hibernica", "Hibernicarum", "Irish Gaelic"}, scripts = {"Latn"}, family = "cel-gae", ancestors = {"mga"}, sort_key = { from = {"á", "é", "í", "ó", "ú", "ý", "ḃ" , "ċ" , "ḋ" , "ḟ" , "ġ" , "ṁ" , "ṗ" , "ṡ" , "ṫ" }, to = {"a", "e", "i", "o", "u", "y", "bh", "ch", "dh", "fh", "gh", "mh", "ph", "sh", "th"}} , } m["gd"] = { canonicalName = "Gaelice", otherNames = {"Gaelica", "Gaelicae", "Gaelici", "Gaelicum", "Gaelica", "Gaelicae", "Gàidhlig", "Gaelicus", "Gaelica", "Gaelicarum", "Highland Gaelic", "Scots Gaelic", "Scottish"}, scripts = {"Latn"}, family = "cel-gae", ancestors = {"mga"}, sort_key = { from = {"[áà]", "[éè]", "[íì]", "[óò]", "[úù]", "[ýỳ]"}, to = {"a" , "e" , "i" , "o" , "u" , "y" }} , } m["gl"] = { canonicalName = "Gallaice", otherNames = {"Gallaica", "Gallaicae", "Gallaici", "Gallaicum", "Gallaica", "Gallaicae", "galego", "Gallaicus", "Gallaica", "Gallaicarum", "Galician"}, scripts = {"Latn"}, family = "roa", ancestors = {"roa-opt"}, sort_key = { from = {"á", "é", "í", "ó", "ú"}, to = {"a", "e", "i", "o", "u"}} , } m["gn"] = { canonicalName = "Guaraní", scripts = {"Latn"}, family = "tup", } m["gu"] = { canonicalName = "Gujarati", scripts = {"Gujr"}, family = "inc", ancestors = {"inc-ogu"}, translit_module = "gu-translit", } m["gv"] = { canonicalName = "Manx", otherNames = {"Manx Gaelic"}, scripts = {"Latn"}, family = "cel-gae", ancestors = {"mga"}, sort_key = { from = {"ç", "-"}, to = {"c"}} , } m["ha"] = { canonicalName = "Hausa", scripts = {"Latn", "Arab"}, family = "cdc-wst", } m["he"] = { canonicalName = "Hebraice", otherNames = {"Hebraica", "Hebraicae", "Hebraici", "Hebraicum", "Hebraica", "Hebraicae", "עִבְרִית", "Hebraicus", "Hebraica", "Hebraicarum", "Ivrit"}, scripts = {"Hebr", "Phnx"}, family = "sem-can", entry_name = { from = {"[" .. u(0x0591) .. "-" .. u(0x05BD) .. u(0x05BF) .. "-" .. u(0x05C5) .. u(0x05C7) .. "]"}, to = {}} , } m["hi"] = { canonicalName = "Hindice", otherNames = {"Hindica", "Hindicae", "Hindici", "Hindicum", "Hindica", "Hindicae", "हिन्दी", "Hindicus", "Hindica", "Hindicarum", "hindī"}, scripts = {"Deva"}, family = "inc", ancestors = {"inc-ohi"}, translit_module = "hi-translit", } m["ho"] = { canonicalName = "Hiri Motu", otherNames = {"Pidgin Motu", "Police Motu"}, scripts = {"Latn"}, family = "crp", ancestors = {"meu"}, } m["hr"] = { canonicalName = "Croate", otherNames = {"Croata", "Croatae", "Croati", "Croatum", "Croata", "Croatae", "hrvatski", "Croatus", "Croata", "Croatarum", "Croatian"}, scripts = {"Latn"}, family = "zlw", } m["ht"] = { canonicalName = "Haitiane", otherNames = {"Haitiana", "Haitianae", "Haitiani", "Haitianum", "Haitiana", "Haitianae", "kreyòl", "Haitianus", "Haitiana", "Haitianarum", "Creole", "Haitian"}, scripts = {"Latn"}, family = "crp", } m["hu"] = { canonicalName = "Hungarice", otherNames = {"Hungarica", "Hungaricae", "Hungarici", "Hungaricum", "Hungarica", "Hungaricae", "magyar", "Hungaricus", "Hungarica", "Hungaricarum"}, scripts = {"Latn"}, family = "fiu-ugr", ancestors = {"ohu"}, sort_key = { from = {"á", "é", "í", "ó", "ú", "ő", "ű"}, to = {"a", "e", "i", "o", "u", "ö", "ü"}} , } m["hy"] = { canonicalName = "Armenie", otherNames = {"Armenia", "Armeniae", "Armenii", "Armenium", "Armenia", "Armeniae", "Հայերէն", "Armenius", "Armenia", "Armeniarum", "Modern Armenian", "Eastern Armenian", "Western Armenian"}, scripts = {"Armn"}, family = "hyx", ancestors = {"axm"}, translit_module = "Armn-translit", sort_key = { from = {"ու", "և", "եւ"}, to = {"ւ", "եվ", "եվ"}}, entry_name = { from = {"՞", "՜", "՛", "՟", "և", "<sup>յ</sup>", "<sup>ի</sup>"}, to = {"", "", "", "", "եւ", "յ", "ի"}} , } m["hz"] = { canonicalName = "Herero", scripts = {"Latn"}, family = "bnt", } m["ia"] = { canonicalName = "Interlingua", otherNames = {"Interlingua"}, scripts = {"Latn"}, family = "art", } m["id"] = { canonicalName = "Indonesie", otherNames = {"Indonesia", "Indonesiae", "Indonesii", "Indonesium", "Indonesia", "Indonesiae", "Bahasa Indonesia", "Indonesius", "Indonesia", "Indonesiarum"}, scripts = {"Latn"}, family = "poz-mly", ancestors = {"ms"}, } m["ie"] = { canonicalName = "Interlingue", otherNames = {"Occidental"}, scripts = {"Latn"}, family = "art", } m["ig"] = { canonicalName = "Igbo", scripts = {"Latn"}, family = "nic-bco", } m["ii"] = { canonicalName = "Sichuan Yi", otherNames = {"Nuosu", "Nosu", "Northern Yi", "Liangshan Yi"}, scripts = {"Yiii"}, family = "tbq-lol", } m["ik"] = { canonicalName = "Inupiak", otherNames = {"Inupiaq", "Iñupiaq", "Inupiatun"}, scripts = {"Latn"}, family = "esx-inu", } m["io"] = { canonicalName = "Ido", scripts = {"Latn"}, family = "art", } m["is"] = { canonicalName = "Islandice", otherNames = {"Islandica", "Islandicae", "Islandici", "Islandicum", "Islandica", "Islandicae", "íslenska", "Islandica", "Islandica", "Islandicarum"}, scripts = {"Latn"}, family = "gmq", ancestors = {"non"}, } m["it"] = { canonicalName = "Italice", otherNames = {"Italica", "Italicae", "Italici", "Italicum", "Italica", "Italicae", "italiano", "Italicus", "Italica", "Italicarum", "Italiana"}, scripts = {"Latn"}, family = "roa", ancestors = {"roa-oit"}, sort_key = { from = {"[àáâäå]", "[èéêë]", "[ìíîï]", "[òóôö]", "[ùúûü]"}, to = {"a" , "e" , "i" , "o" , "u" }} , } m["iu"] = { canonicalName = "Inuktitut", otherNames = {"Eastern Canadian Inuktitut", "Eastern Canadian Inuit", "Western Canadian Inuktitut", "Western Canadian Inuit", "Western Canadian Inuktun", "Inuinnaq", "Inuinnaqtun", "Inuvialuk", "Inuvialuktun", "Nunavimmiutit", "Nunatsiavummiut", "Aivilimmiut", "Natsilingmiut", "Kivallirmiut", "Siglit", "Siglitun"}, scripts = {"Cans", "Latn"}, family = "esx-inu", translit_module = "iu-translit", } m["ja"] = { canonicalName = "Iaponice", otherNames = {"Iaponica", "Iaponicae", "Iaponici", "Iaponicum", "Iaponica", "Iaponicae", "日本語", "Iaponicus", "Iaponica", "Iaponicarum", "Nihongo", "Modern Japanese", "Nipponese"}, scripts = {"Jpan", "Latn", "Hira"}, family = "jpx", ancestors = {"ojp"}, } m["jv"] = { canonicalName = "Iavense", otherNames = {"Iavensis", "Iavenses", "Iavenses", "Iavense", "Iavensia", "Iavensis", "basa Jawa", "Iavensis", "Iavensi", "Iavensium"}, scripts = {"Latn", "Java"}, family = "poz-sus", translit_module = "jv-translit", ancestors = {"kaw"}, link_tr = true, } m["ka"] = { canonicalName = "Georgiane", otherNames = {"Georgiana", "Georgianae", "Georgiani", "Georgianum", "Georgiana", "Georgianae", "ქართული", "Georgianus", "Georgiana", "Georgianarum", "Kartvelian"}, scripts = {"Geor", "Geok"}, family = "ccs-gzn", ancestors = {"oge"}, translit_module = "Geor-translit", entry_name = { from = {"̂"}, to = {""}}, } m["kg"] = { canonicalName = "Kongo", otherNames = {"Kikongo", "Koongo", "Laari", "San Salvador Kongo", "Yombe"}, scripts = {"Latn"}, family = "bnt", } m["ki"] = { canonicalName = "Kikuyu", otherNames = {"Gikuyu", "Gĩkũyũ"}, scripts = {"Latn"}, family = "bnt", } m["kj"] = { canonicalName = "Kwanyama", otherNames = {"Kuanyama", "Oshikwanyama"}, scripts = {"Latn"}, family = "bnt", } m["kk"] = { "Kazachice", "Q9252", "trk-kno", otherNames = {"Kazachica", "Kazachicae", "Kazachici", "Kazachicum", "Kazachica", "Kazachicae", "Қазақ тілі", "Kazachicus", "Kazachica", "Kazachicarum"}, scripts = {"Cyrl", "Latn", "kk-Arab"}, translit_module = "kk-translit", override_translit = true, } m["kl"] = { canonicalName = "Groenlandice", otherNames = {"Groenlandica", "Groenlandicae", "Groenlandici", "Groenlandicum", "Groenlandica", "Groenlandicae", "Kalaallisut", "Groenlandicus", "Groenlandica", "Groenlandicarum"}, scripts = {"Latn"}, family = "esx-inu", } m["km"] = { canonicalName = "Khmer", otherNames = {"Cambodian"}, scripts = {"Khmr"}, family = "mkh", ancestors = {"mkh-mkm"}, translit_module = "km-translit", } m["kn"] = { canonicalName = "Kannada", scripts = {"Knda"}, family = "dra", translit_module = "kn-translit", } m["ko"] = { canonicalName = "Coreane", otherNames = {"Coreana", "Coreanae", "Coreani", "Coreanum", "Coreana", "Coreanae", "한국어", "Coreanus", "Coreana", "Coreanarum", "Modern Korean"}, scripts = {"Kore"}, family = "qfa-kor", ancestors = {"okm"}, translit_module = "ko-translit", } m["kr"] = { canonicalName = "Kanuri", otherNames = {"Kanembu", "Bilma Kanuri", "Central Kanuri", "Manga Kanuri", "Tumari Kanuri"}, scripts = {"Latn"}, family = "ssa", } m["ks"] = { canonicalName = "Caspirice", otherNames = {"Caspirica", "Caspiricae", "Caspirici", "Caspiricum", "Caspirica", "Caspiricae", "कॉशुर / کٲشُر", "Caspiricus", "Caspirica", "Caspiricarum", "Kashmiri"}, scripts = {"ks-Arab", "Deva"}, family = "inc-dar", } m["ku"] = { canonicalName = "Corduene", otherNames = {"Corduena", "Corduenae", "Cordueni", "Corduenum", "Corduena", "Corduenae", "kurdî", "Corduenus", "Corduena", "Corduenarum"}, scripts = {"Latn", "ku-Arab", "Armn", "Cyrl"}, family = "ira-wes", } m["kv"] = { "Komiense", "Q34114", "urj-prm", otherNames = {"Komiensis", "Komienses", "Komienses", "Komiense", "Komiensia", "Komiensis", "Коми кыв", "Komiensis", "Komiensi", "Komiensium", "Komi", "Komi-Zyryan"}, scripts = Cyrl, translit_module = "kv-translit", override_translit = true, } m["kw"] = { canonicalName = "Cornubice", otherNames = {"Cornubica", "Cornubicae", "Cornubici", "Cornubicum", "Cornubica", "Cornubicae", "Kernowek", "Cornubicus", "Cornubica", "Cornubicarum"}, scripts = {"Latn"}, family = "cel-bry", ancestors = {"cnx"}, } m["ky"] = { canonicalName = "Kyrgesse", otherNames = {"Kyrgessa", "Kyrgessae", "Kyrgessi", "Kyrgessum", "Kyrgessa", "Kyrgessae", "кыргызча", "Kyrgessus", "Kyrgessa", "Kyrgessarum", "Chirgisica", "Kirghiz", "Kirgiz"}, scripts = {"Cyrl", "Latn", "Arab"}, family = "trk-kip", translit_module = "ky-translit", } m["la"] = { canonicalName = "Latine", otherNames = {"Latina", "Latinae", "Latini", "Latinum", "Latina", "Latinae", "Latine", "Latinus", "Latina", "Latinarum"}, scripts = {"Latn"}, family = "itc", ancestors = {"itc-ola"}, entry_name = { from = {"[ĀĂ]", "[āă]", "[ĒĔ]", "[ēĕë]", "[ĪĬÏ]", "[īĭï]", "[ŌŎ]", "[ōŏ]", "[ŪŬÜ]", "[ūŭü]", "Ȳ", "ȳ", MACRON, BREVE, DIAER}, to = {"A", "a", "E", "e", "I", "i", "O", "o", "U", "u", "Y", "y"}}, } m["lb"] = { canonicalName = "Luxemburgice", otherNames = {"Luxemburgica", "Luxemburgicae", "Luxemburgici", "Luxemburgicum", "Luxemburgica", "Luxemburgicae", "Lëtzebuergesch", "Luxemburgicus", "Luxemburgica", "Luxemburgicarum"}, scripts = {"Latn"}, family = "gmw", ancestors = {"gmh"}, } m["lg"] = { canonicalName = "Luganda", otherNames = {"Ganda"}, scripts = {"Latn"}, family = "bnt", } m["li"] = { canonicalName = "Limburgice", otherNames = {"Limburgica", "Limburgicae", "Limburgici", "Limburgicum", "Limburgica", "Limburgicae", "Limburgs", "Limburgicus", "Limburgica", "Limburgicarum", "Limburgan", "Limburgian", "Limburgic"}, scripts = {"Latn"}, family = "gmw", ancestors = {"dum"}, } m["ln"] = { canonicalName = "Lingala", scripts = {"Latn"}, family = "bnt", } m["lo"] = { canonicalName = "Lao", otherNames = {"Laotian"}, scripts = {"Laoo"}, family = "tai-swe", translit_module = "lo-translit", } m["lt"] = { canonicalName = "Lithuanice", otherNames = {"Lithuanica", "Lithuanicae", "Lithuanici", "Lithuanicum", "Lithuanica", "Lithuanicae", "lietuvių", "Lithuanicus", "Lithuanica", "Lithuanicarum"}, scripts = {"Latn"}, family = "bat", ancestors = {"olt"}, entry_name = { from = {"[áãà]", "[ÁÃÀ]", "[éẽè]", "[ÉẼÈ]", "[íĩì]", "[ÍĨÌ]", "[ýỹ]", "[ÝỸ]", "ñ", "[óõò]", "[ÓÕÒ]", "[úũù]", "[ÚŨÙ]", ACUTE, GRAVE, TILDE}, to = {"a", "A", "e", "E", "i", "I", "y", "Y", "n", "o", "O", "u", "U"}} , } m["lu"] = { canonicalName = "Luba-Katanga", scripts = {"Latn"}, family = "bnt", } m["lv"] = { canonicalName = "Lettice", otherNames = {"Lettica", "Letticae", "Lettici", "Letticum", "Lettica", "Letticae", "latviešu", "Letticus", "Lettica", "Letticarum", "Lettonica", "Lettish", "Lett"}, scripts = {"Latn"}, family = "bat", } -- 1=nomsgf, 2=nomplf, 3=nomplm, 4=nomsgn, 5=nompln, 6=gensgf, 7=ipsa, 8=nomsgm, 9=ablsgf, 10=genplf m["ma"] = { canonicalName = "Magare", otherNames = {"Magara", "Magarae", "Magari", "Magarum", "Magara", "Magarae", "léngua magara", "Magarus", "Magara", "Magararum", "Magarian"}, scripts = {"Latn"}, family = "roa-itd", } m["mg"] = { canonicalName = "Madagascariense", otherNames = {"Madagascariensis", "Madagascarienses", "Madagascarienses", "Madagascariense", "Madagascariensia", "Madagascariensis", "malagasy", "Madagascariensis", "Madagascariensi", "Madagascariensium", "Betsimisaraka Malagasy", "Betsimisaraka", "Northern Betsimisaraka Malagasy", "Northern Betsimisaraka", "Southern Betsimisaraka Malagasy", "Southern Betsimisaraka", "Bara Malagasy", "Bara", "Masikoro Malagasy", "Masikoro", "Antankarana", "Antankarana Malagasy", "Plateau Malagasy", "Sakalava", "Tandroy Malagasy", "Tandroy", "Tanosy", "Tanosy Malagasy", "Tesaka", "Tsimihety", "Tsimihety Malagasy"}, scripts = {"Latn"}, family = "poz-bre", } m["mh"] = { canonicalName = "Marshallese", scripts = {"Latn"}, family = "poz-mic", sort_key = { from = {"ā" , "ļ" , "m̧" , "ņ" , "n̄" , "o̧" , "ō" , "ū" }, to = {"a~", "l~", "m~", "n~", "n~~", "o~", "o~~", "u~"}} , } m["mi"] = { canonicalName = "Maorice", otherNames = {"Maorica", "Maoricae", "Maorici", "Maoricum", "Maorica", "Maoricae", "Māori", "Maoricus", "Maorica", "Maoricarum"}, scripts = {"Latn"}, family = "poz-pol", } m["mk"] = { canonicalName = "Macedonice", otherNames = {"Macedonica", "Macedonicae", "Macedonici", "Macedonicum", "Macedonica", "Macedonicae", "Македонски јазик", "Macedonicus", "Macedonica", "Macedonicarum" }, scripts = {"Cyrl"}, family = "zls", translit_module = "mk-translit", entry_name = { from = {ACUTE}, to = {}}, } m["ml"] = { canonicalName = "Malayalam", scripts = {"Mlym"}, family = "dra", translit_module = "ml-translit", } m["mn"] = { canonicalName = "Mogolice", otherNames = {"Mogolica", "Mogolicae", "Mogolici", "Mogolicum", "Mogolica", "Mogolicae", "Монгол хэл", "Mogolicus", "Mogolica", "Mogolicarum", "Khalkha Mongolian"}, scripts = {"Cyrl", "Mong"}, family = "xgn", ancestors = {"cmg"}, translit_module = "mn-translit", } m["mr"] = { canonicalName = "Marathi", scripts = {"Deva", "Modi"}, family = "inc", ancestors = {"omr"}, translit_module = "hi-translit", } m["ms"] = { canonicalName = "Malaice", otherNames = {"Malaica", "Malaicae", "Malaici", "Malaicum", "Malaica", "Malaicae", "Bahasa Melayu", "Malaicus", "Malaica", "Malaicarum"}, scripts = {"Latn", "Arab"}, family = "poz-mly", } m["mt"] = { canonicalName = "Melitense", otherNames = {"Melitensis", "Melitenses", "Melitenses", "Melitense", "Melitensia", "Melitensis", "Malti", "Melitensis", "Melitensi", "Melitensium"}, scripts = {"Latn"}, family = "sem-arb", ancestors = {"sqr"}, } m["my"] = { canonicalName = "Birmanice", otherNames = {"Birmanica", "Birmanicae", "Birmanici", "Birmanicum", "Birmanica", "Birmanicae", "မ္ရန္‌မာစာ", "Birmanicus", "Birmanica", "Birmanicarum", "Burmese", "Myanmar"}, scripts = {"Mymr"}, family = "tbq-brm", ancestors = {"obr"}, translit_module = "my-translit", } m["na"] = { canonicalName = "Nauruane", otherNames = {"Nauruana", "Nauruanae", "Nauruani", "Nauruanum", "Nauruana", "Nauruanae", "Nauru", "Nauruanus", "Nauruana", "Nauruanarum"}, scripts = {"Latn"}, family = "poz-mic", } m["nb"] = { canonicalName = "Dano-Norvegice", otherNames = {"Dano-Norvegica", "Dano-Norvegicae", "Dano-Norvegici", "Dano-Norvegicum", "Dano-Norvegica", "Dano-Norvegicae", "Bokmål", "Dano-Norvegicus", "Dano-Norvegica", "Dano-Norvegicarum", "Norwegian", "Norsk"}, scripts = {"Latn"}, family = "gmq", ancestors = {"gmq-mno"}, wikimedia_codes = {"no"}, } m["nd"] = { canonicalName = "Northern Ndebele", otherNames = {"North Ndebele"}, scripts = {"Latn"}, family = "bnt-ngu", } m["ne"] = { canonicalName = "Nepalense", otherNames = {"Nepalensis", "Nepalenses", "Nepalenses", "Nepalense", "Nepalensia", "Nepalensis", "नेपाली", "Nepalensis", "Nepalensi", "Nepalensium", "Nepalese"}, scripts = {"Deva"}, family = "inc", translit_module = "ne-translit", } m["ng"] = { canonicalName = "Ndonga", scripts = {"Latn"}, family = "bnt", } m["nl"] = { canonicalName = "Batave", otherNames = {"Batava", "Batavae", "Batavi", "Batavum", "Batava", "Batavae", "Nederlands", "Batavus", "Batava", "Batavarum", "Netherlandic", "Flemish"}, scripts = {"Latn"}, family = "gmw", ancestors = {"dum"}, sort_key = { from = {"[äáâå]", "[ëéê]", "[ïíî]", "[öóô]", "[üúû]", "ç", "ñ", "^-"}, to = {"a" , "e" , "i" , "o" , "u" , "c", "n"}} , } m["nn"] = { canonicalName = "Neonorvegice", otherNames = {"Neonorvegica", "Neonorvegicae", "Neonorvegici", "Neonorvegicum", "Neonorvegica", "Neonorvegicae", "Nynorsk", "Neonorvegicus", "Neonorvegica", "Neonorvegicarum", "New Norwegian"}, scripts = {"Latn"}, family = "gmq", ancestors = {"gmq-mno"}, } m["no"] = { canonicalName = "Norvegice", otherNames = {"Norvegica", "Norvegicae", "Norvegici", "Norvegicum", "Norvegica", "Norvegicae", "Norsk", "Norvegicus", "Norvegica", "Norvegicarum", "Norwegian"}, scripts = {"Latn"}, family = "gmq", ancestors = {"gmq-mno"}, } m["nr"] = { canonicalName = "Southern Ndebele", otherNames = {"South Ndebele"}, scripts = {"Latn"}, family = "bnt-ngu", } m["nv"] = { canonicalName = "Navajo", scripts = {"nv-Latn"}, family = "apa", sort_key = { from = {"[áą]", "[éę]", "[íį]", "[óǫ]", "ń", "^n([djlt])", "ł" , "[ʼ’']", ACUTE}, to = {"a" , "e" , "i" , "o" , "n", "ni%1" , "l"}}, -- the copyright sign is used to guarantee that ł will always be sorted after all other words with l } m["ny"] = { canonicalName = "Chichewa", otherNames = {"Chicheŵa", "Chinyanja", "Nyanja", "Chewa"}, scripts = {"Latn"}, family = "bnt", entry_name = { from = {ACUTE}, to = {}}, } m["oc"] = { canonicalName = "Occitane", otherNames = {"Occitana", "Occitanae", "Occitani", "Occitanum", "Occitana", "Occitanae", "occitan", "Occitanus", "Occitana", "Occitanarum", "Provençal", "Auvergnat", "Auvernhat", "Gascon", "Languedocien", "Lengadocian", "Shuadit", "Chouhadite", "Chouhadit", "Chouadite", "Chouadit", "Shuhadit", "Judeo-Provençal", "Judeo-Provencal", "Judeo-Comtadin"}, scripts = {"Latn", "Hebr"}, family = "roa", ancestors = {"pro"}, sort_key = { from = {"[àá]", "[èé]", "[íï]", "[òó]", "[úü]", "ç", "([lns])·h"}, to = {"a" , "e" , "i" , "o" , "u" , "c", "%1h" }} , } m["oj"] = { "Ojibwayense", "Q33875", "alg", otherNames = {"Ojibwayensis", "Ojibwayenses", "Ojibwayenses", "Ojibwayense", "Ojibwayensia", "Ojibwayensis", "Anishinaabemowin / ᐊᓂᔑᓈᐯᒧᐎᓐ", "Ojibwayensis", "Ojibwayensi", "Ojibwayensium"}, aliases = {"Ojibway", "Ojibwa"}, varieties = {{"Chippewa", "Ojibwemowin", "Southwestern Ojibwa"}}, scripts = {"Cans", "Latn"}, sort_key = { from = {"aa", "ʼ", "ii", "oo", "sh", "zh"}, to = {"a~", "h~", "i~", "o~", "s~", "z~"}} , } m["om"] = { canonicalName = "Oromo", otherNames = {"Orma", "Borana-Arsi-Guji Oromo", "West Central Oromo"}, scripts = {"Latn", "Ethi"}, family = "cus", } m["or"] = { canonicalName = "Oriya", otherNames = {"Odia", "Oorya"}, scripts = {"Orya"}, family = "inc", ancestors = {"pka"}, } m["os"] = { canonicalName = "Alane", otherNames = {"Ossete", "Ossetic", "Digor", "Iron"}, scripts = {"Cyrl", "Geor", "Latn"}, family = "ira", translit_module = "os-translit", ancestors = {"oos"}, entry_name = { from = {GRAVE, ACUTE}, to = {}} , } m["pa"] = { canonicalName = "Punjabi", otherNames = {"Panjabi"}, scripts = {"Guru", "Arab", "Deva"}, family = "inc", translit_module = "pa-translit", ancestors = {"psu"}, } m["pi"] = { canonicalName = "Pali", scripts = {"Latn", "Deva", "Sinh", "Mymr", "Khmr", "Thai"}, family = "inc", ancestors = {"bh"}, sort_key = { from = {"ā", "ī", "ū", "ḍ", "ḷ", "[ṁṃ]", "[ṇñṅ]", "ṭ"}, to = {"a", "i", "u", "d", "l", "m" , "n" , "t"}} , } m["pl"] = { canonicalName = "Polonice", otherNames = {"Polonica", "Polonicae", "Polonici", "Polonicum", "Polonica", "Polonicae", "język polski", "Polonicus", "Polonica", "Polonicarum"}, scripts = {"Latn"}, family = "zlw", ancestors = {"zlw-opl"}, sort_key = { from = {"[Ąą]", "[Ćć]", "[Ęę]", "[Łł]", "[Ńń]", "[Óó]", "[Śś]", "[Żż]", "[Źź]"}, to = { "a" .. u(0x10FFFF), "c" .. u(0x10FFFF), "e" .. u(0x10FFFF), "l" .. u(0x10FFFF), "n" .. u(0x10FFFF), "o" .. u(0x10FFFF), "s" .. u(0x10FFFF), "z" .. u(0x10FFFF), "z" .. u(0x10FFFE)}} , } m["ps"] = { canonicalName = "Afganice", otherNames = {"Afganica", "Afganicae", "Afganici", "Afganicum", "Afganica", "Afganicae", "پښتو", "Afganicus", "Afganica", "Afganicarum", "Pashtun", "Pushto", "Pashtu", "Central Pashto", "Northern Pashto", "Southern Pashto", "Pukhto", "Pakhto", "Pakkhto", "Afghani"}, scripts = {"ps-Arab"}, family = "ira-eas", } m["pt"] = { canonicalName = "Lusitane", otherNames = {"Lusitana", "Lusitanae", "Lusitani", "Lusitanum", "Lusitana", "Lusitanae", "português", "Lusitanus", "Lusitana", "Lusitanarum", "Modern Portuguese"}, scripts = {"Latn"}, family = "roa", ancestors = {"roa-opt"}, sort_key = { from = {"[àãáâä]", "[èẽéêë]", "[ìĩíï]", "[òóôõö]", "[üúùũ]", "ç", "ñ"}, to = {"a" , "e" , "i" , "o" , "u" , "c", "n"}} , } m["qu"] = { canonicalName = "Quechua", otherNames = {"Quechua", "Quechuae", "Quechui", "Quechuum", "Quechua", "Quechuae", "Runasimi", "Quechuus", "Quechua", "Quechuarum", "Qhichwa simi"}, scripts = {"Latn"}, family = "qwe", } m["rm"] = { canonicalName = "Raetice", otherNames = {"Raetica", "Raeticae", "Raetici", "Raeticum", "Raetica", "Raeticae", "rumantsch", "Raeticus", "Raetica", "Raeticarum", "Romansh", "Romanche"}, scripts = {"Latn"}, family = "roa", } m["rn"] = { canonicalName = "Kirundi", scripts = {"Latn"}, family = "bnt", } m["ro"] = { canonicalName = "Dacoromane", otherNames = {"Dacoromana", "Dacoromanae", "Dacoromani", "Dacoromanum", "Dacoromana", "Dacoromanae", "româna", "Dacoromanus", "Dacoromana", "Dacoromanarum", "Daco-Romanian", "Roumanian", "Rumanian"}, scripts = {"Latn", "Cyrl"}, family = "roa", sort_key = { from = {"ă" , "â" , "î" , "ș" , "ț" }, to = {"a~", "a~~", "i~", "s~", "t~"}}, } m["ru"] = { canonicalName = "Ruthenice", otherNames = {"Ruthenica", "Ruthenicae", "Ruthenici", "Ruthenicum", "Ruthenica", "Ruthenicae", "русский язык", "Ruthenicus", "Ruthenica", "Ruthenicarum"}, scripts = {"Cyrl"}, family = "zle", translit_module = "ru-translit", sort_key = { from = {"ё"}, to = {"е" .. mw.ustring.char(0x10FFFF)}}, entry_name = { from = {"Ѐ", "ѐ", "Ѝ", "ѝ", GRAVE, ACUTE}, to = {"Е", "е", "И", "и"}}, } m["rw"] = { canonicalName = "Kinyarwanda", otherNames = {"Rwanda"}, scripts = {"Latn"}, family = "bnt", } m["sa"] = { canonicalName = "Sanscrite", otherNames = {"Sanscrita", "Sanscritae", "Sanscriti", "Sanscritum", "Sanscrita", "Sanscritae", "संस्कृत", "Sanscritus", "Sanscrita", "Sanscritarum"}, scripts = {"Deva", "Beng", "Brah", "Gran", "Gujr", "Guru", "Khar", "Knda", "Mlym", "Mymr", "Orya", "Shrd", "Sinh", "Taml", "Telu", "Thai", "Tibt"}, family = "inc", translit_module = "sa-translit", } -- 1=nomsgf, 2=nomplf, 3=nomplm, 4=nomsgn, 5=nompln, 6=gensgf, 7=ipsa, 8=nomsgm, 9=ablsgf, 10=genplf m["sc"] = { "Sarde", "Q33976", "roa", otherNames = {"Sarda", "Sardae", "Sardi", "Sardum", "Sarda", "Sardae", "sarda", "Sardus", "Sarda", "Sardarum", "Campidanese", "Campidanese Sardinian", "Logudorese", "Logudorese Sardinian", "Nuorese", "Nuorese Sardinian"}, scripts = {"Latn"}, } m["sd"] = { canonicalName = "Sindhi", scripts = {"sd-Arab", "Deva"}, family = "inc", } -- otherNames is used for inflected forms: 1=nomsgf, 2=nomplf, 3=nomplm, 4=nomsgn, 5=nompln, 6=gensgf m["se"] = { canonicalName = "Lapponica Septentrionali", otherNames = {"Lapponica Septentrionalis", "Lapponicae Septentrionales", "Lapponici Septentrionales", "Lapponicum Septentrionale", "Lapponica Septentrionalia", "Lapponicae Septentrionalis", "Davvisámegiella", "Lapponicus Septentrionalis", "Lapponica Septentrionali", "Lapponicarum Septentrionalium", "Samica septentrionalis", "North Sami", "Northern Saami", "North Saami"}, scripts = {"Latn"}, family = "smi", entry_name = { from = {"([đflmnŋrsšŧv])'%1"}, to = {"%1%1"} }, } m["sg"] = { canonicalName = "Sango", scripts = {"Latn"}, family = "crp", } m["sh"] = { canonicalName = "Servocroate", otherNames = {"Servocroata", "Servocroatae", "Servocroati", "Servocroatum", "Servocroata", "Servocroatae", "srpskohrvatski", "Servocroatus", "Servocroata", "Servocroatarum", "BCS", "Croato-Serbian", "Serbocroatian", "Bosnian", "Croatian", "Montenegrin", "Serbian"}, scripts = {"Latn", "Cyrl"}, family = "zls", entry_name = { from = {"[ȀÀȂÁĀ]", "[ȁàȃáā]", "[ȄÈȆÉĒ]", "[ȅèȇéē]", "[ȈÌȊÍĪ]", "[ȉìȋíī]", "[ȌÒȎÓŌ]", "[ȍòȏóō]", "[ȐȒŔ]", "[ȑȓŕ]", "[ȔÙȖÚŪ]", "[ȕùȗúū]", "Ѐ", "ѐ", "[ӢЍ]", "[ӣѝ]", "[Ӯ]", "[ӯ]", GRAVE, ACUTE, DGRAVE, INVBREVE, MACRON}, to = {"A" , "a" , "E" , "e" , "I" , "i" , "O" , "o" , "R" , "r" , "U" , "u" , "Е", "е", "И" , "и", "У", "у" }}, wikimedia_codes = {"sh", "bs", "hr", "sr"}, } m["si"] = { canonicalName = "Sinhalese", otherNames = {"Singhalese", "Sinhala"}, scripts = {"Sinh"}, family = "inc", ancestors = {"pmh"}, translit_module = "si-translit", } m["sk"] = { canonicalName = "Slovace", otherNames = {"Slovaca", "Slovacae", "Slovaci", "Slovacum", "Slovaca", "Slovacae", "slovenčina", "Slovacus", "Slovaca", "Slovacarum"}, scripts = {"Latn"}, family = "zlw", sort_key = { from = {"[áä]", "é", "í", "[óô]", "ú", "ý", "ŕ", "ĺ"}, to = {"a" , "e", "i", "o" , "u", "y", "r", "l"}} , } m["sl"] = { canonicalName = "Slovene", otherNames = {"Slovena", "Slovenae", "Sloveni", "Slovenum", "Slovena", "Slovenae", "slovenščina", "Slovenus", "Slovena", "Slovenarum", "Slovenian"}, scripts = {"Latn"}, family = "zls", entry_name = { from = {"[ÁÀÂȂȀ]", "[áàâȃȁ]", "[ÉÈÊȆȄỆẸ]", "[éèêȇȅệẹə]", "[ÍÌÎȊȈ]", "[íìîȋȉ]", "[ÓÒÔȎȌỘỌ]", "[óòôȏȍộọ]", "[ŔȒȐ]", "[ŕȓȑ]", "[ÚÙÛȖȔ]", "[úùûȗȕ]", "ł", GRAVE, ACUTE, DGRAVE, INVBREVE, CIRC, DOTBELOW}, to = {"A" , "a" , "E" , "e" , "I" , "i" , "O" , "o" , "R" , "r" , "U" , "u" , "l"}} , } m["sm"] = { canonicalName = "Samoane", otherNames = {"Samoana", "Samoanae", "Samoani", "Samoanum", "Samoana", "Samoanae", "gagana Sāmoa", "Samoanus", "Samoana", "Samoanarum"}, scripts = {"Latn"}, family = "poz-pol", } m["sn"] = { canonicalName = "Shona", scripts = {"Latn"}, family = "bnt", } m["so"] = { canonicalName = "Somali", scripts = {"Latn", "Arab", "Osma"}, family = "cus", entry_name = { from = {"[ÁÀÂ]", "[áàâ]", "[ÉÈÊ]", "[éèê]", "[ÍÌÎ]", "[íìî]", "[ÓÒÔ]", "[óòô]", "[ÚÙÛ]", "[úùû]", "[ÝỲ]", "[ýỳ]"}, to = {"A" , "a" , "E" , "e" , "I" , "i" , "O" , "o" , "U" , "u", "Y", "y"}} , } m["sq"] = { canonicalName = "Illyrice", otherNames = {"Illyrica", "Illyricae", "Illyrici", "Illyricum", "Illyrica", "Illyricae", "shqipja", "Illyricus", "Illyrica", "Illyricarum"}, scripts = {"Latn", "Elba"}, family = "sqj", sort_key = { from = { '[âãä]', '[ÂÃÄ]', '[êẽë]', '[ÊẼË]', 'ĩ', 'Ĩ', 'õ', 'Õ', 'ũ', 'Ũ', 'ỹ', 'Ỹ', 'ç', 'Ç' }, to = { 'a', 'A', 'e', 'E', 'i', 'I', 'o', 'O', 'u', 'U', 'y', 'Y', 'c', 'C' } } , } m["sr"] = { canonicalName = "Service", otherNames = {"Servica", "Servicae", "Servici", "Servicum", "Servica", "Servicae", "српски / srpski", "Servicus", "Servica", "Servicarum"}, scripts = {"Latn", "Cyrl"}, family = "zls", translit_module = "sr-translit", entry_name = { from = {"[ȀÀȂÁĀ]", "[ȁàȃáā]", "[ȄÈȆÉĒ]", "[ȅèȇéē]", "[ȈÌȊÍĪ]", "[ȉìȋíī]", "[ȌÒȎÓŌ]", "[ȍòȏóō]", "[ȐȒŔ]", "[ȑȓŕ]", "[ȔÙȖÚŪ]", "[ȕùȗúū]", "Ѐ", "ѐ", "[ӢЍ]", "[ӣѝ]", "[Ӯ]", "[ӯ]", GRAVE, ACUTE, DGRAVE, INVBREVE, MACRON}, to = {"A" , "a" , "E" , "e" , "I" , "i" , "O" , "o" , "R" , "r" , "U" , "u" , "Е", "е", "И" , "и", "У", "у" }}, wikimedia_codes = {"sh", "bs", "hr", "sr"}, } m["ss"] = { canonicalName = "Swazi", otherNames = {"Swati"}, scripts = {"Latn"}, family = "bnt-ngu", } m["st"] = { canonicalName = "Sotho Meridionali", otherNames = {"Sesotho", "Southern Sesotho", "Southern Sotho"}, scripts = {"Latn"}, family = "bnt", } m["su"] = { canonicalName = "Sondaice", scripts = {"Latn", "Sund"}, family = "poz-msa", translit_module = "su-translit", } m["sv"] = { canonicalName = "Suecice", otherNames = {"Suecica", "Suecicae", "Suecici", "Suecicum", "Suecica", "Suecicae", "svenska", "Suecicus", "Suecica", "Suecicarum"}, scripts = {"Latn"}, family = "gmq", ancestors = {"gmq-osw"}, } m["sw"] = { canonicalName = "Suahelice", otherNames = {"Suahelica", "Suahelicae", "Suahelici", "Suahelicum", "Suahelica", "Suahelicae", "Kiswahili", "Suahelicus", "Suahelica", "Suahelicarum", "Settler Swahili", "KiSetla", "KiSettla", "Setla", "Settla", "Kitchen Swahili", "Kihindi", "Indian Swahili", "KiShamba", "Kishamba", "Field Swahili", "Kibabu", "Asian Swahili", "Kimanga", "Arab Swahili", "Kitvita", "Army Swahili"}, scripts = {"Latn", "Arab"}, family = "bnt", sort_key = { from = {"ng'", "^-"}, to = {"ngz"}} , } m["ta"] = { canonicalName = "Tamulice", otherNames = {"Tamulica", "Tamulicae", "Tamulici", "Tamulicum", "Tamulica", "Tamulicae", "தமிழ்", "Tamulicus", "Tamulica", "Tamulicarum", "Tamil"}, scripts = {"Taml"}, family = "dra", ancestors = {"oty"}, translit_module = "ta-translit", } m["te"] = { canonicalName = "Teluguice", scripts = {"Telu"}, family = "dra", translit_module = "te-translit", } m["tg"] = { canonicalName = "Tadzikice", otherNames = {"Tadzikica", "Tadzikicae", "Tadzikici", "Tadzikicum", "Tadzikica", "Tadzikicae", "Тоҷикӣ", "Tadzikicus", "Tadzikica", "Tadzikicarum", "Tajik", "Tadjik", "Tadzhik", "Tajiki", "Tajik Persian"}, scripts = {"Cyrl", "fa-Arab", "Latn"}, family = "ira-wes", ancestors = {"fa"}, translit_module = "tg-translit", sort_key = { from = {"Ё", "ё"}, to = {"Е" , "е"}} , entry_name = { from = {ACUTE}, to = {}} , } m["th"] = { canonicalName = "Siamense", otherNames = {"Siamensis", "Siamenses", "Siamenses", "Siamense", "Siamensia", "Siamensis", "ภาษาไทย", "Siamensis", "Siamensi", "Siamensium", "Thai"}, scripts = {"Thai"}, family = "tai-swe", translit_module = "th-translit", entry_name = { from = { "-" }, to = {}} , } m["ti"] = { canonicalName = "Tigrinya", scripts = {"Ethi"}, family = "sem-eth", translit_module = "Ethi-translit", } m["tk"] = { canonicalName = "Turcomannice", otherNames = {"Turcomannica", "Turcomannicae", "Turcomannici", "Turcomannicum", "Turcomannica", "Turcomannicae", "Türkmençe", "Turcomannicus", "Turcomannica", "Turcomannicarum", "Tүркменче", "Türkmen dili", "تورکمن ﺗﻴﻠی"}, scripts = {"Latn", "Cyrl"}, family = "trk-ogz", } m["tl"] = { canonicalName = "Tagale", otherNames = {"Tagala", "Tagalae", "Tagali", "Tagalum", "Tagala", "Tagalae", "Wikang Tagalog", "Tagalus", "Tagala", "Tagalarum"}, scripts = {"Latn", "Tglg"}, family = "phi", } m["tn"] = { canonicalName = "Tswana", otherNames = {"Setswana"}, scripts = {"Latn"}, family = "bnt", } m["to"] = { canonicalName = "Tongane", otherNames = {"Tongana", "Tonganae", "Tongani", "Tonganum", "Tongana", "Tonganae", "lea fakatonga", "Tonganus", "Tongana", "Tonganarum"}, scripts = {"Latn"}, family = "poz-pol", } m["tr"] = { canonicalName = "Turcice", otherNames = {"Turcica", "Turcicae", "Turcici", "Turcicum", "Turcica", "Turcicae", "Türkçe", "Turcicus", "Turcica", "Turcicarum"}, scripts = {"Latn"}, family = "trk-ogz", ancestors = {"ota"}, } m["ts"] = { canonicalName = "Tsonga", scripts = {"Latn"}, family = "bnt", } m["tt"] = { canonicalName = "Tatarice", otherNames = {"Tatarica", "Tataricae", "Tatarici", "Tataricum", "Tatarica", "Tataricae", "татарча / tatarça", "Tataricus", "Tatarica", "Tataricarum"}, scripts = {"Cyrl", "Latn", "Arab", "tt-Arab"}, family = "trk-kip", translit_module = "tt-translit", } m["ty"] = { canonicalName = "Tahitiane", otherNames = {"Tahitiana", "Tahitianae", "Tahitiani", "Tahitianum", "Tahitiana", "Tahitianae", "reo Mā’ohi", "Tahitianus", "Tahitiana", "Tahitianarum"}, scripts = {"Latn"}, family = "poz-pol", } m["ug"] = { canonicalName = "Uyghur", otherNames = {"Uigur", "Uighur", "Uygur"}, scripts = {"ug-Arab", "Latn", "Cyrl"}, family = "trk", ancestors = {"chg"}, translit_module = "ug-translit", } m["uk"] = { canonicalName = "Ucrainice", otherNames = {"Ucrainica", "Ucrainicae", "Ucrainici", "Ucrainicum", "Ucrainica", "Ucrainicae", "українська", "Ucrainicus", "Ucrainica", "Ucrainicarum"}, scripts = {"Cyrl"}, family = "zle", translit_module = "uk-translit", entry_name = { from = {"Ѐ", "ѐ", "Ѝ", "ѝ", GRAVE, ACUTE}, to = {"Е", "е", "И", "и"}}, } m["ur"] = { canonicalName = "Urdu", otherNames = {"Urdu"}, scripts = {"ur-Arab"}, family = "inc", ancestors = {"psu"}, entry_name = { from = {u(0x064B), u(0x064C), u(0x064D), u(0x064E), u(0x064F), u(0x0650), u(0x0651), u(0x0652)}, to = {}} , } m["uz"] = { canonicalName = "Usbece", otherNames = {"Northern Uzbek", "Southern Uzbek"}, scripts = {"Latn", "Cyrl", "fa-Arab"}, family = "trk", ancestors = {"chg"}, } m["ve"] = { canonicalName = "Venda", scripts = {"Latn"}, family = "bnt", } -- otherNames is used for inflected forms: 1=nomsgf, 2=nomplf, 3=nomplm, 4=nomsgn, 5=nompln, 6=gensgf m["vi"] = { canonicalName = "Vietnamice", otherNames = {"Vietnamica", "Vietnamicae", "Vietnamici", "Vietnamicum", "Vietnamica", "Vietnamicae", "tiếng Việt", "Vietnamicus", "Vietnamica", "Vietnamicarum", "Annamese", "Annamite"}, scripts = {"Latn", "Hani"}, family = "mkh-vie", ancestors = {"mkh-mvi"}, } m["vo"] = { canonicalName = "Volapük", scripts = {"Latn"}, family = "art", } m["wa"] = { canonicalName = "Vallonice", otherNames = {"Vallonica", "Vallonicae", "Vallonici", "Vallonicum", "Vallonica", "Vallonicae", "walon", "Vallonicus", "Vallonica", "Vallonicarum"}, scripts = {"Latn"}, family = "roa", ancestors = {"fro"}, sort_key = { from = {"[áàâäå]", "[éèêë]", "[íìîï]", "[óòôö]", "[úùûü]", "[ýỳŷÿ]", "ç", "'"}, to = {"a" , "e" , "i" , "o" , "u" , "y" , "c"}} , } m["wo"] = { canonicalName = "Wolof", otherNames = {"Gambian Wolof"}, -- the subsumed dialect 'wof' scripts = {"Latn", "Arab"}, family = "alv-sng", } m["xh"] = { canonicalName = "Xhosa", scripts = {"Latn"}, family = "bnt-ngu", } m["yi"] = { canonicalName = "Iudaeogermanice", otherNames = {"Iudaeogermanica", "Iudaeogermanicae", "Iudaeogermanici", "Iudaeogermanicum", "Iudaeogermanica", "Iudaeogermanicae", "יידיש", "Iudaeogermanicus", "Iudaeogermanica", "Iudaeogermanicarum", "Jiddisch"}, scripts = {"Hebr"}, family = "gmw", ancestors = {"gmh"}, translit_module = "yi-translit", } m["yo"] = { canonicalName = "Yoruba", scripts = {"Latn"}, family = "alv-von", } m["za"] = { canonicalName = "Zhuang", scripts = {"Latn", "Hani"}, family = "tai", } m["zh"] = { canonicalName = "Sinice", otherNames = {"Sinica", "Sinicae", "Sinici", "Sinicum", "Sinica", "Sinicae", "中文", "Sinicus", "Sinica", "Sinicarum"}, scripts = {"Hani"}, family = "sit", ancestors = {"ltc"}, } m["zu"] = { canonicalName = "Zuluane", otherNames = {"Zuluana", "Zuluanae", "Zuluani", "Zuluanum", "Zuluana", "Zuluanae", "isiZulu", "Zuluanus", "Zuluana", "Zuluanarum"}, scripts = {"Latn"}, family = "bnt-ngu", } return m klpfc337gv7mkgugkocjnofkerag2eu 220206 220205 2022-08-14T14:39:13Z YaganZ 4537 corr. Scribunto text/plain -- Module:languages/data2 -- imported from en.wiktionary -- 2022-08-14 -- V25 -- sh-translit module, last modified by Usor:YaganZ -- 2021-12-27 -- V24 -- bn, ma, sc experimental, last modified by Usor:YaganZ -- 2021-03-14 -- V23 -- +kv = kpv = Komiense, last modified by Usor:YaganZ -- 2020-12-06 -- V22 -- +cu, last modified by Usor:YaganZ -- 2020-10-14 -- V21 -- +oj, last modified by Usor:YaganZ -- 2020-07-16 -- V20 -- +genplf, last modified by Usor:YaganZ -- canonicalNames are translated into Latin adverbial, ablative or neuter forms (if available in Categoria:Formulae linguarum), -- missing entries added. -- otherNames 1-6 are used for mostly used inflected forms, 7=own name, 8=non-inflected form, 9-n are rarely used inflected forms: -- 1=nomsgf, 2=nomplf, 3=nomplm, 4=nomsgn, 5=nompln, 6=gensgf, 7=ipsa, 8=nomsgm, 9=ablsgf, 10=genplf local u = mw.ustring.char -- UTF-8 encoded strings for some commonly-used diacritics local GRAVE = u(0x0300) local ACUTE = u(0x0301) local CIRC = u(0x0302) local TILDE = u(0x0303) local MACRON = u(0x0304) local BREVE = u(0x0306) local DOTABOVE = u(0x0307) local DIAER = u(0x0308) local CARON = u(0x030C) local DGRAVE = u(0x030F) local INVBREVE = u(0x0311) local DOTBELOW = u(0x0323) local RINGBELOW = u(0x0325) local CEDILLA = u(0x0327) -- Puncuation to be used for standardChars field local PUNCTUATION = ' \!\#\$\%\&\*\+\,\-\.\/\:\;\<\=\>\?\@\^\_\`\|\~\'\(\)' local m = {} m["aa"] = { canonicalName = "Afarice", otherNames = {"Afarica", "Afaricae", "Afarici", "Afaricum", "Afarica", "Afaricae", "Qafaraf", "Afaricus", "Afarica", "Afaricarum", "Qafar"}, scripts = {"Latn"}, family = "cus", } m["ab"] = { canonicalName = "Abasce", otherNames = {"Abasca", "Abascae", "Abasci", "Abascum", "Abasca", "Abascae", "аҧсшәа", "Abascus", "Abasca", "Abascarum", "Abkhazian", "Abxazo"}, scripts = {"Cyrl", "Geor", "Latn"}, family = "cau-abz", translit_module = "ab-translit", entry_name = { from = {GRAVE, ACUTE}, to = {}} , } m["ae"] = { canonicalName = "Avestane", otherNames = {"Avestana", "Avestanae", "Avestani", "Avestanum", "Avestana", "Avestanae", "zend", "Avestanus", "Avestana", "Avestanarum", "Old Bactrian"}, scripts = {"Avst", "Gujr"}, family = "ira-eas", translit_module = "Avst-translit", } m["af"] = { canonicalName = "Africanice", otherNames = {"Africanica", "Africanicae", "Africanici", "Africanicum", "Africanica", "Africanicae", "Afrikaans", "Africanicus", "Africanica", "Africanicarum"}, scripts = {"Latn", "Arab"}, family = "gmw", ancestors = {"nl"}, sort_key = { from = {"[äáâà]", "[ëéêè]", "[ïíîì]", "[öóôò]", "[üúûù]", "[ÿýŷỳ]", "^-", "'"}, to = {"a" , "e" , "i" , "o" , "u" , "y" }} , } m["ak"] = { canonicalName = "Akan", otherNames = {"Twi-Fante", "Twi", "Fante", "Fanti", "Asante", "Akuapem"}, scripts = {"Latn"}, family = "alv-kwa", } m["am"] = { canonicalName = "Aethiopice", otherNames = {"Aethiopica", "Aethiopicae", "Aethiopici", "Aethiopicum", "Aethiopica", "Aethiopicae", "አማርኛ", "Aethiopicus", "Aethiopica", "Aethiopicarum", "Amharica"}, scripts = {"Ethi"}, family = "sem-eth", translit_module = "Ethi-translit", } m["an"] = { canonicalName = "Aragonice", otherNames = {"Aragonica", "Aragonicae", "Aragonici", "Aragonicum", "Aragonica", "Aragonicae", "aragonés", "Aragonicus", "Aragonica", "Aragonicarum", "Aragonensis"}, scripts = {"Latn"}, family = "roa", ancestors = {"roa-oan"}, } m["ar"] = { canonicalName = "Arabice", otherNames = {"Arabica", "Arabicae", "Arabici", "Arabicum", "Arabica", "Arabicae", "العربية", "Arabicus", "Arabica", "Arabicarum", "Modern Standard Arabic", "Standard Arabic", "Literary Arabic", "Classical Arabic"}, scripts = {"Arab"}, family = "sem-arb", entry_name = { from = {u(0x0671), u(0x064B), u(0x064C), u(0x064D), u(0x064E), u(0x064F), u(0x0650), u(0x0651), u(0x0652), u(0x0670), u(0x0640)}, to = {u(0x0627)}}, translit_module = "ar-translit", } m["as"] = { canonicalName = "Assamice", otherNames = {"Assamica", "Assamicae", "Assamici", "Assamicum", "Assamica", "Assamicae", "অসমীয়া", "Assamicus", "Assamica", "Assamicarum", "Asamiya"}, scripts = {"Beng"}, family = "inc", ancestors = {"pka"}, } m["av"] = { canonicalName = "Avar", otherNames = {"Avaric"}, scripts = {"Cyrl"}, family = "cau-nec", ancestors = {"oav"}, translit_module = "av-translit", } m["ay"] = { canonicalName = "Aymare", otherNames = {"Southern Aymara", "Central Aymara"}, scripts = {"Latn"}, family = "sai-aym", } m["az"] = { canonicalName = "Atropatenice", otherNames = {"Atropatenica", "Atropatenicae", "Atropatenici", "Atropatenicum", "Atropatenica", "Atropatenicae", "Azərbaycan dili", "Atropatenicus", "Atropatenica", "Atropatenicarum", "Azerbaijani", "Azari", "Azeri Turkic", "Azerbaijani Turkic", "North Azerbaijani", "South Azerbaijani"}, scripts = {"Latn", "Cyrl", "fa-Arab"}, family = "trk-ogz", } m["ba"] = { canonicalName = "Baschkirice", otherNames = {"Baschkirica", "Baschkiricae", "Baschkirici", "Baschkiricum", "Baschkirica", "Baschkiricae", "башҡортса", "Baschkiricus", "Baschkirica", "Baschkiricarum", "Bashkir"}, scripts = {"Cyrl"}, family = "trk-kip", translit_module = "ba-translit", } m["be"] = { canonicalName = "Albaruthenice", otherNames = {"Albaruthenica", "Albaruthenicae", "Albaruthenici", "Albaruthenicum", "Albaruthenica", "Albaruthenicae", "беларуская мова", "Albaruthenicus", "Albaruthenica", "Albaruthenicarum", "Belorussian", "Belarusan", "Bielorussian", "Byelorussian", "Belarussian", "White Russian"}, scripts = {"Cyrl"}, family = "zle", translit_module = "be-translit", sort_key = { from = {"Ё", "ё"}, to = {"Е" , "е"}}, entry_name = { from = {"Ѐ", "ѐ", GRAVE, ACUTE}, to = {"Е", "е"}}, } m["bg"] = { canonicalName = "Bulgarice", otherNames = {"Bulgarica", "Bulgaricae", "Bulgarici", "Bulgaricum", "Bulgarica", "Bulgaricae", "български език", "Bulgaricus", "Bulgarica", "Bulgaricarum" }, scripts = {"Cyrl"}, family = "zls", translit_module = "bg-translit", entry_name = { from = {"Ѐ", "ѐ", "Ѝ", "ѝ", GRAVE, ACUTE}, to = {"Е", "е", "И", "и"}}, } m["bh"] = { canonicalName = "Bihari", scripts = {"Deva"}, family = "inc", ancestors = {"pka"}, } m["bi"] = { canonicalName = "Bislama", scripts = {"Latn"}, family = "crp", ancestors = {"en"}, } m["bm"] = { canonicalName = "Bambara", otherNames = {"Bamanankan"}, scripts = {"Latn"}, family = "dmn", } m["bn"] = { "Bengale", "Q9610", "inc-eas", canonicalName = "Bengale", otherNames = {"Bengala", "Bengalae", "Bengali", "Bengalum", "Bengala", "Bengalae", "বাংলা", "Bengalus", "Bengala", "Bengalarum", "Bangla", "Bengali"}, scripts = {"Beng", "Newa"}, ancestors = {"inc-mbn"}, translit_module = "bn-translit", } m["bo"] = { canonicalName = "Tibetane", otherNames = {"Tibetana", "Tibetanae", "Tibetani", "Tibetanum", "Tibetana", "Tibetanae", "བོད་སྐད།", "Tibetanus", "Tibetana", "Tibetanarum"}, scripts = {"Tibt"}, family = "tbq", ancestors = {"xct"}, translit_module = "bo-translit", } m["br"] = { canonicalName = "Britonice", otherNames = {"Britonica", "Britonicae", "Britonici", "Britonicum", "Britonica", "Britonicae", "brezhoneg", "Britonicus", "Britonica", "Britonicarum"}, scripts = {"Latn"}, family = "cel-bry", ancestors = {"xbm"}, } m["bs"] = { canonicalName = "Bosnice", otherNames = {"Bosnica", "Bosnicae", "Bosnici", "Bosnicum", "Bosnica", "Bosnicae", "Bosnian", "bosanski jezik", "Bosnicus", "Bosnica", "Bosnicarum"}, scripts = {"Latn"}, family = "zlw", } m["ca"] = { canonicalName = "Catalane", otherNames = {"Catalana", "Catalanae", "Catalani", "Catalanum", "Catalana", "Catalanae", "català", "Catalanus", "Catalana", "Catalanarum", "Valencian"}, scripts = {"Latn"}, family = "roa", ancestors = {"roa-oca"}, sort_key = { from = {"à", "[èé]", "[íï]", "[òó]", "[úü]", "ç", "l·l"}, to = {"a", "e" , "i" , "o" , "u" , "c", "ll" }} , } m["ce"] = { canonicalName = "Chechen", scripts = {"Cyrl"}, family = "cau-nkh", translit_module = "ce-translit", entry_name = { from = {MACRON}, to = {}}, } m["ch"] = { canonicalName = "Chamorre", otherNames = {"Chamoru"}, scripts = {"Latn"}, family = "poz-sus", } m["co"] = { canonicalName = "Corse", otherNames = {"Corsa", "Corsae", "Corsi", "Corsum", "Corsa", "Corsae", "corsu", "Corsus", "Corsa", "Corsarum", "Corsican"}, scripts = {"Latn"}, family = "roa", } m["cr"] = { canonicalName = "Cree", scripts = {"Cans", "Latn"}, family = "alg", translit_module = "cr-translit", } m["cs"] = { canonicalName = "Bohemice", otherNames = {"Bohemica", "Bohemicae", "Bohemici", "Bohemicum", "Bohemica", "Bohemicae", "čeština", "Bohemicus", "Bohemica", "Bohemicarum"}, scripts = {"Latn"}, family = "zlw", ancestors = {"zlw-ocs"}, sort_key = { from = {"á", "é", "í", "ó", "[úů]", "ý"}, to = {"a", "e", "i", "o", "u" , "y"}} , } m["cu"] = { "Slavica Antiqua", "Q35499", "zls", otherNames = {"Slavica Antiqua", "Slavicae Antiquae", "Slavici Antiqui", "Slavicum Antiquum", "Slavica Antiqua", "Slavicae Antiquae", "словѣньскъ ѩзыкъ", "Slavicus Antiquus", "Slavica Antiqua", "Slavicarum Antiquarum", "Old Church Slavic", "Old Church Slavonic"}, scripts = {"Cyrs", "Glag"}, translit_module = "Cyrs-Glag-translit", entry_name = { from = {u(0x0484)}, -- kamora to = {}}, sort_key = { from = {"оу", "є"}, to = {"у" , "е"}} , } m["cv"] = { canonicalName = "Chuvash", scripts = {"Cyrl"}, family = "trk-ogr", translit_module = "cv-translit", } m["cy"] = { canonicalName = "Cambrice", otherNames = {"Cambrica", "Cambricae", "Cambrici", "Cambricum", "Cambrica", "Cambricae", "Cymraeg", "Cambricus", "Cambrica", "Cambricarum"}, scripts = {"Latn"}, family = "cel-bry", ancestors = {"wlm"}, sort_key = { from = {"[âáàä]", "[êéèë]", "[îíìï]", "[ôóòö]", "[ûúùü]", "[ŵẃẁẅ]", "[ŷýỳÿ]", "'"}, to = {"a" , "e" , "i" , "o" , "u" , "w" , "y" }} , } m["da"] = { canonicalName = "Danice", otherNames = {"Danica", "Danicae", "Danici", "Danicum", "Danica", "Danicae", "dansk", "Danicus", "Danica", "Danicarum"}, scripts = {"Latn"}, family = "gmq", ancestors = {"gmq-oda"}, } m["de"] = { "Germanice", "Q188", "gmw", otherNames = {"Germanica", "Germanicae", "Germanici", "Germanicum", "Germanica", "Germanicae", "Deutsch", "Germanicus", "Germanica", "Germanicarum", "High German", "New High German", "Deutsch"}, -- the last name is indeed also used in English scripts = {"Latn", "Latf"}, ancestors = {"gmh"}, sort_key = { from = {"[äàáâå]", "[ëèéê]", "[ïìíî]", "[öòóô]", "[üùúû]", "ß" }, to = {"a" , "e" , "i" , "o" , "u" , "ss"}} , } m["dv"] = { canonicalName = "Dhivehi", otherNames = {"Divehi", "Mahal", "Mahl", "Maldivian"}, scripts = {"Thaa"}, family = "inc", ancestors = {"pmh"}, translit_module = "dv-translit", } m["dz"] = { canonicalName = "Dzongkha", scripts = {"Tibt"}, family = "tbq", ancestors = {"xct"}, translit_module = "bo-translit", } m["ee"] = { canonicalName = "Ewe", scripts = {"Latn"}, family = "alv", } m["el"] = { canonicalName = "Neograece", otherNames = {"Neograeca", "Neograecae", "Neograeci", "Neograecum", "Neograeca", "Neograecae", "Νέα Ελληνικά", "Neograecus", "Neograeca", "Neograecarum", "Modern Greek", "Neo-Hellenic"}, scripts = {"Grek"}, family = "grk", ancestors = {"grc"}, translit_module = "el-translit", sort_key = { -- Keep this synchronized with grc, cpg, pnt from = {"[ᾳάᾴὰᾲᾶᾷἀᾀἄᾄἂᾂἆᾆἁᾁἅᾅἃᾃἇᾇ]", "[έὲἐἔἒἑἕἓ]", "[ῃήῄὴῂῆῇἠᾐἤᾔἢᾒἦᾖἡᾑἥᾕἣᾓἧᾗ]", "[ίὶῖἰἴἲἶἱἵἳἷϊΐῒῗ]", "[όὸὀὄὂὁὅὃ]", "[ύὺῦὐὔὒὖὑὕὓὗϋΰῢῧ]", "[ῳώῴὼῲῶῷὠᾠὤᾤὢᾢὦᾦὡᾡὥᾥὣᾣὧᾧ]", "ῥ", "ς"}, to = {"α" , "ε" , "η" , "ι" , "ο" , "υ" , "ω" , "ρ", "σ"}} , } m["en"] = { canonicalName = "Anglice", otherNames = {"Anglica", "Anglicae", "Anglici", "Anglicum", "Anglica", "Anglicae", "English", "Anglicus", "Anglica", "Anglicarum", "Modern English", "New English", "Hawaiian Creole English", "Hawai'ian Creole English", "Hawaiian Creole", "Hawai'ian Creole", "Polari", "Yinglish"}, -- all but the first three are names and alt names of subsumed dialects which once had ISO codes scripts = {"Latn", "Shaw", "Dsrt"}, -- last two are rare but probably attested; entries in them might require community approval, but it's good for the script codes not to be orphans family = "gmw", ancestors = {"enm"}, sort_key = { from = {"[äàáâåā]", "[ëèéêē]", "[ïìíîī]", "[öòóôō]", "[üùúûū]", "æ" , "œ" , "[çč]", "ñ", "'"}, to = {"a" , "e" , "i" , "o" , "u" , "ae", "oe", "c" , "n"}}, wikimedia_codes = {"en", "simple"}, standardChars = "A-Za-z0-9" .. PUNCTUATION .. u(0x2800) .. "-" .. u(0x28FF) } m["eo"] = { canonicalName = "Esperantice", otherNames = {"Esperantica", "Esperanticae", "Esperantici", "Esperanticum", "Esperantica", "Esperanticae", "Esperanto", "Esperanticus", "Esperantica", "Esperanticarum"}, scripts = {"Latn"}, family = "art", sort_key = { from = {"[áà]", "[éè]", "[íì]", "[óò]", "[úù]", "[ĉ]", "[ĝ]", "[ĥ]", "[ĵ]", "[ŝ]", "[ŭ]"}, to = {"a" , "e" , "i" , "o" , "u", "cĉ", "gĉ", "hĉ", "jĉ", "sĉ", "uĉ"}} , } m["es"] = { canonicalName = "Hispanice", otherNames = {"Hispanica", "Hispanicae", "Hispanici", "Hispanicum", "Hispanica", "Hispanicae", "español", "Hispanicus", "Hispanica", "Hispanicarum", "Castilian"}, scripts = {"Latn"}, family = "roa", ancestors = {"osp"}, sort_key = { from = {"á", "é", "í", "ó", "[úü]", "ç", "ñ"}, to = {"a", "e", "i", "o", "u" , "c", "n"}}, standardChars = "A-VXYZa-vxyz0-9ÁáÉéÍíÓóÚúÑñ¿¡" .. PUNCTUATION } m["et"] = { canonicalName = "Estonice", otherNames = {"Estonica", "Estonicae", "Estonici", "Estonicum", "Estonica", "Estonicae", "eesti keel", "Estonicus", "Estonica", "Estonicarum"}, scripts = {"Latn"}, family = "fiu-fin", } m["eu"] = { canonicalName = "Vasconice", otherNames = {"Vasconica", "Vasconicae", "Vasconici", "Vasconicum", "Vasconica", "Vasconicae", "Euskara", "Vasconicus", "Vasconica", "Vasconicarum"}, scripts = {"Latn"}, family = "euq", } m["fa"] = { canonicalName = "Persice", otherNames = {"Persica", "Persicae", "Persici", "Persicum", "Persica", "Persicae", "فارسی", "Persicus", "Persica", "Persicarum", "Farsi", "New Persian", "Modern Persian", "Western Persian", "Iranian Persian", "Eastern Persian", "Dari", "Aimaq", "Aimak", "Aymaq", "Eimak"}, scripts = {"fa-Arab"}, family = "ira-wes", ancestors = {"pal"}, entry_name = { from = {u(0x064E), u(0x064F), u(0x0650), u(0x0651), u(0x0652)}, to = {}} , } m["ff"] = { canonicalName = "Fula", otherNames = {"Adamawa Fulfulde", "Bagirmi Fulfulde", "Borgu Fulfulde", "Central-Eastern Niger Fulfulde", "Fulani", "Fulfulde", "Maasina Fulfulde", "Nigerian Fulfulde", "Pular", "Pulaar", "Western Niger Fulfulde"}, -- Maasina, etc are dialects, subsumed into this code scripts = {"Latn"}, family = "alv-sng", } m["fi"] = { canonicalName = "Finnice", otherNames = {"Finnica", "Finnicae", "Finnici", "Finnicum", "Finnica", "Finnicae", "suomi", "Finnicus", "Finnica", "Finnicarum"}, scripts = {"Latn"}, family = "fiu-fin", entry_name = { from = {"ˣ"}, -- Used to indicate gemination of the next consonant to = {}}, sort_key = { from = {"[áàâã]", "[éèêẽ]", "[íìîĩ]", "[óòôõ]", "[úùûũ]", "[ýỳŷüű]", "[øõő]", "æ" , "œ" , "[čç]", "š", "ž", "ß" , "[':]"}, to = {"a" , "e" , "i" , "o" , "u" , "y" , "ö" , "ae", "oe", "c" , "s", "z", "ss"}} , } m["fj"] = { canonicalName = "Fidziane", otherNames = {"Fidziana", "Fidzianae", "Fidziani", "Fidzianum", "Fidziana", "Fidzianae", "?", "Fidzianus", "Fidziana", "Fidzianarum"}, scripts = {"Latn"}, family = "poz-occ", } m["fo"] = { canonicalName = "Faeroice", otherNames = {"Faeroica", "Faeroicae", "Faeroici", "Faeroicum", "Faeroica", "Faeroicae", "føroyskt", "Faeroicus", "Faeroica", "Faeroicarum"}, scripts = {"Latn"}, family = "gmq", ancestors = {"non"}, } m["fr"] = { canonicalName = "Francogallice", otherNames = {"Francogallica", "Francogallicae", "Francogallici", "Francogallicum", "Francogallica", "Francogallicae", "français", "Francogallicus", "Francogallica", "Francogallicarum", "Modern French"}, scripts = {"Latn"}, family = "roa", ancestors = {"frm"}, sort_key = { from = {"[áàâä]", "[éèêë]", "[íìîï]", "[óòôö]", "[úùûü]", "[ýỳŷÿ]", "ç", "æ" , "œ" , "'"}, to = {"a" , "e" , "i" , "o" , "u" , "y" , "c", "ae", "oe"}}, standardChars = "A-Za-z0-9ÀÂÇÉÈÊËÎÏÔŒÛÙÜàâçéèêëîïôœûùü" .. PUNCTUATION } m["fy"] = { canonicalName = "Frisice", otherNames = {"Frisica", "Frisicae", "Frisici", "Frisicum", "Frisica", "Frisicae", "?", "Frisicus", "Frisica", "Frisicarum", "Western Frisian", "Frisian", "Frysk"}, scripts = {"Latn"}, family = "gmw-fri", ancestors = {"ofs"}, } m["ga"] = { canonicalName = "Hibernice", otherNames = {"Hibernica", "Hibernicae", "Hibernici", "Hibernicum", "Hibernica", "Hibernicae", "Gaeilge", "Hibernicus", "Hibernica", "Hibernicarum", "Irish Gaelic"}, scripts = {"Latn"}, family = "cel-gae", ancestors = {"mga"}, sort_key = { from = {"á", "é", "í", "ó", "ú", "ý", "ḃ" , "ċ" , "ḋ" , "ḟ" , "ġ" , "ṁ" , "ṗ" , "ṡ" , "ṫ" }, to = {"a", "e", "i", "o", "u", "y", "bh", "ch", "dh", "fh", "gh", "mh", "ph", "sh", "th"}} , } m["gd"] = { canonicalName = "Gaelice", otherNames = {"Gaelica", "Gaelicae", "Gaelici", "Gaelicum", "Gaelica", "Gaelicae", "Gàidhlig", "Gaelicus", "Gaelica", "Gaelicarum", "Highland Gaelic", "Scots Gaelic", "Scottish"}, scripts = {"Latn"}, family = "cel-gae", ancestors = {"mga"}, sort_key = { from = {"[áà]", "[éè]", "[íì]", "[óò]", "[úù]", "[ýỳ]"}, to = {"a" , "e" , "i" , "o" , "u" , "y" }} , } m["gl"] = { canonicalName = "Gallaice", otherNames = {"Gallaica", "Gallaicae", "Gallaici", "Gallaicum", "Gallaica", "Gallaicae", "galego", "Gallaicus", "Gallaica", "Gallaicarum", "Galician"}, scripts = {"Latn"}, family = "roa", ancestors = {"roa-opt"}, sort_key = { from = {"á", "é", "í", "ó", "ú"}, to = {"a", "e", "i", "o", "u"}} , } m["gn"] = { canonicalName = "Guaraní", scripts = {"Latn"}, family = "tup", } m["gu"] = { canonicalName = "Gujarati", scripts = {"Gujr"}, family = "inc", ancestors = {"inc-ogu"}, translit_module = "gu-translit", } m["gv"] = { canonicalName = "Manx", otherNames = {"Manx Gaelic"}, scripts = {"Latn"}, family = "cel-gae", ancestors = {"mga"}, sort_key = { from = {"ç", "-"}, to = {"c"}} , } m["ha"] = { canonicalName = "Hausa", scripts = {"Latn", "Arab"}, family = "cdc-wst", } m["he"] = { canonicalName = "Hebraice", otherNames = {"Hebraica", "Hebraicae", "Hebraici", "Hebraicum", "Hebraica", "Hebraicae", "עִבְרִית", "Hebraicus", "Hebraica", "Hebraicarum", "Ivrit"}, scripts = {"Hebr", "Phnx"}, family = "sem-can", entry_name = { from = {"[" .. u(0x0591) .. "-" .. u(0x05BD) .. u(0x05BF) .. "-" .. u(0x05C5) .. u(0x05C7) .. "]"}, to = {}} , } m["hi"] = { canonicalName = "Hindice", otherNames = {"Hindica", "Hindicae", "Hindici", "Hindicum", "Hindica", "Hindicae", "हिन्दी", "Hindicus", "Hindica", "Hindicarum", "hindī"}, scripts = {"Deva"}, family = "inc", ancestors = {"inc-ohi"}, translit_module = "hi-translit", } m["ho"] = { canonicalName = "Hiri Motu", otherNames = {"Pidgin Motu", "Police Motu"}, scripts = {"Latn"}, family = "crp", ancestors = {"meu"}, } m["hr"] = { canonicalName = "Croate", otherNames = {"Croata", "Croatae", "Croati", "Croatum", "Croata", "Croatae", "hrvatski", "Croatus", "Croata", "Croatarum", "Croatian"}, scripts = {"Latn"}, family = "zlw", } m["ht"] = { canonicalName = "Haitiane", otherNames = {"Haitiana", "Haitianae", "Haitiani", "Haitianum", "Haitiana", "Haitianae", "kreyòl", "Haitianus", "Haitiana", "Haitianarum", "Creole", "Haitian"}, scripts = {"Latn"}, family = "crp", } m["hu"] = { canonicalName = "Hungarice", otherNames = {"Hungarica", "Hungaricae", "Hungarici", "Hungaricum", "Hungarica", "Hungaricae", "magyar", "Hungaricus", "Hungarica", "Hungaricarum"}, scripts = {"Latn"}, family = "fiu-ugr", ancestors = {"ohu"}, sort_key = { from = {"á", "é", "í", "ó", "ú", "ő", "ű"}, to = {"a", "e", "i", "o", "u", "ö", "ü"}} , } m["hy"] = { canonicalName = "Armenie", otherNames = {"Armenia", "Armeniae", "Armenii", "Armenium", "Armenia", "Armeniae", "Հայերէն", "Armenius", "Armenia", "Armeniarum", "Modern Armenian", "Eastern Armenian", "Western Armenian"}, scripts = {"Armn"}, family = "hyx", ancestors = {"axm"}, translit_module = "Armn-translit", sort_key = { from = {"ու", "և", "եւ"}, to = {"ւ", "եվ", "եվ"}}, entry_name = { from = {"՞", "՜", "՛", "՟", "և", "<sup>յ</sup>", "<sup>ի</sup>"}, to = {"", "", "", "", "եւ", "յ", "ի"}} , } m["hz"] = { canonicalName = "Herero", scripts = {"Latn"}, family = "bnt", } m["ia"] = { canonicalName = "Interlingua", otherNames = {"Interlingua"}, scripts = {"Latn"}, family = "art", } m["id"] = { canonicalName = "Indonesie", otherNames = {"Indonesia", "Indonesiae", "Indonesii", "Indonesium", "Indonesia", "Indonesiae", "Bahasa Indonesia", "Indonesius", "Indonesia", "Indonesiarum"}, scripts = {"Latn"}, family = "poz-mly", ancestors = {"ms"}, } m["ie"] = { canonicalName = "Interlingue", otherNames = {"Occidental"}, scripts = {"Latn"}, family = "art", } m["ig"] = { canonicalName = "Igbo", scripts = {"Latn"}, family = "nic-bco", } m["ii"] = { canonicalName = "Sichuan Yi", otherNames = {"Nuosu", "Nosu", "Northern Yi", "Liangshan Yi"}, scripts = {"Yiii"}, family = "tbq-lol", } m["ik"] = { canonicalName = "Inupiak", otherNames = {"Inupiaq", "Iñupiaq", "Inupiatun"}, scripts = {"Latn"}, family = "esx-inu", } m["io"] = { canonicalName = "Ido", scripts = {"Latn"}, family = "art", } m["is"] = { canonicalName = "Islandice", otherNames = {"Islandica", "Islandicae", "Islandici", "Islandicum", "Islandica", "Islandicae", "íslenska", "Islandica", "Islandica", "Islandicarum"}, scripts = {"Latn"}, family = "gmq", ancestors = {"non"}, } m["it"] = { canonicalName = "Italice", otherNames = {"Italica", "Italicae", "Italici", "Italicum", "Italica", "Italicae", "italiano", "Italicus", "Italica", "Italicarum", "Italiana"}, scripts = {"Latn"}, family = "roa", ancestors = {"roa-oit"}, sort_key = { from = {"[àáâäå]", "[èéêë]", "[ìíîï]", "[òóôö]", "[ùúûü]"}, to = {"a" , "e" , "i" , "o" , "u" }} , } m["iu"] = { canonicalName = "Inuktitut", otherNames = {"Eastern Canadian Inuktitut", "Eastern Canadian Inuit", "Western Canadian Inuktitut", "Western Canadian Inuit", "Western Canadian Inuktun", "Inuinnaq", "Inuinnaqtun", "Inuvialuk", "Inuvialuktun", "Nunavimmiutit", "Nunatsiavummiut", "Aivilimmiut", "Natsilingmiut", "Kivallirmiut", "Siglit", "Siglitun"}, scripts = {"Cans", "Latn"}, family = "esx-inu", translit_module = "iu-translit", } m["ja"] = { canonicalName = "Iaponice", otherNames = {"Iaponica", "Iaponicae", "Iaponici", "Iaponicum", "Iaponica", "Iaponicae", "日本語", "Iaponicus", "Iaponica", "Iaponicarum", "Nihongo", "Modern Japanese", "Nipponese"}, scripts = {"Jpan", "Latn", "Hira"}, family = "jpx", ancestors = {"ojp"}, } m["jv"] = { canonicalName = "Iavense", otherNames = {"Iavensis", "Iavenses", "Iavenses", "Iavense", "Iavensia", "Iavensis", "basa Jawa", "Iavensis", "Iavensi", "Iavensium"}, scripts = {"Latn", "Java"}, family = "poz-sus", translit_module = "jv-translit", ancestors = {"kaw"}, link_tr = true, } m["ka"] = { canonicalName = "Georgiane", otherNames = {"Georgiana", "Georgianae", "Georgiani", "Georgianum", "Georgiana", "Georgianae", "ქართული", "Georgianus", "Georgiana", "Georgianarum", "Kartvelian"}, scripts = {"Geor", "Geok"}, family = "ccs-gzn", ancestors = {"oge"}, translit_module = "Geor-translit", entry_name = { from = {"̂"}, to = {""}}, } m["kg"] = { canonicalName = "Kongo", otherNames = {"Kikongo", "Koongo", "Laari", "San Salvador Kongo", "Yombe"}, scripts = {"Latn"}, family = "bnt", } m["ki"] = { canonicalName = "Kikuyu", otherNames = {"Gikuyu", "Gĩkũyũ"}, scripts = {"Latn"}, family = "bnt", } m["kj"] = { canonicalName = "Kwanyama", otherNames = {"Kuanyama", "Oshikwanyama"}, scripts = {"Latn"}, family = "bnt", } m["kk"] = { "Kazachice", "Q9252", "trk-kno", otherNames = {"Kazachica", "Kazachicae", "Kazachici", "Kazachicum", "Kazachica", "Kazachicae", "Қазақ тілі", "Kazachicus", "Kazachica", "Kazachicarum"}, scripts = {"Cyrl", "Latn", "kk-Arab"}, translit_module = "kk-translit", override_translit = true, } m["kl"] = { canonicalName = "Groenlandice", otherNames = {"Groenlandica", "Groenlandicae", "Groenlandici", "Groenlandicum", "Groenlandica", "Groenlandicae", "Kalaallisut", "Groenlandicus", "Groenlandica", "Groenlandicarum"}, scripts = {"Latn"}, family = "esx-inu", } m["km"] = { canonicalName = "Khmer", otherNames = {"Cambodian"}, scripts = {"Khmr"}, family = "mkh", ancestors = {"mkh-mkm"}, translit_module = "km-translit", } m["kn"] = { canonicalName = "Kannada", scripts = {"Knda"}, family = "dra", translit_module = "kn-translit", } m["ko"] = { canonicalName = "Coreane", otherNames = {"Coreana", "Coreanae", "Coreani", "Coreanum", "Coreana", "Coreanae", "한국어", "Coreanus", "Coreana", "Coreanarum", "Modern Korean"}, scripts = {"Kore"}, family = "qfa-kor", ancestors = {"okm"}, translit_module = "ko-translit", } m["kr"] = { canonicalName = "Kanuri", otherNames = {"Kanembu", "Bilma Kanuri", "Central Kanuri", "Manga Kanuri", "Tumari Kanuri"}, scripts = {"Latn"}, family = "ssa", } m["ks"] = { canonicalName = "Caspirice", otherNames = {"Caspirica", "Caspiricae", "Caspirici", "Caspiricum", "Caspirica", "Caspiricae", "कॉशुर / کٲشُر", "Caspiricus", "Caspirica", "Caspiricarum", "Kashmiri"}, scripts = {"ks-Arab", "Deva"}, family = "inc-dar", } m["ku"] = { canonicalName = "Corduene", otherNames = {"Corduena", "Corduenae", "Cordueni", "Corduenum", "Corduena", "Corduenae", "kurdî", "Corduenus", "Corduena", "Corduenarum"}, scripts = {"Latn", "ku-Arab", "Armn", "Cyrl"}, family = "ira-wes", } m["kv"] = { "Komiense", "Q34114", "urj-prm", otherNames = {"Komiensis", "Komienses", "Komienses", "Komiense", "Komiensia", "Komiensis", "Коми кыв", "Komiensis", "Komiensi", "Komiensium", "Komi", "Komi-Zyryan"}, scripts = Cyrl, translit_module = "kv-translit", override_translit = true, } m["kw"] = { canonicalName = "Cornubice", otherNames = {"Cornubica", "Cornubicae", "Cornubici", "Cornubicum", "Cornubica", "Cornubicae", "Kernowek", "Cornubicus", "Cornubica", "Cornubicarum"}, scripts = {"Latn"}, family = "cel-bry", ancestors = {"cnx"}, } m["ky"] = { canonicalName = "Kyrgesse", otherNames = {"Kyrgessa", "Kyrgessae", "Kyrgessi", "Kyrgessum", "Kyrgessa", "Kyrgessae", "кыргызча", "Kyrgessus", "Kyrgessa", "Kyrgessarum", "Chirgisica", "Kirghiz", "Kirgiz"}, scripts = {"Cyrl", "Latn", "Arab"}, family = "trk-kip", translit_module = "ky-translit", } m["la"] = { canonicalName = "Latine", otherNames = {"Latina", "Latinae", "Latini", "Latinum", "Latina", "Latinae", "Latine", "Latinus", "Latina", "Latinarum"}, scripts = {"Latn"}, family = "itc", ancestors = {"itc-ola"}, entry_name = { from = {"[ĀĂ]", "[āă]", "[ĒĔ]", "[ēĕë]", "[ĪĬÏ]", "[īĭï]", "[ŌŎ]", "[ōŏ]", "[ŪŬÜ]", "[ūŭü]", "Ȳ", "ȳ", MACRON, BREVE, DIAER}, to = {"A", "a", "E", "e", "I", "i", "O", "o", "U", "u", "Y", "y"}}, } m["lb"] = { canonicalName = "Luxemburgice", otherNames = {"Luxemburgica", "Luxemburgicae", "Luxemburgici", "Luxemburgicum", "Luxemburgica", "Luxemburgicae", "Lëtzebuergesch", "Luxemburgicus", "Luxemburgica", "Luxemburgicarum"}, scripts = {"Latn"}, family = "gmw", ancestors = {"gmh"}, } m["lg"] = { canonicalName = "Luganda", otherNames = {"Ganda"}, scripts = {"Latn"}, family = "bnt", } m["li"] = { canonicalName = "Limburgice", otherNames = {"Limburgica", "Limburgicae", "Limburgici", "Limburgicum", "Limburgica", "Limburgicae", "Limburgs", "Limburgicus", "Limburgica", "Limburgicarum", "Limburgan", "Limburgian", "Limburgic"}, scripts = {"Latn"}, family = "gmw", ancestors = {"dum"}, } m["ln"] = { canonicalName = "Lingala", scripts = {"Latn"}, family = "bnt", } m["lo"] = { canonicalName = "Lao", otherNames = {"Laotian"}, scripts = {"Laoo"}, family = "tai-swe", translit_module = "lo-translit", } m["lt"] = { canonicalName = "Lithuanice", otherNames = {"Lithuanica", "Lithuanicae", "Lithuanici", "Lithuanicum", "Lithuanica", "Lithuanicae", "lietuvių", "Lithuanicus", "Lithuanica", "Lithuanicarum"}, scripts = {"Latn"}, family = "bat", ancestors = {"olt"}, entry_name = { from = {"[áãà]", "[ÁÃÀ]", "[éẽè]", "[ÉẼÈ]", "[íĩì]", "[ÍĨÌ]", "[ýỹ]", "[ÝỸ]", "ñ", "[óõò]", "[ÓÕÒ]", "[úũù]", "[ÚŨÙ]", ACUTE, GRAVE, TILDE}, to = {"a", "A", "e", "E", "i", "I", "y", "Y", "n", "o", "O", "u", "U"}} , } m["lu"] = { canonicalName = "Luba-Katanga", scripts = {"Latn"}, family = "bnt", } m["lv"] = { canonicalName = "Lettice", otherNames = {"Lettica", "Letticae", "Lettici", "Letticum", "Lettica", "Letticae", "latviešu", "Letticus", "Lettica", "Letticarum", "Lettonica", "Lettish", "Lett"}, scripts = {"Latn"}, family = "bat", } -- 1=nomsgf, 2=nomplf, 3=nomplm, 4=nomsgn, 5=nompln, 6=gensgf, 7=ipsa, 8=nomsgm, 9=ablsgf, 10=genplf m["ma"] = { canonicalName = "Magare", otherNames = {"Magara", "Magarae", "Magari", "Magarum", "Magara", "Magarae", "léngua magara", "Magarus", "Magara", "Magararum", "Magarian"}, scripts = {"Latn"}, family = "roa-itd", } m["mg"] = { canonicalName = "Madagascariense", otherNames = {"Madagascariensis", "Madagascarienses", "Madagascarienses", "Madagascariense", "Madagascariensia", "Madagascariensis", "malagasy", "Madagascariensis", "Madagascariensi", "Madagascariensium", "Betsimisaraka Malagasy", "Betsimisaraka", "Northern Betsimisaraka Malagasy", "Northern Betsimisaraka", "Southern Betsimisaraka Malagasy", "Southern Betsimisaraka", "Bara Malagasy", "Bara", "Masikoro Malagasy", "Masikoro", "Antankarana", "Antankarana Malagasy", "Plateau Malagasy", "Sakalava", "Tandroy Malagasy", "Tandroy", "Tanosy", "Tanosy Malagasy", "Tesaka", "Tsimihety", "Tsimihety Malagasy"}, scripts = {"Latn"}, family = "poz-bre", } m["mh"] = { canonicalName = "Marshallese", scripts = {"Latn"}, family = "poz-mic", sort_key = { from = {"ā" , "ļ" , "m̧" , "ņ" , "n̄" , "o̧" , "ō" , "ū" }, to = {"a~", "l~", "m~", "n~", "n~~", "o~", "o~~", "u~"}} , } m["mi"] = { canonicalName = "Maorice", otherNames = {"Maorica", "Maoricae", "Maorici", "Maoricum", "Maorica", "Maoricae", "Māori", "Maoricus", "Maorica", "Maoricarum"}, scripts = {"Latn"}, family = "poz-pol", } m["mk"] = { canonicalName = "Macedonice", otherNames = {"Macedonica", "Macedonicae", "Macedonici", "Macedonicum", "Macedonica", "Macedonicae", "Македонски јазик", "Macedonicus", "Macedonica", "Macedonicarum" }, scripts = {"Cyrl"}, family = "zls", translit_module = "mk-translit", entry_name = { from = {ACUTE}, to = {}}, } m["ml"] = { canonicalName = "Malayalam", scripts = {"Mlym"}, family = "dra", translit_module = "ml-translit", } m["mn"] = { canonicalName = "Mogolice", otherNames = {"Mogolica", "Mogolicae", "Mogolici", "Mogolicum", "Mogolica", "Mogolicae", "Монгол хэл", "Mogolicus", "Mogolica", "Mogolicarum", "Khalkha Mongolian"}, scripts = {"Cyrl", "Mong"}, family = "xgn", ancestors = {"cmg"}, translit_module = "mn-translit", } m["mr"] = { canonicalName = "Marathi", scripts = {"Deva", "Modi"}, family = "inc", ancestors = {"omr"}, translit_module = "hi-translit", } m["ms"] = { canonicalName = "Malaice", otherNames = {"Malaica", "Malaicae", "Malaici", "Malaicum", "Malaica", "Malaicae", "Bahasa Melayu", "Malaicus", "Malaica", "Malaicarum"}, scripts = {"Latn", "Arab"}, family = "poz-mly", } m["mt"] = { canonicalName = "Melitense", otherNames = {"Melitensis", "Melitenses", "Melitenses", "Melitense", "Melitensia", "Melitensis", "Malti", "Melitensis", "Melitensi", "Melitensium"}, scripts = {"Latn"}, family = "sem-arb", ancestors = {"sqr"}, } m["my"] = { canonicalName = "Birmanice", otherNames = {"Birmanica", "Birmanicae", "Birmanici", "Birmanicum", "Birmanica", "Birmanicae", "မ္ရန္‌မာစာ", "Birmanicus", "Birmanica", "Birmanicarum", "Burmese", "Myanmar"}, scripts = {"Mymr"}, family = "tbq-brm", ancestors = {"obr"}, translit_module = "my-translit", } m["na"] = { canonicalName = "Nauruane", otherNames = {"Nauruana", "Nauruanae", "Nauruani", "Nauruanum", "Nauruana", "Nauruanae", "Nauru", "Nauruanus", "Nauruana", "Nauruanarum"}, scripts = {"Latn"}, family = "poz-mic", } m["nb"] = { canonicalName = "Dano-Norvegice", otherNames = {"Dano-Norvegica", "Dano-Norvegicae", "Dano-Norvegici", "Dano-Norvegicum", "Dano-Norvegica", "Dano-Norvegicae", "Bokmål", "Dano-Norvegicus", "Dano-Norvegica", "Dano-Norvegicarum", "Norwegian", "Norsk"}, scripts = {"Latn"}, family = "gmq", ancestors = {"gmq-mno"}, wikimedia_codes = {"no"}, } m["nd"] = { canonicalName = "Northern Ndebele", otherNames = {"North Ndebele"}, scripts = {"Latn"}, family = "bnt-ngu", } m["ne"] = { canonicalName = "Nepalense", otherNames = {"Nepalensis", "Nepalenses", "Nepalenses", "Nepalense", "Nepalensia", "Nepalensis", "नेपाली", "Nepalensis", "Nepalensi", "Nepalensium", "Nepalese"}, scripts = {"Deva"}, family = "inc", translit_module = "ne-translit", } m["ng"] = { canonicalName = "Ndonga", scripts = {"Latn"}, family = "bnt", } m["nl"] = { canonicalName = "Batave", otherNames = {"Batava", "Batavae", "Batavi", "Batavum", "Batava", "Batavae", "Nederlands", "Batavus", "Batava", "Batavarum", "Netherlandic", "Flemish"}, scripts = {"Latn"}, family = "gmw", ancestors = {"dum"}, sort_key = { from = {"[äáâå]", "[ëéê]", "[ïíî]", "[öóô]", "[üúû]", "ç", "ñ", "^-"}, to = {"a" , "e" , "i" , "o" , "u" , "c", "n"}} , } m["nn"] = { canonicalName = "Neonorvegice", otherNames = {"Neonorvegica", "Neonorvegicae", "Neonorvegici", "Neonorvegicum", "Neonorvegica", "Neonorvegicae", "Nynorsk", "Neonorvegicus", "Neonorvegica", "Neonorvegicarum", "New Norwegian"}, scripts = {"Latn"}, family = "gmq", ancestors = {"gmq-mno"}, } m["no"] = { canonicalName = "Norvegice", otherNames = {"Norvegica", "Norvegicae", "Norvegici", "Norvegicum", "Norvegica", "Norvegicae", "Norsk", "Norvegicus", "Norvegica", "Norvegicarum", "Norwegian"}, scripts = {"Latn"}, family = "gmq", ancestors = {"gmq-mno"}, } m["nr"] = { canonicalName = "Southern Ndebele", otherNames = {"South Ndebele"}, scripts = {"Latn"}, family = "bnt-ngu", } m["nv"] = { canonicalName = "Navajo", scripts = {"nv-Latn"}, family = "apa", sort_key = { from = {"[áą]", "[éę]", "[íį]", "[óǫ]", "ń", "^n([djlt])", "ł" , "[ʼ’']", ACUTE}, to = {"a" , "e" , "i" , "o" , "n", "ni%1" , "l"}}, -- the copyright sign is used to guarantee that ł will always be sorted after all other words with l } m["ny"] = { canonicalName = "Chichewa", otherNames = {"Chicheŵa", "Chinyanja", "Nyanja", "Chewa"}, scripts = {"Latn"}, family = "bnt", entry_name = { from = {ACUTE}, to = {}}, } m["oc"] = { canonicalName = "Occitane", otherNames = {"Occitana", "Occitanae", "Occitani", "Occitanum", "Occitana", "Occitanae", "occitan", "Occitanus", "Occitana", "Occitanarum", "Provençal", "Auvergnat", "Auvernhat", "Gascon", "Languedocien", "Lengadocian", "Shuadit", "Chouhadite", "Chouhadit", "Chouadite", "Chouadit", "Shuhadit", "Judeo-Provençal", "Judeo-Provencal", "Judeo-Comtadin"}, scripts = {"Latn", "Hebr"}, family = "roa", ancestors = {"pro"}, sort_key = { from = {"[àá]", "[èé]", "[íï]", "[òó]", "[úü]", "ç", "([lns])·h"}, to = {"a" , "e" , "i" , "o" , "u" , "c", "%1h" }} , } m["oj"] = { "Ojibwayense", "Q33875", "alg", otherNames = {"Ojibwayensis", "Ojibwayenses", "Ojibwayenses", "Ojibwayense", "Ojibwayensia", "Ojibwayensis", "Anishinaabemowin / ᐊᓂᔑᓈᐯᒧᐎᓐ", "Ojibwayensis", "Ojibwayensi", "Ojibwayensium"}, aliases = {"Ojibway", "Ojibwa"}, varieties = {{"Chippewa", "Ojibwemowin", "Southwestern Ojibwa"}}, scripts = {"Cans", "Latn"}, sort_key = { from = {"aa", "ʼ", "ii", "oo", "sh", "zh"}, to = {"a~", "h~", "i~", "o~", "s~", "z~"}} , } m["om"] = { canonicalName = "Oromo", otherNames = {"Orma", "Borana-Arsi-Guji Oromo", "West Central Oromo"}, scripts = {"Latn", "Ethi"}, family = "cus", } m["or"] = { canonicalName = "Oriya", otherNames = {"Odia", "Oorya"}, scripts = {"Orya"}, family = "inc", ancestors = {"pka"}, } m["os"] = { canonicalName = "Alane", otherNames = {"Ossete", "Ossetic", "Digor", "Iron"}, scripts = {"Cyrl", "Geor", "Latn"}, family = "ira", translit_module = "os-translit", ancestors = {"oos"}, entry_name = { from = {GRAVE, ACUTE}, to = {}} , } m["pa"] = { canonicalName = "Punjabi", otherNames = {"Panjabi"}, scripts = {"Guru", "Arab", "Deva"}, family = "inc", translit_module = "pa-translit", ancestors = {"psu"}, } m["pi"] = { canonicalName = "Pali", scripts = {"Latn", "Deva", "Sinh", "Mymr", "Khmr", "Thai"}, family = "inc", ancestors = {"bh"}, sort_key = { from = {"ā", "ī", "ū", "ḍ", "ḷ", "[ṁṃ]", "[ṇñṅ]", "ṭ"}, to = {"a", "i", "u", "d", "l", "m" , "n" , "t"}} , } m["pl"] = { canonicalName = "Polonice", otherNames = {"Polonica", "Polonicae", "Polonici", "Polonicum", "Polonica", "Polonicae", "język polski", "Polonicus", "Polonica", "Polonicarum"}, scripts = {"Latn"}, family = "zlw", ancestors = {"zlw-opl"}, sort_key = { from = {"[Ąą]", "[Ćć]", "[Ęę]", "[Łł]", "[Ńń]", "[Óó]", "[Śś]", "[Żż]", "[Źź]"}, to = { "a" .. u(0x10FFFF), "c" .. u(0x10FFFF), "e" .. u(0x10FFFF), "l" .. u(0x10FFFF), "n" .. u(0x10FFFF), "o" .. u(0x10FFFF), "s" .. u(0x10FFFF), "z" .. u(0x10FFFF), "z" .. u(0x10FFFE)}} , } m["ps"] = { canonicalName = "Afganice", otherNames = {"Afganica", "Afganicae", "Afganici", "Afganicum", "Afganica", "Afganicae", "پښتو", "Afganicus", "Afganica", "Afganicarum", "Pashtun", "Pushto", "Pashtu", "Central Pashto", "Northern Pashto", "Southern Pashto", "Pukhto", "Pakhto", "Pakkhto", "Afghani"}, scripts = {"ps-Arab"}, family = "ira-eas", } m["pt"] = { canonicalName = "Lusitane", otherNames = {"Lusitana", "Lusitanae", "Lusitani", "Lusitanum", "Lusitana", "Lusitanae", "português", "Lusitanus", "Lusitana", "Lusitanarum", "Modern Portuguese"}, scripts = {"Latn"}, family = "roa", ancestors = {"roa-opt"}, sort_key = { from = {"[àãáâä]", "[èẽéêë]", "[ìĩíï]", "[òóôõö]", "[üúùũ]", "ç", "ñ"}, to = {"a" , "e" , "i" , "o" , "u" , "c", "n"}} , } m["qu"] = { canonicalName = "Quechua", otherNames = {"Quechua", "Quechuae", "Quechui", "Quechuum", "Quechua", "Quechuae", "Runasimi", "Quechuus", "Quechua", "Quechuarum", "Qhichwa simi"}, scripts = {"Latn"}, family = "qwe", } m["rm"] = { canonicalName = "Raetice", otherNames = {"Raetica", "Raeticae", "Raetici", "Raeticum", "Raetica", "Raeticae", "rumantsch", "Raeticus", "Raetica", "Raeticarum", "Romansh", "Romanche"}, scripts = {"Latn"}, family = "roa", } m["rn"] = { canonicalName = "Kirundi", scripts = {"Latn"}, family = "bnt", } m["ro"] = { canonicalName = "Dacoromane", otherNames = {"Dacoromana", "Dacoromanae", "Dacoromani", "Dacoromanum", "Dacoromana", "Dacoromanae", "româna", "Dacoromanus", "Dacoromana", "Dacoromanarum", "Daco-Romanian", "Roumanian", "Rumanian"}, scripts = {"Latn", "Cyrl"}, family = "roa", sort_key = { from = {"ă" , "â" , "î" , "ș" , "ț" }, to = {"a~", "a~~", "i~", "s~", "t~"}}, } m["ru"] = { canonicalName = "Ruthenice", otherNames = {"Ruthenica", "Ruthenicae", "Ruthenici", "Ruthenicum", "Ruthenica", "Ruthenicae", "русский язык", "Ruthenicus", "Ruthenica", "Ruthenicarum"}, scripts = {"Cyrl"}, family = "zle", translit_module = "ru-translit", sort_key = { from = {"ё"}, to = {"е" .. mw.ustring.char(0x10FFFF)}}, entry_name = { from = {"Ѐ", "ѐ", "Ѝ", "ѝ", GRAVE, ACUTE}, to = {"Е", "е", "И", "и"}}, } m["rw"] = { canonicalName = "Kinyarwanda", otherNames = {"Rwanda"}, scripts = {"Latn"}, family = "bnt", } m["sa"] = { canonicalName = "Sanscrite", otherNames = {"Sanscrita", "Sanscritae", "Sanscriti", "Sanscritum", "Sanscrita", "Sanscritae", "संस्कृत", "Sanscritus", "Sanscrita", "Sanscritarum"}, scripts = {"Deva", "Beng", "Brah", "Gran", "Gujr", "Guru", "Khar", "Knda", "Mlym", "Mymr", "Orya", "Shrd", "Sinh", "Taml", "Telu", "Thai", "Tibt"}, family = "inc", translit_module = "sa-translit", } -- 1=nomsgf, 2=nomplf, 3=nomplm, 4=nomsgn, 5=nompln, 6=gensgf, 7=ipsa, 8=nomsgm, 9=ablsgf, 10=genplf m["sc"] = { "Sarde", "Q33976", "roa", otherNames = {"Sarda", "Sardae", "Sardi", "Sardum", "Sarda", "Sardae", "sarda", "Sardus", "Sarda", "Sardarum", "Campidanese", "Campidanese Sardinian", "Logudorese", "Logudorese Sardinian", "Nuorese", "Nuorese Sardinian"}, scripts = {"Latn"}, } m["sd"] = { canonicalName = "Sindhi", scripts = {"sd-Arab", "Deva"}, family = "inc", } -- otherNames is used for inflected forms: 1=nomsgf, 2=nomplf, 3=nomplm, 4=nomsgn, 5=nompln, 6=gensgf m["se"] = { canonicalName = "Lapponica Septentrionali", otherNames = {"Lapponica Septentrionalis", "Lapponicae Septentrionales", "Lapponici Septentrionales", "Lapponicum Septentrionale", "Lapponica Septentrionalia", "Lapponicae Septentrionalis", "Davvisámegiella", "Lapponicus Septentrionalis", "Lapponica Septentrionali", "Lapponicarum Septentrionalium", "Samica septentrionalis", "North Sami", "Northern Saami", "North Saami"}, scripts = {"Latn"}, family = "smi", entry_name = { from = {"([đflmnŋrsšŧv])'%1"}, to = {"%1%1"} }, } m["sg"] = { canonicalName = "Sango", scripts = {"Latn"}, family = "crp", } m["sh"] = { canonicalName = "Servocroate", otherNames = {"Servocroata", "Servocroatae", "Servocroati", "Servocroatum", "Servocroata", "Servocroatae", "srpskohrvatski", "Servocroatus", "Servocroata", "Servocroatarum", "BCS", "Croato-Serbian", "Serbocroatian", "Bosnian", "Croatian", "Montenegrin", "Serbian"}, scripts = {"Latn", "Cyrl"}, family = "zls", entry_name = { from = {"[ȀÀȂÁĀ]", "[ȁàȃáā]", "[ȄÈȆÉĒ]", "[ȅèȇéē]", "[ȈÌȊÍĪ]", "[ȉìȋíī]", "[ȌÒȎÓŌ]", "[ȍòȏóō]", "[ȐȒŔ]", "[ȑȓŕ]", "[ȔÙȖÚŪ]", "[ȕùȗúū]", "Ѐ", "ѐ", "[ӢЍ]", "[ӣѝ]", "[Ӯ]", "[ӯ]", GRAVE, ACUTE, DGRAVE, INVBREVE, MACRON}, to = {"A" , "a" , "E" , "e" , "I" , "i" , "O" , "o" , "R" , "r" , "U" , "u" , "Е", "е", "И" , "и", "У", "у" }}, wikimedia_codes = {"sh", "bs", "hr", "sr"}, } m["si"] = { canonicalName = "Sinhalese", otherNames = {"Singhalese", "Sinhala"}, scripts = {"Sinh"}, family = "inc", ancestors = {"pmh"}, translit_module = "si-translit", } m["sk"] = { canonicalName = "Slovace", otherNames = {"Slovaca", "Slovacae", "Slovaci", "Slovacum", "Slovaca", "Slovacae", "slovenčina", "Slovacus", "Slovaca", "Slovacarum"}, scripts = {"Latn"}, family = "zlw", sort_key = { from = {"[áä]", "é", "í", "[óô]", "ú", "ý", "ŕ", "ĺ"}, to = {"a" , "e", "i", "o" , "u", "y", "r", "l"}} , } m["sl"] = { canonicalName = "Slovene", otherNames = {"Slovena", "Slovenae", "Sloveni", "Slovenum", "Slovena", "Slovenae", "slovenščina", "Slovenus", "Slovena", "Slovenarum", "Slovenian"}, scripts = {"Latn"}, family = "zls", entry_name = { from = {"[ÁÀÂȂȀ]", "[áàâȃȁ]", "[ÉÈÊȆȄỆẸ]", "[éèêȇȅệẹə]", "[ÍÌÎȊȈ]", "[íìîȋȉ]", "[ÓÒÔȎȌỘỌ]", "[óòôȏȍộọ]", "[ŔȒȐ]", "[ŕȓȑ]", "[ÚÙÛȖȔ]", "[úùûȗȕ]", "ł", GRAVE, ACUTE, DGRAVE, INVBREVE, CIRC, DOTBELOW}, to = {"A" , "a" , "E" , "e" , "I" , "i" , "O" , "o" , "R" , "r" , "U" , "u" , "l"}} , } m["sm"] = { canonicalName = "Samoane", otherNames = {"Samoana", "Samoanae", "Samoani", "Samoanum", "Samoana", "Samoanae", "gagana Sāmoa", "Samoanus", "Samoana", "Samoanarum"}, scripts = {"Latn"}, family = "poz-pol", } m["sn"] = { canonicalName = "Shona", scripts = {"Latn"}, family = "bnt", } m["so"] = { canonicalName = "Somali", scripts = {"Latn", "Arab", "Osma"}, family = "cus", entry_name = { from = {"[ÁÀÂ]", "[áàâ]", "[ÉÈÊ]", "[éèê]", "[ÍÌÎ]", "[íìî]", "[ÓÒÔ]", "[óòô]", "[ÚÙÛ]", "[úùû]", "[ÝỲ]", "[ýỳ]"}, to = {"A" , "a" , "E" , "e" , "I" , "i" , "O" , "o" , "U" , "u", "Y", "y"}} , } m["sq"] = { canonicalName = "Illyrice", otherNames = {"Illyrica", "Illyricae", "Illyrici", "Illyricum", "Illyrica", "Illyricae", "shqipja", "Illyricus", "Illyrica", "Illyricarum"}, scripts = {"Latn", "Elba"}, family = "sqj", sort_key = { from = { '[âãä]', '[ÂÃÄ]', '[êẽë]', '[ÊẼË]', 'ĩ', 'Ĩ', 'õ', 'Õ', 'ũ', 'Ũ', 'ỹ', 'Ỹ', 'ç', 'Ç' }, to = { 'a', 'A', 'e', 'E', 'i', 'I', 'o', 'O', 'u', 'U', 'y', 'Y', 'c', 'C' } } , } m["sr"] = { canonicalName = "Service", otherNames = {"Servica", "Servicae", "Servici", "Servicum", "Servica", "Servicae", "српски / srpski", "Servicus", "Servica", "Servicarum"}, scripts = {"Latn", "Cyrl"}, family = "zls", translit_module = "sh-translit", entry_name = { from = {"[ȀÀȂÁĀ]", "[ȁàȃáā]", "[ȄÈȆÉĒ]", "[ȅèȇéē]", "[ȈÌȊÍĪ]", "[ȉìȋíī]", "[ȌÒȎÓŌ]", "[ȍòȏóō]", "[ȐȒŔ]", "[ȑȓŕ]", "[ȔÙȖÚŪ]", "[ȕùȗúū]", "Ѐ", "ѐ", "[ӢЍ]", "[ӣѝ]", "[Ӯ]", "[ӯ]", GRAVE, ACUTE, DGRAVE, INVBREVE, MACRON}, to = {"A" , "a" , "E" , "e" , "I" , "i" , "O" , "o" , "R" , "r" , "U" , "u" , "Е", "е", "И" , "и", "У", "у" }}, wikimedia_codes = {"sh", "bs", "hr", "sr"}, } m["ss"] = { canonicalName = "Swazi", otherNames = {"Swati"}, scripts = {"Latn"}, family = "bnt-ngu", } m["st"] = { canonicalName = "Sotho Meridionali", otherNames = {"Sesotho", "Southern Sesotho", "Southern Sotho"}, scripts = {"Latn"}, family = "bnt", } m["su"] = { canonicalName = "Sondaice", scripts = {"Latn", "Sund"}, family = "poz-msa", translit_module = "su-translit", } m["sv"] = { canonicalName = "Suecice", otherNames = {"Suecica", "Suecicae", "Suecici", "Suecicum", "Suecica", "Suecicae", "svenska", "Suecicus", "Suecica", "Suecicarum"}, scripts = {"Latn"}, family = "gmq", ancestors = {"gmq-osw"}, } m["sw"] = { canonicalName = "Suahelice", otherNames = {"Suahelica", "Suahelicae", "Suahelici", "Suahelicum", "Suahelica", "Suahelicae", "Kiswahili", "Suahelicus", "Suahelica", "Suahelicarum", "Settler Swahili", "KiSetla", "KiSettla", "Setla", "Settla", "Kitchen Swahili", "Kihindi", "Indian Swahili", "KiShamba", "Kishamba", "Field Swahili", "Kibabu", "Asian Swahili", "Kimanga", "Arab Swahili", "Kitvita", "Army Swahili"}, scripts = {"Latn", "Arab"}, family = "bnt", sort_key = { from = {"ng'", "^-"}, to = {"ngz"}} , } m["ta"] = { canonicalName = "Tamulice", otherNames = {"Tamulica", "Tamulicae", "Tamulici", "Tamulicum", "Tamulica", "Tamulicae", "தமிழ்", "Tamulicus", "Tamulica", "Tamulicarum", "Tamil"}, scripts = {"Taml"}, family = "dra", ancestors = {"oty"}, translit_module = "ta-translit", } m["te"] = { canonicalName = "Teluguice", scripts = {"Telu"}, family = "dra", translit_module = "te-translit", } m["tg"] = { canonicalName = "Tadzikice", otherNames = {"Tadzikica", "Tadzikicae", "Tadzikici", "Tadzikicum", "Tadzikica", "Tadzikicae", "Тоҷикӣ", "Tadzikicus", "Tadzikica", "Tadzikicarum", "Tajik", "Tadjik", "Tadzhik", "Tajiki", "Tajik Persian"}, scripts = {"Cyrl", "fa-Arab", "Latn"}, family = "ira-wes", ancestors = {"fa"}, translit_module = "tg-translit", sort_key = { from = {"Ё", "ё"}, to = {"Е" , "е"}} , entry_name = { from = {ACUTE}, to = {}} , } m["th"] = { canonicalName = "Siamense", otherNames = {"Siamensis", "Siamenses", "Siamenses", "Siamense", "Siamensia", "Siamensis", "ภาษาไทย", "Siamensis", "Siamensi", "Siamensium", "Thai"}, scripts = {"Thai"}, family = "tai-swe", translit_module = "th-translit", entry_name = { from = { "-" }, to = {}} , } m["ti"] = { canonicalName = "Tigrinya", scripts = {"Ethi"}, family = "sem-eth", translit_module = "Ethi-translit", } m["tk"] = { canonicalName = "Turcomannice", otherNames = {"Turcomannica", "Turcomannicae", "Turcomannici", "Turcomannicum", "Turcomannica", "Turcomannicae", "Türkmençe", "Turcomannicus", "Turcomannica", "Turcomannicarum", "Tүркменче", "Türkmen dili", "تورکمن ﺗﻴﻠی"}, scripts = {"Latn", "Cyrl"}, family = "trk-ogz", } m["tl"] = { canonicalName = "Tagale", otherNames = {"Tagala", "Tagalae", "Tagali", "Tagalum", "Tagala", "Tagalae", "Wikang Tagalog", "Tagalus", "Tagala", "Tagalarum"}, scripts = {"Latn", "Tglg"}, family = "phi", } m["tn"] = { canonicalName = "Tswana", otherNames = {"Setswana"}, scripts = {"Latn"}, family = "bnt", } m["to"] = { canonicalName = "Tongane", otherNames = {"Tongana", "Tonganae", "Tongani", "Tonganum", "Tongana", "Tonganae", "lea fakatonga", "Tonganus", "Tongana", "Tonganarum"}, scripts = {"Latn"}, family = "poz-pol", } m["tr"] = { canonicalName = "Turcice", otherNames = {"Turcica", "Turcicae", "Turcici", "Turcicum", "Turcica", "Turcicae", "Türkçe", "Turcicus", "Turcica", "Turcicarum"}, scripts = {"Latn"}, family = "trk-ogz", ancestors = {"ota"}, } m["ts"] = { canonicalName = "Tsonga", scripts = {"Latn"}, family = "bnt", } m["tt"] = { canonicalName = "Tatarice", otherNames = {"Tatarica", "Tataricae", "Tatarici", "Tataricum", "Tatarica", "Tataricae", "татарча / tatarça", "Tataricus", "Tatarica", "Tataricarum"}, scripts = {"Cyrl", "Latn", "Arab", "tt-Arab"}, family = "trk-kip", translit_module = "tt-translit", } m["ty"] = { canonicalName = "Tahitiane", otherNames = {"Tahitiana", "Tahitianae", "Tahitiani", "Tahitianum", "Tahitiana", "Tahitianae", "reo Mā’ohi", "Tahitianus", "Tahitiana", "Tahitianarum"}, scripts = {"Latn"}, family = "poz-pol", } m["ug"] = { canonicalName = "Uyghur", otherNames = {"Uigur", "Uighur", "Uygur"}, scripts = {"ug-Arab", "Latn", "Cyrl"}, family = "trk", ancestors = {"chg"}, translit_module = "ug-translit", } m["uk"] = { canonicalName = "Ucrainice", otherNames = {"Ucrainica", "Ucrainicae", "Ucrainici", "Ucrainicum", "Ucrainica", "Ucrainicae", "українська", "Ucrainicus", "Ucrainica", "Ucrainicarum"}, scripts = {"Cyrl"}, family = "zle", translit_module = "uk-translit", entry_name = { from = {"Ѐ", "ѐ", "Ѝ", "ѝ", GRAVE, ACUTE}, to = {"Е", "е", "И", "и"}}, } m["ur"] = { canonicalName = "Urdu", otherNames = {"Urdu"}, scripts = {"ur-Arab"}, family = "inc", ancestors = {"psu"}, entry_name = { from = {u(0x064B), u(0x064C), u(0x064D), u(0x064E), u(0x064F), u(0x0650), u(0x0651), u(0x0652)}, to = {}} , } m["uz"] = { canonicalName = "Usbece", otherNames = {"Northern Uzbek", "Southern Uzbek"}, scripts = {"Latn", "Cyrl", "fa-Arab"}, family = "trk", ancestors = {"chg"}, } m["ve"] = { canonicalName = "Venda", scripts = {"Latn"}, family = "bnt", } -- otherNames is used for inflected forms: 1=nomsgf, 2=nomplf, 3=nomplm, 4=nomsgn, 5=nompln, 6=gensgf m["vi"] = { canonicalName = "Vietnamice", otherNames = {"Vietnamica", "Vietnamicae", "Vietnamici", "Vietnamicum", "Vietnamica", "Vietnamicae", "tiếng Việt", "Vietnamicus", "Vietnamica", "Vietnamicarum", "Annamese", "Annamite"}, scripts = {"Latn", "Hani"}, family = "mkh-vie", ancestors = {"mkh-mvi"}, } m["vo"] = { canonicalName = "Volapük", scripts = {"Latn"}, family = "art", } m["wa"] = { canonicalName = "Vallonice", otherNames = {"Vallonica", "Vallonicae", "Vallonici", "Vallonicum", "Vallonica", "Vallonicae", "walon", "Vallonicus", "Vallonica", "Vallonicarum"}, scripts = {"Latn"}, family = "roa", ancestors = {"fro"}, sort_key = { from = {"[áàâäå]", "[éèêë]", "[íìîï]", "[óòôö]", "[úùûü]", "[ýỳŷÿ]", "ç", "'"}, to = {"a" , "e" , "i" , "o" , "u" , "y" , "c"}} , } m["wo"] = { canonicalName = "Wolof", otherNames = {"Gambian Wolof"}, -- the subsumed dialect 'wof' scripts = {"Latn", "Arab"}, family = "alv-sng", } m["xh"] = { canonicalName = "Xhosa", scripts = {"Latn"}, family = "bnt-ngu", } m["yi"] = { canonicalName = "Iudaeogermanice", otherNames = {"Iudaeogermanica", "Iudaeogermanicae", "Iudaeogermanici", "Iudaeogermanicum", "Iudaeogermanica", "Iudaeogermanicae", "יידיש", "Iudaeogermanicus", "Iudaeogermanica", "Iudaeogermanicarum", "Jiddisch"}, scripts = {"Hebr"}, family = "gmw", ancestors = {"gmh"}, translit_module = "yi-translit", } m["yo"] = { canonicalName = "Yoruba", scripts = {"Latn"}, family = "alv-von", } m["za"] = { canonicalName = "Zhuang", scripts = {"Latn", "Hani"}, family = "tai", } m["zh"] = { canonicalName = "Sinice", otherNames = {"Sinica", "Sinicae", "Sinici", "Sinicum", "Sinica", "Sinicae", "中文", "Sinicus", "Sinica", "Sinicarum"}, scripts = {"Hani"}, family = "sit", ancestors = {"ltc"}, } m["zu"] = { canonicalName = "Zuluane", otherNames = {"Zuluana", "Zuluanae", "Zuluani", "Zuluanum", "Zuluana", "Zuluanae", "isiZulu", "Zuluanus", "Zuluana", "Zuluanarum"}, scripts = {"Latn"}, family = "bnt-ngu", } return m jykbfcw55hb8eu2wo59i1qhsb4hzxbc Formula:nexus-d 10 24577 220198 204848 2022-08-14T13:13:55Z YaganZ 4537 curatura V3 wikitext text/x-wiki <!-- Formula:nexus-d -- 2022-08-14 -- V3 -- --->{{#if: {{{1|}}}<!-- lingua ---->| {{#switch: {{{1}}}<!-- ------->| la=[[Auxilium:Lingua {{quod-n|la|nomsgf|ext}}|{{quod-n|la|adv|ext}}]]: <!-- --------->{{#if: {{{div|}}}<!-- ---------->|'''{{nexus-a|{{{1|la}}}|{{{div}}}|{{{3|{{{2|{{PAGENAME}}}}}}}}|{{{div}}}}}'''<!-- ---------->|'''{{la-nx|{{{2|{{PAGENAME}}}}}|{{{3|{{{2|{{PAGENAME}}}}}}}}}}'''<!-- --------->}}<!--if div. ------->| #default=[[Auxilium:Lingua {{quod-n|{{{1}}}|nomsgf|ext}}|{{quod-n|{{{1}}}|adv|ext}}]]: <!-- ------------------>{{exsistit | {{{2|{{PAGENAME}}}}}_({{{1}}})<!-- ------------------->| '''[[{{{2|{{PAGENAME}}}}}_({{{1}}})|{{{3|{{{2|{{PAGENAME}}}}}}}}]]'''<!-- ------------------->| '''[[{{{2|{{PAGENAME}}}}}#{{quod-n|{{{1}}}|adv|ext}}|{{{3|{{{2|{{PAGENAME}}}}}}}}]]'''<!-- ------------------>}}<!-- ------>}}<!-- switch. ------>{{#ifeq:{{{x|}}}|{{{x|-}}}<!-- ------->| &nbsp;({{xlit|{{#if: {{{x}}} | {{{x}}} | {{{1}}} }}|{{{2|{{PAGENAME}}}}}}})<!-- ------>}}<!-- ifequ x. ---->| '''[[{{{2|{{PAGENAME}}}}}|{{{3|{{{2|{{PAGENAME}}}}}}}}]]'''<!-- --->}}<!-- if 1. --><noinclude> [[Categoria:Formulae nectentes|nexus-d]] {{documentatio}} </noinclude> 02p9264l57e1k17p5fm2ieyd2fdvq56 Module:sh-translit 828 25978 220203 152103 2022-08-14T14:01:23Z YaganZ 4537 curatura Scribunto text/plain local export = {} local tt = {} tt["Cyrl"] = { ["А"]='A', ["а"]='a', ["Б"]='B', ["б"]='b', ["В"]='V', ["в"]='v', ["Г"]='G', ["г"]='g', ["Д"]='D', ["д"]='d', ["Ђ"]='Đ', ["ђ"]='đ', ["Е"]='E', ["е"]='e', ["Ж"]='Ž', ["ж"]='ž', ["З"]='Z', ["з"]='z', ["И"]='I', ["и"]='i', ["Ј"]='J', ["ј"]='j', ["К"]='K', ["к"]='k', ["Л"]='L', ["л"]='l', ["Љ"]='Lj', ["љ"]='lj', ["М"]='M', ["м"]='m', ["Н"]='N', ["н"]='n', ["Њ"]='Nj', ["њ"]='nj', ["О"]='O', ["о"]='o', ["П"]='P', ["п"]='p', ["Р"]='R', ["р"]='r', ["С"]='S', ["с"]='s', ["Т"]='T', ["т"]='t', ["Ћ"]='Ć', ["ћ"]='ć', ["У"]='U', ["у"]='u', ["Ф"]='F', ["ф"]='f', ["Х"]='H', ["х"]='h', ["Ц"]='C', ["ц"]='c', ["Ч"]='Č', ["ч"]='č', ["Џ"]='Dž', ["џ"]='dž', ["Ш"]='Š', ["ш"]='š', --letters with diacritics ["Ѐ"]='È', ["ѐ"]='è', ["Ѝ"]='Ì', ["ѝ"]='ì', ["Ӣ"]='Ī', ["ӣ"]='ī', ["Ӯ"]='Ū', ["ӯ"]='ū', -- proposed Montenegrin letters ["Ć"]='Ś', ["ć"]='ś' }; tt["Latn"] = { --Digraphs ["Lj"]='Љ', ["lj"]='љ', ["Nj"]='Њ', ["nj"]='њ', ["Dž"]='Џ', ["dž"]='џ', ["A"]='А', ["a"]='а', ["B"]='Б', ["b"]='б', ["V"]='В', ["v"]='в', ["G"]='Г', ["g"]='г', ["D"]='Д', ["d"]='д', ["Đ"]='Ђ', ["đ"]='ђ', ["E"]='Е', ["e"]='е', ["Ž"]='Ж', ["ž"]='ж', ["Z"]='З', ["z"]='з', ["I"]='И', ["i"]='и', ["J"]='Ј', ["j"]='ј', ["K"]='К', ["k"]='к', ["L"]='Л', ["l"]='л', ["M"]='М', ["m"]='м', ["N"]='Н', ["n"]='н', ["O"]='О', ["o"]='о', ["P"]='П', ["p"]='п', ["R"]='Р', ["r"]='р', ["S"]='С', ["s"]='с', ["T"]='Т', ["t"]='т', ["Ć"]='Ћ', ["ć"]='ћ', ["U"]='У', ["u"]='у', ["F"]='Ф', ["f"]='ф', ["H"]='Х', ["h"]='х', ["C"]='Ц', ["c"]='ц', ["Č"]='Ч', ["č"]='ч', ["Š"]='Ш', ["š"]='ш', --letters with diacritics ["È"]='Ѐ', ["è"]='ѐ', ["Ì"]='Ѝ', ["ì"]='ѝ', ["Ī"]='Ӣ', ["ī"]='ӣ', ["Ū"]='Ӯ', ["ū"]='ӯ', ["Á"]='А́', ["á"]='а́', ["À"]='А̀', ["à"]='а̀', ["Ā"]='А̄', ["ā"]='а̄', ["Ȁ"]='А̏', ["ȁ"]='а̏', ["Ȃ"]='А̑', ["ȃ"]='а̑', ["É"]='Е́', ["é"]='е́', ["Ē"]='Е̄', ["ē"]='е̄', ["Ȅ"]='Е̏', ["ȅ"]='е̏', ["Ȇ"]='Е̑', ["ȇ"]='е̑', ["Í"]='И́', ["í"]='и́', ["Ȉ"]='И̏', ["ȉ"]='и̏', ["Ȋ"]='И̑', ["ȋ"]='и̑', ["Ó"]='О́', ["ó"]='о́', ["Ò"]='О̀', ["ò"]='о̀', ["Ō"]='О̄', ["ō"]='о̄', ["Ȍ"]='О̏', ["ȍ"]='о̏', ["Ȏ"]='О̑', ["ȏ"]='о̑', ["Ŕ"]='Р́', ["ŕ"]='р́', ["Ȑ"]='Р̏', ["ȑ"]='р̏', ["Ȓ"]='Р̑', ["ȓ"]='р̑', ["Ú"]='У́', ["ú"]='у́', ["Ù"]='У̀', ["ù"]='у̀', ["Ȕ"]='У̏', ["ȕ"]='у̏', ["Ȗ"]='У̑', ["ȗ"]='у̑', -- proposed Montenegrin letters ["Ź"]='З́', ["ź"]='з́', ["Ś"]='Ć', ["ś"]='ć', -- backtick needs to be removed so that "nad`živeti" returns "надживети" ["`"]="" }; function export.tr(text, lang, sc) if (sc == "Latn") then text = mw.ustring.gsub(text, '[dDnNlL][jž]', tt[sc]) end return mw.ustring.toNFC(mw.ustring.gsub(text, '.', tt[sc])) end return export quwjyduck778zozyadxqbq1rv2a62w8 Module:sr-translit2 828 26662 220207 154797 2022-08-14T14:47:56Z YaganZ 4537 test redirect Scribunto text/plain REDIRECT [[Module:mk-translit]] -- Module:sr-translit -- Transliteration of Serbian Cyrillic to Latin. -- Derived from the transliteration module of Serbo-Croatian to Latin: Module:sh-transit. -- Last modified 2016-10-04 by YaganZ local export = {} local tt = { ["А"]='A', ["а"]='a', ["Б"]='B', ["б"]='b', ["В"]='V', ["в"]='v', ["Г"]='G', ["г"]='g', ["Д"]='D', ["д"]='d', ["Ђ"]='Đ', ["ђ"]='đ', ["Е"]='E', ["е"]='e', ["Ж"]='Ž', ["ж"]='ž', ["З"]='Z', ["з"]='z', ["И"]='I', ["и"]='i', ["Ј"]='J', ["ј"]='j', ["К"]='K', ["к"]='k', ["Л"]='L', ["л"]='l', ["Љ"]='Lj', ["љ"]='lj', ["М"]='M', ["м"]='m', ["Н"]='N', ["н"]='n', ["Њ"]='Nj', ["њ"]='nj', ["О"]='O', ["о"]='o', ["П"]='P', ["п"]='p', ["Р"]='R', ["р"]='r', ["С"]='S', ["с"]='s', ["Т"]='T', ["т"]='t', ["Ћ"]='Ć', ["ћ"]='ć', ["У"]='U', ["у"]='u', ["Ф"]='F', ["ф"]='f', ["Х"]='H', ["х"]='h', ["Ц"]='C', ["ц"]='c', ["Ч"]='Č', ["ч"]='č', ["Џ"]='Dž', ["џ"]='dž', ["Ш"]='Š', ["ш"]='š', -- letters with diacritics ["ѐ"]='è', ["Ѐ"]='È', ["ѝ"]='ì', ["Ѝ"]='Ì', ["ӣ"]='ī', ["Ӣ"]='Ī', ["ӯ"]='ū', ["Ӯ"]='Ū', ["а́"]='á', ["А́"]='Á', ["а̀"]='à', ["А̀"]='À', ["а̄"]='ā', ["А̄"]='Ā', ["а̏"]='ȁ', ["А̏"]='Ȁ', ["а̑"]='ȃ', ["А̑"]='Ȃ', ["е́"]='é', ["Е́"]='É', ["е̄"]='ē', ["Е̄"]='Ē', ["е̏"]='ȅ', ["Е̏"]='Ȅ', ["е̑"]='ȇ', ["Е̑"]='Ȇ', ["и́"]='í', ["И́"]='Í', ["и̏"]='ȉ', ["И̏"]='Ȉ', ["и̑"]='ȋ', ["И̑"]='Ȋ', ["о́"]='ó', ["О́"]='Ó', ["о̀"]='ò', ["О̀"]='Ò', ["о̄"]='ō', ["О̄"]='Ō', ["о̏"]='ȍ', ["О̏"]='Ȍ', ["о̑"]='ȏ', ["О̑"]='Ȏ', ["р́"]='ŕ', ["Р́"]='Ŕ', ["р̀"]='r̀', ["Р̀"]='R̀', ["р̄"]='r̄', ["Р̄"]='R̄', ["р̏"]='ȑ', ["Р̏"]='Ȑ', ["р̑"]='ȓ', ["Р̑"]='Ȓ', ["у́"]='ú', ["У́"]='Ú', ["у̀"]='ù', ["У̀"]='Ù', ["у̏"]='ȕ', ["У̏"]='Ȕ', ["у̑"]='ȗ', ["У̑"]='Ȗ', -- proposed Montenegrin letters ["З́"]='Ź', ["з́"]='ź', ["Ć"]='Ś', ["ć"]='ś' }; function export.tr(text, lang, sc) return (mw.ustring.gsub(f, '.', tt)) end return export 73n7ybgdkijus3at5okx1gthoj74536 220209 220207 2022-08-14T14:51:26Z YaganZ 4537 YaganZ movit paginam [[Module:sr-translit]] ad [[Module:sr-translit2]] sine redirectione: not working Scribunto text/plain REDIRECT [[Module:mk-translit]] -- Module:sr-translit -- Transliteration of Serbian Cyrillic to Latin. -- Derived from the transliteration module of Serbo-Croatian to Latin: Module:sh-transit. -- Last modified 2016-10-04 by YaganZ local export = {} local tt = { ["А"]='A', ["а"]='a', ["Б"]='B', ["б"]='b', ["В"]='V', ["в"]='v', ["Г"]='G', ["г"]='g', ["Д"]='D', ["д"]='d', ["Ђ"]='Đ', ["ђ"]='đ', ["Е"]='E', ["е"]='e', ["Ж"]='Ž', ["ж"]='ž', ["З"]='Z', ["з"]='z', ["И"]='I', ["и"]='i', ["Ј"]='J', ["ј"]='j', ["К"]='K', ["к"]='k', ["Л"]='L', ["л"]='l', ["Љ"]='Lj', ["љ"]='lj', ["М"]='M', ["м"]='m', ["Н"]='N', ["н"]='n', ["Њ"]='Nj', ["њ"]='nj', ["О"]='O', ["о"]='o', ["П"]='P', ["п"]='p', ["Р"]='R', ["р"]='r', ["С"]='S', ["с"]='s', ["Т"]='T', ["т"]='t', ["Ћ"]='Ć', ["ћ"]='ć', ["У"]='U', ["у"]='u', ["Ф"]='F', ["ф"]='f', ["Х"]='H', ["х"]='h', ["Ц"]='C', ["ц"]='c', ["Ч"]='Č', ["ч"]='č', ["Џ"]='Dž', ["џ"]='dž', ["Ш"]='Š', ["ш"]='š', -- letters with diacritics ["ѐ"]='è', ["Ѐ"]='È', ["ѝ"]='ì', ["Ѝ"]='Ì', ["ӣ"]='ī', ["Ӣ"]='Ī', ["ӯ"]='ū', ["Ӯ"]='Ū', ["а́"]='á', ["А́"]='Á', ["а̀"]='à', ["А̀"]='À', ["а̄"]='ā', ["А̄"]='Ā', ["а̏"]='ȁ', ["А̏"]='Ȁ', ["а̑"]='ȃ', ["А̑"]='Ȃ', ["е́"]='é', ["Е́"]='É', ["е̄"]='ē', ["Е̄"]='Ē', ["е̏"]='ȅ', ["Е̏"]='Ȅ', ["е̑"]='ȇ', ["Е̑"]='Ȇ', ["и́"]='í', ["И́"]='Í', ["и̏"]='ȉ', ["И̏"]='Ȉ', ["и̑"]='ȋ', ["И̑"]='Ȋ', ["о́"]='ó', ["О́"]='Ó', ["о̀"]='ò', ["О̀"]='Ò', ["о̄"]='ō', ["О̄"]='Ō', ["о̏"]='ȍ', ["О̏"]='Ȍ', ["о̑"]='ȏ', ["О̑"]='Ȏ', ["р́"]='ŕ', ["Р́"]='Ŕ', ["р̀"]='r̀', ["Р̀"]='R̀', ["р̄"]='r̄', ["Р̄"]='R̄', ["р̏"]='ȑ', ["Р̏"]='Ȑ', ["р̑"]='ȓ', ["Р̑"]='Ȓ', ["у́"]='ú', ["У́"]='Ú', ["у̀"]='ù', ["У̀"]='Ù', ["у̏"]='ȕ', ["У̏"]='Ȕ', ["у̑"]='ȗ', ["У̑"]='Ȗ', -- proposed Montenegrin letters ["З́"]='Ź', ["з́"]='ź', ["Ć"]='Ś', ["ć"]='ś' }; function export.tr(text, lang, sc) return (mw.ustring.gsub(f, '.', tt)) end return export 73n7ybgdkijus3at5okx1gthoj74536 devet (sl) 0 50015 220212 220096 2022-08-14T15:08:11Z YaganZ 4537 YaganZ movit paginam [[devet]] ad [[devet (sl)]]: disambiguatio wikitext text/x-wiki {{caput|sl|devet|devet}} =={{-lingua-|sl|devet}}== {{progressor|retro2=8|osem|devet|deset|porro2=10|sl}} ==={{appellatio}}=== <!-- Appellatio et syllabificatio --> :{{Audio|Sl-devet.ogg|[dɛˈveːt]||sl|devet}} <!-- Exemplum Slovenum apellationis --> :{{syllabae|de|vet|morph=devet}} <!-- Exemplum syllabificationis --> ==={{cardinalis|sl}}=== '''devet''' #{{la-nx|novem}} || Octo et alius. Numerus IX. ==={{collatae}}=== *{{sl-nx|deveti}} *{{sl-nx|devetiški}} ncpm8p71godtus73dqqj1zff7qhyn4i deset (sl) 0 50016 220215 220097 2022-08-14T15:19:14Z YaganZ 4537 YaganZ movit paginam [[deset]] ad [[deset (sl)]]: disambiguatio wikitext text/x-wiki {{caput|sl|deset|deset}} =={{-lingua-|sl|deset}}== {{progressor|retro2=9|devet|deset|enajst|porro2=11|sl}} ==={{appellatio}}=== <!-- Appellatio et syllabificatio --> :{{Audio|Sl-deset.ogg|[dɛˈseːt]||sl|deset}} <!-- Exemplum Slovenum apellationis --> :{{syllabae|de|set|morph=deset}} <!-- Exemplum syllabificationis --> ==={{cardinalis|sl}}=== '''deset''' #{{la-nx|decem}} || Novem et alius. Numerus X. ==={{collatae}}=== *{{sl-nx|deseti}} *{{sl-nx|desetiški}} hhntaq61brmftw2psdqlruf5mzcwkuk šeststo (sl) 0 50056 220229 220178 2022-08-14T21:20:02Z YaganZ 4537 curatura wikitext text/x-wiki {{caput|sl|šeststo|šeststo}} =={{-lingua-|sl|šeststo}}== {{progressor|retro2=500|petsto|šeststo|sedemsto|porro2=700|sl}} ==={{appellatio}}=== <!-- Appellatio et syllabificatio --> :{{Audio|Sl-šeststo.ogg|[ˈʃeːst.sto], [ˈʃeːsto]||sl|šeststo}} <!-- Exemplum Slovenum apellationis --> :{{syllabae|šest|sto|morph=šest-sto}} <!-- Exemplum syllabificationis --> ==={{formae}}=== <!-- Formae aliae --> *šę̑ststọ ==={{cardinalis|sl}}=== '''šeststo''' #{{la-nx|sescentī}} || {{la-nx|quīngentī nōnāgintā novem|Quingenti nonaginta novem}} et alius. Numerus DC. ==={{derivatae}}=== *{{sl-nx|šeststoti}} bryapp0kknhpc00kz7e08ozs9b37c45 trideset 0 50061 220218 220183 2022-08-14T17:20:04Z YaganZ 4537 curatura wikitext text/x-wiki {{caput|mul|trideset}} <!-- {{subst:PAGENAME}} --> {{discretiva}} *{{nexus-d|hr}} *{{nexus-d|sh}} *{{nexus-d|sl}} ==={{similes}}=== *{{nexus-d|bg|тридесет|x=}} *{{nexus-d|mk|триесет|x=}} *{{nexus-d|pl|trzydzieści}} *{{nexus-d|ru|тридцать|x=}} *{{nexus-d|sr|тридесет|x=mk}} *{{nexus-d|sk|tridsať}} *{{nexus-d|uk|тридцять|x=}} 33ac0c6ewk3xejruqbpwvnc6u54mrpd dvesto 0 50062 220220 220184 2022-08-14T18:01:28Z YaganZ 4537 curatura wikitext text/x-wiki {{caput|mul|dvesto}} <!-- {{subst:PAGENAME}} --> {{discretiva}} *{{nexus-d|sk}} *{{nexus-d|sl}} ==={{similes}}=== *{{nexus-d|bg|двеста|x=}} *{{nexus-d|hr|dvjesto}} *{{nexus-d|mk|двесте|x=}} *{{nexus-d|ru|двести|x=}} *{{nexus-d|sh|dvjesto}} *{{nexus-d|sr|двјесто|x=mk}} *{{nexus-d|uk|двісті|x=}} 1m6uxmhhu23jv3aokc0oo54ykpvv3pe tristo 0 50063 220221 220185 2022-08-14T18:14:48Z YaganZ 4537 curatura wikitext text/x-wiki {{caput|mul|tristo}} <!-- {{subst:PAGENAME}} --> {{discretiva}} *{{nexus-d|hr}} *{{nexus-d|sh}} *{{nexus-d|sk}} *{{nexus-d|sl}} ==={{similes}}=== *{{nexus-d|be|трыста|x=}} *{{nexus-d|cs|třista}} *{{nexus-d|bg|триста|x=}} *{{nexus-d|mk|триста|x=}} *{{nexus-d|ru|триста|x=}} *{{nexus-d|sr|тристо|x=mk}} *{{nexus-d|uk|тристо|x=}} gmaromzoxzo3jgr28ldbz0ye60wh0ve petsto 0 50064 220222 220186 2022-08-14T18:29:50Z YaganZ 4537 curatura wikitext text/x-wiki {{caput|mul|petsto}} <!-- {{subst:PAGENAME}} --> {{discretiva}} *{{nexus-d|hr}} *{{nexus-d|sh}} *{{nexus-d|sl}} ==={{similes}}=== *{{nexus-d|sk|päťsto}} *{{nexus-d|sr|петсто|x=mk}} g9kssemgibq8jo3a611lsabwbghf29l šeststo 0 50065 220223 220187 2022-08-14T20:20:29Z YaganZ 4537 curatura wikitext text/x-wiki {{caput|mul|šeststo}} <!-- {{subst:PAGENAME}} --> {{discretiva}} *{{nexus-d|hr}} *{{nexus-d|sh}} *{{nexus-d|sl}} ==={{similes}}=== *{{nexus-d|sk|šesťsto}} *{{nexus-d|sr|шестсто|x=mk}} chn8wrhozjjq5jsm4bp7euagyhx50bi sedemsto 0 50066 220224 220188 2022-08-14T20:23:32Z YaganZ 4537 curatura wikitext text/x-wiki {{caput|mul|sedemsto}} <!-- {{subst:PAGENAME}} --> {{discretiva}} *{{nexus-d|sk}} *{{nexus-d|sl}} ==={{similes}}=== *{{nexus-d|hr|sedamsto}} *{{nexus-d|sr|седамсто|x=mk}} *{{nexus-d|sh|sedamsto}} be7mvpwlzkp0v59zd0ics04xheewa8l osemsto 0 50067 220225 220189 2022-08-14T20:24:46Z YaganZ 4537 curatura wikitext text/x-wiki {{caput|mul|osemsto}} <!-- {{subst:PAGENAME}} --> {{discretiva}} *{{nexus-d|sk}} *{{nexus-d|sl}} ==={{similes}}=== *{{nexus-d|hr|osamsto}} *{{nexus-d|sr|осамсто|x=mk}} *{{nexus-d|sh|osamsto}} b2wji4i5fn8e6lgimwdjboobo2qay63 devetsto 0 50068 220226 220190 2022-08-14T20:30:22Z YaganZ 4537 curatura wikitext text/x-wiki {{caput|mul|devetsto}} <!-- {{subst:PAGENAME}} --> {{discretiva}} *{{nexus-d|hr}} *{{nexus-d|sh}} *{{nexus-d|sl}} ==={{similes}}=== *{{nexus-d|sr|деветсто|x=mk}} *{{nexus-d|sk|deväťsto}} lfa7e2apae9jdeenx9tozgk2b1cuef7 šest 0 50070 220194 2022-08-14T12:09:41Z YaganZ 4537 pagina nova wikitext text/x-wiki {{caput|mul|šest}} <!-- {{subst:PAGENAME}} --> {{discretiva}} *{{nexus-d|cs}} *{{nexus-d|sl}} ==={{similes}}=== *{{nexus-d|bg|шест}} ({{xlit|bg|шест}}) *{{nexus-d|pl|sześć}} *{{nexus-d|ru|шесть}} ({{xlit|ru|шесть}}) *{{nexus-d|sk|šesť}} *{{nexus-d|uk|шість}} ({{xlit|uk|шість}}) pbzbgqt4mqb14t4nvlmpjqioxcxh6ym Module:sr-translit 828 50071 220210 2022-08-14T14:54:09Z YaganZ 4537 pagina nova Scribunto text/plain local export = {} local tt = { ["А"]='A', ["а"]='a', ["Б"]='B', ["б"]='b', ["В"]='V', ["в"]='v', ["Г"]='G', ["г"]='g', ["Ѓ"]='Ǵ', ["ѓ"]='ǵ', ["Д"]='D', ["д"]='d', ["Е"]='E', ["е"]='e', ["Ѐ"]='È', ["ѐ"]='è', ["Ж"]='Ž', ["ж"]='ž', ["З"]='Z', ["з"]='z', ["Ѕ"]='Dz', ["ѕ"]='dz', ["И"]='I', ["и"]='i', ["Ѝ"]='Ì', ["ѝ"]='ì', ["Ј"]='J', ["ј"]='j', ["К"]='K', ["к"]='k', ["Л"]='L', ["л"]='l', ["Љ"]='Lj', ["љ"]='lj', ["М"]='M', ["м"]='m', ["Н"]='N', ["н"]='n', ["Њ"]='Nj', ["њ"]='nj', ["О"]='O', ["о"]='o', ["П"]='P', ["п"]='p', ["Р"]='R', ["р"]='r', ["С"]='S', ["с"]='s', ["Т"]='T', ["т"]='t', ["Ќ"]='Ḱ', ["ќ"]='ḱ', ["У"]='U', ["у"]='u', ["Ф"]='F', ["ф"]='f', ["Х"]='H', ["х"]='h', ["Ц"]='C', ["ц"]='c', ["Ч"]='Č', ["ч"]='č', ["Џ"]='Dž', ["џ"]='dž', ["Ш"]='Š', ["ш"]='š', }; function export.tr(text, lang, sc) return (mw.ustring.gsub(text, '.', tt)) end return export 2f57f496nxz4m6snyvcit9rkcsq2kd8 devet 0 50072 220213 2022-08-14T15:08:11Z YaganZ 4537 YaganZ movit paginam [[devet]] ad [[devet (sl)]]: disambiguatio wikitext text/x-wiki #REDIRECT [[devet (sl)]] m8wn24dc57c58a1k5myalc1zqiqxwr1 220214 220213 2022-08-14T15:17:16Z YaganZ 4537 pagina nova, discretiva wikitext text/x-wiki {{caput|mul|devet}} <!-- {{subst:PAGENAME}} --> {{discretiva}} *{{nexus-d|hr}} *{{nexus-d|sh}} *{{nexus-d|sl}} ==={{similes}}=== *{{nexus-d|cs|devět}} *{{nexus-d|bg|девет|x=}} *{{nexus-d|ru|девять|x=}} *{{nexus-d|sr|девет|x=mk}} *{{nexus-d|sl|deväť}} *{{nexus-d|uk|дев'ять|x=}} 697dbrj2wmmu32cbncf04r3crkk2gyu deset 0 50073 220216 2022-08-14T15:19:14Z YaganZ 4537 YaganZ movit paginam [[deset]] ad [[deset (sl)]]: disambiguatio wikitext text/x-wiki #REDIRECT [[deset (sl)]] 6btab7tcvicdjb60knpmrg1qkbi7k92 220217 220216 2022-08-14T15:26:15Z YaganZ 4537 pagina nova, discretiva wikitext text/x-wiki {{caput|mul|deset}} <!-- {{subst:PAGENAME}} --> {{discretiva}} *{{nexus-d|cs}} *{{nexus-d|hr}} *{{nexus-d|sh}} *{{nexus-d|sl}} ==={{similes}}=== *{{nexus-d|bg|десет|x=}} *{{nexus-d|ru|десять|x=}} *{{nexus-d|sr|десет|x=mk}} *{{nexus-d|sk|desať}} *{{nexus-d|uk|десять|x=}} 5vra2scr7ymgcs8o501gy793c8n97zo dva tisoč 0 50074 220227 2022-08-14T20:56:59Z YaganZ 4537 pagina nova wikitext text/x-wiki {{caput|sl|dva tisoč}} <!-- {{subst:PAGENAME}} --> =={{-lingua-|sl|dva tisoč}}== {{progressor|retro2=1000|tisoč|dva tisoč|tri tisoč|porro2=3000|sl}} ==={{appellatio}}=== <!-- Appellatio et syllabificatio --> :{{Audio|Sl-dva tisoč.ogg|/ˈdʋaː tisɔt͡ʃ/||sl}} <!-- Exemplum Slovenum apellationis --> :{{syllabae|dva ti|soč|morph=dva tisoč}} <!-- Exemplum syllabificationis --> ==={{cardinalis|sl}}=== '''dva tisoč''' #{{la-nx|duo mīlia}} || {{la-nx|mīlle nōngentī nōnāgintā novem|Mille nongenti nonaginta novem}} et alius. Numerus MM. ==={{derivatae}}=== *{{sl-nx|dvatisoči}} p2j3e71lcr5hery83sdbz9lk9lc6dmv 220228 220227 2022-08-14T21:07:04Z YaganZ 4537 corr. wikitext text/x-wiki {{caput|sl|dva tisoč}} <!-- {{subst:PAGENAME}} --> =={{-lingua-|sl|dva tisoč}}== {{progressor|retro2=1000|tisoč|dva tisoč|tri tisoč|porro2=3000|sl}} ==={{appellatio}}=== <!-- Appellatio et syllabificatio --> :{{Audio|Sl-dva tisoč.ogg|/ˈdʋaː ˈtiːsɔt͡ʃ/||sl}} <!-- Exemplum Slovenum apellationis --> :{{syllabae|dva ti|soč|morph=dva tisoč}} <!-- Exemplum syllabificationis --> ==={{cardinalis|sl}}=== '''dva tisoč''' #{{la-nx|duo mīlia}} || {{la-nx|mīlle nōngentī nōnāgintā novem|Mille nongenti nonaginta novem}} et alius. Numerus MM. ==={{derivatae}}=== *{{sl-nx|dvatisoči}} e6rpxopargatzxrwlj4mv5t9mfklzo3 Categoria:Formae affines Finnicae per declinationem 14 50075 220231 2022-08-15T10:06:28Z YaganZ 4537 pagina nova wikitext text/x-wiki {{Latn-categorytoc}} [[Categoria:Formae affines {{quod-n|fi|nomplf|ext}}|declinatio]] [[Categoria:Formae affines per declinationem|{{quod-n|fi|nomplf|ext}}]] r4kdwp004zcl4wqr27q8uzitbfnylki