This module contains definitions and metadata for exceptional language codes: those that consist of two- or three-letter sequences separated by hyphens. See Wiktionary:Languages for more information.
This module must not be used directly in other modules or templates. The data should be accessed through Module:languages. For the corresponding extra data, see Module:languages/data/exceptional/extra.
The following errors were detected by Module:data consistency check:
nb
) has Middle Norwegian language (gmq-mno
) set as an ancestor, but is not in the West Scandinavian family (gmq-wes
).nb
) has Danish language (da
) set as an ancestor, but is not in the East Scandinavian family (gmq-eas
).hns
) has Bhojpuri language (bho
) set as an ancestor, but is not in the Bihari family (inc-bih
).hns
) has Awadhi language (awa
) set as an ancestor, but is not in the Eastern Hindi family (inc-hie
).alv-gtm-pro
) does not have the expected name "Proto-Ghana-Togo Mountain", even though it is the proto-language of the Ghana-Togo Mountain languages (alv-gtm
).auf-pro
) does not have the expected name "Proto-Arauan", even though it is the proto-language of the Arauan languages (auf
).awd-amc-pro
) has a proto-language code associated with the invalid code awd-amc
.awd-kmp-pro
) has a proto-language code associated with the invalid code awd-kmp
.awd-pro
) does not have the expected name "Proto-Arawakan", even though it is the proto-language of the Arawakan languages (awd
).awd-prw-pro
) has a proto-language code associated with the invalid code awd-prw
.awd-taa-pro
) does not have the expected name "Proto-Ta-Arawakan", even though it is the proto-language of the Ta-Arawakan languages (awd-taa
).dru-pro
) has a proto-language code associated with Rukai (dru
), which is not a family.euq-pro
) does not have the expected name "Proto-Vasconic", even though it is the proto-language of the Vasconic languages (euq
).gmq-pro
) does not have the expected name "Proto-North Germanic", even though it is the proto-language of the North Germanic languages (gmq
).inc-krn-pro
) does not have the expected name "Proto-KRNB lects", even though it is the proto-language of the KRNB lects (inc-krn
).mis-hkl
, is repeated in the table of aliases
.nai-chu-pro
) does not have the expected name "Proto-Chumashan", even though it is the proto-language of the Chumashan languages (nai-chu
).nai-mdu-pro
) does not have the expected name "Proto-Maiduan", even though it is the proto-language of the Maiduan languages (nai-mdu
).nai-miz-pro
) does not have the expected name "Proto-Mixe-Zoquean", even though it is the proto-language of the Mixe-Zoquean languages (nai-miz
).nai-pom-pro
) does not have the expected name "Proto-Pomoan", even though it is the proto-language of the Pomoan languages (nai-pom
).omq-maz-pro
) does not have the expected name "Proto-Mazatecan", even though it is the proto-language of the Mazatecan languages (omq-maz
).os-pro
) has a proto-language code associated with Ossetian (os
), which is not a family.poz-swa-pro
) does not have the expected name "Proto-North Sarawakan", even though it is the proto-language of the North Sarawakan languages (poz-swa
).sal-pro
) does not have the expected name "Proto-Salishan", even though it is the proto-language of the Salishan languages (sal
).smi-pro
) does not have the expected name "Proto-Sami", even though it is the proto-language of the Sami languages (smi
).tbq-kuk-pro
) does not have the expected name "Proto-Kukish", even though it is the proto-language of the Kukish languages (tbq-kuk
).xsc-sak-pro
) does not have the expected name "Proto-Sakan", even though it is the proto-language of the Sakan languages (xsc-sak
).xsc-sar-pro
) has a proto-language code associated with the invalid code xsc-sar
.lzh-lit
) has a canonical name that is not unique; it is also used by the code lzh
.preprocess_links
for ??? (th-new
) is invalid.inc-old
) has no child families or languages.lzh-lit
, is wrong; it should be Literary Chinese.lzh-lit
, is wrong; it should be Literary Chinese.ira-mid
and the canonical name Middle Iranian should be removed; they are not found in Module:families/data.ira-old
and the canonical name Old Iranian should be removed; they are not found in Module:families/data.ira-mid
and the canonical name Middle Iranian should be removed; they are not found in Module:families/data.ira-old
and the canonical name Old Iranian should be removed; they are not found in Module:families/data.Every entry in the table must contain the following indexed fields:
1
2
Q
and ends with decimal digits. Set to nil
if not known/present. This replaces the older wikipedia_article
property, which can still be used to link to specific sections or language editions.3
4
Language:findBestScript
method in Module:languages. This function goes down the list of scripts and counts how many characters in the text belong to each script. If all the characters belong to one script, that script will be returned; otherwise, the script with the most characters will be returned. Thus, script detection will be faster if the most frequently used scripts are first in the list. If none of the characters match any of the listed scripts, then the None
script is returned (even if the characters would match a script not listed). Translingual (mul
) and Undetermined (und
) have the special value "All"
, which means they are treated as having every script. This value should not be set for any other language codes."Latn, Brai, Shaw, Dsrt"
.type
regular
- This value is the default, so it doesn't need to be specified. It indicates that the is attested according to WT:CFI and therefore permitted in the main namespace. There may also be reconstructed terms for the language, which are placed in the Reconstruction namespace and must be prefixed with * to indicate a reconstruction.reconstructed
- This language is not attested according to CFI, and therefore is allowed only in the Reconstruction namespace. All terms in this language are reconstructed, and must be prefixed with *.appendix-constructed
- This language is attested but does not meet the additional requirements set out for constructed languages (WT:CFI#Constructed languages). Its entries must therefore be in the Appendix namespace, but they are not reconstructed and therefore should not have * prefixed in links.ancestors
enm
(Middle English); ang
(Old English, the ancestor of Middle English), gem-pro
(Proto-Germanic, the ancestor of Old English), and ine-pro
(Proto-Indo-European, the ancestor of Proto-Germanic) are not listed.gem-pro
) belongs to the Indo-European (ine
) family, and its direct ancestor is Proto-Indo-European (ine-pro
). Because Proto-Indo-European is the proto-language of the Indo-European languages, Proto-Germanic does not need an ancestors
table; Proto-Indo-European will be automatically returned as its ancestor by the getAncestors
function."cr, fr"
.wikimedia_codes
"en, simple"
.interwiki_langs
in Module:translations/data; and the wiktprefix
field of the `metadata` variable in MediaWiki:Gadget-TranslationAdder-Data.js. FIXME: Unify this data.wikipedia_article
translit
isTransliterated
value set to false
in Module:scripts/data. This is used by transliterate
in Module:languages.link_tr
true
to link the language's transliteration. For instance, Gothic has entries in Gothic script and entries for transliterations: 𐌷𐌻𐌰𐌹𐌱𐍃 (hlaibs). Otherwise, this can be a comma-separated list of script codes, which means that links are only applied to terms using those scripts.override_translit
true
to make the automatic transliteration override an any given manual transliteration. Otherwise, this can be a comma-separated list of script codes, which means that the override is only applied to terms using those scripts.display_text
ӏ
, used in Cyrillic in many Caucasian languages, is frequently entered as I
, or even Latin l
or I
. As this is an ongoing issue (even among native speakers), the easiest way to solve the problem is to automatically correct the display form for those languages. This is used by makeDisplayText
in Module:languages.entry_name
ру́сский
→ русский
), or macrons from Latin or Old English words (ōs
→ os
), as these are not used in the normal written form of these languages. This is used by makeEntryName
in Module:languages.sort_key
"у" .. p
. Another character could be inserted straight after by using "у" .. p
(and so on).makeSortKey
in Module:languages.dotted_dotless_i
true
for languages that distinguish between the dotted and dotless I (such as some Turkic languages).translit
, display_text
, entry_name
and sort_key
all use the same syntax, which is designed to be as flexible as possible:
"sa-translit"
refers to Module:sa-translit.from
, to
, remove_diacritics
and remove_exceptions
relate to text substitution (see below).1
can be used as a fallback, which will be used if no specific behaviour is defined for that script.1
if you want to avoid this. It is not possible to process the output of a script-specific module with another module, however: this should be done (for example) with a tail call in the first module.text, lang, sc
, where text
is the input text (usually the page name or input by the user), lang
is the language code (not the language object), and sc
is the script code (not the script object). For performance reasons, they should only be used when it is not possible to achieve the desired result via text substitution.from
and to
keys.remove_diacritics
(and optionally remove_exceptions
).from
is paired with to
, and both of them must be tables that are organised pairwise: each element in from
is a pattern to identify which characters in the term to replace, while the corresponding element in to
defines what to replace them with (as arguments to mw.ustring.gsub
).false
or nil
), then any matching characters are removed altogether. This means that the from
list can be longer than the to
list, and an empty replacement will be assumed for any elements in from
that have no counterpart in to
.mw.ustring.gsub
function. See the Scribunto reference manual for more information. Note that patterns make double substitutions a viable way to achieve more complex results. See the Latin sortkey for Mandarin (cmn
) as an example of this.remove_diacritics
is a string which contains characters that will be removed after the text is decomposed. For instance, if remove_diacritics
is a combining acute accent, all acute accents will be stripped, even if they are part of precomposed characters (such as á or ά). Despite the name, the characters to be stripped need not be diacritics: for instance, including an apostrophe would remove all apostrophes (though be careful with hyphens, which must be be escaped as %-
).remove_diacritics
is given, then it is possible to specify a remove_exceptions
table, which prevents specific characters from having their diacritics stripped. For instance, if remove_diacritics
is a combining diaeresis, but remove_exceptions
contains "ё"
, then any instances of ё
will remain unchanged. On the other hand, an instance of ӱ
would still become у
(unless "ӱ"
is also added to remove_exceptions
).aliases
, varieties
, otherNames
family
3
.scripts
4
.local m_lang = require("Module:languages")
local m_langdata = require("Module:languages/data")
local u = require("Module:string utilities").char
local c = m_langdata.chars
local p = m_langdata.puaChars
local s = m_langdata.shared
local m = {}
m = {
"Proto-Khasian",
116773216,
"aav-khs",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Nicobarese",
116773793,
"aav-nic",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Pnar-Khasi-Lyngngam",
116773259,
"aav-pkl",
"Latn",
type = "reconstructed",
}
m = { -- mkh-pro will merge into this
"Proto-Austroasiatic",
116773186,
"aav",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Afroasiatic",
269125,
"afa",
"Latn",
type = "reconstructed",
}
m = {
"Agawam",
nil,
"alg-eas",
"Latn",
}
m = {
"Proto-Algonquian",
7251834,
"alg",
"Latn",
type = "reconstructed",
sort_key = {remove_diacritics = "·"},
}
m = {
"Amasi",
4740400,
"nic-grs",
"Latn",
entry_name = {remove_diacritics = c.grave .. c.acute .. c.circ .. c.tilde .. c.macron},
}
m = {
"Baïnounk Gubëeher",
17002646,
"alv-bny",
"Latn",
}
m = {
"Proto-Bua",
116773723,
"alv-bua",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Cangin",
116773726,
"alv-cng",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Edoid",
116773206,
"alv-edo",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Fali",
116773754,
"alv-fli",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Gbe",
116773208,
"alv-gbe",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Guang",
116773757,
"alv-gng",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Central Togo",
116773732,
"alv-gtm",
"Latn",
type = "reconstructed",
}
m = {
"Gwara",
16945580,
"nic-pla",
"Latn",
}
m = {
"Proto-Heiban",
116773760,
"alv-hei",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Idomoid",
116773764,
"alv-ido",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Igboid",
116773765,
"alv-igb",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Kwa",
116773780,
"alv-kwa",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Mumuye",
116773791,
"alv-mum",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Nupoid",
116773795,
"alv-nup",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Atlantic-Congo",
116732838,
"alv",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Edekiri",
nil,
"alv-edk",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Yoruba",
nil,
"alv-yor",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Yoruboid",
116773824,
"alv-yrd",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Volta-Niger",
116773820,
"alv-von",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Apachean",
116773135,
"apa",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Algic",
18389588,
"aql",
"Latn",
type = "reconstructed",
sort_key = {remove_diacritics = "·"},
}
m = {
"Adûni",
1232159,
"art",
"Latn",
type = "appendix-constructed",
}
m = {
"Belter Creole",
108055510,
"art",
"Latn",
type = "appendix-constructed",
sort_key = {
remove_diacritics = c.acute,
from = {"ɒ"},
to = {"a"},
},
}
m = {
"Bolak",
2909283,
"art",
"Latn",
type = "appendix-constructed",
}
m = {
"Black Speech",
686210,
"art",
"Latn, Teng",
type = "appendix-constructed",
}
m = {
"Communicationssprache",
35227,
"art",
"Latn",
type = "appendix-constructed",
}
m = {
"Dothraki",
2914733,
"art",
"Latn",
type = "appendix-constructed",
}
m = {
"Eloi",
nil,
"art",
"Latn",
type = "appendix-constructed",
}
m = {
"Goa'uld",
19823,
"art",
"Latn, Egyp, Mero",
type = "appendix-constructed",
}
m = {
"Lapine",
6488195,
"art",
"Latn",
type = "appendix-constructed",
}
m = {
"Mandalorian",
54289,
"art",
"Latn",
type = "appendix-constructed",
}
m = {
"Mundolinco",
851355,
"art",
"Latn",
type = "appendix-constructed",
}
m = {
"Na'vi",
316939,
"art",
"Latn",
type = "appendix-constructed",
}
m = {
"High Valyrian",
64483808,
"art",
"Latn",
type = "appendix-constructed",
}
m = {
"Nicola",
20609,
"ath-nor",
"Latn",
}
m = {
"Proto-Athabaskan",
104841722,
"ath",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Arawa",
116773706,
"auf",
"Latn",
type = "reconstructed",
}
m = {
"Alungul",
16827670,
"aus-pmn",
"Latn",
}
m = {
"Andjingith",
4754509,
"aus-pmn",
"Latn",
}
m = {
"Angkula",
16828520,
"aus-pmn",
"Latn",
}
m = {
"Proto-Arnhem",
116773720,
"aus-arn",
"Latn",
type = "reconstructed",
}
m = {
"Barranbinya",
4863220,
"aus-pmn",
"Latn",
}
m = {
"Barunggam",
4865914,
"aus-pmn",
"Latn",
}
m = {
"Proto-Central New South Wales",
116773199,
"aus-cww",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Daly",
116773743,
"aus-dal",
"Latn",
type = "reconstructed",
}
m = {
"Guwar",
6652138,
"aus-pam",
"Latn",
}
m = {
"Little Swanport",
6652138,
nil,
"Latn",
}
m = {
"Mbiywom",
6799701,
"aus-pmn",
"Latn",
}
m = {
"Ngkoth",
7022405,
"aus-pmn",
"Latn",
}
m = {
"Proto-Nyulnyulan",
116773797,
"aus-nyu",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Pama-Nyungan",
33942,
"aus-pam",
"Latn",
type = "reconstructed",
}
m = {
"Tulua",
16938541,
"aus-pam",
"Latn",
}
m = {
"Uwinymil",
7903995,
"aus-arn",
"Latn",
}
m = {
"Proto-Iwaidjan",
116773767,
"aus-wdj",
"Latn",
type = "reconstructed",
}
m = {
"Wong-gie",
nil,
"aus-pam",
"Latn",
}
m = {
"Wulguru",
8039196,
"aus-dyb",
"Latn",
}
m = { -- contrast nny
"Yangkaal",
3913770,
"aus-tnk",
"Latn",
}
m = {
"Proto-Amuesha-Chamicuro",
nil,
"awd",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Kampa",
nil,
"awd",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Paresi-Waura",
nil,
"awd",
"Latn",
type = "reconstructed",
}
m = {
"Amarizana",
16827787,
"awd",
"Latn",
}
m = {
"Anauyá",
16828252,
"awd",
"Latn",
}
m = {
"Apolista",
16916645,
"awd",
"Latn",
}
m = {
"Cabre",
16850160,
"awd",
"Latn",
}
m = {
"Guinau",
3504087,
"awd",
"Latn",
}
m = {
"Cariay",
16920253,
"awd",
"Latn",
}
m = {
"Kawishana",
6379993,
"awd-nwk",
"Latn",
}
m = {
"Kustenau",
5196293,
"awd",
"Latn",
}
m = {
"Manao",
6746920,
"awd",
"Latn",
}
m = {
"Marawan",
6755108,
"awd",
"Latn",
}
m = {
"Maipure",
6736872,
"awd",
"Latn",
}
m = {
"Mariaté",
16910017,
"awd-nwk",
"Latn",
}
m = {
"Proto-Nawiki",
116773234,
"awd-nwk",
"Latn",
type = "reconstructed",
}
m = {
"Paikoneka",
128807835,
"awd",
"Latn",
}
m = {
"Pasé",
7143168,
"awd-nwk",
"Latn",
}
m = {
"Proto-Arawak",
97573478,
"awd",
"Latn",
type = "reconstructed",
}
m = {
"Shebayo",
7492248,
"awd",
"Latn",
}
m = {
"Proto-Ta-Arawak",
116773282,
"awd-taa",
"Latn",
type = "reconstructed",
}
m = {
"Wainumá",
16910017,
"awd-nwk",
"Latn",
}
m = {
"Yumana",
8061062,
"awd-nwk",
"Latn",
}
m = {
"Cazcan",
5055514,
"azc",
"Latn",
}
m = {
"Proto-Cupan",
116773738,
"azc-cup",
"Latn",
type = "reconstructed",
}
m = {
"Kitanemuk",
3197558,
"azc-tak",
"Latn",
}
m = {
"Proto-Nahuan",
7251860,
"azc-nah",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Numic",
116773247,
"azc-num",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Uto-Aztecan",
96400333,
"azc",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Takic",
116773283,
"azc-tak",
"Latn",
type = "reconstructed",
}
m = {
"Tataviam",
743736,
"azc",
"Latn",
}
m = {
"Proto-Berber",
2855698,
"ber",
"Latn",
type = "reconstructed",
}
m = {
"Fogaha",
107610173,
"ber",
"Latn",
}
m = {
"Zuwara",
4117169,
"ber",
"Latn",
}
m = {
"Balong",
93935237,
"bnt-bbo",
"Latn",
}
m = {
"Boma Nkuu",
nil,
"bnt",
"Latn",
}
m = {
"Boma Yumu",
nil,
"bnt",
"Latn",
}
m = {
"Bwala",
128810345,
"bnt-tek",
"Latn",
}
m = {
"Chimwiini",
4958328,
"bnt-swh",
"Latn",
}
m = {
"Indanga",
51412803,
"bnt",
"Latn",
}
m = {
"Lala (South Africa)",
6480154,
"bnt-ngu",
"Latn",
}
m = {
"Mpiin",
93937013,
"bnt-bdz",
"Latn",
}
m = {
"Mpuono", -- not to be confused with Mbuun zmp
36056,
"bnt",
"Latn",
}
m = {
"Proto-Nguni",
961559,
"bnt-ngu",
"Latn",
type = "reconstructed",
sort_key = {remove_diacritics = c.grave .. c.acute .. c.circ .. c.caron},
}
m = {
"Phuthi",
33796,
"bnt-ngu",
"Latn",
entry_name = {remove_diacritics = c.grave .. c.acute},
}
m = {
"Proto-Bantu",
3408025,
"bnt",
"Latn",
type = "reconstructed",
sort_key = "bnt-pro-sortkey",
}
m = {
"South Boma",
nil,
"bnt",
"Latn",
}
m = {
"Proto-Sotho-Tswana",
116773278,
"bnt-sts",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Batak",
116773191,
"btk",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Abkhaz-Abaza",
7251831,
"cau-abz",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Andian",
nil,
"cau-and",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Avaro-Andian",
116773187,
"cau-ava",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Circassian",
7251838,
"cau-cir",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Dargwa",
116773205,
"cau-drg",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Lezghian",
116773223,
"cau-lzg",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Northeast Caucasian",
116773244,
"cau-nec",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Nakh",
108032840,
"cau-nkh",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Northwest Caucasian",
7251861,
"cau-nwc",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Tsezian",
116773287,
"cau-tsz",
"Latn",
type = "reconstructed",
}
m = {
"Atanques",
4812783,
"cba",
"Latn",
}
m = {
"Catío Chibcha",
7083619,
"cba",
"Latn",
}
m = {
"Dorasque",
5297532,
"cba",
"Latn",
}
m = {
"Duit",
3041061,
"cba",
"Latn",
}
m = {
"Huetar",
35514,
"cba",
"Latn",
}
m = {
"Nutabe",
7070405,
"cba",
"Latn",
}
m = {
"Proto-Chibchan",
116773203,
"cba",
"Latn",
type = "reconstructed",
}
m = {
"Proto-North Caucasian",
116773237,
"ccn",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Kartvelian",
2608203,
"ccs",
"Latn",
type = "reconstructed",
entry_name = {
from = {"q̣", "p̣", "ʓ", "ċ"},
to = {"q̇", "ṗ", "ʒ", "c̣"}
},
}
m = {
"Proto-Georgian-Zan",
23808119,
"ccs-gzn",
"Latn",
type = "reconstructed",
entry_name = {
from = {"q̣", "p̣", "ʓ", "ċ"},
to = {"q̇", "ṗ", "ʒ", "c̣"}
},
}
m = {
"Proto-Central Chadic",
116773197,
"cdc-cbm",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Masa",
116773789,
"cdc-mas",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Chadic",
116773201,
"cdc",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Caddoan",
116773725,
"cdd",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Brythonic",
1248800,
"cel-bry",
"Latn, Grek",
sort_key = "cel-bry-pro-sortkey",
}
m = {
"Gallaecian",
3094789,
"cel-his",
}
m = {
"Gaulish",
29977,
"cel",
"Latn, Grek, Ital",
entry_name = {remove_diacritics = c.macron .. c.breve .. c.diaer},
}
m = {
"Proto-Celtic",
653649,
"cel",
"Latn",
type = "reconstructed",
sort_key = "cel-pro-sortkey",
}
m = {
"Proto-Chimakuan",
116773734,
"chi",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Mari",
116773788,
"chm",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Chamic",
114793834,
"cmc",
"Latn",
type = "reconstructed",
}
m = {
"Basque-Icelandic Pidgin",
810378,
"crp",
"Latn",
ancestors = "eu",
}
m = {
"West Greenlandic Pidgin",
17036301,
"crp",
"Latn",
ancestors = "kl",
}
m = {
"Maroon Spirit Language",
1093206,
"crp",
"Latn",
ancestors = "en",
}
m = {
"Macau Pidgin Portuguese",
128804537,
"crp",
"Hant, Latn",
ancestors = "pt",
sort_key = {Hant = "Hani-sortkey"},
}
m = {
"Russenorsk",
505125,
"crp",
"Cyrl, Latn",
ancestors = "nn, ru",
translit = {Cyrl = "ru-translit"},
}
m = {
"Samoan Plantation Pidgin",
7409948,
"crp",
"Latn",
ancestors = "en",
}
m = {
"Solombala English",
7558525,
"crp",
"Cyrl, Latn",
ancestors = "en, ru",
translit = {Cyrl = "ru-translit"},
}
m = {
"Taimyr Pidgin Russian",
16930506,
"crp",
"Cyrl",
ancestors = "ru",
translit = "ru-translit",
}
m = {
"Proto-Bongo-Bagirmi",
116773722,
"csu-bba",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Mangbetu",
116773786,
"csu-maa",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Central Sudanic",
116773730,
"csu",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Sara",
116773809,
"csu-sar",
"Latn",
type = "reconstructed",
}
m = {
"Ashraaf",
4805855,
"cus-som",
"Latn",
}
m = {
"Proto-Highland East Cushitic",
116773761,
"cus-hec",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Somaloid",
nil,
"cus-som",
"Latn",
type = "reconstructed",
}
m = {
"Proto-South Cushitic",
126081567,
"cus-sou",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Cushitic",
116773204,
"cus",
"Latn",
type = "reconstructed",
}
m = {
"Dama (Sierra Leone)",
19601574,
"dmn",
"Latn",
}
m = {
"Beary",
1089116,
"qfa-mix",
"Mlym, Knda",
ancestors = "ml, tcy",
translit = {
Mlym = "ml-translit",
Knda = "kn-translit",
},
}
m = {
"Proto-Central Dravidian",
nil,
"dra-cen",
"Latn",
type = "reconstructed",
}
m = {
"Middle Kannada",
128810572,
"dra-kan",
"Knda",
translit = "kn-translit",
}
m = {
"Proto-North Dravidian",
124433593,
"dra-nor",
"Latn",
type = "reconstructed",
}
m = {
"Old Kannada",
15723156,
"dra-kan",
"Knda",
translit = "kn-translit",
}
m = {
"Old Telugu",
126720868,
"dra-tel",
"Telu",
translit = "te-translit",
}
m = {
"Proto-Dravidian",
1702853,
"dra",
"Latn",
type = "reconstructed",
}
m = {
"Proto-South Dravidian I",
104847952, -- Wikipedia's "Proto-South Dravidian" is Proto-South Dravidian I in this scheme.
"dra-sdo",
"Latn",
type = "reconstructed",
}
m = {
"Proto-South Dravidian II",
128885257,
"dra-sdt",
"Latn",
type = "reconstructed",
}
m = {
"Proto-South Dravidian",
128886121,
"dra-sou",
"Latn",
type = "reconstructed",
}
m = {
"Demotic",
36765,
"egx",
"Latn, Egyd, Polyt",
translit = {
Polyt = "grc-translit",
},
entry_name = {
Polyt = s,
},
sort_key = {
Latn = {
remove_diacritics = "'%-%s",
from = {"ꜣ", "j", "e", "ꜥ", "y", "w", "b", "p", "f", "m", "n", "r", "l", "ḥ", "ḫ", "h̭", "ẖ", "h", "š", "s", "q", "k", "g", "ṱ", "ṯ", "t", "ḏ", "%.", "⸗"},
to = {p, p, p, p, p, p, p, p, p, p, p, p, p, p, p, p, p, p, p, p, p, p, p, p, p, p, p, p, p}
},
Polyt = s,
},
}
m = {
"Proto-Mande",
116773785,
"dmn",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Western Mande",
116773822,
"dmn-mdw",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Rukai",
116773807,
"map",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Eskimo",
7251842,
"esx-esk",
"Latn",
type = "reconstructed",
}
m = {
"Inuktun",
1671647,
"esx-inu",
"Latn",
}
m = {
"Inuinnaqtun",
28070,
"esx-inu",
"Latn",
}
m = {
"Proto-Inuit",
60785588,
"esx-inu",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Eskimo-Aleut",
7251843,
"esx",
"Latn",
type = "reconstructed",
}
m = {
"Tunumiisut",
15665389,
"esx-inu",
"Latn",
}
m = {
"Proto-Basque",
938011,
"euq",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Gbaya",
nil,
"gba",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Germanic",
669623,
"gem",
"Latn",
type = "reconstructed",
sort_key = "gem-pro-sortkey",
}
m = {
"Burgundian",
47625,
"gme",
"Latn",
}
m = {
"Crimean Gothic",
36211,
"gme",
"Latn",
}
m = {
"Gutnish",
1256646,
"gmq",
"Latn",
ancestors = "gmq-ogt",
}
m = {
"Jamtish",
35512,
"gmq-eas",
"Latn",
}
m = {
"Middle Norwegian",
3417070,
"gmq-wes",
"Latn",
}
m = {
"Old Danish",
12330003,
"gmq-eas",
"Latn, Runr",
entry_name = {remove_diacritics = c.macron},
}
m = {
"Old Gutnish",
1133488,
"gmq",
"Latn",
ancestors = "non",
}
m = {
"Old Swedish",
2417210,
"gmq-eas",
"Latn, Runr",
entry_name = {remove_diacritics = c.macron},
}
m = {
"Proto-Norse",
1671294,
"gmq",
"Runr",
translit = "Runr-translit",
}
m = {
"Scanian",
768017,
"gmq-eas",
"Latn",
}
m = {
"Bergish",
329030,
"gmw-frk",
"Latn",
}
m = {
"Central Franconian",
572197,
"gmw-hgm",
"Latn",
ancestors = "gmh",
wikimedia_codes = "ksh",
}
m = {
"East Central German",
499344, -- subsumes Q699284, Q152965
"gmw-hgm",
"Latn",
ancestors = "gmh",
}
m = {
"Fingallian",
3072588,
"gmw-ian",
"Latn",
}
m = {
"Gottscheerish",
533109,
"gmw-hgm",
"Latn",
ancestors = "bar",
}
m = {
"Jersey Dutch",
1687911,
"gmw-frk",
"Latn",
ancestors = "nl",
}
m = {
"Middle Scots",
3327000,
"gmw-ang",
"Latn",
ancestors = "enm-esc",
}
m = {
"Proto-West Germanic",
78079021,
"gmw",
"Latn",
-- type = "reconstructed",
-- largely but not entirely reconstructed (like Proto-Norse); see April '24 BP, set back to reconstructed (?) if 'anti-asterisk' is added
sort_key = "gmw-pro-sortkey",
}
m = {
"Rhine Franconian",
707007,
"gmw-hgm",
"Latn",
ancestors = "gmh",
}
m = {
"Sathmar Swabian",
2223059,
"gmw-hgm",
"Latn",
ancestors = "swg",
}
m = {
"Transylvanian Saxon",
260942,
"gmw-hgm",
"Latn",
ancestors = "gmw-cfr",
}
m = {
"Volga German",
312574,
"gmw-hgm",
"Latn",
ancestors = "gmw-rfr",
}
m = {
"Zipser German",
205548,
"gmw-hgm",
"Latn",
ancestors = "gmh",
}
m = {
"Classical Guaraní",
17478065,
"tup-gua",
"Latn",
ancestors = "gn",
}
m = {
"Calabrian Greek",
1146398,
"grk",
"Latn",
ancestors = "grk-ita",
}
m = {
"Italiot Greek",
19720507,
"grk",
"Latn, Grek",
ancestors = "gkm",
entry_name = {remove_diacritics = c.caron .. c.diaerbelow .. c.brevebelow},
sort_key = s,
}
m = {
"Mariupol Greek",
4400023,
"grk",
"Cyrl, Latn, Grek",
ancestors = "gkm",
translit = "grk-mar-translit",
override_translit = true,
entry_name = "grk-mar-entryname",
sort_key = s,
}
m = {
"Proto-Hellenic",
1231805,
"grk",
"Latn",
type = "reconstructed",
sort_key = {
from = {"", "", "", "", "", "ď", "ľ", "ň", "ř", "ʰ", "ʷ", c.acute, c.macron},
to = {"a", "e", "i", "o", "u", "d", "l", "n", "r", "¯h", "¯w"}
},
}
m = {
"Proto-Hmong",
116773210,
"hmn",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Mien",
116773229,
"hmx-mie",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Hmong-Mien",
7251846,
"hmx",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Armenian",
3848498,
"hyx",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Nuristani",
116773248,
"iir-nur",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Indo-Iranian",
966439,
"iir",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Ijoid",
116773766,
"ijo",
"Latn",
type = "reconstructed",
}
m = {
"Apabhramsa",
616419,
"inc-mid",
"Deva, Shrd, Sidd",
ancestors = "pra",
translit = {
Deva = "sa-translit",
Shrd = "Shrd-translit",
Sidd = "Sidd-translit",
},
}
m = {
"Ashokan Prakrit",
104854379,
"inc-mid",
"Brah, Khar",
ancestors = "sa",
translit = {
Brah = "Brah-translit",
Khar = "Khar-translit",
},
}
m = {
"Kamarupi Prakrit",
6356097,
"inc-bas",
"Brah, Sidd",
translit = {
Brah = "Brah-translit",
Sidd = "Sidd-translit",
},
}
m = {
"Kholosi",
24952008,
"inc-snd",
"Latn",
}
m = {
"Proto-Kamta",
128816843,
"inc-bas",
"Latn",
ancestors = "inc-kam",
type = "reconstructed",
}
m = {
"Middle Assamese",
128806836,
"inc-bas",
"as-Beng",
ancestors = "inc-oas",
translit = "inc-mas-translit",
}
m = {
"Middle Bengali",
113559927,
"inc-bas",
"Beng",
ancestors = "inc-obn",
translit = "inc-mbn-translit",
}
m = {
"Middle Gujarati",
24907429,
"inc-wes",
"Deva",
ancestors = "inc-ogu",
}
m = {
"Middle Odia",
128810882,
"inc-eas",
"Orya",
ancestors = "inc-oor",
}
m = {
"Early Assamese",
85758237,
"inc-bas",
"as-Beng",
ancestors = "inc-kam",
translit = "inc-oas-translit",
}
m = {
"Old Awadhi",
nil,
"inc-hie",
"Deva, Kthi, ur-Arab",
entry_name = {
from = {"هٔ", "ۂ"}, -- character "ۂ" code U+06C2 to "ه" and "هٔ" (U+0647 + U+0654) to "ه"
to = {"ہ", "ہ"},
remove_diacritics = c.fathatan .. c.dammatan .. c.kasratan .. c.fatha .. c.damma .. c.kasra .. c.shadda .. c.sukun .. c.nunghunna .. c.superalef
},
translit = {
Deva = "sa-translit",
Kthi = "sa-Kthi-translit",
= "inc-ohi-translit",
},
}
m = {
"Old Bengali",
113559926,
"inc-bas",
"Beng",
}
m = {
"Old Gujarati",
24907427,
"inc-wes",
"Deva",
translit = "sa-translit",
}
m = {
"Old Hindi",
48767781,
"inc-hiw",
"Deva, ur-Arab",
entry_name = {
from = {"هٔ", "ۂ"}, -- character "ۂ" code U+06C2 to "ه" and "هٔ" (U+0647 + U+0654) to "ه"
to = {"ہ", "ہ"},
remove_diacritics = c.fathatan .. c.dammatan .. c.kasratan .. c.fatha .. c.damma .. c.kasra .. c.shadda .. c.sukun .. c.nunghunna .. c.superalef
},
translit = {
Deva = "sa-translit",
= "inc-ohi-translit",
},
}
m = {
"Old Odia",
128807801,
"inc-eas",
"Orya",
}
m = {
"Old Punjabi",
115270971,
"inc-pan",
"Guru, pa-Arab",
translit = {
Guru = "inc-opa-Guru-translit",
= "pa-Arab-translit",
},
entry_name = {remove_diacritics = c.fathatan .. c.dammatan .. c.kasratan .. c.fatha .. c.damma .. c.kasra .. c.shadda .. c.sukun},
}
m = {
"Proto-Indo-Aryan",
23808344,
"inc",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Anatolian",
7251833,
"ine-ana",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Balto-Slavic",
1703347,
"ine-bsl",
"Latn",
type = "reconstructed",
sort_key = {
from = {"", "", "", "", "", c.acute, c.macron, "ˀ"},
to = {"a", "e", "i", "o", "u"}
},
}
m = {
"Kalašma",
122770439,
"ine-ana",
"Xsux",
}
m = {
"Paeonian",
2705672,
"ine",
"Polyt",
translit = "grc-translit",
entry_name = s,
sort_key = s,
}
m = {
"Proto-Indo-European",
37178,
"ine",
"Latn",
type = "reconstructed",
sort_key = {
from = {"", "", "", "", "", "ĺ", "ḿ", "ń", "ŕ", "ǵ", "ḱ", "ʰ", "ʷ", "₁", "₂", "₃", c.ringbelow, c.acute, c.macron},
to = {"a", "e", "i", "o", "u", "l", "m", "n", "r", "g'", "k'", "¯h", "¯w", "1", "2", "3"}
},
}
m = {
"Proto-Tocharian",
37029,
"ine-toc",
"Latn",
type = "reconstructed",
}
m = {
"Old Median",
36461,
"xme",
"Grek, Latn",
}
m = {
"Middle Median",
12836150,
"xme",
"Latn",
}
m = {
"Kermanic",
129850,
"xme",
"fa-Arab, Latn",
ancestors = "xme-mid",
}
m = {
"Tafreshi",
nil,
"xme",
"fa-Arab, Latn",
ancestors = "xme-mid",
}
m = {
"Proto-Tatic",
122973870,
"xme-ttc",
"Latn",
ancestors = "xme-mid",
}
m = {
"Kalasuri",
nil,
"xme-ttc",
ancestors = "xme-ttc-nor",
}
m = {
"Kilit",
3612452,
"xme-ttc",
"Cyrl", -- and fa-Arab?
}
m = {
"Old Tati",
434697,
"xme-ttc",
"fa-Arab, Latn",
}
m = {
"Proto-Komisenian",
116773777,
"ira-kms",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Medo-Parthian",
116773227,
"ira-mpr",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Pathan",
116773255,
"ira-pat",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Iranian",
4167865,
"ira",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Zaza-Gorani",
116775031,
"ira-zgr",
"Latn",
type = "reconstructed",
}
m = { -- to be removed once entries using it have been updated
"Proto-Ossetic", -- see ]
116773249,
"xsc",
"Latn",
ancestors = "xln",
type = "reconstructed",
}
m = {
"Proto-Scythian",
116773273,
"xsc",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Sarmatian",
116773249,
"xsc",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Saka-Wakhi",
116773267,
"xsc-skw",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Saka",
116773264,
"xsc-sak",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Shughni-Yazghulami-Munji",
116773813,
"ira-sym",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Sanglechi-Ishkashimi",
116773808,
"ira-sgi",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Munji-Yidgha",
116773792,
"ira-mny",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Shughni-Yazghulami",
116773812,
"ira-shy",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Shughni-Roshani",
116773811,
"ira-shr",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Sogdic",
116773276,
"ira-sgc",
"Latn",
type = "reconstructed",
}
m = {
"Vanji",
3398419,
"ira-shy",
"Latn",
}
m = {
"Erie",
5388365,
"iro-nor",
"Latn",
}
m = {
"Mingo",
128531,
"iro-nor",
"Latn",
ietf_subtag = "i-mingo", -- grandfathered IETF tag
}
m = {
"Proto-North Iroquoian",
116773242,
"iro-nor",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Iroquoian",
7251852,
"iro",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Italic",
17102720,
"itc",
"Latn",
type = "reconstructed",
}
m = {
"Hachijō",
5637049,
"jpx",
"Jpan",
ancestors = "ojp-eas",
translit = s,
display_text = s,
entry_name = s,
sort_key = s,
}
m = {
"Proto-Japonic",
3924309,
"jpx",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Ryukyuan",
56349069,
"jpx-ryu",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Karen",
85794783,
"kar",
"Latn",
type = "reconstructed",
}
m = {
"Eastern Khanty",
30304622,
"kca",
"Cyrl",
translit = "kca-translit",
override_translit = true,
}
m = {
"Northern Khanty",
30304527,
"kca",
"Cyrl",
translit = "kca-translit",
override_translit = true,
}
m = {
"Proto-Khanty",
127505171,
"kca",
"Latn",
type = "reconstructed",
}
m = {
"Southern Khanty",
30304618,
"kca",
"Cyrl",
translit = "kca-translit",
override_translit = true,
}
m = {
"Proto-Khoe",
116773218,
"khi-kho",
"Latn",
type = "reconstructed",
}
m = {
"ǃKung",
32904,
"khi-kxa",
"Latn",
}
m = {
"Early Modern Korean",
756014,
"qfa-kor",
"Kore",
ancestors = "okm",
translit = "okm-translit",
entry_name = s,
}
m = {
"Proto-Kru",
116773778,
"kro",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Kurdish",
116773221,
"ku",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Atayalic",
116773151,
"map-ata",
"Latn",
type = "reconstructed",
}
m = {
"Banyumasan",
33219,
"map",
"Latn",
}
m = {
"Proto-Austronesian",
49230,
"map",
"Latn",
type = "reconstructed",
}
m = {
"Kelantan Peranakan Hokkien",
108794818,
"qfa-mix",
ancestors = "nan-hbl, sou, mfa",
}
m = {
"Isaurian",
16956868,
nil,
-- "Xsux, Hluw, Latn",
}
m = {
"Jie",
124424186,
nil,
"Hani",
sort_key = "Hani-sortkey",
}
m = {
"Jizhao",
45242758,
"qfa-bej",
"Latn",
}
m = {
"Kassite",
35612,
nil,
"Xsux",
}
m = {
"Mimi of Decorse",
6862206,
nil,
"Latn",
}
m = {
"Mimi of Nachtigal",
6862207,
nil,
"Latn",
}
m = {
"Philistine",
2230924,
nil,
"Phnx",
}
m = {
"Rouran",
48816637,
"qfa-xgx",
"Hani, Latn",
sort_key = {Hani = "Hani-sortkey"},
}
m = {
"Tangwang",
7683179,
"qfa-mix",
"Latn",
ancestors = "cmn, sce",
}
m = {
"Tuyuhun",
48816625,
"qfa-xgx",
"Hani, Latn",
sort_key = {Hani = "Hani-sortkey"},
}
m = {
"Tuoba",
48816629,
"qfa-xgx",
"Hani, Latn",
sort_key = {Hani = "Hani-sortkey"},
}
m = {
"Wuhuan",
118976867,
"qfa-xgx",
"Hani, Latn",
sort_key = {Hani = "Hani-sortkey"},
}
m = {
"Xianbei",
4448647,
"qfa-xgx",
"Hani, Latn",
sort_key = {Hani = "Hani-sortkey"},
}
m = {
"Mongghul",
53765528,
"mjg",
"Latn", -- also Mong, Cyrl ?
}
m = {
"Mangghuer",
56285392,
"mjg",
"Latn", -- also Mong, Cyrl ?
}
m = {
"Proto-Aslian",
55630680,
"mkh-asl",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Bahnaric",
116773189,
"mkh-ban",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Katuic",
116773772,
"mkh-kat",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Khmuic",
116773774,
"mkh-khm",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Khmeric",
55630684,
"mkh-kmr",
"Latn",
type = "reconstructed",
}
m = {
"Middle Mon",
121337926,
"mkh-mnc",
"Latn, Mymr", --and also Pallava
ancestors = "omx",
}
m = {
"Proto-Monic",
116773231,
"mkh-mnc",
"Latn",
type = "reconstructed",
}
m = {
"Middle Vietnamese",
9199,
"mkh-vie",
"Hani, Latn",
sort_key = {Hani = "Hani-sortkey"},
}
m = {
"Proto-Palaungic",
104847372,
"mkh-pal",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Pearic",
116773804,
"mkh-pea",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Pakanic",
116773803,
"mkh-pkn",
"Latn",
type = "reconstructed",
}
m = { --This will be merged into 2015 aav-pro.
"Proto-Mon-Khmer",
7251859,
"mkh",
"Latn",
type = "reconstructed",
}
m = { -- To be removed.
"Thai Mon",
nil,
"mkh-mnc",
"Mymr, Thai",
ancestors = "mkh-mmn",
sort_key = {
from = {"", "ျ", "ြ", "ွ", "ှ", "ၞ", "ၟ", "ၠ", "ၚ", "ဿ", "", "()()ฺ?"},
to = {"", "္ယ", "္ရ", "္ဝ", "္ဟ", "္န", "္မ", "္လ", "င", "သ္သ", "", "%2%1"}
},
}
m = {
"Proto-Vietic",
109432616,
"mkh-vie",
"Latn",
type = "reconstructed",
}
m = {
"Central Mansi",
128810384,
"mns",
"Cyrl",
translit = "mns-translit",
override_translit = true,
}
m = {
"Northern Mansi",
30304537,
"mns",
"Cyrl",
translit = "mns-translit",
override_translit = true,
}
m = {
"Proto-Mansi",
128883093,
"mns",
"Latn",
type = "reconstructed",
}
m = {
"Southern Mansi",
30304629,
"mns",
"Cyrl",
translit = "mns-translit",
override_translit = true,
}
m = {
"Proto-Munda",
105102373,
"mun",
"Latn",
type = "reconstructed",
}
m = { -- the stage after ''emy''
"Ch'olti'",
873995,
"myn",
"Latn",
}
m = {
"Proto-Mayan",
3321532,
"myn",
"Latn",
type = "reconstructed",
}
m = {
"Alazapa",
128810233,
nil,
"Latn",
}
m = {
"Bayogoula",
1563704,
nil,
"Latn",
}
m = {
"Calusa",
51782,
nil,
"Latn",
}
m = {
"Chiquimulilla",
25339627,
"nai-xin",
"Latn",
}
m = {
"Proto-Chumash",
116773736,
"nai-chu",
"Latn",
type = "reconstructed",
}
m = {
"Ciguayo",
20741700,
nil,
"Latn",
}
m = {
"Proto-Chinookan",
116773735,
"nai-ckn",
"Latn",
type = "reconstructed",
}
m = {
"Guazacapán",
19572028,
"nai-xin",
"Latn",
}
m = {
"Hitchiti",
1542882,
"nai-mus",
"Latn",
}
m = {
"Ipai",
3027474,
"nai-yuc",
"Latn",
}
m = {
"Jutiapa",
nil,
"nai-xin",
"Latn",
}
m = {
"Jumaytepeque",
25339626,
"nai-xin",
"Latn",
}
m = {
"Kathlamet",
6376639,
"nai-ckn",
"Latn",
}
m = {
"Proto-Kalapuyan",
116773771,
"nai-klp",
"Latn",
type = "reconstructed",
}
m = {
"Konomihu",
3198734,
"nai-shs",
"Latn",
}
m = {
"Kumeyaay",
4910139,
"nai-yuc",
"Latn",
}
m = {
"Macoris",
21070851,
nil,
"Latn",
}
m = {
"Proto-Maidun",
116773784,
"nai-mdu",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Mixe-Zoque",
7251858,
"nai-miz",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Muskogean",
116775368,
"nai-mus",
"Latn",
type = "reconstructed",
}
m = {
"Naolan",
6964594,
nil,
"Latn",
}
m = {
"New River Shasta",
7011254,
"nai-shs",
"Latn",
}
m = {
"Okwanuchu",
3350126,
"nai-shs",
"Latn",
}
m = {
"Pericú",
3375369,
nil,
"Latn",
}
m = {
"Picuris",
7191257,
"nai-kta",
"Latn",
}
m = {
"Proto-Plateau Penutian",
116773806,
"nai-plp",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Pomo",
116773262,
"nai-pom",
"Latn",
type = "reconstructed",
}
m = {
"Quinigua",
36360,
nil,
"Latn",
}
m = { -- NB 'sio-pro' "Proto-Siouan" which is Proto-Western Siouan
"Proto-Siouan-Catawban",
116773275,
"nai-sca",
"Latn",
type = "reconstructed",
}
m = {
"Sinacantán",
24190249,
"nai-xin",
"Latn",
}
m = {
"Salvadoran Lenca",
3229434,
"nai-len",
"Latn",
}
m = {
"Sahaptin",
3833015,
"nai-shp",
"Latn",
}
m = {
"Tapachultec",
7684401,
"nai-miz",
"Latn",
}
m = {
"Tawasa",
7689233,
nil,
"Latn",
}
m = {
"Tequistlatec",
2964454,
"nai-tqn",
"Latn",
}
m = {
"Tipai",
3027471,
"nai-yuc",
"Latn",
}
m = {
"Proto-Totozoquean",
116773285,
"nai-tot",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Tsimshianic",
nil,
"nai-tsi",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Utian",
116773290,
"nai-utn",
"Latn",
type = "reconstructed",
}
m = {
"Waikuri",
3118702,
nil,
"Latn",
}
m = {
"Western Jicaque",
3178610,
"nai-jcq",
"Latn",
}
m = {
"Yupiltepeque",
25339628,
"nai-xin",
"Latn",
}
m = {
"Datian Min",
19855572,
"zhx-nan",
"Hants",
generate_forms = "zh-generateforms",
sort_key = "Hani-sortkey",
}
m = {
"Hokkien",
1624231,
"zhx-nan",
"Hants, Latn, Bopo, Kana",
wikimedia_codes = "zh-min-nan",
generate_forms = "zh-generateforms",
sort_key = {
Hani = "Hani-sortkey",
Kana = "Kana-sortkey"
},
}
m = {
"Hailufeng Min",
120755728,
"zhx-nan",
"Hants",
generate_forms = "zh-generateforms",
sort_key = "Hani-sortkey",
}
m = {
"Longyan Min",
6674568,
"zhx-nan",
"Hants",
generate_forms = "zh-generateforms",
sort_key = "Hani-sortkey",
}
m = {
"Teochew",
36759,
"zhx-nan",
"Hants",
generate_forms = "zh-generateforms",
translit = "zh-translit",
sort_key = "Hani-sortkey",
}
m = {
"Zhenan Min",
3846710,
"zhx-nan",
"Hants",
generate_forms = "zh-generateforms",
sort_key = "Hani-sortkey",
}
m = {
"Sanxiang Min",
7420769,
"zhx-nan",
"Hants",
generate_forms = "zh-generateforms",
sort_key = "Hani-sortkey",
}
m = {
"German Low German",
25433,
"gmw-lgm",
"Latn",
ancestors = "nds",
ietf_subtag = "nds-DE", -- should we make this the actual code?
wikimedia_codes = "nds",
}
m = {
"Dutch Low Saxon",
516137,
"gmw-lgm",
"Latn",
ancestors = "nds",
ietf_subtag = "nds-NL", -- should we make this the actual code?
}
m = {
"Proto-Trans-New Guinea",
85794785,
"ngf",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Benue-Congo",
116773194,
"nic-bco",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Bantoid",
116773190,
"nic-bod",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Eastern Oti-Volta",
116773753,
"nic-eov",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Gurunsi",
116773759,
"nic-gns",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Grassfields",
116773755,
"nic-grf",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Gur",
116773758,
"nic-gur",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Jukunoid",
116773769,
"nic-jkn",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Lower Cross River",
116773782,
"nic-lcr",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Ogoni",
116773799,
"nic-ogo",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Oti-Volta",
116773802,
"nic-ovo",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Plateau",
116773805,
"nic-plt",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Niger-Congo",
108000748,
"nic",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Ubangian",
116773818,
"nic-ubg",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Upper Cross River",
116773819,
"nic-ucr",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Volta-Congo",
116773293,
"nic-vco",
"Latn",
type = "reconstructed",
}
m = {
"Haraza",
19572059,
"nub",
"Arab, Latn",
}
m = {
"Proto-Nubian",
116773246,
"nub",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Chatino",
116773202,
"omq-cha",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Mazatec",
116773790,
"omq-maz",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Mixtecan",
21573423,
"omq-mix",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Mixtec",
21573424,
"omq-mxt",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Oto-Pamean",
116773251,
"omq-otp",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Oto-Manguean",
33669,
"omq",
"Latn",
type = "reconstructed",
}
m = {
"San Juan Quiahije Chatino",
17003130,
"omq-cha",
"Latn",
}
m = {
"Teposcolula Mixtec",
nil,
"omq-mxt",
"Latn",
}
m = {
"Teojomulco Chatino",
25340451,
"omq-cha",
"Latn",
}
m = {
"Proto-Trique",
116773817,
"omq-tri",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Zapotecan",
116773297,
"omq-zap",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Zapotec",
116773296,
"omq-zpc",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Aroid",
116773721,
"omv-aro",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Dizoid",
116773750,
"omv-diz",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Omotic",
116773800,
"omv",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Otomi",
5908710,
"oto-otm",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Otomian",
116773252,
"oto",
"Latn",
type = "reconstructed",
}
m = {
"Kómnzo",
18344310,
"paa-yam",
"Latn",
}
m = {
"Kuwani",
6449056,
"paa",
"Latn",
}
m = {
"Proto-North Halmahera",
116773241,
"paa-nha",
"Latn",
type = "reconstructed"
}
m = {
"Nungon",
128807788,
"paa",
"Latn",
}
m = {
"Dinapigue Agta",
16945774,
"phi",
"Latn",
}
m = {
"Proto-Kalamian",
116773213,
"phi-kal",
"Latn",
type = "reconstructed",
}
m = {
"Nagtipunan Agta",
16966111,
"phi",
"Latn",
}
m = {
"Proto-Philippine",
18204898,
"phi",
"Latn",
type = "reconstructed",
}
m = {
"Abai",
19570729,
"poz-san",
"Latn",
}
m = {
"Baliledo",
4850912,
"poz",
"Latn",
}
m = {
"Proto-Bungku-Tolaki",
116773724,
"poz-btk",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Central-Eastern Malayo-Polynesian",
2269883,
"poz-cet",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Halmahera-Cenderawasih",
116773209,
"poz-hce",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Lampungic",
116773222,
"poz-lgx",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Malayo-Chamic",
116773225,
"poz-mcm",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Micronesian",
111939079,
"poz-mic",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Malayic",
98057728,
"poz-mly",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Malayo-Sumbawan",
116773226,
"poz-msa",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Oceanic",
141741,
"poz-oce",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Eastern Polynesian",
113988745,
"poz-pep",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Nuclear Polynesian",
113988746,
"poz-pnp",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Polynesian",
1658709,
"poz-pol",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Malayo-Polynesian",
3832960,
"poz",
"Latn",
type = "reconstructed",
}
m = {
"Sarawak Malay",
4251702,
"poz-mly",
"Latn, ms-Arab",
}
m = {
"Proto-South Sulawesi",
116773279,
"poz-ssw",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Sunda-Sulawesi",
116773281,
"poz-sus",
"Latn",
type = "reconstructed",
}
m = {
"Proto-North Sarawak",
116773243,
"poz-swa",
"Latn",
type = "reconstructed",
}
m = {
"Terengganu Malay",
4207412,
"poz-mly",
"Latn, ms-Arab",
}
m = {
"Proto-Eastern Malayo-Polynesian",
2269883,
"pqe",
"Latn",
type = "reconstructed",
}
m = {
"Niya Prakrit",
11991601,
"inc-mid",
"Khar",
ancestors = "inc-ash",
translit = "Khar-translit",
}
m = {
"Proto-Great Andamanese",
116773756,
"qfa-adm",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Be-Tai",
116773193,
"qfa-bet",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Chukotko-Kamchatkan",
7251837,
"qfa-cka",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Hurro-Urartian",
116773211,
"qfa-hur",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Kadu",
116773770,
"qfa-kad",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Kam-Sui",
55630682,
"qfa-kms",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Koreanic",
467883,
"qfa-kor",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Kra",
7251854,
"qfa-kra",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Hlai",
7251845,
"qfa-lic",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Be",
116773192,
"qfa-onb",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Ongan",
116773801,
"qfa-ong",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Kra-Dai",
104901616,
"qfa-tak",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Yeniseian",
27639,
"qfa-yen",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Yukaghir",
116773294,
"qfa-yuk",
"Latn",
type = "reconstructed",
}
m = {
"Kichwa",
1740805,
"qwe",
"Latn",
ancestors = "qu",
}
m = {
"Proto-Quechuan",
5575757,
"qwe",
"Latn",
type = "reconstructed",
}
m = {
"Angevin",
56782,
"roa-oil",
"Latn",
sort_key = s,
}
m = {
"Bourbonnais-Berrichon",
2899128,
"roa-oil",
"Latn",
sort_key = s,
}
m = {
"Bourguignon",
508332,
"roa-oil",
"Latn",
sort_key = s,
}
m = {
"Champenois",
430018,
"roa-oil",
"Latn",
sort_key = s,
}
m = {
"Franc-Comtois",
510561,
"roa-oil",
"Latn",
sort_key = s,
}
m = {
"Gallo",
37300,
"roa-oil",
"Latn",
sort_key = s,
}
m = {
"Gallo-Italic of Basilicata",
3094838,
"roa-git",
"Latn",
}
m = {
"Gallo-Italic of Sicily",
2629019,
"roa-git",
"Latn",
}
m = {
"Leonese",
34108,
"roa-ibe",
"Latn",
ancestors = "roa-ole",
}
m = {
"Lorrain",
671198,
"roa-oil",
"Latn",
sort_key = s,
}
m = {
"Navarro-Aragonese",
2736184,
"roa-ibe",
"Latn",
}
m = {
"Old Catalan",
15478520,
"roa-ocr",
"Latn",
sort_key = {
from = {"à", "", "", "", "", "ç", "·"},
to = {"a", "e", "i", "o", "u", "c"}
},
}
m = {
"Old Leonese",
125977465,
"roa-ibe",
"Latn",
}
m = {
"Old Galician-Portuguese",
1072111,
"roa-ibe",
"Latn",
entry_name = {remove_diacritics = c.grave .. c.acute .. c.circ},
}
m = {
"Orléanais",
28497058,
"roa-oil",
"Latn",
sort_key = s,
}
m = {
"Poitevin-Saintongeais",
514123,
"roa-oil",
"Latn",
sort_key = s,
}
m = {
"Tarantino",
695526,
"roa-itd",
"Latn",
ancestors = "nap",
wikimedia_codes = "roa-tara",
}
m = {
"Allentiac",
19570789,
"sai-hrp",
"Latn",
}
m = { -- not to be confused with 'cbc' or 'ano'
"Andoquero",
16828359,
"sai-wit",
"Latn",
}
m = {
"Ayomán",
16937754,
"sai-jir",
"Latn",
}
m = {
"Baenan",
3401998,
nil,
"Latn",
}
m = {
"Bagua",
5390321,
nil,
"Latn",
}
m = {
"Betoi",
926551,
"qfa-iso",
"Latn",
}
m = {
"Proto-Boran",
nil,
"sai-bor",
"Latn",
}
m = {
"Cacán",
945482,
nil,
"Latn",
}
m = {
"Caranqui",
2937753,
"sai-bar",
"Latn",
}
m = {
"Proto-Cariban",
116773196,
"sai-car",
"Latn",
type = "reconstructed",
}
m = {
"Catacao",
5051136,
"sai-ctc",
"Latn",
}
m = {
"Proto-Cerrado",
116773200,
"sai-cer",
"Latn",
type = "reconstructed",
}
m = {
"Chirino",
5390321,
nil,
"Latn",
}
m = {
"Chaná",
5072718,
"sai-crn",
"Latn",
}
m = {
"Chapacura",
5072884,
"sai-cpc",
"Latn",
}
m = {
"Charrua",
5086680,
"sai-crn",
"Latn",
}
m = {
"Churuya",
5118339,
"sai-guh",
"Latn",
}
m = {
"Proto-Central Jê",
116773198,
"sai-cje",
"Latn",
type = "reconstructed",
}
m = {
"Comechingon",
6644203,
nil,
"Latn",
}
m = {
"Chono",
5104704,
nil,
"Latn",
}
m = {
"Cañari",
5055572,
nil,
"Latn",
}
m = {
"Coeruna",
6425639,
"sai-wit",
"Latn",
}
m = {
"Colán",
5141893,
"sai-ctc",
"Latn",
}
m = {
"Copallén",
5390321,
nil,
"Latn",
}
m = {
"Coroado Puri",
24191321,
"sai-mje",
"Latn",
}
m = {
"Catuquinaru",
16858455,
nil,
"Latn",
}
m = {
"Culli",
2879660,
nil,
"Latn",
}
m = {
"Cueva",
5192644,
nil,
"Latn",
}
m = {
"Esmeralda",
3058083,
nil,
"Latn",
}
m = {
"Ewarhuyana",
16898104,
nil,
"Latn",
}
m = {
"Gamela",
5403661,
nil,
"Latn",
}
m = {
"Gayón",
5528902,
"sai-jir",
"Latn",
}
m = {
"Guamo",
5613495,
nil,
"Latn",
}
m = {
"Guachí",
5613172,
"sai-guc",
"Latn",
}
m = {
"Güenoa",
5626799,
"sai-crn",
"Latn",
}
m = {
"Haush",
3128376,
"sai-cho",
"Latn",
}
m = {
"Proto-Jê",
116773212,
"sai-jee",
"Latn",
type = "reconstructed",
}
m = {
"Jeikó",
6176527,
"sai-mje",
"Latn",
}
m = {
"Jirajara",
6202966,
"sai-jir",
"Latn",
}
m = { -- contrast xoo, kzw, sai-xoc
"Katembri",
6375925,
nil,
"Latn",
}
m = {
"Malalí",
6741212,
nil,
"Latn",
}
m = {
"Maratino",
6755055,
nil,
"Latn",
}
m = {
"Matanawi",
6786047,
nil,
"Latn",
}
m = {
"Mocana",
3402048,
nil,
"Latn",
}
m = {
"Menien",
16890110,
"sai-mje",
"Latn",
}
m = {
"Millcayac",
19573012,
"sai-hrp",
"Latn",
}
m = {
"Malibu",
3402048,
nil,
"Latn",
}
m = {
"Masakará",
6782426,
"sai-mje",
"Latn",
}
m = {
"Mucuchí",
6931290,
nil,
"Latn",
}
m = {
"Muellama",
16886936,
"sai-bar",
"Latn",
}
m = {
"Muzo",
6644203,
nil,
"Latn",
}
m = {
"Maynas",
16919393,
nil,
"Latn",
}
m = {
"Natú",
9006749,
nil,
"Latn",
}
m = {
"Proto-Northern Jê",
116773245,
"sai-nje",
"Latn",
type = "reconstructed",
}
m = {
"Opón",
7099152,
"sai-car",
"Latn",
}
m = {
"Otomaco",
16879234,
"sai-otm",
"Latn",
}
m = {
"Palta",
3042978,
nil,
"Latn",
}
m = {
"Pamigua",
5908689,
"sai-otm",
"Latn",
}
m = {
"Paratió",
16890038,
nil,
"Latn",
}
m = {
"Panzaleo",
3123275,
nil,
"Latn",
}
m = {
"Puruhá",
3410994,
nil,
"Latn",
}
m = {
"Patagón",
128807870,
nil,
"Latn",
}
m = {
"Purukotó",
7261622,
"sai-pem",
"Latn",
}
m = {
"Payaguá",
7156643,
"sai-guc",
"Latn",
}
m = {
"Pykobjê",
98113977,
"sai-nje",
"Latn",
}
m = {
"Quimbaya",
7272043,
nil,
"Latn",
}
m = {
"Quitemo",
7272651,
"sai-cpc",
"Latn",
}
m = {
"Rabona",
6644203,
nil,
"Latn",
}
m = {
"Ramanos",
16902824,
nil,
"Latn",
}
m = {
"Sácata",
5390321,
nil,
"Latn",
}
m = {
"Sanaviron",
16895999,
nil,
"Latn",
}
m = {
"Sapará",
7420922,
"sai-car",
"Latn",
}
m = {
"Sechura",
7442912,
nil,
"Latn",
}
m = {
"Sinúfana",
7525275,
nil,
"Latn",
}
m = {
"Proto-Southern Jê",
116773814,
"sai-sje",
"Latn",
type = "reconstructed",
}
m = {
"Tabancale",
5390321,
nil,
"Latn",
}
m = {
"Tallán",
16910468,
nil,
"Latn",
}
m = {
"Tapayuna",
30719984,
"sai-nje",
"Latn",
}
m = {
"Proto-Taranoan",
116773816,
"sai-tar",
"Latn",
type = "reconstructed",
}
m = {
"Teushen",
3519243,
nil,
"Latn",
}
m = {
"Timote",
7806995,
nil,
"Latn",
}
m = {
"Taparita",
7684460,
"sai-otm",
"Latn",
}
m = {
"Tarairiú",
7685313,
nil,
"Latn",
}
m = {
"Waitaká",
16918610,
nil,
"Latn",
}
m = {
"Wayumara",
7960726,
"sai-car",
"Latn",
}
m = {
"Proto-Witotoan",
116773823,
"sai-wit",
"Latn",
type = "reconstructed",
}
m = {
"Wanham",
16879440,
"sai-cpc",
"Latn",
}
m = { -- contrast xoo, kzw, sai-kat
"Xocó",
12953620,
nil,
"Latn",
}
m = {
"Yao (South America)",
16979655,
"sai-ven",
"Latn",
}
m = { -- not the same family as 'suy'
"Yarumá",
3505859,
"sai-pek",
"Latn",
}
m = {
"Yuri",
2669157,
"sai-tyu",
"Latn",
}
m = {
"Yupua",
8061430,
"sai-tuc",
"Latn",
}
m = {
"Yurumanguí",
1281291,
nil,
"Latn",
}
m = {
"Proto-Salish",
116773269,
"sal",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Daju",
116773739,
"sdv-daj",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Eastern Jebel",
116773751,
"sdv-eje",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Nilotic",
116773794,
"sdv-nil",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Nyima",
116773796,
"sdv-nyi",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Taman",
116773815,
"sdv-tmn",
"Latn",
type = "reconstructed",
}
m = {
"Northern Selkup",
30304565,
"sel",
"Cyrl",
translit = "sel-nor-translit",
}
m = {
"Proto-Selkup",
128884235,
"sel",
"Latn",
type = "reconstructed",
}
m = {
"Southern Selkup",
30304639,
"sel",
"Cyrl",
translit = "sel-sou-translit",
}
m = {
"Ammonite",
279181,
"sem-can",
"Phnx",
translit = "Phnx-translit",
}
m = {
"Amorite",
35941,
"sem-nwe",
"Xsux, Latn",
}
m = {
"Chaha",
35543,
"sem-eth",
"Ethi",
translit = "Ethi-translit",
}
m = {
"Dadanitic",
21838040,
"sem-cen",
"Narb",
translit = "Narb-translit",
}
m = {
"Dumaitic",
128810397,
"sem-cen",
"Narb",
translit = "Narb-translit",
}
m = {
"Hasaitic",
3541433,
"sem-cen",
"Narb",
translit = "Narb-translit",
}
m = {
"Hismaic",
22948260,
"sem-cen",
"Narb",
translit = "Narb-translit",
}
m = {
"Muher",
33743,
"sem-eth",
"Latn",
}
m = {
"Proto-Semitic",
1658554,
"sem",
"Latn",
type = "reconstructed",
}
m = {
"Safaitic",
472586,
"sem-cen",
"Narb",
translit = "Narb-translit",
}
m = {
"Old South Arabian",
35025,
"sem-osa",
"Sarb",
translit = "Sarb-translit",
}
m = {
"Taymanitic",
24912301,
"sem-cen",
"Narb",
translit = "Narb-translit",
}
m = {
"Thamudic",
843030,
"sem-cen",
"Narb",
translit = "Narb-translit",
}
m = {
"Proto-West Semitic",
98021726,
"sem-wes",
"Latn",
type = "reconstructed",
}
m = { -- NB this is not Proto-Siouan-Catawban 'nai-sca-pro'
"Proto-Siouan",
34181,
"sio",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Bai",
nil,
"sit-bai",
"Latn",
type = "reconstructed",
}
m = {
"Bokar",
4938727,
"sit-tan",
"Latn, Tibt",
translit = {Tibt = "Tibt-translit"},
override_translit = true,
display_text = {Tibt = s},
entry_name = {Tibt = s},
sort_key = {Tibt = "Tibt-sortkey"},
}
m = {
"Caijia",
5017528,
"sit-cln",
"Latn"
}
m = {
"Chairel",
5068066,
"sit-luu",
"Latn",
}
m = {
"Proto-Hrusish",
116773762,
"sit-hrs",
"Latn",
type = "reconstructed",
}
m = {
"Japhug",
3162245,
"sit-rgy",
"Latn",
}
m = {
"Proto-Kham",
116773773,
"sit-kha",
"Latn",
type = "reconstructed",
}
m = {
"Lizu",
6660653,
"sit-qia",
"Latn", -- and Ersu Shaba
}
m = {
"Longjia",
17096251,
"sit-cln",
"Latn"
}
m = {
"Luren",
16946370,
"sit-cln",
"Latn"
}
m = {
"Proto-Luish",
116773783,
"sit-luu",
"Latn",
type = "reconstructed",
}
m = {
"Puiron",
7259048,
"sit-zem",
}
m = {
"Proto-Sino-Tibetan",
24839178,
"sit",
"Latn",
type = "reconstructed",
}
m = {
"Situ",
19840830,
"sit-rgy",
"Latn",
}
m = {
"Proto-Tamangic",
117469295,
"sit-tam",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Tani",
116773284,
"sit-tan",
"Latn", -- needs verification
type = "reconstructed",
}
m = {
"Tangam",
17041370,
"sit-tan",
"Latn",
}
m = {
"Tosu",
7827899,
"sit-qia",
"Latn", -- also Ersu Shaba
}
m = {
"Tshobdun",
19840950,
"sit-rgy",
"Latn",
}
m = {
"Zbu",
19841106,
"sit-rgy",
"Latn",
}
m = {
"Proto-Slavic",
747537,
"sla",
"Latn",
type = "reconstructed",
entry_name = {
remove_diacritics = c.grave .. c.acute .. c.tilde .. c.macron .. c.dgrave .. c.invbreve,
remove_exceptions = {'ś'},
},
sort_key = {
from = {"č", "ď", "ě", "ę", "ь", "ľ", "ň", "ǫ", "ř", "š", "ś", "ť", "ъ", "ž"},
to = {"c²", "d²", "e²", "e³", "i²", "l²", "nj", "o²", "r²", "s²", "s³", "t²", "u²", "z²"},
}
}
m = {
"Proto-Samic",
7251862,
"smi",
"Latn",
type = "reconstructed",
sort_key = {
from = {"ā", "č", "δ", "", "ŋ", "ń", "ō", "š", "θ", "%(+%)"},
to = {"a", "c²", "d", "e", "n²", "n³", "o", "s²", "t²"}
},
}
m = {
"Proto-Songhay",
116773277,
"son",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Albanian",
18210846,
"sqj",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Kuliak",
116773779,
"ssa-klk",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Koman",
116773775,
"ssa-kom",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Nilo-Saharan",
116773236,
"ssa",
"Latn",
type = "reconstructed",
}
m = {
"Forest Nenets",
1295107,
"syd",
"Cyrl",
translit = "syd-fne-translit",
entry_name = {remove_diacritics = c.grave .. c.acute .. c.macron .. c.breve .. c.dotabove},
}
m = {
"Proto-Samoyedic",
7251863,
"syd",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Tai",
6583709,
"tai",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Southwestern Tai",
116773280,
"tai-swe",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Bodo-Garo",
116773195,
"tbq-bdg",
"Latn",
type = "reconstructed",
}
m = {
"Bailang",
2879843,
"tbq-lob",
"Hani",
sort_key = "Hani-sortkey",
}
m = {
"Gokhy",
5578069,
"tbq-sil",
"Latn",
}
m = {
"Proto-Kuki-Chin",
116773220,
"tbq-kuk",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Lalo",
116773781,
"tbq-lal",
"Latn",
type = "reconstructed",
}
m = {
"Laze",
17007626,
"sit-nas",
"Latn",
}
m = {
"Proto-Lolo-Burmese",
116773224,
"tbq-lob",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Loloish",
7251855,
"tbq-lol",
"Latn",
type = "reconstructed",
}
m = {
"Milang",
6850761,
"sit-gsi",
"Deva, Latn",
}
m = {
"Moran",
6909216,
"tbq-bdg",
"Latn",
}
m = {
"Ngochang",
56582,
"tbq-brm",
"Latn",
}
-- tbq-pro is now etymology-only
m = {
"Dukhan",
12809273,
"trk-ssb",
"Latn, Cyrl, Mong",
translit = {Mong = "Mong-translit"},
display_text = {Mong = s},
entry_name = {Mong = s},
}
m = {
"Old Anatolian Turkish",
7083390,
"trk-ogz",
"ota-Arab",
entry_name = { = "ar-entryname"},
}
m = {
"Proto-Turkic",
3657773,
"trk",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Tupi-Guarani",
116773288,
"tup-gua",
"Latn",
type = "reconstructed",
}
m = {
"Kabishiana",
15302988,
"tup",
"Latn",
}
m = {
"Proto-Tupian",
10354700,
"tup",
"Latn",
type = "reconstructed",
}
m = {
"Alchuka",
113553616,
"tuw-jrc",
"Latn, Hans",
sort_key = {Hans = "Hani-sortkey"},
}
m = {
"Bala",
86730632,
"tuw-jrc",
"Latn, Hans",
sort_key = {Hans = "Hani-sortkey"},
}
m = {
"Kyakala",
118875708,
"tuw-jrc",
"Latn, Hans",
sort_key = {Hans = "Hani-sortkey"},
}
m = {
"Kili",
6406892,
"tuw-ewe",
"Cyrl",
}
m = {
"Proto-Tungusic",
85872335,
"tuw",
"Latn",
type = "reconstructed",
}
m = {
"Solon",
30004,
"tuw-ewe",
}
m = {
"Proto-Finnic",
11883720,
"urj-fin",
"Latn",
type = "reconstructed",
}
m = {
"Old Komi",
86679962,
"urj-prm",
"Perm, Cyrs",
translit = "urj-koo-translit",
sort_key = {Cyrs = s},
}
m = {
"Kukkuzi",
107410460,
"urj-fin",
"Latn",
ancestors = "vot",
}
m = {
"Komi-Yazva",
2365210,
"urj-prm",
"Cyrl",
translit = "kv-translit",
override_translit = true,
entry_name = {remove_diacritics = c.acute},
}
m = {
"Proto-Mordvinic",
116773232,
"urj-mdv",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Permic",
116773257,
"urj-prm",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Uralic",
288765,
"urj",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Ugric",
156631,
"urj-ugr",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Na-Dene",
116773233,
"xnd",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Mongolic",
2493677,
"xgn",
"Latn",
type = "reconstructed",
sort_key = {
from = {"č", "i", "ï", "ǰ", "ŋ", "ö", "š", "ü"},
to = {"c", "i" .. p, "i", "j", "n" .. p, "o" .. p, "s" .. p, "u" .. p},
},
}
m = {
"Buena Vista Yokuts",
4985474,
"yok",
"Latn",
}
m = {
"Delta Yokuts",
70923266,
"yok",
"Latn",
}
m = {
"Gashowu",
3098708,
"yok",
"Latn",
}
m = {
"Kings River Yokuts",
6413014,
"yok",
"Latn",
}
m = {
"Northern Valley Yokuts",
85789777,
"yok",
"Latn",
}
m = {
"Palewyami",
2387391,
"yok",
"Latn",
}
m = {
"Southern Valley Yokuts",
12642473,
"yok",
"Latn",
}
m = {
"Tule-Kaweah Yokuts",
7851988,
"yok",
"Latn",
}
m = {
"Proto-Yupik",
116773295,
"ypk",
"Latn",
type = "reconstructed",
}
m = {
"Proto-Min",
19646347,
"zhx-min",
"Latn",
type = "reconstructed",
}
m = {
"Shaozhou Tuhua",
1920769,
"zhx",
"Nshu, Hants",
generate_forms = "zh-generateforms",
sort_key = {Hani = "Hani-sortkey"},
}
m = {
"Sichuanese",
2278732,
"zhx-man",
"Hants",
generate_forms = "zh-generateforms",
translit = "zh-translit",
sort_key = "Hani-sortkey",
}
m = {
"Taishanese",
2208940,
"zhx-yue",
"Hants",
generate_forms = "zh-generateforms",
translit = "zh-translit",
sort_key = "Hani-sortkey",
}
m = {
"Old Novgorodian",
162013,
"zle",
"Cyrs, Glag",
translit = {Cyrs = "Cyrs-translit", Glag = "Glag-translit"},
entry_name = {Cyrs = s},
sort_key = {Cyrs = s},
}
m = {
"Old Ruthenian",
13211,
"zle",
"Cyrs",
ancestors = "orv",
translit = "zle-ort-translit",
entry_name = {
remove_diacritics = s.remove_diacritics,
remove_exceptions = {"Ї", "ї"}
},
sort_key = s,
}
m = {
"Old Czech",
593096,
"zlw",
"Latn",
}
m = {
"Old Polish",
149838,
"zlw-lch",
"Latn",
entry_name = {remove_diacritics = c.ringabove},
}
m = {
"Old Slovak",
12776676,
"zlw",
"Latn",
}
m = {
"Slovincian",
36822,
"zlw-pom",
"Latn",
entry_name = "zlw-slv-entryname"
}
return m_lang.finalizeLanguageData(m_lang.addDefaultTypes(m, true))