@Wyang, this is perfect! I'll try to implement in {{pi-alt}}
. —Aryamanarora (मुझसे बात करो) 15:50, 1 January 2016 (UTC)
@Wyang. What is the Thai transliteration based on?
E.g. อ in Thai proper is a silent consonant at the beginning of a word and ะ is a short vowel "a", a syllable "ra" should be "ระ" (not just ร) but a standalone syllable "a" should be "อะ". --Anatoli T. (обсудить/вклад) 09:58, 2 January 2016 (UTC)
@Wyang see bhātar and an-. Thanks! —Aryamanarora (मुझसे बात करो) 16:51, 31 January 2016 (UTC)
Bengali script form is breaking at kattar. I think it's because of the nuqta? —Aryaman (मुझसे बात करो) 00:00, 5 October 2017 (UTC)
@Wyang, it seems that the tables are showing Brahmi label but no Brahmi script. Can this be fixed? —*i̯óh₁nC 07:46, 4 December 2017 (UTC)
Aryaman Hi, is the Bengali version a modified version or traditional? Because traditionally Bengalis didn't use ৱ for va, they used ব for both. ৱ (wo) is an Assamese letter and it was derived from its earlier version: র (wo/va) which is still used in Mithilakshar. Sagir Ahmed Msa (talk) 14:00, 19 February 2018 (UTC)
@Octahedron80 I think just like Sanskrit, Pali also didn't use র for ra and ৱ for va. Are there any evidence? Even ৱ is a variant of র in Bengali. It's common to see ৱ used as ra in Bengali and other languages of Bengal. I think the Pali module should follow the Sanskrit module for Bengali alphabet where ব is used for both ba and va. I have seen a lot of writings from Bengal where ব is used for both b and v. Unfortunately I can't identify the languages. I've never seen writings from Bengal where ব is ba and ৱ is va. Msasag (talk) 16:59, 22 November 2018 (UTC)
What we need is evidence of Bengali script usage for Pali - preferably sources fit for use in quotations. It seems that at one time the Bengali script used the 'vva' conjunct ব্ব for va in Sanskrit. At https://www.nirvanapeace.com/buddhism-philosophy/buddhist-philosophy/121, we can find ব U+09AC for ba and ৰ U+09F0 for va! The example is বহুং ৰে সরণং যন্তি, পব্বতানি ৰনানি চ = "bahuṃ ve saraṇaṃ yanti, pabbatāni vanāni ca". RichardW57 (talk) 00:11, 23 November 2018 (UTC)
@RichardW57 Hi, the ৱ for va is not used in any traditional texts in the Bengal region. ব is used for both ba and va in Sanskrit, Bengali etc. Adding ৱ from Assamese is neologism. Please add sources that it is not and that ৱ for va is traditionally used in Bengal. Msasag (talk) 11:09, 26 December 2018 (UTC)
@Msasag: What traditional usage is there of Eastern Nagari for Pali?
I'm not confident that Pali and Sanskrit writing usage coincide. For example, I have seen a claim that to use niggahita for homorganic nasals before stops in Pali is simply wrong; whereas that is the standard Indian Devanagari practice for Sanskrit. (European Devanagari for Sanskrit uses the nasal consonant instead.)
The Thai script spelling we use for Pali uses a character (namely, phinthu) that was added in the 19th century. What do we have for modern Pali use in the Eastern Nagari script? All I could dig out from the Internet was U+09F0 BENGALI LETTER RA WITH MIDDLE DIAGONAL for va; my best guess is that should be interpreted as a substitute for the character formerly known as BENGALI LETTER VA WITH LOWER DIAGONAL because too many fonts and code charts show it with a dot rather than an extra stroke. The scanty evidence I have seen says that ba and va are different in modern Eastern Nagari script usage. Vidyābhūṣaṇ (1898) says that U+09F1 is acceptable for writing the sound or in English loanwords, so the letter is not such a stranger to Bengali usage.
Modern usage in books should trump traditional usage when it dominates it for Pali. However, I believe we need to allow for systematic alternatives in spelling; I intend to automate alternatives for the Tai Tham script. I expect at least a few alternatives will be needed for the Sinhalese script. (I have already automated Thai script alternatives in the conjugation of the middle voice of the verb.) --RichardW57 (talk) 16:43, 26 December 2018 (UTC)
@RichardW57 But U+09F1 was never used in Bengali for v/wa of English loanwords, ওয় is used instead. Like ওয়াশিংটন for 'Washington'. And ৱ is used for 'ra' instead. You can search আমাৱ and see a lot of results of Bengali texts using ৱ as 'ro'. In many fonts U+09B0 has a line instead of dot and in handwritings as well. And similarly in other variants of Eastern Nagari like Assamese, Mithilakshar, ৱ and র are variants of each other for wa/va sound. In Charyapada র is used for va (and ব for ba and ra. But sometimes ৰ for ra). Also in Bengali both র and ৰ were used for ra. And ব for ba and va is still used (check the Bengali alphabet, including different variants of some characters used in this 1778 book: https://books.google.co.in/books/about/A_Grammar_of_the_Bengal_Language.html?id=bttGAAAAcAAJ&printsec=frontcover&source=kp_read_button&redir_esc=y ). So it's better to use the traditional ব or the modern ওয়/ওঅ. Msasag (talk) 19:12, 27 December 2018 (UTC)
@Msasag Your arguments make a lot of sense for devising a transliteration to use on bn.wiktionary.org if there is no standard scheme for writing Pali in the script or transliterating it to the script. However, our primary aim here is to record the spelling used for Pali texts when written in the Eastern Nagari script. The scanty evidence I have seen for usage in the script shows that va is distinguished from ba and ra. What evidence have you on the actual usage for Pali in the Eastern Nagari script?
I tried to expand my knowledge by fishing on Google with various spellings of the exact phrase "evaṃ me sutaṃ":
"এৱং মে সুতং" (using U+09F1 BENGALI LETTER RA WITH LOWER DIAGONAL) got 0 hits! "এবং মে সুতং" (using U+09AC BENGALI LETTER BA) got 10 hits. "এৰং মে সুতং" (using U+09F0 BENGALI LETTER RA WITH MIDDLE DIAGONAL) got 20 hits. "এরং মে সুতং" (using U+09B0 BENGALI LETTER RA) got 0 hits
There was nothing in Google books, alas.
So, there is a case for *adding* the spelling with U+09AC as an alternative. I'm not sure how strong this is; the lack of a source for quotations is a handicap. @AryamanA @Wyang.
Should we take this to the Beer Parlour? --RichardW57 (talk) 23:02, 27 December 2018 (UTC)
{{ping:Octahedron80}: I've been watching the ongoing vandalism with horror. However, if we're going to use ৰ U+09F0 BENGAL LETTER RA WITH MIDDLE DIAGONAL for 'v', I thing the form before other consonants needs not to be <U+09F0, U+200C, U+09CD>, but <U+09F0, U+09CD, U+200C>. Test case: 'vyi', which one would naively encode as <U+09F0, U+200C, U+09CD, U+09AF, U+09BF> yields:
1) <U+09F0, U+09CD, U+09AF, U+09BF> is ৰ্যি - glyphs: i, 'v', modified ya 2) <U+09F0, U+200C, U+09CD, U+09AF, U+09BF> is ৰ্যি - glyphs: i, 'v', modified ya 3) <U+09F0, U+09CD, U+200D, U+09AF, U+09BF> is ৰ্যি - glyphs: i, 'v', virama, modified ya 4) <U+09F0, U+09CD, U+200C, U+09AF, U+09BF> is ৰ্যি - glyphs: 'v', virama, i, ya
Test case: 'vvi', which one would naively encode as <U+09F0, U+200C, U+09CD, U+09AF, U+09BF> yields
5) <U+09F0, U+09CD, U+09F0, U+09BF> is ৰ্ৰি - glyphs 'v', virama, i, 'v' 6) <U+09F0, U+200C, U+09CD, U+09F0, U+09BF> is ৰ্ৰি - glyphs 'v', virama, i, 'v' 7) <U+09F0, U+09CD, U+200D, U+09F0, U+09BF> is ৰ্ৰি - glyphs i, 'v', virama, 'v' 8) <U+09F0, U+09CD, U+200C, U+09F0, U+09BF> is ৰ্ৰি - glyphs 'v', virama, i, 'v'
The combinations 'vy' and 'vv' are the only clusters I can thing of that begin with 'v'.
Do you see something different? I see no repha! --RichardW57 (talk) 15:49, 31 May 2020 (UTC)
9) <U+09F0, U+09CD, U+09B9, U+09C7> is ৰ্হে - glyphs: 'v', virama, e, h 10) <U+09F0, U+200C, U+09CD, U+09B9, U+09C7> is ৰ্হে - glyphs: 'v', virama, e, h 11) <U+09F0, U+09CD, U+200D, U+09B9, U+09C7> is ৰ্হে - glyphs: e, 'v', virma, h 12) <U+09F0, U+09CD, U+200C, U+09B9, U+09C7> is ৰ্হে - glyphs: 'v', virama, e, ha
Again, no sign of repha! It may be worth noting that the renderer is HarfBuzz - I wouldn't guarantee all systems to be the same. I'm not sure of the font - the system seems to think it can fake a half-form by sticking a virama in, which is why the vowel ends up on the left in sequences 3, 7 and 11. In sequences 1 and 2, the special form of 'y' (ya phalaa?) indicates a non-initial consonant, so the vowel can be moved to the font. Otherwise, no form of consonant combination can be achieved (sometimes due to ZWNJ), so we get two orthographic syllables (as fallback in 5 and 9, by command in the others), so the vowel moves to after the 'v'. --RichardW57 (talk) 17:30, 31 May 2020 (UTC)
Now for 13 to 15, with Nirmala UI and either IE11 or the HarfBuzz renderer, I get:
13: i, 'v', ya phalaa, as desired. 14: i, 'vv' conjunct - what we want, but this is a very rare and strange sequence. 15: 'v', virama, e, h. I rather we had a conjunct, but if it's not available, this is an acceptable fallback.
Overall, the Unicode connections sequence <U+200D, U+09CD> seems to give the best results if we have to use U+09F0. --RichardW57 (talk) 02:41, 1 June 2020 (UTC)
@Octahedron80, AryamanA: We've just had a discussion on Template talk:pi-alt about the Bengali script equivalent of <v> when writing Pali. We now have a book ambiguously using U+09AC BENGALI LETTER BA and a book using U+09F0 BENGALI LETTER RA WITH MIDDLE DIAGONAL as this module already does - see the other discussion page for links. I intend to check what can be extracted from the online presence of the first book; at first glance its spelling looked a little inconsistent. For now, I'd say no action is needed, but we may find we will benefit from generating an extra transliteration. There is, I think, no action needed for transliteration to Latin; we certainly cannot distinguish initial <b> and <v> if the writing system does not. --RichardW57 (talk) 18:49, 17 May 2021 (UTC)
AryamanA Hi, if Rajbansi includes Koch Rajbongsi (Assam), Rangpuri (Bangladesh) and Kamatapuri (West Bengal) then it needs a similar template like this, with 3 alphabets. Koch Rajbongsi alphabet is similar to Assamese alphabet, means it has ৰ (ro) and ৱ (wo); Rangpuri uses a Bengali based alphabet, means it has র (ro) and ও (ó) for "r" and "w" respectively. Sagir Ahmed Msa (talk) 14:08, 19 February 2018 (UTC)
There are some problems similar to the one solved by replacing ᩈ᩠ᩈ <HIGH SA, SAKOT, HIGH SA> by ᩔ <GREAT SA>. In Pali, the subscript forms of RA, LA, LOW PA (Pali <b>) and HIGH RATHA (Pali <ṭh>) are not formed by prefixing SAKOT. Instead, special subscript consonants, MEDIAL RA, MEDIAL LA and SIGN LOW PA OR HIGH RATHA, are used instead. Thus, instead of ᨠ᩠ᩁ ᩃ᩠ᩃ ᨻ᩠ᨻ ᨭ᩠ᨮ, we have ᨠᩕ ᩃᩖ ᨻᩛ ᨭᩛ. (<SAKOT, LA> may be used occasionally to represent <al>.)
It is safest to restrict the use of SIGN LOW PA OR HIGH RATHA to the combinations ṭṭh ṇṭh bb mb.
Pali <p> is represented by ᨷ U+1A37 BA in the west (Northern Thailand and the Shan States), but ᨸ U+1A38 PA in Laos and Northeast Thailand. Thus the Tai Tham alternative form for pāpa is wrong for someone who uses Tai Tham in a Northern Thai context. -- RichardW57 (talk) 21:04, 5 September 2018 (UTC)
{{pi-alt}}
should generate them automatically. Or will you write a bot to report inconsistencies in what is passed to {{pi-alt}}
? The other sane alternatives are to:
{{pi-alt}}
, since there is so little duplication. The alternative script pages are just glorified redirects and a bot could be used to make them and handle their {{pi-alt}}
templates. I think having all these modules, as you have been suggesting and implementing for a while now, is overkill, and also it's neither editor- nor user-friendly (how would a new editor understand how to add forms in the data module? what if a user wants to see where the data is stored?). I don't edit much Pali anymore so I have taken a laissez-faire approach, but if I was more involved with Pali I would have been a bit more vocal about my disapproval. I don't think this is the right direction to go with Pali entries. —AryamanA (मुझसे बात करें • योगदान) 03:28, 30 September 2018 (UTC)
{{pi-alt}}
is the first step towards making a bot for automatically creating alternative script pages. However, the data on what alternative script pages to create would have to be stored somewhere. Where? A big step was the automation of {{pi-alt}}
, but I think it needs to handle automatic variation *within* scripts. This variation is mostly between writing systems. (Incidentally, is the writing system of the old PTS dictionary considered too rare to fully accommodate?) However, not all variation is of this nature. We also have variable Sanskritisation (or apparent Sankritisation), as in {{l|pi|tatra} v. tattha and {{l|pi|ti} v. {{l|pi|tri}. There is also local variation. Francis Mason reports in his grammar that the Burmese used -smi, -smiṃ, -smī and -smīṃ indiscriminately as the locative singular ending. The last of them is a fairly fundamental breach of the laws of Pali phonology! I think fully grammatical tasmiṃ deserves a lemma-like entry of its own, but how far those Burmese variants deserve propagation (as opposed to cross-referencing) across scripts is another matter. A few words can be supported manually - though expect groans when Pali-capable Lao is added. (The addition to Unicode of the consonants restored or invented by the Buddhist Institute in the 1930's has started.){{pi-alt}}
and any associated data module, that is what documentation pages are for. ({{pi-alt/documentation}}
does need some improvement, but the associated error-checking helps a lot.) Now, a template like pi-alt-pāpa to generate the alternative forms of pāpa (or should that be the less clear pi-alt-paapa?) should be fairly clear to the user. At present, to understand {{pi-alt}}
one has to drill down to the module. A bot that made the invocations consistent could lead to edit wars, as has happened on Wikipedia with Thai minority languages and bots that enforce the WTT-type rules of Thai orthography. In this case, a bot should make suggestions, not changes. Otherwise, trouble could arise when the deletion of an alternative form is requested, as I must get round to for a Devanagari Pali entry with -ghgh-. RichardW57 (talk) 07:30, 30 September 2018 (UTC)@Octahedron80, @AryamanA: The digraph in 'dh' in the sequence 'dhy' is not recognised. This was shown in the cross-script declension test Module:pi-decl/noun/testcases/documentation for bodhi, which has oblique singular bodhyā if it follows the paradigms. The oblique singulars of बोधि and โพธิ are showing up as बोद्ह्या and โพทฺหฺยา in the 'expected' column, which shows a transliteration of the Roman script case form. - RichardW57 (talk) 20:11, 29 September 2018 (UTC)
Well, I can't find bodhyā, but there are several words with 'dhy'. The commonest seems to osadhyo, which occurs as the unassimilated syncopated vocative plural of osadhī 'medicinal plant in:
in Verse 519 of the Vessantara Jataka (Verse 2174 of the Mahanipato). Thus, we have a bug to be fixed. - RichardW57 (talk) 00:00, 30 September 2018 (UTC)
onset
on line 332 into onset
and glide
, but it would depend on what similar cases there are. Wyang (talk) 10:47, 1 October 2018 (UTC)
Fancy stuff like 'medials', glide handling converting homorganic nasals to niggahita and phonetic syllables goes in the script specific bits. The Thai script has complication with words like 'guyho' - majority คุโยฺห in usage, but majority คุยฺโห in mentions - if we stick to forms with phinthu, and I'm not sure you want to call Pali <h> a 'glide'. RichardW57 (talk) 12:34, 1 October 2018 (UTC)
This should be transliterated as ဝေါ, but is being transliterated as ဝော, which truly looks like 'te' in my browser. - RichardW57 (talk) 20:11, 29 September 2018 (UTC)
@Octahedron80, Wyang, AryamanA, RichardW57: Khmer and Burmese sources for Khmer ព្រហ្មញ្ញ (prummañ, “brahmin”) and Burmese ပုဏ္ဏား (punna:, “brahmin”) use Pali etymologies but in different scripts Pali brahmañña (“brahmin”) for Khmer and Pali ဗြာဟ္မဏ (brāhmaṇa, “brahmin”) for Burmese, which is their usual convention. It should be the same Pali word. Is there an inline template call for converting Pali to different scripts or back to Roman? --Anatoli T. (обсудить/вклад) 08:07, 1 October 2018 (UTC)
{{bor|my|pi|}}
. --Anatoli T. (обсудить/вклад) 08:31, 1 October 2018 (UTC)Look at this one: @ "9 Attributes of the Buddha". It shows that <da> + <vowel aa> in Tai Tham and Myanmar do not act same behavior; Myanmar use tall aa ဒါ but Tai Tham use normal aa ᨴᩣ. Northern Thai also does not use tall aa in this case. The reason is Tai Tham <a> ᩋ (compare ᨴᩣ) does not look like Myanmar <a> အ (compare ဒာ). (As I said recently, the aa form will be considered by the top of consonant stack.) So gyadā in testcases must be ᨣ᩠ᨿᨴᩣ instead of ᨣ᩠ᨿᨴᩤ.
According to the page, Tai Tham uses ᨸ (with tail) instead of ᨷ against Myanmar ပ. Just to ensure that Pali use both ᨷ/ᨸ in writings.
May be not related, but I have this book to check Northern Thai vocabulary. It contains little error but useable. --Octahedron80 (talk) 02:58, 2 October 2018 (UTC)
My longer term aim for this module is that, at least when servicing {{pi-alt}}
, it will return a list of alternative transliterations for the non-Roman script. It may be appropriate to do that through an explicit target argument. One complication is that not all options are simultaneously present - ᩈᨻ᩠ᨻᩣ might not be a possible transliteration of sabbā, though ᩈᨻᩛᩣ, ᩈᨻᩛᩤ and ᩈᨻ᩠ᨻᩤ will all be found. RichardW57 (talk) 09:42, 2 October 2018 (UTC)
@Wyang: Is there any rationale to the transliteration of consonant clusters (back) to the Sinhalese script? It's not the way Mazard writes them.
@Wyang, @AryamanA, @Octahedron80: I've now amassed quite a collection of Sinhala examples (see Category:Pali_terms_with_inconsistent_transliterations) to suggest that clusters should generally be made out of touching letters. Dow we need to accept any clusters made with al-lacuna without ZWJ? It affects the inflection of the '(m/v)ant' stems. RichardW57m (talk) 12:57, 22 October 2018 (UTC)
@Wyang, @AryamanA, @Octahedron80: I've changed transliteration so that Sinhala consonant clusters use touching consonants or conjuncts. The conjuncts are used for -r and -y as the second element, and the 6 exceptional clusters kv, tth, tv, nth, nd, ndh. It may be that we will also need repha, though there seem to be very few examples in mainstream Pali. RichardW57 (talk) 22:48, 8 November 2018 (UTC)
@AryamanA, @Octahedron80: I've found another exceptional cluster, 'nv' that is also written as a conjunct. If we take the source of the quote for the Sinhala script form of upagacchati, it's found at Verse 37. (It's Verse 31 at tipitaka.org.) It's another one we don't have font for - Iskoola Pota seems to have overlooked it (neither conjunct nor touching ligatures) and the list of conjuncts and touching forms in the introduction of the book also omits it. I'm mentioning it here so I don't forget it. It also needs to be handled properly in inflection - please no global substitutions. --RichardW57 (talk) 11:46, 4 June 2020 (UTC)
We may now have a complete list for Pali - there are 6 plausible combinations out of , unless 'kth' also occurs. --RichardW57 (talk) 11:46, 4 June 2020 (UTC)
I'm bending the LKLUG font to support Pali. (It's from GNU, so I can publish it when I've finished.) I've already added a set of touching forms, but the conjunct forms will be more effort - I'm going to have assemble each ligature. The conjuncts nd and ndr are already in LKLUG. It's worse than it sounds, for when I make the conjunct for kv, I've also got to add ligatures for kvi, kvī, kvu and kvū. --RichardW57 (talk) 11:46, 4 June 2020 (UTC)
From what I'm seeing the commoner writing for Pali 'y' seems to be ຍ rather than ຢ. This seems to be the case even when only the Lao lao consonants are used, and so ຍ represents both 'y' and 'ñ'. Does this correspond to others' observations? @Octahedron80 I've added a parameter |y=
to the inflection templates to handle the endings. -- RichardW57 (talk) 22:31, 28 June 2019 (UTC)
The rule
doesn't always work, even in writing systems that normally use tall AA after those base consonants. I propose we gather together examples so that we can make an evidence-based refinement to the rule. Note that inflection tables already allow fine-tuning between round AA and tall AA. Please ensure that the examples come from texts where tall AA is used when there is no subscript. --RichardW57 (talk) 13:36, 4 April 2020 (UTC)
Latin | Example |
---|---|
gg | |
ggh | |
dd | |
ddh | ᨻᩩᨴ᩠ᨵᩤᨶ (buddhāna) (should also found a good round AA counterexample) --RichardW57 (talk) 13:36, 4 April 2020 (UTC) |
dhy | |
pp | |
pph | ᨷᨷ᩠ᨹᩣᩈ (papphāsa) --RichardW57 (talk) 13:36, 4 April 2020 (UTC) |
vy |
There's a specific vowel to denote the "iṃ". For example:
In Khmer, Thai and Lao, no vowel stakes on another vowel. Also, it's not appear well on some device. Hence, it should be កឹ, กึ, and ກຶ, respectively. — Pichnat Thong • 16:47, 20 August 2020 (UTC)
We have, I believe, three quotations giving in Thai script with implicit vowels, all from the same book. An example is
@Pichnat Thong, Octahedron80: What is printed in the book there? Sara ue or <sara i, nikkhahit>? I transcribed it as the latter, but I am worried that I am wrong. Within the book, the precise typeface seems only to be used for Pali. (It could have been chosen because it handles marks below well.) --RichardW57 (talk) 17:43, 22 August 2020 (UTC)
I have added Mon, Old Shan, New Shan variations of Burmese script. You can look at 'variations' table to see what is changed between them. --Octahedron80 (talk) 02:56, 5 December 2020 (UTC)
Conversion to Old Shan was generating hybrid forms (e.g. ၺႃယဿ compared to standard Burmese script ဉာယဿ), which is wrong for servicing {{pi-alt}}
. I've changed it so that words with GREAT SA will be left in the Burmese form rather than converted to Old Shan. This does not affect the conversion of words to the New Shan variant. --RichardW57m (talk) 13:12, 19 August 2021 (UTC)
@Octahedron80, Theknightwho: Following the discussion at User talk:RichardW57#ကီ, I am commenting out the conversion of Mon Pali to ီ rather than to ိံ until we get evidence of it good enough to defend an entry. (Corrected text.) --RichardW57 (talk) 01:10, 26 November 2023 (UTC)
@Erutuon, Msasag, Octahedron80, RichardW57m
How could the Chakma script be added to the module? --Apisite (talk) 13:02, 15 October 2021 (UTC)
@Erutuon, Msasag, Octahedron80, RichardW57m
The consonant vaa for Pali has been added. --Apisite (talk) 13:48, 16 October 2021 (UTC)
Isn't Cyrillic more urgent? I'm not sure how many dead bodies would be required for Chakma. The CFI issue is important. How many durable instances do we have? Are people likely to just come across Pali in Chakma script? The perceived need for the letter VAA for base letter 'v' is odd, when one can see that WAA is being used for subscript 'v'. --RichardW57 (talk) 19:51, 16 October 2021 (UTC)
The technical issues that immediately spring to mind are:
For transliteration from Chakma, the following questions also arise:
The only texts I have available for analysis are those in the Unicode proposals. --RichardW57 (talk) 19:51, 16 October 2021 (UTC)
@Apisite: Do you have any other Pali texts to offer? --RichardW57m (talk) 12:01, 18 October 2021 (UTC)
(Notifying RichardW57): , @Erutuon, Msasag, Octahedron80, Apisite: For transliteration from the Chakma script, I would recommend creating Module ccp-translit, with a switch on language for differences between Chakma and Pali. The use of U+11102 CHAKMA SIGN VISARGA for Pali ā is quite remarkable. --RichardW57m (talk) 14:42, 18 October 2021 (UTC)
@Apisite: Here's an awkward question. Is subscript 'v' encoded as U+11131 CHAKMA O MARK or as <U+11134 CHAKMA MAAYYAA, U+11124 CHAKMA LETTER WAA>? --RichardW57 (talk) 21:56, 18 October 2021 (UTC)
@Apisite: I've added Chakma to this module. The limited data I have is consistent with 𑄠𑄢𑄣𑅇 (yrlv) being subscripted in clusters and all other consonants, including 𑄚 (n), not being subscripted. Until such time as Chakma is added to the script list for Pali in Module:languages/data2, {{link}}
and {{mention}}
will require |sc=
for transliteration to occur - but you didn't ask about these! --RichardW57 (talk) 03:48, 22 October 2021 (UTC)
RichardW57 (talk) 03:48, 22 October 2021 (UTC)
@Apisite: I've now added inflection capability for Chakma. For {{pi-decl-noun}}
you need to add new argument |sc=Cakm
and old argument |ending=
ceases to be optional. For {{pi-conj-special}}
, you need to add new argument |sc=Cakm
. --RichardW57 (talk) 17:23, 23 October 2021 (UTC)
{{ping|Apisite}: I've now knocked up the inverse transliteration, from Chakma to Latin, for Pali. It's Module:Cakm-translit. Unfortunately, there are some unexplained discrepancies between the automated and manual transliterations for Chakma. Does that 'transliteration' depend on invisible phonetic facts? --RichardW57 (talk) 16:49, 20 October 2021 (UTC)
Fixing vital punctuation and re-asking:
@Apisite: I've now knocked up the inverse transliteration, from Chakma to Latin, for Pali. It's Module:Cakm-translit. Unfortunately, there are some unexplained discrepancies between the automated and manual transliterations for Chakma. Does that 'transliteration' depend on invisible phonetic facts? --RichardW57m (talk) 16:39, 13 May 2022 (UTC)
@RichardW57m We'll have to make a poster, that says this: "Help wanted from any people with knowledge of Pali written in the Chakma script." --Apisite (talk) 00:40, 2 July 2023 (UTC)