Alar is an open source Kannada dictionary made by V. Krishna that has became a very popular online source and described as "comprehensive" by Deccan Herald, as well as received positive feedback from Kannada news sources (see links on the bottom of this page). Even better, its corpus is licensed under ODbL 1.0 (source), which allows for sharing, creating, and adapting the database (including for commercial use) given attribution, share-alike and keeping it open (althoug you can also add DRM to it if you also offer a free version), and is available as a single YAML file on Github. (side note: Fenakhay brought up some issues on Discord using this link from OpenStreetMap, however I think this applies to a specific situation on OSM where users can change the license of their contributions, while all contribution to WMF wikis are GDFL/CC-BY-SA dual licensed per meta:Licensing update, and the "produced works" should still be compatible with the project as even static images (as opposed to dynmaps or pure data) of WMF maps are still compatible with OSM's licensing as detailed here).
With over 150k Kannada entries with corresponding English definitions, it would significantly enhance our current coverage of the language, which has only 1122 lemmas as of present in Wiktionary. So, given these circumstances, can we use a tool to run through the file, incorporate these entries into Wiktionary, and properly attribute them to Alar/the data source? I think that, if we could, it would prove to be very useful, given the scope and pre-formatting of the entries' data as well as, more uniquely, how recent it is (not often is there open data like this for recent works, its usually for works that are old enough to be in the public domain). MSG17 (talk) 02:31, 5 May 2021 (UTC)
On se, starting at Serbo-Croatian, all I see "Lua error: not enough memory". If anything that's important and Lua-related was changed recently, maybe it would be a good idea to change it back. -Xbony2 (talk) 14:06, 5 May 2021 (UTC)
Is there a limit to the number of branches in a #switch function? — Saltmarsh. 06:47, 7 May 2021 (UTC)
Alagoas is defined in the Portuguese section as "A state in the the Northeast Region of Brazil. Capital: Maceió." with an extra "the." Problem is, when I tried to edit it to remove the typo, I instead got a template saying "place|pt|state|macroregion/pt:Northeast Region|c/pt:Brazil|capital=pt:Maceió."
How do I fix this??
EDIT: Amapá has the same problem as well!
TheTheRemover (talk) 19:20, 11 May 2021 (UTC)
I listened to a Russian video and thought they said "можедельник", so I looked up this word here in English Wiktionary, but the search (Special:Search/можедельник) found nothing. It turns out there is an entry for можжевельник, which was the right word (juniper bush), I had only two letters wrong. But even a search for "можевельник" (just one letter missing, Special:Search/можевельник) gives no results at all. Is this really a state-of-the-art search function in the year 2021? Now, the same search in Russian Wikipedia (w:ru:Special:Search/можедельник) does find the entry. What's different? --LA2 (talk) 08:53, 16 May 2021 (UTC)
#wikimedia-tech
on Freenode, or on mw:Help talk:CirrusSearch. Perhaps there are language-specific search suggestion rules and the search suggestions here are English-based and the ones on the Russian Wikipedia are Russian-based? Just a guess. — Eru·tuon 05:10, 17 May 2021 (UTC){{sumerogram of}}
and {{akkadogram of}}
I have created these two small form-of templates, as they are greatly used in Akkadian entries. However, most definitions are Sumerogram for". Should this be addressed or is of good enough? Cheers, Swaare (talk) 09:41, 17 May 2021 (UTC)
I've encountered a presentation issue with Pali verbs in some non-Roman scripts. Because emboldening does not work well in some scripts, enlargement is being used as an alternative or supplement. This is seen in both headwords and quotations and usage examples. Unfortunately, Pali verb roots should only be given in the Roman scripts; there is no benefit in giving them in other scripts, because they are not words. (The Pali names of roots end in vowels; their English names mostly don't.) How can I induce Module:headword to recognise that they are in the Roman script. --RichardW57m (talk) 12:48, 18 May 2021 (UTC)
|f1sc=
of {{head}}
. I implemented the solution in {{pi-verb}}
on 20 February 2023. --RichardW57m (talk) 10:09, 16 March 2023 (UTC)I don't know if a similar consideration potentially applies to Sanskrit - perhaps they do intend to have dozens of manually translated and maintained pages for Sanskrit roots. It can be argued that Sanskrit roots are translingual. --RichardW57m (talk) 12:48, 18 May 2021 (UTC)
Do we control character folding for searches for Wikimedia-implemented searches of Wiktionary? For example, to find a Roman text word, I can generally find it without having to enter the diacritics. I assume this is done by some form of character folding. Do we control this mechanism? Could we set it up to not distinguish U+1004 MYANAMR LETTER NGA and U+105A MYANMAR LETTER MON NGA? In some (font-dependent) contexts, these two characters are indistinguishable. In the Default Unicode Collation Element Table (DUCET), and presumably therefore to ICU, they are defined to be as distinct as 'a' and 'b' are. Experimentation has shown that the search for a word encoded with one of the characters will not find a word encoded with the other. --RichardW57m (talk) 13:18, 18 May 2021 (UTC)
Have we, and if so how, solved the problem of multiple encodings for Malayalam chillus? --RichardW57m (talk) 13:18, 18 May 2021 (UTC)
Wiktionary would greatly benefit from an editing App as the literature search needed is very simple to add content to entries. Word translations and all can be conveniently added from a smartphone.
A mobile app is very much crucial and needed for Wiktionary. The old Wiktionary mobile app had 1 million+ downloads.
So please make a Wiktionary lite app/progressive web application based on monobook responsive skin or similar. Wiktionaries are losing huge number of potential edits due to the absence of an app. Thank you! Vis M (talk) 21:54, 18 May 2021 (UTC)
Hey- could you look at this LCCN for me? On the Dabancheng page, I've got a quotation from a 1993 book, and I want to add the LCCN link. When you click on LCCN: 93-83190 it says: "Your search found no results." But on this page in the book (near the top of the page) it clearly says "Library of Congress Catalog Card Number: 93-83190". What gives??? --Geographyinitiative (talk) 18:44, 19 May 2021 (UTC)
For some weeks now (on my screen at least) the "select target languages" option appears not to be working. However, on closer inspection today, it is working but the choice buttons are invisible. So you can make a choice, but only by guessing/clicking where the button might be. — Saltmarsh. 10:48, 21 May 2021 (UTC)
neither the template nor the module of sw-conj has been edited lately, so i assume that this is yet another instance of code changes lurking behind a template that isnt easy to see.
basically, i want to know if this is working as intended. don't get me wrong .... i love the way it looks now .... but i suspect Im in the minority. also, is it just sw-conj or have other templates also been expanded?
see Template:sw-conj/documentation#Example_usage for an example of what it looks like now.
thanks,
—Soap— 12:48, 22 May 2021 (UTC)
Hello,
I tried all possible combinations of noinclude's and includeonly's, but I don't seem to be able to prevent the documentation page of my template to be added to categories because of the examples shown on it. How can I present example keeping only the text output but not include the categories? Thx. Sitaron (talk) 20:10, 22 May 2021 (UTC)
In isopiptesis, the article was translated by Shoemaker, but it reads as if the entire journal was translated by Shoemaker. Can this be fixed? This, that and the other (talk) 10:48, 23 May 2021 (UTC)
I am currently working on etymologies of village names in Romania and the Category:ro:Villages in Romania category is getting a bit crowded, it would help to categorize them by county and, as far as I can tell, this would require a change in Module:place/shared-data, which is protected.
The county dictionary should look like this:
export.romanian_counties = {
= {},
= {},
= {},
= {},
= {},
= {},
= {},
= {},
= {},
= {},
= {},
= {},
= {},
= {},
= {},
= {},
= {},
= {},
= {},
= {},
= {},
= {},
= {},
= {},
= {},
= {},
= {},
= {},
= {},
= {},
= {},
= {},
= {},
= {},
= {},
= {},
= {},
= {},
= {},
= {},
= {},
}
Bogdan (talk) 12:20, 24 May 2021 (UTC)
This is an idea I have been thinking about - can categories for each language, on a page with more than one language, be shown under each language instead of being grouped at the bottom of the page. For instance, the page for Arne has no less than six languages. Or is this a discussion for the Beer Parlour? DonnanZ (talk) 20:30, 24 May 2021 (UTC)
We have for many categories a box showing "Recent additions to the category", and another for "Oldest pages ordered by last edit". Past experience had taught me the latter box is rarely accurate, and I wouldn't mind if this feature was removed, as both boxes on the right-hand side of the page can interfere with the page layout. An example of this is Category:en:Places in Dorset, England, where the "Pages in category" listing is all in one column, instead of two, because the two boxes on the right drop below the start of the listing; the list of subcategories isn't long enough to compensate. Removal of the second box, "Oldest pages ordered by last edit", would cure the problem. Unless there are other bright ideas around. DonnanZ (talk) 21:01, 24 May 2021 (UTC)
I just cleaned up a few Proto-West-Germanic entries that were sitting unnoticed in categories for Proto-Germanic, and it occurred to me that it would be fairly easy to flag such errors automatically: the language name is always right there between "Reconstruction:" and the the page name, and it's always the canonical name- our templates couldn't link to it otherwise.
Thoughts? Chuck Entz (talk) 09:11, 25 May 2021 (UTC)
{{reconstructed}}
could also parse the wikitext and look for templates (like {{der}}
or {{inh}}
) where the language code doesn't match the title of the page. But it's a bit more indirect, requires writing out all the template-specific behavior, and I'd rather not make the poor overworked servers do this for us. Looking at the categories will at least catch many uses of the wrong language code in templates because they often add categories. — Eru·tuon 05:25, 26 May 2021 (UTC)
Why is the category Category:English lemmas added to UserːEntity137/Sandbox, a user page? Does the template not check the namespace? J3133 (talk) 15:06, 29 May 2021 (UTC)
Category:English terms spelled with ː
. — Eru·tuon 17:52, 29 May 2021 (UTC)Right now, for Tibetan འགས་པ ('gas pa), it auto-generates the incorrect Wylie transliteration 'ags pa. The correct form should be 'gas pa, as confirmed by the Tibetan Living Dictionary, which means there is a bug in Module:bo-translit. Can anyone fix this? RcAlex36 (talk) 02:44, 30 May 2021 (UTC)