Hello, you have come here looking for the meaning of the word Wiktionary:Grease pit. In DICTIOUS you will not only get to know all the dictionary meanings for the word Wiktionary:Grease pit, but we will also tell you about its etymology, its characteristics and you will know how to say Wiktionary:Grease pit in singular and plural. Everything you need to know about the word Wiktionary:Grease pit you have here. The definition of the word Wiktionary:Grease pit will help you to be more precise and correct when speaking or writing your texts. Knowing the definition ofWiktionary:Grease pit, as well as those of other words, enriches your vocabulary and provides you with more and better linguistic resources.
This is an area to complement the Beer parlour and Tea room. Its purpose is specifically for discussing the future development of the English Wiktionary, both as a dictionary and thesaurus and as a website.
The Grease pit is a place to discuss technical issues such as templates, Lua modules, CSS, JavaScript, the MediaWiki software, extensions to it, abuse filters, Toolforge, etc. It is also the second-best place, after the Beer parlor, to think in non-technical ways about how to make the best, free, open online dictionary of “all words in all languages”.
Others have understood this page to explain the “how” of things, while the Beer parlour addresses the “why”.
Permanent notice
Tips and tricks about customization or personalization of CSS and JS files are listed at WT:CUSTOM.
Latest comment: 1 month ago1 comment1 person in discussion
If a qualifier is used but not for the preceding form, it is added to the wrong form; e.g., if |mpqual= and |mpqual3= are used but not |mpqual2=, |mpqual3= is added to the second plural (|mp2=). J3133 (talk) 07:47, 1 December 2024 (UTC)Reply
Why do in-line references after tlb get put on a separate line?
Latest comment: 1 month ago3 comments3 people in discussion
E.g. see gōian. I don't think this is ideal formatting, is it? I tried putting the reference inside the template to avoid this, but that breaks the link, so it isn't a valid solution. Urszag (talk) 11:06, 2 December 2024 (UTC)Reply
Latest comment: 1 month ago1 comment1 person in discussion
Latest tech news from the Wikimedia technical community. Please tell other users about these changes. Not all changes will affect you. Translations are available.
Starting this week, Wikimedia wikis no longer support connections using old RSA-based HTTPS certificates, specifically rsa-2048. This change is to improve security for all users. Some older, unsupported browser or smartphone devices will be unable to connect; Instead, they will display a connectivity error. See the HTTPS Browser Recommendations page for more-detailed information. All modern operating systems and browsers are always able to reach Wikimedia projects.
Starting December 16, Flow/Structured Discussions pages will be automatically archived and set to read-only at the following wikis: arwiki, cawiki, frwiki, mediawikiwiki, orwiki, wawiki, wawiktionary, wikidatawiki, zhwiki. This is done as part of StructuredDiscussions deprecation work. If you need any assistance to archive your page in advance, please contact Trizek (WMF).
This month the Chart extension was deployed to production and is now available on Commons and Testwiki. With the security review complete, pilot wiki deployment is expected to start in the first week of December. You can see a working version on Testwiki and read the November project update for more details.
View all 23 community-submitted tasks that were resolved last week. For example, a bug with the "Download as PDF" system was fixed.
Updates for technical contributors
In late February, temporary accounts will be rolled out on at least 10 large wikis. This deployment will have a significant effect on the community-maintained code. This is about Toolforge tools, bots, gadgets, and user scripts that use IP address data or that are available for logged-out users. The Trust and Safety Product team wants to identify this code, monitor it, and assist in updating it ahead of the deployment to minimize disruption to workflows. The team asks technical editors and volunteer developers to help identify such tools by adding them to this list. In addition, review the updated documentation to learn how to adjust the tools. Join the discussions on the project talk page or in the dedicated thread on the Wikimedia Community Discord server (in English) for support and to share feedback.
Latest comment: 1 month ago1 comment1 person in discussion
I need to add some information there so that categories I created show up on the category tree. I’d appreciate it if someone could either tweak permissions so I can edit the page or paste the information I need to add in my stead. Ping @Benwing2 so he can be aware I made this post.
Below is the code I need added.
raw_categories = {
description = "The following terms were officially used in Brazilian Portuguese — but not European Portuguese — until December 2015, when the period of transition between the previous spelling standard and the ] ended.",
parents = {
{name = "Portuguese forms superseded by AO1990", is_label = true, lang = pt},
{name = "Brazilian Portuguese forms", raw = true},
},
}
raw_categories = {
description = "The following terms were officially used in European Portuguese — but not Brazilian Portuguese — until October 2015, when the period of transition between the previous spelling reform and the ] ended in Cape Verde. Portugal’s transition period ended in May 2015.",
parents = {
{name = "Portuguese forms superseded by AO1990", is_label = true, lang = pt},
{name = "European Portuguese forms", raw = true},
},
}
A template to handle large amounts of references in Further reading
Latest comment: 1 month ago1 comment1 person in discussion
I've been pondering - for at least Polish, the Further reading section can get very full - each template however is often used for something on a page, be it to support something such as Middle Polish or dialectal pronunciations/definitions, or obsolete definitions, etc., and also it's generally all the most commonly referenced material within Polish linguistics. This can become unwieldy, so I was thinking, what if we had a template such as {{bibliography}} that had some similar features of {{reflist}}, i.e. it can control the size of the references and such, but also local grouping, i.e. I could create categories "Middle Polish" and have the appropriate templates nested under that, or "Dialectal:" and then a subgrouping "Masovian", etc. etc. Vininn126 (talk) 09:58, 5 December 2024 (UTC)Reply
More advanced support of noun class notation?
Latest comment: 1 month ago16 comments4 people in discussion
The Module:gender and number currently gives a large array of possible ways of specifying gender (such as m or f by sense), while it seems to disallow any class specification more complex than class foo. For many languages with noun classes, this is enough I guess, but there are cases where I’d like to give more information (such as in Swahili).
One issue is that there are different conventions, and they’re horribly confusing. The pan-Bantu convention, for example, is to number all classes, singular and plural separately. So *kɪ̀- and its descendants is class 7, *bì- is class 8, etc. In some modern Bantu languages the classes neatly pair up in singular and plural, so for example in Luganda, those two are jointly called class IV (see here for example).
I don’t think there’s a set convention for roman vs. arabic numerals. Our Appendix:Swahili noun classes uses roman numerals for the Proto-Bantu classes, for one. This means that any drive-by user who sees something labeled class IV can’t know whether that’s Proto-Bantu class 4, or some language-specific conflated class or other, or something else entirely.
One solution is using noun prefixes: class ki- etc. But that can be ambiguous. Or we could write class ki-(VII) and the like, but that gets clunky. Another is to put links to the appendix, as {{sw-noun}} currently does. This is not possible using the {{head}} template, I think. ({{sw-noun}} is currently hard-coded, no Lua, no {{head}} for inflections or class specifications, etc. I’ve started working on a version using our current infrastructure.)
To me the best solution would be more informative mouseovers: so we write class VII, and on mouseover instead of the rather unhelpful “noun class VII”, we’d see “class ki- (VII)”, or something of the like. But that would require modifying Module:gender and number in ways that go way beyond my abilities. Add to this the existence of Swahili nouns that have different class agreement by sense, and I hope you understand my pickle. ☺ MuDavid 栘𩿠 (talk) 03:05, 6 December 2024 (UTC)Reply
@MuDavid This would require adding language-specific info to Module:gender and number, but there is precedent for doing this as there are other modules with lang-specific info attached. To handle Swahili nouns with different class agreement by sense, you'd create multiple headwords, one per class agreement (they don't have to be in separate Etymology sections if it doesn't make sense to do so). Benwing2 (talk) 21:49, 7 December 2024 (UTC)Reply
Given there’s many languages with class agreement, and they tend to have many classes, I guess the lang-specific info had better go into separate data pages, no? MuDavid 栘𩿠 (talk) 02:45, 9 December 2024 (UTC)Reply
Today I discovered there are words that can take two different agreement classes without any change in meaning, apparently based on whim of the speaker. I think I could hack together a solution in Module:sw-utilities, but I also think extended functionality in Module:gender and number would be better. MuDavid 栘𩿠 (talk) 03:43, 13 December 2024 (UTC)Reply
@MuDavid Headwords support multiple genders, so this wouldn't be a problem. As for adding language-specific classes, can you give me a list of the Swahili noun classes and how you want them to appear? Keep in mind that mouseovers/tooltips don't work on mobile, so we might want a different display for phones. Benwing2 (talk) 03:53, 13 December 2024 (UTC)Reply
The left column in this table is a almost full list. There’s some words with mixed agreement m(I)/n(IX), such as rafiki.
I feared you’d bring up mobile at some point. @tbm, do you have any ideas?
On desktop I think my preferred version would be to show the roman numeral (as is currently the case), and give a mouseover with extra information, such as “agreement in ki class(VII)” for example (but without the link/markup as that doesn’t work?).
Is a completely different display on phones possible? If so, something like “ki(VII)” would be my preferred output (removing the word “class” to reduce clutter as a tradeoff for showing the prefix as well). If not, I’d vote for what I described for desktop above (to be consistent with other Bantu languages in look); if our convention isn’t clear, visitors can look up the adjective they need to decline to see it all, like: if you open the inflection table at -angu, there’s a full list with the roman numerals given. The extra information from the mouseover isn’t vital.
@MuDavid Yes, it should be possible to display different content on mobile phones. @Ioaxxere @Surjection sorry for the ping again; it looks like the way to do this is to conditionalize on the screen width, per and . Different sets recommend a max width of 800-1000 pixels for the mobile cutoff. For some reason we don't have nice CSS classes already provided to make this easy; any objections to me adding this, or other thoughts? Benwing2 (talk) 09:33, 13 December 2024 (UTC)Reply
I'm not sure why we are talking about viewport width when MediaWiki has a fully separate mobile site/frontend/skin which you can easily check via CSS or any similar means. However, I'm not very fond of the idea of showing different text on mobile than on desktop. I've always considered it disappointing that mobile browsers never implemented any way to show tooltips. — SURJECTION/ T / C / L /10:29, 13 December 2024 (UTC)Reply
Something else I noticed: if a gender is missing, the entry goes into the category “Requests for gender in lang entries”. But if it’s a class that’s missing (through |g=c?) then it isn’t. I for one think it should. @Tbm, do you agree? MuDavid 栘𩿠 (talk) 03:38, 11 December 2024 (UTC)Reply
Latest comment: 1 month ago1 comment1 person in discussion
The template {{bnt-verb}} seems to have a problem with infinitive generation: it only gives “*kʊ̀”. See here for example; it should be “*kʊ̀béeka” (I think). It used to work, and no edits to the template were made since then, so the problem must be “backstage”, and I don’t know how to figure this out or solve it myself. MuDavid 栘𩿠 (talk) 03:10, 6 December 2024 (UTC)Reply
etymology-only languages bug
Latest comment: 1 month ago2 comments2 people in discussion
It was a known bug that was fixed within 35 minutes but involved potentially thousands of entries, so it may take a while to clear comppletely. Chuck Entz (talk) 20:15, 6 December 2024 (UTC)Reply
"Category:Pages with entries"
Latest comment: 1 month ago18 comments5 people in discussion
@Taylor 49 @Benwing2 Even after a few months, the category is still populating, because those pages haven't been refreshed in the cache yet. I suspect pages are prioritised by the server based on daily/monthly page views, which will be 0 for many non-lemma pages. You can find all pages which aren't yet in the category with the search -incategory:"Pages with entries", selecting the mainspace and Reconstruction namespace (note the hyphen at the beginning, which makes it a negative search). Searching that just now gives 23,649 pages. Theknightwho (talk) 21:58, 7 December 2024 (UTC)Reply
Yuck, I didn't realize there were still headword templates coded like that but it makes sense, there are a lot of dusty corners in Wiktionary. Benwing2 (talk) 21:52, 7 December 2024 (UTC)Reply
I can see that 5 of the listed "dusty" templates are already deleted, after only 6 hours. @Theknightwho: Where is the caching implemented? To me it looks like that the page reads, parses and categorizes itself on every word class for every language, that can be over 100 times for some words. What ensures that this task is done only one time per page? AFAIK mw:Extension:Variables is banned on WMF wikis. Taylor 49 (talk) 19:43, 8 December 2024 (UTC)Reply
@Taylor 49 Calls to getContent() are cached automatically in MediaWiki, so the actual database call happens only once per page. Furthermore, the heavy-duty page parsing is loaded using mw.loadData() through Module:headword/data, and data loaded this way is cached and executed only once per page. The code on line 749 of Module:headword only calls process_page() if the pagename is specified using the `pagename` field and isn't the same as the actual pagename (which generally only happens on test and documentation pages); otherwise it simply uses the precomputed processed page. There are probably other levels of caching as well that @Theknightwho knows more about. Benwing2 (talk) 20:23, 8 December 2024 (UTC)Reply
@Benwing2 @IP 115 @Theknightwho I did some systematic purging, fixed the sign language headword templates (admittedly somewhat crudely), and added an invocation of {{head}} inside {{cuns}}. This has reduced the number of pages in the search from >20,000 to <100.
Yes, indeed. This is a question for WT:RFDO really, but it is redundant.
There is one issue: Greek comparative adjective forms tend to be presented as non-lemma forms of the positive adjective, instead of non-lemma forms of the comparative adjective. Compare:
@This, that and the other In order to convert to the "even more useful" form you mentioned above, the bot would have to determine what the comparative adjective lemma is, which isn't currently specified in the call to {{el-form-of-nounadj}}. Do you know how hard that is? Can it be inferred from the term pagetitle itself? Benwing2 (talk) 21:45, 7 December 2024 (UTC)Reply
@Octahedron80 @This, that and the other @Saltmarsh I have a script ready to convert {{el-form-of-nounadj}} to {{infl of|el}}. I found it easier to derive the comparative/superlative base lemma directly from the non-lemma pagetitle by chopping off the ending and replacing it with -ος. Saltmarsh, in almost all cases the resulting text is shorter and easier to type. For example:
@Sarri.greek, Benwing2 Can I stress that in spite of my resistance I am not adverse to change. Common formats will bring a unified feel to Wiktionary. Nevertheless {{el-form-of-nounadj}} and similar language specific templates have the advantage of speeding up data entry, and more importantly making editing easier for new editors (one look at the Documentation for {{inflection of}}, which refers onward to modules etc, puts me off and it certainly do it for newbies. Experienced editors cannot see how things look to newcomers, and life here has become much more technical since I started. Some of this could be overcome by rewriting Wiktionary:About Greek - a job I could do without.
One question which my look at the documentation hasn't helped me with. Can {{inflection of}} accomplish mixed cases and genders as {{el-form-of-nounadj}} does?
# {{el-form-of-nounadj|μικρός|g=n|c=nav|n=s}}
gives:
1. Nominative, accusative and vocative neuter singular form of μικρός (mikrós).
If I have a problem (or lack the patience) reading the documentation new editors certainly will.
Dear @Saltmarsh, the Wiktionary:About Greek is ok, I do not see anything that needs correcting. The universal system for writing heads, inflections etc covers everything, I think. Now, the robots are doing what we used to do manually in the earlier days of wiktionary, so, please do not worry. ‑‑Sarri.greek♫I12:55, 10 December 2024 (UTC)Reply
PS to @WingerBot, Benwing, notifying my admin @Saltmarsh. Whatttt is this? στήριξα in ancient and modern Greek?? The way things are written are different in grc and el? The point is to make them similar? Why is grc like that? Why should an editor have to learn 2 or even 3 ways to write one and the same thing? Please unify them. If not unified, what is the point of changing only one of them? ‑‑Sarri.greek♫I15:55, 10 December 2024 (UTC)Reply
@Saltmarsh The equivalent of {{el-form-of-nounadj|μικρός|g=n|c=nav|n=s}} using {{infl of}} is {{infl of|el|μικρός||nom//acc//voc|n|s}}. I assume this is what you mean by "mixed cases and genders"; you just put the different cases and genders together and separate by //. In general, if you are not sure how something works, check out the examples first; Example 2 under {{inflection of}} demonstrates exactly this usage. As for having a separate way of doing things for every language, that simply won't work; it leads to siloed language communities, making it hard or impossible for new users to contribute, and you get a non-unified look that makes the whole site look amateurish. Greek in fact is particularly bad because there's a whole ecosystem of weird templates like {{el-example}} that do general things in a Greek-specific way and really aren't necessary. Furthermore, documentation for individual-language-specific templates tends to be worse, and worse-maintained, that the language-agnostic documentation. For example, the documentation for WT:About Greek doesn't actually say anything about Greek templates, and the documentation at Wiktionary:Greek_headword-line_templates was completely out-of-date until I fixed it up to some extent last night.
@Sarri.greek I assume you are referring to the headword templates. Ancient Greek is already using {{inflection of}} for its definition lines and will be unified with Modern Greek as soon as I make the change to obsolete {{el-form-of-nounadj}}. I will do the Ancient Greek headword templates next; they are no more necessary than the Modern Greek ones we just eliminated.
Latest comment: 1 month ago5 comments3 people in discussion
This gadget is on by default for everyone. However, I can't tell what it actually does. I tried comparing a few pages relating to the revision deletion log with it turned on and off and never noticed a difference. Is it nonfunctional? Ioaxxere (talk) 01:53, 8 December 2024 (UTC)Reply
Based on Kephir's edit summary when creating it ("display revdel log on inaccessible revision/diff pages") and its description ("Display excerpts from the revision deletion log when trying to view deleted revisions and diffs between them"), I gather it pulled the relevant revision deletion event (who suppressed the text, edit summary, or username of a given revision) out of the log and displayed it when someone tried to view a revision or diff which had been hidden, to prevent people being confused like this about a situation where information is missing like the username here. (We still have a gadget which does something vaguely similar, pulling up a more informative excerpt of the block log on blocked users' contrib pages.) As far as I can tell, it doesn't work anymore; probably some MW change broke it. In theory it'd be useful, but it might no longer be possible to make it work: which admins have deleted any revisions of a given page is logged publicly, but does anything let non-admins see which admins deleted which exact revisions? - -sche(discuss)06:05, 9 December 2024 (UTC)Reply
I believe the info is extractable from the API, for example . In the first log ID listed at the present moment (logid 53474838), we can see that Surjection hid the edit summary of revisions 82802609 and 82802607. This API response looks identical when I open it in an incognito window as a logged-out user. This, that and the other (talk) 10:17, 9 December 2024 (UTC)Reply
Neat. Well, if someone wants to make a version of this gadget that works, I would say it'd be "somewhat useful, but not necessary enough to be a priority". (In the meantime, to Ioaxxere's point, it seems like we could turn this gadget off, since it doesn't seem to do anything at all...?) - -sche(discuss)17:59, 9 December 2024 (UTC)Reply
@-sche: Good to know — in that case I'll remove it from the list of default gadgets and if there's no interest in fixing it after some time then we might want to remove it entirely. Ioaxxere (talk) 05:50, 11 December 2024 (UTC)Reply
Latest comment: 1 month ago2 comments2 people in discussion
This is an old template that shows up in Wiktionary:Todo/Lists/Template language code does not match header because it's not just used in Crimean Tatar entries. It's very simple: it takes positional parameters that are the endings for 5 cases (genitive, dative, accusative, locative, and ablative), and displays {{lang|crh|{{PAGENAME}}}} labeled as the nominative, and the other cases consisting of the same followed by the relevant positional parameter. It doesn't categorize, and it doesn't link to anything (except for "{{m|crh||{{PAGENAME}}}}" in the header, which of course doesn't do anything).
It's used in Turkish, Tatar, and Zazaki entries as well as Crimean Tatar ones, with what looks like 641 transclusions (there's at least one false positive due to {{desctree}}). Is this a problem? Or should we just add it to the excluded-template page for the Todo list? Chuck Entz (talk) 01:40, 9 December 2024 (UTC)Reply
@Chuck Entz This should be split into language-specific variants. For one thing, it (now) has actual links in it that use the crh code, meaning it will link to the wrong language for any other language. Benwing2 (talk) 08:07, 13 December 2024 (UTC)Reply
Temporary Accounts: technical help needed
Latest comment: 1 month ago1 comment1 person in discussion
As part of the rollout of Temporary Accounts, we are working to ensure that tools, gadgets, bots, user scripts, AbuseFilters, and any other community-owned code continue to work smoothly.
What are Temporary Accounts?
Temporary accounts are a new type of account for unregistered editors. Instead of showing IP addresses publicly, these accounts will assign temporary user IDs to logged-out editors. Tools that rely on IP addresses or workflows for logged-out users might need updates to function correctly. Temporary accounts are already live on some pilot wikis, with more pilot deployments planned for February and full rollout on all wikis in May.
How you can help
Check if code (whether on Toolforge or on wiki, that is: tools, gadgets, bots, or user scripts) you created or frequently use works on the wikis where temporary accounts are already active. Here is the list of content wikis, and here is the list of beta cluster and test wikis with temporary accounts.
If you notice a tool that might be impacted, we encourage you to try updating it based on our developer documentation guide.
Take a look at Abuse Filters used on your wiki. Any filter using IPs via user_name will no longer be able to do so. Those filters need to be updated to use user_unnamed_ip instead. A comment from our engineers: "The main use case should be if you try something like ip_in_range(s). Things that map to usernames should be broadly ok, as they’ll continue to map to temporary account names." If you have more questions about AbuseFilter, you may add a comment on the Phabricator ticket T369611.
If you find any issues or have comments or questions, let us know on the project talk page. You can also join the dedicated Discord thread for support and to share feedback to the team.
Latest comment: 1 month ago1 comment1 person in discussion
Latest tech news from the Wikimedia technical community. Please tell other users about these changes. Not all changes will affect you. Translations are available.
Weekly highlight
Technical documentation contributors can find updated resources, and new ways to connect with each other and the Wikimedia Technical Documentation Team, at the Documentation hub on MediaWiki.org. This page links to: resources for writing and improving documentation, a new #wikimedia-techdocs IRC channel on libera.chat, a listing of past and upcoming documentation events, and ways to request a documentation consultation or review. If you have any feedback or ideas for improvements to the documentation ecosystem, please contact the Technical Documentation Team.
Updates for editors
Later this week, Edit Check will be relocated to a sidebar on desktop. Edit check is the feature for new editors to help them follow policies and guidelines. This layout change creates space to present people with new Checks that appear while they are typing. The initial results show newcomers encountering Edit Check are 2.2 times more likely to publish a new content edit that includes a reference and is not reverted.
The Chart extension, which enables editors to create data visualizations, was successfully made available on MediaWiki.org and three pilot wikis (Italian, Swedish, and Hebrew Wikipedias). You can see a working examples on Testwiki and read the November project update for more details.
Translators in wikis where the mobile experience of Content Translation is available, can now discover articles in Wikiproject campaigns of their interest from the "All collection" category in the articles suggestion feature. Wikiproject Campaign organizers can use this feature, to help translators to discover articles of interest, by adding the <page-collection> </page-collection> tag to their campaign article list page on Meta-wiki. This will make those articles discoverable in the Content Translation tool. For more detailed information on how to use the tool and tag, please refer to the step-by-step guide.
The Nuke feature, which enables administrators to mass delete pages, now has a multiselect filter for namespace selection. This enables users to select multiple specific namespaces, instead of only one or all, when fetching pages for deletion.
The Nuke feature also now provides links to the userpage of the user whose pages were deleted, and to the pages which were not selected for deletion, after page deletions are queued. This enables easier follow-up admin-actions. Thanks to Chlod and the Moderator Tools team for both of these improvements.
The Editing Team is working on making it easier to populate citations from archive.org using the Citoid tool, the auto-filled citation generator. They are asking communities to add two parameters preemptively, archiveUrl and archiveDate, within the TemplateData for each citation template using Citoid. You can see an example of a change in a template, and a list of all relevant templates.
Last week, all wikis had problems serving pages to logged-in users and some logged-out users for 30–45 minutes. This was caused by a database problem, and investigation is ongoing.
View all 19 community-submitted tasks that were resolved last week. For example, a bug in the Add Link feature has been fixed. Previously, the list of sections which are excluded from Add Link was partially ignored in certain cases.
Updates for technical contributors
Codex, the design system for Wikimedia, now has an early-stage implementation in PHP. It is available for general use in MediaWiki extensions and Toolforge apps through Composer, with use in MediaWiki core coming soon. More information is available in the documentation. Thanks to Doğu for the inspiration and many contributions to the library.
Wikimedia REST API users, such as bot operators and tool maintainers, may be affected by ongoing upgrades. On December 4, the MediaWiki Interfaces team began rerouting page/revision metadata and rendered HTML content endpoints on testwiki from RESTbase to comparable MediaWiki REST API endpoints. The team encourages active users of these endpoints to verify their tool's behavior on testwiki and raise any concerns on the related Phabricator ticket before the end of the year, as they intend to roll out the same change across all Wikimedia projects in early January. These changes are part of the work to replace the outdated RESTBase system.
The 2024 Developer Satisfaction Survey is seeking the opinions of the Wikimedia developer community. Please take the survey if you have any role in developing software for the Wikimedia ecosystem. The survey is open until 3 January 2025, and has an associated privacy statement.
Any possibility of allowing users to manage multiple books across sessions?
Latest comment: 1 month ago3 comments2 people in discussion
I recently tried to use the book creation function to compile a list of entries for personal use and was shocked to find out it isn't saved across sessions. Can i ask what the rational for this is?
If there is no underlying obstacle to this, i'd kindly request allowing users to compile and manage any number of books they desire. I've invested an ungodly amount of my time searching for this exact functionality across platforms and this is the only one that actually works as i intend. It's such a shame to see it ruined by such an apparently simple defect. 1ifemare (talk) 06:06, 10 December 2024 (UTC)Reply
@1ifemare I'm sorry to hear you had a bad experience with the book creator. When you're ready to save your book, you can click "Show book" and save the book into your userspace. I just saved a test book at User:This, that and the other/Books/test.
That's the tragedy. It was the absolute best experience i had with a tool capable of creating a custom online personal dictionary - out of dozens of others i scoured the web for and thoroughly tested, just to find they all have crippling limitations, or just outright don't offer the desired functionality.
I tried saving a book, but there doesn't seem to be a away to re-open it as a book inside the creator software. There also doesn't see to be a way to delete saved books, unless i missed something. 1ifemare (talk) 13:47, 10 December 2024 (UTC)Reply
palettize Galician, Portuguese, etc tables
Latest comment: 1 month ago10 comments5 people in discussion
On social media, I saw someone complaining that we were mistaken to list distinto as the short past participle of Portuguese distinguir, and in the process of looking into that, I noticed that all the conjugation tables on that page are bright-on-bright and could stand to be palettized, so I'm reporting that for tracking purposes. (Do we have a page or category for tracking "things that need to be palettized"?) - -sche(discuss)09:12, 10 December 2024 (UTC)Reply
@-sche I don't know of any central place for tracking these. If there is such a place, @Ioaxxere would know. I took a whack at palettizing the Spanish table; in the process the colors got a little more saturated and less pastel. Dunno if anyone will complain. I posted before and after side-by-side comparisons on Discord in the #general channel, but AFAIK you're not on Discord and I don't know how to post the pictures here at Wiktionary. Benwing2 (talk) 07:56, 11 December 2024 (UTC)Reply
@-sche @Ioaxxere Ugh, I managed to palettize all the major Romance languages + Latin (not yet Proto-Romance though) as well as all the minor ones up through R. There are so many minor Romance languages out there, each with their own slightly different table. Ioaxxere, all the tables go 100% across the screen, which might not be ideal. Please take a look at Module:roa-verb/style.css; maybe there is a fairly simple way of fixing this. Unfortunately I don't understand the nuances of CSS well enough to know what to do here. Benwing2 (talk) 10:00, 12 December 2024 (UTC)Reply
As for the second point, NavFrames go all the way across the screen by default, but you can limit it with max-width: 40em or something like that. But that would make the Romance conjugation tables extremely cramped so I don't think there's any reason to do that. Ioaxxere (talk) 00:56, 15 December 2024 (UTC)Reply
Thanks for converting the templates, and apologies for using an opaque term, respectively. (It was shorter than ~"convert to using Palette.css colors".) - -sche(discuss)22:19, 12 December 2024 (UTC)Reply
I guessed at a meaning like that, but there's always the risk that GP discussions with consequences to the less tech savvy will be opaque to them. BTW, I didn't find use in Google Books of palettize/ise except a modest amount possibly in the sense in our entry and as many as a misspelling of palletize/ise. DCDuring (talk) 23:45, 12 December 2024 (UTC)Reply
head with added diacritics
Latest comment: 1 month ago1 comment1 person in discussion
Subject: For languages, where _1. the written words in books are without diacritics and _2. they are also presented in wiktionary and other dictionaries with extra diacritics.
At the moment, en.wiktionary presents the word type_1 at the title of the page. But only type_2 as headword, Template:head. This is very confusing for people who are not aquainted with a particular language: what is the word they should use and copypaste? e.g. italian with added accents, russian with added accents, ancient greek with added prosodies (breve and long), arabic (with vowel diacritics) etc. I think the head should be clearly presented both in its pure spelling as we see it in printed books and in parenthesis with its altered/auxiliary/fiddled spelling+tooltip (with added diacritics / with added so-and-so diacritics). Whenever I need to copy a word, I have to go to the title to copypaste it. For languages I do not speak, I never know how it is supposed to be written. Please help. Thank you. ‑‑Sarri.greek♫I16:29, 10 December 2024 (UTC)Reply
Categories for year of attestation
Latest comment: 18 days ago6 comments6 people in discussion
It could be, if the editorial certainty about first uses were good. But categories would nudge us to shoehorn our adequately vague datings into them. And none might even apply to non-European languages even if we tried earnestly with the effort I consider disproportionate; I believe that we won’t get together a handful in Arabic, which I edit; people from the censored Chinese internet should voice their opinion about what they typically know about the creation ranges of non-artificial terms, if this isn’t verboten at their location as well, and whether this would be useful there. We shouldn’t base such a decision solely on English, and many American even admins around here are competent to see other languages. Fay Freak (talk) 17:40, 10 December 2024 (UTC)Reply
@Angrythewikipedian For reference, @Vininn126 wants to add something similar based on {{etydate}}, but (a) per-language and (b) based on ranges of a century or so, not individual years (which are much harder to attest and are "slippery" because it's common to find a quote that pushes back the earliest attestation from what it was previously thought to be). Benwing2 (talk) 08:00, 13 December 2024 (UTC)Reply
Add language codes to Module:wikimedia languages/data
Latest comment: 1 month ago2 comments2 people in discussion
The following language codes and corresponding should be added as they currently don't appear in many places despite having valid ISO codes:
'bfw' => 'Bonda/Remosam/ବଣ୍ଡା',
'gju' => 'गुज्जरी/Gujari/Gojri',
'hoc' => 'Ho/𑢹𑣉𑣉 𑣎𑣋𑣜',
'kgg' => 'Kusunda/Gemehaq gipan'
70.55.18.3003:38, 11 December 2024 (UTC)Reply
I suspect that if both wikis want these to be crosslinked, they would both need to create redirects from the other's form. (We used to have to do this to accommodate fr.Wikt and en.Wikt using different apostrophes; that is now handled by considering the different forms of apostrophe interchangeable in the automatic linking function itself, but I don't think the same can be done for capital vs lowercase letters.) So sv.Wikt would create a redirect at ] (to ]), as they appear to have already done years ago, and we would do the reverse, creating a redirect at ]. I would hope that if official sv.Wikt policy is to capitalize and include punctuation, everyone here would be fine with allowing redirects from sv.Wikt's forms, to enable interwiki linking, but perhaps others can speak up. - -sche(discuss)16:15, 11 December 2024 (UTC)Reply
is a full sentence, therefore it ends with a dot and begins with a capital letter. On the page sv:crack
(vardagligt) /rent/ kokain
is NOT a full sentence and therefore it does NOT end with a dot and does NOT begin with a capital letter.
There is a very sane and useful point with the Swedish system: a definition is a replacement of the word defined:
The train was hauled by an old locomotive.
Definition style of Swedish wiktionary:
locomotive -- rail vehicle equpipped with a motor intended to pull trains
After replacement:
The train was hauled by an old rail vehicle equpipped with a motor intended to pull trains. (works)
Definition style of English wiktionary:
locomotive -- A rail vehicle equpipped with a motor intended to pull trains.
After replacement:
The train was hauled by an old A rail vehicle equpipped with a motor intended to pull trains.. (broken)
Also, if uppercase and dot are to get dropped for the sake of simplicity, then the comma must get dropped too. Then an eye for an eye, a tooth for a tooth is a technically incorrect/dysfunctional URL and must be moved to an eye for an eye a tooth for a tooth. It is inconsequent and useless to keep commas and at same time drop dots in proverbs.
The English Wiktionary uses full sentences in the same way a physical dictionary would. While the Swedish Wikt formats them in a way where you can replace the word in the sentence for the definition given, the English Wikt formatting works perfectly for actually answering "what is an X?" or "what does X mean?"
{{suffixsee}} etc. don't link to language subsections
Latest comment: 1 month ago5 comments4 people in discussion
List-based templates powered by Module:affix/templates such as {{suffixsee}}, {{prefixsee}}, etc. create lists where the entries don't link to the appropriate language subsection, but rather to the entire page. For example:
You would expect e.g. "spil" to link to spil#Dutch, but it just links to spil. Even further, you would expect it to link to the appropriate sense, as indicated by the id.
@Stujul The problem here is that the above list is not actually generated directly by the code in Module:affix/templates; rather, it's invoking the categorytree parser function, which is a special feature of MediaWiki itself that provides this functionality but unfortunately (AFAIK) doesn't support tweaking the generated links to include the appropriate anchros. I think we'd have to do this using a gadget (which should be fairly easy to write; pinging @Ioaxxere, @Surjection or @Erutuon if they happen to have time on their hands). Maybe alternatively we could generate the list ourselves and stick it in a NavBox or something; unfortunately I'm not sure we have dynamic access to the contents of a given category. (We do have access to the number of pages in a category, but that is a different ball of wax.) Benwing2 (talk) 07:58, 13 December 2024 (UTC)Reply
I don't think we can access category members. Implementing a gadget for this is doable, but needs some thought, since we wouldn't want to duplicate catfix code. — SURJECTION/ T / C / L /10:30, 13 December 2024 (UTC)Reply
Latest comment: 1 month ago3 comments3 people in discussion
I can see why the entry creator wanted to do it this way, but our template infrastructure can't handle slashes in pagenames. Also pinging @Fay Freak as the only regular editor I can think of who might know anything about Proto-Cushitic. Chuck Entz (talk) 15:50, 11 December 2024 (UTC)Reply
Because abuse filters have no way to tell whether an external link added by a new or anonymous user is to a reliable journal. They don't have access to templates or modules, and they can't afford complicated logic because they run every time any page is accessed anywhere on Wiktionary, so they have to be very fast. They also can't access any information about anonymous editors except for their IP addresses, so they have no way to tell you're not a one-time hit-and-run spambot- and trust me, they catch lots of spam edits all the time. It's a shame that they also stop a few occasional good edits in the process, but there's not much we can do about that. At any rate, I was able to see your attempted edit in the logs, so I went ahead and created the template for you. Chuck Entz (talk) 03:27, 13 December 2024 (UTC)Reply
Latest comment: 1 month ago2 comments1 person in discussion
Ive been working at the Module:ine-nominals a bit, hopefully nothing too offensive, adding a few Categories for previously not-recognized stem types (nouns of which types were classified as root nouns), but there remain a few entries in the root noun categories which are in fact formally indistinguishable from root nouns, like *h₃dónts (*h₃dent- would be a perfecly fine root afaik) and only morphological analysis tells us this contains a -ónts. Therefore I would like to request a new parameter stem type which primarily interacts with the table.insert(data.categories operator to override any erroneous placement in the else-category (that is, root nouns).
The reason I'm asking is because adding a parameter is above my technical competence, I hope someone with Lua-knowledge can easily solve this.
If you have problems understanding my request, do ask. Suryaratha03 (talk) 19:30, 13 December 2024 (UTC)Reply
@This, that and the other: from what I understand, the template is supposed to ignore templates like {{l}} and {{vern}}, and markup like “]”, but an alphabetical sort should place “impeevishedobsolete” before “impeevishobsolete” so I’m wondering why that doesn’t happen. — Sgconlaw (talk) 04:44, 14 December 2024 (UTC)Reply
@This, that and the other: sorry, my mistake—I see what is happening now. The template is taking the qualifier “obsolete” into account when sorting, whereas if there were no qualifiers impeevish would appear before impeevished. I agree that qualifiers should be ignored when sorting. — Sgconlaw (talk) 05:17, 14 December 2024 (UTC)Reply
@Sgconlaw: Agreed, but {{col3-u}} comes in handy when all else fails. I have a problem with St for saint, which gets tangled up with everything else beginning with St when sorting. Our templates can't cope with that. DonnanZ (talk) 17:21, 14 December 2024 (UTC)Reply
┌────────────────────────────────────────────────────────────────────────────────────────────────────┘ @Fenakhay has pointed out a useful solution, which is to type "impeevish<qq:obsolete>". It seems that tags in angle brackets are ignored by the template for sorting purposes. Thanks! — Sgconlaw (talk) 18:32, 15 December 2024 (UTC)Reply
Mobile view
Latest comment: 1 month ago8 comments3 people in discussion
I seem to have selected mobile view somehow by accident ( on Friday the 13th, of all days). I can enlarge the page text and the width, but everything else has shrunk, including images and search box. How can I reset it for my full screen? Wikipedia is OK, no problem. DonnanZ (talk) 15:06, 15 December 2024 (UTC)Reply
Well, you can go to Mobile view and then switch back from that to Desktop. But I still get small text, which I can enlarge with the spectacle view at the top, as well as page size. But I didn't knowingly click on Mobile view, I probably clicked on Control by accident (instead of Shift) and a letter key. Editing this page has to be small text, not large, as before. DonnanZ (talk) 18:12, 15 December 2024 (UTC)Reply
@Donnanz: what device are you editing on? If on a Mac, you can usually increase or decrease the text size of a browser window by pressing Command+ or Command- (no need to also press the Shift key). On a PC the key corresponding to Command is either Ctrl or Alt—offhand I can't remember which. — Sgconlaw (talk) 18:30, 15 December 2024 (UTC)Reply
I have a PC built by my local computer shop years ago, now rebuilt by them with a solid state hard drive, and now with a wide screen, and with standard UK keyboard. I'm not sure if I have to press another key with Ctrl or Alt. DonnanZ (talk) 18:58, 15 December 2024 (UTC)Reply
Latest comment: 1 month ago4 comments4 people in discussion
Category pages such as Category:Proto-Italic lemmas now give a list of forms such as "Reconstruction:Proto-Italicad, Reconstruction:Proto-Italicad-" etc., where "Reconstruction:Proto-Italic" is smooshed together with the reconstructed term. This seems like a bug: for readability, I'd expect at least a "/" separating them, although probably in this context it would be reasonable to just leave out "Reconstruction:Proto-Italic/" altogether and use an asterisk in its place. Urszag (talk) 17:05, 15 December 2024 (UTC)Reply
Label and categories for concepts originating in Japan
Latest comment: 1 month ago1 comment1 person in discussion
there are several words for Japanese concepts in English and other foreign languges that require categorisation or recategorisation, as they are mostly just found inside the general Category:Japan. below are some recommendations for labels and categories that should be created:
Latest comment: 1 month ago5 comments3 people in discussion
I have noticed that the menu list of languages on top of each search item has suddenly disappeared? What happened and why? Is there a technical problem? I'm quite devastated, as those listings are very helpful and time saving, that you get an overview of all languages involved and may easily go straight to your preferences instead of having to scroll through every single language in the list. I hope it's just a technical issue, that it's soon to be in place again? Bemland (talk) 08:07, 16 December 2024 (UTC)Reply
I'm pretty sure they are talking about the table of contents as the interwikis were on the left (not the top) before going to the top right.
It's not a technical issue. This was forced on Wiktionary by the WMF some time ago. Not sure where the table of contents went after the switch but if you go to your user preferences (top right), you can switch back to the old skin (Vector 2010) and the table of contents will come back. 115.188.138.10513:08, 16 December 2024 (UTC)Reply
The top right is for Wiktionary in other languages. On the left any other languages in English Wiktionary are listed (e.g. France), as long as you haven't hidden the Main menu and Contents. In addition, you can "Switch to old look" in the Main menu. DonnanZ (talk) 13:58, 16 December 2024 (UTC)Reply
Thank you both very much, it worked fine reversing the skin back to 2010 version and that list of content top left returned, grateful of that! Hard to understand why its considered better without that list, but good that it's reversible. Still, newcomers don't know about that option, so maybe one should give some short info on that? Best wishes to you! Bemland (talk) 21:58, 16 December 2024 (UTC)Reply
desctree errors in case of multiple etymologies
Latest comment: 1 month ago2 comments2 people in discussion
As pointed out by Rua five years ago, the desctree template simply picks the first descendants section it can find, which can be wrong when there are multiple sections. One example (before I copy/pasted the correct descendants) was at Latin ferus for the Old French descendants tree, old version here.
I've also noticed that desctree picks any alternative forms (using 'alt') it can find, also when this is in another etymology section than the descendants. Example at ferrum: Old French 'fier' is not an alternative form of "iron".
Just an idea, but would some interaction with (the id's of) the etymon template be possible? Exarchus (talk) 14:23, 16 December 2024 (UTC)Reply
@Exarchus: Yes, this is a situation I've predicted because no template can be smart enough to actually understand a complex entry like the one you linked. I was working towards integrating the descendant trees with {{etymon}} but there wasn't a ton of enthusiasm and it is very complex to implement. Ioaxxere (talk) 14:37, 16 December 2024 (UTC)Reply
Multiple inline examples
Latest comment: 27 days ago5 comments5 people in discussion
I wish to request a method to create multiple inline examples in examples inside the {{ux}} and {{co}} templates.
below is an example from time (permalink), where editors simply hacked it with non-breaking spaces. this is likely not great as it is a static width and may be not be understood semantically by screen readers.
{{ux|en|Roman '''times'''; the '''time''' of the dinosaurs; how things were at that '''time'''; how things were in those '''times'''}}
Roman times; the time of the dinosaurs; how things were at that time; how things were in those times
TBH, I much prefer just keeping multiple examples like that on separate lines. Sure, it takes up more vertical space, but I think it looks cleaner and is far easier to skim and digest the different examples. Andrew Sheedy (talk) 22:34, 20 December 2024 (UTC)Reply
I don’t mind if several short collocations are placed on the same line as it saves space, but don’t think we should allow for, say, more than three, or allow a string of collocations that flows on to a second line. I have been doing this in some cases by adding three em spaces between the collocations. — Sgconlaw (talk) 20:05, 21 December 2024 (UTC)Reply
Latest comment: 1 month ago1 comment1 person in discussion
Latest tech news from the Wikimedia technical community. Please tell other users about these changes. Not all changes will affect you. Translations are available.
Weekly highlight
Interested in improving event management on your home wiki? The CampaignEvents extension offers organizers features like event registration management, event/wikiproject promotion, finding potential participants, and more - all directly on-wiki. If you are an organizer or think your community would benefit from this extension, start a discussion to enable it on your wiki today. To learn more about how to enable this extension on your wiki, visit the deployment status page.
Updates for editors
Users of the iOS Wikipedia App in Italy and Mexico on the Italian, Spanish, and English Wikipedias, can see a personalized Year in Review with insights based on their reading and editing history.
Users of the Android Wikipedia App in Sub-Saharan Africa and South Asia can see the new Rabbit Holes feature. This feature shows a suggested search term in the Search bar based on the current article being viewed, and a suggested reading list generated from the user’s last two visited articles.
The global reminder bot is now active and running on nearly 800 wikis. This service reminds most users holding temporary rights when they are about to expire, so that they can renew should they want to. See the technical details page for more information.
The next issue of Tech News will be sent out on 13 January 2025 because of the end of year holidays. Thank you to all of the translators, and people who submitted content or feedback, this year.
View all 27 community-submitted tasks that were resolved last week. For example, a bug was fixed in the Android Wikipedia App which had caused translatable SVG images to show the wrong language when they were tapped.
Updates for technical contributors
There is no new MediaWiki version next week. The next deployments will start on 14 January.
Latest comment: 1 month ago3 comments3 people in discussion
These are chronologically-based "families" that are used to get around near-complete lack of information on specific Iranian lects from these time periods. until sometime yesterday, the {{auto cat}} modules allowed derivation categories based on them, but now we have 51 of them in CAT:E. I would note that the etymology templates still accept them. Chuck Entz (talk) 14:41, 18 December 2024 (UTC)Reply
This was mostly my fault; I removed the entries from the canonical name-code mappings, because Module:data consistency check said these codes did not actually exist. Turns out that module just doesn't support the setup. I restored them some time ago and the errors are going away. — SURJECTION/ T / C / L /22:18, 18 December 2024 (UTC)Reply
IMO we should just be using (perhaps "etymology-only") language codes for this, like we do for substrates, etc, particularly because we're de facto treating it as a language when we reconstruct terms in it and have entries say something is from "Middle Iranian *foobar". Cf this and the replies by others. - -sche(discuss)06:27, 19 December 2024 (UTC)Reply
triple braces flagging bug?
Latest comment: 30 days ago11 comments6 people in discussion
If you look at the code there, there are several instances of triple braces to show either 'color' or 'colour', or 'gray'/'grey'. But I can't tell you more about how this should be fixed. Exarchus (talk) 13:56, 21 December 2024 (UTC)Reply
@Exarchus: ah, I see. Those were already in the page before I worked on it. It seems like they are supposed to allow for alternative displays of the spellings of certain words depending on readers' preferences, but I don't know if they actually work. Can someone confirm this? — Sgconlaw (talk) 16:05, 21 December 2024 (UTC)Reply
I couldn't make that structure work in my sandbox. I suspect that this may be something that works on other wikis and/or once worked here. Instances should probably be replaced by wikitext that function properly. DCDuring (talk) 18:28, 21 December 2024 (UTC)Reply
It was a case where someone was improperly copying a template, {{gmh-decl-pronoun}}, into an entry. There should never be triple braced template arguments in entries because entries aren't transcluded as templates, and template arguments have no useful effect otherwise. I'm not sure why the template was copied into the entry. Ideally it would be used in the entry. — Eru·tuon20:29, 21 December 2024 (UTC)Reply
@DCDuring They work everywhere, but there's not much point in them if the page isn't intended to be transcluded like a template. The reason they get flagged by the abuse filter is because they strongly suggest someone is blindly copying a template somewhere they shouldn't be (i.e. exactly what happened here). Theknightwho (talk) 21:26, 21 December 2024 (UTC)Reply
Well, they work in the sense that all parameters are empty, and syntax that provides a value when a parameter is empty will always provide that value. Of course, the parameter-empty value without all the template syntax produces exactly the same result and you don't need to know template code. With the colors appendix, the code was set to provide the UK variants if the parameters for the US variants were empty, so the code accomplished nothing: "Col{{{or|our}}}" is just a very complicated way way of writing "Colour". If you had a page in your userspace with the code "{{Appendix:Colors/Colour grouping|or=or}}", it would display "color" instead of "colour" in those places, but I'm not sure it's worth the unreadble wikitext on the appendix page. Chuck Entz (talk) 21:59, 21 December 2024 (UTC)Reply
Latest comment: 21 days ago13 comments3 people in discussion
Hi. Could someone take a peek at the column formatting on this appendix please. Whole sentences have been split rather than maintained in their correct column. Thanks. -- ALGRIF talk11:10, 20 December 2024 (UTC)Reply
Edit: I think the column template used originally is deprecated. There should be two columns; one with the entry and the other with notes or special examples as needed. I cannot find how to do that now. Acceptable alternative would be simply to make the entries in 2 cols. Thanks. -- ALGRIF talk12:15, 20 December 2024 (UTC)Reply
@Sgconlaw: If I can fix the table format myself, I'm quite happy to do it. However, I do not clearly understand in the Templates how to go about it. Perhaps I'm looking in the wrong place. I dunno. Help of some sort is required, please. Thanks. -- ALGRIF talk12:36, 28 December 2024 (UTC)Reply
FWIW, it would be nice if the items were all in a single sortable table, with the grammatical features that led to separate tables being a column. That would allow one to see all the grammatical structures for a single verb to be visible in one screen. It may well be too hard to accomplish this gracefully. DCDuring (talk) 18:41, 29 December 2024 (UTC)Reply
I'm saying that the table titles ('Followed by a to-infinitive', 'In the passive voice followed by a to-infinitive', 'Followed by a gerund-participle', 'Followed by a to-infinitive or a gerund-participle' ('No difference in meaning', 'Difference in meaning'), 'Followed by a bare infinitive', 'Followed by and') can be deployed as column entries under the column heading "Followed by" (to-infinitive, gerund-participle, bare infinitive, and). Does that make it clearer?
With a sortable table, it would be possible to recover most of the existing structure immediately by sorting on the attribute I propose. The balance ('(no) diff. in meaning', and 'to-inf. or gerund-participle') would be apparent by sorting on the verb. If necessary, whether or not there is a change in meaning can be shown by giving glosses for verbs where the issue arises, possibly in the 'Remarks' column. DCDuring (talk) 00:11, 30 December 2024 (UTC)Reply
@DCDuring: oh, I see. Theoretically, it is possible to combine all the sections into a single table but, given the amount of data, visually the columns would become very narrow and the text within each column would begin to break across lines in weird ways. (There is markup to make a table sortable by columns, though offhand I don’t remember what it is as I haven’t used it for a while. It’s explained at w:Help:Table.) I guess one can experiment and see if the output looks acceptable. — Sgconlaw (talk) 05:05, 30 December 2024 (UTC)Reply
mw:Help:Sortable tables has the details. I found it only modestly more complicated than the various other table templates. IMHO, it is better suited to the kind of content we sometimes have in Appendix space than the table types we use in principal namespace. DCDuring (talk) 14:36, 30 December 2024 (UTC)Reply
It is true that such tables are not great for long text, but as can be seen from item 2 in this table content stays in the column in which it belongs. If long text is in only a few rows, it could be consigned to a footnote. DCDuring (talk) 14:43, 30 December 2024 (UTC)Reply
@Sgconlaw: I've set up a sample table for you on the page that you can add information to. -- Many thanks. I think I can work with this formatting. TAIAP it does what I am looking to do. ALGRIF talk11:04, 30 December 2024 (UTC)Reply
I'm requesting permission to edit Module:ja-pron. To solve the issue of Japanese pitch accent in compound words, I've written a proof-of-concept template, Template:ja-acc-multi, that relies on ja-pron (similarly to how Module:ja-ojad relies on ja-pron), and right now its output looks like:
is there a preferred way of dealing with this at wiktionary? do we no-link it with |nolink=1, or do we do something like |head=]]? ragweed theatertalk, user23:33, 23 December 2024 (UTC)Reply
With any headword template (I think) you can use "head=" to customize appearance and linking whenever there is a good reason to do so {{en-noun|head=socio-political}} might do what you want. (We wouldn't usually put in a space, but it does clarify matters a bit.) DCDuring (talk) 03:03, 24 December 2024 (UTC)Reply
"template loop"
Latest comment: 24 days ago4 comments4 people in discussion
Warning: This page calls Template:l which causes a template loop (an infinite recursive call).
It sounds quite disastrous, but actually the page appears to edit and load normally. I don't know what the message refers to (I can't see any use of such a template) or how to fix it. Mihia (talk) 20:23, 26 December 2024 (UTC)Reply
Template:l The page is NOT broken. This type of error can apperar in some cases when editing and previewing a section, or you make the page transclude itself. It it not malicious until you really break a heavily-used template in that way ;-) Taylor 49 (talk) 21:39, 27 December 2024 (UTC)Reply
I wanted to make per-PoS bullet points in an ety section for clarity, and have the cognates left-aligned with the bulleted text. Mihia (talk) 21:53, 29 December 2024 (UTC)Reply
@Mihia: I think there should be a discussion on whether this is a good idea first. Maybe you could give an example of what you propose. (I have a feeling that a collapsible table created using {{rel-top}} and {{rel-bottom}} cannot by itself be indented. One way to do so would be to enclose the cognates table in another table and adjust the width of the leftmost column to achieve an apparent indentation, but this is quite fiddly and the visual result may vary depending on the reader's browser, which is not desirable.) — Sgconlaw (talk) 22:20, 29 December 2024 (UTC)Reply
I wouldn't have thought that using bullet points required a prior discussion. It's not exactly an outlandish idea that we don't already employ freely. I don't really want to mess around with embedding in hand-crafted tables. If it can't be done easily or neatly then I won't bother. Mihia (talk) 22:33, 29 December 2024 (UTC)Reply
@Mihia: maybe a workaround is to have only one cognates table, but subdivide the table using headings. For example, you can type "; Nouns" or "* '''Nouns'''" within the table—in the latter case, the cognates themselves would then be preceded by two asterisks. I'm not sure how this works with the automatic column formatting in the table, though. — Sgconlaw (talk) 23:13, 29 December 2024 (UTC)Reply
Thanks, combining cognates into one table wouldn't work -- or, I mean, wouldn't be applicable to what I was attempting, which is to create one bulleted section for each PoS containing the explanatory text followed by the cognates for that PoS. Mihia (talk) 18:22, 30 December 2024 (UTC)Reply
@Mihia To indent, you just have to use an extra asterisk. I took the liberty of editing your sample table to illustrate what that would look like. Hope that helps. Andrew Sheedy (talk) 07:10, 3 January 2025 (UTC)Reply
Thanks, but this is not what I am trying to do. Obviously I am not explaining it clearly. I want the whole of the table to be indented, leaving the contents themselves unchanged, so that the left-hand outer border of the table is left-aligned with indented bulleted text. I am trying to achieve what you might hope the following would achieve, except it doesn't work:
Latest comment: 17 days ago10 comments4 people in discussion
@Benwing2: Are Roman script roots intentionally required to contain special characters? For example, when I display terms with root su with ID hear, {{rootsee|pi|pi|su (hear)}} happily displays
The ID-less invocation at bhar was similarly broken by the rewriting for {{rootsee}}.
In terms of code, my problem is a block in export.rootsee in Module:rootsee that prefixes Latin script roots consisting only of letters with '-' for Navaho and suffixes with '-' otherwise. Would it be in order to make Pali an exception for which no hyphen is added?--RichardW57 (talk) 18:10, 30 December 2024 (UTC)Reply
@AryamanA, Svartava: Notifying those known to have worked on templates recording Pali roots ({{pi-verb}}, {{pi-root}}. I am intending to propose that we only record the roots of words for the Wiktionary-primary script, which for almost all words will be the Roman script. This will affect the usage of {{pi-verb}}. (There are a very few words which will not occur in the Roman script.) --RichardW57 (talk) 08:57, 31 December 2024 (UTC)Reply
@RichardW57: This seems to be because of the convention of Latin script reconstructions in PII, PIE, etc. where root entries end with a hyphen. I've added |nohyphen= as temporary but inconvenient solution; see Module:rootsee. Svārtava (tɕ) 09:22, 31 December 2024 (UTC)Reply
Reconstructions get asterisks, and the current logic seems to allow for their omission in reconstructed languages. I'm assuming that hyphens are common in attested Roman script languages - if only there were time to adequately document programming assumptions! --RichardW57 (talk) 09:32, 31 December 2024 (UTC)Reply
@Benwing2, Svartava: If we can't control it by source language, I think a better option than |nohyphen= would be |hyphen= with values 'before', 'after' (default) and 'none'. Perhaps |hyphen_pos= would be better - it's a trade-off between obscurity and verbosity. --RichardW57 (talk) 15:38, 2 January 2025 (UTC)Reply
Latest comment: 19 days ago1 comment1 person in discussion
Every Latvian declension table on here is ordered incorrectly where Accusative is placed after Nominative. Accusative should be between Dative and Instrumental. I've already gone through and fixed (or made edit requests) on several other regional Wiktionaries that had copied from here, but here all of them are protected. I initially made a post on the Noun declension talk page, but here is the list of tables that should be fixed:
Update, seems like I now had enough edits with this to become autoconfirmed, so I was able to fix them myself now! –EdnessP (talk)
{{vern}} and Old High German (two unrelated matters)
Latest comment: 19 days ago1 comment1 person in discussion
I've noticed that {{vern}} automatically converts ' apostrophes to curly ’ apostrophes in the links it forms on Wikipedia.
E.g. devil's bit scabious leads to the nonexistent page Devil’s bit scabious while non-curly Devil's bit scabious does exist as a redirect on Wikipedia.
Is there some way that {{vern}} could be agnostic about whether the Wikipedia page it connects to has a curly or non-curly apostrophe? This is likely to be a perennial issue with vernacular names of organisms that have apostrophes.
I've also noticed that Jberkel's list of requested items for Gothic indicates Bardilo and Bardzila as sources for -ilo. It looks to me like this is due to an Old High German word appearing in the etymologies of both. Could this mean that somewhere in the templates/modules for OHG, the parameter got has been used instead of goh? I've noticed Cornish terms get sorted into the equivalent Welsh list for precisely this reason.
I don't know what color-scheme: light helps. In addition, the filter should do better work with the colors, e.g. something like invert(1) brightness(55%) contrast(250%) hue-rotate(180deg). — SURJECTION/ T / C / L /21:58, 1 January 2025 (UTC)Reply
@Surjectioncolor scheme: light prevents dark mode overrides from happening. It's not strictly necessary here as the style theme-night has been disabled at MediaWiki:Wikimedia-styles-exclude. Also, your filter (invert(1) brightness(55%) contrast(250%) hue-rotate(180deg)) seems to work a lot better so you (or the deciding IA) can add that instead Matrix (talk) 13:10, 2 January 2025 (UTC)Reply
The problem with the English branch of the tree was due to our Middle English entry having only one of the three etymologies that MED showed, so that {{desctree}} drew from that one. I added a second etymology and used {{etymid}} to direct {{desctree}} to the right one. That fixed the English branch- I hope that was the only one. Chuck Entz (talk) 06:48, 4 January 2025 (UTC)Reply
Support for showing both name and pseudonym of quote authors
Latest comment: 11 days ago10 comments4 people in discussion
Quite a lot of entries have quotes from people writing/speaking pseudonymously, and many of those authors' names are widely publicly known (sometimes more widely known than a particular pseudonym, sometimes not as widely known as the pseudonym); even in cases where the name isn't widely known, it'd still be useful to know that the quote is pseudonymous. Currently, our quote templates have just a plain |author= parameter (and the |author1=, |author2=, etc., parameters, but those are set up for adding additional authors, not for adding additional names of a single author, and they themselves have the same problem |author= itself does in the event that one of those coauthors happens to be writing or speaking pseudonymously), forcing us to either (author examples chosen to have one with a Wikipedia article under his pseudonym, one with a WP article under his real name, and one with no WP article, for illustration's sake):
List only the pseudonym (frex |author=w:qntm, |author={{w|James Madison|Publius}}, |author=Drachinifel), omitting the name entirely (except as a link target iff the author has a WP article under their real name);
List only the real name (frex |author={{w|qntm|Sam Hughes}}, |author=w:James Madison, |author=Alexander Pocklington), omitting the pseudonym entirely (except as a link target iff the author has a WP article under their pseudonym);
List either the pseudonym or the real name, depending on which is better known (frex |author=w:qntm or |author={{w|qntm|Sam Hughes}}, |author=w:James Madison, |author=Drachinifel), a determination which is not necessarily easy or simple to make and which would lead to inconsistent treatment of different authors (and would require editing the quote's |author= parameter if which name is better known changes);
List the pseudonym and manually add the real name in parentheses (frex |author={{w|qntm|qntm (Sam Hughes)}}, |author={{w|James Madison|Publius (James Madison)}}, |author=Drachinifel (Alexander Pocklington)), which is clunky for whoever's filling out the quote template (especially if linking to a WP article on the pseudonymous author, due to the need to manually pipe the link in question) and prevents the use of w: syntax for linking to any WP article about the author (forcing the use of the bulkier {{w|blah blah blah}} syntax);
List the real name and manually add the pseudonym in parentheses (frex |author={{w|qntm|Sam Hughes (qntm)}}, |author={{w|James Madison|James Madison (Publius)}}, |author=Alexander Pocklington (Drachinifel)), which has the same problems as option 4; or
List either the pseudonym or the real name, depending on which is better known, and manually add the other in parentheses (frex |author={{w|qntm|qntm (Sam Hughes)}} or |author={{w|qntm|Sam Hughes (qntm)}}, |author={{w|James Madison|James Madison (Publius)}}, |author=Drachinifel (Alexander Pocklington)), which combines the problems of options 3 and 4.
@Benwing2, Whoop whoop pull up: I think it would be fine to have a |pseudonym= parameter. Note the following:
Sometimes it is known that a name is a pseudonym, but the real name is not known: "John Doe ".
Sometimes, Wikipedia has an article under the pseudonym (generally when the author is known under it), and sometimes the article is under the author's real name. Thus, the link to the Wikipedia article could be to either the real name or pseudonym.
There are instances where two or more authors write together using a single pseudonym: "John Doe ".
There may be situations where it is suspected, though not known for sure, that a name is a pseudonym. We may want to indicate it as "Richard Roe ".
In China PRC media, an editor named huaxia is very frequently listed as editor-see . This is a patriotic pseudonym based from the ancient name of China in all likelihood. How would this template deal with these cites? Geographyinitiative (talk) 23:36, 6 January 2025 (UTC)Reply
@Whoop whoop pull up @Sgconlaw If/when I get around to implementing this, I will probably implement it as an inline modifier attached to the author, rather than a separate parameter. The reason for this is that the |author= parameter (as well as several other parameters like |editor=, |tlr=/|translator=, |coauthors=, etc.) can take multiple semicolon-separated authors, each with attached inline modifiers, so it gets tricky to have a separate |pseudonym= parameter along with e.g. multiple authors in |author=. I'm not sure exactly how it would work but it will be implemented in the generic author-handling code so it applies to all author-like parameters. Maybe it will be something like |author=w:Stephen King<pseudonym:Richard Bachman> to display "Stephen King " or |author=w:George Sand<realname:Amantine Lucile Dupin> to display "George Sand " or something. To support cases where the real name isn't known, you could write |author=w:Banksy<realname:-> to display "Banksy " or |author=w:Elena Ferrante<realname:?> to explicitly display "Elena Ferrante " or similar. The value of the realname: and pseudonym: parameters will likely use the same syntax as authors themselves, so you could write e.g. |author=w:H. Bustos Domecq<realname:w:Jorge Luis Borges; w:Adolfo Bioy Casares> to display "H. Bustos Domecq ", with multiple real names, each linked to Wikipedia, while the pseudonym is also linked. The case of a likely pseudonym could be indicated as |author=w:Richard Roe<pseudonym?> or something. If the desired author name isn't the same as the Wikipedia article, this syntax would require you to write a piped link like |author=w:]<real name:w:William Everett Cook> or similar. This latter syntax is a bit awkward but hopefully it won't occur so often. Benwing2 (talk) 02:09, 9 January 2025 (UTC)Reply
Latest comment: 8 days ago3 comments1 person in discussion
(Notifying Aryaman): , @svartava, Benwing2, AryamanA: I'm currently documenting some Pali roots. In the course of this, I've found that different authors uses different names for the same root. An immediate case in point is the root of pāpuṇāti, for which the native name (at least in one edition of the Dhatupatha) is apa, influencing Warder and Buddhadatta to call it ap , while Duroiselle and Collins, possibly under the influence of its Sanskrit forbear आप्(āp), call it āp. I therefore want one of cat:Pali terms belonging to the root āp and cat:Pali terms belonging to the root ap to function as a soft link to the other. I have put appropriate text in the former, but what should I do to preserve it? (This has been covered before, but I'm not good at finding past posts. Besides, it is conceivable that a hard link might be the appropriate solution.) As things stand now, the category and its content will be deleted because the category is empty. --RichardW57 (talk) 17:24, 5 January 2025 (UTC)Reply
As there's been no response and the edit to the category is now 6 days old, note that the content after {{auto cat}} in the former is:
:{{m|pi|āp|pos=root}} is another name for {{m|pi|ap|pos=root}}, and any items placed in this category should instead be placed in ].
: --RichardW57 (talk) 11:16, 12 January 2025 (UTC)Reply
Number conversion to Devanagari not working in quotations module
Latest comment: 15 days ago4 comments3 people in discussion
This is something that used to work, but now it doesn't. Take for example the quotation at सम्राज्: when I click on "6.68.9", it should go here, but instead it goes here, so without the Devanagari numbers. It looks like this is related to @Theknightwho's changes from December 15 at the Quotations module. Exarchus (talk) 20:29, 5 January 2025 (UTC)Reply
@Exarchus This is fixed. In this case, that change actually exposed a latent bug that was already there: .convert can be followed by a function (e.g. .numToDeva), which should be called like a method (i.e. with self as the first implicit argument). That wasn't happening, but someone had implemented a kludge to get around that in this one specific case; however, other conversions would still have been broken (e.g. .numToRoman). Now, they should all work. Theknightwho (talk) 21:32, 5 January 2025 (UTC)Reply
Latest comment: 7 days ago3 comments2 people in discussion
I'd like to propose a value for use in {{lb}}, etc. I've checked at Template:label/list and it's not there. Is this the place to do it? (The value is "italicised," "italicized," etc., by the way, for use with qualifiers like "usually" or "sometimes," e.g. for words that are naturalised but still often treated as foreign terms, like sic.) Cameron.coombe (talk) 23:52, 7 January 2025 (UTC)Reply
You can use an arbitrary label in {{lb}}. We could add this as a recognized label but the only reason to do it is either if these terms should be categorized or if we want the label to link to somewhere in the glossary with an explanation of what the term "italicized" means. Benwing2 (talk) 22:51, 13 January 2025 (UTC)Reply
There are several abuse filters on this wiki that exist to preemptively stop bad edits from being published. One of those is recurring characters, because 99% of the time, if someone is posting "hg" a dozen times in a row, it's vandalism. I can create a blank user page that you can then amend. —Justin (koavf)❤T☮C☺M☯22:22, 8 January 2025 (UTC)Reply
double-formating
Latest comment: 11 days ago4 comments3 people in discussion
Here's the full list. There are fewer matches than I expected. Overall it looks like a mix of stray formatting, intended formatting that's mistakenly separated with a space, and several intentional uses of * as an asterisk and not a formatter. It's probably faster to identify and cleanup the mistakes by hand than to automate it with a bot. JeffDoozan (talk) 19:54, 9 January 2025 (UTC)Reply
Accelerated inflected forms bug for Latin with la-adecl
Latest comment: 7 days ago8 comments2 people in discussion
While editing praedīves, a Latin adjective with identical masculine and feminine inflected forms, I noticed the green link for "praedīvite" in the inflection table incorrectly omitted the "f" marker (creaing a page with "infl of|la|praedīves||abl|m//n|s"). I see the same when I try the green link in the inflection table at compatibilis for the form compatibilem (it generates "infl of|la|compatibilis||acc|m|s" which should be "infl of|la|compatibilis||acc|m//f|s"). This is presumably a recent bug since I don't see this error on existing pages. I would guess it has something to do with Module:la-adj/table; I'm not sure if I caused it by my edit here. Urszag (talk) 12:46, 10 January 2025 (UTC)Reply
@Urszag Testing the accelerator code is not so easy. I would suggest temporarily reverting your code to see whether that fixes the issue; make sure you null-save the test page after the revert. Benwing2 (talk) 01:42, 12 January 2025 (UTC)Reply
Thanks! I tried manually reverting my edit but it didn't seem to fix it. I couldn't do a full rollback since further edits had been made since. @This, that and the other, do you see anything else in that module that might be causing this, or do you have an idea of which other modules might be responsible?--Urszag (talk) 12:52, 12 January 2025 (UTC)Reply
Looks like you're right that it has been broken for multiple years. I just tried searching for examples of "ablative masculine/neuter plural", and I see it was not working right in 2021 when stellantibus was created. I don't know if there's an easy way to find and fix all of the forms that are erroneously labeled like this.--Urszag (talk) 17:42, 12 January 2025 (UTC)Reply
I can search through the dump for particular sorts of use cases with particular sorts of endings, if you can help me enumerate them. They would e.g. be adjective forms in -em that are labeled as just accusative masculine singular, or probably any form that is labeled masculine/neuter. Benwing2 (talk) 17:48, 12 January 2025 (UTC)Reply
Certain forms will be accurately labeled as masculine/neuter, such as second-declension genitive forms (singular in -ī, plural in -ōrum), or dative/ablative singular forms ending in -ō. Forms that can be assumed to be erroneous would include:
any with Adjective or Participle headers labelled as "dative/ablative masculine/neuter plural" (since masculine, neuter and feminine dative/ablative adjectives are nearly always identical in form, with only a handful of exceptions such as ambōbus).
If it's possible to check for specific declension endings, third-declension forms are likely to be erroneous when marked "genitive masculine/neuter" (singular ending in "-is" or plural ending in -(i)um") or as "dative/ablative masculine/neuter singular", "dative masculine/neuter singular", or "ablative masculine/neuter singular" (ending in -ī or -e). Likewise, as you mentioned, third-declension forms marked as "accusative masculine singular" in -em or "nominative/accusative/vocative masculine plural" in -ēs can be assumed to be erroneous.--Urszag (talk) 18:41, 12 January 2025 (UTC)Reply
@Urszag Thanks. I think this is all fixed now. In the process I found and fixed a bunch of random mistakes in the forms generated by SemperBlottoBot (which made tons of random mistakes with no obvious pattern; I don't know how a bot managed to do this). Benwing2 (talk) 22:38, 13 January 2025 (UTC)Reply
Latest comment: 7 days ago3 comments3 people in discussion
My edit was constructive but I triggered bio abuse filter. I replaced some words with ellipses but I still triggered the filter. What words does it filter? 36.85.216.15410:59, 13 January 2025 (UTC)Reply
\b's won't help. The filter is extremely effective at detecting what it's supposed to. I tried now to improve its detection of valid dictionary entries so that it should try to let them through. — SURJECTION/ T / C / L /07:13, 14 January 2025 (UTC)Reply
Seeing that all the other ones (GenAm, InE, AuE, RP) are listed in the labels module, I've gone ahead and added these two over to there! This should hopefully serve to avoid this sort of confusion in the future. MedK1 (talk) 23:12, 13 January 2025 (UTC)Reply
Latest comment: 7 days ago7 comments3 people in discussion
The other day I came across this page on Wikpedia with a non-exhaustive, frankly quite short list of terms with ligatures on them. The page was asking for editors to fill it with more examples. I saw it and thought: "Why, this is right up Wiktionary's alley!"
Imagine my surprise when it turns out that in our case, we keep these similar cases in several separate categories. It makes sense, but someone interested in getting a list of words with Æ is likely interested in knowing about words with, say, Œ as well. And then logically comes the question "what other ligatures are there in English?"
I think we could a) improve our current navegability, b) help out Wikipedia and c) bring some more clicks to Wiktionary by making a little umbrella category encompassing thesethreecats. and any other ligatures I missed, perhaps in a cat called "English terms spelled with ligatures".
I believe it'd be equally useful for other languages where ligatures both exist and are unusual, and that an expert in {{auto cat}} could accomplish this fairly easily. Alas, I am not one such expert, and though I tried very hard to look through the documentation and figure out what it was I had to do to get this done by myself, my experience there was completely fruitless and frankly quite frustrating.
I initially thought the othercat parameter (said to be limitless in the documentation) could be useful, but apparently it's not used with auto cat. I attempted looking through the code as well, only to be linked to this long-obsoleted "letter cat" template...
But I digress. It's best to just leave this to those who know best. A warning in the category edit page directed me to GP, so please help me out here!!
@MedK1|othercat= in {{auto cat}} is specifically used with |lect=1. We could create a category Category:English terms spelled with ligatures and just manually put those three cats into this category by adding ] to the end of each category definition. If this is a good idea though, it might make sense to do it for all languages, which would require an ability to figure out whether a given character is a ligature (I'm not sure how easy this is to do in Unicode). Benwing2 (talk) 22:48, 13 January 2025 (UTC)Reply
I do think it's a good idea, and I agree that it makes sense to do it for other languages as well. Your guess is far better than mine when it comes to ease of coding that part though. Wikipedia has a list of those, the page has no maintenance tags and a reference says no more will be added, so perhaps the list is actually exhaustive and the trick would be to check for any of these characters one-by-one, by 'feeding' all these codes into the code or something...
Do you think that until (or even if) that is figured out, a good temporary option would be to indeed manually categorize the three pages into the new category? If so, how would the body text for the new category be handled? Would it be placed into {{auto cat}} so it too can have a parent category? Thanks so much for the response!! MedK1 (talk) 23:05, 13 January 2025 (UTC)Reply
Yes, if we want a terms spelled with ligatures category, it should be added to the category tree. But let's wait a bit to see if anyone else has any input. Benwing2 (talk) 23:31, 13 January 2025 (UTC)Reply
BTW you are right that Unicode is not going to add more ligatures; they discourage precomposed ligatures in general (and for good reason). However we need to be choosy about what counts as a ligature; e.g. I don't think it would be helpful to treat w as a ligature of vv. Benwing2 (talk) 23:33, 13 January 2025 (UTC)Reply
@Benwing2: There are syllabic scripts like Devanagari that use ligatures a lot- I wonder if categories are a good idea for languages that use them. I suppose, though, that a distinction might be made between precomposed ligatures such as Æ and font-generated ones such as त्र (त् + र). Chuck Entz (talk) 06:07, 14 January 2025 (UTC)Reply
Latest comment: 7 days ago1 comment1 person in discussion
Latest tech news from the Wikimedia technical community. Please tell other users about these changes. Not all changes will affect you. Translations are available.
Weekly highlight
The Single User Login system is being updated over the next few months. This is the system which allows users to fill out the login form on one Wikimedia site and get logged in on all others at the same time. It needs to be updated because of the ways that browsers are increasingly restricting cross-domain cookies. To accommodate these restrictions, login and account creation pages will move to a central domain, but it will still appear to the user as if they are on the originating wiki. The updated code will be enabled this week for users on test wikis. This change is planned to roll out to all users during February and March. See the SUL3 project page for more details and a timeline.
Updates for editors
On wikis with PageAssessments installed, you can now filter search results to pages in a given WikiProject by using the inproject: keyword. (These wikis: Arabic Wikipedia, English Wikipedia, English Wikivoyage, French Wikipedia, Hungarian Wikipedia, Nepali Wikipedia, Turkish Wikipedia, Chinese Wikipedia)
One new wiki has been created: a Wikipedia in Tigre (w:tig:)
View all 35 community-submitted tasks that were resolved last week. For example, there was a bug with updating a user's edit-count after making a rollback edit, which is now fixed.
Updates for technical contributors
Wikimedia REST API users, such as bot operators and tool maintainers, may be affected by ongoing upgrades. Starting the week of January 13, we will begin rerouting some page content endpoints from RESTbase to the newer MediaWiki REST API endpoints for all wiki projects. This change was previously available on testwiki and should not affect existing functionality, but active users of the impacted endpoints may raise issues directly to the MediaWiki Interfaces Team in Phabricator if they arise.
Toolforge tool maintainers can now share their feedback on Toolforge UI, an initiative to provide a web platform that allows creating and managing Toolforge tools through a graphic interface, in addition to existing command-line workflows. This project aims to streamline active maintainers’ tasks, as well as make registration and deployment processes more accessible for new tool creators. The initiative is still at a very early stage, and the Cloud Services team is in the process of collecting feedback from the Toolforge community to help shape the solution to their needs. Read more and share your thoughts about Toolforge UI.
For tool and library developers who use the OAuth system: The identity endpoint used for OAuth 1 and OAuth 2 returned a JSON object with an integer in its sub field, which was incorrect (the field must always be a string). This has been fixed; the fix will be deployed to Wikimedia wikis on the week of January 13.
Many wikis currently use Cite CSS to render custom footnote markers in Parsoid output. Starting January 20 these rules will be disabled, but the developers ask you to not clean up your MediaWiki:Common.css until February 20 to avoid issues during the migration. Your wikis might experience some small changes to footnote markers in Visual Editor and when using experimental Parsoid read mode, but if there are changes these are expected to bring the rendering in line with the legacy parser output.
Latest comment: 6 days ago3 comments3 people in discussion
Hello. I'm new to the world of Wiktionary. I have a question about the structure of definitions. At the moment I'm specifically concentrating on English simple nouns (not compound, hyphenated, nor proper.) I'm using a post-processed dump of the English wiktionary data (2025-01-13) from kaikii.org. I count 481,098 unique such nouns; but there are 489,401 noun entries. There are 6579 simple English nouns with multiple entries for the same pos. That's just 1.4% of total nouns. For example, "swop" has 2 entries as Nouns. I note the POS entry is under the Etymology entry. Is this standard practice in wiktionary? If so, I would expect to see more of these multiple entries but I'm no expert in these matters.
I can only compare to a couple of other resources I have used. In WordNet, every word-pos combination exists as a single lemma, so "swop" would have exactly one lemma in WordNet with multiple senses and synsets. I also frequently use the online Merriam Webster dictionary. In that dictionary, there also seems to be multiple entries for the same word-pos (again, I've only been focusing on simple nouns) for certain words. Sometimes there is a single header for "noun", and multiple senses are listed by number, as with "love." Other times, there are multiple Noun entries such as with "asp."
So my question is, what are the rules for determining when a simple English noun gets one Noun header with multiple sense definitions, versus when there are multiple Noun headers?
@Killeroonie: Hi, welcome! Yes, per WT:EL and similar to MW, we generally separate entries by their etymology. For example, at English lead, the noun/verb relating to the element are separated from the noun/verb related to "guiding, directing, etc.", since they have different etymologies. That means that yes there can and often will be multiple of the same POS headers for the same "entry", if the etymologies are different. AG202 (talk) 17:20, 14 January 2025 (UTC)Reply
I think careful examination of the English-language sections that have multiple etymologies will help clarify the groupings of definitions by etymology and some issues that remain. All of the noun definitions under Etymology 1 ("element") involve the material or an extension of meaning to different materials from earlier practice IRL (eg, a pencil lead). The verb definitions under Etymology 1 also clearly involve the use of the material, either in current or historical practice. Under Etymology 2 the definitions seem to have evolved from a verb definition. As to unresolved issues, in the case of ], we have, under Etymology 3, the definition "mispelling of led". As led is an inflected form of the verb lead under Etymology 2, one might expect it to appear under Etymology 2 with the same definition. OTOH, misconstruction and other errors could be considered different etymological processes. DCDuring (talk) 18:22, 14 January 2025 (UTC)Reply
"noun form" in "head|en|noun form"?
Latest comment: 4 days ago7 comments4 people in discussion
For the word "treen", the source for the first Noun entry is
@Killeroonie I can give you the short answer: if you write {{head|en|something}}, the entry will be categorised into the "English somethings" category. (In your case that category will be Category:English noun forms.) It will also look up "something" in a list to decide whether to additionally categorise the entry into "English lemmas" or "English non-lemma forms". There's nothing more to it.
Why is an obscure word like "treen" categorized as "noun form" when a commonly used word like "lead" is not? Is this category for obscure words? If so, it's not named very descriptively, is it? lol :-)
@Killeroonie It's because treen is a plural, so a form of a noun (that noun being tree), whereas lead is a noun in its own right - a lemma, as we call it.
In fact, if you study the category list carefully, you'll see that treen is also in Category:English nouns, thanks to the noun senses under Etymologies 2 and 3, which are lemmas (not forms of other nouns).
Newbie question - what's the point of keeping track of these forms as top level entries? I can see how this would be useful as a reverse lookup for finding basic noun forms (lemmas?) for irregular morphologies. (Look at me learning the lingo! :)
@Killeroonie It's a reverse look-up, yeah. It's especially useful when a non-lemma form looks identical to some other word, like in the treen example, as it's a way to acknowledge that both exist; a more everyday example is felt, where it can be a noun (a type of fabric), a verb (to cover something with felt) - both lemmas - or the past form of feel - a non-lemma. Theknightwho (talk) 21:43, 15 January 2025 (UTC)Reply
I did a big spree through de and all the templates on that page, including the ones that you don't see until you uncollapse them, are now dark mode-compliant! I am sure there are still many things to clean up, but this is a small step towards improving the quality of our inflection tables. This, that and the other (talk) 09:54, 16 January 2025 (UTC)Reply
Latest comment: 3 days ago4 comments3 people in discussion
There are ~20,000 entries using this template and ~4000 with manual IPA. There should be vanishingly few cases where the template's automatic output isn't correct. Ultimateria (talk) 02:09, 17 January 2025 (UTC)Reply
Can you point me to some pages with manual IPA, and let me know if there are any things to watch out for when trying to convert to use {{eo-IPA}}? The first pass is to try using {{eo-IPA}} and see if the output is the same, but I imagine there are lots of places with wrong manual IPA. For that matter, there appear to be lots of places with wrong {{eo-IPA}} output due to incorrect respelling. For example, abandonismoj has {{eo-IPA|abandon|ismoj}} which yields
which seems clearly wrong (and is in conflict with abandonismo). Are there *ANY* places where the automatic syllable division algorithm gets it wrong, necessitating manual respelling? If not, we can just remove all such cases. Benwing2 (talk) 07:53, 17 January 2025 (UTC)Reply
Here's a small variety: antaŭi, luu, solvo, lekcii, postvivi, armas, sep, elpensi, distingi. I would skip any entries consisting of a single letter and anything with a hyphen just to be safe. I see at malami the syllable division is different and I'm not sure if this is correct based on the prefix mal-; it seems plausible where the division at e.g. deklami does not. Unfortunately I'm not sure who to ping for Esperanto... Ultimateria (talk) 17:47, 17 January 2025 (UTC)Reply
Syllabification seems to be a somewhat contentious issue. Is it necessary to display it? I understand that it's one of the few things not shown by the writing system (so if we can display it accurately, that would be useful), but I'm not sure how effectively we can ensure that we actually display it accurately rather than incorrectly. I found these two discussions online: Hyphenation and syllable and morpheme boundaries, Does pronounciation always respect morpheme boundaries?. Van Oostendorp 1999:75 thinks that because pacama is a compound, it is syllabified "as /pac.a.ma/ not */pa.ca.ma/" (but Van Oostendorp 1999:74 does indicate non-morphological syllabification before a vowel-initial suffix, as in the word urbestro /ur.bes.tro/).--Urszag (talk) 18:32, 17 January 2025 (UTC)Reply
The translation adder and script-specific language subheaders
Latest comment: 2 days ago3 comments2 people in discussion
Going through User:JeffDoozan/lists/translations/by error/wrong language code, I've been finding lots of cases like Serbo-Croatian Cyrillic added to Mongolian Cyrillic, and even Uzbek Latin added to Latin (not the script, the language). I found similar cases for other languages.
I'm talking about languages that have a main header in the language section, with indented subheaders for the different script, e.g.:
Latin:
Mongolian:
Cyrillic:
Latin:
Serbo-Croatian:
Cyrillic:
Latin:
I think what's happening is that the translation adder has a string for the subheader, "Cyrillic:" or "Latin:", and looks for that string in every line of the wikitext that starts with "*". Once it finds an instance, it doesn't check what the main header is. It also doesn't check whether the line starts with "* " or "*: ", so I've found things like "* Latin: {{t|la}}, {{t|uz}}".
I've noticed that the false positive is always before the real target, so it apparently starts at the top and works down. That means that languages higher in the alphabetical order such as Serbo-Croatian and Uzbek are mostly the ones added to the lines for languages that are lower in the alphabetical order.
Aside from having translations under the header for the wrong language, it hides them from people who are looking under the correct header. It's not that uncommon to have the same term added twice- the second time apparently by hand. I believe it also means that the presence of a false positive would keep the translation adder from discovering the absence of a correct header and/or subheader, and thus prevent it from adding one.
I think the way to solve this would be to have the translation adder keep track of both headers and subheaders for each line, so that it would know that "* Latin:" is a main header rather than a subheader, "* Mongolian:" is the next main header, and that the "*: Cyrillic:" on the next line belongs to the "* Mongolian:" main header. It would then switch the main header at the "* Serbo-Croatian:" line and thus know that the "*: Cyrillic:" on the next line belongs to it.
Of course, that's easy for me to say, because I don't have to write the code, but maybe those who work on such things could add it to their "when I get around to it" lists to work on later. Thanks! Chuck Entz (talk) 02:00, 18 January 2025 (UTC)Reply
Just to check, do we know when these errors were introduced, i.e. is the translation-adder still doing this or are these old errors? This was a problem for a long time and is why some languages used to contrast "Cyrillic:" with "Roman:" rather than with "Latin:" script (you probably recall as well as I do). When I try to test right now, I notice that when I add a sh translation here, it isn't put in on the Latin-language line or the Latin-script-Serbo-Croatian-language line, but directly on the Serbo-Croatian line, which seems unexpected but in a different way (?). ¯\_(ツ)_/¯ - -sche(discuss)05:34, 18 January 2025 (UTC)Reply
Attempted to Remove a Request for an Entry that I Made That Is Now Clearly Unnecessary But Got a WTNoD Warning
Latest comment: 2 days ago3 comments3 people in discussion
Sorry, I was trying to remove a request I had made for an entry for a word that wasn't in Wiktionary, that after consulting with some native speakers, turned out was just a common typo. I was given some kind of "WTNoD" warning or something that said it was "harmful" and I couldn't do it. Not sure how best to go about getting it removed. Thank you. 98.127.78.4307:18, 18 January 2025 (UTC)Reply
Latest comment: 1 day ago1 comment1 person in discussion
Due to the latest change in Module:script utilities, the legacy Middle Korean hyphenation trick where a double hyphen "--" shifts the hyphen one letter rightwards in the transliteration is broken.
That is,
{{m|okm|ᄉᆡ--미}}
should be shown as "ᄉᆡ미 soym-i" but an extra hyphen is shown in the Hangul text: ᄉᆡ-미(soym-i). See the quotation in 긋다 for an actual example.
I implemented a solution for this issue in my sandbox and added some more fixes for Middle Korean:
For the new solution for Middle Korean hyphenation, where '>' signifies a shift the hyphen to the right, we additionally need to delete the '>' character. For example, "ᄉᆡ->미" becomes "ᄉᆡ미 soym-i".
The letter 'g' is used to signify the G /ɣ/ phoneme in Middle Korean, which is not consistently reflected in the Hangul script. For example, "달g아" becomes "달아 talGa". This apparently had already been implemented since 2021 but was never used because it was not properly handled in Module:script utilities.
This fix changes the behavior in Middle Korean so that '--' does not show a hyphen, and two additional characters '>' and 'g' are deleted.
I find it to be quite good and has captured the data from each word entry on the wiki. I know that plural forms for loan words in English are a jumbled mess. But I would like to use a general algorithm to generate plural forms for Latin, Greek, and French loan words to minimize the size of my exception lists.
Are there any tags (Latin, Greek, French) for such words in the wiki data that I can use to give me a hint of possible "standard" rules for generating a plural form for words from these languages?
Also, I'm sure this problem has been solved. I'm not looking for suggestions of existing code, as I am doing this as a hobby project for myself because it's fun. :)
Latest comment: 6 hours ago1 comment1 person in discussion
Latest tech news from the Wikimedia technical community. Please tell other users about these changes. Not all changes will affect you. Translations are available.
Updates for editors
Administrators can mass-delete multiple pages created by a user or IP address using Extension:Nuke. It previously only allowed deletion of pages created in the last 30 days. It can now delete pages from the last 90 days, provided it is targeting a specific user or IP address.
On wikis that use the Patrolled edits feature, when the rollback feature is used to revert an unpatrolled page revision, that revision will now be marked as "manually patrolled" instead of "autopatrolled", which is more accurate. Some editors that use filters on Recent Changes may need to update their filter settings.
View all 31 community-submitted tasks that were resolved last week. For example, the Visual Editor's "Insert link" feature did not always suggest existing pages properly when an editor started typing, which has now been fixed.
Updates for technical contributors
The Structured Discussion extension (also known as Flow) is being progressively removed from the wikis. This extension is unmaintained and causes issues. It will be replaced by DiscussionTools, which is used on any regular talk page. The last group of wikis (Catalan Wikiquote, Wikimedia Finland, Goan Konkani Wikipedia, Kabyle Wikipedia, Portuguese Wikibooks, Wikimedia Sweden) will soon be contacted. If you have questions about this process, please ping Trizek (WMF) at your wiki.
The latest quarterly Technical Community Newsletter is now available. This edition includes: updates about services from the Data Platform Engineering teams, information about Codex from the Design System team, and more.