If you look at the transclusion list for *geutan, you see that it transcludes itself. Other Proto-West Germanic pages seem to have this as well. The source seems to be in the descendants list. Any idea why this is, and is it a problem? —Rua (mew) 11:27, 1 January 2020 (UTC)
{{reconstructed}}
checks the headers on the page using Module:reconstruction. I'll add documentation for this. — Eru·tuon 18:49, 1 January 2020 (UTC)There's no doubt some elegant way to do this in the search box, but it involves integrating a literal search for very basic wiki syntax with a regex, so I haven't been able to figure this out yet. I need to look for:
followed by:
followed by:
followed by:
followed by:
followed by:
My best (very amateur) guess at a regex:
Could someone either show me how to do this, or provide a list of entries that I can use? Chuck Entz (talk) 06:56, 2 January 2020 (UTC)
''+ {2,}''
based on your description - if you put the regex into https://regexr.com/, you can see an explanation. --DannyS712 (talk) 07:05, 2 January 2020 (UTC){{trans-top}}
headers. Trying to exclude all such things might be hard. But perhaps a dump run including preceeding (and succeeding?) characters (12?, 20?, 40?) of the potential species name instances would help. DCDuring (talk) 18:35, 2 January 2020 (UTC)This template: {{nuqtaless form of}}
is very useful but it should be limited to (I think) Devanagari scripts, specifically Hindi. It should also add to a category, such as Category:Hindi terms spelled without a nuqta (language is the second parameter), which should be a subcategory of Category:Hindi terms with irregular pronunciations (based on hi) because the spelling doesn't give enough information to pronounce the term correctly. If this is based on {{ru-noun-alt-ё}}
, the idea is implemented there. Please also note that the Russian template transliterates as in the dictionary form - самолет->самолёт (samoljót)
To keep things simple, I think it should be just Hindi. No need to to reuse it for other scripts. Arabic relaxed spelling forms (lack of hamza, etc.) could use a similar approach.
Here's an example: बगीचा (bagīcā) nuqtaless form of -> बग़ीचा (baġīcā). The dictionary form is "बग़ीचा" (with a nuqta (dot)) and the expected dictionary transliteration is "baġīcā" (not "bagīcā"). If it's there are too many cases of "spelling transliteration", then transliteration should be unchanged.
Calling @DerekWinters, AryamanA, Benwing2. --Anatoli T. (обсудить/вклад) 11:03, 3 January 2020 (UTC)
I just tried to create 2 new dictionary entries. The first page was created without incident. Then whenever I try to add another dictionary entry, I get the error “Warning: This action has been automatically identified as harmful”....“A brief description of the abuse rule which your action matched is: various specific spammer habits.” This happens no matter what definition I write for the term. I even tried using the new-entry (NEC) wizard. (Eventually it stopped even offering me the NEC wizard as an option.) (As a test, I tried creating a third entry, and even though it did not meet Wiktionary policy, it allowed me to create it. I have since requested deletion of my test edit, but I still want to know why it won't let me create the entry that I was actually trying to create.) 66.82.144.143 23:47, 3 January 2020 (UTC)
Hi,
This is the api I'm talking about: https://en.wiktionary.org/api/rest_v1/#/Page%20content/get_page_definition__term_
Adding these would be very useful
1) pronunciations
2) urls to the related audio clips of said pronunciations
I probably should be posting this in phabricator but, if I'm being honest, I'd rather not learn to navigate that thing just for this, so I came to throw ball here instead.
88.193.149.0 14:15, 4 January 2020 (UTC)
I suspect editors more technically adept than me have already thought about this, but: is it possible, either via MediaWiki:UnsupportedTitles.js or via an in-entry template or a Lua module (perhaps of the sort which we use to verticalize Mongolian entries' titles), to put images (directly or via templates like {{biang}}
) into the displayed title of entries like ⿺辶⿳穴⿲月⿱⿲幺言幺⿲長馬長刂心⿺辶⿳穴⿲月⿱⿲幺言幺⿲長馬長刂心麵? I've played around and it doesn't seem to work, which leads to my next idea: could we use MediaWiki:UnsupportedTitles.js to replace long titles like ⿺辶⿳穴⿲月⿱⿲幺言幺⿲長馬長刂心⿺辶⿳穴⿲月⿱⿲幺言幺⿲長馬長刂心麵 with a blank space, and then float an image of the correct title over that space? Would that be too prone to displaying incorrectly on different browsers/skins/phones to be a good idea? - -sche (discuss) 20:02, 10 January 2020 (UTC)
The translation box generated by MediaWiki:Gadget-TranslationAdder.js is missing option for masculine personal, animate and inanimate genders used in Slavic languages. I don't have edit access to the file to add them. Can someone add them or give me access to that file? --Tweenk (talk) 00:05, 12 January 2020 (UTC)
. What should it be instead? — Eru·tuon 00:28, 12 January 2020 (UTC)
. The "m" option is ambiguous so it can be omitted. However, the gadget doesn't have support for these, they would have to be added there first. --Tweenk (talk) 09:14, 12 January 2020 (UTC)
Can somebody add the following code to Module:IPA/data?
data.phonemes = { "a", "b", "d", "d͡ʒ", "e", "f", "h", "i", "j", "k", "l", "m", "n", "o", "p", "r", "s", "t", "t͡s", "t͡ʃ", "u", "v", "w", "x", "z", "ɡ", "ʃ", "ʒ", "ˈ", ".", " ", }
Robin van der Vliet (talk) (contribs) 00:14, 13 January 2020 (UTC)
It’s time for these templates {{cite-meta}}
or {{quote-web}}
, or whatever one now is eager to deploy to catch quotes in markup, to somehow detect the writing direction. If the author names in addition to the title are Hebrew or Arabic or Persian, like perhaps everything else in the reference, the references are not readable at all if one does not add a left-to-right mark after the author parameters. And if one does so, one has to start reading somewhere in the middle because that’s where the author names start now, going to the left, and then on the right follows the title, when what they should so is start on the right side of the text block (even if containing left-to-right text, e.g. taxons). Example 1, Hebrew: at دَيْر (dayr) (does not look too bad because of the Right-to-Left parts being very short but still wrong). Example 2, Arabic: at عُشَر (ʕušar). The journal piece linked in the second example, as a model, by the way shows in its bibliography how Right-to-Left titles appear correctly.
A parameter to set direction might solve it perhaps, and perhaps its default value can be inferred from |lang=
or |worklang=
, parameters often required anyway. Fay Freak (talk) 21:37, 13 January 2020 (UTC)
Also concerns {{sic}}
, which looks unnecessarily bad without left-to-right-mark. Fay Freak (talk) 22:37, 28 January 2020 (UTC)
I would like to request for the protection for all individual subpages placed under the following modules:
as well as the following pages:
The values in these pages should not be changed or modified unless there are typos or graphical variants, e.g. conversion of 爲/为 to 為/为.
If someone were to add, modify or remove any of the values, it would cause changes that would affect the output of several templates, and this is not easy to fix if multiple readings are involved, e.g.
It would be better for users to request for changes to be done on a case-by-case basis, like this Nov 2019 Tea room discussion. KevinUp (talk) 09:42, 14 January 2020 (UTC)
Would it be possible to include a second, optional parameter for an attributive form in {{af-adj}}
, and to include a qualifier parameter for this second attributive-form parameter? Some adjectives have two attestable attributive forms, often with one being less common than the other. ←₰-→ Lingo Bingo Dingo (talk) 15:23, 14 January 2020 (UTC)
Look at e.g. Wiktionary:Word of the day/Recycled pages/January. Each word has a pair of "edit, refresh" links, but clicking on the "edit" link takes you to the editing-view of a nonexistent page with one too many slashes, e.g. Wiktionary:Word_of_the_day//January_1 instead of Wiktionary:Word_of_the_day/January_1. - -sche (discuss) 09:54, 16 January 2020 (UTC)
Could someone remove ', B, K ,W, and 🌶 out of Category:Chinese Han characters ? They are definitely not Han characters. --OctraBot (talk) 04:04, 17 January 2020 (UTC)
PS I am Octahedron80 in bot mission at the moment.
I changed clx from using {{mul-symbol}}
to {{head|mul|symbol}}
, because {{mul-symbol}}
was causing the headword to display at an awkwardly large font size, different from the size of the other headword on the page, and different from e.g. mlx (which already used {{head|mul|symbol}}
). We should probably either make {{mul-symbol}}
not display headwords so large if they consist only of (Latin-script?) letters (and punctuation?), or perhaps make it not display headwords so large at all (but that might cause issues for small symbols). Failing that, we could mass-edit entries like clx to consistently use {{head|mul|symbol}}
or, if we actually prefer the embiggened display, then edit them to consistently use {{mul-symbol}}
instead of the current inconsistent state of affairs. - -sche (discuss) 19:42, 17 January 2020 (UTC)
{{mul-symbol}}
manually sets the script code to Zsym
if it hasn't been specified in the |sc=
parameter; .Zsym
has font-size: 150%;
in MediaWiki:Common.css. So currently this can be fixed by setting |sc=Latn
; but it would be better if it were automatic. — Eru·tuon 20:14, 17 January 2020 (UTC)
mul
in Module:languages/data3/m (the same procedure used for other languages), and end up using Latn
for clx. I don't know how satisfactory this would be; the script codes Zsym
and Zmth
aren't Unicode things, so someone here on Wiktionary had to decide what characters they should include (in Module:scripts/data). — Eru·tuon 20:43, 17 January 2020 (UTC)@Chuck Entz: So, if {{mul-symbol}}
used automatic script detection, a script other than None
would be assigned in all but 180 of the 896 titles that use the template:
This would be fine for the ASCII titles, but some of the others might display better if they have script classes applied to them. — Eru·tuon 00:39, 18 January 2020 (UTC)
{{head}}
in order to avoid the latter unnecessarily repeating the script detection? Chuck Entz (talk) 00:57, 18 January 2020 (UTC)
|sc=
parameter is present, {{head}}
won't do script detection for the first headword. That provides a good way to solve this. Now the template uses normal script detection, but, by means of {{mul-symbol/script}}
, replaces None with Zsym. That should be an improvement. — Eru·tuon 03:00, 18 January 2020 (UTC)@Rua, Erutuon Maybe one of you two can help me figure out some weirdness I observed. I have been gradually adding more places to Module:place/shared-data, and I've been worrying that this will eventually cause some pages like Washington (which has > 30 invocations of {{place}}
, each of which loads the module with require()) to run out of memory. The main data table in Module:place/shared-data has handler functions mixed in with data, so I have experimented with moving the subtables that are pure data into Module:place/shared-data/tables. When I did this, however, and replaced the inline subtables with references to Module:place/shared-data/tables, the memory of Washington *increased* from 25MB to 29.some MB. To do this I had to add a call to mw.loadData("Module:place/shared-data/tables")
, an extra require("Module:table")
and some calls to m_table.shallowcopy()
on some data that would originate from Module:place/shared-data/tables, because it gets in-place modified by subpolity_value_handler()
and similar functions. However, these changes don't materially affect the memory usage. I've verified that making the single-line change of replacing data = export.US_states,
on line 1528 of Module:place/shared-data with data = m_shared_tables.us_states,
(i.e. changing the reference to the subtable listing US states from elsewhere in Module:place/shared-data to Module:place/shared-data/tables) increases the memory from 25MB to around 27MB. I can't understand why this would happen; in both the before and after scenarios, Module:place/shared-data/tables is loaded with mw.loadData()
; all that's changing is a single pointer. The only thing I can think of is that the metatable functions used to access the table are somehow using up memory, but I can't see why that would be the case. Any thoughts?
BTW what prompted this was I'm planning on adding a bunch of data on major cities to Module:place/shared-data (see Module:User:Benwing2/place/cities, containing the data as I have it so far). I assumed this would significantly increase the memory usage of pages that call {{place}}
a lot; but when I tried adding the data to Module:place/shared-data, it only increased the memory usage of Washington by about 100K. This leads me to conclude I don't understand Lua memory usage very well. Benwing2 (talk) 20:54, 19 January 2020 (UTC)
mw.loadData
has a cost. data = m_shared_tables.us_states,
is different from data = export.US_states,
, if export.US_states
is a plain table defined elsewhere in the same module, and m_shared_tables
has been loaded with mw.loadData
and has not been indexed by "us_states"
before: it calls an __index
metamethod of the table returned by mw.loadData
, which in turn wraps the underlying data with the dataWrapper
function, which generates new tables and functions each time. And if another function accesses fields of m_shared_tables.us_states
, or iterates over it with pairs
or ipairs
, the __index
metamethod is called there as well. The wrapped tables are unique per call to mw.loadData
; that is, if you save the output of mw.loadData
and index it twice, it returns the same value; but if you load a module with mw.loadData
and index it twice, it generates two different values. (See this demonstration.) So the more subtables accessed, the more memory used. The maximum memory would probably be used if the table were iterated over recursively.{{place}}
in Washington so that's at least 34 * 3 calls to dataWrapper
: for the top-level table of Module:place/shared-data/tables, the US states table, and the "Washington" subtable, in each invocation. I don't know if that fully explains the approximately 4 MB jump (dividing 4 MB by 34 * 3 equals almost 40 KB per call to dataWrapper
, which seems like a lot), but it makes it somewhat plausible. mw.loadData
avoids re-executing a module, but it only saves memory if the memory cost of wrapping each table accessed through dataWrapper
is less than the memory cost of re-generating the whole module table – apparently not in this case.mw.loadData
is called on a new module, probably only for the first invocation of a module on a page. — Eru·tuon 22:18, 19 January 2020 (UTC)
{{place}}
and only uses 10MB. Benwing2 (talk) 23:12, 19 January 2020 (UTC)Instead of hardcoding some ordinal numbers in Module:eo-headword/exceptions, can somebody edit the function "getPOS" in Module:eo-headword, so that it recognizes all ordinal numbers as adjectives? Here are two regular expressions to recognize an Esperanto ordinal number: ^{1,}-?a$
(return "adjectives"
) and ^{1,}-?aj?n?$
(return "adjective forms"
). Robin van der Vliet (talk) (contribs) 18:24, 22 January 2020 (UTC)
-- deal with ordinals
if mw.ustring.match(word,"^{1,}-?a$") then
return "adjectives"
elseif mw.ustring.match(word,"^{1,}-?aj?n?$") then
return "adjective forms"
end
Would adding a detailed Sinitic family tree to Module:etymology languages/data be allowed and or useful?
For example:
Mandarin (cmn)
or:
Similar data is already at Module:zh/data/dial, which belongs to {{zh-dial}}
.
I previously did something similar specifically for Category:Philippine Hokkien; see the family tree at Category:Min Nan language.
—Suzukaze-c◇◇ 10:04, 25 January 2020 (UTC)
See MW page on new tool in beta. It is a FF and Chrome browser extension for now and works for some pedias. I have tried it and it looks handy for patrolling and entry review. But it seems to require the WhoColor API extension. I don't know whether WhoColor API is specific to WP vs. other wikis. DCDuring (talk) 17:55, 25 January 2020 (UTC)
When browsing through the 一 lemma - a very common character in East Asian languages with many possible languages - I noticed that the bottom was full of messages saying: "Lua error: not enough memory." Since the error appears consistently at the end of the page, and only there, it was clear to my layman's mind that there are simple more modules than some part of the system (Lua itself? The Wikimedia software? My browser?) can cope with; something I verified successfully by previewing only a section containing otherwise broken modules. I see the issue has already been raised above. Clearly this problem must be fixed one way or another, so I will be more than happy to report this problem here.
Should the problem be too fundamental to solve, then we must seriously consider simplifying some templates and using plain wiki syntax rather than Lua. Steinbach (talk) 19:28, 28 January 2020 (UTC)
@Erutuon Recently, a bunch of bad categories like Category:Chinese data modulescmn-hom, Category:Chinese data modulesglosses, Category:Chinese data moduleshak-pron/00, etc. have appeared in Special:WantedCategories. Each one is populated by a single module, e.g. Module:zh/data/cmn-hom for Category:Chinese data modulescmn-hom. I don't see any code in these modules or module documentation pages that adds these categories, so I'm baffled as to how these categories are getting added. Is this somehow a bug in the MediaWiki software itself? Benwing2 (talk) 06:26, 29 January 2020 (UTC)
{{documentation}}
on MediaWiki:Scribunto-doc-page-show, and automatically adds categories or documentation or both to certain modules using the module_regex
table. I'd messed up the regex
and cat
fields for some of the Chinese data modules. I've fixed the problem and added documentation about the cause. Thanks for catching this. — Eru·tuon 08:19, 29 January 2020 (UTC)
I've made a search engine for {{IPA}}
at Templatehoard/IPA, written in Rust. This has been my goal for a while. Searching is very fast and User:Surjection and I have already used the tool to find some entries that needed cleanup. For instance, I searched for instances of y in English transcriptions and corrected the ones in which j was intended.
The search engine uses CBOR stream template dumps generated from the latest dump, which comes out after the 1st and 20th of each month. The template dumps might be useful for bot owners; searching them is many times faster than using Pywikibot or another tool to download pages from the MediaWiki API and parse them, or even than using the XML dumps. If anybody wants it, I can generate dump files in JSONL format as well, because that is more widely supported.
User:Jberkel's wanted entry lists are currently generated from these template dumps.
I'm thinking of making a link template search engine, and perhaps a general template search engine. Link templates are fairly straightforward because there are definite "slots" that are used by many templates ("term", "alt", "id", "tr", "g"), but I'm not really sure how to make a search engine that would work for all templates because our templates have so many different configurations of parameters. — Eru·tuon 09:16, 29 January 2020 (UTC)
{{it-IPA}}
etc.) are not indexed. But it's very helpful as a starting point to find manual transcriptions which can be converted into automatic ones. – Jberkel 09:54, 29 January 2020 (UTC)Haven't checked, but I received this message: "there seems to be something wrong with the above-mentioned template. The first parameter for non-English quotations is not working. (see Vorabdruck) Could you please take a look at it and maybe fix it or, if not, ping someone else who can fix it. — Thanks in advance, Caligari ƆɐƀïиϠႵ 18:53, 30 January 2020 (UTC)" Equinox ◑ 21:16, 30 January 2020 (UTC)
The template does not work correctly for liters/litres. Here's what it returns for deciliter/decilitre:
Should return "dl" if I'm not completely ignorant. Seems to work fine for other units, meter for example:
--Hekaheka (talk) 08:53, 31 January 2020 (UTC)
I feel like the backend module for {{mnc-IPA}}
has to be fixed because of what happens if the template were added to the Manchu entry ᠵᠠᡵᡭᡡ (jarh'ū). --Apisite (talk) 10:40, 31 January 2020 (UTC)
{{mnc-IPA}}
ultimately (via the toIPA
function in Module:mnc-IPA) works off the transliteration generated by Module:mnc-translit. I notice that some of the Manchu letters transliterated with a letter and apostrophe are said to be used in foreign words; maybe there's some phonetic difference between the apostrophed and un-apostrophed letters or maybe the apostrophe should just be removed by Module:mnc-IPA. — Eru·tuon 08:14, 1 February 2020 (UTC)
I'm trying to test a new script and a potential future gadget that converts some pre-definition (according to EL) headings into collapsible boxes to make the UI less cluttered and give the main focus to definitions. The script is available at User:Surjection/streamline.js, and also supports converting post-EL headings if explicitly enabled. Any feedback is welcome, particularly considerations on whether this should be converted into a fully functional gadget that would be available under the preferences. — surjection ⟨?⟩ 16:14, 31 January 2020 (UTC)
Is there a reason we have this categorize module errors in Category:Pages with module errors instead of Category:Pages with module errors/hidden? Chuck Entz (talk) 23:44, 31 January 2020 (UTC)