Somehow Uyghur transliteration is skewed on Happy_New_Year#Translations (note the position of brackets) but no problem here: Uyghur: يېڭى يىل مۇبارەك (yë'ngi yil mubarek). --Anatoli T. (обсудить/вклад) 07:19, 1 January 2015 (UTC)
I do a lot of category creation, and, though it's less arcane and complex now that User:CodeCat has luafied most of the infrastructure, there's still a lot of typing involved.
This is mostly unnecessary: we already have very strictly-enforced rigid constraints on the format of category names, and they generally contain most or all of the information needed in the name itself- so the modules that power the templates should be able to parse it from the page name using the modules that CodeCat has put in place (see Category:Pagename-based auto-fill-in templates for some templates that provide a substitutable front end for existing templates using such techniques).
Here are my ideas with regards to specific templates:
{{topic cat}}
: For language-specific cats, the first part of the name is the language code, followed by a colon, followed by the topic name. For the non-language-specific parent categories, the page name is the topic name. See {{tcez}}
, which was developed for me by User:Kc kennylau and User:Wyang. I found Wyang's version more useful and robust, so I modified it slightly to get {{tcez1}}
{{prefixcat}}
: the first part of the page name is the canonical name of the language, followed by " words prefixed with ", followed by the suffix, followed by "-".
{{suffixcat}}
: the first part of the page name is the canonical name of the language, followed by " words suffixed with -", followed by the suffix.
{{charactercat}}
: the page name is always the canonical language name, followed by " terms spelled with " followed by the character.
These methods can be applied to just about every template that uses Module:category tree, with one important exception (below), and quite a few others.
{{poscatboiler}}
: the language-specific categories all consist of the canonical language name, followed by a space, followed by what currently goes in the template's second parameter.
This one is trickier to implement, because there's no unique delimiting text, and because of the potential for overlap between parts of language names and parts of the second parameters.
I came up with a kludgy workaround: require a single parameter consisting of the first few characters of the current second parameter. Everything before the first instance of a space + this string in the page name is the canonical language name, and the string + everything after the first instance of a space + the string in the page name is the current second parameter.
This workaround is potentially defeatable by new canonical language names that would contain a match for the string as originally entered, so it's probably best not implemented in {{poscatboiler}}
itself, but in a substitutable fill-in template. I have a working proof-of-concept at {{pcbez}}
, but I don't understand substitution and/or templates in general well enough to make it substitutable without a lot of clueless trial and error. Can someone do that for me?
Thanks! Chuck Entz (talk) 21:13, 1 January 2015 (UTC)
{{suffixcat}}
and company into {{poscatboiler}}
. I believe that it's beneficial to have less templates, so that users don't have to remember which one does which. —CodeCat 21:18, 1 January 2015 (UTC)
{{poscatboiler}}
itself, and may someday cause problems with converting other templates to {{poscatboiler}}
, but I'm talking about the real short term here: I suspect that all the specific examples I gave here could be implemented in an hour by someone who really knows what they're doing (troubleshooting could drag that out much longer, of course). Since no one currently uses these templates without parameters, there's no problem with backwards-compatibility: you can either ignore any positional parameters, or you can use them instead of the pagename-based ones if they're present (the latter is probably better, just to be safe- see the problem with "sms:" in the {{topic cat}}
section, above). Chuck Entz (talk) 22:05, 1 January 2015 (UTC){{poscatboiler}}
now (see the edits I made to Module:category tree and Module:category tree/poscatboiler). If you leave out the label, it will try to extract it from the page name. If the category doesn't begin with the specified language, or if the autodetected label doesn't exist, it shows a somewhat nondescript error message, but at least the basic idea works. —CodeCat 14:12, 2 January 2015 (UTC)
What kind of software solutions exist and which are we using to ensure we know which of the most common words are missing? I was very surprised to discover that, for example, news stream is completely missing in this and all other dictionaries. So Wiktionary has the chance to be the first dictionary to record one of the most important and common and descriptive words of our time. I found several lists like User:Brian0918/Hotlist and User:Robert Ullmann/Missing and User:Visviva/Tracking, but these don't seem to make any kind of frequency analysis. I don't understand what to do with the red links at the beginning of the last list. --Espoo (talk) 11:32, 7 January 2015 (UTC)
How do we change the template {{hu-conjugation of}}
so that it isn't in Category:Hungarian verb forms but puts verb forms there? --Lo Ximiendo (talk) 15:02, 7 January 2015 (UTC)
{{hu-conjugation of}}
got rewritten into Lua. --Lo Ximiendo (talk) 15:16, 7 January 2015 (UTC)
{{hu-conjugation of}}
add entries to the category. The category is already added by the headword template. —CodeCat 15:24, 7 January 2015 (UTC)
{{hu-conjugation of}}
was created several years ago before the current categorization direction. There is no need to recreate the template in Lua. It could be replaced by {{inflection of}}
applying the parameters needed for Hungarian. For example: vadászok
{{hu-conjugation of|vadászik|1|s|indic|pres|indef}}
would become{{inflection of|vadászik||1|s|indicative|pres|indefinite}}
. --Panda10 (talk) 17:39, 7 January 2015 (UTC)
{{hu-grammar tag}}
to Module:form of/data. But there was a conflict in one case: sub
is already used to mean subjunctive, so it can't also mean sublative. Furthermore, the following tags shouldn't be added to the module, so some solution for them should be found: ban
, ben
, 1s
, 2s
, 3s
, 4s
, 5s
, 6s
, 1p
, 2p
, 3p
, 4p
, 5p
, 6p
. I'm not sure what to do with pos
and nonattr
. It also needs to be checked if any pages use {{hu-grammar tag}}
with a tag that it doesn't recognise (in which case it's shown as-is) but which {{inflection of}}
does recognise. —CodeCat 18:52, 7 January 2015 (UTC)
{{hu-inflection of}}
. This would be more complicated to replace (it is used in about 19,000 entries). In my above note I meant to replace only {{hu-conjugation of}}
with {{inflection of}}
because it is used in a little over 200 entries. --Panda10 (talk) 19:41, 7 January 2015 (UTC)
{{hu-conjugation of}}
now just calls {{inflection of}}
. You can replace it if you want. —CodeCat 22:34, 7 January 2015 (UTC)
Why do the Navajo, pinyin and romaji words need to be written in those ugly and different fonts? --Biolongvistul (talk) 13:11, 8 January 2015 (UTC)
{{l|zh|Běijīng}}
it shows up as Běijīng because it assumes that everything labeled "zh" is in Hanzi, so it uses a font that's better suited to Hanzi. I would have expected {{l|zh|Běijīng|sc=Latn}}
to force it to show up using the default Latin font, but it doesn't; it still shows up as Běijīng, which is annoying. (Interestingly, if I specify the language as "cmn" instead of "zh", Pinyin shows up using the default Latin font, even without being explicitly labeled "sc=Latn", so {{l|cmn|Běijīng}}
shows up as Běijīng.) For Navajo, I have no idea since Navajo is only written in the Latin alphabet, so the software shouldn't be assuming anything else. —Aɴɢʀ (talk) 20:59, 9 January 2015 (UTC)
There is a problem with this template where listings for the suffix don't appear in the correct alphabetical order, when there's a root word as well as a prefix. For example: repristination. Donnanz (talk) 12:36, 9 January 2015 (UTC)
{{affix|en|re-|prestine|-ation}}
? —CodeCat 21:36, 9 January 2015 (UTC)
See discussion here: http://sourceforge.net/p/kiwix/discussion/604122/thread/1eacb6d8/
Thank you.— This unsigned comment was added by 198.23.103.67 (talk) at 13:59 9 January 2015.
No comments? :( — This unsigned comment was added by 198.23.103.67 (talk) at 10:11 15 January 2015.
Would someone with a bot please be so kind as to perform null edits on all the entries in Category:Pages with module errors? With over 2100 entries from a problem that was quickly fixed a day or two ago, the real module errors are very hard to spot (see błyskać for the one I know about). Thanks! Chuck Entz (talk) 19:47, 9 January 2015 (UTC)
I would like to create a template that creates a collapsable table, replaces input strings according to a specific key and logs every single replacement. An example: The input of the template is "a, d, e". Each of these letters has two results according to the key, the alternatives being "ä, ð, 0". I would want to log the variants "ade", "aðe", "äde", "äðe", "ad", "äd", "að", "äð" in an annotated table. Basically an automated form-table like the one that can be seen at enmity.
I tried to figure it out myself through the help pages, but they all seem to be for people who already know to some extent how it all works. For example I could find no help pages on Wiktionary on how to create a table that is collapsable.
So if anyone could give me a link to a help page with the relevant formatting data and explain to me how to include a string-replacement into a template, I'd be grateful. Korn (talk) 10:08, 10 January 2015 (UTC)
ps.: I'm aware the WT-templates page has a section giving the code to make a table collapsable. But what I meant is that if you've no prior Wiki-experience, making your baby steps here is at least a bit confusing. Korn (talk) 10:31, 10 January 2015 (UTC)
I just tried to edit my userpage with a short introduction saying who I am and that I'm an admin and CU over at en.wikibooks. It seemed to have triggered a spam filter, I think. Can someone add this to my userpage?
Thanks. --Xania (talk) 23:14, 12 January 2015 (UTC)
In e.g. indogermanisch, the name of the "dialect" to which the alt forms belong (in this case the qualifier is not a dialect but an explanation that the alt forms are abbreviations) should be in parentheses, like it would be if the old method of formatting were used. - -sche (discuss) 01:46, 13 January 2015 (UTC)
Hello,
I'm new to the Mediawiki markup language. I've already read several help pages about templates.
According to m:Help:Parameter_default, {{{p|q}}} outputs "p", if "p" is defined. Otherwise "q".
I still don't get a particular syntax:
{{{a|{{{b|c}}}}}} gives c {{{{{{a|b}}}|c}}} gives c - parameter b is undefined
Can someone explain to me, why the result is "c"? In both examples, "a" might be defined?
{{{a|{{{b|c}}}}}}is interpreted as
{{{{{{a|b}}}|c}}}is interpreted as
I'm also struggeling with an expression like this one:
{{#if: {{{A|{{{B|{{{C|}}}}}}}}} | XXX | YYY }}
If I understand the syntax correctly, the output will be "XXX" if any one the three variables "A", "B" or "C" is defined. Otherwise (if none of these variables are defined) "YYY" will be the output. Is my assumption correct?
A=
.) So you can force it to output a negative result, even if B and C are defined.{{{A|{{{B|{{{C|}}}}}}}}}with no parameters defined, the result will be {{{C}}}, which is, C is not defined, so the result is
{{{C}}}
, which is a valid string, which is what is passed back as the result. This is not an empty string, so it evaluates as true, and the result of {{#if: {{{A|{{{B|{{{C}}}}}}}}} | XXX | YYY }}(that is, with no default for the C parameter) will be "XXX"
{{{a|{{{b|c}}}}}}works the same way
{{{{{{a|b}}}|c}}}does?
{{#if: {{{A|{{{B|{{{C|}}}}}}}}} | XXX | YYY }}
{{{A|{{{B|{{{C|}}}}}}}}}will be evaluated to
{{{C}}}and that is not an empty string, independent of whether C is defined or not. Thus "XXX" will follow.
{{{x|y}}}
means if x is defined, return "{{{x}}}
", otherwise, return "y
". So look at the evaluation step-by-step:
{{{A|{{{B|{{{C|}}}}}}}}}
" is what we start with. Is parameter A defined?
{{{A}}}
".{{{B|{{{C|}}}}}}
". Is parameter B defined?
{{{B}}}
".{{{C|}}}
". Is parameter C defined?
{{{C}}}
".
" (empty string).{{{C}}}
evaluates to the contents of the parameter C, if it is defined. If it is not defined, then it evaluates to the string "{{{C}}}". The {{{C|a}}}
says that if C is not defined, then return the string "a". {{{C|}}}
says that if C is not defined, then return the empty string. I was contrasting the behaviour of {{{C|}}}
and {{{C}}}
. With the alternate value of the empty string, the whole construction will return the empty string if none of the parameters are defined, and the empty string has the truth value "false". Without that pipe character, if none of the parameters are defined, it will return the string "{{{C}}}", which has the truth value "true".{{{a|{{{b|c}}}}}}
and {{{{{{a|b}}}|c}}}
do not work the same. If a is defined, then the first will return the value of a. Else if b is defined it will return the value of b, else it will return the string "c". The second will first evaluate whether a is defined, and return either that value or the string "b", and then use that string (either a or "b") and see whether that is a defined parameter. If it is, it will return the contents of that parameter, otherwise it will return "c". So if the contents of a is "z", it will evaluate {{{a|b}}}
, which will return "z", which then becomes {{{z|c}}}
. If parameter z is defined, then that value will be returned.{{{C|}}}
and {{{C}}}
. I should have asked here earlier, instead of guessing for 2 weeks straight =) Citronas (talk) 10:10, 14 January 2015 (UTC)Template:eo-head seems to be displaying noun and adjective inflections slightly differently: for adjectives, it presents them in the order "plural, accusative singular, accusative plural" (e.g., aĝa), whereas for nouns, it presents them in the order "accusative singular, plural, accusative plural" (e.g., ŝnuro). I think it is desirable to have the inflections presented in the same order for both adjectives and nouns, so I would appreciate it if someone could change Template:eo-head so that it uses the order "plural, accusative singular, accusative plural" for both parts of speech. This is the order that Template:eo-noun and Template:eo-adj both use. Thank you! —Mr. Granger (talk • contribs) 00:02, 16 January 2015 (UTC)
The parameter id=, used in {{m}}
and {{l}}
for linking to {{senseid}}
-generated targets, has stopped working. See for example in the etymology of भाति (bhāti). Please fix it. --Vahag (talk) 11:25, 18 January 2015 (UTC)
id=
wasn't broken, it was just being ignored for Appendix pages because we don't need to link to language sections. But we do need to be able to link to ids, so I changed that now. —CodeCat 12:37, 18 January 2015 (UTC)
The above category is suddenly collecting suffixes, proper nouns, pronouns and noun forms. Any idea what may be the cause? Thanks. --Panda10 (talk) 19:28, 19 January 2015 (UTC)
In அஃகம், you can see that a couple letters (namely ஃக) aren't transliterated. I presume this should be addressed. - -sche (discuss) 20:26, 19 January 2015 (UTC)
Is it possible to acquire a list of words wanted in a given language? That is, pages with a redlink encased in a template such as {{m|xyz|word}} leading to them?
As far as I can tell this is not possible within MediaWiki software, but it sounds like information extractable from a database dump perhaps. --Tropylium (talk) 14:38, 20 January 2015 (UTC)
{{l}}
, {{m}}
, {{term}}
as all of these have a language parameter, position 1 for {{l}}
and {{m}}
, lang= for {{term}}
. How are your skills with regular expressions? DCDuring TALK 15:50, 20 January 2015 (UTC)
r"{{(?:l|m)(?:\|.*?=.*?)*(?:\|(LANGCODE))(?:\|.*?=.*?)*(?:\|(WORD))(?:\|.*?=.*?)*}}" #gives two groups: langcode and word.
from r"{{term(.*?)(\|lang=(LANGCODE))(.*?)}}" into r"l|\3\1\2\4"
{{taxlink}}
(on 11K pages). It runs in less than 30 seconds, but virtually all instances are red links, so it doesn't have to compare the list of all terms enclosed in {{l}}
(on 362K pages) and {{m}}
(on 42K pages) with a list of all headwords, let alone a list of all entries in a given language. In addition all terms enclosed in {{taxlink}}
are Translingual lemmas.{{l/es}}
(223K pages) (Compare {{l|es}}
(10K pages).) that enclose words from only a single language. IOW, it would be easy to generate, for example, l|es-, m|es-, and l/es- linked words in Spanish. I think they are supposed to all be lemmas. Subtracting members of Category:Spanish lemmas shouldn't be too hard. DCDuring TALK 19:47, 20 January 2015 (UTC)
{{l}}
and {{m}}
(and l/XX templates) are lemmas. There are all sorts of times when nonlemma forms might find themselves inside those templates. —Aɴɢʀ (talk) 20:39, 20 January 2015 (UTC)
This module error wasn't here a couple of days ago, but the entry's edit history doesn't show anything since January 11, and when I look at "Templates used in this section:" for the section that has the error, none of the templates listed has any edits in the past week:
Can anyone explain where this module error came from? It looks like it materialized out of thin air. Has there been a system change that might explain this? Chuck Entz (talk) 03:47, 21 January 2015 (UTC)
find_kana
in Module:ja-headword tries to find from the arguments a pure kana parameter, and it fails to detect one if the fullstop "。" is included. Wyang (talk) 05:59, 21 January 2015 (UTC)
Is this and this, i.e. the use of a language code as the lang= parameter of {{borrowing}}
or as the second parameter of {{etyl}}
that doesn't correspond to the L2 header, something a bot could check for periodically? It doesn't always need to be cleaned up to the language code that corresponds to the L2; sometimes it needs to be switched to use "-", as here. - -sche (discuss) 17:58, 21 January 2015 (UTC)
{{compound}}
and related templates with nocat=. Those should really be excluded. —CodeCat 01:38, 22 January 2015 (UTC)
{{borrowing}}
works. I think it should work the same way as {{etyl}}
. — Ungoliant (falai) 02:36, 22 January 2015 (UTC)
{{unk.}}
to work the same way as {{etyl}}
, too. - -sche (discuss) 02:38, 22 January 2015 (UTC){{rfe}}
should have multiple parameters, for the probable language of the etymology (if you knew it was Latin) as well as the requesting entry language. DTLHS (talk) 02:45, 22 January 2015 (UTC){{rfelite}}
should also be included, though it has only about 80 transclusions so far. DCDuring TALK 02:53, 22 January 2015 (UTC)Since I don't expect that page is monitored that well, I'm posting here requesting that someone take a look at my request on Wiktionary talk:AutoWikiBrowser/CheckPage#Technical 13. Thank you. Technical 13 (talk) 20:59, 21 January 2015 (UTC)
Is it possible to search for partial string with regular expressions or something else? Currently, I need to find all Russian words with Cyrillic "-вств-" in them (e.g. чу́вство (čúvstvo) to fix a pronunciation rule in Module:ru-pron. I have mistakenly defined the rule with a silent first "в", as in чу́вство (čúvstvo), здра́вствуйте (zdrávstvujte) but there are cases when it's pronounced, I forgot what those words are! One example is де́вственница (dévstvennica).
I think the advanced search functionality would be useful in various case, e.g. when looking for words having the same stem or suffix, etc. --Anatoli T. (обсудить/вклад) 22:56, 21 January 2015 (UTC)
Why does {{alter}}
and/or Module:Alternative forms display forms in a font different from the default font? How do we fix that? —Aɴɢʀ (talk) 12:56, 25 January 2015 (UTC)
local sc = args or "polytonic"
? — Keφr 18:06, 25 January 2015 (UTC)
{{alter}}
anyway? {{l}}
paired with {{qualifier}}
works well enough for me. — Keφr 18:42, 25 January 2015 (UTC)
I've never really dabbled in the automation side of Wiktionary, so I thought I'd ask here: is it possible, using AWB or a bot, to go through the German parts-of-speech categories and remove categories like Category:German nouns/Category:German adjectives etc from pages that already has template:head or one of its descendants? The problem is that template:head automatically parses German words with special characters in order to correctly alphabetise them in dictionary order (so it puts gären between garen and garnieren). However, putting a lemma category on the page then overrides this and causes the default sort to take precendence, which puts non-ASCII characters after ASCII (which means gären gets sorted after gustieren, between gähnen and gönnen). Simply removing the category where it's unnecessary would ensure that terms including special characters get correctly sorted. I've corrected a few entries by hand (eg. , ) but it's hard to find these improperly categorised pages manually when they don't start with an umlaut.
Presumably other languages have this problem too. Category:Spanish adjectives has ñango (which is only categorised through template:es-adj) next to namibio, but ñoño (which is explicitly categorized) is sorted next to zurdo. I'm using German as an example solely because that's a language with collation rules I know fairly well. Smurrayinchester (talk) 09:39, 26 January 2015 (UTC)
{{catlangcode|de|Blah}}
instead of bare ]
for the same reason: {{catlangcode}}
uses smart sorting, and the bare Category: code doesn't. —Aɴɢʀ (talk) 20:19, 26 January 2015 (UTC)