Old discussions have been archived to Wiktionary talk:Todo/archive.
A great many misspellings occur in our entries, even in headers. See Wiktionary:Todo/Misspellings, and add to it. Then, we can periodically search for and eliminate instances of the listed misspellings. - -sche (discuss) 05:40, 2 March 2014 (UTC)
Stray spaces appear in a number of predictable/easily-findable circumstances, such as this. - -sche (discuss) 06:55, 13 July 2014 (UTC)
According to TemplateTiger, this was the only example of someone using {{en-proper noun}}
and starting with pl2= rather than with the unnamed first parameter. It might be fruitful to check if other templates have been used in the same way, i.e. with a second plural form ("pl2=") declared prior to any first plural was declared, particularly if (as here) no first plural is automatically displayed, and/or pl2= is set to whatever the automatically displayed plural would be. - -sche (discuss) 19:23, 1 August 2014 (UTC)
TemplateTiger is a good tool for finding which entries use certain parameters of a given template, regardless of whether or not the template supports those parameters. Among other things, this can allow one to find misspelt or mistaken parameters, like the "compound=" or "current=" parameters formerly used in save-all and barrel roll, or the following misspellings of "head=" : "head]", "haed", "hwad", "heead". - -sche (discuss) 20:35, 1 August 2014 (UTC)
Doremítzwr seems to have used middle dots as decimal points(??) in some entries, e.g. the depth measurement here. These should be located and cleaned up. - -sche (discuss) 03:59, 20 August 2014 (UTC)
If someone could examine the displayed text of pages (as opposed to the wikitext) and look for instances of {{, }, }], or ]], that would probably be informative. I imagine most occurrences of such strings are the result of mismatched brackets or bot-errors breaking templates across lines. - -sche (discuss) 20:11, 27 August 2014 (UTC)
{{circa}}
A bot could check for an remove commas after {{circa}}
(which itself adds a comma, making an additional comma superfluous), like so. - -sche (discuss) 19:43, 10 June 2015 (UTC)
Like this. - -sche (discuss) 21:47, 20 June 2015 (UTC)
I've noticed several entries like this one, where the infinitive (not the first-person form) of a Latin word is given, but it is glossed as a first-person form. This is obviously incorrect regardless of whether one prefers to lemmatize infinitives or first-person forms. - -sche (discuss) 02:50, 29 June 2015 (UTC)
Should be found and templatized like . I will try to do this myself. - -sche (discuss) 02:19, 7 July 2015 (UTC)
For example, . Some are valid (Æsir) but most are not. - -sche (discuss) 05:37, 30 July 2015 (UTC)
insource:"lb en informal proscribed"
, insource:"lb en proscribed informal"
etc catch them. - -sche (discuss) 20:42, 4 February 2018 (UTC)Maybe this is already covered here, e.g. Special:Diff/48477724. – Jberkel 21:39, 4 February 2018 (UTC)
Possible task: find pages in Category:English terms with multiple etymologies that don't have multiple etymology sections (checking for pages that don't contain "=Etymology 2=" seems like one obvious way of doing that), which should probably be removed. In the other direction, look for pages that do have "=Etymology 2=" within an English section and aren't in this category yet. - -sche (discuss) 06:42, 30 May 2018 (UTC)
...should be combined like this. I will try to search for instances of this myself later. - -sche (discuss) 21:56, 15 June 2018 (UTC)
...like this, should be cleaned up. - -sche (discuss) 00:14, 27 January 2020 (UTC)
As here. (Should try to catch these systematically.) - -sche (discuss) 18:33, 29 September 2020 (UTC)
Many entries with the {{lb|en|religion}}
label are specific to Christianity (or rarely to another religion such as Buddhism) and should use the more specific label instead, for example "use". (Several other entries should not use the label at all, like Jew or Calvinist.) - -sche (discuss) 10:55, 26 November 2020 (UTC)
The following discussion has been moved from Wiktionary:Requests for cleanup (permalink).
This discussion is no longer live and is left here as an archive. Please do not modify this conversation, but feel free to discuss its conclusions.
I just listed this as a WT:TODO task because I expect it'll keep being an issue even after we fix the existing cases, but: numerous entries in "Terms borrowed from Proto-Foo" categories (like Category:Terms borrowed from Proto-Slavic) were not actually "borrowed" by the L2 language in the way we use the word; see e.g. here. (Surprisingly, one English word apparently was borrowed from Proto-Indo-European, ghrelin.) - -sche (discuss) 08:37, 11 June 2018 (UTC)
https://en.wiktionary.orghttps://en.wiktionary.org/w/index.php?title=hatchet_man&diff=66526741&oldid=66526489 —Fish bowl (talk) 09:58, 28 April 2022 (UTC)
There is a longstanding issue of how to handle group names like "the Abenaki", "the Venda", etc. Many are listed as plural-only using the template above, but there are in fact cites of the singulars ("a Venda") and of the plurals ("Vendas"), so my impression is that these are supposed to be recast as singulars which can have either regular plurals (Vendas) or invariant plurals (Venda). I spy quite a few of these at Special:WhatLinksHere/Template:en-plural_noun. - -sche (discuss) 21:37, 13 August 2022 (UTC)
I came across one entry, and did a database dump search and found three more entries, which 'manually' wrote quotation marks inside T:qfliteral, which itself adds quotation marks, resulting in ""double""; this might be worth checking for once a year or something. είναι κινέζικα για μένα, αυτά μου φαίνονται κινέζικα, εντελώς αβέβαιο, durante beneplacito. - -sche (discuss) 06:48, 3 January 2024 (UTC)
Spitballing: a recurring issue is that when an entry has multiple etymology sections, people only look at the first one and, if not seeing the sense they seek, add it there or add it as a new etymology section (without noticing it is already present in another etymology section). Examples that I can find offhand are the cases de-duplicated in diff of e and diff of linn. I wonder if we could make a list of entries (in a given language: say we start with English) that have multiple etymology sections, and then winnow it to only cases where the definitions in ety 1 and 2 have "important" words in common, for example by 1. retaining cases where definitions had any words in common other than words on a list of "unimportant" words like "the", "of", "and", "for", "from", etc (and also excluding where one of the definitions was a non-gloss like "past tense of foo"), and 2. looking at the results and expanding the list of "unimportant" words, thus progressively winnowing the list of entries that have "important" words in common until it's a manageable size to put the "sets of definitions with words in common" on a page and let a human look them over and spot duplicates. Not saying this is a priority, and not sure if it's feasible, but I'm mentioning the idea. - -sche (discuss) 17:53, 2 May 2024 (UTC)