Wiktionary:Beer parlour/2024/September

Hello, you have come here looking for the meaning of the word Wiktionary:Beer parlour/2024/September. In DICTIOUS you will not only get to know all the dictionary meanings for the word Wiktionary:Beer parlour/2024/September, but we will also tell you about its etymology, its characteristics and you will know how to say Wiktionary:Beer parlour/2024/September in singular and plural. Everything you need to know about the word Wiktionary:Beer parlour/2024/September you have here. The definition of the word Wiktionary:Beer parlour/2024/September will help you to be more precise and correct when speaking or writing your texts. Knowing the definition ofWiktionary:Beer parlour/2024/September, as well as those of other words, enriches your vocabulary and provides you with more and better linguistic resources.

Citations talk namespace

The citations talk namespace has only two pages: Citations talk:Frühgeburtskernikteri and Citations talk:Tai Ji Men. Maybe the namespace can be deleted with any discussions going to the entry's main talk page. Ioaxxere (talk) 05:56, 1 September 2024 (UTC)Reply

I don't think this is possible, is it? Doesn't every namespace need an equivalent talk namespace? Theknightwho (talk) 13:02, 1 September 2024 (UTC)Reply
@Theknightwho: If it isn't possible, we can discourage it by creating an edit filter and hiding the portlet link when you're at a Citations page (this is already the case). A bot could also be created to move all existing content from Citations talk to Citations on a regular basis as well. If we decide not to do this then the link should not be getting hidden in my opinion. Ioaxxere (talk) 15:21, 1 September 2024 (UTC)Reply
Whether or not it's possible, the RFD discussions mentioned on those pages should definitely go to the talk pages, as per our normal practice. They shouldn't be on a Citations talk page that most users won't know about, let alone think to look at. Andrew Sheedy (talk) 21:44, 1 September 2024 (UTC)Reply
It looks like we had an edit filter that stopped Citations talk pages from being created, but I don't know why it was turned off. Theknightwho (talk) 00:32, 2 September 2024 (UTC)Reply
Interesting: Erutuon turned it off on 17 August 2020, saying "Added entry in MediaWiki:Titleblacklist, so this filter is unnecessary." But the two pages mentioned above were both created after that, so either something went wrong with the title blacklist entry, or the issue is that both Citations-talk pages were created by sysops, who may have the rights to defy the blacklist (and because they were making the edit via a gadget, may not even have realized they were doing so). I see a few possible solutions, including editing aWa to automatically change "Citations talk:" to "Talk:", or re-enabling the edit filter (which would mean attempts to archive a discussion with aWa fail and require the user to manually go back and fetch the content and manually archive it, which is a faff...). - -sche (discuss) 06:04, 2 September 2024 (UTC)Reply
@-sche: Yes, it looks like those entries were created by admins using aWa, so that gadget should probably be set to archive things under the main talk page. Courtesy pinging @Erutuon, who also might be interested to work on this. Ioaxxere (talk) 06:57, 2 September 2024 (UTC)Reply
Tangentially related — today I rewrote MediaWiki:Gadget-DocTabs.js and in the process encountered two more edge cases that are currently not handled correctly: template documentation talk (e.g. Template talk:pi-alt/documentation) and module documentation talk (e.g. Module talk:inc-pra-Brah-translit/documentation). I'm mentioning this here because the issue is caused by the same things that @-sche brought up above. Ioaxxere (talk) 06:57, 2 September 2024 (UTC)Reply

Superstition category

I propose we add a Superstition related-to category placed under Folklore (as suggested by @Qwertygiy and @Theknightwho) on Discord). Vininn126 (talk) 12:26, 1 September 2024 (UTC)Reply

Interwiki

See also: #What should I do

May I create such interwiki? ПростаРечь (talk) 12:26, 1 September 2024 (UTC)Reply

I have no problem with this. Theknightwho (talk) 22:07, 5 September 2024 (UTC)Reply

@PUC May I hear your opinion? ПростаРечь (talk) 06:24, 3 September 2024 (UTC)Reply

Unnamed parameters in etymology templates

In the etymology templates like {{bor}}, {{inh}}, etc. all others and, I think it would be convenient if unnamed parameters (except the first, which is the lang code) are for more words, which would help avoid typing out, for example: {{inh|FOO|BAR|TERM1}}, {{m|BAR|TERM2}}, {{m|BAR|TERM3}} which would be replaced by {{inh|FOO|BAR|TERM1|TERM2|TERM3}} (and similarly for {{cog}}, {{der}}, etc.).

  • This is similar to how the templates: {{alt}} and {{desc}} function.
  • In this proposed pattern, meaning of the word(s) would stricly be entered using |tN= (t1, t2, t3, ... correspond to TERM1, TERM2, TERM3; for convenience, |t= will be same as |t1=).
  • Similarly, the term(s) to be displayed, if different from the term linked, would stricly be entered using |altN=.
  • Example: {{cog|FOO|TERM1||MEANING1}}, {{m|FOO|TERM2||MEANING1}}{{cog|FOO|TERM1|t1=MEANING1|TERM2|t2=MEANING2}}
  • This would especially be very helpful when using typing aids since subst:chars would only have to be typed once.
  • For more convenience, we can possibly change |altN= to |aN=, |dN= (display), or |sN= (show) thus minimizing the slight extra effort that will be needed if this proposal is to be implemented.
  • This proposal can also be extended to other linking templates like {{m}} and {{l}} if there is consensus.

This was sometime ago raised on WT:Discord as well but the discussion died down without follow-up. Svartava (talk) 17:52, 1 September 2024 (UTC)Reply

I don't recall often having to list a word as being inherited from multiple lemmas per ancestor, or having multiple cognates in a single related language. What are some example entries where this would be useful?--Urszag (talk) 18:45, 1 September 2024 (UTC)Reply
I get that this varies with languages/families or editors, but I very often have to do this, e.g. 𑀮𑁄𑀡𑀺𑀬, 𑘄𑘡𑘿𑘮𑘰𑘯𑘰, इंदूर etc. When the ancestor or cognate has alternative forms, I always like to mention (as well as read) them whenever relevant. Svartava (talk) 18:58, 1 September 2024 (UTC)Reply
@Svartava It would be better to use // syntax (e.g. {{m|en|foo//bar}} gives foobar), but we still need to work out how to handle transliterations with that format, since some languagse (e.g. Chinese) use slashes but only have one translit. Theknightwho (talk) 00:01, 2 September 2024 (UTC)Reply
@Theknightwho: So in that case, would t1 and alt1 correspond to foo and t2 and alt2 to bar? Alternatively, this might also be workable: {{m|LANG|foo<t:meaning><tr:xlit>}} along with the original proposal, similar to {{desc}}. Svartava (talk) 03:43, 2 September 2024 (UTC)Reply
@Svartava I initiated the change to {{desc}} to the syntax you're proposing, but I'm not currently convinced this is needed for {{bor}}, {{inh}}, etc. since as User:Urszag noted it doesn't seem so common to have multiple ancestral lemmas of a given term in the same language. If we add support for this I'd propose allowing for comma-separated lemmas in a single param with inline modifiers, e.g. your example of इंदूर (indūr) would use something like {{inh+|mr|pra|*𑀇𑀁𑀤𑁅𑀭,*𑀇𑀁𑀤𑀯𑀼𑀭,𑀇𑀁𑀤𑀧𑀼𑀭}}. This syntax is already supported for all form-of templates; to allow for embedded commas in a lemma, the comma is only recognized as a separator if not followed by a space and not preceded by a backslash. Benwing2 (talk) 03:45, 5 September 2024 (UTC)Reply
@Benwing2 That looks like a nice idea. How do we provide the meanings and alternate display in the form-of templates? I don't see it by |t=T1,T2 or |t1=T1 and |t2=T2? Svartava (talk) 03:53, 5 September 2024 (UTC)Reply
@Svartava Use inline modifiers. Benwing2 (talk) 03:54, 5 September 2024 (UTC)Reply
Personally I would support the original proposal. To me it seems tidiest and given how {{desc}} was made to work I think it'd be nice if all templates would work the same for the sake of consistency. I don't think this is rare by any means, I've had to write this a number of times and a quick search insource:/\{(der|inh|bor|cog)+\}\}, \{\{m\|/ reveals 35k hits throughout the project. However I'd also be happy with Benwing's solution. It does feel like extra syntax to be aware of, but it seems a good compromise with editors who'd prefer to continue using unnamed parameters for |alt= and |t=. As a separate note, I also liked the idea of shortening |alt= to something like |d=, all things aside. Catonif (talk) 10:46, 10 September 2024 (UTC)Reply
While the possibility of specifying multiple parents in one template is entirely agreeable, I don’t think that unnamed parameters are the way to go. Upending the elegant, laconic and ubiquitous link-display-translation syntax in order to optimise this relatively uncommon situation is not worth it, and any other solution, whether parameter- or comma-based, is preferable. ―⁠Biolongvistul (talk) 12:43, 10 September 2024 (UTC)Reply

Creating Wiktionary Pages for Generation Z Slang

Hello,

Today I read a w:List of Generation Z slang on Wikipedia, and I created audio for a couple of slang terms. I did create audio on Lingua Libre for two slang terms that don't have English Wiktionary pages yet. Is Wikipedia's list of Generation Z slang sufficient evidence these terms exist, or do I need to further attest their existence by digging up quotes on the Internet?

bouncing on it:(file)
big yikes:(file)

Thank you Flame, not lame (talk) 00:14, 2 September 2024 (UTC)Reply

We do have a page about that actually: Appendix:Gen Z slang and there is a relevant discussion about deleting it which discusses some of the issues you mention. —Justin (koavf)TCM 00:29, 2 September 2024 (UTC)Reply
I'll keep these question in Beer Parlour for one 'tis popularer, and the page you linked is not likely to get noticed. Flame, not lame (talk) 00:44, 2 September 2024 (UTC)Reply
Okay, so attestation requirements are the same for Gen Z slang as they are for any other words, so citing Wikipedia itself is not sufficient. Unfortunately, the nature of youth slang makes it so that you're less liable to find durable citations, but the good news is that requirements for what can be in the appendix namespace are much more lax (and not really codified), so if you wanted to add content there, it would be appreciated. —Justin (koavf)TCM 00:56, 2 September 2024 (UTC)Reply

(ab)use and other unusual portmanteau-like terms

In my recent research into various potential sources/concordances, I have stumbled across a class of terms as best I can understand do not have a linguistic term for(or at least those I have brought this up to about have also failed to recall one), and for which to my understanding we currently only have one, albeit partial in my opinion, example being (s)he, the terms being as followed:

What is unusual about these is unlike a traditional portmanteau, these aren't blending meanings to form a new meaning, and basically are reliant on their on the sum of their parts. None of these terms can not be "synonymize" like (s)he can be(the singular they), of which I found two other examples of such:

These are also notable because unlike the above, these are not simple bracketing of a prefix.


I will also note two examples other examples of technically attestable terms I found of the same class but am unsure of the actual acceptance of said attestments but that is outside the scope of the topic I'm broaching here:

  • (g)old - gold + old (I understand this as coming from the phrase "old but gold", probably "synomizeable" as under adj sense 2 of vintage )
  • (nick)name - nickname + name (even with quote context the exact nature of the use of this eludes me)

currently I haven't been able to find any examples of such but I suspect that this kind of term isn't exclusive of prefixing, give the following hypothetical term as an example of what it could be like:

Basically I bring all this up because I'm unsure on if these are actually something that we "can" document, and more so how exactly we would go about actually documenting them, because describing them as just another blend/portmanteau seems inapt, as well as how one would go about actually defining these. Akaibu (talk) 03:54, 2 September 2024 (UTC)Reply

I'm not aware of a specific term for this either. A juxtaposition of perspectives, like the one that excarnateSojourner describes, occurs when the bracketed element is a negative or pejorative suffix. The result is often a bit cheeky; cf. (in)famous, (mis)adventures. Nicodene (talk) 04:53, 2 September 2024 (UTC)Reply
A juxtaposition of perspectives, again, isn't a requirement, as (wo)man shows, unless you go by the binary definition of gender which, yea lmao. It certainly seems like the most common though. Akaibu (talk) 05:51, 2 September 2024 (UTC)Reply
As I said: “when the bracketed element is a negative or pejorative suffix”. Nicodene (talk) 06:05, 2 September 2024 (UTC)Reply
I have also seen slashes used, as in dis/like, mis/fortune, and we do also have s/he and Latino/a. My initial reaction is that I'm not sure it makes sense to include just any term in this class, though, like (mis)fortune, mis/fortune, (wo)man, or e.g. person(s), which all seem relatively SOP—maybe we want to only include ones where the meaning is not obvious (maybe (g)old?) or where there are THUB translations? I don't know... I'm aware we include unspaced, unpunctuated single words regardless of whether they're SOP, but when punctuation clearly tells the reader what the 'parts' are that they need to look up, it seems like we take that into account, since we don't have e.g. North/South: we seem to rely on someone who sees North/South Dakota to know to fill in the missing part and look up North Dakota and South Dakota separately, and on the face of it, it seems like we could similarly rely on someone who sees mis/fortune or (mis)fortune or misfortune(s) to look up misfortune and fortune and misfortunes separately...? - -sche (discuss) 06:56, 2 September 2024 (UTC)Reply
@-sche I would say they aren't SOP because their use implies simultaneously meanings to an object, while your example of North/South Dakota is different because that would be a single term referring to multiple things you wouldn't expect someone to say "New York (City)", because that's that city and the state, where as with (mis)fortune and such, your only talking about one thing(someone's fortune or misfortune). Akaibu (talk) 16:40, 4 September 2024 (UTC)Reply
Re slashes: we also have wo/man, a form of the mentioned (wo)man. J3133 (talk) 16:55, 4 September 2024 (UTC)Reply

Request for Old Parthian/Proto-Parthian

I would like to request for the creation of templates, modules and codes to enable supporting Old Parthian/Proto-Parthian on Wiktionary. Is this feasible? (Also, pinging @Victar here because they might have useful insights regarding this) Antiquistik (talk) 08:37, 2 September 2024 (UTC)Reply

To what ever end? --{{victar|talk}} 05:09, 3 September 2024 (UTC)Reply
@Victar To add the earlier (Old Iranian) forms of certain attested Parthian (Middle Iranian) lemmas. For example, if I were to create a page for Friyapat in Parthian, I would like to accompany it with an entry on the reconstructed older form, *Friyapatiš in Old/Proto-Parthian, where I can further explain its etymology.
Besides, even on present Parthian entries, the only option now available is to have the terms' further etymologies be labelled as "Old Iranian," which is very flawed when the pre-Middle Iranian forms of these terms can be reconstructed.
I did leave a message regarding this on your talk page, but it seems you might have missed it. Antiquistik (talk) 06:09, 3 September 2024 (UTC)Reply
I've been away for a few weeks.
I would write the etymology of Parthian 𐭐𐭓𐭉𐭐𐭕 (prypt /⁠Friyapāt⁠/) as, "From earlier *Friyapatiš, whence Achaemenid Elamite (pír-ri-ia-bat-ti-iš), Ancient Greek Φριαπίτης (Phriapítēs), from Proto-Iranian *FriHyápatiš, from Proto-Indo-Iranian *PriHyápatiš, from *priHyás (beloved) +‎ *pátiš (master). Cognate with Sanskrit प्रियपति (priyápati)." --{{victar|talk}} 19:09, 3 September 2024 (UTC)Reply
@Victar Isn't this alternative a tad cumbersome though? And should I use the code for Parthian itself for the earlier form? Antiquistik (talk) 02:48, 4 September 2024 (UTC)Reply
Honestly, no more cumbersome than the next. --{{victar|talk}} 07:55, 4 September 2024 (UTC)Reply
@Victar And do I use the code for Parthian itself? Or does Wiktionary have a code for unlabelled languages similar to Wikipedia's mis? Antiquistik (talk) 08:38, 4 September 2024 (UTC)Reply
The only reason for giving an earlier form to Parthian terms is if borrowings points to a more archaic form, like with 𐭐𐭓𐭉𐭐𐭕 (prypt /⁠Friyapāt⁠/). Otherwise, the inherited form should be Proto-Iranian. --{{victar|talk}} 07:12, 12 September 2024 (UTC)Reply

Announcing the Universal Code of Conduct Coordinating Committee

Original message at wikimedia-l. You can find this message translated into additional languages on Meta-wiki. Please help translate to your language

Hello all,

The scrutineers have finished reviewing the vote and the Elections Committee have certified the results for the Universal Code of Conduct Coordinating Committee (U4C) special election.

I am pleased to announce the following individual as regional members of the U4C, who will fulfill a term until 15 June 2026:

  • North America (USA and Canada)
    • Ajraddatz

The following seats were not filled during this special election:

  • Latin America and Caribbean
  • Central and East Europe (CEE)
  • Sub-Saharan Africa
  • South Asia
  • The four remaining Community-At-Large seats

Thank you again to everyone who participated in this process and much appreciation to the candidates for your leadership and dedication to the Wikimedia movement and community.

Over the next few weeks, the U4C will begin meeting and planning the 2024-25 year in supporting the implementation and review of the UCoC and Enforcement Guidelines. You can follow their work on Meta-Wiki.

On behalf of the U4C and the Elections Committee,

RamzyM (WMF) 14:07, 2 September 2024 (UTC)Reply

Have your say: Vote for the 2024 Board of Trustees!

Hello all,

The voting period for the 2024 Board of Trustees election is now open. There are twelve (12) candidates running for four (4) seats on the Board.

Learn more about the candidates by reading their statements and their answers to community questions.

When you are ready, go to the SecurePoll voting page to vote. The vote is open from September 3rd at 00:00 UTC to September 17th at 23:59 UTC.

To check your voter eligibility, please visit the voter eligibility page.

Best regards,

The Elections Committee and Board Selection Working Group

MediaWiki message delivery (talk) 12:15, 3 September 2024 (UTC)Reply

tautological definitions

The following subscript letters are "defined" simply as subscript letters, with no other content. For example, U+2093 LATIN SUBSCRIPT SMALL LETTER X () is defined only as "subscript x".

Graphical descriptions are not definitions. I added an actual definition to , but the rest IMO should be deleted, unless someone wants to go through and add some content. They are:

, , , , , , , , , , , .

kwami (talk) 13:55, 3 September 2024 (UTC)Reply

Should we remove automated "more X" and "most X" from headers of English adjectives?

I'm here per the suggestion of user:Theknightwho.

There are many adjectives where we use the header {{en-adj|-}} because we haven't attested to comparative forms, and that creates the note "not comparable". But almost all of these are comparable, especially in poetry. This came up for Iapetian, where it's easy to see how it might be comparable even if there are no hits at GBooks. That took me to Japhetic, which we also claimed was not comparable, but where I found several instances of "more Japhetic" on GBooks ("some Japhetides are clearly more Japhetic than others"; "beyond the more Japhetic order of minstrelsy"), and possibly one of "most" (no preview on that one).

As Theknightwho pointed out, there are probably very few cases where an English adjective is truly not comparable, because combining an adjective with "more" and "most" is extremely productive -- it's not like countability, where some nouns truly aren't countable. I agree with Theknightwho that we probably shouldn't automatically list these forms: they don't add any information that "adjective" doesn't already. We wouldn't want to claim they exist if we can't attest to them (something like "most Iapetian" may have never been used in the history of English), yet we shouldn't make a positive claim that they can't exist either, unless we have a source to back up that claim.

For inflected comparatives (nicer, nicest), we'd want to continue to provide them in the header of course. But those are generally easy to attest.

In the few cases where that's inadequate, we can always expand on the header template, like we do at fun#Adjective, or add usage notes. kwami (talk) 01:07, 4 September 2024 (UTC)Reply

It sounds like the problem is not the more/most forms, but the text generated when someone suppresses them via {{en-adj|-}}. It sounds like what would solve the issue is to either add a ! parameter (like {{en-noun|!}} has) to {{en-adj}}, or just change the wording of {{en-adj|-}} itself to say the forms are not attested rather than that they don't exist. It doesn't seem like removing the mention of more/most forms, when they're attested, would improve anything; indeed, IMO it would make things a bit worse, because if some entries list comparative/superlative forms (as entries like nice and ugly will need to, as you say), then not listing comparatives on other entries suggests they don't grade/inflect. (Granted, we could retain some mention like "inflects the usual way", but what's the usual way? Might as well go ahead and spell out "inflects using more/most"... and then we might as well just keep listing the forms, we're not short on ink.)
We have the same problem with {{en-noun}}: most or all entries that currently use {{en-noun|-}} should properly use {{en-noun|!}}, because just like with the adjectives you mention, the issue is that the inflected forms aren't thrice-attested, not that they positively cannot exist. Comparing today's X to last century's, or this universe's X to a hypothetical mirror universe's, I can discuss how the Xs differ even if X is one of the nouns we claim are uncountable. Our whack-a-mole / cite-a-mole approach means we list e.g. "engagingness" as uncountable even though there are two cites of it. - -sche (discuss) 06:32, 4 September 2024 (UTC)Reply
There is a distinct grammatical class (POS) of mass nouns, such as water and bread, that are uncountable with their default definition. (You can say at a restaurant "we'll have 2 waters", or "all the waters of the Earth", but those are distinct definitions and can be listed separately.)
There is nothing comparable with adjectives. Whether they inflect is a matter of word length and lexicalization, not being a distinct POS. They're more like strong vs weak verbs.
We can already suppress the comment with {{en-adj|?}} or {{head|en|adj}}; the problem is that those get converted to {{en-adj|-}} (as happened to Iapetian) and then make the claim that a comparative form does not exist.
Unlike nouns and verbs in their inflections, all adjectives can form comparative phrases with 'more' and 'most', unless those would be redundant with existing inflected forms that we already mention. We don't mention which nouns can be pluralized with "some" (some water, some bread), or which verbs can be made past with did, so I don't see how not mentioning comparative forms with more, most is a problem. And do we even want to bother attesting them? But I agree that it would be better to say that such phrasing is not attested than to claim it cannot be. kwami (talk) 18:17, 4 September 2024 (UTC)Reply
At the risk of discussion creep, I feel like this is a persistent problem with virtually any instances where you have no information displayed or stored which is that it's not clear if there is no such value or if the value is just kind of obvious and what you'd intuit. To use a somewhat similar example, on d: if someone has no death date listed, that could be because he's not dead or it could be because we simply haven't added it to the database yet. Putting a death date of "no value" at least explicitly says "no death date applies here: he's still alive". I feel the same way here, where I as a native English speaker would probably not think that "Japhetianer" or "Japhetianest" are words and it's probably more likely that "more/most Japhetian" is the correct way to say this, have a few extra words in the entry that says "the more/most form is the correct one" really causes no harm and clarifies what could be confusing or an omission. So I think we should retain "more/most" in headings. —Justin (koavf)TCM 18:49, 8 September 2024 (UTC) Actually, let me think this thru more... I'll retain this if anyone finds the direction I was headed meaningful. —Justin (koavf)TCM 18:50, 8 September 2024 (UTC)Reply
For me, redundant info is not the problem, or at least not important enough to worry about. It's our positive claim that the more/most forms don't exist that's the issue. I've also used DonnanZ's {en-adj|?} solution, or else {head|en|adj}, but people go through and change them all to {en-adj|-} even after I explain to them what I've done, because a search of GBooks does not come up with any tokens of the more/most forms. If GBooks doesn't have them, that's taken as proof that they do not exist and, more importantly, cannot exist. In their opinion, the solution is to change the wording produced by {en-adj|-}, and in the mean time they'll continue to make false claims on Wk. kwami (talk) 20:25, 8 September 2024 (UTC)Reply

Ambonym Attestation

Hello,

I recently created a Wiktionary page for the neologism "ambonym." The term was coined in Human1011's YouTube video. Etymology Nerd also covered this topic. How do I attest this new word? Perhaps I can use better word choices in the etymology and definition.

Thank you Flame, not lame (talk) 01:18, 4 September 2024 (UTC)Reply

I would say look it up in Google Books and Internet Archive, see if you find any work that uses the term. CitationsFreak (talk) 05:53, 4 September 2024 (UTC)Reply
I don't think the term meets our CFI. The only durably archived mention I find is a coinage of the same term but with a different meaning. And it doesn't appear widespread in online sources. Einstein2 (talk) 15:57, 4 September 2024 (UTC)Reply
I haven't thought too hard about it yet, but my initial thought is that any contranym can meet the definition of Human1011's proposed word. Am I wrong? Quercus solaris (talk) 21:33, 4 September 2024 (UTC)Reply
Not all contranyms are ambonyms, and Etymology Nerd did a pedantic essay on ambonyms, so let's go with Human1011's new word because he is a smart young man. Flame, not lame (talk) 21:35, 4 September 2024 (UTC)Reply
I just realized that yes, the ambonymy concept is about the relation of the contranym to others, not just about the contranymy alone. Quercus solaris (talk) 21:41, 4 September 2024 (UTC)Reply
If other people use it, then we can list it. We don't have a "smart young man" clause. CitationsFreak (talk) 23:32, 4 September 2024 (UTC)Reply

Arabic voiceless velar fricative notation

The Arabic voiceless velar fricative (خ) is currently rendered as ḵ on Wiktionary. I would however argue that, since this diacritic form is already used for begadkefat notation in Hebrew and Aramaic, the organic and non-begadkefat Arabic خ should instead be rendered using ḫ, just like how this phoneme is rendered for other Semitic languages like Old North Arabian, Old South Arabian and Akkadian, and for the non-Semitic but still Afroasiatic Ancient Egyptian language. Antiquistik (talk) 10:28, 4 September 2024 (UTC)Reply

@Antiquistik I oppose this. is too confusable with , the pharyngeal fricative. Benwing2 (talk) 03:37, 5 September 2024 (UTC)Reply
@Benwing2 We already use both and for, respectively, the velar and pharyngeal fricatives, for Old North Arabian, Old South Arabian and Ancient Egyptian without this causing issues so far, don't we? Antiquistik (talk) 06:52, 5 September 2024 (UTC)Reply
Oppose, consistently using an underline for fricatives is much more understandable. ḵ is the fricative equivalent of k, ḡ of g, ṯ of t and ḏ of d. Using diacritics in a consistent manner is preferable IMO and randomly using ḫ would be much less clear. — BABRtalk 01:39, 6 September 2024 (UTC)Reply
Agree, and I'll add that @Antiquistik's concern with "organic" vs. "begadkefat" variants across different languages runs into the same problem with ḡ, ṯ and ḏ, yet we aren't proposing replacing every one of those with some other random symbol. Benwing2 (talk) 04:43, 6 September 2024 (UTC)Reply
@Babr @Benwing2 I am sorry, but saying that I am proposing replacing with a random symbol comes across as disingenuous when is already the notation for the velar fricative in the standard Romanisation schemes for several Semitic languages like Old North Arabian, Old South Arabian, Akkadian, and it is also used as such in some Romanisation schemes for Arabic.
In fact, in my opinion, the DIN 31635 scheme, which notes the velar fricative as , is among the better transliteration schemes for Arabic. It's a German transliteration scheme, so I doubt English Wiktionary would adopt it wholesale, but it is used widely enough in English literature on Arabic that I think we should consider it.
Though I must note that the argument with regards to organic vs begadkefat was proposed by @Fay Freak in a previous discussion Wiktionary:Beer parlour/2024/May#Arabic and Hebrew transliteration
I have no problem with disagreements with or opposition to my proposal per se, but it is not a random choice. Antiquistik (talk) 07:31, 6 September 2024 (UTC)Reply
@Antiquistik I did not say it was a random symbol, I meant it's usage would be random since it wouldn't match our otherwise consistent pattern for marking fricatives. I think using diacritics in simple and consistent manner is desirable as it makes transliterations more easily understood by readers. — BABRtalk 18:01, 6 September 2024 (UTC)Reply
@Babr That's fair. I suppose I should have argued for switching to the DIN 31635 transliteration scheme altogether, especially given that I think its other letters provide better consistency with the Wiktionary notations of languages with heavy loaning from Arabic, like the various registers of Hindustani. Antiquistik (talk) 12:34, 10 September 2024 (UTC)Reply
Oppose. — Fenakhay (حيطي · مساهماتي) 07:42, 6 September 2024 (UTC)Reply
I have no strong feelings for either scheme, but there are lots of casual readers and in particular native speakers without experience in Semitist tradition that do not participate in discussions but might be snubbed by . For now we at least have the advantage of there only being one diacritic variant of k and h each, and , which is as Benwing2 implies optically though not comparatively advantageous, and last but not least status quo bias and complete correspondence with the English edition of the Hans Wehr transliteration, only its rings made larger. Your, albeit excellent, centre of gravity in language comparison begs us here to “cause issues”, I am sorry. Fay Freak (talk) 16:20, 6 September 2024 (UTC)Reply

What should I do

What should I do if my edits are reverted?
I left a message on PUC talk page.
I didn't get a response.
I left a message here.
I didn't get a response. ПростаРечь (talk) 18:30, 4 September 2024 (UTC)Reply

  • @ПростаРечь I don't know that I can give you a full answer as I don't work in the reconstruction namespace myself, but Wiktionary generally doesn't use manually-added interwiki links, because the title of an entry in the English Wiktionary will be the same as the title of the corresponding entry on any other Wiktionary. I do see that the Russian Wiktionary does not seem to have a dedicated namespace for reconstructed terms as we do here, but the way to solve this problem (if we see it as a problem at all) would probably be to change something at Wikidata, rather than manually adding interwiki links between all reconstructed terms in all Wiktionaries. — excarnateSojourner (ta·co) 21:31, 4 September 2024 (UTC)Reply

Recategorizing quotation navigation templates by bot

We have a collection of templates such as {{Douglas Adams quotation templates}} that are used on the documentation pages of quotation templates (such as Template:RQ:Adams Hitchhiker/documentation) to list other quotation templates for works by the same author. Currently these are categorized in cat:Navigation templates (and e.g. cat:English quotation templates), but they are shoved to the front by their sort keys rather than being listed under each letter.

I'm seeking consensus to create cat:Quotation template navigation templates (or cat:Quotation navigation templates if people prefer) as a subcategory of cat:Navigation templates, and use my bot account to recategorize these templates there. — excarnateSojourner (ta·co) 00:11, 5 September 2024 (UTC)Reply

Support This, that and the other (talk) 10:11, 5 September 2024 (UTC)Reply
No objection: perhaps the name should be "Category:English quotation navigation templates" which can then be a child category of both "Category:English quotation templates" and "Category:Navigation templates". Some of our quotation navigation templates are multilingual (for example, {{Bible quotation templates}} and {{Don Quixote quotation templates}}), in which case they should also be child categories of "Category:French quotation navigation templates", "Category:Spanish quotation navigation templates", and so on. — Sgconlaw (talk) 12:15, 5 September 2024 (UTC)Reply

Request for extended mover right

I plan on enforcing the changes to the Ottoman Turkish encoding guidelines which were formalised into WT:AOTA (see disc) but never really put into practice. For doing so I would need to move and merge about ~300 ca. entries, and since I would rather not leave unwanted redirects behind, nor flood CAT:D for someone else to tediously go through, I request the extended mover right as to save both myself and others the hassle. Catonif (talk) 15:53, 5 September 2024 (UTC)Reply

Granted. More than trusted user. Vininn126 (talk) 16:38, 5 September 2024 (UTC)Reply
@Vininn126 Thank you! Catonif (talk) 17:02, 5 September 2024 (UTC)Reply

Request for autopatroller rights

I wish to request autopatrolling rights as I plan on editing certain pages following my collaboration with GLAAD that are fully protected. I already have autopatroller rights on Commons and edit in good faith and good contributions, only with minor mistakes as I am still not a total expert on everything. Juwan (talk) 22:04, 6 September 2024 (UTC)Reply

I'm inclined to say yes, but I have a few thoughts. You are a cooperative editor and fast to learn, and your experience on Wikipedia has definitely helped you in learning the ropes. I'm hesitant, as I think there are still some fairly common ins and outs of the site you're not familiar with yet, and your edit count is just on the edge for many users (not that edit count is everything, but I feel in this particular case it's a useful metric). If no other admins have any qualms, then I think it's probably in order. I'd like to see what others say. Vininn126 (talk) 22:07, 6 September 2024 (UTC)Reply
if you have anything specific that you think that I should learn, please go ahead and mention it in my talk page on or the Discord server, not even for this particular request but in general I want to know! Juwan (talk) 22:14, 6 September 2024 (UTC)Reply
@JnpoJuwan IMO these terms need to be approached with a lot of care. I agree with @Vininn126 that it might make sense to make a list of the changes you propose and post this in the Tea Room. FWIW I am not an expert in these sorts of terms; I think User:-sche knows more. Benwing2 (talk) 03:19, 7 September 2024 (UTC)Reply
thanks for the advice! I will go ahead and do that. Juwan (talk) 13:11, 7 September 2024 (UTC)Reply

Reference bibliographies

Is there any central or per-language bibliography in Wiktionary?

The references and particularly ref templates make a treasure of sources on etymology, usage and other linguistic matters. BUt one usually encounters them under the random heading in this or that article. Is there a section of Wiktionary where you can find them grouped? In the language top categories I saw some "Category:<lang xx> reference templates". But untranscluded, they are completely opaque, and there is no telling the grand etymological dictionary from the note about a single word, the old from the new.

What I'm looking for (or propose) is more like Appendix:Bulgarian bibliography, that lists the major recuring sources, and possibly more works, by topic. Danny lost (talk) 22:04, 6 September 2024 (UTC)Reply

The category you mentioned is usually better. As far as transparency, this depends template to template, and if someone has written a documentation or not. It's also usually important to read the forward of the given dictionary and any reviews, which is often not given. So the answer is no, there usually is not a grand explanation of each source, unfortunately. Sometimes more details are provided on pages such as WT:About Bulgarian but not always. Vininn126 (talk) 22:09, 6 September 2024 (UTC)Reply
Actually what I asked for is found at Wiktionary:Reference templates / Wiktionary:REFT. Though it is a bit of a random selection at the moment. Danny lost (talk) 20:41, 9 September 2024 (UTC)Reply

East Frisian

Anybody could know what code we could use for East Frisian dialects that are not Saterlandic? Like Upgant, Wangerooge, Harlingerland, and Wursten? That Northern Irish Historian (talk) 00:38, 7 September 2024 (UTC)Reply

It's been a while, but if memory serves, ISO messed things up by using East Frisian for a Low Saxon lect and leaving Saterland Frisian as the only linguistically Frisian eastern lect with a code. We made do by making all the Frisian East Frisian lects part of Saterland Frisian's code. Pinging @-sche who was more directly involved and @Theknightwho who might know more about the current state of the codes. I don't think anyone was happy with the way we left it back then, so it wouldn't hurt to revisit the whole mess. Chuck Entz (talk) 02:03, 7 September 2024 (UTC)Reply
So that means using stq for these dialects? That Northern Irish Historian (talk) 15:24, 8 September 2024 (UTC)Reply

"the act of being"

There are 91 hits (enwikt entries), mostly in definitions, for this grammatically correct but IMHO semantically odd expression. Be is the ultimate stative verb, contrasting with a dynamic verb, which describes an action. "The act of being" makes for a good phrase for a poetic, spiritual, or motivational bit of writing or speech, but it seems completely out of place for definitions.

One simple change that might work for many of these definitions is to drop "The act of" and leave "Being". Another is to replace "act" with "condition" or "state" or, imitating other dictionaries, replace "act" with "condition, state, or quality" or similar.

Am I missing something? If I am not, this seems to show that we need to up our game to improve Wiktionary's most basic product, its definitions. I don't know what means there are might be to prevent or cure such inappropriate expressions in definitions. Prevention may be hopeless because almost all newbies and some no-so-new-bies are convinced that it is trivial to write a definition. ("Style guide? Style guide! We don't need no stinking style guide.") Perhaps the alternative is to record common phrases that are almost always wrong for a dictionary definition, scan the dictionary periodically for such expressions, and correct them. The correction could be simple (automatic or semiautomatic) replacement of the offending expression, but probably more extensive rewording would be better. DCDuring (talk) 02:57, 7 September 2024 (UTC)Reply

In most cases, "being" in this context is being used not as a stative verb, but as an auxiliary. E.g. "being dethroned" is not necessarily stative: it can refer to an action.--Urszag (talk) 03:09, 7 September 2024 (UTC)Reply
The OED, based on my quick look at it (specifically, "dis-...-ment" words), uses the wording "The act of ___ing, the fact of being ____ed." I think leaving off "The act of" could work. I don't think it's sufficient to replace "act" with "condition", "state", or "condition", as Urszag points out. "His dismemberment was swiftly accomplished" refers to an action of dismembering, of which the subject is the passive recipient. It is not a state or condition of being dismembered (though the definitions worded with "act" fail to capture the possibility of the word being used that way). That being said, note the way we actually do handle this at dismemberment with two definitions, by contrast with, say, "disenthronement," which uses "The act of being". Andrew Sheedy (talk) 03:12, 7 September 2024 (UTC)Reply

Maybe a better way of looking at this is to divide the instances of "the act of being" into two categories:

  1. those of the form "act of being " in which be is a copula.
  2. those of the form "act of (=being )" in which be is an auxiliary.

I don't think there are many other cases to distinguish.

I believe that all of the instances with be as a copula are simply wrong, as the English copula is stative.

The other case is one of the awkward and unjustified use of act when the purported actor/agent is actually a patient. DCDuring (talk) 21:39, 7 September 2024 (UTC)Reply

Icelandic útúrsnúningur (the act of being intentionally obtuse) and Hungarian foglalkozás (the act of being occupied with something) are both fine, I think. The former despite seeming to be a copulative phrase, the latter despite seeming to be a passive progressive phrase. In actual fact they're both durative non-passive actions and the word act positively contributes to the definition. On the other hand the definition of Swedish umgänge (the act of being with people) is indeed better off rephrased. It's a case-by-case question. —Caoimhin ceallach (talk) 19:37, 8 September 2024 (UTC)Reply
Not only does the English seem to be a copulative phrase, it is a copulative phrase. Being intentionally obtuse is not a "durative non-passive action" because it is not an action. If the word were defined as "pretending or intending to be obtuse" or "acting obtuse" that would be different. I am not too surprised that the problem has to be remedied on a case by case basis. As I siad above: "The correction could be simple (automatic or semiautomatic) replacement of the offending expression, but probably more extensive rewording would be better". DCDuring (talk) 20:05, 8 September 2024 (UTC)Reply
What I meant was that the words I quoted denote durative non-passive action. If they are most clearly glossed by "act of being " or "act of being ," I don't see that as a big issue. "Act of pretending/intending to be obtuse" would be inaccurate and "act of acting obtuse" is tautological. —Caoimhin ceallach (talk) 21:24, 8 September 2024 (UTC)Reply
What I am saying is that "act of being" never belongs in a definition because be is always either a copula or an auxiliary with an -ed-form making a passive verb. The predicate with a copula is never an act, in any normal sense of the word act (See act#Noun.). When being is a component of a passive, the subject of the passive is not an agent, but rather a patient. Patients do not act in any normal sense of the word act. Looking at occurrences of "the act of being" in Google Books should be sufficient indication that the expression is not normal English. It occurs principally in works of metaphysics. DCDuring (talk) 01:39, 9 September 2024 (UTC)Reply
My take is that it's a way to emphasize the verb POS, since an act has to involve a verb. A number of non-European languages use stative verbs where we use adjectives- we have a relatively small number (look, seem, sound, smell, to name a few), but those aren't what we think of when verbs are mentioned. Likewise, English has lost its passive morphology, and instead uses constructions made of forms already in use for other purposes. Saying "the act of being" makes it clear that action is referred to, even if it's being done by someone or something else. Of course, both are at the expense of not making literal sense, but their practitioners aren't paying attention to that aspect. Chuck Entz (talk) 03:44, 9 September 2024 (UTC)Reply
So we can use an expression commonly used in English only in metaphysics, theology, and spiritualism in our English definitions and glosses? DCDuring (talk) 11:34, 9 September 2024 (UTC)Reply
I get your argument. I get that being per se isn't an act, nor is being tired, or being eaten alive. But being intentionally obtuse is, and being occupied with something is too, despite them being grammatically analogous to the former examples. I think the semantics of the constituent parts is more important than the formal properties of the word being. If we're talking about formal logic you may have an argument, but this is a dictionary and clarity is all that matters. —Caoimhin ceallach (talk) 12:35, 9 September 2024 (UTC)Reply
If I thought that the "act of being" definitions were clear, I wouldn't have bothered introducing this topic. I think "act" allows for what ever modest active role a patient may have, but at the expense of muddying the basic definition.
I'm not in the slightest concerned about formal logic and didn't use it. I am concerned about not confusing ordinary folks who might see our definitions, not just on our site, but also via Google Search and entities like OneLook.com (which give us a very favored treatment), but don't have hyperlinks, hovertext, etc. to explain things and cover over some of the inadequacies of our definitions.
Andrew Sheedy noted that, given the need to define terms of the form dis-...-ment, the OED uses the wording "The act of ...ing, the fact of being ...ed." I don't think we should be too proud to learn from and follow their example. I don't recall ever seeing the wording "the act of being" used in any dictionary's definitions.
In the case of Icelandic útúrsnúningur (the act of being intentionally obtuse), I propose "intentional obtuseness", "pretending to be obtuse", "feigning obtudeness". "Intentional", "pretending", and "feigning" all convey an active role for the intender, pretender, or feigner.
For Hungarian foglalkozás (the act of being occupied with something), all the meanings of the verb Hungarian foglalkozik are active and are so worded, excepting "be engaged in", one of the synonym cloud to glosses of definition 4, which could easily be worded as "engage in". Hungarian -as seems to function much as English -ing. Normally we don't reword all the definitions of the verb at each form of the verb or each term derived from the verb, unless there are new meanings or, perhaps, restrictions of meanings. Leaving the definition of Hungarian foglalkozás as "verbal noun of foglalkozik" without the sense-obscuring gloss seems preferable to the current state. DCDuring (talk) 23:09, 9 September 2024 (UTC)Reply
I concede that some of your suggestions might work better than what we currently have. The problem however with English -ing (verbal noun) is that it is homonymous with the present participle. That's an ambiguity which I presume the admittedly not-so-beautiful "the act of"-formulation is intended to avoid. Alternatively we could choose a formulation like "the feigning of obtuseness" or "the engaging in an activity" for verbal nouns, but I'm not sure if this is better. Leaving out a gloss altogether seems even worse to me because not everyone has a perfect intuition for what a verbal noun is.
Perhaps you can make the case better for why ordinary folks might be confused by definitions like "act of being /" in cases where the phrase "being / clearly denotes an agentive activity, because I'm still not entirely convinced. —Caoimhin ceallach (talk) 00:06, 10 September 2024 (UTC)Reply

Korean determiner from of adjectives

For example "진정한," I couldn't find any determiner form of Korean adjectives terms on this dictionary. Should we create a term for them or we should make them redirect to, for example, 진정하다, or should we just not create page for them? If we're going to create a term for them, what should we write? 列维劳德 (talk) 00:33, 9 September 2024 (UTC)Reply

I see 좋은 is a redirect although it used to be an entry. Justin the Just (talk) 12:19, 9 September 2024 (UTC)Reply
Honestly, I'd keep them at redirects, but I'll CC the other Korean editors: (Notifying TAKASUGI Shinji, Atitarev, HappyMidnight, Tibidibi, Quadmix77, Kaepoong, The Editor's Apprentice, Saranamd): , @Solarkoid. AG202 (talk) 22:34, 10 September 2024 (UTC)Reply
@AG202 I'm not keen on having hard redirects for what are basically non-lemmas. Theknightwho (talk) 22:41, 10 September 2024 (UTC)Reply
@列维劳德: At some stage I created a few determiner forms but they were frowned upon. There are just too many forms, even determiners. It's best to focus on lemmas. It's a similar situation with Japanese verbs and adjectives. Anatoli T. (обсудить/вклад) 04:20, 11 September 2024 (UTC)Reply

These don't link to Chinese — for example, the first link on jan4 goes to 人#Cantonese instead of 人#Chinese. Are there any objections to me changing this? -saph668 (usertalkcontribs) 17:55, 9 September 2024 (UTC)Reply

RQ for template editor

I've been working on adding support for dark mode with @Ioaxxere and there are many small templates that are locked that are unreadable in dark mode. I'm mainly creating a dark mode color palette using recommendations from MediaWiki, so far it's coming along great but some templates that need adjustments don't have any classes and can't be adjusted from my css page. — BABRtalk 22:16, 9 September 2024 (UTC)Reply

No objections from me. Benwing2 (talk) 04:08, 10 September 2024 (UTC)Reply
@Benwing2 No objections yet. Do you think ~29 hours is enough time or should we wait a bit longer? — BABRtalk 03:09, 11 September 2024 (UTC)Reply
Give it a day or two and if no one says anything I'll add you. Benwing2 (talk) 04:07, 11 September 2024 (UTC)Reply
@Benwing2, it's been about two days now, I think. — BABRtalk 20:05, 12 September 2024 (UTC)Reply
@Babr Thanks for reminding me, you are now a template editor! Benwing2 (talk) 20:08, 12 September 2024 (UTC)Reply

About a new translation parameter for quote-book

For multiple different historical circumstances, it was pretty common for many Albanian books and periodicals to be printed with an Italian translation next to the Albanian text. These translations are very valuable because they were written by the authors themselves, which let us know more or less exactly what the authors meant for each word they wrote. For this reason I think they should not be omitted whenever these works are quoted, as we would be losing important context. Take for example akull (§ ety 2) where we are deducing the otherwise unattested dialectal meaning "arrow" only thanks to the Italian translation of the quote, which is however not presented in the entry, same thing for kacadre. For this reason, on the entries tipos and gëluhë I temporarily used the parameter |origtext= which does exactly what I'm looking for, except for printing the text "original", which is not the case here as the original text is actually the Albanian one.

So I propose the creation of a parameter |transltext= (and all other |transl... etc.) to essentially work like |origtext= but saying "translation". It's true this parameter name could look confusing at first, given we also have |transl= and |text= respectively, I can't think of a better one. Catonif (talk) 23:18, 9 September 2024 (UTC)Reply

@Catonif I'm thinking maybe instead of generalizing this slightly, so that there would be an |alttext= param (and maybe |alttext2=, etc.) along with a param |altprefix= or something to specify the prefix used for this text. Original text, normalized text, translations, etc. are all instances of "alternative" text that you might want to show, and supporting multiple types of alternative text would let you e.g. put renderings in multiple languages if it's important to do so. For a case where this might be useful, see the example in the {{quote-book}} documentation of Italian text translated from a French translation of the Zhuang-zi. There's no current way of inserting all of the original Old Chinese text, the first-level French translation, the second-level Italian translation and the modern English rendering. Similarly sometimes people have found it important to include multiple renderings in English, e.g. a "poetic" translation and a literal translation; the poetic one could use |alttext=. Pinging @Sgconlaw and @Vininn126 who may have thoughts about this. Benwing2 (talk) 03:15, 12 September 2024 (UTC)Reply
BTW these parameters would actually be added to Module:usex so they are also available to {{ux}}, {{quote}} and the like. Benwing2 (talk) 03:17, 12 September 2024 (UTC)Reply
Hi @Benwing2, to me this sounds like a very sensible solution, you have my full support. Just to be clear, this wouldn't imply the removal of the |norm= parameter, right? Catonif (talk) 08:15, 12 September 2024 (UTC)Reply
@Catonif Right, that param would remain. Benwing2 (talk) 08:24, 12 September 2024 (UTC)Reply
This sounds fine to me. Vininn126 (talk) 09:20, 12 September 2024 (UTC)Reply
I like this. Especially |alttextN=. Thereby we do not need to conceive, or it is agnostic to, why an editor wants multiple texts. Otherwise it becomes intellectually challenging to distinguish the multiple formatting options, new version, old version, original version, translation text and so on. So stick with |text= / |passage= for the entry language’s quote and before- and after-texts the labelling of which the editor can choose, including for |t= in the case that something is to be said about the translation. Fay Freak (talk) 01:14, 13 September 2024 (UTC)Reply
It's useful to have an unambiguous and well defined interpretation for the standard parameters, such as |text=, |norm= or |t=. This makes quotations machine readable. At the same time, the potential great flexibility and freedom of |alttext= makes it great for humans, but possibly difficult to parse for machines. --Ssvb (talk) 04:32, 13 September 2024 (UTC)Reply
@Benwing2: Will this new |alttext= model allow differentiating professional English translations from published books and the English translations provided by Wiktionary contributors?
For example, the Belarusian quotation from бачыць right now lists the English translation done by Mary Mintz in 1989 in the |t= parameter together with the |newversion=English translation from parameter. But the |t= parameter is normally supposed to be used for the translations authored by Wiktionary editors, right? --Ssvb (talk) 04:18, 13 September 2024 (UTC)Reply
I like this, and think it would be useful to have in the language-specific usex templates, too, like {{ja-usex}} which sometimes include translations to multiple languages like on だはんで. Or, migrate the language-specific code from the language templates into Module:usex and use that everywhere. Pinging @Fish bowl who might have some good ideas about this. JeffDoozan (talk) 13:58, 14 September 2024 (UTC)Reply

zzxjoanw Definition

Hello,

How can we make a dictionary entry for zzxjoanw? I made a pronunciation audio yesterday, and it's said to be of Maori origin. It apparently means "drum" "file" or "conclusion". There is a Wikipedia page for it, and I put my voice on Wikipedia a moment ago. Can anybody attest to that?

Thank you Flame, not lame (talk) 00:34, 10 September 2024 (UTC)Reply

It's a known hoax. Besides which, Polynesian languages like Maori pretty much always end syllables with a vowel, nor do they have double consonants, and Maori doesn't use j, x, or z in native words. Chuck Entz (talk) 04:45, 10 September 2024 (UTC)Reply
@Chuck Entz: You don't generally get b, c, d, f, l, q, s, v and y in Maori either, though the place and lake Waihola (known for its black swans) springs to mind. I'm not sure if 'ng' and 'wh' (pronounced like an f) combined qualify as Maori letters. DonnanZ (talk) 11:20, 12 September 2024 (UTC)Reply
Maybe define "zzxjoanw" as an English word that is a fake Maori word. Flame, not lame (talk) 08:13, 10 September 2024 (UTC)Reply
@Flame, not lame: The pronunciation is at Appendix:English dictionary-only terms/zzxjoanw. J3133 (talk) 05:05, 10 September 2024 (UTC)Reply
Writing as an NZer, we don't need this. There shouldn't be an entry for Waikikamukau either, a fake Maori place name pronounced "why kick a moo cow". DonnanZ (talk) 10:54, 12 September 2024 (UTC)Reply
I can't believe it. "Don't talk to me." 💔 Flame, not lame (talk) 22:14, 12 September 2024 (UTC)Reply

Letter articles without definitions, Latin Extended Additional block

Hi. Per request, I'm consolidating requested deletions here rather than posting 'rfd' multiple times. The following articles have no content, apart from the Unicode name being used in place of a definition. There is no indication of which languages they may be used in, or if they're translingual as claimed by the header. Previous consensus has been that the Unicode name, or a paraphrase of it, does not qualify as a definition.

I'd be interested in seeing what these letters are used for, if anyone can document them, but I've failed to find anything myself.

Thanks. kwami (talk) 07:25, 10 September 2024 (UTC)Reply

Latin: words with suffixes (and prefixes?)

I couldn't find habitasne. I eventually decoded it as habitās + -ne. @Nicodene has advised that words like habitasne do not merit a separate entry, but I feel it's worth a broader discussion, expanding on the original conversation.


I wouldn't mind so much not having a separate entry for words like habitasne if there were an easier way to find them. For example, if the search results could be coerced into showing something more useful, and/or if it could be presented (perhaps not hyperlinked) in a table of conjugations or similar. Having the word appear somewhere within the main verb entry could presumably help the search page to list it too — it seems to work this way for bailémosles, which doesn't have its own entry, but is mentioned within the bailar entry.

Slightly off-topic: is it possible to force the search to only look within a specified language or languages? For example, if the user 'knows' that the word is Latin, then can they avoid being shown results from English and other irrelevant languages?

A table of conjugations can perhaps (?) be automatically generated (and manually edited where necessary), and minimised by default. I'm thinking of the Spanish verb conjugation tables, which include not just the 'simple' conjugations, but also every (?) combination with pronoun suffixes too, such as in the Selected combined forms of bailar table: many words in the table do have their own entries too (such as bailarme and bailándolas), and are correspondingly linked, but also many — presumably less common — words don't (such as bailémosles and báilenla), which are somehow redlinked without being coloured red in that table. The convention adopted for Spanish verbs must surely have expanded the number of entries, but was still deemed worthwhile.

For people familiar with Latin it may seem trivial. But suppose I encounter a hypothetical new Latin word *previviruminsmaste. I would rather not have to 'manually' try various combinations in order to decode it: for example, would it be defined in one single entry (previviruminsmaste), or across two entries (pre + viviruminsmaste or previ + viruminsmaste or previviruminsmas + te or previvirumins + maste etc.), or across three entries (pre + viviruminsmas + te or previ + virumins + maste etc.)?

Alternatively, if there is no desire to add conjugation tables or similar to each main verb entry, how about adding a link to a grammar page listing various Latin suffixes (and prefixes?) that can commonly be applied?

—DIV (2001:8004:44F0:5AD4:F864:E084:92A9:DBF4 00:51, 11 September 2024 (UTC))Reply

The problem is that -ne belongs to a class of "enclitic" particles that can in principle be suffixed to just about any Latin word, not just verbs. See the discussion at Talk:fasque#Deletion_debate. It would be unwieldy and not very useful to display all of these forms even in a collapsed table. Ideally we would have redirects, but one of the many structural flaws of Wiktionary as a MediaWiki-based multilingual dictionary is that it doesn't make it easy to set up automatic redirects for cases like this.--Urszag (talk) 01:01, 11 September 2024 (UTC)Reply
One can vaguely dream of a search feature which, in the event of a query finding no results, displays results matching that query minus various common clitics. Nicodene (talk) 01:24, 11 September 2024 (UTC)Reply
Many languages have similar issues; e.g. we don't lemmatize every word with 's added. Arabic single-letter conjunctions and prepositions regularly cliticize onto the next word, which can momentarily trip up even native speakers, e.g. Fenakhay was looking for the rare verb اِسْتَسْأَلَ (istasʔala) and momentarily interpreted the form أستسألني as a form of this verb أَسْتَسْأَلُنِي (ʔastasʔalunī) (maybe "I ask myself", although this may be ungrammatical) when it's actually أَ (ʔa) (which marks a yes-no question) + سَ (sa) (future-tense prefix) + تَسْأَلُ (tasʔalu, you ask) + نِي (, me), i.e. "will you ask me?". The problem, as noted by User:Urszag, is that many types of clitics can be added onto pretty much any word, e.g. -que, -ne and -ve attach to the first word, whatever it is. Potentially we could write a JavaScript add-on that helps with this; we already have JavaScript gadgets that auto-redirect certain forms to certain other forms, and so it's not out of the question to write a gadget that tries to analyze a nonexistent word into its components, as User:Nicodene notes. But this would have to be quite complex and ideally would have machine learning attached to figure out the likely language and do the segmentation. Benwing2 (talk) 04:05, 11 September 2024 (UTC)Reply
There was a discussion a while ago about having our templates (modules), whenever they transliterate an Arabic word, output all the common methods of transliteration of Arabic, but then set them to "display: none" (or hide them in HTML comments), so that searching for them in our on-site search engine, or on Google, will find the relevant pages, without the extra content actually being displayed to readers of the entry itself. I'm not sure whether anyone got around to implementing that, but the same basic idea suggests itself to me here: have Latin-specific headword or inflection templates produce (but not display) text like "suffixed with -ne: foobarne, suffixed with -que: foobarque", etc, so that searches for those things find the relevant entry (and ideally the 'results snippet' even includes the "suffixed with..." text so the reader can work out what's going on). (Edited to add: the discussion was in December 2022.) - -sche (discuss) 19:16, 11 September 2024 (UTC)Reply
Other dictionaries of course do these automatic redirects since the user chooses his language in the mask. MediaWiki is not for languages, though we be better than the other dictionaries by virtue of the other things MediaWiki is designed for. One probably indeed needs some frequency data for any target language, if not so-called machine learning which sounds so undebuggable as to be out of scope of a Wiki. A list of relevant clitics and the mechanisms by which they are attached to be edited by trusted editors plus a toggle for the searcher to restrict for a specific language. I rather believe in someone just doing it with an external site … Fay Freak (talk) 19:46, 11 September 2024 (UTC)Reply

I’m not a TTS

Last month @Fytcha deleted my contributions to Wiktionary accusing my voice recordings of being “computer-generated”. They are not. I request them to be brought back. JapanYoshi (talk) 00:23, 13 September 2024 (UTC)Reply

Your audio files have weird singsong intonation and missing or mispronounced sounds. Whether they're generated by a computer or by a human, we would rather have audio generated by people who speak the language well. - -sche (discuss) 01:34, 13 September 2024 (UTC)Reply
@-sche: There's no missing sound. The /p/ at the end of poop in poop pipe is which is perfectly valid in GA (in fact, would sound unnatural). With that said this one sounds a little non-native to me. I don't think JapanYoshi is using TTS, unless it's some kind of super-advanced AI that sounds exactly like a human, in which case I wouldn't be opposed to it anyway. Ioaxxere (talk) 06:39, 14 September 2024 (UTC)Reply
I hope this thread is not used in the future to assert consensus for the use of TTS. I'd be strongly opposed. The takeaway is that JapanYoshi's audios may not be TTS but they are incredibly unnatural sounding. Vininn126 (talk) 09:13, 14 September 2024 (UTC)Reply
I don't get the sense that the audios were recorded by someone who doesn't speak the language well. I think any unnaturalness is simply due to over-enunciation. So the solution is very simple: we just explain to JapanYoshi that we want audio files to sound as natural as possible, not over-enunciated, and if we find their speech a little unusual, then it should be labelled with the appropriate region. Of course, if JapanYoshi does have an idiosyncratic accent, then we might consider letting them know that it's important for the audio to be fairly representative of a given accent. But let's not jump all over this person and their contributions without even welcoming them to Wiktionary and giving them a little guidance! Andrew Sheedy (talk) 16:30, 14 September 2024 (UTC)Reply
As I said in the other thread, there's a certain type of pronunciation that should be expected in a dictionary. You can look at the OED, Merriam-Webster, Dictionary.com, etc. to find it. Even if they may sound like TTS (or be TTS), they still have a very specific and clear intonation & enunciation that's been missing from your audios. Remember that audios are meant to illustrate a standard pronunciation and are especially used by non-natives for learning/illustration purposes. Honestly, I really do think that we should have an explicit policy as to what should be expected from audios reflecting standard speech, as a lot of recent ones have been problematic for various reasons. I've been getting frustrated with the less-than-par quality that we've been allowing recently, and I also really do not have the time to go through each one, removing the ones that sound noticeably non-standard. CC: @-sche, @Benwing2, @Theknightwho. AG202 (talk) 17:29, 14 September 2024 (UTC)Reply
@AG202 I agree with you about the need for good-quality audio files but would an explicit policy accomplish anything? I assume the people who are uploading bad audio files are unlikely to read the policy even if it's in their face. Maybe a more relevant issue IMO concerns User:DerbethBot, which auto-uploads audio files. I have encountered lots of problems due to this; some of them are old but I don't know if they are all fixed and I can't tell from looking over the github source code because it isn't well documented. As an example, all audio files for any Arabic-script and Hebrew-script terms should be blacklisted because the scripts are underspecified. Yet there are definitely lots of Arabic script audio files present that have been uploaded by DerbethBot, and I don't know whether this can still happen. Benwing2 (talk) 20:30, 14 September 2024 (UTC)Reply
@Benwing2: An explicit policy would at least give us something to point to if we added a like filter or warning or something. It would also give less wiggle room to argue about what is and isn't "good audio", as we've seen happen time and time again. Re: bots like User:DerbethBot: as stated on their page, "Administrators: if this bot is malfunctioning or causing harm, please block it." If it's been seen time and time again that this bot is problematic in the way it adds audios, why wasn't it been blocked since 2008? Honestly, it baffles me a bit. AG202 (talk) 00:45, 15 September 2024 (UTC)Reply
@AG202 Because I don't know if the bot is still causing issues; I'm waiting for User:Derbeth to respond. I feel somewhat uncomfortable about the idea of a bot that auto-adds audio files without any way of verifying the correctness of the actual content, but not enough to insist on shutting it down outright. Benwing2 (talk) 01:15, 15 September 2024 (UTC)Reply
Doesn't that bot operate based on a whitelist? Vininn126 (talk) 06:26, 15 September 2024 (UTC)Reply
I don't think so, maybe it has whitelists for sites but generally it uses blacklists I think. Benwing2 (talk) 06:29, 15 September 2024 (UTC)Reply
I think that a workable solution would be to implement some sort of a voting system at https://lingualibre.org/ (the project under the Wikimedia Foundation's umbrella?) and grade audio recordings based on that. The downvoted audios or their submitters could be blacklisted. But the problem is that not all words even have audio recordings available, and beggars can't be choosers: https://lingualibre.orghttps://dictious.com/en/List:Eng/Lemmas-without-audio-sorted-by-number-of-wiktionaries --Ssvb (talk) 18:06, 15 September 2024 (UTC)Reply
(Previous discussion: Wiktionary:Beer parlour/2024/August § Synthesised audio files (again))
@JapanYoshi: Hey. I'm willing to take you at your word that your audio files are not computer-generated and I therefore apologize for my mistake. When I read the BP thread I linked to above, I reviewed your audio files and found there to be multiple issues with them. On top of that, some files, especially File:En-fucking Nora.ogg, sounded "robotic" to me; to my ears, there's something weird going on in the higher frequencies of that file, exactly the same kind of artifact often encountered in synthesized audio files. However, that could just as well be your recording equipment or maybe I'm just mishearing things.
All that said, even if not the product of speech synthesis software, there's still issues to be addressed with your audio files as others in this discussion have also pointed out. I won't touch your audio files anymore and I wouldn't revert you if you revert my removal. — Fytcha T | L | C 19:58, 14 September 2024 (UTC)Reply
Having just noticed this, I'm going to block the user for unacceptable conduct (bigotry). For my part, I set the block length to indef because my feeling is that a user whose 31 edits are "queer people / theorists advocate pedophilia and bestiality" and unnatural audio and defense thereof (which, in light of the other thing, someone could even speculate could be trolling) is NOTHERE, but if the user makes an unblock request that persuades another admin that they should be unblocked, I (of course) won't stand in the way of that. - -sche (discuss) 21:28, 14 September 2024 (UTC)Reply
WTF, -sche? You blocked them for asking a question "why don't we have such an entry?" I'm disappointed. Denazz (talk) 21:39, 14 September 2024 (UTC)Reply
Yikes on their part. AG202 (talk) 00:46, 15 September 2024 (UTC)Reply
I oppose an indefinite block. I don't really understand what they were getting at with that question you blocked them for, but I really don't understand what happened to assuming good faith. It seems we assume bad faith now until a user can prove otherwise. Andrew Sheedy (talk) 04:00, 15 September 2024 (UTC)Reply
Well, it sure reads to me like that post is trolling. However, if they ask for an unblock and can provide a reasonable explanation why their post was not trolling, I would not be opposed to reducing the block to e.g. a month. Benwing2 (talk) 04:08, 15 September 2024 (UTC)Reply
Well, can I ask for a unblock on their behalf? I assume they were asking why we were missing a term, albeit a vulgar/offensive one. Sure, it wasn't very clear a request, but not blockworthy. Like if I ask "I heard 'you are a honky donkey' in a film used against a white person from Ottawa. Can we add it?" Denazz (talk) 07:21, 15 September 2024 (UTC)Reply
No, it's very obvious what type of place they're coming from. Judith Butler has never argued for those topics, nor is she their advocate, and anyone who's seriously taken a look at queer theory would know that those topics are not a part of it. Don't be obtuse, saying that they're just asking "why don't we have xyz entry". It's at best trolling and at worst bigotry to equate queer theory and queerness with those unrelated repulsive topics. I'm disappointed but not surprised. AG202 (talk) 15:40, 15 September 2024 (UTC)Reply
I know little of LGBT-studies, and their stance on the issue is not my concern. The user deserves another chance. BTW, if anything creative comes out the discussion, let it be the creation of q***r! Denazz (talk) 07:26, 18 September 2024 (UTC)Reply
@Andrew Sheedy What they asked is equivalent to a user asking why we don't list "criminal" as a sense of black, and that's being quite generous if I'm honest. Theknightwho (talk) 00:44, 19 September 2024 (UTC)Reply
Based on the explanation on their talk page, I'm not convinced that's the case. As weird as the request sounded. The user could certainly work on communicating more clearly, but I wouldn't be confident in saying it was in bad faith. Andrew Sheedy (talk) 04:00, 19 September 2024 (UTC)Reply

-Sche made the right call here. There is legitimate disagreement over the term queer within the LGBT community. Someone taking the position that queer is a slur isn't necessarily here in bad faith. But someone who has suggested that queer is synonymous with "pedophilia/bestiality advocate" is very obviously an anti-LGBT troll. The singling out of Judith Butler is a subtler tell. They are basically the only gender theorist that anti-LGBT folks can name. Few anti-LGBT people have actually engaged with Butler's work. That's why none of them can seem to clearly articulate anything Butler has actually written. These people are playing a game of telephone with like two garbled quotes. Butler did not argue in Undoing Gender (2004) that "some forms of parental child rape are only bad because of the social stigma" as JapanYoshi has claimed. They suggested that "there are probably forms of incest that are not necessarily traumatic or which gain their traumatic character by virtue of the consciousness of social shame that they produce." That's a nuanced argument referencing cases like this British woman who never had children because of her deep shame at learning as an adult that she was born of seemingly consensual sibling incest. I wouldn't trust contributions from a user who has either knowingly misrepresented an author's writing to promote an anti-LGBT conspiracy theory or didn't care enough to verify a supposed quote. WordyAndNerdy (talk) 08:13, 19 September 2024 (UTC)Reply

zh-pron order

Hey, I think Wade-Giles should appear above Tongyong Pinyin in Template:zh-pron, on the basis of wider usage of Wade in personal names, historically, etc. I don't know where to make that change in the code but I think it would be non-controversial. It seems too minor for beer parlor and grease pit. If anyone that sees this could do that, I would appreciate it, or tell me what to do and where could I take this. Thanks. --Geographyinitiative (talk) 22:58, 13 September 2024 (UTC)Reply

I'm not sure if this change should be done. Pinyin is, to my understanding, more widely used in transliterating Chinese than Wade-Giles presently, so it should go first. CitationsFreak (talk) 00:17, 14 September 2024 (UTC)Reply
Hanyu Pinyin, not Tongyong Pinyin. MuDavid 栘𩿠 (talk) 00:51, 14 September 2024 (UTC)Reply
Oh yeah, of course Hanyu Pinyin is on top, I'm just saying that it should be: Hanyu Pinyin, then Bopomofo (so-called "Zhuyin"), then Wade-Giles, then Tongyong Pinyin, then the rest. I think that's more in line with volume of usage of those systems. --Geographyinitiative (talk) 09:29, 14 September 2024 (UTC)Reply

Archiving failure at payback's a bitch

This deleted entry was just recreated, and I went to the talk page to see the details of the deletion. The only thing on the talk page was a discussion from January of 2013 where Equinox and I explained that the {{rfd}} template had to stay because its deletion was still being discussed (that put the entry on my watchlist, which is how I found out it was recreated). The page itself was deleted in April of 2013 as having failed RFD, but no mention of that made of it onto the talk page. It was only after wading through the revision history of WT:RFD from 2013 that I was able to find that it had been added to the RFD for life's a bitch, and I found the discussion was archived at Talk:life's a bitch.

How should we handle this? The new definition is odd and probably wrong, but not SOP. I'm not really sure what to do with the talk page so the previous RFD is reflected there, and right now it seems a bit off to tell someone that their good-faith page creation is going to be deleted because of something not explained anywhere findable from the entry or its talk page. If someone wants to retag the new entry and send it to RFDE, that might help. Chuck Entz (talk) 00:51, 15 September 2024 (UTC)Reply

Should personal attacks be restored?

Let's say that Alice and Bob get into a heated personal argument. Insults lobbed, caps lock engaged, lawyers threatened, the whole nine yards. But then Caroline decides to hide the comments made by Alice and Bob. The question is, should the comments be restored?

My understanding of the policy is that they should be, since it's common etiquette to not edit other's comments. However, it does feel wrong to let such incivility lay out in the open. CitationsFreak (talk) 07:56, 15 September 2024 (UTC)Reply

I say they should be if only lest discords be continued by editing others’ comments. Any bidding to hide or keep hidden exaggerated argumentative behaviour specifically – as opposed to e.g. data protection violations where findability in machine searches has to be considered, which loses relevance in so far as pseudonymity is kept, and circular arguments of editors that have been hidden by a click-to-open box in ultimately inoffensive cases due to the otherwise unpalatable bulk of the discussion – sets a perverse incentive.
Empirically it is also shown that the cases occurring on Wiktionary – I won’t point at the cases – would have fared better if people would have gotten over the matter by just remembering they are not a main character and not reading it already, instead of heating up for edit summaries.
The probability of success in forgetting such microtraumata without even any cognitive reframing effort is great, while only complexity is engaged by hiding their agents in edit histories. Fay Freak (talk) 09:17, 15 September 2024 (UTC)Reply
I think hiding, but not deleting, is fine. I'd favor doing so whenever motives, values, attitudes, or beliefs are attributed to a user. DCDuring (talk) 17:34, 15 September 2024 (UTC)Reply

Request to change libelous block statement

Hello, I am User:Gapazoid. I was blocked by User:Surjection for an edit to MAP that promoted the idea that there are people who are attracted to minors and are against inappropriate adult-minor relationships (so-called "anti-contact pedophiles" or "non-offending pedophiles"). Furthermore, I identified myself as being both attracted to minors and fundamentally against any sexual contact between adults and minors.

The stated reason for my block is "Unnacceptable conduct: pedophilia advocacy". To the average person, "pedophilia advocacy" implies that I was promoting inappropriate adult-minor relationships, which is not the case, as I have repeatedly explained. It is very important to me that the public is not misinformed about my values. In my discussion with Surjection on their talk page, they stated that the exact reason for my block was "promoting the idea that there are non-offending pedophiles". This statement I have no issue with, as it is truthful and cannot be misconstrued.

Therefore, I am requesting that the stated reason for my block be changed from "Unacceptable conduct: pedophilia advocacy" to "Unacceptable conduct: Promoting the idea that there are non-offending pedophiles". 76.35.75.228 18:28, 15 September 2024 (UTC)Reply

Quotations asserting dubious, clickbaity, controversial, deceptive or sometimes even blatantly false statements or conjectures

Today an IP user made this edit, adding a quotation about Putin allegedly admitting that he is "hunting" his opponents. To put it mildly, I'm not a fan of Putin, but I personally think that the concept of "hunting" can be better illustrated using some other quotations, which don't mention Putin, Biden, Trump or any other modern politician. And also without trying to promote any political, religious, ideological or corporate narratives. I checked the WT:QUOTE#Choosing_quotations policy and it doesn't say anything on this topic. Does Wiktionary need some kind of a formal rule to prevent it from turning into an outlet of propaganda for various modern ideologies, hoaxes and conspiracy theories?

That said, really old books may contain quotations, which mention obsolete morally questionable barbaric traditions or some outdated unscientific information. Do they get a free pass on the basis of their age? --Ssvb (talk) 20:22, 15 September 2024 (UTC)Reply

Adding a controversial quote to an entry for a non-controversial term that doesn't need the controversial aspect to illustrate the meaning is definitely a violation of WT:NPOV. I'm not talking about politically charged or bigoted terms, which would have bigoted or politically charged usage that we would be wrong not to document. I'm talking about converting an entry for an ordinary term into a political statement by adding a quote that illustrates something political that doesn't need to be there to understand the term. Sure, there are lots of really outrageous things people have said that happen to contain the word "the", but that's not a reason to add them to the entry for "the" as quotes or usage examples.
I've reverted and hidden the edit that added that quote. Chuck Entz (talk) 20:50, 15 September 2024 (UTC)Reply
It is not a controversial term but its meaning range has to be described in a controversial context: “hunting” in the figurative sense always involves various kinds of, well, predatory behaviour designed to offend. In other words not the term, but the meanings being charged is enough for the quotes to be charged. Only that the Belarusian entry has not been that elaborate. It should get one gloss or at least quote that is normal and then the user might add propaganda, with some hyperbole.
I cannot agree with the general rule given that I normally browse political websites and find quotes there about specific people or brands that everyone relates too, even find them when I specifically search terms, because that’s what publications are about since humans are social animals, and if you aspire to be in a leading political office, the more so as an eternal leader, you have to bear being an example of everything. I would be more concerned if the shown example skewed the usage already, like the anti-abortion propaganda redefining infant the other day. Fay Freak (talk) 18:29, 16 September 2024 (UTC)Reply

templatizing raw category references more generally

We have a mechanism that automatically tracks raw category references. See subcategories of Category:Entries with language name categories using raw markup by language and Category:Entries with topic categories using raw markup by language. I have a script to templatize these into calls to {{cln}} and {{C}}, respectively, grouping multiple categories into a single call as much as possible. I have run the script on all the pages in various individual common languages (e.g. Spanish, French, German, Russian, Italian, several others) but not generally. As discussed in Wiktionary:Grease_pit/2024/September#user_sandboxes_showing_up_in_categories and elsewhere, there are two major disadvantages in using raw category references: (1) sort keys are not properly generated for languages that have custom sort orders, meaning that the pages are wrongly ordered in the categories in question; (2) transclusions of pages containing raw category references into userspace pages result in the userspace pages wrongly showing up in the categories in question. My script has been extensively tested through its use on the various common languages mentioned above. I propose running the script on all the pages in all languages under the above maintenance categories; or maybe, on all but certain languages (e.g. excluding English if people object for some reason to this). The only downside I can think of is a slight increase in Lua memory usage, which could theoretically push some pages over the limit if they're close to it; but AFAIK, with the increase in memory limits late last year, no pages are close to the current limit. Instead, we're hitting the template expansion limits before the memory limits, and I don't think the use of Lua-based templates here will increase the template expansion size (and in any case, the number of such categories on a given page is usually small). If for some reason any given page ends up hitting a limit as a result of this, we can back out the changes for that particular page, whitelist it in the code that generates the contents of the subpages of Category:Entries with language name categories using raw markup by language etc., and blacklist it in my templatize-categories script so a future run won't affect it. Benwing2 (talk) 23:12, 15 September 2024 (UTC)Reply

I personally am in favor of this; it would save me a lot of time and energy. -BRAINULATOR9 (TALK) 03:48, 17 September 2024 (UTC)Reply

{{senseno}} is a nasty hack created by IP 70.* (who seems to have disappeared). Per discussion with User:Vininn126, I am planning on renaming it to {{senselink}} and not having it display the sense number by default because I don't believe this is the best way of referring to senses; instead senses should be named using a summary of the meaning of by POS. Also the code in Module:senseno to determine the number of the sense is really terrible. I am planning on renaming it to {{senselink}}, cleaning it up and adding a required param to specify either a meaning summary (arbitrary text), a request for the sense number (maybe +# or something) or a part of speech (maybe pos:POS or something). As an example of what I mean, under raz you have:

{{senseno|pl|counting|uc=1|pos=numeral}} is a generalization of {{senseno|pl|instance|pos=noun}}, for which see {{cog|zlw-opl|raz}}. For this use, compare {{cog|ru|раз}}. {{senseno|pl|case|uc=1|pos=noun}} is a semantic narrowing of {{senseno|pl|instance|pos=noun}}.

which displays as

Sense 1 is a generalization of sense 1, for which see Old Polish raz. For this use, compare Russian раз (raz). Sense 1 is a semantic narrowing of sense 1.

This is really awful. Much better would be "the sense one is a generalization of (one) time, for which see (etc.)", which would make a lot more sense. Benwing2 (talk) 06:13, 16 September 2024 (UTC)Reply

Anything for more specificity would be preferable. Vininn126 (talk) 06:19, 16 September 2024 (UTC)Reply
No objection to adding an option for a gloss (which I already add manually), but I'm not sure it's a good idea to remove the ability to display a sense number altogether. If an entry has many senses, having the sense number may be a quick way of determining which is the relevant sense referred to. Could you provide an example of how the template would display if the sense number is removed?
On a separate note, perhaps there should also be options for adding the etymology number (for example "etymology 1, sense 2") and the part of speech ("noun sense 1"), though I'm not sure how this would be displayed if the sense number is ultimately removed as proposed. — Sgconlaw (talk) 19:43, 16 September 2024 (UTC)Reply
@Sgconlaw I'm not suggesting removing the ability to display a sense number, but just requiring that if the user wants a sense number, they request it explicitly using e.g. {{senselink|pl|#}} or {{senselink|pl|+#}}, instead of having the template default to displaying a sense number. Basically the second param is required and specifies either a short gloss, a request for a sense number or a part of speech. Benwing2 (talk) 19:54, 16 September 2024 (UTC)Reply
@Benwing2: ah, I see. Could you provide some mock-ups of how the template will display if different parameters (sense number, gloss, part of speech, etc.) are used? — Sgconlaw (talk) 20:24, 16 September 2024 (UTC)Reply
@Sgconlaw
  • {{senselink|pl|#}} displays as "sense 1" or whatever, linked appropriately
  • {{senselink|pl|pos:noun}} displays as "the noun sense", linked appropriately
  • {{senselink|pl|(one) time}} displays as "(one) time", linked appropriately; alternatively it could display as "the sense (one) time", but then you'd need a way of suppressing the words "the sense"
If you think it would be useful, I can provide syntax to include the etymology number; maybe {{senselink|pl|##}} displays as "etymology 1, sense 2"
or whatever. Benwing2 (talk) 21:41, 16 September 2024 (UTC)Reply
I think the template was designed for use in image captions, where brevity is valuable. I'm not hugely supportive of any changes, especially if it's going to introduce yet more of this bespoke syntax ("funny characters to memorise") that seems to be in vogue these days. This, that and the other (talk) 01:49, 17 September 2024 (UTC)Reply
@This, that and the other The problem is that this template is not only used for image captions. Would you rather have two different templates, one for image captions and another for etymology sections? That seems a worse solution. Benwing2 (talk) 02:02, 17 September 2024 (UTC)Reply
@Benwing2 surely it wouldn't be too much trouble to have {{senseno}} to generate a sense number, and {{senselink}} to generate a sense link? Obviously the situation at raz is silly, but it's hardly the fault of {{senseno}} that it's being forced to do bad things (I certainly would not have attempted to use it on an entry with more than one etymology section!). Anyway maybe I need to think about this some more. This, that and the other (talk) 02:09, 17 September 2024 (UTC)Reply
@Benwing2: One feature that I would like is a |t= parameter like our other templates. So you would be able to do something like Sense 2 ("something something") comes from without having to hack it together in wikitext. It could be generated automatically as well but in some cases you want to shorten the gloss a bit. Ioaxxere (talk) 03:44, 17 September 2024 (UTC)Reply

That pesky dot in Template:pedia

The dot at the end of {{pedia}} is an absolute nuisance. I suppose it was put there because most of our reference templates end with a dot, but (a) Wikipedia isn't a reference, and (b) the template is so frequently used in running text that the dot becomes a pain to deal with. If you want to write a sentence like "See {{pedia|Something}}", you have to remember to leave off the dot at the end, because the template adds it for you!

Would there be support for a bot run to add a dot after every instance of {{pedia}}, after which we can remove the dot itself from the template? The same would need to be done with the other bolded, logo-adorned Wikimedia project link templates like {{specieslite}}. This, that and the other (talk) 01:47, 17 September 2024 (UTC)Reply

Yes definitely. Benwing2 (talk) 02:03, 17 September 2024 (UTC)Reply
I frequently use {{pedia}} for references, but I have no objection to the removal of the dot from the template. There are other templates it could be removed from too. DonnanZ (talk) 08:19, 18 September 2024 (UTC)Reply

Minitoc

{{minitoc}} has been added to 436 of our longest entries so far. I think now that we've had time to get used to it we can decide on a policy for when it can be added. I've personally found it very useful on mobile, especially with User:Ioaxxere/minitoc.js so I would support adding it to all entries with at least five language section (probably via a continuously running bot job). Note that the template uses some complex rules to show or hide itself on different skins which you can see here. Ioaxxere (talk) 03:44, 17 September 2024 (UTC)Reply

I love it. And I also love the initiative that has been taken to address a real, longstanding problem.
A few minor comments about the look of the template (I know that's not the purpose of this section!):
  • Why is it collapsed by default? The regular TOC is not collapsed by default, so why should this one be?
  • The space before the bullet should be a non-breaking space.
  • It serves no purpose when Tabbed Languages is enabled, so something should be added to MediaWiki:Gadget-TabbedLanguages.css to hide the minitoc.
As for the thrust of your point, I feel that a better threshold would be - add {{minitoc}} to entries where the regular TOC exceeds, say, 25 lines (this could be easily calculated by a bot). This, that and the other (talk) 04:46, 17 September 2024 (UTC)Reply
One thing I find confusing: why does it have to be implemented by a template in each page's source code (whether added manually or by a bot)? Is it impossible to add logic to whatever code generates and displays the normal TOC?--Urszag (talk) 07:31, 17 September 2024 (UTC)Reply
@This, that and the other: You can make it uncollapsed by default by clicking the link in the sidebar and clicking "Show table of contents" (see diff). @Urszag: Yes, we cannot control how the TOC is generated to my knowledge. Ioaxxere (talk) 02:57, 18 September 2024 (UTC)Reply
Thanks @Ioaxxere for addressing the issues. As for collapsing, I was talking about the default state for logged-out desktop users. The default state of the MediaWiki TOC is uncollapsed; the default state of {{minitoc}} is collapsed. What is the rationale for the inconsistency?
In any event, I Support wider use of this template. This, that and the other (talk) 04:57, 18 September 2024 (UTC)Reply
@This, that and the other: We could try that. I think we would have to change something in MediaWiki:Gadget-defaultVisibilityToggles.js or MediaWiki:Gadget-VisibilityToggles.js? Not sure (@Erutuon?). Ioaxxere (talk) 19:56, 18 September 2024 (UTC)Reply

ISO dab mislabeled; unicase Latin letters called "symbols"

If we enter 'ISO' in the {lb} tag, it's converted to the generic "international standards". However, ISO frequently contrasts with other international standards. For example, at , the ISO definition contrasts with the IAST definition. Both are international standards, but currently one is labeled "IAST" while the other is labeled just "international standards". Could we restore the ISO tag to its actual meaning?

Also, at that same article, the letters of IAST and ISO transliteration fall under the heading "letter", while the letters of NAPA and UPA transcription fall under the heading "symbol". They're all alphabetic letters; the only difference is that NAPA and UPA are unicase, while ISO and IAST are, sometimes, cased. ISO and IAST are also often unicase, because they transliterate unicase scripts, but there is an allowance for capital forms. That is, however, the only effective difference, and we don't call the letters of unicase scripts like Georgian "symbols". Some languages are written only in the IPA, NAPA etc. alphabet, and it's weird to say they're written in "symbols". Shouldn't we just have two "letter" headers, one with casing and one without?

To be clear, IMO actual alphabetic symbols -- e.g. those like e and pi used in math, chemistry and the like -- should remain under the "symbol" heading, because they're not used as letters of an alphabetic script. kwami (talk) 09:21, 17 September 2024 (UTC)Reply

split up WT:RFM

This page is over 1MB in length, which is too big. I propose a three-way split:

  1. Requests for language splits/mergers/additions go to WT:RFLM = Wiktionary:Requests for language moves, mergers and splits.
  2. Requests not related to individual terms (i.e. non-mainspace, non-Reconstruction and non-Appendix-only-language pages such as categories, modules, templates, etc.) go to WT:RFMO = Requests for moves, mergers and splits/Others (compare WT:RFDO).
  3. Requests related to individual terms remain on WT:RFM.

Benwing2 (talk) 04:02, 18 September 2024 (UTC)Reply

Support 2 and 3. I support 1 in principle too; I feel the name is a little wordy, and I would like something like WT:Lect workshop, but since it's a request page, not a discussion room, this might not be so great. Maybe WT:Requests to rename, merge or split languages (you don't "move" a language) or simply WT:Language treatment requests (a la WT:LT). In any event I don't want to hold this effort up with petty disagreements; it's exactly these language discussions that make the RFM page so large, so they are the priority for moving off the page.
While we're solving this issue, we should also deal with the issue that there is no natural place to archive language-oriented RFM discussions, yet archiving them for posterity is especially important. The current situation is that they are archived across WT:Language treatment/Discussions and Wiktionary talk:Language treatment/Discussions, but this is clearly unsustainable. I propose to resolve this issue by archiving completed discussions to yearly archive subpages of the new venue, such as WT:____/Archive/2024. We can't use yearly discussion subpages in the same manner as BP, because the request discussions need to be closed, handled and archived; it won't do if they silently disappear into oblivion at the end of each year. This, that and the other (talk) 05:10, 18 September 2024 (UTC)Reply
I can agree with not using "move" for the L2 discussion page. Vininn126 (talk) 20:21, 18 September 2024 (UTC)Reply
@Benwing2: I don't really care too much but I oppose doing this on the basis of page size alone, because that can easily be resolved by just archiving old discussions (why are there still conversations from 2015?). Also keep in mind that Wiktionary:Request pages will get really crowded with an additional column. Ioaxxere (talk) 19:53, 18 September 2024 (UTC)Reply
Some conversations from 2015 are still unresolved. I don't think we should archive unresolved discussions even if they're 10 years old. Benwing2 (talk) 19:56, 18 September 2024 (UTC)Reply

Reddit as a Source for WT:CFI

I've largely avoided bringing this up since I frankly didn't have the energy, but I've seen one too many entries now. I'd like to start a discussion about whether or not we should include Reddit as a source for WT:CFI. Right now, it's been included de facto mainly because CFI has essentially stopped being enforced for online quotes (the main people who did are either inactive or have been overwhelmed or don't care anymore), but looking at entries like j*et (etym 2, sense 2), I'm very averse to having any entry that's solely based upon Reddit quotes. Reddit is much more anonymous than, for example, Twitter, and there's little to actually verify if each account is an individual person. The content is also not durable. 3 Reddit users is simply not enough for us to be including a word; it starts to harm our credibility and comes off as honestly unserious, becoming the likes of Urban Dictionary with some of the recent terms.

The last time this was brought up was at Wiktionary:Beer parlour/2022/September § Reddit, where the vote was 9-9, which meant that it shouldn't have been included by default, but again enforcement hasn't really been a thing. As such, I want to open up the discussion again. AG202 (talk) 05:22, 18 September 2024 (UTC)Reply

That being said, I would be okay with Reddit quotes showing usage, providing that there's ample evidence that it's not a complete nonce word (ex: references, much more than 3 quotes, etc.). AG202 (talk) 05:26, 18 September 2024 (UTC)Reply
I'd be OK with preventing Reddit from being an exclusive source of quotes. But I don't like the idea of just outright banning it. I added a slang term at one point (slang senses of dead) which was pretty well unciteable apart from Reddit and Twitter (and very difficult to find cites for on Twitter). Yet I hear and see this sense all the time in real life (texting, conversations) among my peers. A lot of slang just slips through the cracks if you're not able to use internet sources, so until there's an alternative, I'd prefer that we be generous in what we include. Andrew Sheedy (talk) 05:35, 18 September 2024 (UTC)Reply
Well yes, I'm not saying that we should ban it. I will say though that that particular example has been around for quite some time (I've used it since at least 2017 after checking my texts), and shouldn't be hard to find at all in other media, let alone Twitter. It's mentioned in this 2021 CNN article, this 2023 USA Today article, this 2022 The Atlantic article, and used in this CharlotteWeekly artice, for example. There's even a cite in the OED (no paywall) for this term. There's definitely no need for Reddit here, and I'm more so surprised that we didn't have it before considering how widespread it is, though maybe that's a sign. AG202 (talk) 05:52, 18 September 2024 (UTC)Reply
@AG202: I don't think Reddit should count for CFI in general. But CFI states that "a term should be included if it's likely that someone would run across it and want to know what it means", and this is clearly met for jeet. But if the issue is Reddit specifically, then we could find quotes from other websites as well. Generally I try to add a mix of Reddit and Twitter citations since both sites are easily searchable and archivable. Ioaxxere (talk) 19:46, 18 September 2024 (UTC)Reply

Reddit is an important tool for attesting fandom and video game slang. For all its flaws, Reddit seems to be the last mature, widely used, and openly accessible social-media platform. There's a decent built-in search function on the site that can be used to find quotes. Plus Reddit is indexed by Google and will likely remain so in the near term. The fact that Reddit is generally used for casual discussion also makes it an ideal source for quotes. I had a lot of success finding short illustrative quotes on Reddit. That's just not as easy to do on Tumblr. Tumblr's search functionality is basically non-existent, and the site's text content tends toward cryptic in-jokes and long essays. Reddit also potentially has some safeguards against linkrot. AFAIK you can't change account or subreddit names. Comments are also preserved after account deletion unless manually deleted beforehand. Whereas on Twitter and Tumblr, links can break with account name changes, as well as with account deletion or suspension. More broadly, I'd argue that Reddit is a modern-day Usenet. Wiktionary needs to keep pace with the evolving landscape of social media to document new and emerging language. The vast majority of fandom slang never makes it into print publications. I found it wasn't uncommon for fandom slang with a decade-plus history on Reddit to have zero hits on Google Books, Google Scholar, Issuu, etc. Sometimes the search needs to start where language is actually being used. WordyAndNerdy (talk) 08:01, 18 September 2024 (UTC)Reply

I agree with you all, but I agree with the status quo as well for the given reasons, since you don’t and won’t have a particular formulation that will find agreement, because precisely we have come close enough to include what we should include without being gameable by three Reddit usernames. The inclusion criteria always have been about what usage is there and not about what three quotes are shown on the front. Sometimes you don’t take a quote even from a book if it is the heading of a table or the context is too confusing or extensive for the quote to be read by anyone or the isolated sentence is misleading factually or politically. When we have a word with three Reddit quotes or three Twitter quotes or three Telegram quotes or whatever it is just for example, after which the editor called it a day, it is understood that one site is not enough like a word from the universe of one videogame is not enough. I don’t find it more ambiguous than is apt. Fay Freak (talk) 08:58, 18 September 2024 (UTC)Reply
I think Reddit should be a less broad analog to the "clear widespread use" clause. The main problem with Reddit is independence: as I said before when we first discussed using social media, you sometimes have the equivalent of a group of friends who hang out together in school and develop their own in-group vocabulary. The idea is to find a way to filter out the in-group stuff for narrow groups and look for the terms that people use not because it's part of participating in the specific group but because that's the way they talk online in general. I'm not sure exactly how to implement it, but that's the idea. Chuck Entz (talk) 13:58, 18 September 2024 (UTC)Reply
Exactly the above. Also @WordyAndNerdy: I get what you're saying and I do think Reddit is useful; I just think that we need more stringent criteria. 3 usages is not enough. Also, I would be against deleted accounts being counted for CFI (unless we're able to find the original account and cite that), as that completely goes against our criteria for independent usages from different people. AG202 (talk) 16:44, 18 September 2024 (UTC)Reply
These concerns have come up in discussions about Usenet. The "independent" clause in CFI applies to authors. It doesn't apply to publishers. Attesting an entry with cites from a single newsgroup or subreddit would be like attesting an entry with only New York Times articles. Sometimes terms are restricted to a specific community. Warhammer slang is easier to find on /r/warhammer40k than on cute animal subreddits. I usually gathered "extra" cites as a personal practice. But one needn't go out of their way as I did. Only derogatory terms and limited documentation languages are held to different standards under CFI. Three cites should be enough for attesting English slang off Reddit if it's enough to attest marketing buzzwords.
CFI's one-year requirement is effective at filtering out most nonsense. It's more difficult than one might expect to make fetch happen. Friend groups often lack the dedication and/or social cohesion necessary to push in-jokes/protologisms into wider use. This is not a problem encountered with any regularity in the wild. Bored kids will try dropping their coinages directly onto Wiktionary and move on when they're rejected (perhaps after a bit of feet-stomping). Few if any will go to the trouble of creating three Reddit sockpuppets and letting their comments age a year. And if anyone actually wants to play that kind of 4D chess it would be entirely possible for them to game protologisms onto Wiktionary with two friends and three letters to local newspapers.
As for deleted accounts, this seems no different from including quotes from anonymous authors, articles without bylines, etc. There can never be 100% forensic certainty that any three quotes are from three different people. However, it's often possible to infer authorship from style, timestamps, discussion flow, etc. on Reddit. "Deleted_account" replying to "ILoveGarfield82" multiple times in a single chain can reasonably be assumed to be the same user. There won't be many instances in which the only quotes available are from a deleted account anyway. Reddit's karma system disincentivizes account deletion and anything that meets the one-year provision of CFI is almost guaranteed to have been used by more than one person. WordyAndNerdy (talk) 20:24, 18 September 2024 (UTC)Reply
Agree (although mayyybe there should be a way to ensure that there are three people using the term.) CitationsFreak (talk) 03:10, 19 September 2024 (UTC)Reply

Remaking on drugs

Hello,

An audio for "on drugs" is on my Lingua Libre, and I'm aware this entry used to exist but was deleted. This term is defined in a few other online dictionaries such as Merriam-Webster and Oxford Languages. If I can find quotations of this prepositional phrase being used, then is it ok to put it back online?

Thank you. "Don't talk to me." 💔 Flame, not lame (talk) 23:58, 18 September 2024 (UTC)Reply

It was not deleted for lack of quotations, it was deleted because it’s sum of parts. If you know of an idiomatic sense, you can reopen the deletion discussion. MuDavid 栘𩿠 (talk) 01:18, 19 September 2024 (UTC)Reply
No thank you. "Don't talk to me." 💔 Flame, not lame (talk) 02:22, 19 September 2024 (UTC)Reply

Classical Nahuatl entries and vowel length

I am not fully sure how this works in mainspace since I have tended to mostly work on reconstructions so far, but I've noticed an issue with Classical Nahuatl entries in that their page names do not possess the diacritics to note vowel length, and the templates such as {{m| , {{l| and {{cog| do not detect any long vowel when used.

Given that this present situation causes issues, such as when noting that a term uses -tlan vs -tlān, can the page names, modules and templates for Classical Nahuatl be updated to better handle entries with long vowels? Antiquistik (talk) 07:22, 19 September 2024 (UTC)Reply