The primary requirement in supporting dictionary entries is to verify usage. This is significantly different from verifying the contents of an encyclopedia article. For example, if a random blog makes a statement such as "the average American has 3.8 cousins and 7.4 nephews and nieces," there is no particular reason to believe that the average American actually has 3.8 cousins. However, there is very good reason to believe that the words "American," "cousin," and "average" are in use.
In this regard, the appearance of a term is much more trustworthy than a purported definition. I someone asserts "It is incorrect to use the word podium to refer to a lectern," there is no particular reason to believe this, regardless of whether the statement occurs on the internet or in print. As we are documenting usage, usage trumps opinion. In this particular case, it is not hard to find examples where podium is clearly used to mean a lectern or similar item.
In general, it should be assumed that people use language in order to convey meaning, and not to influence the editors of dictionaries. There are clear exceptions to this, in which a particular party tries to promote a favorite word or usage, often of that party's own invention. Such exceptions are generally easy to identify and in any case, they are the exception.
Part of the confusion over verifiability arises over the need to prove several different things. Here is an attempt to break this down. I will occasionally refer to currently controversial entries because the issues are close at hand. Naturally, I have an opinion about such terms, but my focus here is to highlight issues, not to push for any particualar term (I do this elsewhere :-).
First, we may wish to know whether a term is in use at all. This is where the infamous "This gets xxxx google hits" is appropriate. "This gets 1 million google hits." is an easily verifiable statement, and past some arbitrary limit, should be enough to establish that a term is worth studying. Exactly where that limit should be is open to debate and compromise. I would tend to put it in the range of 1,000 or 10,000. Others would prefer more like 1,000,000, and I can understand such a desire. By either measure, terms like webinar are clearly worth an entry, while a term like choad may or may not be, and one like bot herd would need separate justification.
However, mere google hits do not establish meaning, nor do they speak to idiomaticity ("it is" gets billions of hits). I believe that establishing meaning breaks down to two cases:
This second point, as far as I can tell, is the source of much of the disagreement over verifiability. In these cases, we are simply trying to establish that someone actually used a term to convey meaning in a given language. We don't care if this person is making a factually true statement, we don't greatly care if the person is anonymous or pseudonymous, except in rare cases, and we don't care whether the term in question is slang, vulgarity, recently coined or considerd distasteful for whatever reason.
There are legitimate reasons to prefer stable sources, including printed sources, in such cases. However, if I find a quotation on google groups such as "According to a most conservative estimate, if one pandava jubok will have draupodi-choda 3 times a week, 3x5 = 15 chodas/week = two chodas daily for the poor girl continuously for 14 years, RAIN or SHINE." (see ), it's very clearly verifiable that someone used the word "choda" here. It's even reasonably clear from context that in this case the word means "sex act". Would I prefer to see a well-proofread printed source using the term? Of course. I would also love to see feedback from native speakers. But none of this keeps a quotation such as this from being usable evidence. It shows the term in use, and thanks to Google's archiving, it can be expected to remain verifiable over time. And there are plenty more like it.
This example is particularly instructive because a search of relevant print sources failed to turn up the term. Restricting verifiability to print sources would cut out precisely the kind of terms that wiktionary is best suited to documenting. N.B., by this I don't mean strictly vulgarity, slang, netspeak and such, though those will likely be overrepresented. I mean anything which is not likely to have made it through the usual channels. It would also include developing senses of existing terms. -dmh 20:30, 21 September 2005 (UTC)
The following information has failed Wiktionary's deletion process.
It should not be re-entered without careful consideration.
This is a policy think tank that was pulled off of Wikipedia, and after an unsuccessful attempt at adapting it to Wiktionary, sat around for four years. This page is pointless, given that all of Wiktionary is original research, and verifiability is not a requirement at all, short of the attestation requirement than a word exists. --Yair rand 23:28, 26 April 2010 (UTC)