William Beutler on Wikipedia

Posts Tagged ‘Verifiability’

Verifiability and Truth: What John Siracusa Doesn’t Get About Wikipedia

Tagged as , , , , , , , , ,
on February 2, 2012 at 6:50 pm

One of my favorite podcasts is Hypercritical, co-hosted by and principally featuring the thoughtful criticisms of John Siracusa, a sometime columnist for Ars Technica and Internet-famous Apple pundit. The show’s tagline calls it: “A weekly talk show ruminating on exactly what is wrong in the world of Apple and related technologies and businesses. Nothing is so perfect that it can’t be complained about.” Last week’s edition—“Marked for Deletion”—was about something far from perfect, but of great interest to this blog: Wikipedia.

If you want to listen for yourself, jump to about 1:11:55 (yes, more than an hour into the show) where Siracusa and co-host Dan Benjamin turn the discussion to Wikipedia. And a warning: this is going to be long. Consider it homage.

♦     ♦     ♦

Promisingly, Siracusa begins by asking his co-host to answer, if he can, “what Wikipedia is”. The answer is pretty good for an outsider: it’s a place for sharing information and collaboratively building a resource for (hopefully) accurate information on almost any topic. In general, this will do. But it’s not quite right, as Siracusa explains by recounting his personal experience of trying, in vain, to defend an article from deletion. With five years to reflect on it, Siracusa describes his efforts as a “prototypical example of someone who does not understand what Wikipedia is, proving that he does not understand what Wikipedia is.”

All of this is a way of getting to Siracusa’s fascination—one might say morbid fascination—with Wikipedia’s policy of “Verifiability”. The first paragraph of the policy says:

Verifiability on Wikipedia is the ability to cite reliable sources that directly support the information in an article. All information in Wikipedia must be verifiable, but because other policies and guidelines also influence content, verifiability does not guarantee inclusion. The threshold for inclusion in Wikipedia is verifiability, not truth—whether readers can check that material in Wikipedia has already been published by a reliable source, not whether editors think unsourced material is true.

Or as Siracusa summarizes it: “Something can be as true as you want it to be, if it is not verifiable, it doesn’t go in.” Well said.

He also discusses the related policy of “No original research”. This includes a good explication of the different types of sources that may or may not be used on Wikipedia: primary sources (original documents and first-hand accounts), secondary sources (news articles interpreting primary sources) and tertiary sources (encyclopedias and academic articles summarizing the former). This is advanced stuff, and for a longtime Wikipedian, it’s no small thrill to hear a smart outsider explain why secondary sources are preferred, and work through the fundamental policies of Wikipedia. Siracusa correctly observes: “Wikipedia is not a place where you write down stuff that you know. … Wikipedia writes about other people writing about things.”

Except here’s the thing: Siracusa understands Wikipedia’s core content policies. He just doesn’t like them.

In his particular example, a former standalone article called FTFF (here’s what it used to look like) didn’t survive the process not because it wasn’t true, but (he says) because it contained material that wasn’t verifiable, and constituted original research. This is partly true, but it owes more to a guideline that got only passing mention on the show (and, frankly, in the deletion debate): “Notability”, and specifically the “General notability guideline”. It’s closely tied in with WP:VERIFY and WP:ORIGINAL, and basically says that a topic must be have sufficient coverage in secondary sources to be given its own standalone page. FTFF was not, and the result of the debate was to merge the topic to Finder_(software)#Criticism.

Anyway, this pedantry about WP:NOTE and WP:GNG doesn’t affect Siracusa’s main point: If something is true but unverifiable, he would like to see it included in Wikipedia anyway. Nor does it affect his corollary argument, that Wikipedia’s complex rules discourage many would-be participants.

He’s undoubtedly right about the second point: many people try to get involved with Wikipedia who have no idea what it’s really about, and they tend to have a really bad experience. Wikipedia struggles to explain itself to outsiders, and it probably always will.

As to the former, the problem is that he fails to grapple with the implications of the Wikipedia he describes, and this is disappointing. By privileging “truth” above “verifiability”, one gets the impression he’s describing a Rashomon-like Wikipedia where all possible viewpoints are explored, and somehow eventually Wikipedia just makes the right call. This assumes a lot, not least that contentious topics wouldn’t simply devolve into edit wars of unchecked aggression. In a world where Wikipedia aims for truth but eschews verifiability, there are no footholds upon which to steady an argument. There is no way to know what should be considered credible or otherwise.

At times it actually sounds like he’s advocating something that already exists: reliance on “Consensus” for determining how Wikipedia will address the topics it covers. Wikipedia policies and guidelines don’t cover everything, and this is where consensus steps in, however imperfectly. If you’ve ever wondered why there is sometimes an observable discrepancy in the depth or quality of coverage between topics, consensus is the big reason why, and moreso the self-selection that shapes consensus. The current, real world Wikipedia refers to outside authorities as well as consensus among editors; Siracusa’s Bizarro World Wikipedia would jettison the former and rely solely on the latter.

Meanwhile, Siracusa ascribes Wikipedia’s Byzantine rule structure to Wikipedians’ desire for approval from educators and academics, which he thinks is holding back Wikipedia from what it could be. He repeatedly says “Wikipedia should be something different” and refers to “what’s different about online” but he never gets prescriptive and never actually says why the old methods are outmoded. He does say his Wikipedia would seek to “arrive at truth using every tool necessary” and would, for example, allow original research… but what then is the mechanism for (dare I say) verifying it?

At one point, Siracusa compares the popular, widely-viewed Ars Technica forums to a hypothetical low-circulation print magazine, and complains that the widely-read former site is an invalid source while the unpopular latter publication is acceptable. It’s true that Wikipedia does not necessarily take a populist approach to evaluating sources, but he’s far off the mark in his attempt to explain this: “They’re not cool with the old librarians, because they’re not paper.”

I hope that he was just being lazy and doesn’t actually think that Wikipedia editors prefer paper (if anything they actually prefer online sources, which are easier to check) but he completely misses a key dynamic that ties back to verifiability: the paper magazine with poor circulation at least will have editors who are presumed to care about fact-checking and accuracy. A web forum, however popular it may be, may have moderators, but that’s not the same thing as having an editor. A discussion group is not an editorial operation, period. The forum is a primary source, and so should only be used to support reliable sources.

There are, however, reliable web sources. One of them is the editorial side of Ars Technica; no less an authority than John Siracusa has been cited in approximately 150 different Wikipedia articles about the Macintosh and other technology subjects.

♦     ♦     ♦

I’m sorry to say this, but in the show’s last fifteen minutes, Siracusa pretty much descends into total incoherence. Here’s his summary statement, close to verbatim:

[There are] many flaws in verifiability and reliability of sources. It’s built on a foundation of sand. Notability, what’s a reliable source, those things become so key to making Wikipedia crappy or good, and those sands are constantly always shifting, you know? And so if Wikipedia was centered on truth and that was its final goal, yeah, it would have to include citations and verifiability and stuff like that, but there would never be any argument when the two are in conflict. You know, if you could prove that a series of events happened here, then you could say, well, it’s verifiable, it appeared in a reliable source, but it’s not the truth. And so therefore we should expunge that. Because the final goal of Wikipedia is truth. But the final goal of Wikipedia is not truth, it’s verifiability.

There would “never be any argument” about what is the truth? In the parlance of Wikipedia: [citation needed].

Look, this is an epistemological issue, one much larger than just Wikipedia. The reason Wikipedia’s goal is verifiability, not truth, is because verifiability is an achievable goal. In fact, verifiability is a necessary step toward establishing truth, as Siracusa at this point seems to acknowledge in his imagined alternate, truth-seeking Wikipedia.

It’s not that Wikipedia is actively hostile to the truth: it’s just agnostic as to what it might be. Wikipedia articles are like road signs; truth itself may be unknowable, and we may never arrive at our destination, but Wikipedia can point in the right direction. Wikipedia’s policies and guidelines are designed to make sure that its content does that, although it’s fair to acknowledge that it’s not guaranteed. But what is? And what is truth?

Anyway, there’s a user essay on Wikipedia called “Verifiability, not truth” that says this better than I am going to. Here’s the key point:

That we have rules for the inclusion of material does not mean Wikipedians have no respect for truth and accuracy, just as a court’s reliance on rules of evidence does not mean the court does not respect truth. Wikipedia values accuracy, but it requires verifiability. Unlike some encyclopedias, Wikipedia does not try to impose “the truth” on its readers, and does not ask that they trust something just because they read it in Wikipedia. We empower our readers. We don’t ask for their blind trust.

If you want to upset the old system and do something new, you actually do need to think through what should replace it. Siracusa never does.

If he thinks Wikipedia’s adherence to “old world” rules is driving away contributors, he should consider what the free-for-all alternative would look like. It isn’t a Wikipedia I would spend any time with, it’s not one that Google would be eager to rank so highly, and it wouldn’t be the most important reference site on the Internet.

Email This Post
  • Facebook
  • Twitter
  • Digg
  • del.icio.us

Is Quora the Next Wikipedia? Part III: It’s the Little Differences

Tagged as , , , , , , , , , ,
on March 4, 2011 at 9:39 am

In two previous posts, I have explored a comparison between Wikipedia and the upstart platform Quora, the first setting the stage for discussion, and the second explaining the (acknowledged) debt one owes the other. In this post, I will discuss how they differ in ways you’ve surely noticed—and ways you might not.

Writing a detailed explanation of how Wikipedia and Quora differ is a foolhardy assignment (and an even more foolish self-assignment). Because one is descended from the paper encyclopedia and the other comes from the Q&A genre, it’s hard to know where to begin. But we can make some observations:

The most significant difference between Quora and Wikipedia is a philosophical one: they simply do not share the same definition of “knowledge”. As you might imagine, this matters quite a bit and, in fact, Jimmy Wales’ best-known quote is arguably the following:

“Imagine a world in which every single person on the planet is given free access to the sum of all human knowledge. That’s what we’re doing.”

That is certainly what a Wikipedian might say he or she is doing. Your average Quoran (if that’s the preferred nomenclature) might not immediately find reason to disagree. But given further investigation they may find Wikipedia to be something less than that. Perhaps the best summary of these competing viewpoints comes from the Seb Paquet essay at The Quora Review linked in my first post. In it, he writes:

Wikipedia reflects consensus reality, or tries very hard to do so. In this respect, you could say that Wikipedia is past-bound: it offers knowledge of what has been known. However, there’s another segment of the world’s knowledge that is hazy and tentative. It is emphatically not validated. It is contentious. It is controversial. It’s messy. You could call it pre-knowledge.

On Wikipedia, the most concise definition of Wikipedia considers useful knowledge is encapsulated in the “General notability guideline”, which states:

If a topic has received significant coverage in reliable sources that are independent of the subject, it is presumed to satisfy the inclusion criteria for a stand-alone article or stand-alone list.

Quora has yet to develop anything quite so pithy, although its About page contains numerous statements which altogether produce a clear vision. As “notability” is the primary basis for inclusion at Wikipedia, “reusability” seems to play the same role at Quora:

“Each question page on Quora is a reusable resource that should help everyone who has the question that the page is about. … There is only one version of each distinct question on the site, so everyone who is interested in or knows about that material is focused on that one place.”

We can leave aside a careful exploration of what consitutes “reusable”, in part because so has Quora: to date they have not placed too many limits on what readers can contribute, only in what format they may contribute it. Wikipedia, on the other hand, has already developed a lengthy list of things that it does not wish to do, helpfully titled “What Wikipedia is not”. Among these, Wikipedia is not a “publisher of original thought”, nor a “manual, guidebook” or “crystal ball”. Quora seems OK with all that.

One effect of Wikipedia’s “narrow” focus is that it serves as a handy guide for other websites (and their backers) to identify a niche that avoids competing directly with Wikipedia. While other electronic encyclopedias have fallen to Wikipedia, specialization has worked for other projects. A good example of how this works is Wikia, founded by none other than Jimbo Wales himself, which smartly capitalizes on “what Wikipedia is not” and finds opportunities on the other side; because Wikipedia policies imply a limited appetite and minimum standards for information about Star Wars, the Wikia-hosted Wookiepedia is there to take up the slack.

Wikipedia and Quora logosAn example from outside the family might be the Internet Movie Database. Although IMDb’s original incarnation predates Wikipedia by more than 20 years, the point is that it has survived, and even thrived. For all kinds of information about motion pictures, IMDb is better because it wants more of that kind of information than Wikipedia does.

Quora too wants more information than Wikipedia, except it wants more of everything. In some respects this has its advantages; as Paquet goes on to say, Wikipedia is “past-bound” whereas Quora is “future-oriented”. I think that may be a little too rosy an assessment; one cannot overlook the possibility that Quora won’t necessarily be good at either. If you want to be everything to everybody, pretty soon you’ll be nothing to nobody. But I do think Quora recognizes this, and is watching to see how things develop, and will probably introduce more restrictions as time goes on.

And that brings us to another key difference: the organizations behind the websites and their relationship to users. I’ll get to those in the fourth (and final?) installment of this series. Look for that next week.

Why not follow me on Quora? Indeed, why not.

Email This Post
  • Facebook
  • Twitter
  • Digg
  • del.icio.us

Watch Out, Laszlo Panaflex!

Tagged as , , , , , , , , ,
on June 22, 2009 at 10:42 pm

laszlo_panaflexIn a 1996 episode of The Simpsons, washed-up movie star Troy McClure — you may remember him from such self-help videos as “Smoke Yourself Thin!” and “Get Confident, Stupid!” — enters a sham marriage with Aunt Selma to squash rumors about his sordid personal life and regain his former screen glory. As he is “romancing” Selma along a Simpsonized version of the Hollywood Walk of Fame, McClure declares:

One day, my lady Selma’s gonna have a star right next to mine, so watch out [camera pans right] Laszlo Panaflex!

Like most throwaway Simpsons lines, it has faded from mainstream recognition — the episode’s imagined musical version of “Planet of the Apes” is surely better known — but lives on in offhand references made by those of us who have been watching long enough to remember the controversy over Bart Simpson and those “Underachiever and Proud Of It” T-shirts.

I thought of it again while watching Ghostbusters on TV last night, noticing that the cinematographer was László Kovács. Was Kovács’ the name Simpsons writers were riffing on? Following a well-established routine, I plugged his name — Panaflex’s of course — into Google, hoping for but not really expecting a Wikipedia article to pop up.

It turns out Wikipedia did show up first — but it wasn’t an article. Instead, it was a user page for someone using the fictional lenser’s moniker as a handle. It reads in full:

Nice. But this also got me wondering: is this a loophole in Wikipedia policy? Isn’t this a way to get an encyclopedic page on the site even if it would be otherwise deleted by Wikipedia’s relentless arbiters of significance? After, all articles appearing on what Wikipedians call the “mainspace” of Wikipedia are expected to satisfy a handful of core guidelines lest they be removed or radically altered.

First there is the general notability guideline requiring the subject to meet a certain threshhold of importance (often determined by news coverage). Articles failing the requirement are deleted, and relevant content is sometimes relocated to existing articles about the same topic. Laszlo Panaflex, as one joke in one episode, would never pass Wikipedia’s notability requirement because it would obviously belong on the page about the episode (and as of this writing, it is not even there). An example of a Simpsons reference that does meet this requirement is Homer Simpson’s ubiquitous “D’oh!

Other guidelines it could elide and does in this case: Verifiability and Reliable sources. Sure, it helps to confirm my suspicion that Laszlo Panaflex is inspired by the real cinematographer with the accented name discouraging me from Ctrl-C/V-ing it again. It certainly wouldn’t surprise me if it was named for him, but certainly doesn’t offer a citation for the claim. I need more proof, and articles in the Wikipedia mainspace do, too.* User pages have no such requirement.

On the other hand, I think it passes NPOV with flying colors.

But is it a loophole to treat a user page like an article? After all, Laszlo Panaflex ranked right at the top of Google; other articles on semi-obscure subjects could as well. I don’t believe there is a policy, guideline or essay that specifically addresses this, though I fully acknowledge I may be wrong. In that case that I am not, the possibility exists for unworthy (or even “unworthy”) articles to be given a second home on user pages.

I can say for certain — alas, without being able to summon a link (I’ll look) — that there are a number of editors whose user pages are written to resemble a Wikipedia article. Is that wrong? I don’t think so. However, I do think it could make the Wikipedia community uncomfortable if it became a widespread practice, and was seen as a gray hat SEO technique.

In that unlikely event, the first suggestion that comes to me would be requiring a banner on user pages that specifies that it is not an “article”. It would be phrased like the banner I keep atop my own page, included as a disclaimer in case the page is swiped by an unscrupulous mirror site. After all, this non-accusatory template puts even a flawed but useful article about one Laszlo Panaflex in the proper context:

This is a Wikipedia user page.

This is not an encyclopedia article. If you find this page on any site other than Wikipedia, you are viewing a mirror site. Be aware that the page may be outdated and that the user this page belongs to may have no personal affiliation with any site other than Wikipedia itself. The original page is located at http://en.wikipedia.org/wiki/User:WWB.

Wikimedia Foundation

*It may be out there. Many other Simpsons-related Wikipedia articles, including “A Fish Called Selma”, are buttressed by citations to the commentary tracks on the official DVD releases. If anybody knows for sure, I’d be happy to help add the citation.

Email This Post
  • Facebook
  • Twitter
  • Digg
  • del.icio.us

The Wikipedia Haters Club

Tagged as , , , , , , , , ,
on June 9, 2009 at 8:42 am

Count as one member Examiner.com personal finance columnist Steve Juetten, who writes in a review comparing Microsoft’s newly launched search engine, Bing, with old standby Google:

Before I started the search, I set two rules. First, I was looking for information from reliable sources. As a result, if a search placed information from Wikipedia high on the list, the search engine sank in my review. As with information from any source (human, web or book), trust but verify and Wikipedia is not trustworthy when it comes to your money.

Anyone who spends much time around Wikipedia is pretty familiar with complaints such as these, and to this end the Wikipedia community maintains a page called Replies to common objections. Juetten isn’t quite specific enough for me to highlight a particular section, but I’m pretty sure he will find some answers in the answers to “Wikipedia can never be high quality“.

Meanwhile, a few objections to his objection do occur to me. For one thing, who is to say that other sources will be more trustworthy? Juetten undoubtedly singles out Wikipedia for its high profile, but it’s difficult to see why it should be placed at a disadvantage to About.com, Answers.com or NNDB, all of which can rank well for certain terms.*

Are these other information resources likely to be more reliable? I know of no reason why they should be. And if About.com or NNDB does happen to be wrong, there’s not a thing you can do about it.

Lastly, I agree with Juetten that “trust but verify” is a good personal rule and a sound approach to research, but I don’t understand why he doesn’t extend it to Wikipedia when this is an area in which Wikipedia often shines. One of the site’s core content policies is in fact Verifiability, that articles need references. But Juetten’s objection becomes even more ironic when you consider that said references are required to meet another core policy: Reliable sources.

Juetten’s worldly cynicism is understandable but, in this case, selectively applied and ultimately misplaced. It is true that Wikipedia is not completely reliable, but it shouldn’t be penalized for being one of the few reference websites that actually admits the fact.

_____
*For example, try searching for Alan Greenspan on Google and Alan Greenspan on Bing. As of this morning, the top three results for each are: Wikipedia, Answers.com and NNDB.

Email This Post
  • Facebook
  • Twitter
  • Digg
  • del.icio.us

The Fix is In

Tagged as , , ,
on March 6, 2009 at 8:44 am

Today’s Featured article on the English Wikipedia covers an interesting subject, and one that is recently relevant as well:

saxbe-fix-featured-article

As you may remember, the fix was necessary for Senator Hillary Clinton to become Secretary of State Hillary Clinton, and this is well-covered in the section titled “21st century.” But here’s my favorite part:

These pay raises were by executive order in accordance with cost of living adjustment statutes, as noted by legal scholar Eugene Volokh on his blog, The Volokh Conspiracy.[54] Before the January 2009 pay increases, secretaries made $191,300 and senators and congressmen earned only $169,300.[59]

If you know anything about the Verifiability guideline, one of the things you probably know is that blogs are nearly always disallowed as a “self-published source.” But the usage of Volokh’s writing on his widely-celebrated group blog falls well within the scope of this guideline:

Self-published material may, in some circumstances, be acceptable when produced by an established expert on the topic of the article whose work in the relevant field has previously been published by reliable third-party publications.

Check, and check. As a longtime fan if intermittent reader of The Volokh Conspiracy, I think Eugene Volokh’s admittance as a source on this rigorously-evaluated article — and not just once but in fact five times — is pretty cool.

Email This Post
  • Facebook
  • Twitter
  • Digg
  • del.icio.us
pres1cription1
ervrtv cvs pharmacy locations fjkngr cvs rtyhty Adderall Online ehfnfe Adderall ergveve buy phentermine 37.5 without prescription ervn Phentermine ervrv ololo adderall online evbyrf Adderall Xr rtbrgf cheap cialis tygy Cheap Cialis ggyjgy Well, viagra ygcew viagra cheap viagra uhqwdh cheap viagra meds buy viagra hvvdd buy viagra wgdd viagra online asghdwf, viagra online, adgh generic viagra sadgyuw generic viagra cialis cialis afgd! Fdga trusted pharmacy cialis online cialis online wfdwf wefg wfee levitra levitra pharmacy qw, wad phentermine phentermine online qwefdg fda phentermine 37.5 qwdeijg phentermine 37.5 weight loss 5 ef tramadol tramadol qwdyg tramadol 50 mg wagyed tramadol 50 mg ed adderall adderall xr online iehf, wfd, afdwf, xanax xanax sleeping awgd 2-5 valium wfdqgjb valium pharmacy trusted pharmacy wef e facebook login facebook login, secrets, methods, qgywj lexapro lexapro, afgfa afhydrocodone dgvqwd hydrocodone and free viagra excellent free viagra. Viagra Samples
Viagra For Sale
Natural Viagra
order tramadol online community still order tramadol online pharmacies tramadol online pharmacy tramadol online pharmacists setting order tramadol gradually functions health-related order tramadol generic xanax various generic xanax surgery patient free viagra functions still of free viagra order levitra online reversing order levitra online works approach buy cialis acupuncturists inside buy cialis specific