William Beutler on Wikipedia

Archive for the ‘Biographies of Living Persons’ Category

The Top 10 Wikipedia Stories of 2012 (Part 1)

Tagged as , , , , , , , , , , , , , , ,
on December 28, 2012 at 12:18 pm

In these waning days of 2012, let’s take this opportunity—for a third year in a row—to look back and come up with a list of the most important Wikipedia news and events in the last 12 months. Like our first installment in 2010 and our follow-up in 2011, the list will be arbitrary but hopefully also entertaining. There is no methodology to be found here, just my own opinion based on watching Wikipedia, its sister projects and parent organization, and also thumbing through the Wikipedia Signpost, Google News and other news sites this past week. So what are we waiting for?

Wait, wait, one more thing: this post ended up being much longer than I expected, and so I’ve decided to split this in two. Today we publish the first five items in the list, 10-6. On Monday 12/31 we’ll publish the final five. Enjoy!

♦     ♦     ♦

10. Wikipedia bans a prominent contributor — Let’s start with something that did not make the news outside of the Wikipedia / Wikimedia community at all, but which took up a great deal of oxygen within it. It’s the story of a prominent editor and administrator who goes by the handle Fæ. In April of this year, he was elected to lead a new organization within the community based on his leadership of the UK chapter. The move was not without controversy: Fæ’s actions both on Wikipedia and the sister site Wikimedia Commons (best known as a vast image repository) and interactions with editors became the subject of intense scrutiny, and even an ArbCom case (the Arbitration Committee is sort of like Wikipedia’s Supreme Court). Fæ ended up resigning his adminship—he basically jumped to avoid being pushed—and the end result had him banned from editing Wikipedia, which he still is. Not that he’s gone away—he’s still a contributor to Commons, and a very active one.

This might sound like a lot of insider nonsense, and I’m not about to dissuade you from this viewpoint. (Sayre’s law applies in spades.) But the key issue involved is about governance: is the Wikimedia community’s organizational structure and personnel capable of the kind of leadership necessary to maintain and build on this important project? The Fæ incident (along with other incidents in this list) suggests the answer may be no.

9. Confusing software development — Not all of Wikipedia’s contributors are focused on editing articles. Some are also developers, working on the open source software to keep Wikimedia sites running and, perhaps, improving. Some (but not all) are paid staff and contractors, and the hybrid part-volunteer, part-professional organizational structure can make it difficult to get projects off the ground.

One longtime project that has yet to see wide implementation is a “visual editor” for Wikipedia articles, to make editing much easier for users. Everyone knows that the editing interface for Wikipedia articles feels like software programming, and almost surely turns away some potential contributors (though it’s not the main reason people don’t contribute, as a 2011 Wikimedia survey showed). But the visual editor is a bigger technical challenge than one might think (as recently explained by The Next Web), and the outcome of a current trial run (also not the first) is anyone’s guess.

Another announced with a great deal of hype but which no one really seems to understand is Wikidata. It calls itself a “common data repository” which by itself sounds fairly reasonable, but no one really knows how it will work in practice, even those now developing it. Wikidata could be a terrifically innovative invention and the very future of Wikimedia… but first we need to find out what it does.

Other projects have been released, but have received thoughtful criticism for adding little value while diverting resources from more worthy projects. For example, a feature briefly existed asking you to choose whether a smiley face or frowny face best represented your Wikipedia experience. Uh, OK? Some projects have been better-received: the Wikipedia iPhone app, for example, is a definite improvement over the mobile site. But there are some odd decisions here, as well: does Wikipedia really need an app for the failed Blackberry Playbook?

8. Sum of human knowledge gets more human knowledge — If you’ve ever seen a [citation needed] tag on Wikipedia—and I know you have—then you know that, well, citations are needed. And while citations do actually kind of grow on trees (if by “trees” we mean “the Internet”) there is a lot of information out there which isn’t readily searchable on Google, and sometimes that information costs money. This year, some of those paid services cracked the door open just a bit.

The interesting story to the HighBeam Research partnership is that there really isn’t one. First of all, HighBeam is a news database which charges for reader access to its vast collection of articles. But in March, a volunteer Wikipedia editor who goes by the name Ocaasi reached out to HighBeam and asked if they would be willing to grant free access to Wikipedia editors. They said yes—and supplied one-year, renewable accounts to editors with at least one year’s experience and 1,000 edits. For Wikipedia, it meant greater access to information. For Highbeam, it meant a 600% increase in links to the site in the first few months of the project. Seems like a fair trade.

More recently, the Wikimedia Foundation announced an agreement with the academic paper storehouse JSTOR, making one-year accounts available to 100 of the most-active Wikipedia editors. With almost 240 editors petitioning for access, if you haven’t spoken up yet, chances are you’re a bit too late.

7. The first person to 1 million edits — OK, how about a fun one? In April, a Wikipedia editor named Justin Knapp, who uses the handle Koavf, became the first person to make 1 million edits to Wikipedia. To the surprise of everyone, perhaps none more than Knapp himself, this made him an overnight international celebrity of the Warhol variety. Jimmy Wales even declared April 20 “Justin Knapp Day” on Wikipedia.

It’s worth pointing out that most editors with many, many edits to their name typically are involved in janitorial-style editing activities, such as fighting vandals or re-organizing categories. And many very active editors spend a lot of time squabbling with others on the so-called “drama boards” such as Administrators’ noticeboard/Incidents. Not Knapp: his edits over time have overwhelmingly focused on creating new articles, plus researching and improving content in existing ones. In short: Wikipedia doesn’t need more editors—it needs more Justin Knapps.

Also, this is one I actually played a small role in, as verified by Knapp’s own timeline of events. I’d happened to see someone note the fact on Jimmy Wales’ Talk page that day, which I tweeted, and was then picked up by Gawker’s Adrian Chen, and the rest is history. Actually, then Knapp kept right on editing Wikipedia. As of this writing, he’s closing in on 1.25 million edits.

6. Philip Roth’s Complaint — Wikipedia has been extraordinarily sensitive to complaints by living people the subject of articles ever since a 2005 incident where a veteran newspaper editor found his article maliciously vandalized to implicate him in the murder of the brothers Kennedy.

In what was arguably the biggest row since then, in September 2007 the celebrated, prickly author of Portnoy’s Complaint, American Pastoral and numerous other novels took to the pages of The New Yorker to issue “An Open Letter to Wikipedia” complaining that the site had the inspiration for his 2000 novel The Human Stain all wrong. And this wasn’t his first resort: Roth’s first attempt had been to authorize his biographer to change the article directly, which was rebuffed. His consternation here: not inexplicable.

But Roth’s complaint was not really with Wikipedia. Several book reviewers had speculated (apparently incorrectly) about the real-life basis for the novel’s central figure, and it was these speculations which had been introduced to Wikipedia. Roth’s publicity campaign brought the issue to much wider attention, which got his personal explanation of the novel’s inspiration into Wikipedia. However, in a twist on the Streisand effect, the controversy is now the subject of a longish and somewhat peevish section written by editors perhaps irked by Roth’s campaign. So he got what he wanted, plus more that he didn’t. Shall we call it the Roth effect?

♦     ♦     ♦

Look here on Monday for the thrilling conclusion to The Top 10 Wikipedia Stories of 2012!

Public Lives: Jim Hawkins and Wikipedia’s Privacy Dilemma

Tagged as , , ,
on April 6, 2012 at 9:15 am

Editor’s note: The author of this blog post is Rhiannon Ruff (User:Grisette), a friend and colleague, in what I hope is a continuing series. The Wikipedian published a previous guest blog post in December 2011.

Introduction to Jim Hawkins Wikipedia article.

As an occasional Wikipedian, I like to check out Jimmy Wales’ user Talk page every now and again; while user Talk pages are generally where editors leave messages for each other, notes of support, or even warnings, Jimbo Wales’ page is a hot-bed of intrigue, gossip and debate. It’s Wikipedia’s water cooler. And it’s the perfect place to go if you’re looking to find an example of the confusion that can result from the occasional collision of hot-headed editors, complex guidelines and individuals who are themselves the subjects of articles. Just today I came across a discussion that mentioned Jim Hawkins, a radio-presenter in the UK who has been struggling to deal with Wikipedia editors, and Jimmy himself, over privacy issues raised by his biographical article.

Contrary to what many people believe, the Wikipedia community and Wikimedia Foundation are very keen to protect individuals’ privacy. There’s a common misunderstanding that if you edit Wikipedia, anyone can find out who you are—an idea proliferated by media coverage of incidents where editors’ IP addresses were traced and companies outed for editing their own articles (or, worse, those of competitors). But there’s actually a simple solution: creating an account on the site hides your IP address when you edit. And as long as you only edit while logged into that account, there’s no way for anyone to find out who or where you are through your IP. There are also very strong rules against “outing” the real life identities of editors by posting their personal information on the site.

But what if you’re the subject of a Wikipedia article? Getting back to Jim Hawkins, here’s the real dilemma that people in the public eye are faced with: anyone can create an article about them, but how do they go about preventing their personal details from being included in it? Hawkins certainly wasn’t happy about the creation of an article about him, and he was even less impressed that it included details such as the county where he lives and his exact birthdate. He’s been trying to get the article deleted for five years now. Over time, his frustration in dealing with the Wikipedia community has led to increasing antagonism on both sides.

After a recent “edit war” where his birthdate was repeatedly added and removed, the date was removed once and for all after an official request was made on behalf of Hawkins. The edit was made in line with a privacy policy that allows subjects of biographical articles to request the removal of their date of birth from the site. But, the county remained and Hawkins continued to rail against the system on the article’s Talk page:

Why should the people who’ve been stalking, bullying and harassing me – and have been doing so again today! – have any say in what happens to the article?
Hooray for policies. Does common human decency come into this anywhere? Or am I going to get the same response I’ve had for five years, the borderline-fundamentalist ‘that’s not how Wikipedia works’?

In a lively discussion on Jimmy Wales’ User Talk page beginning on April 1, editors were divided over two issues:

  1. Should an individual who is on the cusp of notability (i.e. just about eligible for a Wikipedia article, according to guidelines) be allowed to choose whether or not they have an article?
  2. If personal information about a subject has been published in public sources, does it contravene Wikipedia’s privacy rules to include it in the article?

There’s no simple answer to either of these. The first one in particular is really rather tricky. It’s true that if an article about someone hasn’t been created, there’s nothing that says that it has to exist. If an article has been created, though, it isn’t clear whether there should be the option to delete if the subject isn’t very strongly notable. Wikipedians seem to fall into two roughly two camps on the issue: those with sympathy towards article subjects and those who are concerned with ensuring that information is available on Wikipedia, if sources exist to support it.

The main question that Hawkins raised was why there had to be an article about him, if he felt that it was unnecessary, inaccurate and infringed upon his privacy. At one point in discussion he asks:

Can I point out that the whole damn thing is an invasion of privacy?

And an experienced editor replies, summarising the crux of the issue here:

An invasion of privacy is, by definition, the release of private information. This information, however, is not private, but is stated by the subject in the very show he hosts.

So, the issue is: if information exists in the public sphere, why should it not be included in a Wikipedia article? The details are already out there, some editors argue, so adding it to a Wikipedia article can’t be infringing on the subject’s privacy as the information wasn’t private to begin with. The bright line that exists on Wikipedia is its governing principle of verifiability: information included in articles must always be verifiable, that is, they must be supported by reliable sources. So, if personal information about a subject isn’t supported by a reliable source—even if it’s true—it can’t be included. Unfortunately, as Hawkins has discovered, if the information does appear in a reliable source (in this case, in a local magazine and on the BBC website), whether it is included or not comes down largely to editors’ discretion.

In short, the lesson Jim Hawkins has learned the hard way is: if you don’t want something included in your Wikipedia article, make sure it isn’t published in the first place.

Rick Santorum’s Wikipedia Problem and its Discontents

Tagged as , , , , , , , , , , , , , , , , ,
on August 10, 2011 at 9:16 am

When former U.S. Senator Rick Santorum started gearing up to launch his presidential campaign earlier this year, there was one question he could not avoid. It had to do with the matter of alt-weekly editor and advice columnist Dan Savage, who has for years positioned himself as Santorum’s most prominent critic. Many politicians have fierce opponents, but few did what Savage did in 2003, and that was hold a contest to give an alternate meaning to the word “santorum”. I hope you’ll forgive me for declining to quote the winning definition, but you can find it here, and suffice to say that it has stuck. So much so, in fact, that eight years later Savage’s term has come to dominate the web search results for Rick Santorum’s name.

In news stories this year it was mostly described—by ABC News, Roll Call, Slate, and Huffington Post, among others—as Santorum’s “Google problem”. Indeed, one of the top three results for Santorum’s name is Dan Savage’s website promoting the campaign. But Google and Wikipedia are often joined at the hip, and one of the top results has been a Wikipedia article, not about Rick Santorum per se, but in fact about the campaign against him… or about the word itself… it hasn’t always been clear. And by mid-summer 2011, the article—then called Santorum (neologism)—had grown to several thousand words, and had itself become the focus of controversy among Wikipedians.

This blog post traces the history of the article’s evolution in some detail—not exhaustive, but getting there—because it’s an interesting window into how Wikipedia deals with controversial topics. Wikipedians can’t always agree, and in fact the article in question still remains a matter of dispute. But after 200,000 words and numerous debates in various forums around Wikipedia, the community has arrived at something approaching a satisfactory conclusion. Below, I aim to show how things got out of control, and how the Wikipedia community worked it out.

·     ·     ·

August 2006—To start from the beginning, let’s start from the beginning. The first version of this article was created five years ago this week, simply as Santorum.

(I should take a moment here to point out that—spoiler alert—because the article today is called Campaign for “santorum” neologism that is what appears at the top of all historical versions of the article; generally speaking, for each version I’ll link here, I will boldface article’s name at the time upon each reference.)

At this point the article was just a few paragraphs, outlining the circumstances that led to Savage’s coinage and a few examples of the term’s usage in the U.S. media. Prior to becoming its own article, most of the relevant material had been contained in a sub-section of the article about Savage’s sex advice column: Savage Love#Santorum.

It didn’t take very long at all before editors questioned the article’s suitability for a standalone article—what Wikipedia calls “notability”. In fact, the same day the article was first created, it was nominated for deletion. The reason for the nomination is one that would be echoed many times over the next half-decade:

The neologism referred to, created by Savage Love, does not have any evidence of real currency as a neologism. It should be treated as a political act by Savage Love, and described under that article.

The nomination failed and the article remained, as it certainly had received some media attention, but it was decided a renaming was in order. The suggestion was made that it be called Santorum (neologism), or possibly Santorum (sexual slang). Recent followers of this controversy might assume that the former was selected, because that was the name of the article for a long while. However, it was the latter, with a large reason being that Wikipedia has an explicit policy against creating articles about neologisms.

But that hardly settled the matter; the next issue concerned which Wikipedia page readers should find when they search for the word “santorum”, which now was considered to have—and here you could say that Savage had already won—two legitimate meanings. So the question was taken to a “straw poll”. For now, the article was still called Santorum, but what would the average Internet user be looking for when they looked up that term? How should the ambiguity be handled—in Wikipedia terminology, “disambiguated”? And what exactly should they call the article about the coinage?

Related to the word “Santorum”, the options included, and I quote:

  • Santorum should be an article about Savage’s attempt to define the word “santorum”
  • Santorum should be a disambiguation page, with its “traditional” content
  • Santorum should be a disambiguation page, with some other content (explain)
  • Santorum should be a redirect to Rick Santorum, and Rick Santorum should have a dablink…
  • Santorum should be a redirect to Rick Santorum, with no reference to the Savage neologism in the Rick Santorum article

Related to the article about Savage’s coinage, the options included, and I quote:

  • The article on the Savage neologism should be titled Santorum (neologism)
  • The article on the Savage neologism should be titled Santorum (sexual slang)
  • The Savage neologism needs no article; sufficiently covered at Savage Love#Santorum

And the result was… inconclusive. Nevertheless, a proposal was made, and subsequently accepted, to keep Rick Santorum as it always was, to call the Savage Love-inspired article Santorum (neologism), and to make Santorum a disambiguation page with links to relevant pages, among other details. The best summary of the considerations involved was stated by User:Dpbsmith, a veteran and still-active editor, who wrote:

Frankly I’ll support anything meeting these criterion:
A user who types in “santorum” as the Go word intending to find information about the Senator can find it very easily.
A user who types in “santorum” as the Go word intending to find information about the neologism can find it easily.
A user who types in “santorum” as the Go word is not presented immediately with the details of the neologism, but must click on a link, and the link must have some kind of label that communicates that fact that they are about to read about a political attack on the the [sic] Senator.
There should be no implication that Wikipedia endorses the neologism as somehow being “the real meaning” of the word.

Oh, did I mention there was also then a page called Santorum controversy, which is now called Santorum controversy regarding homosexuality, that also came up in the discussion? Well, now I have. Just wanted to be clear about that.

·     ·     ·

Late 2006-Early 2007—Although the matter seemed to have been handled appropriately, that didn’t stop editors from raising objections—even the very same objections—in the months following. In fact, someone had changed the article’s title back to Santorum (sexual slang) by the time the article came up for a second deletion debate in December 2006. The nominator focused on the fact that the media hits for the article were trivial—sure, The Daily Show and The Economist had used it, but neither had focused on it as a topic—while several less well-known sources appeared to be joining Savage’s campaign to popularize the term. Meanwhile, the nominator’s first argument was that the primary information was already covered in the Santorum controversy article (now you see why I mentioned it). Following a week’s worth of debate involving approximately two dozen Wikipedians and several thousand words…

The result was hopeless, hopeless lack of consensus.

(Emphasis in the original.) Lack of consensus to delete an article always means that it stays, and so it did. Some editors had suggested moving the article’s content to Wiktionary, Wikipedia’s dictionary sister project, where in fact the term had registered its own entry (without controversy) several months ahead of Wikipedia.

Later in December, one of the editors involved in the previous debate suggested moving the article from Santorum (sexual slang) to the oddly-titled Santorum (sexual slang activism), though the article stayed put. In January, a suggestion was made to merge the article back into the Savage Love entry, but that didn’t happen either.

·     ·     ·

Late 2007—Debate continued. In September, someone renamed it to Santorum (fluid)—ugh—and it was returned to Santorum (neologism), as it was then called. By this point, the article had grown substantially, was attracting the efforts of serious Wikipedians, and was… well, it was actually getting pretty good. In September 2007, the article was nominated for “Good article” (GA) status, and it looked like this. Later that day, the reviewing editor failed the article for including unsourced and “poorly sourced” material—The Onion in particular was singled out, although it was really an interview with Savage in the sister publication, AV Club—and for being a “BLP liability”.

That is to say, the article skirted the line of Wikipedia’s Biographies of living persons (BLP) policy, which aims to keep out scurrilous and weakly-sourced material about living persons that could be damaging to a living person’s reputation. As you might imagine, that had long been an issue; one couldn’t write about this topic without it being an issue. One could argue that Savage’s campaign was all about damaging Santorum’s reputation—I presume Dan Savage would agree to that—and yet it was nonetheless notable. Many editors then, and to this day, wished it would simply go away. And yet some wanted to make it as “good” as possible.

·     ·     ·

2008-2010—We can skip ahead, because after October 2007, fewer than 160 edits occurred in the three years intervening, and it was not changed substantially in that time. Santorum had lost his re-election bid in late 2006, re-entered private life in January 2007, and ceased to make headlines. In December 2007, the article looked like this. In January 2011, it looked like this. It was the same old back-and-forth, and not much happened.

·     ·     ·

Early 2011—As Santorum started making moves to run for president, activity picked up. In mid-February, Roll Call was first to write about Santorum’s “Google problem”, and this was dutifully added. The article continued to draw attention (including from vandals) through the end of February, until it was put under temporary “semi-protection”. When Stephen Colbert mentioned the controversy on his show, a not-so-brief summary was added, then removed, with the point made that “not everything Colbert says needs to be repeated in Wikipedia”. (Imagine that!) March and April were months of relative calm before the proverbial storm: nearly 1,000 direct edits, from May to this writing, lay just ahead.

·     ·     ·

May 2011—In early May, a very active and respected editor-administrator, User:Cirt, began a series of more than 300 edits to the article, starting with a long-overdue link to Wiktionary. By this point, the article contained some 1,600 words, excluding links and references. Cirt announced his intention to add “some research in additional secondary sources”, and four days later he had expanded the article to some 4,300 words. On the discussion page, one editor objected:

Expanding an article about a vile attack on a living person – it’s twice the size now and refs have gone from 33 to 95 – has got to be against the spirit of least of our BLP policy. My proposal, and my intention, stated right now, is to return this article to the content it had on May 9th.

This kicked off the first sustained debate in years—one that has arguably not yet come to a close. A proposal was made to “stub” the article, meaning to reduce the article’s length to a mere stub of an entry; the argument went, because the arguably unfair subject obviously met Wikipedia’s previously-determined standards for inclusion, a possible solution was to reduce it to the shortest possible version. This proposal quickly failed, with Cirt himself citing an earlier comment by veteran Wikipedian (and current Wikimedia Foundation fellow) Steven Walling:

The BLP policy is not a blank check for deleting anything negative related to a living individual. Criticism, commentary, and even base mockery of a public figure like a Senator is protected free speech in the United States. While it would be ridiculous for anyone to try and make Wikipedia a platform for creating the kind of meme Savage did, it is perfectly prudent for Wikipedia to neutrally report on the overwhelming amount of coverage given to the topic.

Remember that part about using Wikipedia as a platform—it will come up later. Meanwhile, Cirt continued to add significant information about media usage and analysis of the term and events surrounding Savage’s campaign, all backed up with acceptable references. In particular, he focused on adding uses of “santorum”, in slang dictionaries and even erotica, to support the article’s focus as legitimately about the neologism, and not Savage’s campaign per se.

For those who did not wish for Wikipedia to contribute to the so-called problem of making Savage’s campaign seem more important than it arguably was, it must have been more frustrating still to observe that the article was quite well-written and scrupulously followed Wikipedia’s style and sourcing guidelines. Cirt was nothing if not sophisticated. Many had the impression that the article itself was now an attack on Santorum, although that conclusion was only in the eye of the beholder. Cirt knew what he was doing and, for lack of a better phrase, Cirt knew exactly what he was doing. One editor objected:

I realize you will defend this bloated attack piece with all your skills (that is actually what I find most disturbing) but you have to realize or at least have noticed that many experienced editors disagree with your massive expansion of it and at some point it will require wider input and a community RFC.

By the end of May, the article had grown to more than five times the length of the article Santorum controversy regarding homosexuality and more than two-thirds the length of the primary Rick Santorum biographical article. Discrepancies of this sort have been well observed, most significantly on the Internet forum Something Awful, but no Wikipedia policy exists to require proportionality among articles.

At its greatest length, on May 31, the article surpassed 5,500 words, including headers but excluding photo captions, links and references—a total of over 77,000 bytes of data.

·     ·     ·

June 2011-Present— Were I to adequately summarize the debates and discussions that occurred beginning in late May and continuing sustainedly—with most debate occurring in June—this blog post could be three times its already considerable length. Instead I will attempt to summarize, although “considerable length” is unavoidable still.

From early June, Cirt pretty much stopped editing the article. To a significant extent, he’d become part of the issue, not just regarding this article but others as well, as can be seen on the discussion page for Cirt’s user account.

Among the many solutions offered around this time, one focused not on the article content itself, but rather its visibility on search engine results pages (SERPs). The editor offered, even if just for the sake of argument:

While I don’t really like the precedent, there’s nothing to say that every article needs to be indexed by search engines. … The majority of the concerns here seem to be focused on how people are coming across this article (via Google bombing, etc.), not necessarily that the article exists. … Both sides have legitimate points in their favor, so a compromise might be best here.

Other editors agreed it would set a bad precedent, and the suggestion did not go any further.

By now the topic had come to involve some of Wikipedia’s most influential editors, and a lengthy debate opened on Jimmy Wales’ discussion page. Wales’ take was as follows:

My only thought about the whole thing is that WP:COATRACK applies in spades. There is zero reason for this page to exist. It is arguable whether this nonsense even belongs in his biography at all, but at a bare minimum, a merger to his main article seems appropriate.

The “Coatrack” argument—one of many analogies Wikipedians have created over the years to illustrate key concepts—is not a policy or a guideline, but an informal essay, yet one with much currency. It states:

A coatrack article is a Wikipedia article that ostensibly discusses the nominal subject, but in reality is a cover for a tangentially related biased subject. The nominal subject is used as an empty coat-rack, which ends up being mostly obscured by the “coats”. The existence of a “hook” in a given article is not a good reason to “hang” irrelevant and biased material there.

In retrospect, it’s a little surprising that the “Coatrack” issue hadn’t been raised in any significant way before—and Wales is neither considered infallible nor is he always that involved in day-to-day Wikipedia issues—but this may yet have been a turning point. The next day, the highly respected User:SlimVirgin opened an RfC (Request for Comment) called “Proposal to rename, redirect, and merge content”. This led to the article being renamed, for a time, Santorum Google problem. Later, it was pointed out that “Google is not the only search engine in the world”, and so the search (as it were) continued.

The argument that the “neologism” had not evolved organically, but was the result of an organized campaign by Savage and his allies, had begun to exert some influence. For one thing, it was now quite clear that the majority of sources focused on the political campaign to bring relevance to the term, as opposed to the term’s relevance itself. In this way, one might say that Savage’s campaign had become a little too successful. Yes, the term was notable, but the controversy itself had become even more so.

Prior to the renaming mentioned above, editors in an adjacent thread had discussed several alternative names for the article. These included:

  • Santorum neologism controversy
  • Dan Savage santorum neologism controversy
  • Dan Savage santorum neologism campaign
  • Santorum neologism campaign
  • Spreading santorum (the name of Savage’s website)

Here one can start to see where the article’s current title would eventually emerge. Meanwhile, the article faced two more AfD (Articles for deletion) nominations, the first under its old name and the second under its current one. These were the fourth and fifth nominations overall, and surely the most futile.

As part of the ongoing RfC discussion in June, it had been strongly suggested that the article needed to be condensed, especially as Cirt’s expansion had contributed so significantly to the controversy. Besides the article expansion, in mid-May Cirt had created a new “footer” template, Template:Sexual slang, which further linked Rick Santorum’s name to dozens of NSFW topics. That template still exists, but on June 11 the link to Santorum (neologism) was removed. Again, it’s hard to say if this was another turning point, but a discussion about this template on Wales’ discussion page supports the notion that a consensus was coming into view: the article in its present form had itself become part of the campaign—that Wikipedia was being used as a platform for the campaign in the manner Walling had suggested.

A day later, a request for arbitration (RfAr)—a petition to the Arbitration Committee, Wikipedia’s equivalent of the Supreme Court—was opened against Cirt on the basis that his concerted efforts on the subject constituted “political activism”. On June 18 the request was rejected, but not before several dozen editors had contributed more than 28,000 words of opinion. One committee member wrote:

Decline for now, I’m inclined to think that this is more of a content dispute, and the community is able to cope with it.

On June 17, the community finally hit on a name that stuck: Campaign for “santorum” neologism. Initially, this was only intended as an interim move while further discussion took place. Among the names considered at this time, not all were serious, but most were:

  • Dan Savage santorum campaign
  • Dan Savage campaign
  • Dan Savage’s verbal attack on Rick Santorum
  • Santorum (sexual slang)
  • Santorum neologism campaign
  • Santorum neologism campaign
  • Santorum neologism controversy
  • Rick Santorum and homosexuality
  • Rick Santorum homosexuality controversy
  • Savage Santorum campaign
  • Dan Savage santorum neologism controversy
  • Dan Savage santorum neologism campaign
  • Spreading Santorum
  • Rick Santorum’s Google problem
  • Rick Santorum’s “Google problem”
  • Santorum Google problem
  • Rick Santorum Google problem
  • ‘Spreading santorum’ campaign
  • Campaign for “santorum” neologism
  • Dan Savage campaign for “santorum” neologism
  • Savage–Santorum affair (a reply: “Oh Please God No.”)
  • Savage–Santorum controversy
  • santorum (neologism)
  • The problem Rick Santorum is facing because every search engine in the world’s top search results says santorum is an anal sex by-product
  • Santorum (googlebomb)
  • SEO Campaign for “santorum” neologism
  • Santorum (cyberattack)
  • Santorum (cyberbullying)
  • Santorm (SEO attack)
  • Dan Savage’s “spreading santorum” campaign against Rick Santorum’s anti-gay stance
  • Santorum Google ranking problem
  • Dan Savage Google-bomb Attack on Rick Santorum
  • Campaign to attack Santorum’s name
  • Campaign to create ‘santorum’ neologism
  • Campaign to associate Santorum to neologism

In the end, inertia and the current title’s inherent virtues won out. Of the eventual “winner”—Campaign for “santorum” neologism—a veteran Wikipedian commented:

This one is growing on me – neutral, correct, to-the-point, and succinctly informative to readers both familiar and unfamiliar with the subject as to what the article will be about.

All that was left was to whittle the article down from its extreme length to a shape that covered the topic adequately, balancing relevance with discretion. While many edits were to follow, the key edit was made on June 21, when SlimVirgin replaced a 4,800-word version of the article (minus links and references) with a 1,400-word version. This is substantially the version of the article that remains in place today.

·     ·     ·

Comparing the late May version of the article, at its longest point, to the trimmed-down and refocused current version, here’s what we find:

  • The earlier version focused on the term in and of itself, with the opening sentence including a definition and describing its use. The current version focuses on the events, explaining the aim of Savage’s campaign—though the definition remains.
  • Excluding the lead section, references and external links, there are only three sections in the current version, compared with seven in the earlier (not including “See also” and “Further reading”, which were also removed).
  • The content of the “Background” section was almost entirely removed, leaving just the key facts about Rick Santorum’s statements in the 2003 Associated Press interview.
  • The section about the website “Spreading Santorum” was removed, details added into the “Campaign by Dan Savage” section.
  • Almost all of the “Recognition and usage” section was removed.
  • “Media analysis” and “Political impact” were combined into one, shorter, summarized section, focusing on the reception of the campaign in the media and its political impact.
  • Santorum’s response to the controversy was kept in the current article, however condensed.

Up to the present day, in the Talk page discussions alone (including the RfC discussion), more than 200,000 words have been written about the article. That is probably well short of the true number.

Perhaps surprisingly, the impact on Rick Santorum’s Wikipedia article was not that great—the article had long summarized the events in a short final paragraph concluding a heading relating to his statements about homosexuality—83 words at this count.

Meanwhile, Santorum’s “Google” problem continues. Conduct a logged-out search today, and here are the top three results:

And let’s not imagine the argument is completely over on Campaign for “santorum” neologism. Visit today, and one will find at the very top:

Images courtesy Wikipedia and Wikimedia Commons, licensed under Creative Commons. Additional research and analysis provided by Rhiannon Ruff.

Osama bin Laden is No Longer a BLP

Tagged as , , , , ,
on May 2, 2011 at 7:35 am

That is to say, as the world knows by now, the Wikipedia article about Osama bin Laden no longer describes a living person, and he is no longer subject to Wikipedia’s policy for Biographies of living persons (BLP).*

Osama bin Laden, finally dead (on Wikipedia)

Quite something to see this template attached to this particular article. As I type this just before 9am Eastern Time, Wikipedia editors have been extremely active overnight; since early reports of President Obama’s announcement, there have been more (as of my counting) 430 edits to the main bin Laden page and 999 edits to an all-new article: Death of Osama bin Laden. And, of course, there was the obligatory circumstance wherein someone accurately updated the article to reflect his death without providing a citation, leading another editor to revert the change pending verification. And within a few minutes, it was.

*Of course it’s still covered by BLP insofar as other individuals mentioned on the page are concerned, but can we set that aside and take some satisfaction in this moment already?

From the Mixed-Up Files…

Tagged as , , , , , , , , , ,
on September 29, 2010 at 8:04 pm

WBEZ in Chicago is probably best known for being home to the long-running radio series This American Life. But one of their most innovative offerings is an online video series first aired in April 2009 called The Wikipedia Files.

The idea is simple: WBEZ hosts interview entertainment celebrities by reading portions of the Wikipedia articles about them, simply to fact-check the articles within. More often than not, the articles are accurate enough, but they certainly have caught some interesting errors.

I think it’s an ingenious idea, and I hope that other media organizations follow, especially on other subjects. One of the biggest complaints about Wikipedia is that it’s difficult to tell what’s true and what is not. Although contributors are encouraged to add citations, the fact is many do not. In many cases, people add things they know, or think they know, and either cannot find a source or never bother to look one up. Some details may have originated on blogs, most of which Wikipedia generally does not consider to be reliable. This is all the more serious on articles about living persons, which Wikipedia takes more seriously than in other genres. The Wikipedia Files offers editors the chance to verify certain facts at the source, and to establish facts that were not previously known.

In one recent example, WBEZ’s Justin Kaufmann sat down with Antwan “Big Boi” Patton, one half of acclaimed American hip hop duo OutKast, now promoting his also-acclaimed solo debut, “Sir Lucious Left Foot: The Son of Chico Dusty”. Here is Big Boi with Kaufmann:

Big Boi fact-checks his Wikipedia page from WBEZ on Vimeo.

And in fact, at least one fix did come of the interview. On July 20, the same day it was posted, an anonymous, to date one-time editor from Akron, Ohio made the following correction about how he started pursuing music and his early relationship with André “3000” Benjamin:

wikipedia-files-big-boi-edit

Alas, this editor did not add a citation to go along with it (so I just did). Otherwise, who’s to know where to go and verify the information contained? This points to the fact that adding citations to Wikipedia is harder than it should be—but you can’t hold that against WBEZ.

A Potential Supreme Court Nominee Probably Edited Her Own Wikipedia Article. Is It a Big Deal?

Tagged as , , , , , , ,
on April 13, 2010 at 9:30 am

leah_ward_sears_wikiNew York-based media blog Gawker is reporting that Leah Ward Sears, former Chief Justice of the Georgia Supreme Court and potential nominee to the United States Supreme Court by President Obama, edited her own Wikipedia article in late 2008 and early 2009.

While the possibility exists that someone else used her initials, last name and year she became a state Supreme Court Justice as a username, it usually turns out that this type of account is exactly that person. Gawker is focusing primarily on an edit she made that was favorable toward herself:

On May 6th, a user named LWsears1992 edited Leah Ward Sears’ Wikipedia page, adding the clause “Based in large part on her highly regarded record” to a passage about how she defeated an opponent in the 2004 race for Georgia Supreme Court. (Georgia is one of eight states that have the sort of weird policy of electing Supreme Court justices.)

This is technically correct, but not exactly right. While Gawker does have a screen shot of an edit by Lwsears1992 “adding” this, all she did was restore a phrase that had existed on the page since June 2005, added in the first place by a technology consultant in Atlanta. The phrase was removed again a few days later for lacking a source, and Lwsears1992 did not press the case further. Not that Sears should necessarily be making direct edits on matters of disagreement, but these are considerations that few Wikipedia outsiders understand.

In total, Lwsears1992 made 36 edits to Wikipedia, all of them relating to this particular article. So how did she do? Did she make the page better or worse, overall? To find out, I went through each and every edit, starting with the article as it appeared before she started working on it, November 3, 2008 and concluding with the article after she completed her work, on November 13, 2008. Here is what I found:

Better:

  • The fact is that Sears is being called out because she attempted to be transparent about it. However, it’s probable that she made a single edit an hour before her first editing session from the IP address 167.192.61.254 in Atlanta, Georgia. Unfortunately, she screwed up a template, rendering the “Infobox” sidebar a mess of code. But I count this as a positive, because of what happened next. Once she had caused this error, she created an account and undertook the task of fixing it. Not only did she do so, but approximately a third of her edits were devoted to getting this one thing right.
  • She uploaded her own photo, taking the time to release it under two free licenses, the old GNU license Wikipedia used to use for everything, and the Creative Commons license it uses now. She experimented with the sizing of the photo she added, including trying it at full size before settling upon 155 pixels wide, which is the width still.
  • She added useful context, such as noting that her resignation from the Court would coincide with the end of her term; this is unambiguously more useful than simply ending the sentence on “she will resign from the State Supreme Court at the end of June 2009.”
  • Chances are good she made the article sturdier in the long run, changing the article to read that she was the “first” African-American female Chief Justice in a U.S. state instead of the “only” one. Assuming this is correct, the former will always be true though the latter assuredly will not be.
  • She tried to protect her own page from vandalism by experimenting with templates meant to indicate the page cannot be edited in some circumstances. But as she was not an administrator, she couldn’t do this anyway. Once she saw it wasn’t working, she took them down. One could almost file this as a negative, because trying to get a page locked from editing is a sure sign of not understanding Wikipedia. On the other hand, changing your own mistake is a sign that you do. I’ll give her the benefit of the doubt here.

Worse:

  • She didn’t cite any source for the claim she is the first African-American female Chief Justice in a U.S. state, making the claim difficult to verify. Anytime one makes a claim of superiority or “firstness,” it helps to source the claim to avoid the dreaded “[citation needed]” tag.
  • She didn’t provide any edit summaries for her work, making it tedious to click through each and find out exactly what she did.
  • She made some changes that didn’t make the page better. In one edit, she edited internal site links embedded in the phrase “Chief Justice of the Georgia Supreme Court” so that instead of directing people to articles about Chief Justices and the GA Supreme Court, it would go to a non-existent page that she probably assumed existed.
  • She also removed internal links to the names of her appointer (Zell Miller) and predecessor (Norman S. Fletcher) for no apparent reason; she also removed the link for “Chief Justice of the Georgia Supreme Court” — perhaps after noticing that it did not lead anywhere. Odder still, she did replace some of this information, including Miller’s name, but removed Fletcher’s name after having initially sought to add it. In any case, he is back in the full article today.

What is the value of adding her photograph vs. removing the name of her predecessor? What is the value of adding new details which are presumably correct, but not citing independent sources? How bad is it to edit your Wikipedia article without seeking consensus of other editors? How should one seek to change their articles on Wikipedia in any case?

These questions and more like it have been coming up more often in recent months. It’s a subject recently addressed by the Wikimedia Foundation’s Jay Walsh in an interview with PR Week. It’s a subject that others are discussing, from law firms in the UK to PR firms around the world. It’s s a subject I weigh every day as a consultant on matters of Wikipedia, and in an article I just published in Politics Magazine.

My answer regarding Leah Ward Sears is that, she made the article better, but not much. She did not go about it the right way, but the right way is non-obvious to most, and the burden is on Wikipedia to make its rules understood by outsiders. While some of her edits were self-serving, they were of a mild sort. At most this was a venal sin, not a cardinal one. Gawker is turning this into a “gotcha” story on the implied theory that interacting with one’s own Wikipedia article is never acceptable. This is a myth, one widely believed and one propagated by many at Wikipedia simply to keep people from meddling with their pages en masse. This is understandable, but it won’t work out in the long term.

If Sears is Obama’s nominee and is further confirmed to the Supreme Court, perhaps it will help put an end to this kind of “gotcha”. I doubt this is significant enough to come up at confirmation hearings if she is nominated, and it should not be. But I will concede that would be kind of entertaining.

Image via Sears via Wikipedia.

Flagged Revisions Come to the English Wikipedia

Tagged as , , , , , , , ,
on August 26, 2009 at 6:39 am

Earlier this week, New York Times web reporter Noam Cohen, who does some of the best Wikipedia reporting this side of The Register, broke the news about a decision by Wikipedia’s parent organization to instate tighter controls on some articles. Wrote Cohen:

Officials at the Wikimedia Foundation, the nonprofit in San Francisco that governs Wikipedia, say that within weeks, the English-language Wikipedia will begin imposing a layer of editorial review on articles about living people.

The new feature, called “flagged revisions,” will require that an experienced volunteer editor for Wikipedia sign off on any change made by the public before it can go live. Until the change is approved — or in Wikispeak, flagged — it will sit invisibly on Wikipedia’s servers, and visitors will be directed to the earlier version.

The change is part of a growing realization on the part of Wikipedia’s leaders that as the site grows more influential, they must transform its embrace-the-chaos culture into something more mature and dependable.

It’s worth pointing out early on, as Cohen’s story unfortunately did not, that these changes will apply only to biographies of living persons. In Wikipedia, that is a proper noun: Biography of Living Persons (BLP) is one of Wikipedia’s most strenuously enforced policies; earlier this year, Wikipedia veteran Newyorkbrad explained this in a series of posts on Volokh Conspiracy, which The Wikipedian previously discussed.

Blogosphere reaction has been much more widespread than any Wikipedia story that comes to mind from this past year. I think this is because everybody who uses Wikipedia has some opinion about the website’s curious balance between openness and reliability — and now the balance has shifted. I’d say reaction is roughly divisible into four quadrants: those who mourn Wikipedia’s openness vs. those who will continue to question Wikipedia’s reliability, with those who are optimistic about the change vs. those who are not. Here is a walk-through:

Among those who feel that Wikipedia’s openness is key to the site’s success, count Judd Antin at TechnoTaste, who is studying Wikipedia as part of his PhD work:

As part of my dissertation research I’ve been interviewing less experienced Wikipedians about their perceptions of the site. One constant theme has been the perception of a class system in Wikipedia. Casual editors worry that their edits aren’t good enough, and that they’ll be rebuked by Wikipedia’s upper-classes. They perceive a mystical group of higher-order contributors who make Wikipedia work. … This latest move is troubling in that it seems to represent a lack of faith in crowdsourcing and the wisdom of crowds, in the model that made Wikipedia what it is today. This change will also remove another of the important social-psychological incentives that draw new people into the Wikipedia fold: the instant gratification that comes from seeing your work reflected on a Wikipedia page.

This is not always a good thing; Kate McMillan at Small Dead Animals is an example of someone who is the subject of a Wikipedia article, but is not exactly pleased about the fact. She also isn’t exactly optimistic that things will change:

My own Wiki page was instigated by an internet “stalker”, in fact, the same individual who once authored a blogspot site using my stolen identity. Requests to Wikipedia to delete the page went unheeded, and it’s remained a reliable source of misinformation, false attribution of quotes, and drive-by smears ever since. … It wasn’t until I threatened a Wiki editor personally with legal action for restoring defamatory material to the page, that they began to take tighter control of the content.

Another skeptic is Ann Bartow at Madisonian.net:

I have doubts about how effective this is going to be in improving the reliability of the content of Wikipedia entries, but it is a great PR move by Jimmy Wales, that’s for sure.

From the perspective of a frustrated editor, here is Andy Merrett at The Blog Herald:

As someone not in the Wikipedia “elite”, I’ve long since given up trying to edit entries on the site, having already wasted not insignificant time adding information only to have it reversed. I foresee that Wikipedia will increasingly become a place where only a minority of privileged and “trusted” editors have the keys to the kingdom.

That is a plus to others. Among the critics of Wikipedia’s reliability was Lisa Gold at Research Maven, who nonetheless is a skeptic herself:

I’m glad there is finally some acknowledgment among the powers that be at Wikipedia that accuracy is important. But that’s not enough. If accuracy is important, you have to make it a priority and do things on many different levels to try to achieve it. You have to apply your policies to the entire site, not just some articles. You have to bring in people with knowledge, experience, and qualifications to do real editing and fact-checking. (With all of the unemployed editors, fact-checkers, and journalists out there, why not hire a few and let them work their magic.) This new policy is not really about making Wikipedia more accurate, it’s just about trying to stop the embarrassing vandalism stories that hit the news with disturbing regularity.

A similar sentiment was expressed by Dr. Jim West, who appears to have some experience arguing with an intellectual opponent about Wikipedia content. His reaction to the change:

In a word, duh. Now if you’ll do the same for every entry then perhaps your resource might be worth visiting some day. Until then, I think I’ll continue to abstain. I’m not really interested in reading an article on the Dead Sea Scrolls that Raphael Golb has edited using one of his 200 fake names.

While I understand the concerns of both above, I also think they go too far. Striking a balance and offering a more optimistic view is Ben Parr at Mashable:

[W]e can’t help but feel a bit sad that this change had to happen. Wikipedia was egalitarian in the spread and use of information, and it treated everyone as equal contributors of knowledge. While that may not necessarily be true in the real world, it still was the driving force behind the creation of 3 million articles, more than any other encyclopedia could ever hope to boast.

The move was necessary, but it does mark a new chapter in the Wikipedia information age and the end of an old one.

And here’s another philosophical take from Joe Windish at The Moderate Voice:

There is little doubt the debate will be passionate, but that’s exactly as it should be. Eight years into the incredible success of Wikipedia, long one of the 10 most popular sites on the Web, many of us still don’t understand it. … The thousands of volunteer Wikipedian editors take their responsibility seriously. Flagged revisions may or may not work. What’s best about it is that the Wikipedia editorial community will watch and wonder about and debate it. And if it should not succeed, they will try and try again.

My own take on the situation? I don’t know yet. As Andrew Lih explains in his book, The Wikipedia Revolution, the German-language edition has had this feature for several years, and it seems to work there. On the other hand, the English Wikipedia is much larger, and the possibility certainly exists that some articles will be left unchecked and un-updated for extended periods of time. Will the site grow stagnant? Will the vast majority of people who read but do not edit even notice? These are just a few of the operative questions.

WikiProject Flagged Revisions, which will try to keep articles current, was only established on the 19th of August and as yet has just four listed participants. It’s also worth noting, once the details are hammered out — which they are not just yet — the plan will be implemented on a two-month trial basis. And after that? Well, I’m very interested to find out myself.

Newyorkbrad on “the BLP Problem”

Tagged as , , , ,
on May 24, 2009 at 6:57 pm

Newyorkbrad is a longtime Wikipedian, known very well to the most active of editors as one of their most creative and thoughtful cohort. To the extent that he is known externally, it is for leaving the site under threat of having his real identity exposed by off-site critics. As it happened, Newyorkbrad returned about three-and-a-half months later, whereupon he has resumed his former positions as an Administrator and member of Arbitration Committee.

His identity is of little interest to most Wikipedians, but two weeks ago he name-checked himself in a fascinating series of blog posts about Wikipedia and how it works at the Volokh Conspiracy. For what it’s worth — and only because he volunteered it — Newyorkbrad is Ira Metetsky, a New York City lawyer whose middle name is Brad, and whose presence on the site owes something to a childhood friendship with UCLA law professor and chief Conspirator Eugene Volokh.

While he started off with the still kind-of obligatory explanation of “what Wikipedia is all about,” most of his writing was devoted to a subject of internal debate at Wikipedia, which is commonly referred to as “the BLP problem“:

That is the problem of how easy it is, in the era of near-universal Internet access and instantaneous search engines, to inflict devastating and nearly irreversable damage to people’s privacy.

BLP stands for Biography of Living Persons, which informally can refer to any article about a living person and formally to the policy developed in 2005 following a couple of incidents in which people objected to biographical articles about them. One is very famous as far as Wikipedia goes, while the other is very much not, but that may be a subject for another post.

Beyond just explaining the controversy to the uninitiated, Newyorkbrad also proposed one part of the solution:

[T]he suggestion [has been] made that when an issue arises concerning whether a biographical article should be kept on Wikipedia or deleted, there be a presumption in favor of deletion unless there is a collective decision to keep it, rather than the other way around. (In Wikiparlance: when a BLP is AfD’d [nominated for deletion], “no consensus” would default to delete. In an ordinary deletion discussion, by policy, “no consensus” defaults to keep.)

This suggestion has been advanced and discussed on-wiki, and has won wide endorsements, but not quite enough to be adopted. A main sticking point is that a BLP can be nominated for deletion for reasons having nothing to do with defamation, privacy violation, or undue weight — say, a dispute whether an athlete or a performer is quite notable enough to warrant coverage. In many of these instances, ironically, if the article subject were asked, he or she might prefer that the article remain. …

I advanced a compromise proposal suggesting that deletion discussions on BLPs default to delete where the notability of the subject is not clear-cut (that would presumably be the case anytime the tentative AfD [Articles for deletion] result is “no consensus”) and (1) the article taken as a whole is substantially negative with respect to the reputation of the subject, (2) the article subject is a minor, or (3) the article subject is known to have himself or herself requested the article’s deletion. It may be time to revive discussion on-wiki of this suggestion.

Although I have not personally been involved in much policy discussion in my time on the English Wikipedia, that sounds like a policy proposal I could get behind. To this I may add a fourth: Articles about living persons should be removed as well. By definition, these articles have not yet passed the Notability requirement. In many cases when an article subject’s notability has yet to be verified, these articles may be saved (by Wikipedians of the “inclusionist” philosophy) from deletion. But given the particularly sensitive nature of BLPs, the unreferenced ones should simply go. If they are truly about notable subjects, they will be replaced sooner or later.

We don’t know just how big of a problem BLPs are but, in another post to come, I will discuss what we do.