William Beutler on Wikipedia

Archive for the ‘Direction of Wikipedia’ Category

The Wikimedia Foundation is Losing its Chief. What Happens Next?

Tagged as , , , , , ,
on March 28, 2013 at 9:35 am

Big news in the world of Wikipedia, yesterday: Sue Gardner, the executive director of the Wikimedia Foundation (the non-profit behind Wikipedia and other wiki-based projects) announced she will be stepping down from the role, which she has held since June 2007. Gardner, in a post on the Wikimedia blog:

I feel that although [Wikipedia is] in good shape, with a promising future, the same is not true for the internet itself. (This is thing number two.) Increasingly, I’m finding myself uncomfortable about how the internet’s developing, who’s influencing its development, and who is not. Last year we at Wikimedia raised an alarm about SOPA/PIPA, and now CISPA is back. Wikipedia has experienced censorship at the hands of industry groups and governments, and we are –increasingly, I think– seeing important decisions made by unaccountable, non-transparent corporate players, a shift fromSue Gardner at Wikimania the open web to mobile walled gardens, and a shift from the production-based internet to one that’s consumption-based. There are many organizations and individuals advocating for the public interest online — what’s good for ordinary people — but other interests are more numerous and powerful than they are. I want that to change. And that’s what I want to do next.

In January 2012, you may remember that Wikipedia went into “blackout” mode for 24 hours in protest of legislation before the U.S. Congress (SOPA/PIPA), so this explains that much. The rest of the statement is a little harder to puzzle out; the “non-transparent corporate players” in those circumstances were opposed by other corporate players, and both were fighting over government regulations. The line about “mobile walled gardens” sounds like Facebook, and a “consumption-based” Internet sounds like a jab at tablets, of all things, but I suppose we’ll have to see. These are obviously broad statements, and Gardner hasn’t actually announced her next move.

The move won’t be happening too soon, yet: Gardner will be in the position for (at least) another six months, while she works with Wikipedia’s Board of Trustees to find a successor, she writes in the post.

Whether Wikipedia is really “in good shape” is a matter for debate, especially considering Gardner had made a personal cause of trying to fix Wikipedia’s absurd gender imbalance, not to mention the overall downward drift in editor retention and activity.

She also leaves with some organizational questions unresolved: just last October, the board approved her plan to shift and “narrow” the non-profit organization’s focus to primarily software development; whereas the foundation once had “fellows” focused on community-building, the Foundation has shifted to a grant-making process, which is still making a first go of it.

Speaking of development, the great white whale continues to be what’s called the VisualEditor, an editing interface intended to be much easier for users than the current system, which is fairly similar to coding HTML. (It’s not as difficult as real programming, but still too much effort for most.) It’s been nearly two years in the making, and has finally rolled out into testing just this year.

Speaking of whales, Sue was the first leader to follow the much better-known Jimmy Wales, who still sits on the Board of Trustees*. Gardner came from the CBC in Canada, and was not an original part of “the movement,” but she came to identify with it and become quite popular with the overall Wikimedia community. It’s not at all clear who should or will succeed her, but it is clear that a lot rides on the decision.

Photo licensed under Creative Commons by Ariel Kanterewicz, via Wikimedia Commons.

*This post originally stated that Wales rotates off the Board later this year; it’s since been pointed out to me that, while all members’ terms are limited, reappointments are allowed, which it is expected to do in Wales’ case again next time.

First Wikipedian (Officially Representing a Presidential Library)

Tagged as , , , ,
on January 24, 2013 at 7:03 pm

Via the NYT Arts Beat blog:

Gerald R. Ford may have governed during a time of economic stagnation, but his library has just laid claim to a cutting-edge distinction: becoming the first presidential depository to employ an official “Wikipedian in residence.”

Michael Barera, a master’s student at the University of Michigan’s School of Information who has been editing Wikipedia articles for five years, started the job last week, The Chronicle of Higher Education reported. He is charged with improving the Wikipedia presence of the Gerald R. Ford Presidential Library and Museum, which is housed at the university’s Ann Arbor campus.

He’s the first official representative to Wikipedia at a presidential library, and surely not the last. Since Liam Wyatt became the first Wikipedian-in-Residence (WiR) at the British Museum, in spring 2010, the concept of an in-house Wikipedian has spread far and wide. So far, these have all been at non-profits, but I won’t be surprised if that isn’t always the case.

(Hat tip: cultural-partners email list.)

The Top 10 Wikipedia Stories of 2012 (Part 2)

Tagged as , , , , , , , , , , ,
on December 31, 2012 at 9:02 am

For the past two years The Wikipedian has compiled a list of the top 10 news stories about Wikipedia (2010, 2011), focusing on topics that made mainstream news coverage and those which affected Wikipedia and the larger Wikimedia community more than any other. Part 1 ran on Friday; here’s the dramatic conclusion:

♦     ♦     ♦

5. The Gibraltarpedia controversy — Like the tenth item in our list, file this one under prominent members of the UK Wikimedia chapter behaving badly. In September, board member Roger Bamkin resigned following complaints that he had used Wikipedia resources for personal gain—at just about the worst possible time.

Bamkin was the creator of an actually pretty interesting project, Gibraltarpedia, an effort to integrate the semi-autonomous territory of Gibraltar with Wikipedia as closely as possible, writing every possible Wikipedia article about the territory, and posting QR codes around the peninsula connecting visitors to those articles. It was closely modeled on a smiliar project, with which Bamkin was also involved, called Monmouthpedia, which had won acclaim for doing the same for the Welsh town of Monmouth.

Problem is, the government of Gibraltar was a client of Bamkin’s, and Bamkin arranged for many of these improved articles to appear on the front page of Wikipedia (through a feature of Wikipedia called “Did you know”). Too many of them, enough that restrictions were imposed on his ability to nominate new ones. At a time when the community was already debating the propriety of consultant relationships involving Wikipedia (more about this below) Bamkin’s oversight offended many within the community, and was even the subject of external news coverage (now of course the subject of a “Controversy” section on Gibraltarpedia’s own Wikipedia page).

(Note: A previous version of this section erroneously implied that Bamkin was not involved with Monmouthpedia, and was then board chair as opposed to trustee. Likewise, it suggested that disclosure was the primary concern regarding DYK, however the controversy focused on issues of volume and process. These errors have been corrected.)

4. Wikipedia’s gender imbalance — This one is down one spot from last year, but the undeniable fact that Wikipedia is overwhelmingly male (like 6-1 overwhelmingly) seems to have replaced Wikipedia’s falling editor retention as the primary focus of concerns about the long-term viability of Wikipedia’s mission. The topic was given center stage during the opening plenary at the annual Wikimedia conference, Wikimania DC, and has been the subject of continuing news coverage and even the focus of interesting-if-hard-to-decipher infographics. Like Wikipedia’s difficulty keeping and attracting new editors, the Wikimedia Foundation is working on addressing this as well, and no one knows precisely how much it matters or what to do about it. For further reading: over the last several weeks, my colleague Rhiannon Ruff has been writing an ongoing series about Wikipedia and women (here and here).

3. Wikipedia’s relationship with PR — I’m reluctant to put this one so high up, because one could say that I have a conflict of interest with “conflict of interest” as a topic (more here). But considering how much space this took up at the Wikipedia Signpost and on Jimmy Wales’ Talk page over the past 12 months, it would be a mistake to move it back.

This one is a continuation from last year’s #8, when a British PR firm called Bell Pottinger got caught making a wide range of anonymous edits to their client’s articles. The discussion continued into early 2012, including a smart blog post by Edelman’s Phil Gomes that focused the discussion on how Wikipedia and PR might get along, a public relations organizations in the UK developing a set of guidelines for the first time, and a similar organization in the US releasing a survey purporting to demonstrate problems with Wikipedia articles about companies, though it wasn’t quite that.

For the first time since 2009, the topics of “paid editing” and “paid advocacy” drew significant focus. New projects sprung up, including WikiProject Cooperation (to help facilitate outside requests) and WikiProject Paid Advocacy Watch (to keep tabs on said activity). Jimmy Wales spelled out his views in as much detail as he had before, and the Wikipedia Signpost ran a series of interviews over several months (called “Does Wikipedia Pay?”), covering the differing views and roles editors play around the topic. But after all that, no new policies or guidelines were passed, and discussion has quieted a bit for now.

2. Britannica admits defeat — In the year of our lord 2012, Encyclopædia Britannica announced that it would stop publishing a print edition and go online-only. Which means that Britannica essentially has ceased to exist. The 244-year-old encyclopedia, the world’s most famous until about 2005 or so, has no real web presence to speak of: its website (which is littered with annoying ads) only makes previews of articles available, and plans to allow reader input have never gone anywhere. Wikipedia actually had nothing to do with Britannica’s decline, as I pointed out earlier this month (Microsoft’s late Encarta started that), but the media narrative is already set: Britannica loses, Wikipedia wins. Britannica’s future is uncertain and the end is always near, while Wikipedia’s time horizon is very, very long.

Wikipedia SOPA blackout announcement

1. Wikipedia’s non-neutral protest on U.S. Internet law — Without question, the most significant and widely-covered Wikipedia-related topic in the past year was the 24-hour voluntary blackout of Wikipedia and its sister sites on Wednesday, January 18. Together with a few other websites, notably Reddit, Wikipedia shut itself down temporarily to protest a set of laws under consideration in the U.S. House and Senate, called the Stop Online Piracy Act (SOPA) and PROTECT IP Act (PIPA), supported by southern California (the music and movie industry) and opposed by northern California (i.e. the Silicon Valley).

The topic basically hit everyone’s hot buttons, and very different ones at that: the content companies who believe that online piracy is harming their business, and the Internet companies who feared that if the bills became law it would lead to censorship. You can imagine which side Wikipedia took.

But here’s the problem: Wikipedia is not one entity; it’s kind of two (the Foundation and volunteer community), and it’s kind of thousands (everyone who considers themselves a Wikipedian). While there seemed to be a majority in favor of the protest, the decision was arrived at very quickly, and many felt that even though they agreed with the message, it was not Wikipedia’s place to insert itself into a matter of public controversy. And one of Wikipedia’s core content policies is that it treats its subject matter with a “neutral point of view”—so how could anyone trust Wikipedia would be neutral about SOPA or PIPA?

But the decision had been made, and the Foundation (which controls the servers) had made the call, and even if you didn’t like it, it was only for 24 hours. And it certainly seemed to be effective: the blackout received the abovementioned crazy news attention, and both bills failed to win wide support in Congress (at least, for now). And it was a moment where Wikipedia both recognized its own power and, perhaps, was a little frightened of itself. For that alone, it was the biggest Wikipedia story of 2013.

The Top 10 Wikipedia Stories of 2012 (Part 1)

Tagged as , , , , , , , , , , , , , , ,
on December 28, 2012 at 12:18 pm

In these waning days of 2012, let’s take this opportunity—for a third year in a row—to look back and come up with a list of the most important Wikipedia news and events in the last 12 months. Like our first installment in 2010 and our follow-up in 2011, the list will be arbitrary but hopefully also entertaining. There is no methodology to be found here, just my own opinion based on watching Wikipedia, its sister projects and parent organization, and also thumbing through the Wikipedia Signpost, Google News and other news sites this past week. So what are we waiting for?

Wait, wait, one more thing: this post ended up being much longer than I expected, and so I’ve decided to split this in two. Today we publish the first five items in the list, 10-6. On Monday 12/31 we’ll publish the final five. Enjoy!

♦     ♦     ♦

10. Wikipedia bans a prominent contributor — Let’s start with something that did not make the news outside of the Wikipedia / Wikimedia community at all, but which took up a great deal of oxygen within it. It’s the story of a prominent editor and administrator who goes by the handle Fæ. In April of this year, he was elected to lead a new organization within the community based on his leadership of the UK chapter. The move was not without controversy: Fæ’s actions both on Wikipedia and the sister site Wikimedia Commons (best known as a vast image repository) and interactions with editors became the subject of intense scrutiny, and even an ArbCom case (the Arbitration Committee is sort of like Wikipedia’s Supreme Court). Fæ ended up resigning his adminship—he basically jumped to avoid being pushed—and the end result had him banned from editing Wikipedia, which he still is. Not that he’s gone away—he’s still a contributor to Commons, and a very active one.

This might sound like a lot of insider nonsense, and I’m not about to dissuade you from this viewpoint. (Sayre’s law applies in spades.) But the key issue involved is about governance: is the Wikimedia community’s organizational structure and personnel capable of the kind of leadership necessary to maintain and build on this important project? The Fæ incident (along with other incidents in this list) suggests the answer may be no.

9. Confusing software development — Not all of Wikipedia’s contributors are focused on editing articles. Some are also developers, working on the open source software to keep Wikimedia sites running and, perhaps, improving. Some (but not all) are paid staff and contractors, and the hybrid part-volunteer, part-professional organizational structure can make it difficult to get projects off the ground.

One longtime project that has yet to see wide implementation is a “visual editor” for Wikipedia articles, to make editing much easier for users. Everyone knows that the editing interface for Wikipedia articles feels like software programming, and almost surely turns away some potential contributors (though it’s not the main reason people don’t contribute, as a 2011 Wikimedia survey showed). But the visual editor is a bigger technical challenge than one might think (as recently explained by The Next Web), and the outcome of a current trial run (also not the first) is anyone’s guess.

Another announced with a great deal of hype but which no one really seems to understand is Wikidata. It calls itself a “common data repository” which by itself sounds fairly reasonable, but no one really knows how it will work in practice, even those now developing it. Wikidata could be a terrifically innovative invention and the very future of Wikimedia… but first we need to find out what it does.

Other projects have been released, but have received thoughtful criticism for adding little value while diverting resources from more worthy projects. For example, a feature briefly existed asking you to choose whether a smiley face or frowny face best represented your Wikipedia experience. Uh, OK? Some projects have been better-received: the Wikipedia iPhone app, for example, is a definite improvement over the mobile site. But there are some odd decisions here, as well: does Wikipedia really need an app for the failed Blackberry Playbook?

8. Sum of human knowledge gets more human knowledge — If you’ve ever seen a [citation needed] tag on Wikipedia—and I know you have—then you know that, well, citations are needed. And while citations do actually kind of grow on trees (if by “trees” we mean “the Internet”) there is a lot of information out there which isn’t readily searchable on Google, and sometimes that information costs money. This year, some of those paid services cracked the door open just a bit.

The interesting story to the HighBeam Research partnership is that there really isn’t one. First of all, HighBeam is a news database which charges for reader access to its vast collection of articles. But in March, a volunteer Wikipedia editor who goes by the name Ocaasi reached out to HighBeam and asked if they would be willing to grant free access to Wikipedia editors. They said yes—and supplied one-year, renewable accounts to editors with at least one year’s experience and 1,000 edits. For Wikipedia, it meant greater access to information. For Highbeam, it meant a 600% increase in links to the site in the first few months of the project. Seems like a fair trade.

More recently, the Wikimedia Foundation announced an agreement with the academic paper storehouse JSTOR, making one-year accounts available to 100 of the most-active Wikipedia editors. With almost 240 editors petitioning for access, if you haven’t spoken up yet, chances are you’re a bit too late.

7. The first person to 1 million edits — OK, how about a fun one? In April, a Wikipedia editor named Justin Knapp, who uses the handle Koavf, became the first person to make 1 million edits to Wikipedia. To the surprise of everyone, perhaps none more than Knapp himself, this made him an overnight international celebrity of the Warhol variety. Jimmy Wales even declared April 20 “Justin Knapp Day” on Wikipedia.

It’s worth pointing out that most editors with many, many edits to their name typically are involved in janitorial-style editing activities, such as fighting vandals or re-organizing categories. And many very active editors spend a lot of time squabbling with others on the so-called “drama boards” such as Administrators’ noticeboard/Incidents. Not Knapp: his edits over time have overwhelmingly focused on creating new articles, plus researching and improving content in existing ones. In short: Wikipedia doesn’t need more editors—it needs more Justin Knapps.

Also, this is one I actually played a small role in, as verified by Knapp’s own timeline of events. I’d happened to see someone note the fact on Jimmy Wales’ Talk page that day, which I tweeted, and was then picked up by Gawker’s Adrian Chen, and the rest is history. Actually, then Knapp kept right on editing Wikipedia. As of this writing, he’s closing in on 1.25 million edits.

6. Philip Roth’s Complaint — Wikipedia has been extraordinarily sensitive to complaints by living people the subject of articles ever since a 2005 incident where a veteran newspaper editor found his article maliciously vandalized to implicate him in the murder of the brothers Kennedy.

In what was arguably the biggest row since then, in September 2007 the celebrated, prickly author of Portnoy’s Complaint, American Pastoral and numerous other novels took to the pages of The New Yorker to issue “An Open Letter to Wikipedia” complaining that the site had the inspiration for his 2000 novel The Human Stain all wrong. And this wasn’t his first resort: Roth’s first attempt had been to authorize his biographer to change the article directly, which was rebuffed. His consternation here: not inexplicable.

But Roth’s complaint was not really with Wikipedia. Several book reviewers had speculated (apparently incorrectly) about the real-life basis for the novel’s central figure, and it was these speculations which had been introduced to Wikipedia. Roth’s publicity campaign brought the issue to much wider attention, which got his personal explanation of the novel’s inspiration into Wikipedia. However, in a twist on the Streisand effect, the controversy is now the subject of a longish and somewhat peevish section written by editors perhaps irked by Roth’s campaign. So he got what he wanted, plus more that he didn’t. Shall we call it the Roth effect?

♦     ♦     ♦

Look here on Monday for the thrilling conclusion to The Top 10 Wikipedia Stories of 2012!

Wikipedia is Not Finished, But Its Needs are Changing

Tagged as , , , ,
on December 18, 2012 at 9:14 am

Earlier this fall, a very interesting and not too-academicky paper on how Wikipedia’s article about the War of 1812 (by historian and Wikipedian Richard Jensen) somehow begat an Atlantic web story with the wishy-washy subheading “Wikipedia is Nearing Completion, in a Sense” which begat this less subtle, more alarming headline in the UK Independent: “Is Wikipedia Complete?

Wikipedia doomsaying is a popular pastime among technology writers (one can’t exclusively rely on Apple doomsaying, after all) and this isn’t even the first go around for this particular variant. But this one is more annoying than the usual complaint that Wikipedia is losing editors, because proclaiming Wikipedia complete is more likely to suggest that one shouldn’t consider get involved. Why bother? Wikipedia’s finished.

Of course, it’s not. The Atlantic’s Rebecca J. Rosen acknowledges this briefly, quoting Jensen as follows:

Wikipedia is now a mature reference work with a stable organizational structure and a well-established reputation. The problem is that it is not mature in a scholarly sense.

Just so. Yes, Wikipedia already has more than 4 million articles in the English language. The problem is that a great many of them just aren’t very good. An article may exist, but it might not contain much information. It may contain some decent information, but some of it may be wrong. It may have been correct at one time, but has since become outdated. Or an article may have lots of information, but it may not be well-organized. Just because an article exists does not mean the job is done. What it really means is the job of cultivating that specific slice of human knowledge—whether about the War of 1812 or the 18½ minute gap or —has only just begun.

The problem Wikipedia faces is that it has many, many more readers than editors (only 6% of readers have ever tried, according to a 2011 survey) even if the line between them is supposedly no thicker than choosing to click the “Edit” button at the top of a page.

For almost any topic you can thing of, it can seem like there is already an article. What’s more, the topics which are most well-known, especially those related to current events, tend to be extremely well-developed and already saturated with editors. An edit on a page like President of the United States is likely not to last long before someone else comes along and changes it. The uncomfortable truth is that the veteran editor is probably right, insofar as Wikipedia’s standards are concerned. But that doesn’t make it any less discouraging to new editors.

♦     ♦     ♦

So, where can new Wikipedians gain confidence, knowledge of Wikipedia’s editing style, and make edits that really make a difference? The answer lies with Wikipedia’s vast collection of underdeveloped articles—those far outside of the daily news cycle, focused on topics dating to the pre-Wikipedia age, and which could be much better, but have lacked for sustained interest from foregoing editors.

As someone who reads Wikipedia daily, I come across these all the time. I also decided to ask some colleagues about what kind of article categories might be particularly neglected. Here are just a few topics that we see (and please note that we are all native English speakers from the U.S. and UK in our late 20s and early 30s, so YMMV) where new editors can dive in and start adding information and sources:

  1. 1990s rock albums: A surprisingly large number of rock albums from the ’90s have just a stub article—one that has very little information other than a basic description of the album. Follow the link, start by clicking on titles that you’re familiar with, and it won’t take long to find one that needs some help. The wider Internet has no shortage of reviews from music publications, which should be just what you need to add new details.
  2. 1990s comedy films: There’s a theme here, and one that speaks to the demographics of Wikipedia: the missing age group of 29- to 40-year-olds has left the encyclopedia with a gap in its collective knowledge: the 1990s! Once again, you can follow the link, pick any film and help improve it. Just remember: you can’t use IMDb (not a reliable source!) but you probably can use articles IMDb links to.
  3. Historical novels: If you’re not into reminiscing about the 1990s, perhaps you’d like to look back a bit further in time. In which case, the historical novel stubs listed here might be right up your alley—or galley, since there are a few of C.S. Forester’s nautical-themed Hornblower novels listed here…
  4. Fairy tales: Still on a literary note, a surprising number of articles on well-known fairy tales are lacking references or still in stub form. See if any of your childhood favorites need some work.
  5. Cartoonists: Biographies are a good topic area for any beginner on Wikipedia and there are no shortage of sub-topics to choose from that need development. There’s a whole list of cartoonists here whose articles are currently just stubs, why not dive in and see if there’s one you’re familiar with?

If you’re thinking about starting to edit Wikipedia and the thought of trying to improve a whole article seems overwhelming, here’s a few ideas for small fixes that you can make in any article of your choosing:

  1. Read through an article and fix any typos or formatting errors.
  2. Remove any obvious vandalism or pure nonsense you come across.
  3. Look at information in infoboxes (the sidebars that appear at the top right of articles) and check that it is correct and up-to-date.
  4. Rewrite sentences that don’t make sense or are obtusely worded.
  5. Fact-check: choose a claim from an article with no citation, then find a book or another quality source to verify the statement.

I fully acknowledge that all of the above is easier said than done. Even though Wikipedia is the encyclopedia anyone can edit, that doesn’t mean everyone does. But it is possible for anyone to learn, given the right inspiration. With this post—and who knows, maybe more like it to come?—I’d like to help others find it.

Thanks to Rhiannon Ruff, Morgan Wehling and Pete Hunt for help with this post.

Linux distributions vs. wedding dresses: the gender gap impact

Tagged as , ,
on November 19, 2012 at 3:10 pm

Editor’s note: The author of this post is Rhiannon Ruff (User:Grisette) and is part of a series on female editors of Wikipedia. Her most recent post—the first in the series—was “All The Women Who Edit Wiki, Throw Your Hands Up At Me” on November 8, 2012.

Continuing this series on women and Wikipedia, this week I’d like to give a quick overview of the gender gap and its impact. Let’s start with what we already know: female Wikipedia editors are in the minority of those making edits to the site’s articles and Talk pages on a regular basis. Earlier this year, a research project by Santiago Ortiz found that on average there are 12.9 male editors to each female editor editing a given article. This is an issue that Wikipedians are very familiar with. For many, the real concern is not just that women aren’t participating, but that their relative absence may have led to gaps in Wikipedia’s collective knowledge.

In early 2011, Noam Cohen wrote an oft-cited article for the New York Times which made the point that Wikipedia’s coverage of topics more likely to be of interest to women tended to be much less well developed than for corresponding topics of interest to men. Indeed, anecdotal evidence exists for a gendered take on notability: in some cases, articles on female-oriented topics have been nominated for deletion, not considered “notable” by (mostly) male editors. In particular, Torie Bosch wrote on Slate.com about the deletion debate around the Wikipedia article Wedding dress of Kate Middleton, which survived after editors including Jimbo Wales fought for it to remain. Bosch also described how several new articles on female historical figures created during a Smithsonian archives “edit-a-thon” were later nominated for deletion—one more than once.

(As an aside: I personally find it offputting how this gender gap topic is often addressed. For instance, Cohen’s article specifically mentions the poor state of the articles on the TV series Sex and the City and fashion designer Jimmy Choo as indicators of missing female editors. Examples like these are more than a little patronizing and hard to take seriously. I’m not the only one who feels this way.)

The gender gap doesn’t just affect what articles get created (and don’t get deleted): the quality of certain articles may be affected by the dearth of female editors, too. In January 2011, Wikipedia’s newsletter, The Signpost, included a piece in which Wikipedia article quality was compared between the most famous male and female scientists from Science magazine’s Science Hall of Fame. The author of the Signpost article found that the top ten male scientists’ articles are mostly rated a “B” on Wikipedia’s article quality grading scheme, and include one Good Article and one Featured Article, while the top ten female scientists’ articles are all rated Stub or Start class (with the exception of Marie Curie). Worth noting: the author explained the conclusion isn’t a clear cut case of gender imbalance, since the female scientists were generally less well-known than the men, which could have an impact on both number of editors interested in the articles and availability of material to improve them.

An interesting question in light of all the above: what exactly are women editing on Wikipedia? If we look at one of Wikipedia’s most well-known female editors, SlimVirgin, who’s had a key role in 10 Featured Articles—no mean feat—we can get an idea of what a prolific female editor works on. Her Featured Articles span a range of topics, from the biographical article for Palestinian political leader Abu Nidal to the article on the Brown Dog Affair, an Edwardian-era political controversy about vivisection. No obvious gender bias here. Nor is there any big difference between male and female editors in terms of types of edit according to a 2011 study titled Gender Differences in Wikipedia Editing. The study’s authors found there was no evidence that men and women tend to make different sized edits or that one gender prefers fixing text to adding new text. In short, it seems the gender gap issue isn’t as simple as “get female editors, solve knowledge gaps”; it may have a lot to do with the types of article or information that people drawn to Wikipedia editing are most interested in. (Yes, I’m saying that Wikipedia editors are likely to be more interested in Linux than dresses, sorry Jimmy Wales!)

While writing this post I was intrigued to see if picking 10 editors at random from the Female Wikipedians category and looking at their most recent edits would provide any insight. Disappointingly, seven out of the ten hadn’t edited in over two years, and of the remaining three only one had made an edit in article space in the last year. This result is certainly indicative of Wikipedia’s broader problem of editor retention, but it also speaks to the particular issues Wikipedia has had retaining female editors. Which leads nicely to the topic of my next post… the issues involved in recruitment and retention of female editors. Look for that here soon, meanwhile (for U.S. readers) have a wonderful Thanksgiving!

All The Women Who Edit Wiki, Throw Your Hands Up At Me

Tagged as , , , ,
on November 8, 2012 at 2:16 pm

Editor’s note: The author of this post is Rhiannon Ruff (User:Grisette) who last wrote “Public Lives: Jim Hawkins and Wikipedia’s Privacy Dilemma” for The Wikipedian in April 2012.

It’s no secret that the majority of those editing Wikipedia on a regular basis are men. It’s one of the best-known facts about the Wikipedia community and a situation that doesn’t appear to be changing over time. In fact, from 2010 to 2011, the proportion of women editors actually dropped, from 13% to just 9%, according to an independent survey by Wikipedian Sarah Stierch. And it does seem, at least from the media coverage, that this contributes to some bias in content. This issue not taken lightly by the Wikimedia Foundation, which has set a goal of “doubling the percentage of female editors to 25 percent” by 2015, as part of its Strategic Plan.

Over the next few weeks, I’ll be writing here about content bias and what women are actually editing on Wikipedia, and the issues involved in encouraging more women into such a male-dominated space. First, though, let’s round up recent efforts to get more women involved with Wikipedia.

  1. The Wikipedia gender gap mailing list: Founded back in January 2011, subscribers to the list offer up ideas, share experiences, discuss issues and help to develop events and programs. Among recent updates, the list shared news of the latest Wikipedia Editor Survey and the launch of the new WikiProject Women scientists. 295 people are subscribed to the list.
  2. WikiWomen Camp: The inaugural camp was held in Argentina in May 2012. While not focusing on the gender gap, the conference was for female Wikipedia editors to network and discuss projects. A total of twenty women from around the world attended.
  3. WikiWomen’s History Month: March 2012 was the first WikiWomen’s History Month, where editors were encouraged to improve articles related to women in history. During the month 119 new women’s history articles were created and 58 existing articles were expanded.
  4. Workshop for Women in Wikipedia: This project to create in-person workshops encouraging women to edit Wikipedia was started in 2011 and is ongoing. So far, workshops sharing technical tips and discussing women’s participation have been held as part of the WikiConferences in Mumbai (2011) and Washington, D.C. (2012), as well as individual workshops held in D.C., Pune and Mumbai.
  5. The WikiWomens Collaborative: Launched at the end of September 2012, the Collaborative is a Wikimedia community project with its own Facebook page and Twitter account, designed to create a collaborative (hence the name) and supportive working space for women. Participants share ideas for projects, knowledge about Wikipedia and particularly support efforts to improve content related to women. Projects promoted by the Collaborative include Ada Lovelace Day, when participants were encouraged to improve articles related to women in math and science, including via an edit-a-thon organized by Wikimedia UK and hosted by The Royal Society in London. So far, the Collaborative has over 500 Twitter followers and 414 Likes on Facebook.

With all this activity, it’ll be interesting to see the results of the 2012 Wikipedia Editor Survey to see whether there has been any positive shift in the numbers of female editors. Look for those results early next year. Meanwhile, stay tuned here for my next post discussing gendered patterns of editing and Wikipedia’s knowledge gaps.

My Wikitinerary: Day 3 at Wikimania DC

Tagged as on July 14, 2012 at 6:22 am

Wikimania logoWe have arrived at the last day (of official events) at Wikimania, which begins shortly with an opening plenary by the Wikimedia Foundation’s executive director, Sue Gardner. As expected, my Wikimania attendance yesterday was limited on account of other obligations; today I’ll be around for most of the events. Here are a few of the panels and presentations I’m interested in today:

♦     ♦     ♦

10:30 – 11:50

Title: Getting elected thanks to Wikipedia. Social network influence on politics.
Speaker: Damian Finol
Category: Wikis and the Public Sector
Description: Wikipedia and politicians is a contentious topic—one I wrote about for Campaigns & Elections in April 2010. This seems to be a bit different: it will be focused on Venezuelan politics, but the question: does having a good Wikipedia page help win elections? is one I’d like to hear how others would answer.

Title: Iterate your cross-pollinated strategic synergy, just not on my Wikipedia!
Speaker: Tom Morris
Category: WikiCulture and Community
Description: Like any small community focused on a unique project, Wikipedia and its Wikimedia sister projects have developed a kind of jargon all its own. This talk will focus on the language used on WMF and how it can be simplified for clarity, especially to encourage participation of new editors and non-native English speakers.

Title: Wikimedia on social media
Speaker: Jeromy-Yu Chan, Tango Chan, Slobodan Jakoski, Kiril Simeonovski, Guillaume Paumier, Naveen Francis, Christophe Henner
Category: WikiCulture and Community
Description: As I tweeted the other day, English-speaking Wikipedians are often disdainful of Facebook, for reasons that would take some time to unpack. Twitter too was disfavored for the similar service Identi.ca—the latter is open source, a plus for many—although I think the Twitter has gained a share of acceptance by now. Indeed, the proceedings of Wikimania have been heavily tweeted, just like any conference. So: “The goal of this panel is to share experience on the use of social media throughout the Wikimedia movement, and to share best practices to collectively improve our use of these communication channels.” What are best practices now?

12:10 -13:30

Title: What does THAT mean? Engineering jargon and procedures explained
Speaker: Sumana Harihareswara and possibly Rob Lanphier or additional members of the engineering staff of the Wikimedia Foundation
Category: Technology and Infrastructure
Description: Speaking of jargon, this is supposed to be a non-techie explanation of the technical aspects of Wikimedia. As a non-techie, I could stand for someone to explain how Wikipedia uses squids to me again.

Title: The bad assumptions of the copyright discussion; Blacking out Wikipedia
Speaker: James Alexander; panel
Category: Wikis and the Public Sector
Description: January’s Wikipedia blackout in protest of proposed U.S. legislation tightening copyright and intellectual property enforcement on the web (SOPA and PIPA) was very controversial, and remains so. Jimmy Wales, in his opening plenary, addressed the issue, suggesting blackouts would be considered only for similar issues. The first talk is shorter and appears to be on the issue of copyright. The panel is longer and will discuss the decision to blackout, and how the blackout worked, how the blackout page was designed and the media’s response.

14:30 – 15:50

Title: 11 years of Wikipedia, or the Wikimedia history crash course you can edit
Speaker: Guillaume Paumier
Category: WikiCulture and Community
Description: Exactly what it sounds like, a history lesson on the last 11 year years of Wikimedia/pedia history. This is a 70 minute talk. Having read Andrew Lih’s “The Wikipedia Revolution” and Andrew Dalby’s “The World and Wikipedia” there is probably not much here I won’t know about already, but I still find it interesting nonetheless.

Title: The end of notability
Speaker: David Goodman
Category: WikiCulture and Community
Description: Notability, on Wikipedia, refers to a widely-discussed guideline which recommends whether a given subject deserves a standalone Wikipedia article or not. It is very contentious, it is the inspiration for the ideological split between inclusionists and deletionists, and was a key focus of John Siracusa in the “Hypercritical” podcast episode I wrote about earlier this year. This talk will focus on the topic of notability guidelines and how we can’t always find two reliable sources providing substantial coverage for some topics that probably should have articles. Goodman seems to be suggesting that we have articles on topics people want information about regardless of standard notability, but with a twist: should there be a “Wikipedia Two” to satisfy the many non-notable college athletes and politicians whose fans and supporters would like to create articles about them. Plus, Goodman (DGG on Wikipedia) is a bit of a character, so that should be interesting, too.

♦     ♦     ♦

OK, I’ve got to race down to the GWU campus now if I’m to catch Gardner’s talk. Look for me on Twitter as @thewikipedian, and I’ll write more here soon!

My Wikitinerary: Day 2 at Wikimania DC

Tagged as on July 13, 2012 at 2:19 am

Wikimania logoWikimania Day 1 is on the books, and it was a busy one. Mary Gardiner’s keynote delivered on the mostly-male Wikimedia community’s promise that they care about female participation (and as many noted, the female presence at Wikimania is very strong) while Jimmy Wales fulfilled his role as the conference touchstone, while adding a dose of levity, or two.

Although, did anyone else notice he was credited as “Founder” of Wikipedia and not “Co-founder”? Well, I did.

My coverage of the first day of the conference was doled out in 140-characters-or-fewer bursts on Twitter as @thewikipedian, and so it will be on subsequent days.

As to the first subsequent day ahead: as much as I’d like to give my full day over to Wikimania, regular readers will know that I live here, and Friday I’m still basically on the clock. So I may not get to all the sessions I would like. But here is what I’m hoping to attend:

♦     ♦     ♦

9:00 – 10:20

Time will tell if I make it to the first of the breakout sessions. If I do, it will probably be:

Title: Ask the Operators
Speaker: Leslie Carr, Ben Hartshorne, Jeff Green, Ryan Lane, Rob Halsell
Category: Technology and Infrastructure
Description: Just what it sounds like, a chance to ask the people who keep Wikipedia up and running about how it works, their jobs, and apparently… unicorns? I doubt this session will actually be dominated by bronies, but if it is, then I concede I have been sufficiently warned.

I may also attend:

Title: Giving readers a voice: Lessons from article feedback v5
Speaker: Fabrice Florin
Category: WikiCulture and Community
Description: I missed his presentation on new tools yesterday, and I’m intrigued by this as well. Good feedback is hard to come by, as a Wikipedia editor, and I’m curious to find out how those most involved think the current feedback tool is working. When I wrote about it last year, I was skeptically optimistic.

10:50 – 12:10

If you’re keeping score at home, it seems that I am most interested in the “WikiCulture and Community” sessions, and why shouldn’t I be? The Wikipedian tries to be about making Wikipedia’s goings-on understandable to the non-editor, so this track is a natural fit.

Title: Wikipedia in the Twitter age
Speaker: Panel moderated by Andrew Lih
Category: WikiCulture and Community
Description: How does Wikipedia handle the fast pace of information in the Twitter age? Can Twitter be a reliable source? (I think the correct answer is: generally, no.) The role Twitter played with Wikipedia in the 2011 Egyptian revolution and other breaking news events will be discussed here. And I’m always a fan of Andrew Lih’s take on Wikipedia.

13:10 – 14:30

One of the panels I wanted to see yesterday was rescheduled last-minute for this time period, and I very well may still try to check that out. But I’m also fascinated by this one:

Title: Eternal December: How awful arguments are killing the Wiki, and why not to make them
Speaker: Oliver Keyes
Category: WikiCulture and Community
Description: For good or ill, Wikipedia is a place that many people go to argue about all kinds of things—some very important, and others not so much. This talk will cover the resistance and curmudgeonliness of “Power Editors” and how they prevent the implementation of new developments on Wikipedia and discourage newbies from contributing.

There are other good panels in this time slot, so room-hopping again is a thing I would like to try, although on day one I found it a challenge. If I manage, I like:

Title: Hey, its trending! Let’s update that Wikipedia article!
Speaker: Arkaitz Zubiaga, Taylor Cassidy, Heng Ji
Category: Research, Analysis and Education
Description: This one is a discussion of a possible system that suggests revisions for Wikipedia based on Twitter activity; much Wikipedia editing activity is driven by the news, and Twitter often breaks news before the media has had a chance to write a full story. The panelists will outline goals, details of the system and progress of this research project.

Title: Bots and Wikipedia: It’s OK to be lazy!
Speaker: Gaëtan Landry
Category: Technology and Infrastructure
Description: Although I lack the technical skills to write a real software program myself, I love me some bots. I.e. automated programs that wander around Wikipedia making changes based on an algorithm—fixing common misspellings, reverting obvious vandalism, and the like. The submission says it won’t be highly technical, which is probably good for yours truly.

15:10 – 16:30

I said above that Friday will have to be a working day for me, and it’s very possible that I’ll cut out in the afternoon to wrap some things up for the week. But if I’m still around, I think I may visit:

Title: Refighting the War of 1812 on Wikipedia
Speaker: Richard Jensen
Category: WikiCulture and Community
Description: From the description: “This year is the bicentennial of the War of 1812, and my presentation will examine how Canadian and American editors have handled the war in the main article. Sometimes they re-fought the war, as they balanced scholarship/RS and patriotism in a quest to tell the world what really happened.” I can go in for that.

♦     ♦     ♦

One last shameless plug: and if you’re not following me as @thewikipedian on Twitter, then you’re missing out on a lot of interesting tweets, including some very smart people that I am dedicating, and some things that I hope other people are smart.

I’ll see you there in a few hours!

My Wikitinerary: Day 1 at Wikimania DC

Tagged as on July 12, 2012 at 5:15 am

Wikimania logoIn a few hours, the first day of general activities at Wikimania—the official annual conference of Wikimedia Foundation—begins right here in Washington, DC. It is a global conference, in fact this is the first time Wikimania is being held in the United States since 2006, when it was hosted on the Harvard University campus in Cambridge, Massachusetts.

This year, it just happens to be outside my front door. What’s more, it is being held on the campus of the George Washington University, precisely where I launched this very blog at a (much smaller) conference in March 2009.

So: it’s a big day ahead—big weekend, but I have to focus for now. A review of the official schedule reveals an almost overwhelming number of events. After reading through the various panels and presentations, I think I have a pretty good idea of my day ahead, which I’d like to share here now:

♦     ♦     ♦

09:00 – 11:10

The only place to be, indeed the only official event at this time, is the opening ceremony, keynote and plenary. Most of the wider media attention that Wikimania generates will be probably be focused on Wikipedia co-founder and unofficial mascot Jimmy Wales on “The State of the Wiki”, but I’ll be interested to see the opening keynote by Mary Gardiner, an Australian computer programmer who is also a leader in “increasing participation of women in open technology and culture”. Wikipedia editors have long skewed heavily toward men, but in recent years more attention has been focused on how to change that. I am a skeptic—Wikipedia is hardly alone in this fact, particularly among technology tactics—but I am also interested in hearing what she says, on this very high-profile stage for such a topic.

11:40 – 13:00

Here the breakout sessions begin, and it is truly a poverty of riches from a Wikipedian perspective; there is too much to possibly take all in. What follows is an estimation of the panels I am likely to check out:

Title: “This is my voice”: the motivations of highly active Wikipedians
Speaker: Maryana Pinchuk, Steven Walling
Category: WikiCulture and Community
Description:One of the most common questions I am asked about Wikipedia, and also one of the hardest to answer with anything but anecdotal evidence, is why Wikipedians do what they do. Pinchuk and Walling have interviewed some of the most active Wikipedia editors to study the motives behind why they participate. Intriguingly, their submission includes the following teaser: “Note: after this talk, we will be making a special piece of conference swag available to any interested Wikipedians which will let them show off their own motivation for editing.”

Title: Engaging editors on Wikipedia: A roadmap of new features
Speaker: Fabrice Florin
Category: Technology and Infrastructure
Description: This talk will discuss new features on Wikipedia that make it easier to edit and the impact this will have on attracting new editors and retaining current ones. This follows a 20-minute talk so if I leave right after the Pinchuk / Walling’s talk and sneak in quietly I can probably catch most of this. At least, I presume this will work. You can never really tell how a conference will work until one arrives.

14:00 – 15:20

Title: A talk page is a broken message wall: Building a more efficient communication
Speaker: Danny Horn, Tomasz Odrobny
Category: WikiCulture and Community
Description: These days, I spend more time on Wikipedia’s discussion pages than I do editing the encyclopedia itself, so I am extremely familiar with how these pages work—and how they don’t. This presentation will demonstrate a new talk feature that will make it easier to track conversations you are interested in without receiving watchlist notifications about topics you don’t actually care about. Interesting! Although Wikipedia has put much more public attention on a forthcoming WYSIWYG editor, I think this could actually be a bigger deal. If it works, of course.

The above talk is followed by another one that I find fascinating for exploring the insider-outsider dynamic around Wikipedia, featuring the presenters from the first breakout session:

Title: Welcome to Wikipedia, now please go away? improving how we communicate with new editors
Speaker: Steven Walling, Maryana Pinchuk
Category: WikiCulture and Community
Description: On Wikipedia, veteran editors run across the same kind of activity by new editors so often that they have developed a deep reserve of templated messages—some friendly, many unfriendly. According to the session’s topic page, “On English Wikipedia and many other projects, automated warnings and welcomes currently make up about 80% of first messages to new editors.” Wow. I had not thought about it before, but it makes complete sense. I’ll be curious to see where the state-of-the-art thinking is on this topic.

15:40 – 17:00

For the final breakout session, there is one long sustained discussion of Wikidata that I am awfully tempted to spend my time at, but there is another talk that I find interesting within this period:

Title: How Wikidata fits into the global web of data; Wikidata implementation and integration; Wikidata as a platform
Speaker: Denny Vrandečić; Daniel Kinzler; Jeroen De Dauw
Category: Technology and Infrastructure
Description: What is Wikidata? Indeed, what is it precisely. It is only the most ambitious new Wikimedia Foundation project to launch in recent years. As the first panel description says: “Wikidata’s goal is to move the rich structured data currently encoded in Wikipedia templates into a central repository, which will be available for re-use on all Wikimedia projects, but also to 3rd party services. We will introduce what Wikidata aims to do and how: centralizing language links, centralizing data for the infoboxes, and all of that in the first new Wikimedia project since 2006.” Yeah, that’s not too ambitious. The first talk appears to be more of an overview and the two following it seem to be more technical.
Location: Grand Ballroom
Length: Each talk is 25 minutes

Title: Wikimedia relations with government, lobbying and public relations
Speaker: James Forrester, Philippe Beaudette
Category: Wikis and the Public Sector
Description: If Wikidata gets too technical for me, I’ll be heading over to this panel. In my professional life public relations is one of my primary activities, often involving Wikipedia—as I have written about before—and so I will be very interested to see where this discussion goes. If there is any presentation where I am likely to participate, this may be it, depending on where the discussion goes. Why not come find out?

♦     ♦     ♦

And that is the end of the official activities for the day. More events stretch into the evening, but I won’t be at them. Tonight, Roger Waters brings The Wall to the Verizon Center, which I will be seeing with a friend from high school and college in town just for this event. Actually, we’ll be seeing this if StubHub and FedEx combine to deliver these tickets during the day today, which they have so far been rather slow about.

I know… this has nothing to do with Wikipedia. But it’s highly relevant to my day ahead at Wikimania. Fingers crossed everything works out! Meantime, I will be tweeting the day’s activities from my @thewikipedian Twitter account, so please follow! And if all goes well I will post tomorrow’s wikitinerary here soon.

Two Wikipedia Co-Founders, Two Very Different Causes

Tagged as , , , , ,
on June 29, 2012 at 3:58 pm

The Wikipedian has been occupied with other projects, and fairly quiet as of late. The good news is that, with the Wikimania global conference just around the corner, I’ll be writing more here in the near future. And I really do mean just around the corner: Wikimania 2012 will be held in the city I call home, Washington, DC.

Meanwhile, here’s something I’ve noticed that I don’t think other Wikipedia commentators have remarked upon: the divergent activism of its two co-founders, its still closely involved spiritual leader and unofficial mascot Jimmy Wales, and estranged, erstwhile rival Larry Sanger. Although both men might be broadly described as libertarian—as legend has it, they first met on an Internet discussion forum for Objectivists—and yet their causes today are all but diametrically opposed.

In the last week, Wales has publicly opposed U.S. Department of Justice plans to extradite a British student, Richard O’Dwyer, for (allegedly) knowingly enabling copyright violations by users of a website he once operated (since shuttered). Although based in the UK, O’Dwyer’s domain was registered in the U.S.—hence the federal government’s interest. Wales’ point, made in a Guardian op-ed:

One of the important moral principles that has made everything we relish about the Internet possible, from Wikipedia to YouTube, is that Internet service providers need to have a safe harbour from what their users do.

A fair point? Sure. Self-serving? Most certainly! Wikipedia is always making someone mad because anonymous individuals use the site to spread malicious, sometimes defamatory, occasionally offensive material, true or false. In fact, someones like… none other than Larry Sanger.

In recent months, Larry Sanger has has taken up a more conservative cause, focused on some of Wikipedia’s more controversial content. Sanger is critical of Wikipedia for allowing the inclusion of sexually explicit photos on articles about sexually explicit topics, and moreso Wikipedia’s sister site Wikimedia Commons, for allowing users to upload even more graphic photos, many of which serve no purpose except to titillate the uploader, and disgust most others. Here’s an exhaustive report by Internet buzz beacon BuzzFeed, on one such example (highly NSFW, even with blurring).

Wales remains squarely within the camp of Internet libertarians, lending support to those who do things we may not like, but whom we may defend on principles of freedom. It is also consistent with his previous activism against U.S.-based SOPA and PIPA legislation, which I wrote about in January.

From a Wikipedia perspective, the key difference is this: in this case, Wales is seeking to use only his celebrity (which is considerable, in Internet terms) to draw attention to his cause, rather than enlisting the power of Wikipedia’s community as a force multiplier. The matter has been the subject of much discussion on Wales’ Talk page (basically a water cooler for Wikipedians) this week, led by the following comment:

As someone who strenuously opposed the political advocacy pursued by the Wikimedia Foundation early this year … I commend your decision to take action on the O’Dwyer case as Wikipedia founder and respected opinion leader as opposed to (additionally) trying to light a fire under the editing community.

Sanger has far less celebrity to wield (even in Internet cricles). Earlier in June, Sanger was interviewed by TechCrunch to discuss these topics, and as he said in a tweet aimed partially at yours truly:

Wikipedia, choose two: (1) call yourself kid-friendly; (2) host lots of porn; (3) be filter-free.

Not a bad point there, either.

I don’t mean to wade into this controversy myself. I find myself largely in agreement with both men on some broad points, contradictory as that may seem, although I think the long-run implications of both issues are more difficult to assess.

As for reservations about Wales’ petition: are we to be ISP freedom absolutists? Is there no “fire in a crowded theater” moment? As for reservations about Sanger’s cause: how are we to determine what serves a genuine informational purpose, and how do we balance this against Wikipedia’s longstanding and admirable policy that it is “not censored”?

I don’t know the answer, but if you think you do, I welcome your response in the comments.

Regarding the Uncertain Future of Encyclopædia Britannica

Tagged as , , , , , , ,
on March 14, 2012 at 5:01 pm

Yesterday, Encyclopædia Britannica made the startling announcement that they would discontinue their print edition after 244 years. Once the current edition has sold out, they’ll become a collector’s item. Which is essentially what they are now, if it’s not too uncharitable to point out. Britannica is not finished as an operation, however: it will continue to publish on the web. It’s a startling announcement, sure, but it makes more sense than if it went on as if nothing had changed. Britannica’s editors acknowledged as much in a post on their blog:

A momentous event? In some ways, yes; the set is, after all, nearly a quarter of a millennium old. But in a larger sense this is just another historical data point in the evolution of human knowledge.

But Britannica’s grip on the evolution of human knowledge isn’t what it used to be—you can see where I’m going, right? As a well-known quote from Jimbo Wales goes:

Imagine a world in which every single person on the planet is given free access to the sum of all human knowledge. That’s what we’re doing.

Since its launch in 2001, and especially since a (much-debated) 2005 Nature article comparing the two, Wikipedia has been a thorn in Britannica’s side. And its influence has long since surpassed its much older rival. A Quantcast comparison suggests that Wikipedia’s traffic is 30x that of Britannica’s. And as I tweeted last night, news organizations have been quick to note the competition.

Under the title “Death By Wikipedia: Encyclopedia Britannica Stops Printing”, ReadWriteWeb observes:

The usefulness of such reference materials has been on the decline for years, especially since the advent of Wikipedia. Whatever flaws its open, crowd-sourced editorial model may invite, Wikipedia is generally regarded as a comprehensive and mostly-accurate source of information, which can be accessed for free.

And in a Venture Beat article titled “Encyclopaedia Britannica wiped out by Wikipedia, selling final print edition” we find:

The extremely thorough Wikipedia article on Encyclopaedia Britannica … serves as the perfect example of why Wikipedia is coming out on top.

It’s true—Wikipedia’s article about Encyclopædia Britannica is very thorough. Britannica’s article about Wikipedia is not bad, but it is far more limited than Wikipedia’s article about itself, and Britannica has those annoying pop-up advertisements that do nothing for readers.

Yet Britannica president Jorge Cauz tells the The Washington Post:

This has nothing to do with Wikipedia or Google. … This has to do with the fact that now Britannica sells its digital products to a large number of people.

This is a little bit like Microsoft saying Windows 8 has nothing to do with the the iPad, merely the shift in consumer purchasing habits toward the tablet and mobile markets. That’s not to say the statement isn’t necessarily untrue, just that it’s complete. I don’t know a great deal about Britannica’s current business model, but it’s safe to say that non-print revenues have become far more important, as Britannica’s print sales have fallen. Whether they will succeed is another question; PC World and doesn’t think so, pointing out the closure of—speak of the devil—Microsoft’s online encyclopedia Encarta in 2009 (which I wrote about at the time):

Microsoft shuttered its digital multimedia encyclopedia, Encarta, in 2009, and the last trace of it, the online dictionary, closed last year. Encarta, though a digital product, was also made obsolete by Wikipedia’s free availability, constantly updated content and thousands of editors, contributors and volunteers from around the world.

At The Atlantic, expert on evolution and Bloggingheads impresario Robert Wright offers this (small) consolation:

Maybe, long after even the electronic edition of Britannica is gone, the idea of Britannica can remain for us what it once was for me–a kind of Platonic ideal that we aspire to evolve toward even if we can never reach it, something that has a kind of reality even if we can never touch it.

As someone who devoured Britannica in my school library when growing up, not to mention someone who relied on Britannica as a college student in the late 1990s (before Britannica added a pay wall)—much the same way as students today (notoriously) rely on Wikipedia —I’m sorry to see it go. But we no longer live in a world where a 30,000 page, 15-volume encyclopedia can be printed on an annual basis for profit. In fact, even Britannica sees itself as a collector’s item now; as Cauz tells the News Observer:

This is going to be as rare as the first edition, because the last print run of our last copyright was one of the smallest print runs.”

I’d love to own one myself, but at $1,395.00 for the “Final Print Edition”, I’m afraid I’ll have to pass. And perhaps Cauz is wrong; maybe the death of Britannica will be more like the Death of Superman.

Verifiability and Truth: What John Siracusa Doesn’t Get About Wikipedia

Tagged as , , , , , , , , ,
on February 2, 2012 at 6:50 pm

One of my favorite podcasts is Hypercritical, co-hosted by and principally featuring the thoughtful criticisms of John Siracusa, a sometime columnist for Ars Technica and Internet-famous Apple pundit. The show’s tagline calls it: “A weekly talk show ruminating on exactly what is wrong in the world of Apple and related technologies and businesses. Nothing is so perfect that it can’t be complained about.” Last week’s edition—“Marked for Deletion”—was about something far from perfect, but of great interest to this blog: Wikipedia.

If you want to listen for yourself, jump to about 1:11:55 (yes, more than an hour into the show) where Siracusa and co-host Dan Benjamin turn the discussion to Wikipedia. And a warning: this is going to be long. Consider it homage.

♦     ♦     ♦

Promisingly, Siracusa begins by asking his co-host to answer, if he can, “what Wikipedia is”. The answer is pretty good for an outsider: it’s a place for sharing information and collaboratively building a resource for (hopefully) accurate information on almost any topic. In general, this will do. But it’s not quite right, as Siracusa explains by recounting his personal experience of trying, in vain, to defend an article from deletion. With five years to reflect on it, Siracusa describes his efforts as a “prototypical example of someone who does not understand what Wikipedia is, proving that he does not understand what Wikipedia is.”

All of this is a way of getting to Siracusa’s fascination—one might say morbid fascination—with Wikipedia’s policy of “Verifiability”. The first paragraph of the policy says:

Verifiability on Wikipedia is the ability to cite reliable sources that directly support the information in an article. All information in Wikipedia must be verifiable, but because other policies and guidelines also influence content, verifiability does not guarantee inclusion. The threshold for inclusion in Wikipedia is verifiability, not truth—whether readers can check that material in Wikipedia has already been published by a reliable source, not whether editors think unsourced material is true.

Or as Siracusa summarizes it: “Something can be as true as you want it to be, if it is not verifiable, it doesn’t go in.” Well said.

He also discusses the related policy of “No original research”. This includes a good explication of the different types of sources that may or may not be used on Wikipedia: primary sources (original documents and first-hand accounts), secondary sources (news articles interpreting primary sources) and tertiary sources (encyclopedias and academic articles summarizing the former). This is advanced stuff, and for a longtime Wikipedian, it’s no small thrill to hear a smart outsider explain why secondary sources are preferred, and work through the fundamental policies of Wikipedia. Siracusa correctly observes: “Wikipedia is not a place where you write down stuff that you know. … Wikipedia writes about other people writing about things.”

Except here’s the thing: Siracusa understands Wikipedia’s core content policies. He just doesn’t like them.

In his particular example, a former standalone article called FTFF (here’s what it used to look like) didn’t survive the process not because it wasn’t true, but (he says) because it contained material that wasn’t verifiable, and constituted original research. This is partly true, but it owes more to a guideline that got only passing mention on the show (and, frankly, in the deletion debate): “Notability”, and specifically the “General notability guideline”. It’s closely tied in with WP:VERIFY and WP:ORIGINAL, and basically says that a topic must have sufficient coverage in secondary sources to be given its own standalone page. FTFF was not, and the result of the debate was to merge the topic to Finder_(software)#Criticism.

Anyway, this pedantry about WP:NOTE and WP:GNG doesn’t affect Siracusa’s main point: If something is true but unverifiable, he would like to see it included in Wikipedia anyway. Nor does it affect his corollary argument, that Wikipedia’s complex rules discourage many would-be participants.

He’s undoubtedly right about the second point: many people try to get involved with Wikipedia who have no idea what it’s really about, and they tend to have a really bad experience. Wikipedia struggles to explain itself to outsiders, and it probably always will.

As to the former, the problem is that he fails to grapple with the implications of the Wikipedia he describes, and this is disappointing. By privileging “truth” above “verifiability”, one gets the impression he’s describing a Rashomon-like Wikipedia where all possible viewpoints are explored, and somehow eventually Wikipedia just makes the right call. This assumes a lot, not least that contentious topics wouldn’t simply devolve into edit wars of unchecked aggression. In a world where Wikipedia aims for truth but eschews verifiability, there are no footholds upon which to steady an argument. There is no way to know what should be considered credible or otherwise.

At times it actually sounds like he’s advocating something that already exists: reliance on “Consensus” for determining how Wikipedia will address the topics it covers. Wikipedia policies and guidelines don’t cover everything, and this is where consensus steps in, however imperfectly. If you’ve ever wondered why there is sometimes an observable discrepancy in the depth or quality of coverage between topics, consensus is the big reason why, and moreso the self-selection that shapes consensus. The current, real-world Wikipedia refers to outside authorities as well as consensus among editors; Siracusa’s Bizarro World Wikipedia would jettison the former and rely solely on the latter.

Meanwhile, Siracusa ascribes Wikipedia’s Byzantine rule structure to Wikipedians’ desire for approval from educators and academics, which he thinks is holding back Wikipedia from what it could become. He repeatedly says “Wikipedia should be something different” and refers to “what’s different about online” but he never gets prescriptive and never actually says why the old methods are outmoded. He does say his Wikipedia would seek to “arrive at truth using every tool necessary” and would, for example, allow original research… but what then is the mechanism for (dare I say) verifying it?

At one point, Siracusa compares the popular, widely-viewed Ars Technica forums to a hypothetical low-circulation print magazine, and complains that the widely-read former site is an invalid source while the unpopular latter publication is acceptable. It’s true that Wikipedia does not necessarily take a populist approach to evaluating sources, but he’s far off the mark in his attempt to explain this: “They’re not cool with the old librarians, because they’re not paper.”

I hope that he was just being lazy and doesn’t actually think that Wikipedia editors prefer paper (if anything they actually prefer online sources, which are easier to check) but he completely misses a key dynamic that ties back to verifiability: the paper magazine with poor circulation at least will have editors who are presumed to care about fact-checking and accuracy. A web forum, however popular it may be, may have moderators, but that’s not the same thing as having an editor. A discussion group is not an editorial operation, period. The forum is a primary source, and so should only be used to support reliable sources.

There are, however, reliable web sources. One of them is the editorial side of Ars Technica; no less an authority than John Siracusa has been cited in approximately 150 different Wikipedia articles about the Macintosh and other technology subjects.

♦     ♦     ♦

I’m sorry to say this, but in the show’s last fifteen minutes, Siracusa pretty much descends into total incoherence. Here’s his summary statement, close to verbatim:

[There are] many flaws in verifiability and reliability of sources. It’s built on a foundation of sand. Notability, what’s a reliable source, those things become so key to making Wikipedia crappy or good, and those sands are constantly always shifting, you know? And so if Wikipedia was centered on truth and that was its final goal, yeah, it would have to include citations and verifiability and stuff like that, but there would never be any argument when the two are in conflict. You know, if you could prove that a series of events happened here, then you could say, well, it’s verifiable, it appeared in a reliable source, but it’s not the truth. And so therefore we should expunge that. Because the final goal of Wikipedia is truth. But the final goal of Wikipedia is not truth, it’s verifiability.

There would “never be any argument” about what is the truth? In the parlance of Wikipedia: [citation needed].

Look, this is an epistemological issue, one much larger than just Wikipedia. The reason Wikipedia’s goal is verifiability, not truth, is because verifiability is an achievable goal. In fact, verifiability is a necessary step toward establishing truth, as Siracusa at this point seems to acknowledge in his imagined alternate, truth-seeking Wikipedia.

It’s not that Wikipedia is actively hostile to the truth: it’s just agnostic as to what it might be. Wikipedia articles are like road signs; truth itself may be unknowable, and we may never arrive at our destination, but Wikipedia can point in the right direction. Wikipedia’s policies and guidelines are designed to make sure that its content does that, although it’s fair to acknowledge that it’s not guaranteed. But what is? And what is truth?

Anyway, there’s a user essay on Wikipedia called “Verifiability, not truth” that says this better than I am going to. Here’s the key point:

That we have rules for the inclusion of material does not mean Wikipedians have no respect for truth and accuracy, just as a court’s reliance on rules of evidence does not mean the court does not respect truth. Wikipedia values accuracy, but it requires verifiability. Unlike some encyclopedias, Wikipedia does not try to impose “the truth” on its readers, and does not ask that they trust something just because they read it in Wikipedia. We empower our readers. We don’t ask for their blind trust.

If you want to upset the old system and do something new, you actually do need to think through what should replace it. Siracusa never does.

If he thinks Wikipedia’s adherence to “old world” rules is driving away contributors, he should consider what the free-for-all alternative would look like. It isn’t a Wikipedia I would spend any time with, it’s not one that Google would be eager to rank so highly, and it wouldn’t be the most important reference site on the Internet.

The Top 10 Wikipedia Stories of 2011

Tagged as , , , , , , , , , , , , , ,
on December 31, 2011 at 10:07 pm

A year ago, I wrote a blog post called “The Top 10 Wikipedia Stories of 2010”. Perhaps, then, I should write a follow-up this year? For some reason, I’m having a harder time of it. Was 2011 less of a newsworthy year for Wikipedia? Not if this Google Insights for Search analysis of Wikipedia-related news stories is to be believed: if anything, Wikipedia was a more prominent news generator this year than last. Make what you will of the proprietary, nontransparent methodology of Google’s news judgment, but at least it seems Wikipedia has been plenty newsworthy.

It’s my personal judgment that Wikipedia was somehow less newsworthy than it was last year. Maybe that speaks to the absence of WikiLeaks / Wikipedia confusion in the public discussion, or maybe it speaks to the fact that I think some of the big topics simply repeat.

Whichever is the case, I say let’s do what we did last year, and count down through the most important and / or impactful news stories about the year in Wikipedia, using my own proprietary, nontransparent methodology, which is to say these are my personal judgments:

10. Superinjunctions — In May, Wikipedia was one of several websites (notably also Twitter) that came into conflict with UK court orders—”superinjunctions”—seeking to suppress scandalous gossip about sports and film celebrities (I know, right?). Wikipedia servers, like Twitter’s, are based in the U.S. and so are protected by the First Amendment. But that doesn’t mean some won’t try.

9. Wikipedia and education — This was on the list last year, and even though there was no singular event to point to, I’m going to include it again. Wikipedia remains a major subject of controversy at both the university and secondary levels, and while teacher attitudes are changing, and Wikipedia is making efforts to work with them, much confusion remains and resistance continues to exist. (But is probably futile.)

8. Wikipedia meddling — Politicians don’t fare well when they try to edit Wikipedia. Nor do some famous newspaper columnists. You know who seems to an even worse job of this? PR firms. As I’ve written about more than once, it’s not impossible to contribute to Wikipedia on a topic you are close to without getting burned, but those who are determined to subvert Wikipedia will keep getting burned.

7. Drawbacks of Wikipedia’s openness — It’s not just politicians who sometimes run afoul of Wikipedia… their supporters do, too. This summer, Sarah Palin said something about Paul Revere that was factually inaccurate, and anonymous someones presumed to be in her corner tried to change relevant Wikipedia articles… and then a few days later, Michele Bachmann said something about John Wayne’s hometown that was incorrect and John Quincy Adams’ status as a founding father that basically is too, and unhelpful Wikipedia edits commenced. Oh, and of course Stephen Colbert was there to fan the flames. To paraphrase a real founding father, if eternal vigilance is the price of liberty, so too is it the price of an online encyclopedia anyone can edit.

6. But how open is it, really? — This will come up again later, but many Wikipedians have become concerned that Wikipedia is too difficult to use, both for reasons related to the community and the once-revolutionary but now-creaky collaborative tools (i.e. the MediaWiki software that powers Wikipedia and its sister sites) and the often-insular community that defines it. Over Thanksgiving weekend, search engine-focused blogger Danny Sullivan published a blog post blasting Wikipedia for being “closed” and “unfriendly” and, even though he wasn’t very friendly (read: a total jerk) in his brief on-site activity, his point that Wikipedia is difficult to use is not incorrect. Wikipedia volunteer developers have created multiple versions of an Article Feedback Tool, something called “WikiLove”, a rather condescending smiley face / frowny face tool still in testing, and there are more user interface (UI) changes in store. But if the community itself is the issue, that’s a much trickier question.

5. Integration with museums and archives — One of the most interesting things happening on Wikipedia these days is the GLAM (Galleries, Libraries, Archives and Museums) project, in which researchers collaborate with the aforementioned institutions to make their material more easily accessed by Wikipedians for use on Wikipedia. Started by Liam Wyatt, who received considerable attention in 2010 for a stint as “Wikipedian in residence” at the British Museum, the project has grown far beyond him. In the U.S., the Smithsonian and National Archives are now participants, with attention paid by The Atlantic, among other news organizations. If Wikipedia’s reputation for accuracy and depth improves in the years ahead, the GLAM project will play a big part.

4. Wikipedia’s gender imbalance — As I asked in February: “Could it really be that just 13% of Wikipedia editors are women?” Well, nobody knows for sure, but this is the percentage of women who participated in the Wikimedia Foundation’s most recent editors survey, and in 2011 the issue attracted renewed attention. A story in the New York Times by the publication’s lead wiki-watcher, Noam Cohen, led to new internal discussion over the site’s gender balance, a renewed outreach effort by Wikimedia executive director Sue Gardener, and and a Wikipedia “fork” of the Change the Ratio campaign spearheaded by my friend Amy Senger. Has it worked? Well… who’s to say just yet? It seems unlikely that Wikipedia participation will reflect the actual gender balance of the wider world—and I would say it needn’t actually do that—but all parties would probably be happy to see a measurable uptick when the next survey rolls around.

3. Wikipedia occupies itself — In early October, the Italian-language Wikipedia edition turned off the lights temporarily in protest against a proposed law that would require websites to issue corrections, or face penalties. The protest received worldwide coverage; the proposed law has not become law. According to Google Insights, this was in fact the most-searched Wikipedia-related news story of the year, but I’m exercising my own editorial discretion here. Meanwhile on the (much more widely read) English-language Wikipedia, similar measures have been considered in response to the U.S. Stop Online Privacy Act (SOPA) however nothing has come of it (yet).

2. Falling editor retention — I begin with the caveat that this should probably be number one; this might seem a bit esoteric to the outsider, but in fact this is a proxy for questions about the long-term survivability of Wikipedia as a project, and is such a huge topic that I can’t properly wrap my head around it.

In August, I wrote a response to a Gawker post titled “Wikipedia is Slowly Dying”, arguing that Wikipedia had lost its mojo, and the “cognitive surplus” that helped build it had now moved on to places like Facebook and Twitter. This is wrong for reasons I only partly articulated at the time, but there’s no question that Wikipedia has fewer editors than it did last year, and the year before, and the year before.

The Wikimedia Foundation’s own research shows that new editors face longer articles offering fewer clear opportunities to get involved (which shouldn’t be a surprise, given the site’s impressive growth) and have a harder time making their edits stick.

The above chart, also prepared by the Wikimedia Foundation, shows it is clearly in flux: the explosive growth of participation crested several years ago, has been in slow decline since. No one really knows what’s going on with the direction of Wikipedia’s participation rate—regardless of gender—but it has been a major topic of discussion and will continue to be.

1. Wikipedia’s 10th anniversary — My choice for the top story last year was also about Wikipedia—the controversy over its ubiquitous fundraising banners—and so it is again. As much as Wikipedia strives to avoid self-referentiality in its own encyclopedia pages, the one thing Wikipedians have in common (and they often do not have much) is a fascination with Wikipedia. And this year was a big milestone: the 10th anniversary since Jimmy Wales (and, oh yeah, Larry Sanger) started up a “wiki” encyclopedia, very much as an afterthought.

To celebrate the milestone, Wikipedia held events around the world, and it happened to be a good time to be a Wikipedia commentator: I was interviewed for Ukrainian TV, and I collaborated with the creative agency JESS3 to produce a web video called “The State of Wikipedia”, narrated by Jimbo himself. As of this writing, it has more than 135,000 views on YouTube, making it one of the bigger things I did this year. Here’s looking forward to an interesting 2012.

This Wikipedia Article Is Not Yet Rated

Tagged as , ,
on September 7, 2011 at 1:18 pm

Even if you’re a very casual Wikipedia reader (which I assume is not the case, or you wouldn’t be here right now) you might have noticed a few new features* at Wikipedia in recent weeks and months. Most noticeably, the Article Feedback Tool, pictured below.

And it takes a single click to see the ratings on a given article. In the following example, a number of readers have already expressed their opinion of the (very short and currently unreferenced) article about the new Clap Your Hands Say Yeah album, which isn’t supposed to be released until later this month (thanks, Spotify / BitTorrent!).

It’s not entirely clear what the long-range prospects for the tool may be. Unlike flagged revisions, it isn’t slated for a vote and approval or removal; indeed, it’s now listed on every Wikipedia article that you visit, and it will continue to be for the indefinite future.

But that doesn’t mean it will necessarily remain static. An invitation to “please take a moment to rate this page” has already been changed. More questions are surely in store, especially as some very good questions have been raised, such as who’s to say what it means to be “highly knowledgable” in a given subject area?

Certain aspects of its implementation, though, are quite clever. For example, any rating assigned to an article that itself may change often cannot be considered good for long, right? This has been anticipated: ratings expire after 30 edits have been made on a given page, and if you’ve rated a page before, you can re-rate it then.

Some Wikipedians have also asked for a statistical tool charting the data over time, which would be very cool to see. Like most Wikipedia projects, all information captured is available through its API, so anyone could build one if they wanted. A good example of this kind of ad hoc service is User:Henrik’s Wikipedia article traffic statistics tool.

Meanwhile, it also opens a new Pandora’s box for Wikipedia (as if it didn’t already have plenty). Perhaps the biggest concern ahead is that the ratings can be gamed; as Liam “Wittylama” Wyatt (known particularly for his work with the British Museum) has pointed out, the top-rated article (4.9 out of 5 stars) is something called the VAD 43 MRC Klang Chapter. About which, well, have a look for yourself.

I think the concept of article ratings is an idea whose time is coming, if that time is not yet now. These ratings have a long way to go before they should be considered a barometer of anything. It’s a good start, but still just that.

*The other is one asking how you feel about editing Wikipedia, complete with a choice of smiley and frowny faces, but I haven’t seen it lately.

Is Wikipedia “Slowly Dying”?

Tagged as , , , , , , , ,
on August 5, 2011 at 11:27 am

Here’s a provocative blog post from Gawker’s Adrian Chen yesterday: “Is Wikipedia Slowly Dying?”. It’s based on a provocative comment by none other than Wikipedia’s Jimmy Wales at Wikimania, the annual conference for Wikipedia and its sister wiki sites. Of course, that’s not quite what Wales said, but the Associated Press story Chen’s post is based on is not so far off:

“We are not replenishing our ranks,” said Wales. “It is not a crisis, but I consider it to be important.”

Administrators of the Internet’s fifth most visited website are working to simplify the way users can contribute and edit material. “A lot of it is convoluted,” Wales said. “A lot of editorial guidelines … are impenetrable to new users.”

It’s also not a new concern. In March the Wikimedia Foundation published its latest study of editor participation, showing a decline in editor participation compared with a couple years ago, although it certainly still has more contributors than a couple years before that. In my post on the subject, “Trendy Thinking: Contemplating Wikipedia Contributorship”, I included a Wikimedia-generated chart that shows what Wales is talking about:

From 2001 through 2006, participation grew exponentially, slowed at its peak in 2007, and has decreased at a steady rate in the years since. A number of theories have been floated to explain the decline. Via the AP, Wales offers a very common one: with almost 3.7 million articles in the English-language edition, the project of buiding Wikipedia has mostly already been done. But he also offers one that I hadn’t really considered before:

Wales said the typical profile of a contributor is “a 26-year-old geeky male” who moves on to other ventures, gets married and leaves the website.

There is some evidence for this in the survey results. Turn to page five of an earlier survey report (PDF) and you’ll see that more than 75% of editors (technically, survey respondents who called themselves editors) are younger than 30, and of the remaining quarter, half again are in their thirties. It may be that only 12.5% of Wikipedia editors are older than 40.

This situation points toward a perhaps unlikely but perhaps untapped editor group: retired persons. In fact, it was my expectation to find a higher percentage of older editors—something like a reverse bell curve—showing greater participation by the young and old, with those in the middle with careers and young children contributing less frequently. In my personal experience on the site, some dedicated editors—some of the best, in my estimation—are middle aged or older. Yet the survey plausibly explains why they are statistically less common:

The last group is characterised by the fact that its members started to use / contribute to Wikipedia at a comparably old age. However, since the age range of this group is very broad, it covers persons that grew up with the Internet as well as persons that had to learn to use new media past their school and university time.

Someone who was 39 when Wikipedia was created is now 49 or 50, and actuarial realities will continue to produce a general population that is ever-more Internet-savvy, and therefore ever-more inclined to edit Wikipedia. That is to say, those who were once young editors may return as old editors.

Back at Gawker, the comment section offers another complaint to which Wales only alludes. The pseudonymous SoCalMalaise writes:

I used to write and edit Wikipedia a lot. Some long articles are almost entirely written by me. It was a way to fine tune both my research and writing skills and enjoy the novelty of writing something that thousands (millions?) of people read. But soon I found that your work is frequently stifled by so-called “administrators” who are usually high school or college students with sub-par research and writing skills. These trolls have created a Kafka-esque labyrinth of self-contradictory “policies” and “guidelines” that they used to remove sentences, paragraphs, sections or even entire articles that skilled writers have volunteered to put down. They cherry-pick various parts of their rules as an excuse to act out their God complexes and strike out content. … And I’m not talking about a few bad apples. These people are everywhere! The whole writing-for-Wikipedia thing became very frustrating and just not worth my time.

It’s difficult to generalize from any one person’s experience, and who knows what common-but-non-obvious mistakes SoCalMalaise might have made, but the sentiment is certainly not unheard-of.

Thing is, for every complaint about overzealous editors and sticklers for arcane rules, there’s a complaint about uninformed editors who show little respect for common-sense rules. I have to admit, I’m more of the latter complaint—it is sticklers for policies and guidelines who enforce a minimum level of quality required for new additions, and therefore maintain a semblance of article quality. Myself, I spent a lot of time learning how Wikipedia works. It took several years before I was able to contribute at a high level, creating new entries or significantly improving existing ones. I am polite when I find someone is doing it wrong, although I know also that some are not.

Meanwhile, the organized core of the community has spent a lot of time, especially recently, trying to figure out how to retain those who give Wikipedia a try. There is the WikiLove campaign, which has received some media attention, but I’ll have to explain my skepticism another time. I’ve also heard that new account registrants are sometimes asked to identify areas of interest, which sounds like an interesting idea, but as far as I can tell it hasn’t been widely deployed.

Ultimately, whether Wikipedia’s declining user base represents a problem is not a question that exists in a vacuum. The question is really whether Wikipedia has enough editors to keep getting better or, at the very least, maintain its current level of quality. There are multiple answers here. As I’ve pointed out before, the Wikipedia community’s rapid response to breaking news is impressive: if you want a good primer on the United States debt ceiling crisis, Wikipedia has a very strong and evolving summary. But Wikipedia sometimes fares poorly with articles on many pre-Internet topics, especially in the social sciences: if you want to know about Money market funds, I’m not sure I can recommend Wikipedia.

It’s worth taking stock of the fact that Wikipedia’s decline among editors is a bit more than gradual, but does not now appear to be accelerating. The next two years will be telling, but I suspect that Wikipedia’s contributor base will find its floor, and my guess—though it is only that—is that we’re probably somewhere near it. Wikipedia is no longer the new hotness, and let’s face it, it’s an encyclopedia. To most it is far less thrilling and far more challenging than YouTube or Facebook, and we shouldn’t expect that Wikipedia’s participation will look anything like it. It’s no less popular as a destination for readers, and it would take a very significant drop in article quality for that to happen. (Like, say, if Wikipedia’s vandal patrol disappeared tomorrow… if anyone, send your WikiLove to them.)

I think the current situation also raises a question that many Wikipedians are loathe to consider, but that is the professionalization of some aspects of Wikipedia. This doesn’t necessarily mean hiring editors, but it could mean working out partnerships to share in the responsibility of maintenance and development of software and perhaps even some content. It’s an article of faith that much of Wikipedia’s early growth and unique characteristics derive from its volunteer force, but as any business professor can tell you, the skill set that launches a viable company is not the same skill set that brings that company to maturity. There is precedent for this; Wikipedia needs the Wikimedia Foundation, which does have a paid staff, although they avoid organized involvement in matters of content, except as individuals. Ultimately, Wikipedia must remain in the hands of its volunteer editors—to change that would be too fundamental a shift. But as Wikipedia grows more complex, it’s not hard to think they could use greater support.

Trendy Thinking: Contemplating Wikipedia Contributorship

Tagged as , , ,
on March 17, 2011 at 8:49 am

Last week, the Wikimedia Foundation published some early results in an ongoing study of trends in editor participation, both in a detailed analysis by the survey’s leaders and a general summary by Wikimedia executive director Sue Gardner. I’d actually started writing a summary of my own before I read Gardner’s letter… only to find that Gardner had already made the exact same “Eternal September” comparison as I had planned. (Which makes sense, since I first learned of the term from Wikipedia.) Anyhow, both are worth reading if you are so inclined, but here’s a key excerpt from Gardner’s summary:

Between 2005 and 2007, newbies started having real trouble successfully joining the Wikimedia community. Before 2005 in the English Wikipedia, nearly 40% of new editors would still be active a year after their first edit. After 2007, only about 12-15% of new editors were still active a year after their first edit. Post-2007, lots of people were still trying to become Wikipedia editors. What had changed, though, is that they were increasingly failing to integrate into the Wikipedia community, and failing increasingly quickly. The Wikimedia community had become too hard to penetrate.

In the first half of Wikipedia’s first ten years, it experienced exponential growth in the absolute number of editors, from barely 100 active participants in 2001 to about 44,000 in 2006. The community continued to grow in 2007, cresting at nearly 52,000 active editors. Interestingly, though, 2007 brought fewer new editors: the peak owed to a one-year spike in retention. Thereafter, the number of total editors (and new editors) has dropped each year, with about 33,000 active contributors in 2010. Granted, that’s a pretty big drop. While it hasn’t bottomed out, it does seem to be stabilizing.

At the moment, Wikipedia has somewhat fewer editors than it had in 2006 and more than double the editors it had in 2005. But it only has slightly more new editors than it did that year: about 13,000 in 2010 compared to 12,200 new editors in 2005. For a better understanding of these trends, see this chart prepared for the survey:

Gardner continues:

Our new study shows that our communities are aging, probably as a direct result of these trends. I don’t mean that the average age of editors is increasing: I’m talking about tenure. Newbies are making up a smaller percentage of editors overall than ever before, and the absolute number of newbies is dropping as well. That’s a problem for everyone, because it means that experienced editors are needing to shoulder an ever-increasing workload, and bureaucrat and administrator positions are growing ever-harder to fill.

My initial reaction is to say this is not necessarily a problem. Yes, over time the proportion of new editors is shrinking, but this is the flipside of editor retention. The community “growing older” as a proportion of all editors does not necessarily mean number of editors is getting smaller, but that longtime editors are sticking around. Except that the community actually is getting smaller.

How many Wikipedians does the community really need to sustain itself? This is another open question. Some editors may point to the rapid development of impressive new articles such as Fukushima I nuclear accidents whereas interesting but less timely articles (let’s pick on the Assassination Records Review Board) languish.

If you joined Wikipedia in 2004, there is about a 40% chance you were still editing Wikipedia after one year. All things considered, that’s a pretty solid number, but that’s about as good as it got: by mid-2005 those retention rates started plummeting. If you joined in early 2007, there was about a 15% chance you were still editing after one year. Interestingly, the drop in retention more or less coincides with the explosion in new contributors: new editorship grew most between early 2005 and early 2007; the drop in retention begins about the same time and continued falling into the middle of 2007.

This makes some sense: those were the years with the greatest number of new editors, so it makes sense that a larger number would wash out. On the other hand, even as trends have stabilized, only about 10% of editors who joined in 2009 are still editing today. That’s a pretty remarkable drop-off in retention, and so the class of 2004 and 2009 today have about the same number of editors currently active.

Why the drop-off? Hard to say, but as the study’s authors put it: “[W]e do know something drastically changed during this time period, which corresponds to the period of massive influx of New Wikipedians.” This almost sounds like the influx of new editors drove the old ones out, although there’s no way to know that. So this raises an interesting question: were all those new editors necessarily good for the community?

For a snapshot of editor participation trends based on which year one joined, see this chart:

Wikipedia is an incredible resource and, like natural resources, it needs to be both developed and preserved. That means more editors are needed, and this study is just one step in a long process of figuring out how best to do that. Fortunately, there is time.

Wikipedia’s Endless Pool Party (Not Quite What it Sounds Like)

Tagged as , , , ,
on February 16, 2011 at 11:26 am

There’s no longer a question of whether the English-language Wikipedia will hit the four million article mark: only when. While new topics may become increasingly difficult to come by, five, six million or more articles is not out of the question. And when Wikipedians are not busy working on making that happen, sometimes they like to place guesses on when those things will happen. If you visit Wikipedia’s vast backstage, you can find several current and past betting pools these milestones and others through the years.

One of the first was the Half-million pool, in June 2004, in which several dozen editors took part. When Wikipedia passed 500,000 articles on March 17, 2005 the winner (an active Wikipedian to this day) had guessed March 18, narrowly beating another who had guessed March 15. Since then, more recent pools have focused on landmarks including the Million pool (passed March 1, 2006) and the 300-million edits pool (a matter of dispute, but certainly in 2009). Though there are just more than 3.5 million articles today, if you’d like to guess when Wikipedia’s four-millionth article will be created… I’m afraid you’re out of luck. No further guesses were taken after February 2010.

Among pools still open, one of two versions of the Five-million pool is still open, as is the Ten-million pool and the Twenty-million pool. In the latter category, one unlucky soul guessed 2007, several picks would have this achievement within the next decade, but more have placed their bets in the 2015-2025 range, and more still in the 2026-2100 range. A few have placed their bets on “Never”; time will tell… or not.

There are some more outlandish pools as well, including something like a dead pool: the Last topic pool. What will be the last article created on Wikipedia? There are some swell guesses; among my favorites are: “2100 Wikimedia server room fire” and “Why the zombies won”.

Want in on the fun? You can test your powers of prediction at Wikipedia:Pools. And if you do win, what exactly do you win? Is there any money involved here? Alas, no. Each page makes sure to note: “The person who comes closest to the actual date is the winner (of eternal fame).”

Photograph by Finlay McWalter, via Wikipedia.

The Wikipedian Mystique: Do Women Participate Enough in Wikipedia?

Tagged as , , , ,
on February 7, 2011 at 4:57 pm

Could it really be that just 13% of Wikipedia editors are women? That statistic comes from a survey of Wikipedia users (whether contributing or just reading) sponsored by the Wikimedia Foundation, first previewed in fall 2009 and eventually published in full in March 2010. Last week, Wikimedia executive director Susan Gardner announced plans to try raising this number to 25% by 2015. Thanks to coverage by Noam Cohen in The New York Times, the topic has dominated Interweb discussion of Wikipedia since then.

This participatory imbalance is not a new phenomenon, and hardly unique to Wikipedia. Cohen points to op-ed pages, and the same is considered to be true in their virtual equivalent, the political blogosphere. While there are some very prominent female contributors to all of the above, most surveys tend to show that men nevertheless lead these sectors.

On the other hand, as a female colleague pointed out to me, if you were to look at online forums about health care, animals, or the environment, the gender balance is likely to flip. The same is true with regard to professions; some are predominantly male or female, and many fall somewhere in between. Some combination of biological programming and social reinforcement produces a society with masculine and feminine traits. However, just because many stereotypes have a basis in reality does not mean they should be taken for granted or used as an excuse. Just because something is natural doesn’t make it right.

Among the many words expended on the topic, probably the best is by veteran Wikipedia contributor Kat Walsh; the entirety of it is worth reading, but here is the conclusion:

The big problem is that the current Wikipedia community is what came about by letting things develop naturally–trying to influence it in another direction is no longer the easiest path, and requires conscious effort to change. How do you become more inclusive without breaking the qualities that make the project happen to begin with? (Any easy, obvious answer to this question is probably wrong.) That Wikipedia works at all is an improbable thing; that it works, for the most part, well, nearly miraculous. Wikipedia’s culture doesn’t have to be hostile or unfriendly to a group for it to be underrepresented–it merely has to be not one of the most attractive options.

It so happens that “unfriendliness” has been identified as one possible reason. And it’s not that Wikipedia doesn’t have policies designed to address this issue: Wikipedia:Civility and Wikipedia:No personal attacks are core, non-negotiable site policies, augmented by further guidelines such as Wikipedia:Please do not bite the newcomers. The message is simple: Be polite to other editors, or you can be blocked. However, any experienced editor also knows that enforcement is uneven. Wikipedia is a very big place, where many editors are used to working in isolation. If someone comes along and starts behaving abusively, it can often feel like there is nowhere to turn. Even if you do know where to go for help, one actually must petition for a resolution, and this can be an unpleasant process. It’s also probably worth pointing out that this is an already issue on the presumably male-dominated website, so it is far from just women who feel this way.

Another issue worth considering is that no one actually knows for sure how many women are on the site. Anonymity on Wikipedia is guaranteed; hence the survey. But it’s trickier than that still, as I found out personally.

An early draft of the script for The State of Wikipedia video included the same detail from the survey Cohen cites. To make sure I had the details right, I sought the input of Erik Zachte, a data analyst for the Wikimedia Foundation and curator of information at the great Infodisiac website.

What he pointed out is that the survey had a significant problem with self-selection bias; more than a quarter of survey respondents came from Russia, for example. Among survey respondents, it is true somewhat less than 13% were female contributors. Slice it another way, and among contributors to the website, slightly more than 16% were female. Meanwhile, just 25% of survey-takers identified themselves as female. Therefore, the information concerning women on WIkipedia is considerably less likely to be accurate compared with men, but it still seems probable the percentage of female contributors is somewhere south of the 25% Gardner would like it to be.

The question then is what exactly she plans to do about it, and that discussion is underway now. If you want to be part of it, the Wikimedia Foundation has set up a mailing list to address the topic that is open to the public, and the Wikipedians you will find there are likely to be among the most thoughtful and welcoming. I certainly have my doubts that much will come of it, or that we’ll be able to reliably measure it. Wikipedia is a challenge to most people, from all walks of life, and any effort to artificially boost participation from any one group over the other is likely bound to meet with failure. If any solutions do arise, my guess is that will not necessarily be gender-specific.

As a final note, I find some irony in the fact that one reason put forth to explain why women don’t participate in Wikipedia is that they may not feel confident in their contributions, because on this particular topic, I don’t feel confident in my observations. Just for the record, on one hand I find that I am writing something because it’s a big topic and I don’t want to let it pass me by entirely; on the other hand, I think there is far more to be said about the subject than even a lengthy blog post can address. So I publish this now, unsure whether I’ve actually said anything worthwhile. Or maybe I’m overthinking it.

The State of The State of Wikipedia

Tagged as , , , , , , , ,
on January 25, 2011 at 4:35 pm

Chances are good that if you follow Wikipedia closely, then you have probably seen the following video:

The State of Wikipedia from JESS3 on Vimeo.

Last week, it was featured on both TechCrunch and Mashable and, on YouTube alone, it’s climbing toward 100,000 views as of this writing. And you might have missed the following infographic that went along with it, although I hope you didn’t:


Right-click to view at full size in another tab.

Meanwhile, if you happened to see Jay Walsh’s post on the Wikimedia blog last week—or you watched carefully through to the very end—you may have noticed that among those involved was yours truly.

The story of this video’s development began early in 2010 with the launching of the “State of” video series by my friends at the DC-based creative agency JESS3. The first in the series was “The State of the Internet“; more recently, they produced “The State of Cloud Computing” in association with Salesforce.com.

Seeking new topics, JESS3 invited me to develop a story concept for the video you see above. I talked with some influential wiki-thinkers, some of whose names appear in “Special Thanks” at the video’s end, to write a script for the eventual narrator. Not unlike Dan Aykroyd’s first draft of “The Blues Brothers”—and like it in only this regard—it was much longer than what you see above. Left out were asides on the cause (and effects) of the Spanish Fork, the German-language Wikipedia’s different way of doing things, the development of chapters, the invention of bots, the most-visited Wikipedia articles, the most-visited-in-a-single day Wikipedia article, and more.

In the end, it was a good thing they asked me to scale it back, especially once Jimmy Wales agreed to provide the voice as narrator. And the shorter version perhaps better accomplishes the goal of giving viewers a bit of an answer to the questions of where Wikipedia came from, and why it works the way it does. At the very least, I hope it sparks a deeper curiosity among viewers and, perhaps, sufficient interest to get involved themselves.

Who knows if it will have that effect, but it was a great experience to be part of. The effort put into this by the JESS3 team—on art direction, animation and sound—was tremendous, and took it far beyond any concept I had of what it could become. And maybe we’ll do it again in ten years.

The Top 10 Wikipedia Stories of 2010

Tagged as , , , , , , , , , , , ,
on December 30, 2010 at 6:50 pm

The year 2010 will be over and out in another day’s time, which means there is no time like the present to look back on the year that was at Wikipedia. Instead of some kind of highfalutin’ think piece on what the past year, like, meant, let’s make this an easy-to-write, easier-to-read listicle outlining the biggest stories of the year involving Wikipedia—at least from an English-speaking, North American perspective. (For it is this perspective from which I am most qualified to write.)

For better or worse, here are the stories that defined Wikipedia, on-site and off, in 2010:

10. Wikipedia backups discovered — This occurred just in the past few weeks, and has not received a great deal of attention outside of Wikipedia circles, but to Wikipedia enthusiasts, it’s a big one. In mid-December, Wikimedia Foundation developer Tim Starling found several files dating back to Wikipedia’s first three months of existence. These had long been presumed to be gone for good, but now Wikipedia’s earliest days are much easier to reconstruct. Joseph Reagle of Harvard’s Berkman Center extracted the first 10,000 edits and has placed them on his own website for viewing, and in the future a more accessible reconstruction may be created, similar to the one at nostalgia.wikipedia.org.

9. Cuba’s Wikipedia copycatEcuRed is the Castro regime’s attempt to emulate Wikipedia. At least, in terms of look and feel: EcuRed may well be built using wiki software, but content updates are strictly reserved for unknown pre-approved editors. The entry for Estados Unidos is amusing. Surprisingly, there is no entry for Capitalismo, only Imperialismo, fase superior del capitalismo. Translated from Spanish, the website’s front page proclaims it was “born from the desire to create and disseminate knowledge with everyone and for everyone from Cuba and the world.” It would probably more more correct to say that it was born of a desire to create and disseminate propaganda for Fidel and Raúl Castro and their cronies.

8. Mike Godwin vs. the FBIThis was just weird. During the summer, the FBI sent a cease-and-desist letter to Wikipedia demanding that they remove occurrences of the FBI seal from Wikipedia articles about the agency. According to the FBI, use of the logo conflicted with the law. According to Wikimedia Foundation general counsel Mike Godwin, the law cited was about preventing people from impersonating FBI officials. Godwin’s sardonic reply—”While we appreciate your desire to revise the statute to reflect your expansive vision of it, the fact is that we must work with the actual language of the statute, not the aspirational version”—amused many. Two months later, Godwin resigned his position at Wikimedia. Were the two incidents connected? That was the whisper, but neither Mike nor the Foundation have clarified the reasons for his departure. It’s entirely possible that the two are not connected, but the whispering hasn’t been refuted. The FBI seal’s presence on Wikipedia, and Mike Godwin’s famed wit elsewhere, live on.

7. Wikimedia expansion to India — Wikipedians are all too aware of the fact that most of their contributions come from the rich, Western nations in the Anglosphere and Western Europe, but they yearn for participation to grow much beyond. As in the global economy, much growth may be found in the BRICs. Among industrializing countries, interest in Wikipedia has been especially strong in India, which is being rewarded with the first non-U.S. office of the Wikimedia Foundation. (For what it’s worth, I myself attended a Wikipedia-oriented conference in Bangalore this past January.)

6. Wikipedia gets a new look — Bet you didn’t notice this until months after it happened, but in the first half of 2010, Wikipedia received its first major redesign in several years. Gone was the “Monobook” skin and in was the “Vector” look. Why change? Wikipedia is always looking for ways to make the site easier to read—and easier to edit—and there had been concern for some time that the site design was becoming outdated, even in some ways confusing. Perhaps the biggest change involved moving the search field from the lefthand sidebar to the top right corner, a placement more common among popular websites. And the result? The number of individuals contributing during the second half of 2010 has been mostly flat, and even down slightly. Whatever drives people to contribute to Wikipedia, or stay away, is a force more powerful than web design.

5. Flagged revisions, er, pending changes — For years, the German-language Wikipedia has maintained a unique system for improving the reliability of its pages: contributions by new and infrequent users are held for review by more trusted editors. The result has been an encyclopedia taken far more seriously by academics in that country, so Wikipedians on the larger (and looser) English Wikipedia decided to give it a try. First called “flagged revisions” and later changed to the arguably more intuitive “pending changes” (yes, there was a debate about this), a number of articles were protected in this manner. The result was inconclusive: while a clear majority of participants voted to continue employing some form of pending changes, there was no consensus on just how to do it. For now, the project lies dormant.

4. Wikipedia in education — This is not one story, and it’s not unique to the past calendar year: encyclopedias have been staples of term paper bibliographies for decades (at least) but the rise of Wikipedia has turned this on its head. Where teachers were once content to let students cite Britannica on any number of subjects, many (if not most) now ban students from using Wikipedia in assignments. But 2010 may be the year in which educators learned to stop worrying and accommodate (if not love) Wikipedia. Time and debate have allowed more professional educators to see that Wikipedia is a legitimate starting point for research, and Wikipedia’s own imperfections provide numerous teachable moments. ZDNet education writer Christopher Dawson’s well-argued “Teachers: Please stop prohibiting the use of Wikipedia” is a good example of the former, while classroom projects at UC Berkeley and the University of Rhode Island show there is great promise for the latter.

3. Larry Sanger reports Wikimedia to the FBI — The Federal Bureau of Investigation and Wikimedia Foundation sure got to know each other this year. In April, estranged Wikipedia co-founder Larry Sanger sent a missive to the FBI reporting the Wikimedia Foundation for hosting “child pornography” and other obscene images on Wikipedia sister site Wikimedia Commons. Among the contested images were nude artistic works depicting the underaged and sexually explicit images featuring adults. Wikipedia’s commitment to the free availability of information can be controversial; name a body part or disease and you are going to see a picture of it on that Wikipedia page. There is even a specific policy related to this question, called “Wikipedia is not censored“. But does this mean that anything goes? Even after Sanger clarified that he understood no actual prurient images photographs of child sexual molestation* were in the site’s collection, some images were deleted, and the FBI pursued no action in any case. Although resolved for now, you can bet the controversy over the line between “censorship” and “editorial policy” will come up again.

2. Wikileaks and Wikipedia confusion — You may protest that Wikileaks has nothing to do with Wikpedia. In fact, I wrote “Wikileaks: No Wiki, Just Leaks” over the summer, when the mysterious online outfit published its Afghan War Diary. But the mere presence of the word “wiki” in the the not-a-wiki site’s name has become a potential PR problem for Wikipedia. When Wikileaks re-entered the news with the publication of leaked U.S. diplomatic cables in the fall, Jimmy Wales openly criticized Wikileaks, telling Charlie Rose: “If I had some information, the last thing I would ever do with it is send it to Wiikileaks.” Even Larry Sanger published a critical commentary about Wikileaks on his own site; although Sanger only tangentially referenced Wikipedia in his comment, the press took up that angle regardless. As long as Wikileaks remains a well-known and much-criticized public entity, Wikipedia will have to keep repeating the message that the two organizations have nothing to do with one another. Which leads us to #1…

1. The face of Wikipedia fundraising — It was perhaps fortuitous that the latest round of Wikileaks debate occurred at the same time the Wikimedia Foundation was undertaking the most sustained and visible PR push in its history. Since late November, Wikimedia sites have featured large banners across the top, asking readers to donate money toward its goal of raising $16 million—the largest amount yet requested, though still not quite enough to cover 2011′s expected operating budget. Most banners featured Wales’ face prominently, asking readers to consider his “personal appeal” to contribute. While effective, they’ve also been a source of annoyance and subject of derision. The New York Observer headline, “Staring Contest with Jimmy Wales To Go On Indefinitely”, was among the politer expressions of this viewpoint. On the other hand, they are working: at the campaign’s outset, Wikimedia collected in one week what they took in over a month last year. As of this writing, the organization had just about a million dollars left to go. Not too shabby. And Henry Blodget will get a chance to recycle his call for Wikipedia to deploy advertising next year.

That was the year that was, at Wikipedia and the Wikimedia Foundation. Next year will be another. If you think I’ve missed or messed up anything important, please share in the comments. See you in 2011!

All images via Wikimedia Commons.

*Updated, per comments.

What’s With All Those Banners

Tagged as , , , , ,
on November 20, 2010 at 5:36 pm

If you’ve visited Wikipedia during the second week of November 2010 (and I’ll wager you have) you’ve no doubt seen the bearded mug of one Jimmy Wales staring back at you from one of several banners placed across the top of the article you wanted to read.

Not everyone is happy to see them:

Do you feel violated but can’t quite figure out why? Perhaps it’s the gargantuan banner atop all Wikipedia articles these past few days that feature the mug of none other than Wikipedia co-founder Jimmy Wales pleading for some money.

Yes, they are a little annoying and, if you really hate them, there is the little [X] box in the corner you can click to make them go away. But in this fourth year of fundraising by the Wikimedia Foundation (which oversees Wikipedia and its sister projects) this kind of reaction is nothing new. Even in 2008, Gawker covered that year’s campaign with a characteristically unfriendly tone. This year, some of the complaints are more amusing. Here are two of the family-friendlier screen shots of actual Wikipedia articles going around Facebook and other parts of the Internets:

wiki-fundraiser-scopophobia-600

wiki-fundraiser-begging-600

On the other hand, if you absolutely love seeing Jimmy Wales at the top of every Wikipedia page, well, now you can see him on every page on the entire Internet.

While the fundraiser formally launched on November 15, the banners started running on some pages since the 12th, and even before that, for reasons of testing. You might find being stared at by Jimmy Wales a little disconcerting, but there’s a reason Wikipedia is using them—they tested better than the other options.

Billed as “the fundraiser you can edit”, for this year’s campaign the Wikipedia community was invited to come up with banner ideas, and these were tested alongside the “Jimmy” banner. Volunteers were challenged to “Beat Jimmy” and produce a banner that would have a higher clickthrough rate. Almost 900 people got involved in the process.

In the banner message testing itself, four contenders rolled out onto Wikipedia for limited testing:

  • The Jimmy banner which had 1537 individual donations
  • Thanks for the brain massage which received just 19 donations
  • You depend on Wikipedia for information. Now it depends on you which received 99 donations
  • Admit it: without Wikipedia you never could have finished that report which had 140 donations

As you can see, it wasn’t much of a contest. That negative reaction some people have when they see Jimmy Wales? Well, at least it’s a reaction. For better or worse, Jimmy Wales is the unofficial mascot of Wikipedia, and that means he’s its biggest fundraising mascot.

For further details on how the banner featuring Wales stacked up against other tested options, check out the Banner testing project page. For a visual representation, see this David “Information is Beautiful” McCandless infographic (which seems to be better than his last one (update: per the comments, apparently not)).

This year the goal is to raise $16 million, the Foundation’s biggest target to date. That’s roughly the same amount of money the Foundation spent last year, of which $1 million alone went to web hosting. It’s also far less than the budget of the other top 10 global websites, as Wikipedians have pointed out. In the coming year, the Wikimedia Foundation plans to expand operations—including a new office in India—and hire 44 new staffers (there are 40 now). That’s a pretty incredible growth rate, one more like that of the other top 10 global websites. Whether that is a good idea at all has been the subject of debate on the blog of a Wikipedia contributor.

So, you should expect those fundraising banners to last through December, at least. But once the fundraising goal has been met, they won’t necessarily go away—they’ll just refocus. Once that happens, the banner space will start asking readers to contribute to Wikipedia with their knowledge, i.e. to start editing themselves. While the money is important, it’s the time and effort of volunteers that really makes Wikipedia work. Yes, you can click that [X] box anytime you want, and Jimmy will go away. But it’s probably worth leaving them up for now, to see what comes next.

The Earliest Known Record of Wikipedia Journalism

Tagged as , , , , , , ,
on October 12, 2010 at 6:56 am

I’d gotten to wondering, recently, what was the first time Wikipedia was mentioned by a media source? The project began in January 2001, but I’m sure I wasn’t aware of it until sometime in 2003 at the earliest. I have no memory of first learning about it — only a recollection that sometime in the middle of the last decade, I was spending hours and hours, and entire days on some weekends, reading Wikipedia. I wasn’t too curious about where it came from then, but over the last few years, I clearly have been.

So I did what anyone with access to an online news database would do: I looked it up. And the winner appears to be a July 1, 2001 article in the Australian edition of PC World, by one Aldis Ozols. Here it is, in its entirety:

Roll-your-own fount of knowledge: www.wikipedia.com.; editor’s choice.

“A wiki is a collection of interlinked Web pages which can be visited and edited by anyone” goes the definition by Wikipedia. Rising to the challenge, I edited the page on which this statement was made, and behold, my contribution (all two words of it) became part of the Wikipedia.

This is a collaborative project intended to produce a usable encyclopedia through the efforts of many volunteers who surf in from the Net. While this makes it superficially similar to Everything2 (see June issue), there are differences. For instance, Everything2 seeks to be a live, interactive community as well as a reference, whereas Wildpedia [sic] has a more modest goal: to create a freely distributable 100,000 page encyclopedia online. In addition, where Everything2 has a complex system of user ranking and moderation which attempts to grade contributions and their authors, Wikipedia is wide open. Anyone can rock up and modify existing entries, or create new ones as I did.

Astonishingly, the result is not a pile of chaotic nonsense, as one might expect. Perhaps that’s because the project is still small, with only 6000 pages of text and a few dozen contributors, but something more seems to be at work here. Evidently, articles that start off with a one-sided viewpoint are edited and re-edited until they settle into a kind of consensus with which most people are satisfied. In anycase, this is an interesting experiment containing some surprisingly accurate articles.

Surprisingly prescient, if you ask me. Or perhaps just lucky — many a website that garners positive reviews in its early going nonetheless still folds, or descends into chaos. In any case, I’m surprised to find this article is not online — if I’d been first to report on Wikipedia, I’d want to take credit for the fact.

Looking a little further, it seems that most of the Anglosphere reported on Wikipedia before anyone in the U.S. had anything to say about it: England (London Free Press), Canada (Edmonton Sun), Wales (Wales on Sunday) and Northern Ireland (Irish News) all got there first.

Stateside, the first press mention of Wikipedia was in the Gray Lady herself, the New York Times, by someone named Peter Meyers. This story is online, so I will simply quote the lede (sorry, non-journos) and call it good:

Fact-Driven? Collegial? This Site Wants You

FOR all the human traffic that the Web attracts, most sites remain fairly solitary destinations. People shop by themselves, retrieve information alone and post messages that they hope others will eventually notice. But some sites are looking for ways to enable visitors not only to interact but even to collaborate to change the sites themselves.

Wikipedia (www.wikipedia.com) is one such site, a place where 100 or so volunteers have been working since January to compile a free encyclopedia. Using a relatively unknown and simple software tool called Wiki, they are involved in a kind of virtual barn-raising.

Their work, which so far consists of some 10,000 entries ranging from Abba to zygote, in some ways resembles the ad hoc effort that went into building the Linux operating system. What they have accomplished suggests that the Web can be a fertile environment in which people work side by side and get along with one another. And getting along, in the end, may ultimately be more remarkable than developing a full-fledged encyclopedia.

For the curious, here is what the ABBA entry looked like on the day the story ran, and here it is today. And here is something close to what the zygote article looked like then, and what it looks like now. One wonders what it will look like in another ten years.

Update: In the comments, Graham87 locates the exact zygote entry, from the so-called Nostalgia Wikipedia (a topic worthy of its own post, at some point).

From the Mixed-Up Files…

Tagged as , , , , , , , , , ,
on September 29, 2010 at 8:04 pm

WBEZ in Chicago is probably best known for being home to the long-running radio series This American Life. But one of their most innovative offerings is an online video series first aired in April 2009 called The Wikipedia Files.

The idea is simple: WBEZ hosts interview entertainment celebrities by reading portions of the Wikipedia articles about them, simply to fact-check the articles within. More often than not, the articles are accurate enough, but they certainly have caught some interesting errors.

I think it’s an ingenious idea, and I hope that other media organizations follow, especially on other subjects. One of the biggest complaints about Wikipedia is that it’s difficult to tell what’s true and what is not. Although contributors are encouraged to add citations, the fact is many do not. In many cases, people add things they know, or think they know, and either cannot find a source or never bother to look one up. Some details may have originated on blogs, most of which Wikipedia generally does not consider to be reliable. This is all the more serious on articles about living persons, which Wikipedia takes more seriously than in other genres. The Wikipedia Files offers editors the chance to verify certain facts at the source, and to establish facts that were not previously known.

In one recent example, WBEZ’s Justin Kaufmann sat down with Antwan “Big Boi” Patton, one half of acclaimed American hip hop duo OutKast, now promoting his also-acclaimed solo debut, “Sir Lucious Left Foot: The Son of Chico Dusty”. Here is Big Boi with Kaufmann:

Big Boi fact-checks his Wikipedia page from WBEZ on Vimeo.

And in fact, at least one fix did come of the interview. On July 20, the same day it was posted, an anonymous, to date one-time editor from Akron, Ohio made the following correction about how he started pursuing music and his early relationship with André “3000″ Benjamin:

wikipedia-files-big-boi-edit

Alas, this editor did not add a citation to go along with it (so I just did). Otherwise, who’s to know where to go and verify the information contained? This points to the fact that adding citations to Wikipedia is harder than it should be—but you can’t hold that against WBEZ.

GLAM Rock: The Wikipedian in Residence and the Race for the Prize

Tagged as , , , , ,
on June 18, 2010 at 11:47 am

british_museum_cc_temporalataStarting in March, a longtime Wikipedian and co-host of the Wikipedia Weekly podcast, Liam Wyatt, began an unusual experiment: he has become, for a short while at least, a volunteer “Wikipedian in Residence” at the British Museum in London (which I visited in high school and where I touched the Rosetta Stone, when no one was looking, not that you care). It’s the first time such an institution has created such a position (voluntary though this arrangement is) and it points toward a future where organizations with significant cultural material (GLAMs, as this project calls them) may appoint or hire individuals to be representatives or ambassadors to Wikipedia.

Along the way, Wyatt and the British Museum are doing something very interesting: they are offering cash prizes for raising articles to Featured-level status on topics related to the British Museum. From the project page:

The British Museum is offering five prizes of £100 (≈$140USD/€120) at their shop/bookshop for new Featured Articles on topics related to the British Museum in any Wikipedia language edition. Ideally, the topics will be articles about collection items.

This is the first time an organisation in the UK has put out a prize that recognises the value of fine articles on Wikipedia. This is a recognition that Wikipedia work is not only good quality but is consistent with the outreach aspect of the Museum’s mission to engage the public.

It’s an inventive idea, even if some of the rules are a little unclear: it almost sounds like it requires the creation of a brand new article, though that doesn’t seem to be the case. Meanwhile, there are already a dozen or so articles on the English-language Wikipedia currently judged to be Good, B, or C-quality, according to Wikipedia’s internal rating system. Though the prize is pointedly offered in any language edition, most will surely be won in the English, German or French language versions, and at least a few of the aforementioned English articles will be the five ones improved by the winners.

And in keeping with Wikipedia’s “There is no deadline” ethos (related to the concept of “eventualism“), the competition runs until all prizes are claimed. I wouldn’t be surprised if they went fast, and I wouldn’t be surprised if that leads to another interesting situation: most quality articles have several major contributors, as was pointed out on a Wikipedia mailing list this week.

the_great_court_mchohanAs Wyatt points out, getting an outside organization to care about “the value of good quality articles on Wikipedia in their own right” is a significant achievement, and the first of a kind. Now that the English-language Wikipedia has grown to include far more articles (3 million) than its veteran editors (a few thousand editing on a daily basis) can possibly handle, more ideas will be needed to generate new content for Wikipedia. Perhaps this represents the next step in the development of the human-powered “content management system” for Wikipedia. Wyatt hopes that other museums will follow in the British Museum’s lead; as someone who works with companies, associations and other organizations that are frequently concerned about how they are represented on Wikipedia, I think outposts for representatives to the Wikipedia community from many organizations can be a good idea, though sorting out the conflict of interest issues is likely to be different for each.

If you’re interested in joining the British Museum contest, you might start with one of the articles discussed above, or find your own in the Collection of the British Museum category. And if you’re looking for a curator at the British Museum to work with, here is the page to do that.

And for more information about Wyatt’s residency, see his personal blog posts here: Part 1: Making Wikipedia “GLAM-friendly”* and Part 2: Making Wikipedia “GLAM-friendly”.

Exterior of British Museum by temporalata on Flickr; Great Hall by M.Chohan.

*GLAM stands for “Gallery, Library, Archive and Museum”; I had to look it up, too.