Planeta Wikimedia

15. May 2012

- Wikimedia Foundation - (anglicky)

- Wikimedia Foundation - (anglicky)

Commons Picture of the Day: Kiril Lazarov, Macedonian handballer

 

Kiril Lazarov, Macedonia national handball team captain.

German Wikipedia contributor and handball aficionado Armin Kübelbeck routinely takes pictures of sporting events, where he captures what he calls “unrepeatable moments,” or candid expressions that capture a sliver of time that will not come again. “I can’t tell a player, ‘Can you do it one more time, but now with a smile on your face?’” he said. “I have to take the available light and the moments as they come.”

In the unrepeatable moment above, which was the Wikimedia Commons Picture of the Day on 10 May, 2012, Kübelbeck captured Kiril Lazarov, the captain of the Macedonia national team and one of the most notable Macedonians in the history of the sport, right before he took a seven-meter throw. Unlike many of his images of players in action, this shot shows a much more interesting human element and very little movement.

“For me this smile shows resoluteness and a good portion of slyness and self-confidence. I’m not sure to whom he was looking: his coach, the other team’s coach, a teammate or an opponent,” said Kübelbeck, who was sure Lazarov was not looking at the goalkeeper, who stood in front of him at the time. “But the message is clear: I’ll throw that ball behind the line.”

Adult European Robin.

Kübelbeck began taking photos in the 1980s with a Canon A-1 with a 50mm 1:1.4 lens. He developed the black-and-white images he took in the basement of his parents’ home, where he set up a studio and experimented with many kinds of chemical processing effects, such as sepia toning and solarization.

On December 31st, 2006, he uploaded his first photo to Commons, which he refers to as an archive for his pictures. Kübelbeck said he is not active in the Commons community nor does he submit his own photos for consideration as Featured Pictures.

He is committed, however, to improving the encyclopedic content of Commons and Wikipedia. He also contributes self-made illustrations when his photos don’t successfully capture the meaning of the article he is editing.

He did also admit to enjoying the attention his photos get when they are featured: “Where else would my images and writing have such an audience? A dilettante’s work!”

Matthew Roth, Global Communications Manager

Matthew Roth — 15. May 2012, 17:53

Wikimedia Foundation Report, April 2012

Information You are more than welcome to edit the wiki version of this report for the purposes of usefulness, presentation, etc., and to add translations of the “Highlights” excerpts.
Monthly Metrics Meeting May 3, 2012.ogv

Video of the monthly Wikimedia Foundation metrics and activities meeting covering the month of April (May 3, 2012)

Global unique visitors for March:

489 million (+2.7% compared with February; +22.3% compared with the previous year)
(comScore data for all Wikimedia Foundation projects; comScore will release April data later in May)

Page requests for April:

17.3 billion (+0.4% compared with March; +18.2% compared with the previous year)
(Server log data, all Wikimedia Foundation projects including mobile access)

Active Registered Editors for March 2012 (>= 5 edits/month):

85.09K (+0.2% compared with February / -4.5% compared with the previous year)
(Database data, all Wikimedia Foundation projects except for Wikimedia Commons)

Report Card for March 2012: http://reportcard.wmflabs.org/

Financials

(Financial information is only available for March 2012 at the time of this report.)

All financial information presented is for the period of July 1, 2011 – March 31, 2012.

Revenue $32,054,861
Expenses:
 Technology Group $7,788,192
 Community/Fundraiser Group $3,212,763
 Global Development Group $2,984,100
 Governance Group $718,116
 Finance/Legal/HR/Admin Group $4,607,656
Total Expenses $19,310,826
Total surplus/(loss) $12,744,035
  • Revenue for the month is $1.9MM vs plan of $3.9MM, approximately $2MM or 53% under plan.
  • Year-to-date is $32.1MM vs plan of $28.6MM, approximately $3.5MM or 12% over plan.
  • Expenses for the month is $2.3MM vs plan of $2.2MM, approximately $112K or 5% higher than plan.
  • Year-to-date is $19.3MM vs plan of $21.1MM, approximately $1.8MM or 8% lower than plan.
  • Cash position is $30.6MM as of March 31, 2012 – approximately 13 months of expenses.

Highlights

Expanding fundraising and affiliation models

After the resolutions of the Wikimedia Board of Trustees at its meeting in Berlin, work is ongoing to implement a new model for distributing the money raised via Wikimedia project sites. Except the costs for the core operations and operating reserves of the WMF, all of it (including funds for chapters and non-core operations of the WMF) will be distributed based on the recommendations of the new volunteer-driven Funds Dissemination Committee (FDC). Another resolution of the Board recognizes new models of affiliation with the Wikimedia movement: “Movement Partners” (like-minded organizations that actively support the movement’s work), “National or Sub-national Chapters” (which includes the existing chapter model), “Thematic Organizations” (non-profits representing the movement and using the Wikimedia trademarks, which are supporting work focused on a specific topic), and “User Groups” (open membership groups which may or may not choose to incorporate).

Indic language outreach

Chief Global Development Officer Barry Newstead visited India meeting Wikimedians in Bangalore and attending Wikisangamotsavam, the Malayalam community conference, as part of work to support Indic language projects. The India team is working actively with seven Indic language communities on outreach, social media strategy and initiatives to build community momentum.

Page views to the Wikipedia mobile site (red: non-English versions) compared to the 2 billion target from the annual plan

Mobile pageviews target reached

At the end of April, the Wikipedia mobile site reached the milestone of 2 billion monthly page views – one of the goals for the 2011/12 WMF annual plan.

Towards a rapid software deployment cycle

Wikimedia engineers have begun switching to a more rapid deployment cycle, starting to deploy the latest MediaWiki software to Wikipedia and other Wikimedia sites every two weeks.

Technology

A detailed report of the Tech Department’s activities for April 2012 can be found at:

https://www.mediawiki.org/wiki/Wikimedia_engineering_report/2012/April
Department Highlights

Major news in April include:

Operations

  • Search — After months of preparation and refactoring work with our dated Lucene implementation at the Tampa data center, we are glad to report that Peter Youngmeister (with help from Asher Feldman, Robert Stojnic and Jeff Green) successfully built and deployed the new Search infrastructure at our EQIAD data center. The performance improvement is quite amazing; at the 99th percentile level, search latency dropped from a high of 9 seconds to 1 second, and the average search is only 100ms, down from 700ms. In addition, the new infrastructure addresses some of the previous single point of failures and capacity limitations.
  • Wikimedia Labs — Ryan Lane released a new version of OpenStackManager, adding project filters for all interfaces, usability fixes and a number of bug fixes. OpenStackManager and LdapAuthentication were switched to Git, allowing a few more changes to be pushed thanks to being able to keep a stable master branch. Notable changes were per-project sudo management, allowing sysadmins in a project to manage who gets which sudo permissions in a fine grained manner for their projects, and a change in how groups are added to LDAP for projects. Sara Smollett added Per-project ganglia monitoring, displaying resource graphs for instances in projects. Andrew Bogott finished work on a plugin framework for OpenStack Nova, and has added an example plugin for a SharedFS driver, which would allow us to manage gluster volumes via an API.
  • Data Dumps — The gluster share with the last 5 or so good dumps for all projects is ready for use by Wikimedia Labs projects. A first copy of uploaded media, accessible via rsync, was announced, and some work was done on the infrastructure to generate downloadable bundles of media per project. We’re working with the Internet Archive to produce media bundles that they can host for download as well. A new version of the dump scripts was deployed with some minor bug fixes. Christian Aistleitner wrapped up work on the PHPUnit tests for the dump maintenance scripts, and discovered a problem with the database schema, which we will need to discuss with the user community in order to find a resolution that works for everyone.

Features Engineering

  • Visual editorRoan Kattouw and Trevor Parscal are rewriting the underlying data model to achieve feature compatibility with the parser and correct a variety of problems that have been previously deferred. Inez Korczynski and Christian Williams have been continuing their work to stabilize and integrate the content editable layer and have been working with Rob Moen, who has focused on getting the user interface elements working with the content editable layer. Gabriel Wicke has been working on improving the parser‘s ability to parse pages more quickly as well as increasing compatibility with existing features such as thumbnails. A template-heavy page like Barack Obama can now be expanded in similar time as the production parser.

Internationalization and Editor Engagement Experiments

  • Internationalization and localization tools — The team has completed the first round of UI designs for a Universal language selector (ULS) for desktop and mobile. UI/UX team members (Pau Giner and Arun Ganesh) are now implementing a prototype to showcase the first version of ULS. The team also added keymaps for language support to Narayam, added notification support to Translate, fixed bugs, reviewed code for localization support in MediaWiki 1.19, and discussed language support metrics.

Mobile engineering

  • Mobile design — The final selection of section styles was supported by user experience testing. We also added switches between Mobile/Desktop view and Images on/off to the footer. These changes have now been deployed as the default view of mobile Wikipedia. The first working prototype of the new navigation UI has been completed in rough form and sent out for feedback.
  • Wikimedia AppsYuvaraj Pandian released new versions of our Android app and our first ever PhoneGap version of the iOS app. Issues were identified with iOS 4.x and we released a update to fix them. Yuvaraj also continued work on the API move branch. Brion Vibber pushed out a final build of the Wikipedia App to the BlackBerry market, and started experimenting with a Windows mobile version.

Platform Engineering

  • MediaWiki 1.20 — As of April 2012, core software deployments to Wikimedia sites are done from git (instead of Subversion) through incremental “wmf”-branches. The first deployment of the 1.20 release cycle, labeled 1.20wmf1, was deployed to all Wikimedia sites this month; it notably brought new diff colors for improve readability. The 1.20wmf2 deployment cycle began on April 30. The 1.20.0 stable tarball is expected to be released in fall of 2012.
  • Summer of Code 2012 — Wikimedia engineers have chosen nine students for this year’s program. For the next few weeks, until May 21st, the students and their mentors are working together to train the students in MediaWiki development, so that they’ll have all the basic domain knowledge they’ll need to succeed during the summer.

Fundraising

Major Gifts and Foundations

  • Secured a sponsorship from the Richard Lounsbery Foundation for Wikimania.
  • We began our push to renew Benefactors who gave in FY2010-11, but not yet in FY2011-12 by June 30.

Fundraiser

  • Posted a full report from the 2011 fundraiser
  • Researched improvements to make for the fundraiser in Spain, Italy and Belgium. Held focus groups with donors to optimize messaging, payment methods offered, donation forms and translations.
  • Heavy research on the new payment methods to be added in 2012. Roadmap and timeline to be released soon.
  • Held a one hour systems test in the US.
  • Continually iterating on forms and landing pages as well as the A/B testing infrastructure.
  • Implemented GlobalCollect recurring payments, bringing our recurring monthly income up to approximately $40K/month, nearly a half-million dollars a year.

Global Development

New fellowship launched to focus on EN:WP help pages, good progress on Teahouse projects and welcome Siko and fellows to global development team!
New grants to the community in support of activities in seven countries
Barry attends the Malayalam community conference in Kollam, India and visits with chapter and community in Bangalore (see also general “Highlights” section)

Grants Awarded and Executed

Fellowships

Updates

Gender Gap – Fellow Sarah Stierch completed wrap-up documentation of outcomes and lessons learned from WikiWomen’s History Month.

New Fellowships

  • Peter Coombe was announced and started work this month as our newest Wikimedia Community Fellow. Pete’s fellowship project is piloting a data-driven approach to reorganize and rewrite key help pages on English Wikipedia in order to make them more usable, particularly for new editors. His work can be followed on the Help redesign project page.
  • Two new 2012 Fellowships will be announced in May.

Teahouse Project

The Teahouse has been live on English Wikipedia for two months and we’re beginning to see evidence of the project’s impact for participating new editors. Some relevant metrics from April’s report include:

  • In April, the Teahouse had an average of 50 questions posted in the Q&A forum per week and served about 20-30 new editors visiting for the first time each week, in addition to repeat visitors (the average guest asks 1.5 questions, 22% of guests asked more than one question, and many guests return to the Teahouse more than once). The median response time for questions is 30 minutes. The project’s greatest challenge continues to be making Teahouse known to all new editors in need of help, as our hosts have capacity to assist more new editors than are making their way to the space via personal invitation.
  • Comparing a sample of 75 new editors who participate in the Teahouse with a control group (of equivalent size and similar first-day editing activity) points to Teahouse having a positive impact on new editor engagement: New editors who participate in Teahouse edit 10x the number of articles than the uninvited control group and make on average 6x more global edits. The average participant adds 26x more bytes of content that survive on Wikipedia (i.e. content that isn’t reverted or deleted) than the uninvited control group.
  • Among the 224 editors in our three experimental groups, 28% of new editors who participate in the Teahouse were still active on Wikipedia at least 10 days later, compared with 12% of new editors who receive an invitation but don’t actively participate in the Teahouse, and only 5% from a similar uninvited control group.

Editor Growth and Contribution Program

  • The Global Development Department launched the Editor Growth and Contribution Program in mid-April, and announced Haitham Shammaa as program consultant. This program will focus on designing, testing, and implementing online programs to attract and retain new editors in small-to-medium sized Wikimedia projects.

Arabic Language Initiative

  • Haitham Shammaa visited Algeria to meet Wikipedians and other supporters who can sustain Wikipedia activities in the future. The visit included a lecture and Wikipedia workshop for students of Médéa University in northern Algeria.
  • Currently, we are working with a number of associates and NGOs in Egypt, Jordan, Lebanon, Morocco and Tunisia to explore the possibility of supporting Wikimedia program activities with grants.

Brazil Catalyst

Summary: A rich month for outreach and institutional relations, as well as advances in the institutional of scenario establishing the representative office in Brazil: a relatively good start in community engagement process

Brazil background notes

  1. Brazil is a huge country with 27 states: São Paulo, Minas Gerais and Rio de Janeiro in the Southeast have the highest populations, the strongest economies, and stronger infrastructure.
  2. São Paulo also has the greatest number of Wikipedia-PT editors, followed by Rio de Janeiro. Minas Gerais curiously doesn’t play a significant role in editorship now, but should be explored.
  3. Until now major work has been done with the Wikimedia Brasil community, but plan to increase focus in relationship building with the Wikipedia-PT community.

Brazil outreach

  • Trip to São Paulo to talk with community members, investigate locations for the office, meet with lawyers, and meet with potential partners
    • Learnings: need to generate more compelling agendas for community meetings to generate more interest and also need to work with the community to find times/locations that would make it more convenient to increase attendance
    • Recommendations: set up a programatic agenda to organize meetings objectively to highlight their value and in advance and focus on mapping real resources based on what the community will actually do.
  • Trip to Uberlândia to participate in event at the Computing College of the Federal University of Uberlândia. Jonas from Recife joined the activity and contacted local editors, but no local editors showed up.
    • Learnings: students don’t know much about free licensing or Wikipedia, bringing volunteers to events like this fosters trustful relationships within the community.
    • Recommendations:follow up to encourage future participation and gather contact details of participants.
  • Trip to Goiânia to bring community members together and participate in an event at the Federal University of Goiânia, no community members showed up but the event was successful (popular with students and professors).
    • Learnings: There’s a possibility to develop the education program in the communications college of the Federal University of Goiania. We can’t miss the chance to get local editors’ contacts. Alexandre Guiote (another lecturer in the event), living in Spain, has done very interesting research on Wikipedia.
    • Recommendations: consider developing an education program here and maintain contacts.
  • Second trip to São Paulo to attend a community workshop at Casa Fora do Eixo and met with community members, as well as meet with Banco do Brazil Foundation.
    • Learnings: community members have a lot of knowledge (editing, licensing, etc.), but outreach methodologies might be improved to improve results.
    • Recommendations: work with the community to build and share methodologies and build outreach materials.

Brazil program updates

  • Possible coworking places have been explored: the Hub in São Paulo seems to be the most neutral, but no decision yet
  • Partnerships update: Discussion of partnerships with Fiocruz (institution related to the Ministry of Health for research and development) to develop a validation process based on social participation, the National Library is excited about doing things together on access to books/reading programs and digital archives

India Programs

Indic Languages

  • Kannada: Support for translated articles enhancement project and enabling transwiki export
  • Assamese: Medicine project outreach support at Jorhat medical college. Ideas for 10th anniversary, Community translated outreach ppt to Assamese, Ideas and inputs for a potential Assamese Wikipedia CD project
  • Odia: Helping Odia community for a medical project including outreach at SCB Medical College. Supported 3 outreach events in Odisha
  • Hindi: conducted Hindi Wikipedia outreach at Delhi University
  • Bengali: support to enable sub pages in bn wikisource, support to enable proof read extension in bn wikisource
  • Malayalam: ad hoc support for Malayalam community conference

India Outreach

  • Commons outreach handbook that community members can adapt/adopt
  • Worked on train-the-trainer program design
  • Documented outreach correspondence with all institutes
  • Supported outreach in four communities – AS, GU, MR, ML
  • Translation work on outreach documents being done by community members. 4 Indic languages have successfully finished translating.

Communications, Wikipatrika

  • Work on the Wikipatrika newsletter is in progress
  • First set of mails to contact previous issue coordinators
  • Received response, initiated GU,NE,OR,AS pages on wiki
  • Tech news done, AS,MR done
  • Hope to publish it by May first week
Storytelling for community building
Internal communications
  • Announcement for Communications + Outreach IRC meeting: India Program#IRC_Meetings
  • Supported Malayalam community with press release draft and journalist contacts
  • Supported Kannada Translation project with messages for social media
  • Started journalist database
  • Contacted Indic journalists
  • Made brief digest to post on all village pumps
  • Supported Ahmedabad meetup with local press contacts
  • Worked on train-the-trainer program design for outreach capacity building

Wikimania Scholarships

Scholarship recipients for Wikimania were announced! We have 130 scholars from 57 countries around the world. Chapter scholarships are also being organized now.

For more information, see the blog post: http://blog.wikimedia.org/2012/04/25/wikimania-2012-scholarships/

US Cultural Partnerships

  • Final preparations for the American Association of Museums conference, which took place April 29 – May 2. Wikimedia will be represented throughout the conference, including in a traditional and a virtual Wikipedian in Residence session, a Wikipedia basics table, a nomination for QRpedia at the MUSE tech awards, and highlights in the Association of Children’s Museum’s session.
  • Featured on the Library of Congress blog in the post “Galleries, Libraries, Archives, and Museums with Wikipedia (GLAM-Wiki): Insights Interview with Lori Phillips
  • Coordination and updates to the GLAM/US portal in preparation for the American Association of Museums conference
  • Ongoing coordination with US cultural organizations, including support and advisement for early stages of planning processes. (Partnerships not yet publicly announced.)
  • Ongoing coordination on recommendations for technical tools from cultural professionals
  • Preparations for the Wikipedia Lounge at the MuseumNext conference in Barcelona in May and session proposal writing and coordination for the Museum Computer Network conference

Mobile and Business Development

This month has been primarily dedicated to testing and implementation of our new Free Access to Mobile Wikipedia programs with Orange and Telenor. This has been a very complicated process as we work through bugs and other technical problems but we have made great progress with our first territories. We’ve spent most of this month on working through browser support issues, landing pages, translation, banners, caching issues, etc.

  • Orange status update: Tunisia and Uganda are both live, although features still need to be implemented
  • Telenor status update: Digi Malaysia is ready from a technical perspective. Currently preparing for a market launch.

Global development research

Our gender ratio held steady with only 9% of editors being women. We also found that compared to other countries US fares better on the gender divide, 14% of editors from US being women compared to other countries for which we had a significant sample.

Outreach results

  • First report from outreach events was delivered to the India team.

Education program research

A quantitative analysis undertaken by Ayush Khanna and Mani Pande from the Global Development Research and Analytics team shows that Wikipedia Education Program participants from the United States added more than three times as much quality content as regular new users to the English Wikipedia. The data also shows that students who are introduced to editing Wikipedia through the U.S. Education Program are just as likely to continue editing as any other newcomer. Read more on the Wikimedia Foundation blog: http://blog.wikimedia.org/2012/04/19/wikipedia-education-program-stats-fall-2011/

Wikipedia Education Program

  • In order to determine the future of U.S. and Canada education programs, we invited Wikipedia Ambassadors, class instructors, and the Wikipedia community to contribute to an on-wiki application process for joining a “Working Group” that will meet in July 2012 for a kick-off meeting of the planning process. This is the first step in a year-long open and collaborative process to make the U.S. and Canada program more volunteer-driven and to discuss the creation of the Education Program Structure that will be in charge of the day-to-day operations of the program. More information can be found at http://en.wikipedia.org/wiki/Wikipedia:Education_Working_Group
  • Students at universities in the United States and Canada found that contributing to Wikipedia as a class assignment through the Wikipedia Education Program improved their media literacy and technology skills, according to survey results from the Fall 2011 term. About two-thirds of the respondents agreed that doing a Wikipedia assignment was a beneficial experience, with almost 20 percent of them strongly in favor of a Wikipedia assignment in place of a traditional term paper. See more of the survey results: http://blog.wikimedia.org/2012/04/23/students-see-benefits-from-wikipedia-assignment/
  • University of Massachusetts at Amherst professor Edward Erikson wrote a post for the Wikimedia Foundation blog explaining why he is glad he is asking students to contribute to Wikipedia. He argues that Wikipedia is part of the classroom whether the professor likes it or not, and by making Wikipedia the destination rather than the route, students have a better learning experience. http://blog.wikimedia.org/2012/04/02/wikipedia-in-my-classroom/
  • Canadian class adds two Good Articles: University of Alberta – Augustana Psychology Professor Paula Marentette asked her students to expand two articles on course-related topics this year for her Language Acquisition class. The result? The seven students in her class worked together to get two articles, “Vocabulary development” and “Joint attention,” to Good Article status on the English Wikipedia. In a blog post published on the Wikimedia Foundation blog, the students describe their reaction to the assignment, and Dr. Marentette describes the learning outcomes for her students: http://blog.wikimedia.org/2012/04/30/psychology-class-collaborates-on-two-good-articles/
  • A lot has happened in the Cairo Pilot, with students contributing on-wiki at an increasing pace. Campus Ambassadors conducted several workshops this month, and some outstanding student articles are now live on the Arabic Wikipedia. Updates with the program are documented in detail on the Arabic Wikipedia. Annie is also putting together a document with more updates about the Cairo Pilot as a whole and each class individually (coming soon, in next month’s GD report).
  • More than 200 students and faculty members at Ain Shams University in Cairo showed up to an in-person outreach event organized by Wikipedia Ambassadors from the Cairo Pilot and a local student group. Attendees learned how the Arabic Wikipedia works and how they could contribute to it. Check out the group’s photo album to see photos from this successful event

Communications

No major new projects, announcements, or media issues unfolded in April. Tech press and culture blogs, as well as some main stream media, continued to focus on the Wikipedia blackout and post-SOPA musings.

The communications team, along with the Wikimedia blogging and social media teams have been putting considerable work into an effort to increase the number and quality of Wikimedia blog postings. This month 37 posts hit the blog, including deeper profiles of active Wikipedians and media creators from Wikimedia Commons.

Over the next few months we hope to bring a basic metrics/traffic measurement tool back to the blog, bring in new volunteer contributors, and revise the structure and design of the blog to create a more engaging front page.

Major announcements

No major press releases or announcements in April.

Major stories through March

PRSA on Wikipedia accuracy (April 17, 2012)

In April, The Public Relations Society of America published a study by Marcia W. DiStaso, Assistant Professor of PR at Penn State University, surveying the wide range of views of PR practitioners and their experiences with Wikipedia. The original summary incorrectly asserted that ’60% of Wikipedia articles are wrong’ – an error that was repeated across dozens of global main stream media press (the actual claim was “60% of respondents found errors with their company’s articles”). PRSA, at the urging of Wikimedia community members, revised the study headline and press release (thank you!).

(updated PRSA survey data: http://media.prsa.org/article_display.cfm?article_id=2582)
http://www.theatlanticwire.com/technology/2012/04/60-percent-wikipedia-entries-about-companies-contain-errors/51236/
http://abcnews.go.com/blogs/technology/2012/04/wikipedia-survey-shows-60-percent-of-entries-have-errors-and-public-relations-people-cant-correct-them/

Wikipedia Zero gaining attention in Africa (April 5, 2012)

February’s announcement from Orange and the Wikimedia Foundation about providing free access to Wikipedia on mobile devices in specific markets is beginning to get positive attention in the region. Regional programs advertising the service are appearing as Orange affiliates expand the program.

http://bikyamasr.com/65057/ugandans-access-wikipedia-for-free-through-orange-uganda/
http://allafrica.com/stories/201204220118.html

Wikipedia mobile switches to OpenStreetMap (April 5, 2012)

A large number of tech press picked up on the fact that a revised Wikipedia mobile app chose Openstreetmaps over Google maps in a recent update.

(original blog post https://blog.wikimedia.org/2012/04/05/new-wikipedia-app-for-ios-and-an-update-for-our-android-app/ )
http://9to5google.com/2012/04/05/wikipedia-dumps-google-maps-for-openstreetmap-marks-industry-trend-to-alternative-service/
http://news.cnet.com/8301-1023_3-57410234-93/wikipedia-dumps-google-maps/
http://www.theverge.com/2012/4/7/2931320/wikipedia-updates-mobile-apps-drops-google-maps-for-openstreetmap

Other worthwhile reads

Clips from the Malayalam Wikipedia gathering in April

http://ibnlive.in.com/news/malayalam-wikipedia-could-be-emulated/253196-60-116.html
http://ibnlive.in.com/news/youths-come-forward-to-fill-up-odia-wikipedia/247114-60-117.html

Download all of Wikipedia (from https://blog.wikimedia.org/2012/04/09/download-the-text-of-the-entire-english-wikipedia/)

http://www.webpronews.com/download-wikipedia-in-english-all-9-7gb-of-it-2012-04

Sarah Stierch on bringing women to Wikipedia in the Smithsonian

http://blogs.smithsonianmag.com/aroundthemall/2012/04/how-many-women-does-it-take-to-change-wikipedia/

Guardian on the ‘mapping Wikipedia’ project

http://www.guardian.co.uk/news/datablog/interactive/2012/apr/04/wikipedia-world-language-map?newsfeed=true

AllAfrica.com on the recent visit of Jimmy Wales

http://allafrica.com/stories/201204090023.html

The UCLA ‘Daily Bruin’ on the Foundation’s Wikipedia Education Program

http://www.dailybruin.com/index.php/article/2012/04/professors_students_worldwide_work_to_improve_wikipedias_credibility_by_editing_articles

FastCompany also reported on the Wikipedia Education Program

http://www.fastcompany.com/1830315/wikipedia-education-program-college-university

Wikipedia Signpost

WMF Blog posts

http://blog.wikimedia.org/2012/04/

Media Contact

https://wikimediafoundation.org/wiki/Press_room/Media_Contact#April_2012

Human Resources

Wikimedia Foundation extended HR video.ogv

“Work with Wikimedia” outreach video produced for the HR department, featuring WMF employees

This month, HR is experimenting with new metrics and presentation styles, depicting better views of Foundation staff and contractor composition as well as beginning to report on recruiting metrics.

On the jobs.wikimedia.org site, we premiered a new video created with the support of the Communications team. Victor Grigas and Matthew Roth did a fabulous job. See http://wikimediafoundation.org/wiki/Work_with_us

We are still in the midst of implementing our Human Resources Information System (HRIS) system. We are set to complete that by end of June. We completed our bi-annual assessment of exempt/non-exempt employees. In terms of new policy, we initiated out a comprehensive Paid Time Off Policy, with an Immigration policy set to roll out next week. These policies will all be codified in an updated employee handbook. In the arena of benefits delivery, we kicked off a new 401K committee comprised of employees interested in managing and diversifying the retirement options for the organization.

HR is sponsoring the work of a qualitative, anthropological analysis of culture and also leadership profiling. This will support later work in leadership development. We have also initiated a coaching program for WMF managers.

Staff Changes

New Hire
  • Renee Bracey Sherman, Development Associate (Fundraising)
  • Andrew Otto, Software Developer – Analytics (Engineering)
  • Chris Steipp, Security Engineer (Engineering)
New Other Position Hires
  • Mathias Mullie, Contractor, Software Developer, Features (Engineering)
  • Haitham Shammaa, Contractor, program consultant for the Editor Growth and Contribution Program (Global Development)
Conversions
  • Rob Moen, Software Developer Front-end (Engineering)
Promotions
  • Sumana Harihareswara, Engineering Community Manager (Engineering)
New Contractors
  • Daisy Chen (Legal and Community Advocacy)
  • Arun Ganesh (Engineering)
  • Faidon Liampotis (Engineering)
  • Tauhida Parveen (Engineering)
  • Ricardo Saavedra (Fundraiser)
  • Sandra Senderovich (Fundraiser)
Contract Extended
  • Rayne MacGeorge (IT)
Exit
  • Nimish Gautam
  • Dana Isokawa
Contract Ended
  • Farhan Choudary
  • Emmanuel Engelhart

Statistics

Total Requisitions Filled:

Actual: 106
April Plan: 115 April Filled: 6, April Attrition: 2
YTD Filled: 48, YTD Attrition: 16

Remaining open requisitions to fiscal year end: 16

Department Updates

Department Changes, effective April 15, 2012
  • Siko Bouterse joins Global Development
  • Karyn Gladstone, Ryan Faulkner, Maryana Pinchuk and Steven Walling join Engineering

Real-time feed for HR updates: http://identi.ca/wikimediaatwork or http://twitter.com/wikimediaatwork

Finance and Administration

Independent contractors traveling on business for the Wikimedia Foundation, outside their home country now have limited medical and travel coverage.

Our search for a Director of Administration is continuing with final interviews in process.

We are beginning to look at the option of doing online expense reimbursements for employee exepenses.

Based on feedback received on her IRC office hours, the Chief Talent and Culture Officer is looking at socially responsible options for investing some of the reserve for the Foundation.

  • Updated terms of use becomes effective May 25, 2012.
  • Proactive trademark actions (e.g., successful challenge to third-party “Wikimedia UK” use & and winning globe logo in Mexico) and trademark review for Wikidata
  • Finalizing decision on appropriate license for Wikidata (probably CC0)
  • Kelly Kay, Deputy General Counsel, will represent WMF at the Open Source Initiative.
  • Discovered and worked through backlog of trademark requests sent to wrong email address
  • Two strong candidates identified for the junior legal counsel, including an active Wikimedian. Final decision: likely by May 15.
  • Welcome to the newest Arbitration Committee, English Wikinews
  • Working on electronic contract storage and approval process
  • Agreed to support the Free Culture Conservacy through endorsement
  • After winning signature issue as expected, we decided not to pursue appeal on German Loriot case regarding the stamps and public domain issue.
  • Reappointed the ombudsman commission
  • Ongoing legal and community work on a wide variety of issues and topics, including AFT5, CC 4.0, litigation, trademark portfolio, new fundraising agreement, template agreements, FDC, Wikipedia town, budget, privacy, internal policies, board issues and governance, etc.
  • Daisy Chen joined as a paralegal (temporary contract) to help handle workload.
  • New interns expected to start end of May. Our last semester interns have left (except for Stephen), and we wish them well. They did a great job. New full-time summer interns will be from Harvard, Stanford, and University of Minnesota.
  • This month’s posted discussions on topics of community interest:
  • Metrics:
    • Number of contracts in April – 14 (159 contracts to date in FY 2011/12)
    • Number of trademark issues in April – 61
      • Number of backlogged trademark issues that were sent through to the team in April (included in above total count) – 50
      • Approved – 7
      • Denied – 8
      • Request Withdrawn – 7
      • Pending – 34
      • Approval not needed – 2
      • No known response – 1
      • Closed due to lack of response – 2

Visitors and Guests

  1. Tammy Davidson (Chartis)
  2. Jennifer Hills (Chartis)
  3. Yanina Budkin (World Bank Senior Communications Officer for Argentina, Uruguay, and Paraguay)
  4. Gabriele Niola (Italian tech journalist for Punto Informatico and Wired Italy)
  5. Elisa Manheim (Institute for International Education)
  6. Matjaz Panjan (Fulbright scholar, Lawrence Berkeley National Laboratory)
  7. Cristiano Boccolini (Fulbright scholar, UC Berkeley)
  8. Alona Sekan (Fulbright scholar, US Department of Agriculture, Western Research Regional Center, Agricultural Research Service)
  9. Phoebe Ayers (board member)
  10. Tom Simonite (Computing Editor, MIT Technology Review)
  11. Frieder Bronner (writing dissertation on parts of Wikipedia)
  12. Matt Zimmerman (Technical Leader of Ubuntu)
  13. Craig Newmark (Craigslist founder, visitor for Ushahidi brownbag)
  14. Dan Perkel (visitor for Ushahidi brownbag)
  15. Megan Finn (visitor for Ushahidi brownbag)
  16. Nick Arnett (visitor for Ushahidi brownbag)
  17. Aakash Desai (Product Manager, Mozilla)
  18. Raj Ramabadran (Microsoft)
  19. John P. Alioto (Microsoft)
  20. Randall Benson (Benson Consulting)
  21. Aaron Halfaker (WMF Research Analyst)
  22. Ryan Merkeley (COO of Mozilla Foundation)
  23. Faidon Laimbotis (visiting contractor)
  24. Laura Lanzerotti (Bridgespan Group)
  25. Libbie Landles-Dowling (Bridgespan Group)
  26. Daniel Stid (Bridgespan Group)
  27. Meera Chary (Bridgespan Group)
  28. Divya Narayanan (Bridgespan Group)
  29. Deborah Bezona (D. Bezona & Company)
  30. Kelley Cope (Sitzmann, Morris and Lavis)
  31. Alice Komarnicki (Sitzmann, Morris and Lavis)
  32. Sandy Urgel (Sitzmann, Morris and Lavis)
  33. Kate Antonini (First Data)
  34. Patricia Brizio (First Data)
  35. Thomas Tucker (First Data)
  36. Anne Hiaring Hocking (Hiaring Smith)
  37. Vijay Toke (Hiaring Smith)
  38. David Evan Harris (Global Lives Project and Institute for the Future)
  39. Gavin McConnon (BoxPay)
  40. Kyle Hitchcox (BoxPay)
  41. Aaron Nobles (BoxPay)
  42. Tim Otten (CiviCRM)
  43. Anand Gupta (Thiel 20Under20 Finalists)
  44. Benedikt Lotter (Thiel 20Under20 Finalists)
  45. Brandon Paton (Thiel 20Under20 Finalists)
  46. Brianna Smrke (Thiel 20Under20 Finalists)
  47. Charlie Javice (Thiel 20Under20 Finalists)
  48. Charlie Stigler (Thiel 20Under20 Finalists)
  49. Chris Olah (Thiel 20Under20 Finalists)
  50. Clay Allsopp (Thiel 20Under20 Finalists)
  51. Connor Zwick (Thiel 20Under20 Finalists)
  52. Dylan Field (Thiel 20Under20 Finalists)
  53. Eric Chang (Thiel 20Under20 Finalists)
  54. Henry Lui (Thiel 20Under20 Finalists)
  55. Ilya Vakhutinsky (Thiel 20Under20 Finalists)
  56. Isaac Dietrich (Thiel 20Under20 Finalists)
  57. Jimmy Koppel (Thiel 20Under20 Finalists)
  58. Jon Lim (Thiel 20Under20 Finalists)
  59. Kettner Griswold (Thiel 20Under20 Finalists)
  60. Kevin Ma (Thiel 20Under20 Finalists)
  61. Lindsay Haskell (Thiel 20Under20 Finalists)
  62. Michael Moore-Jones (Thiel 20Under20 Finalists)
  63. Noor Siddiqui (Thiel 20Under20 Finalists)
  64. Omar Rizwan (Thiel 20Under20 Finalists)
  65. Oskar Niburski (Thiel 20Under20 Finalists)
  66. Param Jaggi (Thiel 20Under20 Finalists)
  67. Paul Sebexen (Thiel 20Under20 Finalists)
  68. Rebekah Austin (Thiel 20Under20 Finalists)
  69. Rijul Gupta (Thiel 20Under20 Finalists)
  70. Ritik Malhotra (Thiel 20Under20 Finalists)
  71. Ryan Lelek (Thiel 20Under20 Finalists)
  72. Saku Panditharantne (Thiel 20Under20 Finalists)
  73. Samir Devalaraja (Thiel 20Under20 Finalists)
  74. Semon Rezchikov (Thiel 20Under20 Finalists)
  75. Shai Kiriati (Thiel 20Under20 Finalists)
  76. Spencer Hewett (Thiel 20Under20 Finalists)
  77. Tara Seshan (Thiel 20Under20 Finalists)
  78. Taylor Wilson (Thiel 20Under20 Finalists)
  79. Tony Ho (Thiel 20Under20 Finalists)
  80. Vaibhav Kumar (Thiel 20Under20 Finalists)
  81. Vijay Viswanathan (Thiel 20Under20 Finalists)
  82. Wole Idowu (Thiel 20Under20 Finalists)
  83. Yoonseo Kang (Thiel 20Under20 Finalists)
  84. Maggie Dennis (visiting staff)
  85. Sumana Harihareswara (remote staff)
  86. Daniel Phifer and Kevin McCracken (Social Imprints)
  87. Christina Dragwidge (Arthur J. Gallagher)
  88. Henrik Bennetsen
  89. Majd Abbar, Qatar Foundation
  90. Ginny Jarrett (AJLI)
  91. Alice Gardner-Boreta (AJLI)
  92. Jodi Penn (AJLI)
  93. Eileen Goodwin (AJLI)
  94. Cynthia Foster (AJLI)
  95. Olivia Thomas (AJLI)
  96. Becker Holland (AJLI)
  97. Delly Beekman (AJLI)
  98. Sandra Thomas (AJLI)
  99. Sarah Berthelot (AJLI)
  100. Laurel Lee-Alexander (AJLI)
  101. Liz Murley (AJLI)
  102. Julie Siebel (AJLI)
  103. Karla Wallace (AJLI)
  104. Toni Freeman (AJLI)
  105. Mary Jo Hunt (AJLI)
  106. Kathy Rabon (AJLI)
  107. Deann Cook (AJLI)
  108. Liz Davis (AJLI)
  109. Subha Lembach (AJLI)
  110. Gwin Londrigan (AJLI)
  111. Karen Miller (AJLI)
  112. Terri Nass Reeder (AJLI)
  113. Dona Ponepinto (AJLI)
  114. Diann Rohde (AJLI)
  115. Evelyn Zabo (AJLI)
  116. Susan Danish (AJLI)
  117. Anne Dalton (AJLI)
  118. Maureen Mackey (AJLI)
  119. Janine le Sueur (AJLI)
  120. Carrie Holmes (AJLI)
  121. Heather Mcleod-Grant (AJLI)
  122. Rebecca Petzel (AJLI)
  123. Kristin Cobble (AJLI)
  124. Kat Walsh (Board Member)
  125. Arthur Richards (Remote staff)
  126. Martin Kalfatovic (Smithsonian)
  127. Chris Freeland (Missouri Botanical Garden)
  128. Diane Peters (Creative Commons)
  129. Marion Strecker (Brazilian journalist)
  130. Michiel Minderhoud (Mobile Code Challenge Winner)

2012/05/14: Edited to correct an error in the “Financials” section (changed “5% lower than plan” to “5% higher than plan”)

Tilman Bayer — 15. May 2012, 01:45

Wikimedia Highlights, April 2012

Information You are more than welcome to edit the wiki version of this report for the purposes of usefulness, presentation, etc., and to add translations.

Highlights from the Wikimedia Foundation Report and the Wikimedia engineering report for April 2012, with a selection of other important events from the Wikimedia movement

Wikimedia Foundation highlights

Expanding fundraising and affiliation models

After the resolutions of the Wikimedia Board of Trustees at its meeting in Berlin, work is ongoing to implement a new model for distributing the money raised via Wikimedia project sites. Except the costs for the core operations and operating reserves of the WMF, all of it (including funds for chapters and non-core operations of the WMF) will be distributed based on the recommendations of the new volunteer-driven Funds Dissemination Committee (FDC). Another resolution of the Board recognizes new models of affiliation with the Wikimedia movement: “Movement Partners” (like-minded organizations that actively support the movement’s work), “National or Sub-national Chapters” (which includes the existing chapter model), “Thematic Organizations” (non-profits representing the movement and using the Wikimedia trademarks, which are supporting work focused on a specific topic), and “User Groups” (open membership groups which may or may not choose to incorporate).

Indic language outreach

Chief Global Development Officer Barry Newstead visited India meeting Wikimedians in Bangalore and attending Wikisangamotsavam, the Malayalam community conference, as part of work to support Indic language projects. The India team is working actively with seven Indic language communities on outreach, social media strategy and initiatives to build community momentum.

Page views to the Wikipedia mobile site (red: non-English versions) compared to the 2 billion target from the annual plan

Mobile pageviews target reached

At the end of April, the Wikipedia mobile site reached the milestone of 2 billion monthly page views – one of the goals for the 2011/12 WMF annual plan.

Towards a rapid software deployment cycle

Wikimedia engineers have begun switching to a more rapid deployment cycle, starting to deploy the latest MediaWiki software to Wikipedia and other Wikimedia sites every two weeks.

Global unique visitors for March:

489 million (+2.7% compared with February; +22.3% compared with the previous year)
(comScore data for all Wikimedia Foundation projects; comScore will release April data later in May)

Page requests for April:

17.3 billion (+0.4% compared with March; +18.2% compared with the previous year)
(Server log data, all Wikimedia Foundation projects including mobile access)

Active Registered Editors for March 2012 (>= 5 edits/month):

85.09K (+0.2% compared with February / -4.5% compared with the previous year)
(Database data, all Wikimedia Foundation projects except for Wikimedia Commons)

Report Card for March 2012: http://reportcard.wmflabs.org/

Financials

(Financial information is only available for March 2012 at the time of this report.)

All financial information presented is for the period of July 1, 2011 – March 31, 2012.

Revenue $32,054,861
Expenses:
 Technology Group $7,788,192
 Community/Fundraiser Group $3,212,763
 Global Development Group $2,984,100
 Governance Group $718,116
 Finance/Legal/HR/Admin Group $4,607,656
Total Expenses $19,310,826
Total surplus/(loss) $12,744,035
  • Revenue for the month is $1.9MM vs plan of $3.9MM, approximately $2MM or 53% under plan.
  • Year-to-date is $32.1MM vs plan of $28.6MM, approximately $3.5MM or 12% over plan.
  • Expenses for the month is $2.3MM vs plan of $2.2MM, approximately $112K or 5% higher than plan.
  • Year-to-date is $19.3MM vs plan of $21.1MM, approximately $1.8MM or 8% lower than plan.
  • Cash position is $30.6MM as of March 31, 2012 – approximately 13 months of expenses.
Monthly Metrics Meeting May 3, 2012.ogv

Video of the monthly Wikimedia Foundation metrics and activities meeting covering the month of April (May 3, 2012)

Other movement highlights

“Wikipedian in Residence” model comes of age

The British Library recently became the latest cultural institution to announce (together with Wikimedia UK) a Wikipedian in Residence, supporting connections between the British Library’s staff and the Wikimedia community. At the prestigious 2012 American Association of Museums conference, five Wikipedians in Residence from around the world presented this collaboration model. The April issue of the “This Month in GLAM” newsletter reports open Wikipedian in Residence positions at state institutions in Israel, Germany and Sweden.

The very first screenshot of Wikidata

During its first month, the Wikidata team has started coding for the first phase of the project, which will allow the central storage of interwiki links. Two MediaWiki extensions are being developed as the base for Wikidata: Wikibase Client and Wikibase core. A demo version will be available soon.

Multilanguage contest from Monmouthpedia project

As part of Monmouthpedia – “the first Wikipedia project to cover a whole town” – the Charles Rolls Challenge awarded prizes to Wikimedians who had written, improved or uploaded content about Monmouth in multiple languages. Also, the town saw the first ever Welsh meetup in April.

One of the two first prize winning entries from the Tamil contest: A Rekla race (Ox cart race) in Tamil Nadu, India

Tamil media contest

The Tamil Wikimedia community completed its “Tamil Wiki Media Contest“, which led to 15,000 media contributions from 307 individuals.

Database company donates free access to Wikimedians

Over 600 Wikimedians have received free access to the HighBeam Research database for one year, to support their work on Wikipedia and its sister projects. Active editors can still apply at en:Wikipedia:HighBeam/Applications.

2012/05/14: Edited to correct an error in the “Financials” section (changed “5% lower than plan” to “5% higher than plan”)

Tilman Bayer — 15. May 2012, 01:44

14. May 2012

- Wikimedia Foundation - (anglicky)

- Wikimedia Foundation - (anglicky)

If you’re seeing ads on Wikipedia, your computer is probably infected with malware

We never run ads on Wikipedia. Wikipedia is funded by more than a million donors, who give an average donation of less than 30 dollars. We run fundraising appeals, usually at the end of the year. If you’re seeing advertisements for a for-profit industry (see screenshot below for an example) or anything but our fundraiser, then your web browser has likely been infected with malware.

Screenshot of the Wikipedia article on John Slattery, with an advertisement for Inkfruit injected by malware on the user's computer

Malware installed on your computer may inject advertising into a page on popular websites, such as this Wikipedia article. This is an example that we've seen in the wild. Note the tiny text "ads not by this site" immediately below the ad, which may or may not appear next to these types of injected advertisements.

One example that we have seen installs itself as a browser extension. The extension is called “I want this” and installs itself in Google Chrome. To remove it:

  • Open the options menu via the “pipe-wrench” icon on the top right, and choose Settings.
  • Open the Extensions panel and there is the list of extensions installed.
  • Remove an Extension by clicking the Remove button next to an item.

There is likely other similar malware that injects ads into Chrome, Firefox, Internet Explorer and other popular browsers. If you see examples that you can document, please point them out in the comments.

Ads injected in this manner may be confined to some sites, even just to Wikipedia, or they may show up on all sites you visit. Browsing through a secure (HTTPS) connection (which you can automate using the HTTPS everywhere extension) may cause the ads to disappear, but will not fix the underlying problem.

Disabling browser add-ins is a good starting point to determine the source of these types of ads. This does not necessarily fix the source of the problem either, as malware may make deep changes to your operating system. If you’re comfortable attempting a malware scan and removal yourself, there are various spyware/malware removal tools. Popular and well-reviewed solutions include Ad-Aware and Malwarebytes. But be aware that these types of tools may also bundle software, or leave your computer in an unusable state.

If in doubt, have your computer evaluated for malware by a competent and qualified computer repair center.

There is one other reason you might be seeing advertisements: Your Internet provider may be injecting them into web pages. This is most likely the case with Internet cafes or “free” wireless connections. This New York Times blog post by Brian Chen gives an example.

But rest assured: you won’t be seeing legitimate advertisements on Wikipedia.

We’re here to distribute the sum of human knowledge to everyone on the planet — ad-free, forever.

Philippe Beaudette, Director of Community Advocacy
Erik Moeller, Vice President of Engineering and Product Development

Philippe Beaudette — 14. May 2012, 19:49

11. May 2012

- Wikimedia Foundation - (anglicky)

- Wikimedia Foundation - (anglicky)

New book dives into the architecture of MediaWiki, git, puppet and other open-source applications

The cover of the book, based on the photo of a building from a low-angle shot

The Architecture of Open-Source Applications is a collection of technical essays detailing the architecture of twenty-four major open-source applications.

The second volume of the Architecture of Open-Source Applications book, which includes a chapter on MediaWiki, is now available online and on lulu.com.

The Architecture of Open-Source Applications is a collection of technical essays detailing the architecture of twenty-four major open-source applications. This is the second volume of a series that aims to help developers understand how great and large programs are constructed, and the decisions (or accidents) that led to the way they now work. The series draws inspiration from books used by architects that feature case studies of the great buildings of history.

This volume contains a chapter detailing the inner workings of MediaWiki, the wiki software that powers all Wikimedia sites, including Wikipedia.

The writing of the chapter was coordinated by myself and Sumana Harihareswara. While I put together the majority of the content, it wouldn’t have been possible without the initial knowledge-sharing effort made by many Wikimedia engineers and volunteer MediaWiki developers, who also reviewed and improved the several revisions the text underwent.

The chapter on MediaWiki is available on the book’s website, along with the other chapters from both volumes. Its content was integrated into the documentation on mediawiki.org (at MediaWiki history and Manual:MediaWiki architecture) when it was completed in November 2011.

Greg Wilson and Amy Brown, the book’s editors, contacted the Wikimedia Foundation in August 2011 to offer to feature MediaWiki in the second volume. We chose a very collaborative approach to writing the chapter to ensure that the content was accurate and thorough, and also to split the workload among subject matter experts.

This volume dives into the inner workings of other tools familiar to the Wikimedia community, like Git, GNU Mailman, nginx and Puppet.

All of the book’s content is released under the Creative Commons Attribution license, similar to the license used on Wikimedia sites. It is freely available for reading online at http://www.aosabook.org, and you can also order a print from lulu.com. E-book and PDF versions will be available for purchase shortly. All royalties from purchases are donated to Amnesty International.

This is the second book published this year that contains a chapter written by Wikimedia staff, after the publication of Open Advice, a collection of essays, stories and lessons learned by members of the Free Software community.

I hope the chapter on MediaWiki, and also the rest of the book, will prove useful and interesting to the Wikimedia community and other developers. If you enjoyed it, learned from it, or would like to see more publications of this type, let us know!

Guillaume Paumier
Technical communications manager

Guillaume Paumier — 11. May 2012, 12:24

10. May 2012

- Wikimedia Foundation - (anglicky)

- Wikimedia Foundation - (anglicky)

59 percent of logged-in Wikipedians started as anonymous editors

As we work on new product features to improve various aspects of editing Wikipedia, we asked our editors to share more about their editing experiences. Here are some highlights from the Editor Survey that we found to be valuable:

a. 59 percent of editors edited Wikipedia anonymously before creating an account

b. Decline in edit activity is more pronounced for experienced editors

c. Edit history influences editors’ views on problems with Wikimedia culture as well as desired solutions

a. 59 percent of editors edited Wikipedia anonymously before creating an account

59 percent of respondents pointed out that they had edited Wikipedia anonymously before they set up a login account on Wikipedia. (The survey was only announced to logged-in editors.) Portuguese (70 percent) and Spanish (66 percent) Wikipedia editors were more likely to have edited anonymously in the beginning compared to English (60 percent), Russian (58 percent) and German (59 percent) editors. Among editors, the three biggest motivators for setting up a login account are: tracking their edit history (54 percent), creating new articles (54 percent) and having a watchlist of articles to follow (49 percent). Interestingly, while the English Wikipedia makes it mandatory to obtain an account before one can start a new article, respondents there cited this reason less often (39 percent). Among Spanish (67 percent) and Portuguese (68 percent) language editors this percentage was much higher, even though these Wikipedias allow creation of new articles without being logged in.
76 percent of those respondents who had started anonymously said that they had made between one and 50 anonymous edits before they registered a user account. The majority of these respondents (54 percent) saw the benefit of having a user account only after editing Wikipedia anonymously for more than 10 times.

QD3b. What prompted you to set up a user account? n=6378

b. Decline in edit activity is more pronounced for experienced editors

When asked about their level of activity in 2011 compared to the previous year (2010), 30 percent of respondents said that they were less active, another 30 percent stated that there was no change in their activity and 41 percent said that they were more active. However, more experienced editors (with 100+ edits) were more likely to point out that they were contributing less often. The most common reasons reported for becoming less active on Wikipedia are: Having less time (59 percent), spending more time on other offline activities like reading (44 percent), spending more time on school or academic work (34 percent), spending more time on other online activities like Facebook or Twitter (23 percent), and rules and guidelines for editing becoming too complicated (17 percent).

QD7a: Thinking about this year (2011) how active were you on Wikipedia compared to the previous year? (ratio of 'less active' answers) n=3890

c. Edit history influences editors’ views on problems with Wikimedia culture as well as desired solutions

We asked editors to choose the top three problems with Wikimedia culture that have affected them personally, making it harder for them to edit. The most commonly picked responses were: Other editors who feel that they own specific articles and don’t want others to collaborate (46 percent), too many rules and policies (41 percent), editors who are not fun to work with (39 percent) and lack of access to research materials like scholarly articles or books (39 percent).
When we sliced data through edit counts, we found that experienced editors are more likely to identify issues with other editors as the biggest problem that plague Wikipedia culture, while newer editors are more likely to identify complicated policies and software. For example, 45 percent of emerging editors (1-9 edits), 48 percent of aspiring editors (10-50 edits) and 44 percent of new Wikipedians (51-100 edits) said that too many rules and policies were the main problems that they faced in Wikimedia culture. But these numbers were significantly smaller for more experienced editors: 39 percent for active Wikipedians (100+ edits), 36 percent of very active Wikipedians (1000+ edits) and 34 percent of highly prolific Wikipedians (5000+ edits). Correspondingly, 59 percent of highly experienced Wikipedians, 53 percent of very active Wikipedians and 47 percent of active Wikipedians pointed out that editors who are not fun to work with formed one of the main problems that they were facing. But only 22 percent of all newer editors (emerging, aspiring and new Wikipedians) reported this as an issue.

Q23: Please pick the three most important problems that have affected you personally, making it harder to edit. n=5962

Similarly, when asked about changes that might make it easier to contribute to Wikipedia, newer editors were more interested in simpler policies and rules, and a more user friendly editing interface. On the other hand, more seasoned editors are looking for improvements in editor behavior. All editors, irrespective of their edit history, agree that they need access to better research materials for writing articles.

Q24. Please pick three changes that you believe will make it easier for you to contribute. n=6176

 

If you are interested in more information about Wikipedia editors, please check out the topline findings from the survey.

Mani Pande, Head of Global Development Research
Ayush Khanna, Data Analyst, Global Development

In December 2011, we conducted an online survey of Wikipedia editors in 17 languages. This is the third in a series of blog posts summarizing our findings. If you are interested, you can find out more about the methodology of the survey here.

 

 

Mani Pande — 10. May 2012, 20:40

08. May 2012

- Wikimedia Foundation - (anglicky)

- Wikimedia Foundation - (anglicky)

DigiCert partnership enhances SSL security on Wikimedia sites

The Wikimedia Foundation today announced a partnership with DigiCert, Inc. based in Linden, Utah, to secure its web and mobile properties, using the company’s Enterprise SSL Managed PKI. The agreement supports online authentication and encryption on Wikimedia’s web and mobile properties, while enabling Foundation staff to streamline digital certificate management.

“The Wikimedia Foundation is grateful for this partnership with DigiCert, which will enhance our ability to secure the millions of online exchanges that occur with our websites each day,” said CT Woo, Director of Technical Operations for the Wikimedia Foundation. “It’s important for Wikimedia to identify like-minded partners that value transparency and the privacy of our users.”

DigiCert is an online security provider for many of the most recognized companies and web sites in the world, including four of the top 10 comScore-ranked sites. With 489 million unique visitors to the 285 language Wikipedias and sister sites each month, the Wikimedia Foundation seeks partners who share its mission to ensure transparency and privacy for its users.

DigiCert has seen consistent growth over time and is currently the world’s third-largest provider of enterprise authentication services and digital certificates, with numerous government, educational and business clients around the world.

“DigiCert is pleased to partner with the Wikimedia Foundation in recognizing the importance of the free and secure flow of information across the Internet and to support the Foundation’s mission,” said DigiCert CEO Nicholas Hales in a press release. “We’re excited to have another opportunity to demonstrate the quality, scalability and flexibility of DigiCert’s products for a continually expanding roster of globally leading organizations of all sizes and industries.”

CT Woo, Director of Technical Operations

Matthew Roth — 08. May 2012, 06:55

Walters Museum uploads 19,000 photos to Wikimedia Commons

The Tulip Folly, Jean-Léon Gérôme, from the Walters Museum collection

‪The Walters Art Museum in Baltimore, Maryland, has donated more than 19,000 freely-licensed images of artworks to Wikimedia Commons. The Walters’ collection includes ancient art, medieval art and manuscripts, decorative objects, Asian art and Old Master and 19th-century paintings. The images and their associated information will join our collection of more than 12 million freely usable media files, which serves as the repository for the 285 language editions of Wikipedia. ‬

‪The project began taking shape in February 2012, as part of the GLAM-Wiki initiative (Galleries, Libraries, Archives, and Museums). During GLAMcamp DC, a three-day conference hosted by the National Archives and Records Administration in Washington, D.C., the Walters Museum worked with several Wikimedians to develop a documented process for uploading images to Commons. The basic details of the upload procedure were established during the conference, and during the weeks that followed, the uploads were conducted, monitored and tested, while collaboration continued online. ‬

‪”The Walters has gone above and beyond throughout this collaboration with the GLAM-Wiki community, working alongside Wikipedians to serve as a model for our mass image upload process,” said Lori Byrd Phillips, U.S. Cultural Partnerships Coordinator for the Wikimedia Foundation. “The release of these images will not only improve articles in Wikipedia, but will also have the potential to be used freely throughout the web.”

‪The image donation is part of the Walters Museum’s larger initative to provide free public access to its collection, both online and offline, beginning with the removal of admission fees in 2006. In 2011, the Walters launched a redesigned works of art website with 10,000 online artwork images freely licensed under a Creative Commons license. ‬

Sarasvati image from Walters Museum

‪”By uploading our information in this way, we can share items of cultural heritage from around the globe, directly with people in those parts of the world. Already our images have been used in 48 different languages. The Walters’ collection is well-suited for this project because of its size and its breadth of topic areas,” said Dylan Kinnett, Manager of Web and Social Media for the Walters Art Museum. “By developing documentation and tools for this type of work, we hope that our upload project can serve as a prototype for other cultural institutions.”

‪Already, the museum’s images have had an impact in improving content on Wikipedia, such when they are used as illustrations in entries whose topic is not the artwork itself, but a related idea, such as a mythological figure, or a time or place. The Walters’ painting of the Hindu goddess Saraswati, for instance, has been added to five different language Wikipedia entries about the goddess.‬

We would like to thank to the Walters Museum for their donation and their commitment to promoting free knowledge on Wikimedia Commons, and to the GLAM volunteers who helped make this endeavor possible.

Matthew Roth, Global Communications Manager

Matthew Roth — 08. May 2012, 03:53

07. May 2012

- Wikimedia Foundation - (anglicky)

- Wikimedia Foundation - (anglicky)

GLAM-Wiki at the American Association of Museums

Over the past two years, the GLAM-Wiki initiative (Galleries, Libraries, Archives, and Museums) has grown from strength to strength, gaining the attention of cultural institutions and organizations from around the world. Due to this ever-increasing interest, a group of Wikipedians in Residence were invited to participate in the 2012 American Association of Museums annual meeting (AAM) in Minneapolis, Minnesota at the end of April. While volunteers in the GLAM-Wiki movement frequently present at professional conferences, at 4,500 participants AAM is the largest and most prestigious conference that we’ve had the opportunity to attend.

Wikipedians in Residence

Wikipedians in Residence prepare to present at the American Association of Museums. cc by-sa 3.0 Sarah Stierch.

Because of the importance of this conference, much preparation went into bringing together five Wikipedians in Residence from around the world to represent the work of the GLAM-Wiki initiative. Liam Wyatt, Sarah Stierch, Àlex Hinojo, and I participated in both a virtual and an on-site panel titled “Wikipedia and the Museum: Lessons from Wikipedians in Residence.” Dominic McDevitt-Parks facilitated a table at the “Marketplace of Ideas” event, which focused on how museums can best share their resources with Wikipedia. Throughout the conference, the GLAM-Wiki US portal was promoted as a new tool for American museums to more easily connect with the Wikimedia community.

Highlights of our outreach included:

  • For the in-person Wikipedians in Residence panel, over fifty museum professionals gathered to hear about the GLAM-Wiki initiative, the types of outreach events, the methods for connecting with the Wikimedia community and the resources for helping museums get started with a project.
  • The virtual session brought together over fifty museum professionals and GLAM-Wikimedians from around the world to discuss best practices. Event organizers allowed Wikipedians free access to the event and the recorded session is now publicly accessible.
  • Due to the high level of interest, all of the Wikipedians in Residence jumped in to assist with the Marketplace of Ideas table. Over a three-hour period we answered questions, shared resources and left with a number of potential new museum cooperations.

As the conference went on, it was abundantly clear that museum professionals were ready to more fully engage with the Wikimedia community. A handful of sessions independently discussed the GLAM-Wiki initiative as a model project within broader topic areas, including global partnerships within children’s museums, transparency in the future of museum ethics and “going beyond digitization.”

The AAM conference was a watershed moment for GLAM-Wikimedia collaboration. We were surprised that many people no longer needed to be convinced of Wikimedia’s relevance within their institution.  Instead, many were eager and ready to take the next step toward connecting with the Wikimedia community. As a museum professional myself, it was inspiring to directly witness the museum field wholly embracing Wikipedia as a serious tool for furthering their missions.

It has been a long time coming.

Lori Byrd Phillips, US Cultural Partnerships Coordinator

(Participation by Wikipedians in Residence in the American Association of Museums conference was made possible through the Wikimedia Participation Grants program.)

Lori Phillips — 07. May 2012, 20:12

Community service requirement fulfilled through Wikipedia contributions

Seven students in Mexico from ITESM-CCM’s International Baccalaureate Program participated in a pilot project to satisfy all or part of their community service requirement by working with Wikipedia.

The International Baccalaureate (IB) Diploma is an interdisciplinary high school level program which was developed in collaboration with UNESCO mostly based on European and U.S. models. It is considered to be rigorous, with graduation depending on testing, the writing of essays, and the completion of a community service requirement. This makes IB program ideal for Wikipedia. Students have been prepared to do expository writing in their native language (and often in English as well), and the community service requirement is eighty hours, which allows for the development of various kinds of projects.

Four students who edited Wikipedia as part of their community service requirement of their International Baccalaureate Diploma.

Four students who edited Wikipedia as part of their community service requirement of their International Baccalaureate Diploma.

In the pilot program at ITESM-CCM, I trained students on how to edit Wikipedia and helped them come up with article ideas, working at the school’s self access language laboratory, which I run. We began with English-to-Spanish translation on Wikipedia since I have had success with this kind of assignment and the Teylers Museum was doing a translation challenge. Soon after doing several articles related to Teylers, students began to find their own interests over the course the semester. Students worked on translating or creating fifteen articles, as wells as donating and organizing photos in Wikimedia Commons. See a list of articles created or translated.

Opportunities arose during the semester to allow students to explore options other simply working in the lab. The main one was an edit-a-thon held at the Museo de Arte Popular in Mexico City. This allowed the one IB student who could attend to not only work with the museum, some university students, and members of Wikimedia México, it also got her picture in the school’s newspaper. Various students have worked with me to create formal proposals in Spanish to be sent to other organizations with interest in working with Wikipedia.

The pilot had only seven students so that time and other considerations could be tested. It turns out that these students adapted to working with Wikipedia rapidly, mostly because they already know the basics of doing expository writing and citation. Since this is a community service requirement, the only assessment is hours served, no grades. This makes working with students in International Baccalaureate programs, which exist worldwide, an attractive option for outreach programs to explore!

Leigh Thelmadatter, Volunteer

Leigh Thelmadatter — 07. May 2012, 17:39

Commons Picture of the Day: Red deer stag in autumn

It was an early morning in autumn and Luc Viatour (User:Lviatour) was on a nature path with a naturalist friend well-versed in the habits of deer, when he chanced upon this red deer (Cervus elaphus) stag running through a field. “The hardest part is the approach,” said Viatour, emphasizing the importance of keeping a safe distance.

This image – which was selected as today’s “picture of the day” on Wikimedia Commons – is part of a series which Viatour especially loves, since he feels it was “a moment with nature.” He hopes it will inspire others to “want to go early in the morning and observe nature preserves.” In addition to nature photography, Viatour is also interested in astronomy, architecture, color and black-and-white photography…“in fact all [subjects and styles],” he said.

He first developed a passion for film photography at the age of 14, and “digital photography accentuated this passion.” Viatour is now a professional photographer, after he decided that photography was an activity he wanted to devote himself to full-time.

Viatour considers himself a computer and free software enthusiast, which is how he first discovered Wikipedia and the other Wikimedia Projects. “I like the idea of free software that I use every day,” he said. “It seemed logical to contribute to something that I use. As I am not very good at writing, I helped with what I think I do best.”

He started contributing to Commons in 2005, when he uploaded one of his best pictures, from a series of photos he took of the solar eclipse in 2009. “I realized then, [after I] saw the success, that some images were better in the public domain [than forgotten] in one of my drawers,” he said.

(View more of Viatour’s featured photos or visit his personal website)

Elaine Mao, Communications Intern

Elaine Mao — 07. May 2012, 15:15

* Wikimedia Česká republika *

* Wikimedia Česká republika *

Výstava „Příroda v obrazech“ se blíží

Ve středu 16. května 2012 se bude konat vernisáž k výstavě fotografií „Příroda v obrazech.“

Vernisáž se uskuteční ve třetím patře Vyšší odborné školy informačních služeb (Pacovská 350/4) v Praze nedaleko stanice metra Budějovická. Zahájení proběhne v 17 hodin, kdy studenti pronesou několik slov na úvod, a následovat bude malý koncert, na kterém vystoupí australský kytarista Rowen Slocombe. Připraven je také doprovodný program s názvem „Staň se součástí fotky.“

Na výstavě bude k vidění třicet snímků, z nichž dvacet zachycuje přírodu a přírodní živly, které jen tak nespatříte. Nejdůležitější částí výstavy je desítka fotografií, které patří mezi nejlepší v soutěži o „Obrázek roku 2010.“ Všechny tyto fotografie pochází z internetové galerie Wikimedia Commons, kam byly nahrány a nafoceny dobrovolníky. Většina z nich pochází od zahraničních fotografů, na výstavě ale můžete vidět snímky od českého autora, který se také zúčastní vernisáže.

Výstavu pořádá skupina studentů za finanční podpory neziskové organizace Wikimedia Česká republika, jejíž zástupci budou představeni na vernisáži. Tato organizace se podílí se na rozvoji kultury a vzdělanosti v České republice podporou svobodné tvorby, součástí je mimo jiné i podpora a rozvoj Wikipedie a dalších projektů na českém území.

Příroda v obrazech

Příroda v obrazech


The opening of exhibition „Nature in images“ will be held on Wednesday 16th May 2012.

It takes place in Prague college building of Information services, on the third floor (Pacovská 350/4). The opening is scheduled at 5pm. Students are going to make a short opening speech. Following programmes are live music performance by australian guitar player Rowen Slocombe and so called „Become part of the picture“ program.

There are going to be 30 photographs from which 20 images are capturing nature and natural elements, that are very rare to see. The most important part of the exhibition is 10 best photographs from competition „Picture of the Year 2010“. All of them come from internet gallery Wikimedia Commons. Those photographs have been taken and uploaded to Wikimedia by volunteers. People who created the images are mostly foreign photographers but there are also photos from czech artist, who is also attending the opening.

The exhibition is held by a group of students with financial support from non-profit organisation Wikimedia Czech republic, whose representatives will be present at the opening. This organisation participates in developing culture and education in the Czech republic by supporting free artistic creation. A part of this initiative is also support and development of Wikipedia and other projects in czech region.

Jiří Sedláček — 07. May 2012, 08:38

06. May 2012

- Wikimedia Foundation - (anglicky)

- Wikimedia Foundation - (anglicky)

Commons Picture of the Day: Volcanic rock strata on Tenerife

Volcanic rock strata on Tenerife

Wladyslaw Sojka (User:Taxiarchos228) captured this breathtaking picture of volcanic rock strata while vacationing on Tenerife, Spain, in December 2007. This particular shot required some precarious positioning of his Nikon D80 to capture the layers of rock strata as well as the road adjacent to it. The position in question happened to be in the middle of the road. While paying attention to any cars passing by for fear of being run over, Sojka took about 30 shots of the hairpin turn as well as the surrounding area.

While shooting the featured picture Sojka tried to capture a scene that would enable viewers to appreciate the aesthetic nature of the shot as well as the informative character of the fascinating rock layers. He chose this shot, at the juncture where the curve of the road was reducing, because it highlighted the volcanic strata in its best scale.

“The different coloured layers derive from the different eruptions. Centuries or even millenniums may have elapsed between the deposition of each layer. The white porous layer is made up of pumice fragments. The black layers are composed of basalt, emerging during eruption with low gas content. The reddish layers are also of basalt, however they were oxidized by ground water,” said Sojka.

Sojka describes his passion for photography as an “ambitious hobby,” having spent a lot of time teaching himself how to take pictures with his first compact camera in 2004. His work mainly focuses on architecture and landscape. Sojka expresses that he has a deep interest in architecture and civil engineering of all kinds of towers. As a result his pictures are mostly of churches, television towers and the historic, as well as modern architecture of bridges that are special to him. Besides contributing photos to Commons, Sojka also spends time arranging and rearranging categories to sort pictures on the gigantic media repository.

“Sometimes I also study some buildings that are commonly classified as ugly but have a unique attraction for me. The object itself and its reproduction close to its naturalness, for me, is often more important than an artistic picture,” said Sojka.

(View more of Sojka’s photos)

Jordon Hu, Communications intern

Jordan Hu — 06. May 2012, 19:41

05. May 2012

- Wikimedia Foundation - (anglicky)

- Wikimedia Foundation - (anglicky)

Bringing the wonder of Wikipedia to rural Kenyan schools

If you’re fortunate enough to have access to a computer and the Internet, you can get the sum of the world’s knowledge for free thanks to Wikipedia. But if you’re like the majority of the world that isn’t online, how can you access this amazing resource?

Alex Wafula, Wikipedian from Kenya

What if someone brings it to you by hand?

Alex Wafula, a Wikipedia editor and 3rd year student at Strathmore University, in Nairobi, Kenya, has helped start the Wikimedia Project for Kenyan Schools, where he and a team of volunteers travel to remote parts of Kenya to share offline Wikipedia with students. Wafula and the team install offline versions of English Wikipedia from a disc or memory stick in schools that have computers, and they have provide both teachers and students with tutorials on how to operate the database.

“I’ve always been fascinated in discovering new things and knowing why things work the way they do, like why’s the sky blue and not red, what makes planes fly and boats float etc. Now what’s even more fascinating for me is sharing this,” said Wafula.

Organizers of the project began by procuring a list of schools with computer labs from the Kenyan Ministry of Education. From that list they divided the schools into 3 distinct regions: Kakamega Town (Western Kenya), Nyeri Town (Central Kenya) and Mombasa City (South Eastern Kenya) with 10 schools per region and a total of 30 schools.

Project members were mostly based in Nairobi and traveled 7 hours by bus to reach Kakamega and Mombasa City, and 4 hours to Nyeri. The teams typically stay 5 days in each region. To reach their goal of 2 schools per day, the team utilize an assortment of transportation including mini-buses, tuk-tuks and motorbikes to travel from school to school, which in some cases are considerable distances apart.

In Kenyan public schools that have computers, Wafula said, students take turns in time-allotted sessions in the computer lab and share a single computer with as many as three other classmates. Wafula noted that some of the computers he dealt with at these labs were too old and needed repair before they could install the offline Wikipedia.

“In high school, I spent many hours reading encyclopedias and from the knowledge gained, I found hope of making something out of my life,” said Wafula. “It’s my hope that students who get access to offline Wikipedia will find hope of a better future for themselves and their families as well from the knowledge they gain.”

In 2003, education in public schools in Kenya became free and universal. According to Wafula, however, the number of students enrolled in the public school system has exceeded the capacity of the system, with as many 60 students (or more) being taught by a single teacher. Schools in rural areas lack enough desks and chairs to facilitate all the students and in some cases students attend class in half-built classrooms or under trees. Textbooks are shared between 2-4 students and school supplies are treasured commodities.

In this context, gaining access to hundreds of thousands of Wikipedia articles is a marvel.

“There is so much promise in these kids, despite the adversities they face,” said Wafula. “In their world full of challenges and uncertainties, I’m happy that I got to deliver one of their solutions.”

Story and reporting by Jordan Hu, Communications Intern

Jordan Hu — 05. May 2012, 04:35

04. May 2012

- Wikimedia Foundation - (anglicky)

- Wikimedia Foundation - (anglicky)

Commons Picture of the Day: Flèche attack in the Trophée Monal

Looking at today’s Wikimedia Commons Picture of the Day (POTD), it may come as a surprise that the photographer, Marie-Lan Nguyen (User:Jastrow), is not a specialist in sports photography. In fact, Nguyen had never even shot at a sporting event prior to the day she took this amazing picture at the Trophée Monal, an event of the 2012 Fencing World Cup circuit.

Nguyen took this photo during the final match of the tournament. “The audience was plunged in the dark while the piste was brilliantly lit, naturally providing a dramatic lighting,” she said. “I was lying on the floor propped on my elbows to get a good angle, with my photo backpack used as a bean bag to support my camera. To change position I had to military crawl on the floor.”

Since fencing is such a fast-paced sport, Nguyen had to shoot at a shutter speed of less than 1/640 sec just to catch the movements without any blurring. During the match, she usually focused on the glove or the bell guard of one of the contestants using continuous autofocus mode.

Nguyen had the advantage of having some knowledge of fencing, since she used to fence as a student. “Though I was useless at it, it helped with understanding what kind of action was going to get interesting,” she said. “The key was to spot the beginning of an attack, press the shutter-release button long enough to get the whole action, and hope for the best.” She took approximately 100 pictures in just the final match alone, and her POTD came from a single burst of 8 shots. The action depicted in the photo is a flèche attack, which she had several bursts of, but she especially likes this one because of the “nice symmetry in the bend of the blades. I hope [the photo] reflects the speed, accuracy and elegance of a fencing bout.”

She got the opportunity to photograph this sporting event through the support of Wikimédia France, the national Wikimedia chapter. She is a member of the Paris “cabal,” a joking designation for the informal local chapters. At one of their meetings, they discussed the success that the Swiss chapter and the Toulouse cabal had in gaining official accreditation to do sports photography for Commons. These successes and others are reflective of a growing trend toward recognizing Wikimedians as storytellers, allowing them the accreditation to attend events as photographers or reporters. The Paris cabal had never tried anything of the sort, but Nguyen had no trouble getting accredited for the Trophée Monal just by mentioning her affiliation with Wikimédia France.

“I don’t usually take the kind of pictures that get nominated as a QI [quality image] or a FP [featured picture],” said Nguyen, who typically takes photos of museum objects. “I’m glad this one made it, as it’s the result of a collaborative process” with assistance from other Commons users, Wikipedians, Wikimédia France, and the French WikiProject:Fencing. “I’m aware I’m lucky to get a FP for my first sports event. I learnt a lot covering it, and the FP is great encouragement for me to keep doing sports photography.”

Nguyen first started taking pictures in 2004 using her 1.3 megapixel compact camera. She was a contributor to the French language Wikipedia at the time, and she noticed that there was a severe lack of quality pictures in her field of interest–ancient Greek and Roman history–so she went to the Louvre to take some pictures. She quickly developed a greater interest in photography and bought an entry-level DSLR. Nguyen now owns two Nikon DSLRs (a D300s and a D200) and “a whole array of lenses,” and her primary project is now Commons. She estimates that more than three-quarters of her photos are shot for Commons.

“Beyond its use as a common repository for Wikimedia projects, I see [Commons] as a project in its own right,” she said. “I believe Commons is a great way to increase academics’, teachers’ and students’ awareness about free content, and to get them to contribute to all Wikimedia projects.”

(View more of Nguyen’s photos)

Elaine Mao, Communications Intern

Elaine Mao — 04. May 2012, 18:43

Mobile milestone: Two billion page views

Page views to the Wikipedia mobile site (red: non-English versions) compared to the 2 billion target from the annual plan

One of the annual plan targets of the Wikimedia Foundation for 2011-2012 was to reach 2 billion monthly page views to the Wikipedia mobile site by June 2012. We’re happy to say that we hit the mark sooner, on the second-to-last day of April to be exact. April clocked in at 2.089 billion, a year-over-year increase of 187%. The mobile site now attracts 12.6% of all page views for Wikipedia, more than twice of its 5.1% share in April 2011.

How did it happen? As internet usage shifts from a desktop-centric environment to a more mobile-centric one, there’s a migration to smaller screens. Various industries and factors have made that happen, and several things have been done at the Wikimedia Foundation to move with the change. We can’t do justice to all the individual work by attempting to list it here, but amongst the many changes and contributions, a few highlights include the launch of the new mobile site last October, better device detection, and the official Android app announced in January.

Also notable about the 2 billion mark is the way use has evolved globally. A year ago, 67% of all visits to the Wikipedia mobile site were to the English Wikipedia; now that number is 54%. In the Global South in particular, traffic to the mobile sites for certain languages has grown tremendously. Some examples include Portuguese (from 3.9M to 27.4M), Arabic (from 1.7M to 10.2M), and Turkish (from 1.0M to 9.0M). As our partnership programs roll out to allow hundreds of millions to access Wikipedia on their mobile devices without incurring data charges, we expect mobile use to be even more globally distributed over the coming year.

The work on mobile, from both the tech and global development side, is not slowing down in the least however. There’s a lot more to come, but it’s worth taking a moment to recognize the mark we’ve reached, and to thank every community and staff member who played a part.

On behalf of the Mobile Team (Tomasz Finc, Patrick Reilly, Arthur Richards, Jon Robson, André Engels, Kul Wadhwa, Mani Pande, Amit Kapoor, Yuvaraj Pandian, Max Semenik, Phil Chang, Dan Foy):

Amit Kapoor, Senior Manager, Mobile Partnerships

Amit Kapoor — 04. May 2012, 01:00

03. May 2012

- Wikimedia Foundation - (anglicky)

- Wikimedia Foundation - (anglicky)

Reaching out to the world, one embassy at a time

Washington, DC, is a global hub for culture and knowledge. This is embodied in its numerous colleges and universities, more than 30 museums and the world’s largest library, containing over 29 million books.

But there is a fourth kind of cultural and educational resource within the city that is sometimes overlooked—Washington‘s 170+ embassies and diplomatic missions. These embassies are hidden gems of knowledge, housing cultural artifacts and works of art, and hosting numerous educational and cultural events, particularly in May, which DC Mayor Vincent Gray has declared as the city’s “International Cultural Awareness Month,” in order to showcase the value these embassies bring to the city.

In an effort to capture the intellectual energy and highlight the cultural and educational resources of these international institutions, Wikimedia District of Columbia (Wikimedia DC) last week kicked off its Embassy Outreach Initiative (EOI) with an inaugural event held in partnership with the Washington European Society and the Estonian Embassy in Washington.

The event, hosted at the Estonian Embassy, featured a discussion on global Internet freedom efforts with Danny Weitzner, Deputy CTO for Internet Policy at the White House Office of Science & Technology Policy; Chairman Marko Mihkelson, Chairman of the Foreign Affairs Committee of the Estonian Parliament; Ian Schuler, Senior Manager for Internet Freedom Programs at the US State Department; and Rebecca MacKinnon, Bernard L. Schwartz senior fellow at the New America Foundation and a member of the Wikimedia Foundation Advisory Board. Adam Kushner, Deputy Editor of the National Journal, moderated the discussion.

From L to R, Weitzner, Mihkelson, Schuler, and MacKinnon. CC-BY-SA

At the heart of EOI is an effort to foster an international dialogue around Wikimedia DC’s and the Foundation’s vision. In that sense, the choice of the Estonian Embassy as the debut venue for EOI was not coincidental. Estonia currently ranks as the number one country for Internet freedom by the DC-based NGO Freedom House. Not only do tech, Internet companies, startups (think Skype) and knowledge initiatives thrive in Estonia, but so does the Estonian Wikipedia. Its nearly 95,000 articles, and 8.1 million monthly page views, may seem small compared to the English Wikipedia, but considering the country’s population stands at slightly over 1.3 million, these numbers are very substantial.

The global Wikimedia community will be coming to Washington, DC, this summer for Wikimania 2012, providing the city with an opportunity to witness how the world collaborates in pursuit of free global knowledge. Before these international delegates arrive, and long after they have returned home, Washington, DC has always been and will always remain a great place to promote international dialogue in support of shared knowledge. That is the ultimate goal of Wikimedia DC’s outreach and program efforts–like EOI and LibraryLab: they utilize the potential for collaboration that is present within the city and make a positive and lasting impact on global knowledge.

Nicholas Michael Bashour, President, Wikimedia District of Columbia

Nicholas Michael Bashour — 03. May 2012, 15:45

02. May 2012

- Wikimedia Foundation - (anglicky)

- Wikimedia Foundation - (anglicky)

Wikimedia Research Newsletter, April 2012

Wikimedia Research Newsletter
Wikimedia Research Newsletter Logo.png


Vol: 2 • Issue: 4 • April 2012 [archives] Syndicate the Wikimedia Research Newsletter feed

Barnstars work; Wiktionary assessed; cleanup tags counted; finding expert admins; discussion peaks; Wikipedia citations in academic publications; and more

With contributions by: Lambiam, Piotrus, Jodi.a.schneider, Amir E. Aharoni, DarTar, Tbayer, Steven Walling, Junkie.dolphin and Protonk

Contents

Recognition may sustain user participation

The relative number of edits by Wikipedians who had randomly received barnstars (red) and by the control group whose members hadn’t (blue).

To gain insight in what makes Wikipedia tick, two researchers from the Sociology Department at Stony Brook University conducted an experiment with barnstars.[1] They were surprised by what they found.

Professor Arnout van de Rijt and graduate student Michael Restivo wanted to test the hypothesis according to which receiving recognition for one’s work in an informal peer-based environment such as Wikipedia has a positive effect on productivity. To test their hypothesis, they determined the top 1% most productive English Wikipedia users among the currently active editors who had yet to receive their first barnstar. From that group they took a random sample of 200 users. Then they randomly split the sample into an experimental group and a control group, each consisting of 100 users. They awarded a barnstar to each user in the experimental group; the users in the control group were not given a barnstar. The researchers found their hypothesis confirmed: the productivity of the users in the experimental group was significantly higher than that of the control group. What really took the researchers by surprise was how long-lasting the effect was. They followed the two groups for 90 days, observing that the increase in contribution level for the group of barnstar recipients persisted, almost unabated, for the full observation period.

One major factor the experiment did not take into account was whether it mattered who delivered barnstars and whether they were anonymous, registered, or known members of the Wikipedia community. During the experiment, it was noted on the Administrator’s noticeboard/Incidents page that a seemingly random IP editor was “handing out barnstars”, which led to some suspicion from Wikipedians. The thread was closed after User:Mike Restivo confirmed he accidentally logged out when delivering the barnstars. He did not, however, declare his status as a researcher, and the group’s paper does not disclose that the behavior was considered unusual enough to warrant such a discussion thread.

Can Wiktionary rival traditional lexicons?

Wiktionary received an extensive assessment[2] as a potential rival to expert-built lexicons.

A chapter titled “Wiktionary: a new rival for expert-built lexicons?”[2] in a collection on electronic lexicography to appear with Oxford University Press contains a description and critical assessment of Wikipedia’s second oldest sister project (which will celebrate its 10th anniversary in December this year) – subtitled “Exploring the possibilities of collaborative lexicography”, which it calls a “fundamentally new paradigm for compiling lexicons”.

The article describes in detail the technical and community features of Wiktionary. Though it is not immediately clear, the article’s focus is on several language editions and not just English (as often happens in research about Wikipedia and its sister projects). The article gives a comprehensive account of the coverage of the world’s languages by the various Wiktionary language editions. There is a critical analysis of Wiktionary’s content, first with what appears to be a thorough statistical comparison with other dictionaries and wordnets, including an examination of the overlaps in the lexemes covered, which the authors found to be surprisingly small.

Number of native terms (p.17) Wiktionary wordnets Roget’s Thesaurus OpenThesaurus
English language 352,865 148,730 (WordNet) 59,391
German language 83,399 85,211 (GermaNet) 58,208
Russian language 133,435 130,062 (Russian WordNet)

The article notes an important characteristic of open wiki projects: they allow “updating of the lexicons immediately, without being restricted to certain release cycles as is the case for expert-built lexicons” (p. 18). Though this characteristic is obvious to experienced Wikimedians, it is frequently overlooked. The discussion of the organization of polysemy and homonymy is comprehensive, although limited to the English Wiktionary. Other language editions may do it differently. The article notes that “it is a serious problem to distinguish well-crafted entries from those that need substantial revision by the community”, which is good constructive criticism. The paragraphs about “sense ordering” make some vague claims (e.g. “Although there is no specific guideline for the sense ordering in Wiktionary, we observed that the first entry is often the most frequently used one”) which could be interesting and useful from a community perspective, but offers little actionable evidence and should be investigated further. The paper’s conclusions identify some of the features that enable Wiktionary to rival expert-built lexicons: “We believe that its unique structure and collaboratively constructed contents are particularly useful for a wide range of dictionary users”, listing eight such groups – among them “Laypeople who want to quickly look up the definition of an unknown term or search for a forum to ask a question on a certain usage or meaning.”

On a critical note, the last paragraph says “we believe that collaborative lexicography will not replace traditional lexicographic theories, but will provide a different viewpoint that can improve and contribute to the lexicography of the future. Thus, Wiktionary is a rival to expert-built lexicons – no more, no less”, which sounds a bit contradictory. The authors also note that “Lepore (2006: 87) raised a criticism about the large-scale import of lexicon entries from copyright-expired dictionaries such as Webster’s New International Dictionary”. It would be nice if the authors would write at least a short explanation of the problem that Lepore described. But the actual article[3] mentions Wiktionary only very briefly. For the most part, the article is a good academic-grade presentation of Wiktionary: it is very general and does not dive too much into details; it makes a few vague statements, but they present a good starting point for further research.

Wikipedia as an academic publisher?

Can Wikipedia integrate with open-access scholarly publishing?

Xiao and Askin (2012) looked at whether academic papers could be published on Wikipedia.[4] The paper compares the publishing process on Wikipedia to that of an open-access journal, concluding that Wikipedia’s model of publishing research seems superior, particularly in terms of publicity, cost and timeliness.

The biggest challenges for academic contributions to Wikipedia, they found, revolve around the level of acceptance of Wikipedia in academia, poor integration with academic databases, and technical and conceptual differences between an academic article and an encyclopedic one. However, the paper suffers from several problems. It correctly observes that the closest a Wikipedia article comes to a “final”, fully peer-reviewed status is after having passed the featured article candidate process, but makes no mention of intermediary steps in Wikipedia’s assessment project, such as B-class, Good Article and A-class reviews; nor is the the assessment project itself mentioned. Despite its focus on the featured-article process, no previous academic work on featured articles is cited (although quite a few have been published). Crucially, the paper disregards the most relevant of Wikipedia’s policies, no original research. Thus, the study fails to consider whether Wikipedia would want to publish academic articles without their undergoing changes to bring them closer to encyclopedic style – a topic that already has become an issue numerous times on the site, in particular regarding difficulties encountered by some educational projects. In the end, the paper, while a well-intentioned piece, seems to illustrate that university researchers can have a quite different understanding of what Wikipedia is than those more closely connected with the project.

In other news, however, a scientific journal appears to have found a viable way to publish peer-reviewed articles on Wikipedia: The open access journal PLoS Computational Biology has announced[5] that it is starting to publish “Topic Pages” – peer-reviewed texts about specific topics, which are published both in the journal and as a new article on Wikipedia. It is hoped that the Wikipedia versions will be updated and improved by the Wikipedia community. The first example is about circular permutation in proteins.

Wikipedia citations in American law reviews

Volume 1 of the Harvard Law Review (1887–1888).

The article “A Jester’s Promenade: Citations to Wikipedia in Law Reviews , 2002–2008″ concerns the issue of citations of Wikipedia in US law reviews and the appropriateness of this practice.[6] The article seems to be well researched, and its author, law reference/research librarian Daniel J. Baker, demonstrates familiarity with the mechanics of Wikipedia (such as the permanent links). For the period 2002–08, Baker identified 1540 law-review articles that contain at least one citation of Wikipedia – most in law reviews dealing with general and “popular” subject matter, with a significant proportion originating from authors with academic credentials.

The article notes that 2006 marked the peak of that trend, attributing it (thereby demonstrating some familiarity with Wikipedia’s history) to a delayed reaction to the Seigenthaler incident and the Essjay Controversy. (Since the article’s data analysis ends in 2008, the question of whether this trend has rebounded in recent years is left unanswered.)

The author is highly critical of Wikipedia’s reliability, arguing that a source that “anyone can edit” – and where much of the information is not verified – should not be used in works that may influence legal decisions. Thus Baker calls for stricter rules in legal publishing, in particular that Wikipedia should not be cited. In a more surprising argument, the paper suggests that if information exists on Wikipedia, it should be treated as common knowledge, and thus does not require referencing (a recommendation that follows a 2009 one – Brett Deforest Maxfield, “Ethics, politics and securities law: how unethical people are using politics to undermine the integrity of our courts and financial markets”, 35 OHIO N.U. L. REV. 243, 293 (2009)). This argument does, however, raise the question of whether no citation at all is truly better than a citation to Wikipedia; if such a recommendation were followed, it could lead to a proliferation of uncited claims in law review journals that would be assumed (without any verification) to rely on “common knowledge” as represented in the “do not cite” Wikipedia.

One in four of articles tagged as flawed, most often for verifiability issues

A paper titled “A Breakdown of Quality Flaws in Wikipedia”[7] examines cleanup tags on the English Wikipedia (using a January 2011 dump), finding that 27.53% of articles are tagged with at least one of altogether 388 different cleanup templates. In a 2011 conference poster [8] (a version of which was summarized in an earlier edition of this newsletter), the authors analyzed – together with a third collaborator – a 2010 dump of the English Wikipedia for a smaller set of tags, arriving at much lower ratio: “8.52% [of articles] have been tagged to contain at least one of the 70 flaws”. Using a classification of Wikipedia articles into 24 overlapping topic areas (derived from Category:Main topic classifications), the highest ratio of tagged articles were found in the “Computers” (48.51%), “Belief” (46.33%) and “Business” (39.99%) topics; the lowest were in “Geography” (19.83%), “Agriculture” (22.57%) and “Nature” (23.93%). Of the 388 tags on the more complete list, “307 refer to an article as a whole and 81 to a particular text fragment”. As another original contribution of the paper, the authors offer an organization of the existing cleanup tags into “12 general flaw types” – the most frequent being “Verifiability” (19.46% of articles have been tagged with one of the corresponding templates), “Wiki tech” (e.g. the “orphan”, “wikify” or “uncategorized” templates; 5.47% of articles) and “General cleanup” (2.01%).

Time evolution of Wikipedia discussions

Kaltenbrunner and Laniado look at the time evolution of Wikipedia discussions, and how it correlates to editing activity, based on 9.4 million comments from the March 12, 2010 dump.[9] Peaks in commenting and peaks in editing often co-occur (for sufficiently large peaks of 20 comments, 63% of the time) within two days. They show the articles with the longest comment peaks and most edit peaks, and the 20 slowest and 20 fastest discussions.

The authors note that a single, heavy editor can be responsible for edit peaks but not comment peaks; peaks in the discussion activity seem to indicate more widespread interest by multiple people. They find that “the fastest growing discussions are more likely to have long lasting edit peaks” and that some editing peaks are associated with event anniversaries. They use the Barack Obama article as a case study, showing peaks in comments and editing due to news events as well as to internal Wikipedia events (such as an editor poll or article protection). Current events are often edited and discussed in nearly real-time in contrast to articles about historical or scientific facts.

They use the h-index to assess the complexity of a discussion, and they chart the growth rate of the discussions. For instance, they find that the discussion pages of the three most recent US Presidents show a constant growth in complexity but that the rate of growth varies: Bill Clinton‘s talk page took 332 days to increase h-index by one, while George W. Bush‘s took only 71 days.

They envision more sophisticated algorithms showing the relative growth in edits and discussions. Their ideas for future work are intriguing – for instance, the question of how to determine article maturity and the level of consensus, based on the network dynamics. (AcaWiki summary)

APWeb2012 papers on admin networks, mitigating language bias and finding “minority information”

Several of the accepted papers of this month’s Asia-Pacific Web Conference APWeb2012 concerned Wikipedia:

  • Prototype tool searches for expert admins: In the article “Exploration and Visualization of Administrator Network in Wikipedia”,[10] four Chinese authors examine the collaboration graph of administrators on the English Wikipedia (where two of them are connected by an edge if they have edited the same article during the sampled time span from January 2010 to January 2011), and “define six features to reflect the characteristics of administrator’s work from different respects including diversity of the admin user interests, the influence & importance across the network, and longevity & activity in terms of contribution.” The authors observe that the recognition of an admin’s work by other users in the form of barnstars seems to agree with the overall rank they calculate from these quantities: “By analyzing the profiles of the top ranked fifty admin users as a test case, it has been observed that the number of barn stars received by them also follows the similar trend as we overall ranked the admin users.”To extract topics from an admin’s history and define diversity, Latent Dirichlet allocation (LDA) is used. The authors describe a prototype software called “Administrator Exploration Prototype System”, which displays these various quantitative measures for an admin and allows ranking them. In particular, it “will automatically find the expert authors based on the editing history of each admin user”. An example screenshot shows a list of results for a “Search for “Expert Admin User” for the keywords “Music, Songs, Singers”, topped by Michig, Mike Selinker and Bearcat. Analyzing the whole network, the authors find a “decreasing trend of the clustering coefficient [which] can also be seen as a symptom of the growing centralization of the network.” Overall, they observe that “the administrator network is a healthy small world community having a small average distances and a strong centralization of the network around some hubs/stars is observed. This shows a considerable nucleus of very active administrators who seems to be omnipresent.”

Not what the majority of readers search for under “football”: A goalball game

Detecting “minority information” on Wikipedia: A paper[11] by two Japanese researchers proposes “a method of searching for minority information that is less-acknowledged and has less popularity in Wikipedia” for a given keyword. “For example, if the user inputs ‘football’ as a majority information keyword, then the system seeks articles having a sentence of “….looks like football….” or similar content of articles about soccer in Wikipedia. It extracts as candidates for minority sports those articles which have few edits and few editors. Then, it performs sports filtering and extracts minority articles from the candidates. In this case, the results are ‘Bandy’, ‘Goalball’, and ‘Cuju’.” The authors constructed a prototype system and tested it.

  • Completing Wikipedia articles with information from other language versions: In an article titled “Extracting Difference Information from Multilingual Wikipedia”[12], four Japanese researchers describe a “method for extracting information which exists in one language version [of Wikipedia], but which does not exist in another language version. Our method specifically examines the link graph of Wikipedia and structure of an article of Wikipedia. Then we extract comparison target articles of Wikipedia using our proposed degree of relevance.” As motivating example, they note that the English Wikipedia’s coverage of the game cricket is much fuller than the Japanese Wikipedia’s, but spread over separate articles beyond just the main one at cricket. The goal is a system where a (Japanese) user can enter a keyword and will receive the “Japanese article with sections of English articles that do not appear in the Japanese article”.
  • Briefly

    The proportion of “good faith” and “golden” editors among new contributors over time has remained constant.[13]

    Unchanged quality of new user contributions over time. GroupLens PhD candidate Aaron Halfaker (who also collaborates with the Wikimedia Foundation as a contractor research analyst) shared some preliminary results on the quality of new user contributions,[13] part of a larger study currently submitted for publication. The results, based on an analysis of revert rates in the English Wikipedia combined with blind assessment of a new editor contribution history, indicate that new editors have produced the same level of quality in their first contributions since 2006. Despite the fact that “the majority of new editors are not out to obviously harm the encyclopedia (~80 percent), and many of them are leaving valuable contributions to the project in their first editing session (~40 percent)”, today’s user experience for a first-time editor is much more hostile than it used to be, as “the rate of rejection of all good-faith new editors’ first contributions has been rising steadily, and, accordingly, retention rates have fallen. These results challenge the hypothesis that today’s newbies produce much lower quality contributions than in earlier years.

  • Modeling Wikipedia’s community formation processes. An important factor behind the success of Wikipedia is its own internal culture. Like any social group, a community of peer production has its own rules, norms, and customs. Unlike traditional social groups — a recently-defended doctoral dissertation in computer science argues — the process of formation of these traits involves, and often determines, how contents are being produced. The dissertation, defended by former Summer of Research fellow and Wikimedia Foundation contractor analyst Giovanni Luca Ciampaglia,[14] uses computer simulation to study how the community of Wikipedia may have formed its specific cultural traits and distinctive sociological features. Starting from the distribution of user account lifespan in five of the largest Wikipedia communities (English, German, Italian, French, and Portuguese) this work shows how the statistical patterns of the data can be reproduced by a simple model of cultural formation based on principles taken from self-categorization theory and social judgment theory.
  • Distribution of AFT ratings for articles in different project quality assessment categories.[15]

    The research finds that an important factor to determine whether a community will be able to sustain itself and thrive is the degree of openness of individual users towards differing points of view, which may be critical in the early stages of user participation, when a newcomer first enters in contact with the body of social norms that the community has devised. The thesis concludes that simulation techniques, when supplemented with empirical methods and quantitative calibration, may become an important tool for conducting sociological studies.

  • Matching reader feedback via the Article Feedback Tool to editor peer review: An upcoming presentation at Wikimania 2012[15] compares data gathered from the Article Feedback Tool (AFT) version 4 on the English Wikipedia over summer 2011 to ratings assigned by various peer review processes, e.g. good and featured articles. As might be expected, articles at any point in the peer review process tend to be rated more highly by reviewers, but this distinction is highly sensitive to the article length. Once length is accounted for (using a variety of methods), the differences between demoted or not promoted articles and unrated articles disappears. The research also offers a broad snapshot of the AFT dataset as well as some suggestions for future AFT design. Future revisions of the draft as well as the presentation will approach the dynamic relationship between peer reviewed status and reader feedback, exploiting entry and exit into various categories for identification.
  • Referencing of Wikipedia in academic works is continuing unabated: An article in the “Research Trends” newsletter published by the bibliographical database Scopus, titled “The influence of free encyclopedias on science”[16] charts the number of papers in Scopus that are either about Wikipedia or cite it. Considering that Wikipedia was only founded in 2001 (i.e. that these numbers have necessarily started from zero right before the observed timespan), the author’s astonishment at the compound annual growth rates for both kinds of papers from 2002 to 2011 (which she calls “staggering” and “unbelievable”, respectively) is somewhat surprising, but the article also gives the growth rates for the five years from 2007 to 2011 (ca. 19% per year for Wikipedia as a subject, ca. 31% per year for Wikipedia as a reference). Interestingly, Scholarpedia is showing itself to be the second most popular online encyclopedia to be cited, if lagging significantly behind Wikipedia (5%).
  • Using Wikipedia to drive traffic to library collections: In an article titled “Wikipedia Lover, Not a Hater: Harnessing Wikipedia to Increase the Discoverability of Library Resources”[17] in the Journal of Web Librarianship, two librarians from the University of Houston Libraries and a former intern report how they had successfully used Wikipedia to drive traffic to the collection of the institution’s digital services department (UHDS), proceeding from merely inserting links into articles to uploading images from the collection to Commons (which still contain such link on the file description pages): “Originally, UHDS intended to contribute exclusively to the External Links section of existing Wikipedia articles. [However, over time] UHDS staff found it was much more effective to match digital items with Wikipedia articles and to share those items in Wikimedia Commons (WMC) rather than (or in addition to) the External Links section of the articles.” While few statistics are given, the authors emphasize the effectiveness of their actions, observed already for the very first attempts: “Within hours of posting external links to existing Wikipedia articles, the digital library received hits to those collections at a surprisingly high rate.” As an example of an article enriched with such images, the entry 1915 Galveston hurricane is named. Among the successful additions to external links section is the article about former US president George H. W. Bush, where the student intern linked a photograph showing Bush shaking hands with former University of Houston chancellor Philip G. Hoffman (as already noted in the Signpost’s April 2011 coverage after the authors had presented their project at the annual meeting of the Association of College and Research Libraries: “Experts and GLAMs – contributing content or ‘just’ links to Wikipedia?“). Much of the paper describes basic technicalities of Wikipedia: The uploading of image, the use of contributions lists, talk pages and watchlists. While Wikipedia’s external links guidelines are not cited in the paper, it notes that “contributing effectively to Wikipedia and WMC entailed a steep learning curve in order to align contributions with the granular and well-enforced Wikipedia guidelines for use”, and among them notices policies against advertising. As one unresolved problem for such institutional usage of Wikipedia and Commons, the paper describes the prohibition “to share an editor username with other editors, and [that] organizational usernames are considered a violation of Wikipedia guidelines forbidding the promotion of organizations. When the pilot project transitioned into a permanent departmental program, UHDS staff struggled to devise a way that others on staff could continue to monitor previous edits and uploads and create new ones”, e.g. due to the lack of shared watchlists.
  • Weekly and daily activity patterns discern Wikipedia from commercial sites: Two Finnish researchers analyzed[18] the distribution of timestamps in the recent changes RSS feed from four different language versions (Arabic, Finnish, Korean, and Swedish – Arabic having been chosen because its speakers are spread over “a very wide range of timezones”, in contrast to the other three), and RSS new feeds from BBC World News “and the leading Finnish daily newspaper Helsingin Sanomat“. As the main difference between the activity on those two sites (which the authors describe as “commercial news sites”) and on the Wikipedias, it was found that Wikipedia edits “distribute fairly equally over all days in all cases. The drop of activity on weekends that occurred with the commercial news services is not visible in the Wikipedias, quite the opposite, with Sundays typically seeing the highest average level of activity. Only the Arabic version has a slightly lower activity rate in Sundays, however, we should remember the fact that in Arabic countries the weekend falls on Friday-Saturday or in some countries on Thursday-Friday”. The diurnal patterns are found to be “more spread out” on Wikipedia, where “the activity levels follow natural diurnal rhythms. Interestingly, a great number of changes are made during working hours, which leads us to 2 different, but not mutually exclusive, conjectures about the people who edit Wikipedia. Either, the editors are people with “free” time during the day, e.g., students, or people actually edit Wikipedia during the working hours at work. Our methodology is not able to answer this question”.
    Furthermore, the authors offer a rather far-reaching but (if proven) significant conjecture based on their date: “Cultural and geographical differences in the Wikipedias we studied seemed to have very little effect on the level of activity. This leads us to speculate that the ‘trait’ of editing Wikipedia is something to which individuals are drawn, not something specific to certain cultures.”
    Last year, papers by two other teams (covered in the September issue of this newsletter: Wikipedians’ weekends in international comparison“, but missing from the “Related work” section of the present paper) had similarly examined daily and weekly patterns on Wikipedia, coming to other results – in particular, different language Wikipedias showed different weekly patterns.
  • Simple English Wikipedia is only partially simpler/controversy reduces complexity: “A practical approach to language complexity: a Wikipedia case study”[19] analyzed samples of articles from the English Wikipedia and the Simple English Wikipedia from the end of 2010 with respect to the Gunning fog index as well as other measures for language complexity. Comparing them with other corpora including Charles Dickens‘ books, they observe that “Remarkably, the fog index of Simple English Wikipedia is higher than that of Dickens, whose writing style is sophisticated but doesn’t rely on the use of longer latinate words which are hard to avoid in an encyclopedia. The British National Corpus, which is a reasonable approximation to what we would want to think of as ‘English in general’ is a third of the way between Simple and Main, demonstrating the accomplishments of Simple editors, who pushed Simple half as much below average complexity as the encyclopedia genre pushes Main above it.” However, the number of distinct tokens used (a measure for vocabulary richness) is almost the same on the English and Simple Wikipedia (the samples were chosen to be of the same size). Still “detailed analysis of longer units (n-grams rather than words alone) shows that the language of Simple is indeed less complex”. In another finding, the authors “investigate the relation between conflict and language complexity by analysing the content of the talk pages associated to controversial and peacefully developing articles, concluding that controversy has the effect of reducing language complexity.”
  • Contributions from South America. “Mapping Wikipedia edits from South America”,[20] the latest from a series of studies and visualizations by Oxford Internet Institute researcher Mark Graham and his team, reports that almost half of all edits to Wikipedia from South America come from Brazil, which is unsurprising considering that the largest population of Internet users in South America lives in Brazil. More interestingly, Chile –- a country with only 5-6% of the continent’s Internet population — contributes more than 12% of edits to Wikipedia.
  • Deaths generate edit bursts: A student paper titled “Death and Change Tracking : Wikipedia Edit Bursts”[21] examines the editing activity in nine articles about celebrity actors on the English Wikipedia after they died.
  • Searching by example: This month’s WWW 2012 conference in Lyon, France saw a demo titled “SWiPE: Searching Wikipedia by Example”[22], showcasing a tool where the user can search for articles similar to a given one by modifying entries in that article’s infobox, and also ask questions in natural language.
  • Wikipedia in the eyes of PR professionals. A study published in the journal of the Public Relations Society of America (PRSA)[23] surveyed public relations and communications professionals about their perception of Wikipedia contribution and conflict of interest. The online survey was pilot-tested with members of the Corporate Representatives for Ethical Wikipedia Engagement (many of whom have recently pushed for Wikipedia to let PR professionals edit articles about their clients to a greater extent) and produced 1,284 usable responses after being disseminated via various outlets. The results indicate that “of the 35% who had engaged with Wikipedia, most did so by making edits directly on the Wikipedia articles of their companies or clients”. The response time to issues reported on talk pages was found to be one of the important barriers in the interaction between Wikipedia community members and PR professionals. The author observes that “when the wait becomes too long, the content is defamatory, or a dispute with a Wikipedian needs to be elevated, there are resources to help. Unfortunately, only a small percentage of the respondents in this study had used them and many had never heard of these resources”. As another argument against the “bright line” rule advocated by Wikipedia’s Jimmy Wales (which says that PR professionals should not edit Wikipedia articles they are involved in), a separate result of the paper has been offered, which has met with heavy criticism by Wikimedians regarding statistical biases and other issues (see e.g. last week’s Signpost coverage: “Spin doctors spin Jimmy’s ‘bright line’“): 32% of the respondents said that “there are currently factual errors on their company or client’s Wikipedia articles”, corresponding to 41% of those respondents who said that such articles existed, or 60% of those respondents who said that such articles existed but did not reply “don’t know” to that question. The press releases of the author’s college and of PRSA interpreted the result as “Sixty percent of Wikipedia articles about companies contain factual errors”, although the latter was updated after the criticism “to clarify the survey findings described in this press release and help prevent any misinterpretation of the data that this release may have caused”.
  • Wikipedia coverage of marketing terms found accurate: The proceedings of the recent “International Collegiate Conference Faculty” of the American Marketing Association (AMA) offer a more positive view on Wikipedia from PR professionals: “Is Wikipedia A Reliable Tool for Marketing Educators and Students? A Surprising Heck Yes!”.[24] The paper chose a more systematic way to examine the quality of Wikipedia articles than the PRSA study and focused on AMA’s area of expertise, starting out from a “random sample of marketing glossary terms [that] were collected from 3 marketing management textbooks and 4 marketing principles textbooks”, and rating corresponding Wikipedia entries from 1 to 3 according to a standard procedure for content analysis: “Each textbook definition was compared to the corresponding Wikipedia definition and rated using a 3-point Likert scale where 1=Correct Definition, 2=Correct but difficult to find the term or the definition was not easy to decipher, or 3=Incorrect definition when compared to the textbook term.”. Of 459 items in the eventual sample only five were rated 3, and “the average score across all textbooks was a 1.18 demonstrating Wikipedia is an accurate source of marketing content.”
  • Wikipedia’s osteosarcoma coverage assessed: An abstract published in the Journal of Bone & Joint Surgery[25] finds “that the quality of osteosarcoma-related information found in English Wikipedia is good but inferior to the patient information provided by the National Cancer Institute”. The abstract refers to a study and results that appear to be identical to the one reported in a 2010 viewpoint article in the Journal of the American Medical Informatics Association (JAMIA) (Signpost coverage).
  • Wikipedia assignments for Finnish school students: A paper by three Finnish authors[26] describes course assignments to upper secondary school students (age 16–18) involving “writing articles for Wikipedia (a public wiki) and for the school’s own wiki”, in subject areas including biology, geography and Finnish history. In particular the paper reports that “a carefully planned library [visit] can help to activate students to use printed materials in their source-based writing assignments. [And that our] findings corroborate the generally held view that students tend to copy-paste and plagiarise, especially when exploiting Web sources.”
  • Wikipedia as a thermodynamic system – becoming more efficient over time: A paper titled “Thermodynamic Principles in Social Collaborations”[27] (presented at this month’s Collective Intelligence 2012 conference) applies principles and concepts from Statistical mechanics to the collaboration on (the English) Wikipedia. The analogy is based on interpreting the edit count of a user as the energy level of a particle, positing a “logarithmic energy model” for edits which assumes a “decreasing effort required for a given user to make additional edits in a relatively short period of time (e.g., one month) or to a particular page”. (According to the authors this contrasts with two other theories which also explain the observed power law distribution of edit counts: The “Wikipedia editors are ‘born’” notion, which assumes that different users need to expend different amounts of energy on the same kind of edits due to “an extreme heterogeneity of preference among the potential user population”, and the “Wikipedia editors are ‘made’” notion, which sees positive or negative feedback from other users as the defining influence.) Using the analogy, the authors define the entropy, free energy, temperature, entropy efficiency and entropy reduction of an editing community and their edits during a particular timespan. They then calculate the latter two for each month in the English Wikipedia’s history from January 2002 to December 2009. They conclude that “Wikipedia has become more efficient in terms of entropy efficiency, and more ordered according to entropy reduction. The increasing power-law coefficient causes the shift of the contributions from elites to crowd. The saturation of free energy reduction ratio may cause the saturation of the active editors.” The next section finds that “entropy efficiency is correlated with the quality of the social collaboration”, and one figure is interpreted as implying “that the nature of Wikipedia is a true media of the masses, where pages produced by crowd wisdom will have higher quality and thus more readership compared to that produced by a few elites.”
  • Too many docs don’t spoil the broth: Another paper[28] presented at the Collective Intelligence 2012 conference similarly found “that the number of contributors has a curvilinear relationship to information quality, more contributors improving quality but only up to a certain point” – based on an examination of 16,068 articles in the realm of the WikiProject Medicine.
  • The most influential biographies vary depending on the language/culture: Barcelona Media Foundation studied “the most influential characters” in the 15 largest language Wikipedias[29], by asking which biographies are the most linked to (“central”) from other Wikipedia biography articles. Political and artistic biographies are the most central, and the particular biographies depend on the language. They found, for instance, that Shakespeare’s biography is among the most important for Russian, Chinese, Spanish, and Dutch, but not for English. And they estimated the Jaccard similarity coefficient (similarity) between the social networks in different language editions: most similarity can be explained by language-family and geographical or historical ties. One interesting finding is that Dutch “seems to serve as a bridge between different language and cultural groups”. Some social connections are very common,and they produce a graph of the connections found in at least 13 of the language editions. The authors note that articles on people from non-Anglo-Saxon cultures may be missing if they are not known internationally, since the initial list of notable people is extracted from DBPedia. A blog post on Technology Review highlighted the fact that in the paper’s table of most connected biographies (listing the top 5 from 15 language versions), among the 75 entries “only three are women: Queen Elizabeth II, Marilyn Monroe and Margaret Thatcher” , which it interprets as one of “The Worrying Consequences of the Wikipedia Gender Gap“. (Summary at AcaWiki)
  • Biographical social network of the connections between persons present in at least 13 of the 15 largest language Wikipedias, as described in Aragón et al.[29]

    References

    1. Restivo, M. & van de Rijt, A. (2012). Experimental Study of Informal Rewards in Peer Production. PLoS ONE 7(3): e34358. PDFDOI Open access
    2. a b Meyer, C. M., & Gurevych, I. (2012). Wiktionary: a new rival for expert-built lexicons? Exploring the possibilities of collaborative lexicography. In S. Granger & M. Paquot (Eds.), Electronic Lexicography. Oxford: Oxford University Press. PDF Open access
    3. Lepore, J. (2006). Noah’s Mark, The New Yorker, November 6, 2006, pp. 78-86. HTML Open access
    4. Xiao, L., & Askin, N. (2012). Wikipedia for Academic Publishing: Advantages and Challenges. Online Information Review, 36(3), 2. Emerald Group Publishing Limited. HTML Closed access
    5. (2012) “Topic Pages: PLoS Computational Biology Meets Wikipedia”. PLoS Computational Biology 8 (3): e1002446. doi:10.1371/journal.pcbi.1002446.  Open access
    6. Baker, D. J. (2012). A Jester’s Promenade: Citations to Wikipedia in Law Reviews, 2002–2008. I/S: A Journal of Law and Policy for the Information Society, 7(2):1–44. PDF Open access
    7. Anderka, M., & Stein, B. (2012). A breakdown of quality flaws in Wikipedia. Proceedings of the 2nd Joint WICOW/AIRWeb Workshop on Web Quality – WebQuality ’12 (p. 11). New York: ACM Press. DOIPDF Open access
    8. Anderka, M., Stein, B., & Lipka, N. (2011). Towards automatic quality assurance in Wikipedia. Proceedings of the 20th international conference companion on World Wide Web – WWW ’11. New York: ACM Press. DOIPDF Open access
    9. Kaltenbrunner, A., & Laniado, D. (2012). There is No Deadline – Time Evolution of Wikipedia Discussions. ArXiV. Computers and Society; Physics and Society. PDF Open access
    10. Yousaf, J., Li, J., Zhang, H., & Hou, L. (2012). Exploration and Visualization of Administrator Network in Wikipedia. In: Q. Z. Sheng, G. Wang, C. S. Jensen, & G. Xu (Eds.), Web Technologies and Applications, Lecture Notes in Computer Science 7235:46-59. Berlin, Heidelberg: Springer Berlin Heidelberg. DOI Closed access
    11. Hattori, Y., & Nadamoto, A. (2012). Search for Minority Information from Wikipedia Based on Similarity of Majority Information. In: Q. Z. Sheng, G. Wang, C. S. Jensen, & G. Xu (Eds.) Web Technologies and Applications, Lecture Notes in Computer Science 7235:158-169. Berlin, Heidelberg: Springer Berlin Heidelberg. DOI Closed access
    12. Fujiwara, Y., Suzuki, Y., Konishi, Y., Nadamoto, A., Sheng, Q., Wang, G., Jensen, C., et al. (2012). Extracting Difference Information from Multilingual Wikipedia. In: Q. Z. Sheng, G. Wang, C. S. Jensen, & G. Xu (Eds.) Web Technologies and Applications, Lecture Notes in Computer Science 7235:496-503. Berlin, Heidelberg: Springer Berlin Heidelberg. DOIClosed access
    13. a b Halfaker, A. (2012). Kids these days: the quality of new Wikipedia editors over time. Wikimedia Foundation blog. HTML Open access
    14. Ciampaglia, G. L. (2011). User participation and community formation in peer production systems. PhD Thesis, Università della Svizzera Italiana PDF Open access
    15. a b Hyland, A. (2012). Comparing article quality by article class and article feedback ratings. Wikipedia. HTML Open access
    16. Huggett, S. (2012). The influence of free encyclopedias on science. Research Trends, (27). HTMLOpen access
    17. Elder, D., Westbrook, R. N., & Reilly, M. (2012). Wikipedia Lover, Not a Hater: Harnessing Wikipedia to Increase the Discoverability of Library Resources. Journal of Web Librarianship, 6(1), 32-44. Routledge. DOI Closed access
    18. Karkulahti, O., & Kangasharju, J. (2012). Surveying Wikipedia activity: Collaboration, commercialism, and culture. The International Conference on Information Network 2012 (pp. 384-389). IEEE. DOI Closed access
    19. Yasseri, T., Kornai, A., Kertész, J. (2012). A practical approach to language complexity: a Wikipedia case study. ArXiv. Computation and Language. PDF Open access
    20. Graham, M. (2012). Mapping Wikipedia edits from South America. Zero Geography. HTML Open access
    21. Lincoln, M. (2012). Death and Change Tracking : Wikipedia Edit Bursts. PDF Open access
    22. Atzori, M., & Zaniolo, C. (2012). SWiPE: Searching wikipedia by example. Proceedings of the 21st international conference companion on World Wide Web – WWW ’12 Companion (p. 309). New York, New York, USA: ACM Press. DOIPDF Open access
    23. DiStaso, M. W. (2012). Measuring Public Relations Wikipedia Engagement: How Bright is the Rule? Public Relations Journal, 6(2) HTML Open access
    24. Gray, D. M., & Peltier, J. (2012). Is Wikipedia Reliable Tool for Marketing Educators and Students? A Surprising Heck Yes! Marketing Always Evolving. 34th Annual International Collegiate Conference. PDF Open access
    25. Leithner, A., Maurer-Ertl, W., Glehr, M., Friesenbichler, J., Leithner, K., & Windhager, R. (2012). Wikipedia and Osteosarcoma: An educational opportunity and professional responsibility for Emsos. Journal of Bone & Joint Surgery, BR, 94-B(SUPP XIV), 13. British Editorial Society of Bone and Joint Surgery. HTML Closed access
    26. Sormunen, E., Eriksson, H., & Kurkipäa, T. (2012). Wikipedia and wikis as forums of information literacy instruction in schools. The Road to Information Literacy: Librarians as Facilitators of Learning. IFLA 2012 Congress Satellite Meeting (pp. 1-23). PDF Open access
    27. Peng, H.-K., Zhang, Y., Pirolli, P., & Hogg, T. (2012). Thermodynamic Principles in Social Collaborations. ArXiV. Physics and Society. PDF Open access
    28. Kane, G. C., & Ransbotham, S. (2012). Collaborative Development in Wikipedia. ArXiv. PDF Open access
    29. a b Aragón, P., Kaltenbrunner, A., Laniado, D., & Volkovich, Y. (2012). Biographical Social Networks on Wikipedia – A cross-cultural study of links that made history. ArXiV. Computers and Society; Physics and Society, PDF Open access

    Wikimedia Research Newsletter
    Vol: 2 • Issue: 4 • April 2012
    This newletter is brought to you by the Wikimedia Research Committee and The Signpost
    Subscribe: Syndicate the Wikimedia Research Newsletter feed Email @WikiResearch on Identi.ca WikiResearch on Twitter[archives] [signpost edition] [contribute] [research index]

    Tilman Bayer — 02. May 2012, 04:49

    01. May 2012

    - Wikimedia Foundation - (anglicky)

    - Wikimedia Foundation - (anglicky)

    From adding content to patrolling, Wikipedians do it all

    Contributing to the largest online encyclopedia is not as simple as it appears: it involves a gamut of activities from writing new articles to writing policies and guidelines to participating in the deletion process. But when participants in our survey of Wikipedia editors were asked how often they contributed to certain activities in the last one month, the top three activities they named most frequently as those to which they contribute very often/often are in the article namespace: fix formatting, spelling, grammar or make other minor edits (50 percent), add content to existing articles (48 percent) and write new articles (23 percent).

    Being a Wikipedian is not only about adding content to Wikipedia. Many Wikipedians work behind the scenes to ensure content on Wikipedia is of high quality and meets the standards of Wikipedia. More than one-fifth of editors (21 percent) patrol for vandalism, copyright violations or other problems with articles often/very often. A slightly smaller number (17 percent) participates in the discussion namespace often/very often, and nine percent participate in the deletion process including speedy and proposed deletion often/very often. Other popular activities include: doing translation work and uploading or editing images, media etc. (14 percent each).

    Q4a,4b: How often have you participated in the following in the last 30 days? (n=6348)

    With 3.9 million articles in March 2012, the English Wikipedia is one of the more mature and complete language Wikipedias. It’s no surprise that editors who edit other language Wikipedias are more likely to say that they write new articles or add content to existing articles often/very often. Editors who work on smaller Wikipedias are also more likely to do translation work. Compared to the English (4 percent) and German Wikipedia (5 percent), more editors from the Russian (18 percent), Spanish (28 percent), French (16 percent), Portuguese (35 percent) and Arabic (36 percent) Wikipedias stated that they did translation work often/very often.

    Q4a,4b: How often have you participated in the following in the last 30 days? (n=6348)

    If you are interested in more information about Wikipedia editors like age or country of residence, please check out the long-awaited topline findings from the survey.

    Mani Pande, Head of Global Development Research
    Ayush Khanna, Data Analyst, Global Development

    In December 2011, we conducted an online survey of Wikipedia editors in 17 languages. This is the second in a series of blog posts summarizing our findings. If you are interested, you can find out more about the methodology of the survey here.

    Mani Pande — 01. May 2012, 02:07

    30. April 2012

    - Wikimedia Foundation - (anglicky)

    - Wikimedia Foundation - (anglicky)

    Psychology class collaborates on two Good Articles

    When University of Alberta-Augustana Psychology Professor Paula Marentette thought about Wikipedia, it was in the context of reminding her students not to cite it in a paper. But then she read Association for Psychological Science President Mahzarin Banaji’s call for psychology professionals to improve the quality of Wikipedia articles on their discipline. The arguments compelled Dr. Marentette to assign her students to edit Wikipedia as part of their coursework, and this term she joined the Wikipedia Education Program in Canada with her Language Acquisition class. The upper-level seminar class had seven students, and Dr. Marentette thought it would be good to have students work jointly to improve two course-related articles: vocabulary development and joint attention, with the goal of bringing them both up to Good Article status.

    Some members of the Language Acquisition seminar at the Augustana Faculty of the University of Alberta.

    Some members of the Language Acquisition seminar at the Augustana Faculty of the University of Alberta.

    The students in the class found this writing assignment a bit scary, but they were curious and eager to see how it could turn out.

    “I was nervous about writing for Wikipedia because I didn’t know what to expect and the process seemed a little daunting,” said Alanna Lindsay, a fourth-year student from Wainwright, Alberta. “However, once we began, I quickly got used to Wikipedia and began to really enjoy the project.”

    Alanna’s classmate Erika Heiberg, a fourth-year from Kingman, Alberta, agreed, saying she was “slightly intimidated” at the beginning.

    “I was unfamiliar with the editing process and was overwhelmed at first by all the new interfaces and things like that, but I was excited to do something different,” said Heiberg.

    Dr. Marentette said the traditional research essay generally develops information literacy, critical thinking, and writing skills, but she appreciated the fact that the Wikipedia assignment addressed those and more.

    “The students learned a lot about evaluating sources. They learned how Wikipedia works (talk pages, standards etc.). They know now that they can figure out if an article is of reliable or not. It made the typical lesson about evaluating sources very relevant,” she said. “When writing, they need to constantly be aware of the potential audience. They have to address public response as soon as they begin to plan changes to an article. They have to defend their choices, or allow themselves to be persuaded to change their approach. They have to write and rewrite to achieve the tone and clarity and coverage for which they are aiming.”

    And her students noticed their skills blossoming as they worked together to improve the two articles.

    “The good thing about Wikipedia is that it teaches you to write in a more accessible manner and to leave out a lot of the unnecessary information that you include in a traditional paper to meet the page number requirement,” Heiberg said. ”This project helped me to make my writing much more succinct and easy to follow.”

    The students improved their articles, then reached out to their Online AmbassadorNeelix, members of WikiProject Psychology and other editors who contribute to psychology-related articles for feedback. When they felt ready, they submitted both articles to the Good Article review process, then collaborated to make edits based on the feedback of the reviewers. Students said they welcomed the productive feedback on their articles, and they were excited to have had their hard work recognized with the Good Article designation.

    All of the students said they preferred Wikipedia assignments to traditional assignments after this experience. In fact, third-year student Lianne Theelen, a native of Red Deer, Alberta, said she hoped to take another class next year where she writes Wikipedia articles for class.

    “We get more feedback on our writing and we’re more motivated to make it look good because it’s accessible to the public,” Theelen said. “My favorite part is not having it sit and collect dust or get thrown away after the term. Our article is still there, and it is useful for people.”

    Other classmates agreed, echoing the themes that drove APS to call on professors like Dr. Marentette to use Wikipedia in their classrooms in the first place.

    “In the future, I would prefer a Wikipedia assignment over a usual term paper, since this will help many Wikipedia users get credible information on psychology related topics,” said Juliet Brown, a fourth-year student born in Ghana and now living in Alberta.

    “My favorite part about writing for Wikipedia was knowing that the information being presented is valuable to someone, and in particular to parents,” said Alison Owens, a third-year student from Olds, Alberta. “The joint attention page may be a place where parents of children with developmental disabilities go to look in order to learn more about their child’s disability. I think its great that we can provide that information for them in an easy access format, with reliable sources that they can trust.”

    LiAnna Davis, Wikipedia Education Program Communications Manager

    LiAnna Davis — 30. April 2012, 21:54

    Nine out of ten Wikipedians continue to be men: Editor Survey

    As part of the Wikimedia Movement strategic plan, regular surveys among Wikipedia editors are an important way to take the pulse of the community and identify pressing concerns and needs. We are happy to share results from the second editor survey that was conducted in December 2011. We began survey efforts in April 2011, and results from the first survey are available here. We would like to point out that although this blog post and the following ones will be looking at some trends across the April and December survey, 7-8 months is a rather short time to see statistically significant change on important indicators like gender distribution resulting from Wikimedia Foundation initiatives. Here are some demographic data about Wikipedia editors:

    a. Wikipedia editors continue to be predominantly men

    The gender distribution of Wikipedia editors hasn’t changed since the last survey. Among those surveyed, 90 percent self-identified as males, 9 percent as females and 1 percent as transsexual or transgender. That being said, there was a greater amount of female editors among those respondents who had joined more recently: Among editors who had joined in 2011, 14 percent were female compared to 10 percent for 2010, 9 percent for 2009 and 8 percent for editors who had joined in 2008 and participated in this survey. Possible explanations include that Wikipedia has been attracting a higher ratio of women recently, or that female editors leave the project sooner. There were no significant variations across the major language Wikipedias, with the exception of the Russian Wikipedia, which reported only 6 percent female editors. Also, out of all editors in the US, 15 percent are women, which is significantly higher than any other country of residence. Conversely, there are fewer male editors in US (85 percent) compared to other countries (UK, India, Brazil, Canada) where 90 percent or more of editors are males.  With initiatives like the Teahouse project that engages new editors through outreach, we hope to increase the number of female editors on Wikipedia.

    (D15) What is your gender? (n=6503)

    D15. What is your gender? n=6503

    b. English Wikipedia continues to be the most read and edited Wikipedia

    As we had found in the April 2011 survey, a large majority of Wikipedia editors read and edit the English version. Many editors that primarily make edits to another language Wikipedia also edit the English Wikipedia. While only 30 percent primarily edit the English Wikipedia, 63 percent contribute to it. Almost half of English Wikipedia editors reported other language Wikipedias as their primary project. Similarly, 86 percent of Wikipedia editors read the English Wikipedia, though only 38 percent read it primarily.

    Q1a. Which language versions of Wikipedia do you CONTRIBUTE to? Please choose all that apply.

    Q1b. Which language version of Wikipedia do you PRIMARILY CONTRIBUTE to? Please choose ONE.

    Q2a. Which language versions of Wikipedia do you READ? Please choose all that apply.

    Q2b. Which language version of Wikipedia do you PRIMARILY READ? Please choose ONE.

    If you are interested learning more about Wikipedia editors – from age demographics to their editing experiences, please check out this space as we publish the long-awaited topline findings from the survey.

    Mani Pande, Head of Global Development Research
    Ayush Khanna, Data Analyst, Global Development

    In December 2011, we conducted an online survey of Wikipedia editors in 17 languages. This is the first in a series of blog posts summarizing our findings. If you are interested, you can find out more about the methodology of the survey here.

    Ayush Khanna — 30. April 2012, 18:25

    26. April 2012

    - Wikimedia Foundation - (anglicky)

    - Wikimedia Foundation - (anglicky)

    Can you help Wikipedians collaborate with Harvard University?

    Today, the Wikimedia Foundation is pleased to announce a new opportunity for Wikipedians to reach out to scholars at one of the world’s most prestigious educational institutions. We’re seeking an experienced Wikipedia editor for a one year, full time fellowship based at Harvard University’s Belfer Center for Science and International Affairs.

    Located at the Harvard campus in Cambridge Massachusetts, this Wikipedian will have a unique role facilitating collaboration between the faculty, staff, and fellows at the Center and the Wikipedia volunteer community, with the aim of improving the quality of encyclopedia articles.

    The Belfer Center is a focal point for research on international security and policy related to science, technology and the environment. It is also part of the John F. Kennedy School of Government at Harvard. While some experience with the subject matter is preferred, the goal of this fellowship is for a Wikipedian to help unlock the expertise at the Center and see that it is shared with the world. While English Wikipedia alone may have nearly four million articles, the depth and quality of our coverage of international affairs and policy — such as on global nuclear security — is not well known. What we do know is that we are still a long way from Wikipedia’s goal of the “sum of all human knowledge,” and that having a liaison to work with experts and volunteers will do much to improve the free encyclopedia.

    This position is funded by a generous grant from the Stanton Foundation. This philanthropic institution has supported both the Belfer Center and the Wikimedia Foundation in the past. Apply now!

    Siko BouterseHead of Community Fellowships Program, Wikimedia Foundation

    Siko Bouterse — 26. April 2012, 23:49

    Analyzing Mobile Browser Energy Consumption

    Recently, technology reporter Jacob Aron wrote a blog post on newscientist.com that talks about how bloated website code drains your smartphone’s battery.

    He mentions how Stanford computer scientist Narendran Thiagarajan and colleagues used an Android phone hooked up to a multimeter to measure the energy used in downloading and rendering popular websites. Using their experimental setup they measured the energy needed to render popular web sites as well as the energy needed to render individual web elements such as images, Javascript, and Cascading Style Sheets (CSS). They claim that complex Javascript and CSS can be as expensive to render as images. Moreover, dynamic Javascript requests (in the form of XMLHttpRequest) can greatly increase the cost of rendering the page, since it prevents the page contents from being cached. Finally, they show that on the Android browser, rendering JPEG images is considerably cheaper than other formats, such as GIF and PNG for comparably sized images.

    One example that is cited is that simply loading the mobile version of Wikipedia over a 3G connection consumed just over 1 per cent of the phone’s battery, while browsing apple.com, which does not have a mobile version, used 1.4 per cent.
    Yet, in the summary of the paper they find that the results from this study are not meaningful except for the initial loading of just a single page resource. It would be interesting to extend these results in a meaningful way, and study the energy signature of an entire browsing session at a site such as Wikipedia, where a user typically moves from page to page. So, during that session, downloaded web elements such as Javascript, CSS and images would mostly be cached locally. Therefore, we really can’t estimate the energy cost of a total session by simply summing the energy usage of pages visited during that session. Measuring an entire typical session may help optimize the power signature of the entire site. Custom CSS that is applicable to every page of a site would easily outweigh the cost of the apparently excessive CSS download for the render of just the first page.
    So, one of the ways that we are looking to improve our mobile browser energy consumption is by implementing the MediaWiki ResourceLoader in order to improve the load times for JavaScript and CSS. ResourceLoader is the delivery system in MediaWiki for the optimized loading and managing of modules. Its purpose is to improve MediaWiki’s front-end performance and the experience by making use of strong caching while still allowing near-instant deployment of new code that all clients start using within 5 minutes. Modules are built of JavaScript, CSS and interface messages; it was first released in MediaWiki 1.17.
    On Wikimedia wikis, every page view includes hundreds of kilobytes of JavaScript. In many cases, some or all of this code goes unused due to browser support or because users do not make use of the features on the page. In these cases, bandwidth and loading time spent on downloading, parsing and executing JavaScript code are wasted. This is especially true when users visit MediaWiki sites using older browsers, like Internet Explorer 6, where almost all features are unsupported, and parsing and executing JavaScript is extremely slow.
    ResourceLoader solves this problem by loading resources on demand and only for browsers that can run them. Although there is too much to summarize in a simple list, the major improvements for client-side performance are gained by:
    • Minifying and concatenating
    • → which reduces the code’s size and parsing/download time
    • JavaScript files, CSS files and interface messages are loaded in a single special formatted “ResourceLoader Implement” server response.
    • Batch loading
    • → which reduces the number of requests made
    • The server response for module loading supports loading multiple modules so that a single response contains multiple ResourceLoader Implements, which in itself contain the minified and concatenated result of multiple javascript/css files.
    • Data URIs embedding
    • → which further reduces the number of requests, response time and bandwidth
    • Optionally images referenced in stylesheets can be embedded as data URIs. Together with the gzippping of the server response, those embedded images, together, function as a “super sprite”.

    Patrick Reilly, Senior Software Developer, Mobile

    Patrick Reilly — 26. April 2012, 23:18

    Commons Picture of the Day: Skull from a Téviec tomb

    Skull from a tomb dating to the Mesolithic period. Photo by Didier Descouens

    Wikimedia Commons photographer Didier Descouens (User:Archaeodontosaurus) has the unique opportunity to take photos of natural history and paleontology: he is the chair of the Institute of Natural Sciences “Picot de Lapeyrouse” at the Natural History Museum in Toulouse, France. As part of a special arrangement with Wikimedia France and Toulouse Mayor Pierre Cohen, Descouens and his colleagues have spent a year adding quality encyclopedic images to the Commons database as part of Projet Phoebus.

    The skull above was part of a famous tomb, from the Téviec region in Brittany, and is dated to approximately 7000 BC, during the Mesolithic era. According to Descouens, the tomb has undergone a complete restoration with a study of paleoanthropology and forensic medicine and is part of an exhibition that originally appeared in Toulouse, but is currently in Paris. The exhibition will then move to the Transvaal Museum in Pretoria, South Africa.

    Descouens said there were two skeletons in the tomb, which were originally believed to be of a man and a woman. Through their analysis, they discovered that they were both women and that they had been violently killed. “The skull of the photo shows an impact of arrow, above the right orbital,” said Descouens. “The right side of the skull shows a blow with a hammer.”

    As for the significance of the finding, Descouens said, “The image of two women murdered and buried with great care, asks us about our inner nature. 7000 years: that is both far away but frighteningly ‘modern.’”

    Didier Descouens self portrait.

    Descouens is a doctor, a scientific photographer and a naturalist. He has been taking photographs since he was 10 years old and learned scientific photography techniques from François Seguy, who is in charge of photographic works from the Muséum de Toulouse and Université Paul Sabatier. The Muséum de Toulouse has mandated that the ”Institute Piot de Lapeyrouse” share photographs from its collections freely. After testing various options, they found Commons to be the most reliable and easiest to use.

    In addition to the value of the photos for Wikipedia and Commons, Descouens said scientific organizations have repurposed the images for their materials. The Projet Phoebus images have been used on 283 of the Wikipedia language projects and have also appeared in media stories throughout Europe and in Canada.

    The ubiquitous use of the photos pleases Descouens. “Knowledge is the cement that unites us. It is valuable if it is freely shared by all,” he said.

    (View more of Descouens’ featured photos)

    Story by Matthew Roth, Global Communications Manager
    Reporting by Elaine Mao, Communications Intern
    Translation by Guillaume Paumier, Technical Communications Manager 

    Matthew Roth — 26. April 2012, 21:30

    Crown Prince Haakon of Norway celebrates Wikipedia Zero

    Kristen Skogen Lund, Crown Prince Haakon, Jimmy Wales, and Minister of Development Holmas looking on as Wikipedians demonstrate editing.

    On Monday, at the annual Wikipedia Academy in Oslo, Norway, Crown Prince Haakon of Norway joined with Wikimedians to promote free knowledge and to highlight the cultural institutions and businesses that have embraced Wikipedia. They focused on the Wikipedia Zero agreement between the Wikimedia Foundation and Telenor, which enables more than 135 million customers in Asia to access Wikipedia without any additional charge on their data plans.

    The celebration was headed by local Wikipedians, who introduced Crown Prince Haakon, Jimmy Wales, Minister of Development Heikki Holmas, and Telenor Executive Vice President Kristin Skogen Lund. Lund opened the celebration by advocating for Wikipedia and its open source format, identifying it as the “main pillar” of Telenor’s policy of openness in Asia.

    “It’s an important development we put on track together with the Wikimedia Foundation and the Wikipedia movement, and we are proud to contribute to this,” said Lund

    Lund then demonstrated Telenor’s commitment to the free knowledge mission by announcing that Telenor had contributed 200 photos to Wikimedia Commons with Creative Commons licensing. “There is so much work to be done globally, but we can also contribute at home,” she said. “We hope that others will do the same.”

    The academy broke past participation records by attracting 99 sign-ups and 23 high school assistants, as well as the first ever royal participant. Half of the attendees belonged to GLAM institutions, mainly museums and archives. It marked the beginning of the sector’s national policy of officially acclaiming Wikipedia as a preferred channel of communicating cultural heritage.

    Minister Holmas, Crown Prince Haakon, and Jimmy Wales

    The 100 chairs of the academy room were all filled up, with people standing along the walls during the award ceremony for Wiki Loves Monuments 2011 and the local Wikipedian of the Year. The latter prize went to meteorologist Frode Korneliussen and the Catholic parish priest Claes Tande, who has 180,000 edits and more than 13,000 new articles.

    During a course on how to edit Wikipedia, we learned that Minister Holmaas is an active Wikipedia user, and editor. Ten Wikipedia editors from the local Drömtorp High School were recruited to help teach the course. The students, with assistance from Wales, taught Crown Prince Haakon, Lund and Holmaas how to patrol, supervise RSS feeds, and recognize vandalism. The Crown Prince followed the lectures intensely and expressed admiration for the elaborate tools that administrators and patrollers use on Wikipedia.

    This year’s academy coincided with the last day of the trial of Utøya terrorist Anders Behring Breivik in Oslo City court house, only a kilometer away. In the local media’s coverage of the academy, they focused on the efforts of the Norwegian Wikipedia community to keep extremists from making their imprint on articles. Jimmy Wales explained the Wikipedia model as one of openness and democratic debate, which helps the project attain a neutral point of view.

    The morning after, Wikimedia Norway vice chairman Erlend Bjørtvedt appeared on the morning news, explaining how a corps of 150 norwegian patrollers and administrators on four continents, have managed to uphold the neutrality of disputed articles by a mix of patrolling, reverting and limited article blocking.

    Erlend Bjørtvedt, Vice Chairman, Wikimedia Norway 


    Matthew Roth — 26. April 2012, 17:27

    23. April 2012

    - Wikimedia Foundation - (anglicky)

    - Wikimedia Foundation - (anglicky)

    Students see benefits from Wikipedia assignment

    Students at universities in the United States and Canada found that contributing to Wikipedia as a class assignment through the Wikipedia Education Program improved their media literacy and technology skills, according to survey results from the fall 2011 term. In the Wikipedia Education Program, professors assign students to contribute to Wikipedia, usually in the form of expanding a stub article, in place of a traditional research paper grade. At the end of the fall 2011 term, we asked students who participated in the U.S. and Canada program to fill out a survey on their experiences. A total of 132 students took the survey, with a little over three-quarters of the respondents from the United States. About 61 percent of the respondents were enrolled in undergraduate courses, while the remainder were enrolled in graduate courses.

    Learning outcomes

    A series of questions were designed around assessing student learning outcomes. About two-thirds of the respondents agreed that doing a Wikipedia assignment was a beneficial experience, with almost 20 percent of them strongly in favor of a Wikipedia assignment in place of a traditional term paper. Students from the United States and graduate students all reported higher beliefs in the benefits of a Wikipedia assignment. More than half of the respondents felt that doing a Wikipedia assignment improved (1) their ability to identify poor quality Wikipedia articles and (2) their ability to identify bias in documents. In addition, more than half the respondents felt their ability to write a neutral (i.e., balanced point-­of-­view) document improved through a Wikipedia assignment more than through a standard term paper.

    These findings indicate that students recognize the media literacy benefits in doing a Wikipedia assignment. As professors have noted, when Wikipedia is not the destination of the student’s research on a topic, but is instead the road, students are forced out of their research comfort zone. Students are required to evaluate the reliability of sources, find journal articles, and write from a neutral point of view to meet Wikipedia’s policy requirements.

    Support resources

    Student participants use a set of resources when they have questions about editing Wikipedia — online text, Campus Ambassadors, Online Ambassadors, and professors to name a few. Online text is the most commonly used resource, followed by printed materials. Nearly 93 percent of students who consulted their Campus Ambassador found him or her to be helpful, and 74 percent of students who consulted an Online Ambassador said he or she was helpful.

    We’ve found that having that support makes a big difference to students. Students can chat with their Campus Ambassadors in person on campus or on wiki, and they can interact with Online Ambassadors on-wiki and through an IRC chatroom where they can get immediate help for quick questions.

    Students had a positive interaction with the Wikipedia community of editors when they interacted with them. Students were asked to pick two adjectives to describe their views of the Wikipedia editing community; top responses included “helpful” (72 percent), “collaborative” (39 percent), and “intelligent” (27 percent).

    Motivations

    We asked students to identify the key motivations for their contributions to Wikipedia. Important factors students reported were getting a grade, interest in their Wikipedia article topic, and the usefulness of their work (i.e., it wasn’t another throwaway assignment). Graduate students reported a broader variety of motivations, when compared to undergraduates. In particular, more than 60 percent of the graduate students gave a high ranking to the fact that their work contributes to a freely accessible knowledge base.

    Final comments and looking ahead

    Although converting students into longterm editors is not an explicit goal of the Wikipedia Education Program, as many as 46 percent of our respondents expressed interest in continuing to edit Wikipedia.

    When students were asked to share the hardest thing about their Wikipedia editing experience, some common themes emerged. Many students mentioned the challenges of learning how Wikipedia works, and how editing an article was a lot more work than they imagined. Almost universally, they talked about how hard it was to learn wiki syntax. The Visual Editor will help alleviate many of these concerns.

    To sum up, here’s what one student had to say when asked about any memorable experiences:

    “Overall, a great learning experience. Having to really validate anything you say by backing it with a reputable source is incredibly beneficial and students should be exposed to this, especially if they have not had a research methods course in their undergraduate career.”

    Ayush Khanna
    Data Analyst, Global Development

    Ayush Khanna — 23. April 2012, 23:52

    Niklas Laxström, language engineer and Wikimedian

    University of HelsinkiThe average age of the MediaWiki developers is quite young. They often started contributing to the MediaWiki code while still in school or university. When their contributions show promise, they are sometimes asked to contribute to particular projects. This has resulted in the hiring of students and they continue to do professionally what they at first did as a hobby.

    While the Wikimedia Foundation is happy with the talent it gains in this way, it feels strongly that finishing formal education is very important. Some students only work for the WMF in their holidays while others manage regular contributions in their free time as well. Such relations are often strengthened through programs like the Google Summer of Code or through summer internships.

    Niklas Laxström recently finished University and this happy occasion is reason enough to interview him. As you may know, he works for the WMF Localisation Team and his claim to fame is that he started what became translatewiki.net. Niklas has been instrumental in much of the internationalisation and localisation development for the MediaWiki software.

    Thanks,
    Gerard Meijssen
    Internationalization / Localization outreach consultant

    Congratulations, master Niklas. You finished university !! What did you study and what is your exact title (in Finnish)
    I studied language technology with minors in Finnish language, Computer Science, East-Asian studies and collection of Russian language courses. I’m now Master of Arts, filosofian maisteri.

    You started with what became translatewiki.net before you started university. How did your study influence the development of translatewiki.net
    Before university I had a hobby project for inflecting Finnish nouns. It wasn’t successful nor had it a good design, but it started series of events, which caused me to start studying language technology.

    My studies were pretty heavily biased in hard language processing: for instance syntactic parsers, finite state technologies and morphologies.  however, the open source language technologies are not yet in a level where that kind of processing can just be plugged into any software.

    Learning about variation in languages has been very useful to me. It helps avoiding solutions that only work for limited number of similar languages. I learned most of that in linguistics courses but also by studing several dissimilar languages. l also liked the isolated courses about copyright, terminologies and string processing, which turned out to be useful in different situations.

    On the other hand, working with MediaWiki and translatewiki.net has given me enormous amounts of practical experience all over computer
    engineering, which helped me to perform better in engineering related courses.

    You are now a fully qualified linguist. What do you consider the biggest linguistic challenge we are facing you
    Not exactly a linguist, more like a language engineer. The language support in MediaWiki is nothing to be ashamed of. That said, there is a huge difference between languages on how readily they can be used on a computer.

    This involves everything from search to fonts and input method and from optical character recognition to speed synthesis. The biggest challenge is to identify solutions that help to bridge that gap for as many languages as possible.

    What is it that you do for the Wikimedia Foundation
    I continue doing what brought me to MediaWiki: making the translation process easy and fast. This includes a lot of i18n (making the software translatable) and facilitating l10n (adapting and translating the software to particular language and culture).

    It is absurd that localisation is simultaneously one of the biggest strengths and weakness of open source. While it enables using computer in anyone’s mother tongue, the processes and tools used for localisation tend to be difficult to use if not outright preventing good localisation.

    You participated at the Multilingual Web conference in Luxembourg. What value do we gain from visiting conferences like this
    Generally conferences are about discussions and sharing of experiences: we learn what others do and we let others know what we do. The ultimate purpose is to learn new things and find ways to work together to to reach goals we wouldn’t be able to reach otherwise. Meeting new people and enjoying the local culture makes it fun and interesting.

    In this case we had a chance to learn, discuss and affect standards that on the other hand let us support languages better and dictate the limits of what we can do in the near future.

    Standards play an important part in translatewiki.net and in MediaWiki. What standards do we use and for what
    We usually make our own best practices for what we do, to get stuff done. That said, we are not looking to reinvent the wheel just because we can. We rely on standards all over the place starting from basic things HTTP and HTML but specifically we do use and support standard i18n file formats, language codes and other common practices like tagging our content with language codes.

    The CLDR repository is especially problematic, because on the one hand it contains lots of data we don’t want to duplicate, but on the other
    hand it is missing so much data that we need and could provide, but are currently not able to push back into CLDR.

    As a member of the WMF Localisation team you care about language support. How much of a difference does the team make ?
    We have talked a lot about how to measure the impact of what we do. We do not have any numbers currently. There are so many things we would
    want to fix but we can’t possibly do them all. We have chosen issues that we want to fix: two examples of that are the spreading use of Narayam for inputting text and WebFonts for displaying text. The fact that communities request those extensions to be enabled on new projects says to me that we have made something that is useful and wanted.

    It is also worth mentioning that we are raising awareness of i18n and l10n inside Wikimedia Foundation. WMF can sometimes be ignorant about
    i18n, especially when it comes to software. Our efforts are not a replacement for the language communities making noise themselves to make sure they are heard. We advise and provide WMF the tools to tackle the complex issues of deep multilingualism.

    What language support challenges do you want tackled
    I am very much looking towards to implemeting the planned language selector, which has been specifically desgined to fit our special needs.

    In slightly longer term, I would be happy to see our lack of support for proper collation (sorting) sorted out.

    Now that you finished university, what would you like to study next
    I want to upkeep my language skills, especially Russian. I do not have immediate plans to continue with postgraduate studies. But one never stops learning.

    Can you tell us a joke about linguists
    Not exactly a joke, but I’m not sure whether I should be proud or ashamed of this.

    Gerard Meijssen — 23. April 2012, 20:07

    19. April 2012

    - Wikimedia Foundation - (anglicky)

    - Wikimedia Foundation - (anglicky)

    Primary data about languages

    For MediaWiki, the CLDR or Common Locale Data Repository, is a primary source of information. The information about languages Unicode maintains in this standard is what is most relevant to us. It registers its name in English, as well as the autonym or the name in its own language, as well as information like what a date and a number look like,  the script or scripts used for a language and the names of other languages in that language.

    We prefer to use standardised information, not only because it is stable and reliable, but because we do not have to collect the data ourselves and also because the data is used by many other organisations and in many other applications. We love the CLDR and we want it to be even better. To make it better we need your help.

    Many of the languages that have a Wikipedia and many of the languages that want to have a Wikipedia are not represented in the CLDR. Many Wikipedians know their language really well. They can provide the information about their language and they can verify that the existing information is correct. When there is a need to change things, you will need to create a user.

    When a language is not yet supported, you will have to request for the new locale or language to be added. It is expected that you provide at least the core data when you make your request and that you at least complete the minimal data required. One of the questions is: where the language is official, it may be that a language does not have any official status. This does not prevent people from reading or writing that language and it does not mean that information about such a language is not important to us.

    When a language is already supported, we want you to verify if the names for other languages exist and are correctly written. There can be issues in any language including English; using the Auracana name for the Mapundungun language is considered an insult.

    When you are able and happy to help us in this way, you may be interested in joining our “language support team.” Because of your interest you belong to the group of people we first want to turn to when we have questions about supporting your language. More structured information and room for your reports can be found here. When there are any issues, do not hesitate to report them.

    Thanks,
    Gerard Meijssen
    Internationalization / Localization outreach consultant

    Gerard Meijssen — 19. April 2012, 05:16

    17. April 2012

    - Wikimedia Foundation - (anglicky)

    - Wikimedia Foundation - (anglicky)

    When putting a photo on the web is an act of generosity

    Ken Thomas. Photo CC0

    Life in the small town of Jodie, West Virginia, revolved around coal mining. Opportunities were scarce and young men like Ken Thomas were expected to get a job in the mines.

    He remembers sitting in his 4th grade class, listening to the responses of classmates as their teacher asked them, “What do you want to be when you grow up?” Most of the girls said housewife, while the boys agreed that they wanted to become coal truck drivers.

    Thomas said he wanted to be an astronaut.

    When he graduated high school, Thomas worked a short stint mining coal. He liked it so much he soon joined the Army, where he spent four years. After trying a number of professions, including radio broadcasting, he eventually got into construction safety, which he does currently in North Carolina.

    “Because of where we were and the low income nature, we had schools that weren’t always great, teachers that weren’t always great,” said Thomas. “We did not have access to resources, libraries, learning opportunities that I think people might have elsewhere.”

    The lack of educational opportunities led Thomas to embrace open access to knowledge, and he supports organizations that promote it like the Electronic Frontier Foundation and the Wikimedia Foundation.

    “When I first got involved in the Internet and with Wikipedia, what I kept thinking about was myself when I was 7 or 8 years old and logging on and finding a resource like that, how huge that would have been and how transformative that could have been for me as a child,” he said.

    He added, a “free education as a way to break what remains of our class system in the US, or an economic class system.”

    For Thomas, Wikipedia is “the most important experiment going on right now.” He also calls it a mirror that reflects, “the best of us and the worst of us. If we can’t make it work, then maybe we’re not worthy of our technology.”

    Sachem butterfly. Photo by Ken Thomas, CC0

    Thomas first began contributing to Wikimedia projects by way of Wikimedia Commons. His passion for photography was cultivated by combining it with his love for, as he describes it, “typical redneck pastimes,” such as fishing, hunting and kayaking, among other activities.

    Thomas recalled a particular event that marked a “weird point in [his] evolution” from a casual hobbyist to passionate photographer.

    “I was deer hunting one year, saw what I thought was a nice deer, and there was this flinch, do I go for the camera, do I go for the rifle?” he said. “You know, which am I going to do? If he steps out of there, the light’s perfect, and I got the 300 mm lens on… man it would be such a good photo.”

    Thomas does sell prints of some of his amazing images through his personal website, including a very nice calendar, but he is committed to giving away his photographs for the public good. While most photos on Commons are licensed Creative Commons Share Alike, which allows re-use so long as the original photographer is credited and derivative works are licensed under the same terms, Thomas donates his photos to the public domain.

    “What I have told people in the past is putting a photo on the web is an act of generosity,” he said. Emblazoned on the bottom of his userpage is the simple phrase: Give freely or don’t give.

    “I don’t own the bird. I don’t own the light. I don’t own the tree branch that the bird was sitting on,” he said. ”I take these pictures because I want people to see how beautiful these things are. Who am I to charge for that?”

    (View more of Ken Thomas’s photos)

    Story by Jordan Hu, Communications Intern, and Matthew Roth, Global Communications Manager
    Reporting by Victor Grigas, Storyteller

    Jordan Hu — 17. April 2012, 19:36

    Opening our operations with Wikimedia Labs

    For the past year and a half we’ve been working on a project named Wikimedia Labs, which enables us to invite our community to contribute to how our sites are run. Labs is a cloud computing environment using OpenStack for development, testing and deployment of Wikimedia’s infrastructure as a whole, enabling us to treat our infrastructure as an open source software project.

    The problems we’re solving

    When Wikipedia and its sister projects started, volunteers had root level access on our infrastructure. They were the only roots and most of the infrastructure they built is still in use today. Our lenient access policy made us flexible, so changes could happen quickly. Also, the sites were smaller, had far fewer users, and large, fundamental changes could be made in production.

    Growth has made us less willing to give out root access to volunteers. Because of the size of our sites, downtime is less acceptable. But having fewer volunteers means we have less ideas, and due to that, our ability to make changes quickly is decreased. We haven’t had a new volunteer root in years. We haven’t even had a new volunteer with shell access. Engaging volunteers and enabling them to easily contribute is a wider problem as well.

    Our software development community scales with volunteers. Unfortunately, operations doesn’t scale in a similar way right now. We’re limited to the staff operations engineers we currently have. The staff is great, but the fact that operations can’t scale to meet the needs of a large growth of developers means that operations is a bottleneck. Furthermore, our access policy prevents volunteer developers from learning how our infrastructure works.

    This leads to a situation where our staff developers and volunteer developers can’t easily collaborate. Our volunteers also have no way of appropriately testing their changes, since our infrastructure is complex and difficult to replicate. This means it’s harder to take contributions, which further slows the pace of changes on our sites.

    The approach to solving the problem

    The solution we are taking for this is to open access to our infrastructure up as widely as possible. We are making it possible for volunteers to use and modify our infrastructure like they do our software.

    Operations work can be done via configuration management tools, orchestration, and cloud computing. Thanks to this, operations can be treated like a software development project. If it can be a software project, then we can make it an open source project, allowing us to open infrastructure development to the wider community.

    Opening up infrastructure development to the software development community allows developers to bypass operations for work when operations is too busy to get to a task quickly enough. It also makes it possible to take contributions from volunteer operations engineers, bringing back the flexibility we originally had.

    However, going staff-only to volunteer-based is hard. We still have the same concerns about site availability, so we must take a new approach.

    Our first step was to release our puppet configuration. We spent nearly a month going through the configuration, pulling out sensitive info, ensuring we don’t release anything that would open us to major vulnerabilities.

    Next we created an infrastructure, called Labs, where volunteers can test their work, document it, and eventually have it deployed live to production. Our goal is to build a self-sustaining operations community using this infrastructure in a way similar to how the operations staff team currently works.

    A technical overview of what Labs is

    TL;DR: Labs is an environment where users can create and manage entire computing and networking infrastructures.

    Labs has its own set of terminology, so to understand this technical overview, it’s likely good to read the terminology guide.

    Labs allows users to create instances (virtual machines) that multiple users can use and fully control. It does so inside of projects, where projects are maintained by a community and the projects reflect real world projects, like testing and developing MediaWiki extensions for Incubator.

    Projects are a security concept. A project is a grouping of resources, like instances, firewall rules (via security groups), IP addresses, DNS entries, etc. It’s also a grouping of users and roles. A project has membership and roles that allow users to perform certain actions. Membership to a project generally allows a user to access instances that exist in the project, and often that access is full root level privileges. A member of a sysadmin role in a project allows that user to create, delete, and reconfigure instances. A member of a sysadmin role in a project allows that user to manage IP addresses, DNS records, and security groups.

    Here’s an example workflow in Labs, where a user wants to create and test a new search infrastructure:

    1. Request a new project called search, where they have sysadmin and netadmin roles. When the project is created, document the project’s purpose on the project’s page.
    2. Create a web security group, with ports 80 and 443 open to the world.
    3. Create three small instances (with 2 CPUs and 2GB RAM), with one of the three instances using the web security group. The instance with the web security group will be the search frontend, and the other two will be indexers.
    4. Configure the three instances, by hand, documenting the process on the project’s page.
    5. When the service is ready to be demoed, a floating IP address will be requested. Once the IP address is given, it’ll be associated with the search frontend.
    6. After the IP address is associated, add a DNS entry to it, allowing end-users to access the search infrastructure by host name.
    7. If the demoed infrastructure is successful, the configuration of the instances will then be puppetized and the puppet configuration will be pushed to the puppet repository for review.
    8. Upon successful review, the code will be merged, and will be deployed to production. The project will continue to be used for maintenance and feature testing before production.

    The reception and current use of Labs

    We launched Labs as a closed beta in October 2011, and are still in closed beta at this time. At the time of this writing, Labs has 79 projects, 129 instances, and 264 users.

    So far the community is very vibrant. Here’s a small select list of community maintained projects:

    • Bots: A bots infrastructure, for hosting bots used to edit Wikimedia sites.
    • Deployment-prep: A clone of our production infrastructure, meant to be used for testing software deployments before they go to production. You can be a beta tester for changes on the beta wikis.
    • Nagios: A volunteer maintained nagios monitoring system for Labs.
    • Hugglewa: A project used to develop a web based version of huggle.
    • Wikidata-dev: A project used to develop and demo the exciting new Wikidata project.
    • Mobile: A project used to develop and demo changes to our mobile infrastructure and software.
    • Maps: A project used to build OpenStreetMap infrastructure. This project is a collaboration of our staff and volunteer developers and developers of OpenStreetMap to add OpenStreetMap support to Wikimedia projects.

    There are many, many more, and we add new projects every day.

    How to help

    Labs is community built and maintained. We’d love for you to help out! If you’d like to help, please request an account and come talk with us about what you’d like to help with.

    We can easily be found on #wikimedia-labs on Freenode (IRC); look for Ryan_Lane, andrewbogott, sarasmollett, paravoid, and sumanah. Also you can subscribe and send us email on our mailing lists, labs-l.

    Ryan Lane
    Operations Engineer

    Ryan Lane — 17. April 2012, 00:24