Finding (and Studying) Wikipedia Trolls

Published: March 13, 2017, 1:44 a.m.

You may be shocked to hear this, but sometimes, people on the internet can be mean. \xa0For some of us this is just a minor annoyance, but if you're a maintainer\xa0or contributor of a large project like Wikipedia, abusive users can be a huge problem. \xa0Fighting the problem starts with understanding it, and understanding it starts with measuring it; the thing is, for a huge website like Wikipedia, there can be millions of edits and comments where abuse might happen, so measurement isn't a simple task. \xa0That's where machine learning comes in: by building an "abuse classifier," and pointing it at the Wikipedia edit corpus, researchers at Jigsaw and the Wikimedia foundation are for the first time able to estimate abuse rates and curate a dataset of abusive incidents. \xa0Then those researchers, and others, can use that dataset to study the pathologies and effects of Wikipedia trolls.