Harnessing Context for Vandalism Detection in Wikipedia

Journal Title: EAI Endorsed Transactions on Collaborative Computing - Year 2015, Vol 1, Issue 1

Abstract

The importance of collaborative social media (CSM) applications such as Wikipedia to modern free societies can hardly be overemphasized. By allowing end users to freely create and edit content, Wikipedia has greatly facilitated democratization of information. However, over the past several years, Wikipedia has also become susceptible to vandalism, which has adversely affected its information quality. Traditional vandalism detection techniques that rely upon simple textual features such as spammy or abusive words have not been very effective in combating sophisticated vandal attacks that do not contain common vandalism markers. In this paper, we propose a context-based vandalism detection framework for Wikipedia. We first propose a contextenhanced finite state model for representing the context evolution ofWikipedia articles. This paper identifies two distinct types of context that are potentially valuable for vandalism detection, namely content-context and contributor-context. The distinguishing powers of these contexts are discussed by providing empirical results. We design two novel metrics for measuring how well the content-context of an incoming edit fits into the topic and the existing content of a Wikipedia article. We outline machine learning-based vandalism identification schemes that utilize these metrics. Our experiments indicate that utilizing context can substantially improve vandalism detection accuracy.

Authors and Affiliations

Lakshmish Ramaswamy, Raga Sowmya Tummalapenta, Deepika Sethi, Kang Li, Calton Pu

Keywords

Related Articles

PVSio-web: mathematically based tool support for the design of interactive and interoperable medical systems

Use errors, where medical devices work to specification but lead to the clinicians making mistakes resulting in patient harm, is a critical problem. Manufacturers need tools to help them find such design flaws at an earl...

An Analytical Study of Computation and Communication Tradeoffs in Distributed Graph

Distributed vertex-centric graph processing systems such as Pregel, Giraph and GPS have acquired significant popularity in recent years. Although the manner in which graph data is partitioned and placed on the computatio...

A System for Multimodal Interaction with Kinect-Enabled Virtual Windows

Commercial off-the-shelf gaming devices (e.g. such as Kinect) are demonstrating to have a great potential beyond their initial service purpose. In particular, when integrated within the environment or as part of smart ob...

A Novel, Privacy Preserving, Architecture for Online Social Networks

The centralized nature of conventional OSNs poses serious risks to the privacy and security of information exchanged between their members. These risks prompted several attempts to create decentralized OSNs, or DOSNs. Th...

Emergency Response using Ephemeral Social Communities across Online Social Networks

In an emergency situation, receiving prompt and organized help from nearby people is of critical importance. The growing use of online social networks (OSNs) in emergency situations is a clear indication of the natural a...

Download PDF file
  • EP ID EP45680
  • DOI http://dx.doi.org/10.4108/cc.1.1.e7
  • Views 514
  • Downloads 0

How To Cite

Lakshmish Ramaswamy, Raga Sowmya Tummalapenta, Deepika Sethi, Kang Li, Calton Pu (2015). Harnessing Context for Vandalism Detection in Wikipedia. EAI Endorsed Transactions on Collaborative Computing, 1(1), -. https://europub.co.uk/articles/-A-45680