Harnessing Context for Vandalism Detection in Wikipedia

Journal Title: EAI Endorsed Transactions on Collaborative Computing - Year 2015, Vol 1, Issue 1

Abstract

The importance of collaborative social media (CSM) applications such as Wikipedia to modern free societies can hardly be overemphasized. By allowing end users to freely create and edit content, Wikipedia has greatly facilitated democratization of information. However, over the past several years, Wikipedia has also become susceptible to vandalism, which has adversely affected its information quality. Traditional vandalism detection techniques that rely upon simple textual features such as spammy or abusive words have not been very effective in combating sophisticated vandal attacks that do not contain common vandalism markers. In this paper, we propose a context-based vandalism detection framework for Wikipedia. We first propose a contextenhanced finite state model for representing the context evolution ofWikipedia articles. This paper identifies two distinct types of context that are potentially valuable for vandalism detection, namely content-context and contributor-context. The distinguishing powers of these contexts are discussed by providing empirical results. We design two novel metrics for measuring how well the content-context of an incoming edit fits into the topic and the existing content of a Wikipedia article. We outline machine learning-based vandalism identification schemes that utilize these metrics. Our experiments indicate that utilizing context can substantially improve vandalism detection accuracy.

Authors and Affiliations

Lakshmish Ramaswamy, Raga Sowmya Tummalapenta, Deepika Sethi, Kang Li, Calton Pu

Keywords

Related Articles

A Hybrid Model Ranking Search Result for Research Paper Searching on Social Bookmarking

Social bookmarking and publication sharing systems are essential tools for web resource discovery. The performance and capabilities of search results from research paper bookmarking system are vital. Many researchers use...

A Framework for Performance Evaluation of Decentralized Eventual Consistency Algorithms

Eventual Consistency (EC) model is adopted by numerous large-scale distributed systems. To ensure performance and scalability, this model allows any replica to accept updates without remote synchronization. Nowadays, man...

Design of Pet Robots with Limitations of Lives and Inherited Characteristics

In this paper, we propose a framework of life duration and inheritance for pet robots to make them have original characteristics in their limited lives. The purpose of our research is to develop a pet robot that enables...

A method to determine the transient capacitance of the bifacial solar cell considering the cylindrica grain and the dynamic junction velocity (Sf)

In this paper, we present a new techninic based on the dynamic junc velocity (Sf) conconce ept for the evaluation of the transient diffusion capacitance of the bbiifacial solar cell considering cylindrical model of th he...

Guest Editorial: Selected Papers from IEEE IEEE/EAI CollaborateCom 2013

This issue of EAI Transactions on Collaborative Computing includes extended versions of articles selected from the program of the 9th IEEE International Conference on Collaborative Computing: Networking, Applications...

Download PDF file
  • EP ID EP45680
  • DOI http://dx.doi.org/10.4108/cc.1.1.e7
  • Views 507
  • Downloads 0

How To Cite

Lakshmish Ramaswamy, Raga Sowmya Tummalapenta, Deepika Sethi, Kang Li, Calton Pu (2015). Harnessing Context for Vandalism Detection in Wikipedia. EAI Endorsed Transactions on Collaborative Computing, 1(1), -. https://europub.co.uk/articles/-A-45680