Using Big Data Technique for Building Edit Alert System for Wikipedia Infoboxes Based on MapReduce Method

Abstract

Wikipedia is an online encyclopedia and has become a vital information resource for users as well as for many knowledge bases derived from it. This information requires manual editing for update. Wikipedia provides an infobox on the right hand side of many articles. An infobox of a Wikipedia article generally contains key facts in thearticle and is organized as attribute-value pairs. All the Wikipedia’s content is manually updated or maintained by contributors. This leads to the fact that its information is not updated regularly and completely. In this paper, we present a novel system that focuses onprediction of data items that are most likely to be updated, based on the category of page, record key, last time updated, etc. for alerting Wikipedia editors, about the data items that might need update soon, using Time series modeling. Concept of Bipartite graph is used to perform user based collaborative filtering to find similar editors who might be interested in editing the infobox. The update alert is sent to editors found using Bipartite graph along with the past editors of a particular infobox. The technique to deal with vandalic and erroneous edits is also discussed and its analysis is given. We have also presented various tasks that can be carried out on infoboxes

Authors and Affiliations

Khushboo Bhatia, Arnab Halder, Yashi Yadav, Ankush Sarsewar, Priyanka Singh, Khushboo Khurana

Keywords

Related Articles

A Critical Review on Improving the Productivity of Microalgae Cultivated in Wastewater for Biofuel Production

Microalgae has been recognized as a possible feedstock for the assembling of biofuels. On account of their natural nonpartisanship and adaptability underway, microalgae have arisen as a potential feasible biomass asset....

The Electricity Creation by the Means of Hydro Power Plant

Energy can be produced in a variety of ways, and electricity is one of them. Hydropower plants produce electricity from water, thermal power plants produce electricity from heat, wind energy power plants produce electric...

IoT in Agriculture: Ongoing Developments and Emerging Issues

The developing requirement for food, both as far as amount and quality, has required horticultural area improvement and industrialization. The "Web of Things" (IoT) is a promising group of innovations fit for giving an a...

Review on Detecting DDoS Attacks using Map Reduce in Haddop

An assault on a network that overflows it with so many requests that regular traffic is either decelerated or entirely interrupted. Unlike a virus or worm, this can cause severe damage to databases. A Distributed Denial...

Use of Shear Wall and Reinforced Cement Concrete Bracing System Both in High Rise Commercial Buildings Using Staad Pro Software

In earthquake-prone zones the structures are designed to withstand seismic or lateral forces along with gravity loads. In that respect shear wall system, bracing system, diagrid, etc., have been suggested over a period o...

Download PDF file
  • EP ID EP748140
  • DOI 10.21276/ijircst.2018.6.4.2
  • Views 56
  • Downloads 0

How To Cite

Khushboo Bhatia, Arnab Halder, Yashi Yadav, Ankush Sarsewar, Priyanka Singh, Khushboo Khurana (2018). Using Big Data Technique for Building Edit Alert System for Wikipedia Infoboxes Based on MapReduce Method. International Journal of Innovative Research in Computer Science and Technology, 6(4), -. https://europub.co.uk/articles/-A-748140