Using Big Data Technique for Building Edit Alert System for Wikipedia Infoboxes Based on MapReduce Method
Journal Title: International Journal of Innovative Research in Computer Science and Technology - Year 2018, Vol 6, Issue 4
Abstract
Wikipedia is an online encyclopedia and has become a vital information resource for users as well as for many knowledge bases derived from it. This information requires manual editing for update. Wikipedia provides an infobox on the right hand side of many articles. An infobox of a Wikipedia article generally contains key facts in thearticle and is organized as attribute-value pairs. All the Wikipedia’s content is manually updated or maintained by contributors. This leads to the fact that its information is not updated regularly and completely. In this paper, we present a novel system that focuses onprediction of data items that are most likely to be updated, based on the category of page, record key, last time updated, etc. for alerting Wikipedia editors, about the data items that might need update soon, using Time series modeling. Concept of Bipartite graph is used to perform user based collaborative filtering to find similar editors who might be interested in editing the infobox. The update alert is sent to editors found using Bipartite graph along with the past editors of a particular infobox. The technique to deal with vandalic and erroneous edits is also discussed and its analysis is given. We have also presented various tasks that can be carried out on infoboxes
Authors and Affiliations
Khushboo Bhatia, Arnab Halder, Yashi Yadav, Ankush Sarsewar, Priyanka Singh, Khushboo Khurana
A Critical Review on Improving the Productivity of Microalgae Cultivated in Wastewater for Biofuel Production
Microalgae has been recognized as a possible feedstock for the assembling of biofuels. On account of their natural nonpartisanship and adaptability underway, microalgae have arisen as a potential feasible biomass asset....
The Electricity Creation by the Means of Hydro Power Plant
Energy can be produced in a variety of ways, and electricity is one of them. Hydropower plants produce electricity from water, thermal power plants produce electricity from heat, wind energy power plants produce electric...
IoT in Agriculture: Ongoing Developments and Emerging Issues
The developing requirement for food, both as far as amount and quality, has required horticultural area improvement and industrialization. The "Web of Things" (IoT) is a promising group of innovations fit for giving an a...
Review on Detecting DDoS Attacks using Map Reduce in Haddop
An assault on a network that overflows it with so many requests that regular traffic is either decelerated or entirely interrupted. Unlike a virus or worm, this can cause severe damage to databases. A Distributed Denial...
Use of Shear Wall and Reinforced Cement Concrete Bracing System Both in High Rise Commercial Buildings Using Staad Pro Software
In earthquake-prone zones the structures are designed to withstand seismic or lateral forces along with gravity loads. In that respect shear wall system, bracing system, diagrid, etc., have been suggested over a period o...