Big Data Analytics for Net Flow Analysis in Distributed Environment using Hadoop

Abstract

Network traffic measurement and analysis have been regularly performed on a high performance server that collects and analysis packet flow. When we monitor a large volume of network traffic data for detailed statistics, a large-scale network, it is not easy to handle Tera or Peta byte data with a single server, there is need to thousands of machines. As distributed parallel processing scheme have been recently developed due to the cluster file system, which beneficially applied to analyzing big network traffic data. Hadoop is a popular parallel processing framework that is widely used for working with large datasets. We analyze the netflow data monitoring single node to multi nodes hadoop cluster and provide an algorithm to calculate packet count and packet size of each source ip address for every fix interval of time, with low rate of false positives to detect malicious activity. Finally, we highlight performance and benefits of hadoop distributed cluster when we used large data sets as well as small data sets.

Authors and Affiliations

Amreesh Kumar Patel, D. S. Bhilare, Sushil buriya, Satyendra Singh Yadav

Keywords

Related Articles

SIMULATION AND ANALYSIS OF DSR PROTOCOL IN VANETS

VANET (Vehicular Adhoc Network) is a new concept in the field of wireless networks. The main objective of VANET is to build a powerful network between mobile vehicles so that the vehicles can talk to each other for t...

A comprehensive survey on restructuring user search results with Feedback sessions

Search engine relevance and user experience is very important in web search applications. It can be improved by inferring and analyzing user search goals. This paper proposes a novel approach to infer user goals by a...

Embedding of Data in Motion Vectors by Using Steganography Concept

This paper applies steganography algorithm in videos. In the proposed method, we take GOP techniques which are nothing but video algorithms so we use advantage of prediction types of MPEG bit streams to embed waterma...

Proficient User Revocation On Cloud Computing

In today's Computing world Cloud enlisting is one of the best advancement which uses advanced computational power and it upgrades data sharing and data securing capacities. Essential inconvenience in appropriated fig...

Multichannel Contact Center

A Contact Center is a central point in an organization from which a customer contact is being managed. Contact Center plays an important role in one-to-one customer interactions. Multichannel provides various channel...

Download PDF file
  • EP ID EP28209
  • DOI -
  • Views 231
  • Downloads 2

How To Cite

Amreesh Kumar Patel, D. S. Bhilare, Sushil buriya, Satyendra Singh Yadav (2015). Big Data Analytics for Net Flow Analysis in Distributed Environment using Hadoop. International Journal of Research in Computer and Communication Technology, 4(7), -. https://europub.co.uk/articles/-A-28209