A Semantic Approach for Outlier Detection in Big Data Streams

Journal Title: Webology - Year 2019, Vol 16, Issue 1

Abstract

In recent years, the world faced a big revolution in data generation and collection technologies. The volume, velocity and veracity of data have changed drastically and led to new types of challenges related to data analysis, modeling and prediction. One of the key challenges is related to the semantic analysis of textual data especially in big data streams settings. The existing solutions focus on either topic analysis or the sentiment analysis. Moreover, the semantic outlier detection over data streams as one of the key problems in data mining and data analysis fields has less focus. In this paper, we introduce a new concept of semantic outlier through which the topic of the textual data is considered as the primary content of the data stream while the sentiment is considered as the context in which the data has been generated and affected. Also, we propose a framework for semantic outlier detection in big data streams which incorporates the contextual detection concepts. The advantage of the proposed concept is that it incorporates both topic and sentiment analysis into one single process; while at the same time the framework enables the implementation of different algorithms and approaches for semantic analysis.

Authors and Affiliations

Hussien Ahmad and Salah Dowaji

Keywords

Related Articles

Theories of Information, Communication and Knowledge: A Multidisciplinary Approach

This book includes a provocative collection of instructive, informative and inclusive topics of theories of information through an introduction and 11 chapters. The book attempts to cover a vast range of theories of info...

Editorial The International Scope of Webology

Since August 2004, the number of manuscripts submitted to Webology, an international open access journal, has increased. However, about 50 percent of the submitted manuscripts have been rejected by referees. This is one...

Theories of Information Behavior

Theoretically, information behaviour is one of the richest research areas in the field of library and information science (LIS). Since the calls for conceptual enrichment within the field of information behaviour in the...

Information Seeking Behaviour of Faculty Members of Rajabhat Universities in Bangkok

This article reports the results of a study of the information seeking behaviour of faculty members of Rajabhat Universities in Bangkok, Thailand. Data were collected by using a questionnaire from seven faculties in Raja...

Search Engines and Resource Discovery on the Web: Is Dublin Core an Impact Factor?

This study evaluates the effectiveness of the Dublin Core metadata elements on the retrieval of web pages in a suite of six search engines, AlltheWeb, AltaVista, Google, Excite, Lycos, and WebCrawler. The effectiveness o...

Download PDF file
  • EP ID EP687813
  • DOI -
  • Views 241
  • Downloads 0

How To Cite

Hussien Ahmad and Salah Dowaji (2019). A Semantic Approach for Outlier Detection in Big Data Streams. Webology, 16(1), -. https://europub.co.uk/articles/-A-687813