A Semantic Approach for Outlier Detection in Big Data Streams
Journal Title: Webology - Year 2019, Vol 16, Issue 1
Abstract
In recent years, the world faced a big revolution in data generation and collection technologies. The volume, velocity and veracity of data have changed drastically and led to new types of challenges related to data analysis, modeling and prediction. One of the key challenges is related to the semantic analysis of textual data especially in big data streams settings. The existing solutions focus on either topic analysis or the sentiment analysis. Moreover, the semantic outlier detection over data streams as one of the key problems in data mining and data analysis fields has less focus. In this paper, we introduce a new concept of semantic outlier through which the topic of the textual data is considered as the primary content of the data stream while the sentiment is considered as the context in which the data has been generated and affected. Also, we propose a framework for semantic outlier detection in big data streams which incorporates the contextual detection concepts. The advantage of the proposed concept is that it incorporates both topic and sentiment analysis into one single process; while at the same time the framework enables the implementation of different algorithms and approaches for semantic analysis.
Authors and Affiliations
Hussien Ahmad and Salah Dowaji
Students' sense of self-efficacy in searching information from the Web: A PLS approach
The role of self-efficacy in different task and organizational settings has largely been highlighted, especially in searching for information by web users. The current research was conducted to reemphasize the mentioned...
Marketing Research in India: A Scientometrics Study
Analyses the Indian publications output in marketing research during 1990-2018 on several parameters including contribution and citation impact of most productive countries, India’s overall contribution, its growth patte...
A Study of Web Search Trends
This article provides an overview of recent research conducted from 1997 to 2003 that explored how people search the Web. The article reports selected findings from many research studies conducted by the co-authors of th...
Detecting Fake Accounts on Twitter Social Network Using Multi-Objective Hybrid Feature Selection Approach
The frequency of fake accounts or social bots is considered as one of serious challenges of online social networks, which are controlled by automatic operators and often used for malicious purposes. The researchers have...
Correlation between references and citations
There are various opinions on the possible correlation between references and citations. The main question is that is there a positive correlation between the number of times a paper is cited (citations received) and t...