Identification of User Aware Rare Sequential Pattern in Document Stream An Overview

Abstract

Documents created and distributed on the Internet are ever changing in various forms. Most of existing works are devoted to topic modeling and the evolution of individual topics, while sequential relations of topics in successive documents published by a specific user are ignored. In order to characterize and detect personalized and abnormal behaviours of Internet users, we propose Sequential Topic Patterns STPs and formulate the problem of mining User aware Rare Sequential Topic Patterns URSTPs in document streams on the Internet. They are rare on the whole but relatively frequent for specific users, so can be applied in many real life scenarios, such as real time monitoring on abnormal user behaviours. Here present solutions to solve this innovative mining problem through three phases pre processing to extract probabilistic topics and identify sessions for different users, generating all the STP candidates with expected support values for each user by pattern growth, and selecting URSTPs by making useraware rarity analysis on derived STPs. Experiments on both real Twitter and synthetic datasets show that our approach can indeed discover special users and interpretable URSTPs effectively and efficiently, which significantly reflect users' characteristics. Rajeshri R. Shelke "Identification of User Aware Rare Sequential Pattern in Document Stream- An Overview" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-3 | Issue-4 , June 2019, URL: https://www.ijtsrd.com/papers/ijtsrd24008.pdf Paper URL: https://www.ijtsrd.com/computer-science/data-miining/24008/identification-of-user-aware-rare-sequential-pattern-in-document-stream--an-overview/rajeshri-r-shelke

Authors and Affiliations

Keywords

Related Articles

Foldable World

The document resembles the possible UX disaster that upcoming foldable phones and devices and brings along with requirements and demands. This document covers the whole possibilities of destruction that will be done beca...

Computational Mechanics

Computational mechanics CM is concerned with the use of computational techniques to characterize, predict, and simulate physical phenomena and engineering systems governed by the principles of mechanics. Over the years,...

Virtual Therapist for Psychological Healthcare

Nowadays Stress has been a quite common ailment in people. We believe that when technology is used to build understanding, it can help humanity in creative and effective ways. That idea lives at the core of our paper in...

Resources Allocation Queue Fairness Model of Multi Server Petroleum Products Distribution System

Customer classification and service prioritization policy in a multi server single queuing system is one major scheduling policy employed by most service oriented institutions to provide preferential treatment to custome...

Hematological and Liver Function of Plasmodium Berghei Positive Wister Treated With Herbs and Acts

Eradication of malaria in Africa continues to be one of the greatest challenges in the health sector. All the drugs developed thus far have their limitations and are generally expensive. In Africa and Nigeria the use of...

Download PDF file
  • EP ID EP595740
  • DOI 10.31142/ijtsrd24008
  • Views 114
  • Downloads 0

How To Cite

(2019). Identification of User Aware Rare Sequential Pattern in Document Stream An Overview. International Journal of Trend in Scientific Research and Development, 3(4), 1340-1342. https://europub.co.uk/articles/-A-595740