Text Mining: Techniques, Applications and Issues
Journal Title: International Journal of Advanced Computer Science & Applications - Year 2016, Vol 7, Issue 11
Abstract
Rapid progress in digital data acquisition tech-niques have led to huge volume of data. More than 80 percent of today’s data is composed of unstructured or semi-structured data. The discovery of appropriate patterns and trends to analyze the text documents from massive volume of data is a big issue. Text mining is a process of extracting interesting and non-trivial patterns from huge amount of text documents. There exist different techniques and tools to mine the text and discover valuable information for future prediction and decision making process. The selection of right and appropriate text mining technique helps to enhance the speed and decreases the time and effort required to extract valuable information. This paper briefly discuss and analyze the text mining techniques and their applications in diverse fields of life. Moreover, the issues in the field of text mining that affect the accuracy and relevance of results are identified.
Authors and Affiliations
Ramzan Talib, Muhammad Kashif Hanif, Shaeela Ayesha, Fakeeha Fatima
Implementation of Failure Enterprise Systems in Organizational Perspective Framework
Failure percentage of Enterprise Resource Planning (ERP) implementation projects stay high, even following quite a while of endeavours to diminish them. In this paper, the author proposes the exact exploration that plans...
BAAC: Bangor Arabic Annotated Corpus
This paper describes the creation of the new Bangor Arabic Annotated Corpus (BAAC) which is a Modern Standard Arabic (MSA) corpus that comprises 50K words manually annotated by parts-of-speech. For evaluating the quality...
Using the Sub-Game Perfect Nash Equilibrium to Deduce the Effect of Government Subsidy on Consumption Rates and Prices
Governments are interested in inducing positive habits and behaviors in its citizens and discouraging ones that are harmful to the individual or to the society. Taxation and legislation are usually used to discourage neg...
Estimation of Water Quality Parameters Using the Regression Model with Fuzzy K-Means Clustering
The traditional methods in remote sensing used for monitoring and estimating pollutants are generally relied on the spectral response or scattering reflected from water. In this work, a new method has been proposed to fi...
Inter Prediction Complexity Reduction for HEVC based on Residuals Characteristics
High Efficiency Video Coding (HEVC) or H.265 is currently the latest standard in video coding. While this new standard promises improved performance over the previous H.264/AVC standard, the complexity has drastically in...