Bringing Shape to Textual Data – A Feasible Demonstration
Journal Title: Mehran University Research Journal of Engineering and Technology - Year 2019, Vol 38, Issue 4
Abstract
The Internet has revolutionized the communication paradigm. This has led towards immense amount of unstructured data (i.e. textual data), which is a major source to get useful knowledge about people in several application domains. TM (Text Mining) extracts high quality information to discover knowledge by drawing patterns and relationships in textual data. This field has taken great attention of the research community. As a result, several attempts have been made to propose, introduce and refine techniques applied for uncovering knowledge from text data. This study aims at: (1) presenting existing TM techniques in the scientific literature, (2) reporting challenges/issues and gaps that still need attention, and (3) proposing a framework to bring shape to textual data. A prototype has been developed to demonstrate the effectiveness and potential worth of proposed approach to display how unstructured data (i.e. news articles in this study) has been brought to a shape representing interesting knowledge. The proposed framework implements basic NLP (Natural Language Processing) functions in combination of AYLIEN API (Application Programming Interface) functions. The results reveal the fact that how events, celebrities and popular news-items have been covered in the electronic media, and it also represents subjectivity of topical news events. The news coverage trends highlight the significance of daily news events, which may assist in getting insight about the media groups.
Authors and Affiliations
Anoud Shaikh, Naeem Ahmed Mahoto, Mukhtiar Ali Unar
Comparison of Effects of Entropy Coding Schemes Cascaded with Set Partitioning in Hierarchical Trees
WT (Wavelet Transform) is considered as landmark for image compression because it represents a signal in terms of functions which are localized both in frequency and time domain. Wavelet sub-band coding exploits the self...
Comparative Study of White Layer Characteristics for Static and Rotating Workpiece during Electric Discharge Machining
EDMed (Electric Discharge Machined) surfaces are unique in their appearance and metallurgical characteristics, which depend on different parameter such as electric parameters, flushing method, and dielectric type. Conven...
RICCI and Matter Collineations of SOM-ROY Chaudhary Symmetric Space Time
This paper is devoted to explore the RICCI and MCs (Matter Collineations of the Som-Ray Chaudhary spacetime. The spacetime under consideration is one of the spatially homogeneous and rotating spacetimes. Collineations ar...
A Novel Approach for Blind Estimation of Reverberation Time using Rayleigh Distribution Model
In this paper a blind estimation approach is proposed which directly utilizes the reverberant signal for estimating the RT (Reverberation Time).For estimation a very well-known method is used; MLE (Maximum Likelihood Est...
Integrated GIS-Based Site Selection of Hillside Development for Future Growth of Urban Areas
Urbanization is a challenging issue for developing countries, like Malaysia. Penang Island is one of the states of Malaysia selected as a study area where limited flat land exists. As a result, this would create urban en...