Bringing Shape to Textual Data – A Feasible Demonstration
Journal Title: Mehran University Research Journal of Engineering and Technology - Year 2019, Vol 38, Issue 4
Abstract
The Internet has revolutionized the communication paradigm. This has led towards immense amount of unstructured data (i.e. textual data), which is a major source to get useful knowledge about people in several application domains. TM (Text Mining) extracts high quality information to discover knowledge by drawing patterns and relationships in textual data. This field has taken great attention of the research community. As a result, several attempts have been made to propose, introduce and refine techniques applied for uncovering knowledge from text data. This study aims at: (1) presenting existing TM techniques in the scientific literature, (2) reporting challenges/issues and gaps that still need attention, and (3) proposing a framework to bring shape to textual data. A prototype has been developed to demonstrate the effectiveness and potential worth of proposed approach to display how unstructured data (i.e. news articles in this study) has been brought to a shape representing interesting knowledge. The proposed framework implements basic NLP (Natural Language Processing) functions in combination of AYLIEN API (Application Programming Interface) functions. The results reveal the fact that how events, celebrities and popular news-items have been covered in the electronic media, and it also represents subjectivity of topical news events. The news coverage trends highlight the significance of daily news events, which may assist in getting insight about the media groups.
Authors and Affiliations
Anoud Shaikh, Naeem Ahmed Mahoto, Mukhtiar Ali Unar
A Digital Diary: Remembering the Past Using the Present Context
Lifelog devices have gained much attention in recent past. These devices are capable of recording daily activities of a user such as visited places, calories burnt, heart rate, etc. However, reminiscing the past life fro...
An Effective Channel Allocation Scheme to Reduce Co-Channel and Adjacent Channel Interference for WMN Backhaul
Two folded work presents channel allocation scheme sustaining channel orthogonality and channel spacing to reduce CCI (Co-Channel Interference) and ACI (Adjacent Channel Interference) for inter flow of an intra-flow link...
Monitoring the Wastewater Treatment Efficiency of Oxidation Ponds at Chokera, Faisalabad
Treatment efficiency of the sewage stabilization ponds at Chokera, Faisalabad was carried out with respect to the parameters (i.e. BOD5 (Five Days Biochemical Oxygen Demand), COD (Chemical Oxygen Demand), pH, Turbidity,...
Semantic Based Cluster Content Discovery in Description First Clustering Algorithm
In the field of data analytics grouping of like documents in textual data is a serious problem. A lot of work has been done in this field and many algorithms have purposed. One of them is a category of algorithms which f...
Solid Waste Management Issues in Hyderabad City
Solid waste is a great threat not only to the economy of any country but for the environment too. The public through various sources generate tons of solid waste regularly. In the era of globalization, one of the rising...