An Enhanced Malay Named Entity Recognition using Combination Approach for Crime Textual Data Analysis
Journal Title: International Journal of Advanced Computer Science & Applications - Year 2018, Vol 9, Issue 9
Abstract
Named Entity Recognition (NER) is one of the tasks in the information extraction. NER is used for extracting and classifying words or entities that belong to the proper noun category in text data such as person's name, location, organization, date and others. As seen in today's generation, social media such as web pages, blogs, Facebook, Twitter, Instagram and online newspapers are among the major contributors to the generation of information. This paper presents an enhanced Malay Named Entity Recognition model using combination fuzzy c-means and K-Nearest Neighbours Algorithm method for crime analysis. The results showed that this combination method could improve the accuracy performance on entity recognition of crime data in Malay. The model is expected to provide a better method in the process of recognizing named entities for text analysis particularly in Malay.
Authors and Affiliations
Siti Azirah Asmai, Muhammad Sharilazlan Salleh, Halizah Basiron, Sabrina Ahmad
A Novel Design of Miniaturaized Patch Antenna Using Different Substrates for S-Band and C-Band Applications
In advance communication technology, patch antennas are widely exploit due to their inexpensive and light weighted structure. This paper presents a novel design of miniaturized multiband patch antenna using different sub...
Crowdsensing: Socio-Technical Challenges and Opportunities
With the advancement in mobile technology, the sensing and computational capability of mobile devices is increasing. The sensors in mobile devices are being used in a variety of ways to sense and actuate. Mobile crowdsen...
RASP-FIT: A Fast and Automatic Fault Injection Tool for Code-Modification of FPGA Designs
Fault Injection (FI) is the most popular technique used in the evaluation of fault effects and the dependability of a design. Fault Simulation/Emulation (S/E) is involved in several applications such as test data generat...
A multi-scale method for automatically extracting the dominant features of cervical vertebrae in CT images
Localization of the dominant points of cervical spines in medical images is important for improving the medical automation in clinical head and neck applications. In order to automatically identify the dominant points of...
Assessment of High and Low Rate Protocol-based Attacks on Ethernet Networks
The Internet and Web have significantly transformed the world’s communication system. The capability of the Internet to instantly access information at anytime from anywhere has brought benefit for a wide variety of area...