Efficient Reduction of Overgeneration Errors for Automatic Controlled Indexing with an Application to the Biomedical Domain

Abstract

Studies on MetaMap and MaxMatcher has shown that both concept extraction systems suffer from overgeneration problems. Over-generation occurs when the extraction systems mistakenly select an irrelevant concept. One of the reasons for these errors is that these systems use the words to weight the terms of the concepts. In this paper, an Integer Linear Programming model is used to select the optimal subset of extracted concept mentions covering the largest number of important words in the document to be indexed. Then each concept mentions that this set is mapped to a unique concept in UMLS using an information retrieval model.

Authors and Affiliations

Samassi Adama, Brou Konan Marcellin, GOORE Bi Tra, Prosper Kimou

Keywords

Related Articles

Empirical Study of Segment Particle Swarm Optimization and Particle Swarm Optimization Algorithms

In this paper, the performance of segment particle swarm optimization (Se-PSO) algorithm was compared with that of original particle swarm optimization (PSO) algorithm. Four different benchmark functions of Sphere, Rosen...

A Two-Level Fault-Tolerance Technique for High Performance Computing Applications

Reliability is the biggest concern facing future extreme-scale, high performance computing (HPC) systems. Within the current generation of HPC systems, projections suggest that errors will occur with very high rates in f...

New Method of Faults Diagnostic based on Neuro-Dynamic Sliding Mode for Flat Nonlinear Systems

This paper addresses the problem of simultaneous actuator, process and sensor Fault Detection and Isolation (FDI) for nonlinear system having flatness properties with the presence of disturbances and which are operating...

IoT-Enabled Door Lock System

This paper covers the design of a prototype for IoT and GPS enabled door lock system. The aim of this research is to design a door lock system that does not need manual input from user for convenience purpose while also...

Design of a High Speed Architecture of MQ-Coder for JPEG2000 on FPGA

Digital imaging is omnipresent today. In many areas, digitized images replace their analog ancestors such as photographs or X-rays. The world of multimedia makes extensive use of image transfer and storage. The volume of...

Download PDF file
  • EP ID EP429152
  • DOI 10.14569/IJACSA.2018.091225
  • Views 104
  • Downloads 0

How To Cite

Samassi Adama, Brou Konan Marcellin, GOORE Bi Tra, Prosper Kimou (2018). Efficient Reduction of Overgeneration Errors for Automatic Controlled Indexing with an Application to the Biomedical Domain. International Journal of Advanced Computer Science & Applications, 9(12), 174-178. https://europub.co.uk/articles/-A-429152