Learning Approaches toward Title Word Selection on Indic Script

Journal Title: International Journal on Computer Science and Engineering - Year 2011, Vol 3, Issue 3

Abstract

Title is a compact representation of a document which distill the important information from the document. In this paper we studied the selection words as title words by using different learning approaches namely nearest neighbor approach (NN), Naive Bayes approach with limited-vocabulary (NBL), Naive Bayes approach with full vocabulary (NBF) and by using a term weighing approach (tf-idf). We compare the performance of these approaches by using F1 metric. We compare the F1 metric results both on English Script and Indic Script ' Telugu'. We concluded the influence of linguistic complexity in the process of Title word selection.

Authors and Affiliations

P. Vijayapal Reddy , A. Govardhan

Keywords

Related Articles

Comparison of performance analysis of 802.11a, 802.11b and 802.11g standard

Wireless local area networks (WLANs) based on the IEEE 802.11 standards has been successfully deployed in a variety of home, office and corporate environments and available in various flavors like 802.11a/b/g. In this pa...

A Mid – Point based k-mean Clustering Algorithm for Data mining

In k-means clustering algorithm, the number of centroids is equal to the number of the clusters in which data has to be partitioned which in turn is taken as an input parameter. The initial centroids in original k-means...

Two-Level Dynamic Load Balancing Algorithm Using Load Thresholds and Pairwise Immigration

This paper proposes a two-level dynamic load balancing scheme for grid and distributed systems. We focus on reducing average task response time. In order to achieve the goals, efficient dynamic load balancing is required...

Comparing Neural Network Approach with N-Gram Approach for Text Categorization

This paper compares Neural network Approach with N-gram approach, for text categorization, and demonstrates that Neural Network approach is similar to the N-gram approach but with much less judging time. Both methods dem...

UEP based on Proximity Pilot Subcarriers with QAM in OFDM

A novel UEP (Unequal Error Protection) method is proposed that utilizes the subcarrier positions relative to pilot subcarriers in an OFDM multicarrier frame along with QAM (Quadrature Amplitude Modulation) schemes. With...

Download PDF file
  • EP ID EP91913
  • DOI -
  • Views 135
  • Downloads 0

How To Cite

P. Vijayapal Reddy, A. Govardhan (2011). Learning Approaches toward Title Word Selection on Indic Script. International Journal on Computer Science and Engineering, 3(3), 1063-1067. https://europub.co.uk/articles/-A-91913