Efficient Feature Selection for Product Labeling over Unstructured Data

Abstract

The paper introduces a novel feature selection algorithm for labeling identical products collected from online web resources. Product labeling is important for clustering similar or same products. Products blindly crawled over the web sources, such as online sellers, have unstructured data due to having features expressed in different representations and formats. Such data result in feature vectors whose representation is unknown and non-uniform in length. Thus, product labeling, as a challenging problem, needs efficient selection of features that best describe the products. In this paper, an efficient feature selection algorithm is proposed for product labeling problem. Hierarchical clustering is used with the state of the art similarity metrics to assess the performance of the proposed algorithm. The results show that the proposed algorithm increases the performance of product labeling significantly. Furthermore, the method can be applied to any clustering algorithm that works on unstructured data.

Authors and Affiliations

Zeki YETGIN, Abdullah ELEWI, Furkan GÖZÜKARA

Keywords

Related Articles

 SVD Based Image Processing Applications: State of The Art, Contributions and Research Challenges

 Singular Value Decomposition (SVD) has recently emerged as a new paradigm for processing different types of images. SVD is an attractive algebraic transform for image processing applications. The paper proposes an...

JsonToOnto: Building Owl2 Ontologies from Json Documents

The amount of data circulating through the web has grown rapidly recently. This data is available as semi-structured or unstructured documents, especially JSON documents. However, these documents lack semantic descriptio...

An Enhanced Framework with Advanced Study to Incorporate the Searching of E-Commerce Products Using Modernization of Database Queries

This study aims to inspect and evaluate the integration of database queries and their use in e-commerce product searches. It has been observed that e-commerce is one of the most prominent trends, which have been emerged...

  OFW-ITS-LSSVM: Weighted Classification by LS-SVM for Diabetes diagnosis

 In accordance to the fast developing technology now a days, every field is gaining it’s benefit through machines other than human involvement. Many changes are being made much advancement is possible by this develo...

Online Incremental Rough Set Learning in Intelligent Traffic System

In the last few years, vehicle to vehicle communication (V2V) technology has been developed to improve the efficiency of traffic communication and road accident avoidance. In this paper, we have proposed a model for onli...

Download PDF file
  • EP ID EP260430
  • DOI 10.14569/IJACSA.2017.080750
  • Views 69
  • Downloads 0

How To Cite

Zeki YETGIN, Abdullah ELEWI, Furkan GÖZÜKARA (2017). Efficient Feature Selection for Product Labeling over Unstructured Data. International Journal of Advanced Computer Science & Applications, 8(7), 376-381. https://europub.co.uk/articles/-A-260430