Using Word Embeddings for Ontology Enrichment

Abstract

Word embeddings, distributed word representations in a reduced linear space, show a lot of promise for accomplishing Natural Language Processing (NLP) tasks in an unsupervised manner. In this study, we investigate if the success of word2vec, a Neural Networks based word embeddings algorithm, can be replicated in an aggluginative language like Turkish. Turkish is more challenging than languages like English for complex NLP tasks because of her rich morphology. We picked ontology enrichment, again a relatively harder NLP task, as our test application. Firstly, we show how ontological relations can be extracted automaticaly from Turkish Wikipedia to construct a gold standard. Then by running experiments we show that the word vector representations produced by word2vec are useful to detect ontological relations encoded in Wikipedia. We propose a simple but yet effective weakly supervised ontology enrichment algorithm where for a given word a few know ontologically related concepts coupled with similarity scores computed via word2vec models can result in discovery of other related concepts. We argue how our algorithm can be improved and augmented to make it a viable component of an ontoloy learning and population framework.

Authors and Affiliations

İzzet Pembeci*| Muğla Sıtkı Koçman University. Department of Computer Engineering

Keywords

Related Articles

The Control of A Non-Linear Chaotic System Using Genetic and Particle Swarm Based On Optimization Algorithms

In this study, the control of a non-linear system was realized by using a linear system control strategy. According to the strategy and by using the controller coefficients, system outputs were controlled for all referen...

The Classification of Eye State by Using kNN and MLP Classification Models According to the EEG Signals

What is widely used for classification of eye state to detect human’s cognition state is electroencephalography (EEG). In this study, the usage of EEG signals for online eye state detection method was proposed. In this s...

Matlab’s GA and Optimization Toolbox: A Fourbar Mechanism Application

This study presents an optimization approach for synthesis of planar mechanisms. A four bar mechanism is chosen for an application example. This mechanism is studied with the constraints assigned. Genetic Algorithm (GA)...

Design and Implementation of High Speed Artificial Neural Network Based Sprott 94 S System on FPGA

FPGA-based embedding system designs have been preferred for industrial applications and prototyping because of the advantages of parallel processing, reconfigurability and low cost. Due to having characteristic structure...

The Classification of Diseased Trees by Using kNN and MLP Classification Models According to the Satellite Imagery

In this study, the Japanese Oak and Pine Wilt in forested areas of Japan was classified into two group as diseased trees and all other land cover area according to the 6 attributes in the spectral data set of the forest....

Download PDF file
  • EP ID EP799
  • DOI 10.18201/ijisae.58806
  • Views 432
  • Downloads 25

How To Cite

İzzet Pembeci* (2016). Using Word Embeddings for Ontology Enrichment. International Journal of Intelligent Systems and Applications in Engineering, 4(3), 49-56. https://europub.co.uk/articles/-A-799