MODELING MORPHOLOGICAL ANALYSIS BASED ON WORD-ENDING FOR UZBEK LANGUAGE

Journal Title: International scientific journal Science and Innovation - Year 2023, Vol 2, Issue 11

Abstract

Uzbek, an agglutinative language, forms words by combining affixes with roots, utilizing inflectional endings for various morphological features. This property makes a large number of combinations of word ending, and greatly increases the word-vocabulary size, and data sparseness problems for statistical models. This paper discusses a morphological analyzing model which includes stemming, lemmatizing and extraction of morphological information considering morpho-phonetic exceptions. A main point of the model involves developing a complete set of word-ending with assign morphological information, and additional datasets for morphological analysis. The proposed model was evaluated using a curated test set comprising 5.3K words. It achieved a word-level accuracy over 91%, as determined through manual verification of stem, lemma, and morphological feature corrections conducted by linguistic experts. The created tool based on the proposed methodology is available as an open-source Python package, as well as a web-based application including a public API

Authors and Affiliations

Ulugbek Salaev

Keywords

Related Articles

THE INFLUENCE OF HISTORICAL AND CULTURAL FACTORS IN THE TRANSITION TO UNIVERSITY AUTONOMY

The article examines the concept of university autonomy and its development is developed Western and Asian countries, including the prerequisites for the development of autonomy, as well as the components of autonomy, wh...

THE COURSE OF PREGNANCY AND CHILDBIRTH IN VIRAL HEPATITIS B

Viral hepatitis B is a significant global health concern, affecting millions of people worldwide. In pregnant women, hepatitis B virus (HBV) infection can have important implications for both maternal and fetal health. T...

BASED ON THE EDUCATIONAL PRACTICE 4+2 PROGRAM IN THE TRAINING OF FUTURE PRIMARY EDUCATION TEACHERS

This article explores the educational practice involved in the training of future primary education teachers, focusing on the 4+2 program. The 4+2 program is a specific teacher training initiative designed to equip aspir...

PREVALENCE OF MYOPIA DISEASE IN ADOLESCENTS AND MEASURES FOR ITS PREVENTION

In this article, the occurrence of myopia (nearsightedness) in adolescents and the social psychological changes that occur as a result of it. Data of 2018-2022 in the Termiz branch was conducted on the basis of retrospec...

USE OF PHRASEOLOGISMS IN POEMS

This article discusses the artist's ability to provide expressiveness through phraseology in poetic works. Scientific opinions on the subject are proved and analyzed on the example of Farida Afro'z's poetic works.

Download PDF file
  • EP ID EP725418
  • DOI 10.5281/zenodo.10155225
  • Views 46
  • Downloads 0

How To Cite

Ulugbek Salaev (2023). MODELING MORPHOLOGICAL ANALYSIS BASED ON WORD-ENDING FOR UZBEK LANGUAGE. International scientific journal Science and Innovation, 2(11), -. https://europub.co.uk/articles/-A-725418