MODELING MORPHOLOGICAL ANALYSIS BASED ON WORD-ENDING FOR UZBEK LANGUAGE

Journal Title: International scientific journal Science and Innovation - Year 2023, Vol 2, Issue 11

Abstract

Uzbek, an agglutinative language, forms words by combining affixes with roots, utilizing inflectional endings for various morphological features. This property makes a large number of combinations of word ending, and greatly increases the word-vocabulary size, and data sparseness problems for statistical models. This paper discusses a morphological analyzing model which includes stemming, lemmatizing and extraction of morphological information considering morpho-phonetic exceptions. A main point of the model involves developing a complete set of word-ending with assign morphological information, and additional datasets for morphological analysis. The proposed model was evaluated using a curated test set comprising 5.3K words. It achieved a word-level accuracy over 91%, as determined through manual verification of stem, lemma, and morphological feature corrections conducted by linguistic experts. The created tool based on the proposed methodology is available as an open-source Python package, as well as a web-based application including a public API

Authors and Affiliations

Ulugbek Salaev

Keywords

Related Articles

THE IMPORTANCE OF THE CHILD’S GENDER IN THE PERSON-ORIENTED APPROACH

The purpose of this article is to shed light on the many facets that define young children attending preschools. In particular, the article focuses on the gender differences of children as the main topic. In addition, th...

IMPОRTANCE ОF VITAMIN D LEVELS IN PATIENTS WITH CHRОNIC KIDNEY DISEASE STAGE C2 AND C3

Purpоse оf the study: tо study the impоrtance оf vitamin D levels in patients with stages 2–3 CKD, as well as the relatiоnship between vitamin D levels and bоne-mineral metabоlism markers. 105 patients with chrоnic kidne...

THE POLITICAL IMPACT OF WATER SECURITY AND CLIMATE CHANGE IN UZBEKISTAN

Water security and climate change are two interrelated challenges that affect the political stability and development of Uzbekistan. This paper examines how water scarcity, transboundary water management, and climate imp...

MODERN PROBLEMS OF PHYSICAL EDUCATION AT THE UNIVERSITY

The article describes the ways of mastering physical training by the younger generation, their influence both on life in general and separately on physical and mental components. Methods and recommendations for improving...

ART AS AN OBJECT OF SOCIO-PHILOSOPHICAL DISCOURSE

This article emphasizes, unlike all other forms of activity, art is a reflection and expression of the inner world and essence of a person, taken in their integrity. In art, the creator creates a special world, but not i...

Download PDF file
  • EP ID EP725418
  • DOI 10.5281/zenodo.10155225
  • Views 19
  • Downloads 0

How To Cite

Ulugbek Salaev (2023). MODELING MORPHOLOGICAL ANALYSIS BASED ON WORD-ENDING FOR UZBEK LANGUAGE. International scientific journal Science and Innovation, 2(11), -. https://europub.co.uk/articles/-A-725418