TOPIC CLASSIFICATION OF UZBEK TEXTS

Journal Title: International scientific journal Science and Innovation - Year 2024, Vol 3, Issue 9

Abstract

Topic classification of text is an important task in the field of natural language processing, where the main goal is to classify text data into predefined categories. In this study, we analyze the dataset creation and methods for evaluating multi-tag news series as part of topic-based topic classification. First, in the article, we present obtained corpus for the classification of texts in Uzbek language. This corpus was collected from 5 different news and press websites and contains 4 categories of news, press and legal texts. We also trained a rule-based machine learning model and a neural network model for subject classification in this newly created corpus experiment. Experiments show that models based on recurrent neural network and convolutional neural network outperform rule-based models.

Authors and Affiliations

Salaev U. , Matlatipov S. G.

Keywords

Related Articles

TRAINING PRIMARY SCHOOL TEACHERS TO USE MEANS OF INFORMATION AND COMMUNICATION TECHNOLOGIES

This article examines the system of training primary school teachers to use information and communication technologies (ICT) in their professional activities and highlights the key components of ICT teacher training. The...

FUNCTIONS THAT DETERMINE THE TEACHER'S ATTITUDE TO CONFLICT SITUATIONS

From the point of view of functions, conflict is a rather contradictory phenomenon. By entering into confrontation, teachers can achieve their goals. But it is often difficult to predict even not very distant consequence...

CLINICAL FND EPIDEMIOLOGICAL FEATURES OF SALMONELLOSIS IN EARLY AGE CHILDREN

The brief review of modern clinical and epidemiological features of salmonellosis in children is submitted. Results of studying of clinic and epidemiology of salmonellosis in early age children are presented. The clinica...

ADVANTAGES OF CORPORATE IDENTITY IN A HOTEL

An important stage in the development of the concept is the right choice of corporate identity of the hotel. In this article I would like to characterize the corporate identity of hotels, as corporate identity is the bas...

SOME ISSUES OF FORECASTING CRACK FORMATION ON HIGHWAYS (IN THE EXAMPLE OF TASHKENT REGİON)

SOME ISSUES OF FORECASTING CRACK FORMATION ON HIGHWAYS (IN THE EXAMPLE OF TASHKENT REGİON)

Download PDF file
  • EP ID EP747994
  • DOI 10.5281/zenodo.13882462
  • Views 39
  • Downloads 0

How To Cite

Salaev U. , Matlatipov S. G. (2024). TOPIC CLASSIFICATION OF UZBEK TEXTS. International scientific journal Science and Innovation, 3(9), -. https://europub.co.uk/articles/-A-747994