COMPARATIVE ANALYSIS OF RELATED SEQUENCES AND THEIR INCREMENTS ON THE BASIS OF DISCRIMINANT ANALYSIS

Abstract

The article is devoted to the study of the relationship between the lengths of orthologous proteins of four organisms, one of which is taken as the basic one ( more than 1200 proteins in total). The methods of multivariate statistical analysis are used, it is applied to pairs, triples and fours (strings) composed of lengths of orthologous proteins. The number of such lines is from 200 to 400. The analysis of pair correlations, orthogonal transformation and cluster analysis allowed us to distinguish two homogeneous clusters of four-lengths. At the same time, we studied the increments of the length of the orthologous protein relative to the basic organism. We showed that the lines form a non-uniform sample, and the increments form a homogeneous sample. Then the task was to expand the clusters with rows with incomplete data. It was shown that cluster analysis is not applicable for this task, so we used discriminant analysis with a training sample — clustering with complete data. A 100 percent separation of all incomplete rows by clusters was obtained; with the following description of the length dependences of clusters on the base. The adequacy of the resulting regression equations was tested. As a result of statistical analysis, the following conclusions were made. For a set of lengths of orthologous series, a generalizing factor was obtained, let's call it the size of an orthologic object from 4 lengths of orthologous proteins. For the given task such sizes of objects were obtained, and their average group values differ, they form two separate ranges of values, one for each group of the values obtained by other methods. For series of increments of the lengths of orthologous proteins from objects of four, an analysis performed by all methods showed homogeneity of the set. It was shown that the lengths of orthologous proteins have significant autocorrelation, as is the case with rows associated with the same basic series.

Authors and Affiliations

Svetlana Istomina

Keywords

Related Articles

SPACE MONITORING OF THE LARGEST AGRICULTURAL REGION IN THE INTERESTS OF ITS SUSTAINABLE DEVELOPMENT

The ways of creation and application of technologies of complex information support and monitoring of large agricultural territories are used to create basic thematically oriented means of solving problems of managing su...

ON IMPROVEMENT OF THE SYSTEM OF HIGHER PROFESSIONAL EDUCATION IN THE LIGHT OF THE NEW DOCTRINE OF INFORMATION SECURITY OF RUSSIA

The article analyzes the main provisions of the new doctrine of information security of Russia and the basic ways of its realization in the development of science and education in the sphere information security of socie...

SOLUTION OF THE PROBLEM OF HIGH-PRECISION POSITIONING OF AUTOMOBILE TRANSPORT ON THE BASIS OF THE USE OF ELECTRONIC MAPS

The solution of the problem of positioning of moving objects is now increasingly carried out using electronic maps, allowing approximating with high accuracy the trajectory of the object by a set of orthodromic trajector...

ANALYSIS OF INDICATORS FOR ASSESSING THE EFFICIENCY OF STRUCTURAL SUBDIVISIONS OF THE UNIVERSITY

The task of the authors was to rank the factors that are used to assess the rating of the structural units of the University. The authors define and describe the stages of ranking. The statistical analysis of data struct...

ALGORITHMS FOR THE ROBUST PROPERTIES ANALYSIS OF A MULTI-PURPOSE CONTROL LAWS OF MOVING OBJECTS

The problems of analyzing robust properties for control systems of moving objects are of significant importance in modern control theory. This is because the mathematical models used in the synthesis of control laws are...

Download PDF file
  • EP ID EP521850
  • DOI 10.25559/SITITO.14.201803.672-678
  • Views 90
  • Downloads 0

How To Cite

Svetlana Istomina (2018). COMPARATIVE ANALYSIS OF RELATED SEQUENCES AND THEIR INCREMENTS ON THE BASIS OF DISCRIMINANT ANALYSIS. Современные информационные технологии и ИТ-образование, 14(3), 672-678. https://europub.co.uk/articles/-A-521850