PROTEIN IDENTIFICATION USING SEQUENCE DATABASES

Journal Title: Scientific Journal of Astana IT University - Year 2020, Vol 4, Issue 4

Abstract

The bottom-up proteomics approach (also known as the shotgun approach), based on the digestion of proteins in peptides and their sequencing using tandem mass spectrometry (MS/MS), has become widespread. The identification of peptides from the obtained MS/MS data is most often done using available sequence databases. This paper presents a detailed overview of the peptide identification workflow and a description of the main protein bioinformatics databases. Choosing the correct search parameters and the sequence database is essential to the success of this method, and we pay special attention to the practical aspects of searching for efficient analysis of MS/MS spectra. We also consider possible reasons why database search tools cannot find the correct sequence for some MS/MS spectra and highlight the misidentification issues that can significantly reduce the value of published data. To help assess the assignment of peptides to MS/MS spectra, we will look at the scoring algorithms that are used in the most popular database search tools. We also analyze statistical methods and computational tools for validating peptide compliance with MS/MS data. The final part describes the process of determining the identity of protein samples from a list of peptide identifications and discusses the limitations of bottom-up proteomics.

Authors and Affiliations

Ye. Golenko, A. Ismailova, Ye. Rais

Keywords

Related Articles

INFORMATION TECHNOLOGIES AND THE FUTURE OF EDUCATION IN THE REPUBLIC OF KAZAKHSTAN

The aim of the article is a comprehensive approach to addressing the digitalization of education in the Republic of Kazakhstan based on identifying problems in this area, forming priority tasks and possible ways to sol...

MODELLING AGILE-TRANSFORMATION ORGANIZATIONAL DEVELOPMENT PROJECT PORTFOLIO

Agile transformation is a necessary process for companies in various fields of activity to ensure their competitiveness in modern business conditions when the uniformity of production processes and the growth of the le...

TRACKING OF NON-STANDARD TRAJECTORIES USING MPC METHODS WITH CONSTRAINTS HANDLING ALGORITHM

In recent decades, a Model-Based Predictive Control (MPC) has revealed its dominance over other control methods such as having an ability of constraints handling and input optimization in terms of the value function. H...

MATHEMATICAL SUPPORT OF THE INFORMATION SYSTEM FOR DECISION SUPPORT IN THE SPHERE OF HEALTHCARE

The relevance of the topic is that currently modern medical information systems are aimed at providing management, economic and in some cases medical practice in the collection and processing of anamnestic data, includ...

MODEL OF POPULATION MIGRATION IN AGGLOMERATIONS WITHIN THE FRAMEWORK OF THE “SMART CITY” PARADIGM

The urban agglomeration is a multitude of united settlements according to economic, labor, cultural, household, and recreational characteristics. Agglomeration is perceived as an integral territorial union; therefore,...

Download PDF file
  • EP ID EP711935
  • DOI 10.37943/AITU.2020.91.98.002
  • Views 82
  • Downloads 0

How To Cite

Ye. Golenko, A. Ismailova, Ye. Rais (2020). PROTEIN IDENTIFICATION USING SEQUENCE DATABASES. Scientific Journal of Astana IT University, 4(4), -. https://europub.co.uk/articles/-A-711935