PROTEIN IDENTIFICATION USING SEQUENCE DATABASES

Journal Title: Scientific Journal of Astana IT University - Year 2020, Vol 4, Issue 4

Abstract

The bottom-up proteomics approach (also known as the shotgun approach), based on the digestion of proteins in peptides and their sequencing using tandem mass spectrometry (MS/MS), has become widespread. The identification of peptides from the obtained MS/MS data is most often done using available sequence databases. This paper presents a detailed overview of the peptide identification workflow and a description of the main protein bioinformatics databases. Choosing the correct search parameters and the sequence database is essential to the success of this method, and we pay special attention to the practical aspects of searching for efficient analysis of MS/MS spectra. We also consider possible reasons why database search tools cannot find the correct sequence for some MS/MS spectra and highlight the misidentification issues that can significantly reduce the value of published data. To help assess the assignment of peptides to MS/MS spectra, we will look at the scoring algorithms that are used in the most popular database search tools. We also analyze statistical methods and computational tools for validating peptide compliance with MS/MS data. The final part describes the process of determining the identity of protein samples from a list of peptide identifications and discusses the limitations of bottom-up proteomics.

Authors and Affiliations

Ye. Golenko, A. Ismailova, Ye. Rais

Keywords

Related Articles

STUDY OF THE CRYPTOGRAPHIC STRENGTH OF THE S-BOX OBTAINED ON THE BASIS OF EXPONENTIATION MODULO

This article presents one of the main transformations of symmetric block ciphers used to protect confidential information, a new method for obtaining a non-linear S block, and an analysis of the results obtained. The S-b...

CONVERGENCE OF PROJECT MANAGERS COMPETENCIES IN HYBRID WORLD

Global trends that occur in various fields of knowledge with a significant acceleration affect the development of information technology and project management competencies, programs, and project portfolios. The paper...

SYSTEMATIC DATA PROCUREMENT IN AN OWL-EMBEDDED INFORMATION AND ANALYTICAL FRAMEWORK FOR THE MONITORING OF WATER RESOURCES IN THE ILE-BALKHASH BASIN

The world is facing an escalating water shortage crisis, with dire consequences for ecosystems, human health, and socio-economic development. This article explores the multifaceted nature of the water shortage problem of...

DEVELOPMENT OF A MARKOV MODEL OF THE INFORMATION ENVIRONMENT AS A COMMUNICATION SYSTEM IN THE SCIENTIFIC SPHERE

The paper presents the theoretical foundations of creating the educational environment of an educational institution using the project approach at the stage of building models and displaying communications in the infor...

DEEP LEARNING-BASED FACE MASK DETECTION USING YOLOV5 MODEL

Based on the background of rapid transmission of novel coronavirus and various pneumonia, wearing masks becomes the best solution to effectively reduce the probability of transmission. For a series of problems arising fr...

Download PDF file
  • EP ID EP711935
  • DOI 10.37943/AITU.2020.91.98.002
  • Views 50
  • Downloads 0

How To Cite

Ye. Golenko, A. Ismailova, Ye. Rais (2020). PROTEIN IDENTIFICATION USING SEQUENCE DATABASES. Scientific Journal of Astana IT University, 4(4), -. https://europub.co.uk/articles/-A-711935