A Novel Big Data Storage Model for Protein-Protein Interaction and Gene-Protein Associations

Abstract

NGS (Next Generation Sequencing) technology has resulted in huge amount of proteomics data that exists in the form of interactions (protein-protein, gene-protein, and gene-disease). ETL (Extraction, Transformation, and Loading) techniques are very useful for Databases. Existing Rational Databases are not unified and having SQL (Structured Query Language). Proteomics data requires improvement for Integration of different Data sources. With the usage of NoSQL (not only SQL), improve the efficiency and performance. For this, a novel based unified model has been designed for protein interactions data (P-P, G-G, and G-D) by using Apache HBase to evaluate given the model, different case studies have been used.

Authors and Affiliations

M. Atif Sarwar, Hira Yaseen, Javed Ferzund, Hina Farooq, Azka Mahmood

Keywords

Related Articles

The Dynamics of IT Workaround Practices - A Theoretical Concept and an Empirical Assessment

An interesting phenomenon that has received limited attention in the extant literature is that of IT workaround practices. Based on Ashby's Law of Requisite Variety, workarounds were found to be used to accomplish the ba...

Short Answer Grading Using String Similarity And Corpus-Based Similarity

Most automatic scoring systems use pattern based that requires a lot of hard and tedious work. These systems work in a supervised manner where predefined patterns and scoring rules are generated. This paper presents a di...

Hearing Aid Method by Equalizing Frequency Response of Phoneme Extracted from Human Voice

Hearing aid method by equalizing frequency response of phoneme which is extracted from human voice is proposed. One of the problems of the existing hearing aid is poor customization of the frequency response compensation...

Towards No-Reference of Peak Signal to Noise Ratio

The aim of this work is to define a no-referenced perceptual image quality estimator applying the perceptual concepts of the Chromatic Induction Model The approach consists in comparing the received image, presumably deg...

An approach for Teaching of National Languages and Cultures through ICT in Cameroon

This article describes the input of ICT to the modernization of teaching national languages and cultures in order to promote cultural diversity as well as dissemination of scientific knowledge through national languages....

Download PDF file
  • EP ID EP258801
  • DOI 10.14569/IJACSA.2017.080543
  • Views 98
  • Downloads 0

How To Cite

M. Atif Sarwar, Hira Yaseen, Javed Ferzund, Hina Farooq, Azka Mahmood (2017). A Novel Big Data Storage Model for Protein-Protein Interaction and Gene-Protein Associations. International Journal of Advanced Computer Science & Applications, 8(5), 346-357. https://europub.co.uk/articles/-A-258801