A Novel Big Data Storage Model for Protein-Protein Interaction and Gene-Protein Associations

Abstract

NGS (Next Generation Sequencing) technology has resulted in huge amount of proteomics data that exists in the form of interactions (protein-protein, gene-protein, and gene-disease). ETL (Extraction, Transformation, and Loading) techniques are very useful for Databases. Existing Rational Databases are not unified and having SQL (Structured Query Language). Proteomics data requires improvement for Integration of different Data sources. With the usage of NoSQL (not only SQL), improve the efficiency and performance. For this, a novel based unified model has been designed for protein interactions data (P-P, G-G, and G-D) by using Apache HBase to evaluate given the model, different case studies have been used.

Authors and Affiliations

M. Atif Sarwar, Hira Yaseen, Javed Ferzund, Hina Farooq, Azka Mahmood

Keywords

Related Articles

Survey of Error Correction Mechanisms for Video Streaming over the Internet

This overview is targeted at determining state-of-the-art on Error control mechanisms for video streaming over the Internet. The aims of error control mechanisms are to provide and protect the data from errors caused by...

Regression-Based Feature Selection on Large Scale Human Activity Recognition

In this paper, we present an approach for regression-based feature selection in human activity recognition. Due to high dimensional features in human activity recognition, the model may have over-fitting and can’t learn...

A Collaborative Process of Decision Making in the Business Context based on Online Questionnaires

This article is a component of a series of articles and scientific researches conducted by the research team which deals with the web 2.0 and its interactions with the different technology areas. During recent years, the...

Interactive Hypermedia Programs and its Impact on the Achievement of University Students Academically Defaulting in Computer Sciences

Traditional teaching practices through lecture series in a classroom have shown to have less universal efficacy in imparting knowledge to every student. Some students encounter problems in this traditional setting, espec...

A Web based Inventory Control System using Cloud Architecture and Barcode Technology for Zambia Air Force

Inventory management of spares is one of the activities Zambia Air Force (ZAF) undertakes to ensure optimal serviceability state of equipment to effectively achieve its roles. This obligation could only be made possible...

Download PDF file
  • EP ID EP258801
  • DOI 10.14569/IJACSA.2017.080543
  • Views 90
  • Downloads 0

How To Cite

M. Atif Sarwar, Hira Yaseen, Javed Ferzund, Hina Farooq, Azka Mahmood (2017). A Novel Big Data Storage Model for Protein-Protein Interaction and Gene-Protein Associations. International Journal of Advanced Computer Science & Applications, 8(5), 346-357. https://europub.co.uk/articles/-A-258801