Designing Novel Queries for Analysing NoSQL Data of Gene-Disease Associations

Abstract

To precisely identify gene associated diseases has been an open area of research for biological scientists to ensure clinical and psychological symptoms and treatment for human diseases. Because whole Human Genome is defined now it is the next step to find all necessary possible factors from such a complex data set that cause gene mutations and hence lead inherited and/or non-inherited diseases. So our research implementation combines all important factors from different biomolecular data sources to make one integrated data set and defines new relationships among these factors for gene associated disease/s that were not present in existing platforms. This paper presents a novel query model for NoSQL data storage that can help researchers to visualise relationships among gene factors and two new factors termed as “causative factors” and “drugs/treatment” for associated diseases. Since no data source applies graphical querying for gene associated diseases, our proposed novel cypher query model can help researchers to deeply analyse data set and get results in an efficient manner. The proposed query model writes novel cypher queries for this research domain on a graphical data model implemented in neo4j, which is a NoSQL (Not Only Structured) database. Use of NoSQL database and NoSQL query language has overcome certain limitations of relational databases, the existing data platforms had to cope up with. This paper gives a new suitable data storage format and effective data search queries for large, complex, semi-structured and multi-dimensional gene associated diseases data set to efficiently define new relationships among factors format to open new horizons of research.

Authors and Affiliations

Hira Yaseen, Muhammad Atif Sarwar, Javed Ferzund

Keywords

Related Articles

The Method of Computer-Aided Design of a Bread Composition with Regard to Biomedical Requirements

A method for efficient software implementation of bread optimized multicomponent mixtures has been developed. These polycomposite mixtures have a chemical composition that meets the modern physiological standards of nutr...

Applications of Data Envelopment Analysis in Development and Assessment of Sustainability Across Economic, Environmental and Social Dimensions

Recently, senior managers are paying much more attention to the environmental aspects of decision-making units. Technically, global economy is inextricably connected to the environment, as it is heavily dependent on extr...

Hierarchical Classifiers for Multi-Way Sentiment Analysis of Arabic Reviews

Sentiment Analysis (SA) is one of hottest fields in data mining (DM) and natural language processing (NLP). The goal of SA is to extract the sentiment conveyed in a certain text based on its content. While most current w...

Improvement of the Vertical Handover Decision and Quality of Service in Heterogeneous Wireless Networks using Software Defined Network

The development of wireless networks brings people great convenience. More state-of-the-art communication protocols of wireless networks are getting mature. People attach more importance to the connections between hetero...

Adapted Speed Mechanism for Collision Avoidance in Vehicular Ad hoc Networks Environment

The disrespect of the safety distance between vehicles is the cause of several road accidents. This distance cannot certainly be estimated at random because of some physical rules to be calculated. The more speed gets hi...

Download PDF file
  • EP ID EP258808
  • DOI 10.14569/IJACSA.2017.080546
  • Views 75
  • Downloads 0

How To Cite

Hira Yaseen, Muhammad Atif Sarwar, Javed Ferzund (2017). Designing Novel Queries for Analysing NoSQL Data of Gene-Disease Associations. International Journal of Advanced Computer Science & Applications, 8(5), 370-380. https://europub.co.uk/articles/-A-258808