Current Opportunities and Challenges of Next Generation Sequencing (NGS) of DNA; Determining Health and Diseases

Journal Title: Biotechnology Journal International - Year 2016, Vol 13, Issue 4

Abstract

Many publications have demonstrated the huge potential of NGS methods in terms of new species discovery, environment monitoring, ecological studies, etc. [24,35,92,97,103]. Undoubtedly, NGS will become one the major tools for species identification and for routine diagnostic use. While read lengths are still quite short for most existing systems ranging between 50 bp and 800 bp, they are likely to improve soon. This will enable easier, faster, and more reliable contig assembly and subsequent matching against reference databases. When data generation is no longer a bottleneck, the storage, speed of analysis, and interpretation of DNA sequence data are becoming the major challenges. Also, the integration or the use of data originating from diverse datasets and a variety of data providers are serious issues that need to be addressed. Poor sequence record annotations and species name assignments are known problems that should be instantly addressed and would allow the creation of reference databases used for routine diagnostics based on NGS. Samples with huge amounts of short DNA fragments need to be analyzed and compared against reference databases in an efficient and fast way. Although a number of solutions have been proposed by Industry; offering commercial software, there still remain hurdles to take. One of the challenges that we need to address is data upload from client’s computers to central or distributed data storage and analysis services. Another one is the efficient parallelization of analyses using cloud or grid solutions. The reliability and up-time of storage and analyses facilities is another important problem that need to be addressed if one wants to use it for routine diagnostics. Finally, the management, reporting and visualization of the analyses results are among the last issues, but not the least challenging ones. Considering the constant growth of computational power and storage capacity needed by different bioinformatics applications, working with single or a limited number of servers is no longer realistic. Using a cloud environment and grid computing is becoming a must. Even single cloud service provider can be restrictive for bioinformatics applications and working with more than one cloud can make the workflow more robust in the face of failures and always growing capacity needs. In this white paper we review the current state of the art in this field. We discuss the main limitations and challenges that we need to address such as; data upload from client’s computers to central or distributed data storage and analysis services; efficient parallelization of analyses using grid solutions; reliability and up-time of storage and analyses facilities for routine diagnostics; management, retrieving and visualization of the analyses results.

Authors and Affiliations

Carlo P. J. M. Brouwer, Thuy Duong Vu, Miaomiao Zhou, Gianluigi Cardinali, Mick M. Welling, Nathalie van de Wiele, Vincent Robert

Keywords

Related Articles

Compositional and Amino Acid Profile of Nicker Bean (Entada gigas) Seeds

The proximate, anti-nutritional factors, functional properties, minerals and amino acid composition of nicker bean (Entada gigas) were determined. The sample contained crude protein and carbohydrate of 24.8±0.02% and 47....

Antibacterial Potential of Magnesium Oxide Nanoparticles Synthesized by Aspergillus niger

A total of 280 urinary tract infection samples were collected in this investigation. Out of them 212(75.7%) samples showed a positive response.to bacterial isolates. Morphological, cultural and biochemical testes were co...

Purification and Characterization of α-Glucan Phosphorylase Isoform Pho 2 from Spinach Leaves

α-Glucan phosphorylase is an important enzyme of carbohydrate metabolism. In spinach leaves, it has been reported in two multiple forms viz. Pho 2 (cytosolic) and Pho 1 (plastidial). Here, we extracted and purified Pho 2...

In vitro and in silico Approach to Evaluate the Urease and Collagenase Inhibitory Activity of Embilica officinalis Gaertn Fruit

Aim: The key virulent factors of bacteria are enzymes. Urease and collagenase enzyme play a vital role in pathogenesis of wide array of bacterial strains and cause numerous diseases. So the aim of present study was to fi...

Alkaline Cellulase Production by Penicillium mallochii LMB-HP37 Isolated from Soils of a Peruvian Rainforest

Alkaline cellulases are demanded by the textile industry for several purposes but commercial preparations showing activity at alkaline conditions are very scarce. Aim: To characterize a Penicillium strain isolated form s...

Download PDF file
  • EP ID EP237921
  • DOI 10.9734/BBJ/2016/25662
  • Views 122
  • Downloads 0

How To Cite

Carlo P. J. M. Brouwer, Thuy Duong Vu, Miaomiao Zhou, Gianluigi Cardinali, Mick M. Welling, Nathalie van de Wiele, Vincent Robert (2016). Current Opportunities and Challenges of Next Generation Sequencing (NGS) of DNA; Determining Health and Diseases. Biotechnology Journal International, 13(4), 1-17. https://europub.co.uk/articles/-A-237921