An Empirical Evaluation of Error Correction Methods and Tools for Next Generation Sequencing Data

Apply

An Empirical Evaluation of Error Correction Methods and Tools for Next Generation Sequencing Data

Journal Title: International Journal of Advanced Computer Science & Applications - Year 2018, Vol 9, Issue 1

Abstract

Next Generation Sequencing (NGS) technologies produce massive amount of low cost data that is very much useful in genomic study and research. However, data produced by NGS is affected by different errors such as substitutions, deletions or insertion. It is essential to differentiate between true biological variants and alterations occurred due to errors for accurate downstream analysis. Many types of methods and tools have been developed for NGS error correction. Some of these methods only correct substitutions errors whereas others correct multi types of data errors. In this article, a comprehensive evaluation of three types of methods (k-spectrum based, Multi- sequencing alignment and Hybrid based) is presented which are implemented and adopted by different tools. Experiments have been conducted to compare the performance based on runtime and error correction rate. Two different computing platforms have been used for the experiments to evaluate effectiveness of runtime and error correction rate. The mission and aim of this comparative evaluation is to provide recommendations for selection of suitable tools to cope with the specific needs of users and practitioners. It has been noticed that k-mer spectrum based methodology generated superior results as compared to other methods. Amongst all the tools being utilized, Racer has shown eminent performance in terms of error correction rate and execution time for both small as well as large data sets. In multisequence alignment based tools, Karect depicts excellent error correction rate whereas Coral shows better execution time for all data sets. In hybrid based tools, Jabba shows better error correction rate and execution time as compared to brownie. Computing platforms mostly affect execution time but have no general effect on error correction rate.

Authors and Affiliations

Atif Mehmood, Javed Ferzund, Muhammad Usman Ali, Abbas Rehman, Shahzad Ahmed, Imran Ahmad

Keywords

Next generation sequencing; bioinformatics; errors; error correction; execution time; k-spectrum; suffix tree based; hybrid based

Multi Factor Authentication for Student and Staff Access Control

This paper proposes a model to improve security, by controlling who accesses the University of Zambia Campus, Student Hostels and Offices. The proposed model combines Barcode, RFID, and Biometrics Technology to automatic...

Wavelet-based Image Modelling for Compression Using Hidden Markov Model

Statistical signal modeling using hidden Markov model is one of the techniques used for image compression. Wavelet based statistical signal models are impractical for most of the real time processing because they usually...

High-Speed FPGA-based of the Particle Swarm Optimization using HLS Tool

The Particle Swarm Optimization (PSO) is a heuristic search method inspired by different biological populations on their swarming or collaborative behavior. This novel work has implemented PSO for the Travelling Salesman...

Blind Image Quality Evaluation of Stitched Image using Novel Hybrid Warping Technique

Image stitching is collection of sequential images captured at fixed camera center having considerable amount of overlap and produces aesthetically pleasing seamless panoramic view. But, practically it is very difficult...

Dynamic Modification of Activation Function using the Backpropagation Algorithm in the Artificial Neural Networks

The paper proposes the dynamic modification of the activation function in a learning technique, more exactly backpropagation algorithm. The modification consists in changing slope of sigmoid function for activation funct...

EP ID EP261664
DOI 10.14569/IJACSA.2018.090158
Views 95
Downloads 0

How To Cite

Atif Mehmood, Javed Ferzund, Muhammad Usman Ali, Abbas Rehman, Shahzad Ahmed, Imran Ahmad (2018). An Empirical Evaluation of Error Correction Methods and Tools for Next Generation Sequencing Data. International Journal of Advanced Computer Science & Applications, 9(1), 425-431. https://europub.co.uk/articles/-A-261664