CPLSTool: A Framework to Generate Automatic Bioinformatics Pipelines

Journal Title: Biomedical Journal of Scientific & Technical Research (BJSTR) - Year 2018, Vol 11, Issue 5

Abstract

Many bioinformatics tools have been developed for data analysis and focus on some specific problems. However, one program is not enough to complete the data mining. We developed CPLSTool (https://github.com/maoshanchen/CPLSTool) that can compress multiple bioinformatics tools and the produced pipeline can be used for data anlaysis repeatly. The most significant advantage of using CPLSTool is to save waiting time, compared to step-by-step analysis. In addition, some steps for the data analysis can be run parallely in order to save the program running time. We used CPLSTool to build an automatic pipeline based on QIIME and analyzed skin 16S rRNA data. The results showed that a total of 102 minutes can be saved using CPLSTool and the visualization of results improves our understanding of the results. CPLSTool can be applied in any kind of data analysis, including genomic, transcriptomic, proteomic and metagenomic data analysis. The use of CPLSTool will improve our understanding of data analysis and save time and computing resources.The last decade has witnessed the breaking development of Next-Generation Sequencing (NGS) tools, including Transcriptome Sequencing (RNA-Seq), Whole-Genome and Whole-Exome Sequencing (WGS/WXS), Metagenomics, Chromatin Immunoprecipitation or Methylated DNA Immunoprecipitation followed by Sequencing (ChIP-Seq or MeDIP-Seq), and a multitude of more specialized protocols, such as Cross-Linking Immunoprecipitation (CLIP-Seq), Assay for Transposase-Accessible Chromatin Using Sequencing (ATAC-Seq), and Formaldehyde-Assisted Isolation of Regulatory Elements (FAIRE-Seq) [1]. Every NGS tool was born with one or more analysis applications and now there are many bioinformatics tools developed for general and special research purposes, such as BWA [2], ExScalibur [3], Chipster [4], Churchill [5], NEAT [6], MG-RAST [7], TopHat [8] and QIIME [9]. However, there are some drawbacks for these tools. For example,i) Some tools concentrate on a single analysis step instead of completing all needed contents, such as BWA and Top Hat; ii) It is difficult to add new analysis contents to current integrated pipelines, such as NEAT; iii) Some tools are based on web server and the analysis is limited by the internet speed sometimes, such as MG-RAST; and iv) An automatic pipeline is necessary for the whole analysis rather than step-by-step operation, such as QIIME. Moreover, the tremendous amount of NGS output requires a possible way to speed up the analysis. Thus, it is important to develop a clever way to organize the related tools and software within reasonable time to get automatic pipelines and to speed up the overall procedure using parallelization and acceleration technologies [10]. To address this need, some features of a program should be considered when it is developed, such as i) Management of related tools and programs regardless of their own program language and input file formats, ii) Flexibility of adding new contents, iii) Generating an automatic pipeline instead of step-by-step operations, and iv) use of parallelization and acceleration technologies. We developed CPLSTool, which can conform to all the above features. CPLSTool is freely available for users from https://github.com/maoshanchen/CPLSTool.

Authors and Affiliations

Sifen Lu, Jing Song, Maoshan Chen

Keywords

Related Articles

Re-Implantation of Total Hip Replacement (THR) with Burch-Schneider (BS) plate in 2003–2007 – Midterm Results at Our Institution

Here, we report the midterm results of re-implantation using Burch-Schneider (BS) plate for Paprosky bone defects IIIa and IIIb. The optimal treatment of pelvis discontinuity remains controversial. Materials a...

Nucleotide Variations in microRNA-Binding Sites: The Need of Novel Tools for the Nucleic Acid Alignment

It has been proposed that naturally-occurring variations in microRNAs and their target sites on messenger RNAs may be linked with various human pathologies, including cancer. The growing number of reports clearly indicat...

Seizures Related to Antibiotic Use: Update

Objective: make a review about the risk of seizures related to the use of antibiotics. Method: We perform a non-systematic review using Google Scholar platform approaching the relationship between seizures and the most u...

H5N1 Associated Human Fatality in Nepal- A First Case

Ministry of Health and Population has confirmed the first case of human causality by H5N1.21-year-old patient hospitalized due to fever and cough died on 29th March 2019. Molecular diagnosis was performed in National Pub...

On Optimal Control Pair Treatment: Clinical Management of Viremia Levels In Pathogenic-Induced HIV-1 Infections

The quest to actively draw the attention of research scientist to alternative approach for the eradication of the menace of HIV and its associated pathogens, informed the decision of this present work. In this paper, we...

Download PDF file
  • EP ID EP592718
  • DOI 10.26717/BJSTR.2018.11.002172
  • Views 184
  • Downloads 0

How To Cite

Sifen Lu, Jing Song, Maoshan Chen (2018). CPLSTool: A Framework to Generate Automatic Bioinformatics Pipelines. Biomedical Journal of Scientific & Technical Research (BJSTR), 11(5), 8863-8867. https://europub.co.uk/articles/-A-592718