CPLSTool: A Framework to Generate Automatic Bioinformatics Pipelines
Journal Title: Biomedical Journal of Scientific & Technical Research (BJSTR) - Year 2018, Vol 11, Issue 5
Abstract
Many bioinformatics tools have been developed for data analysis and focus on some specific problems. However, one program is not enough to complete the data mining. We developed CPLSTool (https://github.com/maoshanchen/CPLSTool) that can compress multiple bioinformatics tools and the produced pipeline can be used for data anlaysis repeatly. The most significant advantage of using CPLSTool is to save waiting time, compared to step-by-step analysis. In addition, some steps for the data analysis can be run parallely in order to save the program running time. We used CPLSTool to build an automatic pipeline based on QIIME and analyzed skin 16S rRNA data. The results showed that a total of 102 minutes can be saved using CPLSTool and the visualization of results improves our understanding of the results. CPLSTool can be applied in any kind of data analysis, including genomic, transcriptomic, proteomic and metagenomic data analysis. The use of CPLSTool will improve our understanding of data analysis and save time and computing resources.The last decade has witnessed the breaking development of Next-Generation Sequencing (NGS) tools, including Transcriptome Sequencing (RNA-Seq), Whole-Genome and Whole-Exome Sequencing (WGS/WXS), Metagenomics, Chromatin Immunoprecipitation or Methylated DNA Immunoprecipitation followed by Sequencing (ChIP-Seq or MeDIP-Seq), and a multitude of more specialized protocols, such as Cross-Linking Immunoprecipitation (CLIP-Seq), Assay for Transposase-Accessible Chromatin Using Sequencing (ATAC-Seq), and Formaldehyde-Assisted Isolation of Regulatory Elements (FAIRE-Seq) [1]. Every NGS tool was born with one or more analysis applications and now there are many bioinformatics tools developed for general and special research purposes, such as BWA [2], ExScalibur [3], Chipster [4], Churchill [5], NEAT [6], MG-RAST [7], TopHat [8] and QIIME [9]. However, there are some drawbacks for these tools. For example,i) Some tools concentrate on a single analysis step instead of completing all needed contents, such as BWA and Top Hat; ii) It is difficult to add new analysis contents to current integrated pipelines, such as NEAT; iii) Some tools are based on web server and the analysis is limited by the internet speed sometimes, such as MG-RAST; and iv) An automatic pipeline is necessary for the whole analysis rather than step-by-step operation, such as QIIME. Moreover, the tremendous amount of NGS output requires a possible way to speed up the analysis. Thus, it is important to develop a clever way to organize the related tools and software within reasonable time to get automatic pipelines and to speed up the overall procedure using parallelization and acceleration technologies [10]. To address this need, some features of a program should be considered when it is developed, such as i) Management of related tools and programs regardless of their own program language and input file formats, ii) Flexibility of adding new contents, iii) Generating an automatic pipeline instead of step-by-step operations, and iv) use of parallelization and acceleration technologies. We developed CPLSTool, which can conform to all the above features. CPLSTool is freely available for users from https://github.com/maoshanchen/CPLSTool.
Authors and Affiliations
Sifen Lu, Jing Song, Maoshan Chen
Soil pH, Ca and Mg Stability and pH Association with Temperature and Groundwater Silicon
Objective: It is generally known that pH, Ca and Mg have changed remarkably during 1961-90, but their inter-areal variation seems not have been fully discussed nor explained. Parameters of cropland have been earlier asso...
Vermitechnology Based Tribal Women Empowerment for Economic Development in Himachal Pradesh
A novel technique of converting decomposable organic wastes into valuable manure (compost) through earthworm activity is a faster and beneficial process. The earthworms are used as the natural bioreactors for making deco...
Health and Lifestyle of University Freshmen: A CrossBorder Comparison among three Cities in China
Objectives: Limited work has been performed on intra-cultural, cross-societal variation among youth lifestyles and health status, and baseline data on university freshmen’s health status are lacking. In the present study...
Yoga in Mental Health
Yogic techniques, such as asana and pranayama of Hatha Yoga, and various meditations, have been trailed through clinical and other scientific procedures. The results have established the preven...
Methodology of Comparison and Classification of Eroided Land in Azerbaijan
The results of the conducted studies revealed that soil erosion agitatedly reduces the humus content. If the reserves of 0-50 cm of the layer are not washed away by mountain- Brown soils - 168 tons per hectare, in a very...