EGID: an ensemble algorithm for improved genomic island detection in genomic sequences

Journal Title: Bioinformation - Year 2011, Vol 7, Issue 6

Abstract

Genomicislands (GIs) are genomic regions that are originally transferred from other organisms. The detection of genomic islands in genomes can lead to many applications in industrial, medical and environmental contexts. Existing computational tools for GI detection suffer either low recall or low precision, thus leaving the room for improvement. In this paper, we report the development of our Ensemble algorithm for Genomic Island Detection (EGID). EGID utilizes the prediction results of existing computational tools, filters and generates consensus prediction results. Performance comparisons between our ensemble algorithm and existing programs have shown that our ensemble algorithm is better than any other program. EGID was implemented in Java, and was compiled and executed on Linux operating systems. EGID is freely available at http://www5.esu.edu/cpsc/bioinfo/software/EGID.

Authors and Affiliations

Dongsheng Che, Mohammad Hasan, Han Wang, John Fazekas, Jinling Huang, Qi Liu

Keywords

Related Articles

Monozygotic twins: genes are not the destiny?

Monozygotic twins are considered to be genetically identical, yet can show high discordance in their phenotypes and disease susceptibility. Several studies have emphasized the influence of external factors and the role o...

sRNATarget: a web server for prediction of bacterial sRNA targets.

In bacteria, there exist some small non-coding RNAs (sRNAs) with 40-500 nucleotides in length. Most of them function as posttranscriptional regulation of gene expression through binding to their target mRNAs, in which Hf...

Insights from the molecular docking of withanolide derivatives to the target protein PknG from Mycobacterium tuberculosis.

A crucial virulence factor for intracellular Mycobacterium tuberculosis survival is Protein kinase G (PknG), a eukaryotic-like serinethreonine protein kinase expressed by pathogenic mycobacteria that blocks the intracell...

Codon Usage Bias as a Function of Generation Time and Life Expectancy

It has recently been demonstrated that human natural codon usage bias is optimized towards a higher buffering capacity to mutations (measured as the tendency of single point mutations in a DNA sequence to yield the same...

Glaucoma database.

Glaucoma, a complex heterogenous disease, is the leading cause for optic nerve-related blindness worldwide. Primary open angle glaucoma (POAG) is the most common subset and by the year 2020 it is estimated that approxima...

Download PDF file
  • EP ID EP86040
  • DOI 10.6026/97320630007311
  • Views 179
  • Downloads 0

How To Cite

Dongsheng Che, Mohammad Hasan, Han Wang, John Fazekas, Jinling Huang, Qi Liu (2011). EGID: an ensemble algorithm for improved genomic island detection in genomic sequences. Bioinformation, 7(6), 311-314. https://europub.co.uk/articles/-A-86040