Alt-Splice Gene Predictor Using Multitrack-Clique Analysis: Verification of Statistical Support for Modelling in Genomes of Multicellular Eukaryotes

Journal Title: Informatics - Year 2017, Vol 4, Issue 1

Abstract

One of the main limitations of the typical hidden Markov model (HMM) implementation for gene structure identification is that a single structure is identified on a given sequence of genomic data—i.e., identification of overlapping structure is not directly possible, and certainly not possible within the confines of the optimal Viterbi path evaluation. This is a huge limitation given that we now know that significant portions of eukaryotic genomes, particularly mammalian genomes, are alternatively spliced, and, thus, have overlapping structure in the sense of the mRNA transcripts that result. Using the general meta-state HMM approach developed in prior work, however, more than one ‘track’ of annotation can be accommodated, thereby allowing a direct implementation of an alternative-splice gene-structure identifier. In this paper we examine the representation of alternative splicing annotation in the multi-track context, and show that the proliferation on states is manageable, and has sufficient statistical support on the genomes examined (human, mouse, worm, and fly) that a full alt-splice meta-state HMM gene finder can be implemented with sufficient statistical support. In the process of performing the alternative splicing analysis on alt-splice event counts we expected to see an increase in alternative splicing complexity as the organism becomes more complex, and this is seen with the percentage of genes with alt-splice variants increasing from worm to fly to the mammalian genomes (mouse and human). Of particular note is an increase in alternative splicing variants at the start and end of coding with the more complex organisms studied (mouse and human), indicating rapid new first and last exon recruitment that is possibly spliceosome mediated. This suggests that spliceosome-mediated refinements (acceleration) of gene structure variation and selection, with increasing levels of sophistication, has occurred in eukaryotes and in mammals especially.

Authors and Affiliations

Stephen Winters-Hilt and Andrew J. Lewis

Keywords

Related Articles

Data Provenance for Agent-Based Models in a Distributed Memory

Agent-Based Models (ABMs) assist with studying emergent collective behavior of individual entities in social, biological, economic, network, and physical systems. Data provenance can support ABM by explaining individual...

Artery Segmentation in Ultrasound Images Based on an Evolutionary Scheme

Segmentation in ultrasound (US) images is a challenge in computer vision, due to the high signal noise, artifacts that produce discontinuities in the boundaries and shadows that hide part of the received signal. In thi...

Analyzing Spatiotemporal Anomalies through Interactive Visualization

As we move into the big data era, data grows not just in size, but also in complexity, containing a rich set of attributes, including location and time information, such as data from mobile devices (e.g., smart phones),...

Mobile Phones Help Develop Listening Skills

Listening is one of the most difficult language skills among the four communication competences; however, it has received much less time in English learning than the other three (reading, writing, and speaking). Also,...

When Wiki Technology Meets Corporate Knowledge Management Routines: A Sociomateriality Perspective

There seems to be an inherent tension between wiki affordances—open boundaries, unconstrained editing, and transparency—and traditional knowledge management (KM) routines used in firms. The objective of this study is t...

Download PDF file
  • EP ID EP44074
  • DOI https://doi.org/10.3390/informatics4010003
  • Views 244
  • Downloads 0

How To Cite

Stephen Winters-Hilt and Andrew J. Lewis (2017). Alt-Splice Gene Predictor Using Multitrack-Clique Analysis: Verification of Statistical Support for Modelling in Genomes of Multicellular Eukaryotes. Informatics, 4(1), -. https://europub.co.uk/articles/-A-44074