MITIE: Simultaneous RNA-Seq-based transcript identification and quantification in multiple samples Journal Article


Authors: Behr, J.; Kahles, A.; Zhong, Y.; Sreedharan, V. T.; Drewe, P.; Ratsch, G.
Article Title: MITIE: Simultaneous RNA-Seq-based transcript identification and quantification in multiple samples
Abstract: Motivation: High-throughput sequencing of mRNA (RNA-Seq) has led to tremendous improvements in the detection of expressed genes and reconstruction of RNA transcripts. However, the extensive dynamic range of gene expression, technical limitations and biases, as well as the observed complexity of the transcriptional landscape, pose profound computational challenges for transcriptome reconstruction. Results: We present the novel framework MITIE (Mixed Integer Transcript IdEntification) for simultaneous transcript reconstruction and quantification. We define a likelihood function based on the negative binomial distribution, use a regularization approach to select a few transcripts collectively explaining the observed read data and show how to find the optimal solution using Mixed Integer Programming. MITIE can (i) take advantage of known transcripts, (ii) reconstruct and quantify transcripts simultaneously in multiple samples, and (iii) resolve the location of multi-mapping reads. It is designed for genome- and assembly-based transcriptome reconstruction. We present an extensive study based on realistic simulated RNA-Seq data. When compared with state-of-the-art approaches, MITIE proves to be significantly more sensitive and overall more accurate. Moreover, MITIE yields substantial performance gains when used with multiple samples. We applied our system to 38 Drosophila melanogaster modENCODE RNA-Seq libraries and estimated the sensitivity of reconstructing omitted transcript annotations and the specificity with respect to annotated transcripts. Our results corroborate that a wellmotivated objective paired with appropriate optimization techniques lead to significant improvements over the state-of-the-art in transcriptome reconstruction. Availability: MITIE is implemented in C++ and is available from http://bioweb.me/mitie under the GPL license. Contact: Jonas-Behr@web.de and raetsch@cbio.mskcc.org Supplementary information: Supplementary data are available at Bioinformatics online. © The Author 2013. Published by Oxford University Press.
Journal Title: Bioinformatics
Volume: 29
Issue: 20
ISSN: 1367-4803
Publisher: Oxford University Press  
Date Published: 2013-10-15
Start Page: 2529
End Page: 2538
Language: English
DOI: 10.1093/bioinformatics/btt442
PROVIDER: scopus
PMCID: PMC3789545
PUBMED: 23980025
DOI/URL:
Notes: "Export Date: 1 November 2013" - "CODEN: BOINF" - "Source: Scopus"
Altmetric
Citation Impact
BMJ Impact Analytics
MSK Authors
  1. Gunnar Ratsch
    68 Ratsch
  2. Jonas Tahmoh Behr
    6 Behr
  3. Jan Philipp Jurgen Drewe
    13 Drewe
  4. Andre Kahles
    31 Kahles
  5. Yi Zhong
    18 Zhong