Detecting Differentially Expressed Genes in RNA-Seq Data with Unknown Conditions

Günter Klambauer, Thomas Unterthiner, Sepp Hochreiter

Research output: Chapter in Book/Report/Conference proceedingConference proceedingspeer-review

Abstract

Methods that identify differential expression in RNA-Seq data are currently limited to study designs in which two or more sample conditions are known a priori. However, these biological conditions like activated regulatory and metabolic pathways are typically unknown in genetic studies such as the HapMap or the 1000 Genomes project. We suggest DEXUS for detecting differential expression in RNA-Seq data for which the sample conditions are unknown. In a Bayesian framework DEXUS models read counts as a finite mixture of negative binomial distributions in which each mixture component corresponds to a condition. Evidence of differential expression is measured by the informative/non-informative (I/NI) value, which allows differentially expressed transcripts to be extracted at a desired specificity (significance level) or sensitivity (power). DEXUS performed excellently in identifying differentially expressed transcripts in data with unknown conditions. On 2,400 simulated data sets, I/NI value thresholds of 0.025, 0.05, and 0.1 yielded average specificities of 92%, 97%, and 99% at sensitivities of 76%, 61%, and 38% respectively. In cohorts with genetic and RNA-Seq data, DEXUS was able to detect differentially expressed transcripts that could be related to genetic variants via the identified conditions. These genetic variants can be classified into structural variants like copy number variations and single nucleotide variants, that is, eQTLs.
Original languageEnglish
Title of host publicationASHG 2013 Proceedings
Number of pages1
Publication statusPublished - 2013

Fields of science

  • 303 Health Sciences
  • 304 Medical Biotechnology
  • 305 Other Human Medicine, Health Sciences
  • 106013 Genetics
  • 106041 Structural biology
  • 102 Computer Sciences
  • 101029 Mathematical statistics
  • 102001 Artificial intelligence
  • 101004 Biomathematics
  • 102015 Information systems
  • 102018 Artificial neural networks
  • 106002 Biochemistry
  • 106023 Molecular biology
  • 301 Medical-Theoretical Sciences, Pharmacy
  • 302 Clinical Medicine
  • 106005 Bioinformatics

JKU Focus areas

  • Computation in Informatics and Mathematics
  • Nano-, Bio- and Polymer-Systems: From Structure to Function

Cite this