Ontology-aware classification of tissue and cell-type signals in gene expression profiles across platforms and technologies. Author Young-suk Lee, Arjun Krishnan, Qian Zhu, Olga Troyanskaya Publication Year 2013 Type Journal Article Abstract MOTIVATION: Leveraging gene expression data through large-scale integrative analyses for multicellular organisms is challenging because most samples are not fully annotated to their tissue/cell-type of origin. A computational method to classify samples using their entire gene expression profiles is needed. Such a method must be applicable across thousands of independent studies, hundreds of gene expression technologies and hundreds of diverse human tissues and cell-types.RESULTS: We present Unveiling RNA Sample Annotation (URSA) that leverages the complex tissue/cell-type relationships and simultaneously estimates the probabilities associated with hundreds of tissues/cell-types for any given gene expression profile. URSA provides accurate and intuitive probability values for expression profiles across independent studies and outperforms other methods, irrespective of data preprocessing techniques. Moreover, without re-training, URSA can be used to classify samples from diverse microarray platforms and even from next-generation sequencing technology. Finally, we provide a molecular interpretation for the tissue and cell-type models as the biological basis for URSA's classifications. Keywords Humans, Gene Expression Profiling, High-Throughput Nucleotide Sequencing, Computational Biology, Models, Statistical, Oligonucleotide Array Sequence Analysis, Bayes Theorem, Organ Specificity, Cells, Databases, Factual Journal Bioinformatics Volume 29 Issue 23 Pages 3036-44 Date Published 12/2013 Alternate Journal Bioinformatics Google ScholarBibTeXEndNote X3 XML