Tissue-Specific Alternative Splicing Analysis Reveals the Diversity of Chromosome 18 Transcriptome
datasetposted on 03.01.2014 by Alexander V. Shargunov, George S. Krasnov, Elena A. Ponomarenko, Andrey V. Lisitsa, Mikhail A. Shurdov, Vitaliy V. Zverev, Alexander I. Archakov, Vladimir M. Blinov
Datasets usually provide raw data for analysis. This raw data often comes in spreadsheet form, but can be any collection of data, on which analysis can be performed.
The Chromosome-centric Human Proteome Project (C-HPP) is aimed to identify the variety of protein products and transcripts of the number of chromosomes. The Russian part of C-HPP is devoted to the study of the human chromosome 18. Using widely accepted Tophat and SpliceGrapher, a tool for accurate splice sites and alternative mRNA isoforms prediction, we performed the extensive mining of the splice variants of chromosome 18 transcripts and encoded protein products in liver, brain, lung, kidney, blood, testis, derma, and skeletal muscles. About 6.1 billion of the reads represented by 450 billion of the bases have been analyzed. The relative frequencies of splice events as well as gene expression profiles in normal tissues are evaluated. Using ExPASy PROSITE, the novel features and possible functional sites of previously unknown splice variants were highlighted. A set of unique proteotypic peptides enabling the identification of novel alternative protein species using mass-spectrometry is constructed. The revealed data will be integrated into the gene-centric knowledgebase of the Russian part of C-HPP available at http://kb18.ru and http://www.splicing.zz.mu/.