Integration of quantitated expression estimates from polyA-selected and rRNA-depleted RNA-seq libraries

Background

The availability of fast alignment-free algorithms has greatly reduced the computational burden of RNA-seq processing, especially for relatively poorly assembled genomes. Using these approaches, previous RNA-seq datasets could potentially be processed and integrated with newly sequenced libraries. Confounding factors in such integration include sequencing depth and

Conclusion

A combination of reference transcriptome filtering and a ratio-based correction can create equivalent expression profiles from both polyA-selected and rRNA-depleted libraries. This approach will allow meta-analysis and integration of existing RNA-seq data into transcriptional atlas projects.

Results

The method was developed by comparing two RNA-seq datasets from ovine macrophages, identical except for RNA selection method. Gene-level expression estimates were obtained using a two-part process centred on the high-speed transcript quantification tool Kallisto. Firstly, a set of reference transcripts was defined that constitute a standardised RNA space, with expression from both datasets quantified against it. Secondly, a simple ratio-based correction was applied to the rRNA-depleted estimates. The outcome is an almost perfect correlation between gene expression estimates, independent of library type and across the full range of levels of expression.

期刊：	BMC Bioinformatics	影响因子：	2.900
时间：	2017	起止号：	2017 Jun 13;18(1):301.
doi：	10.1186/s12859-017-1714-9

Integration of quantitated expression estimates from polyA-selected and rRNA-depleted RNA-seq libraries

整合来自 polyA 选择和 rRNA 耗尽的 RNA 测序文库的定量表达估计值

Background

Conclusion

Results

特别声明