Identifying genes and cell types with dynamic alternative polyadenylation in multi-cluster single-cell transcriptomics data

Published: July 31, 2020, 1:01 a.m.

Link to bioRxiv paper: http://biorxiv.org/cgi/content/short/2020.07.30.229096v1?rss=1 Authors: Bai, Y., Qin, Y., Kim, S., Fan, Z., Nam, K., Koldamova, R., Padiath, Q. S., Park, H. J. Abstract: Alternative polyadenylation (APA) causes shortening or lengthening of the 3'-untranslated region (3'-UTR), widespread in complex tissues. Bioinformatic tools have been developed to identify dynamic APA in single cell RNA-Seq (scRNA-Seq) data, but with relatively low power and the lack of interpretability for multiple cell types. To address these limitations, we developed a model-based method, scMAPA. scMAPA increases power by building a regression model based on a sensitive quantification of 3'-UTR short and long isoforms. By developing a de novo simulation platform, we demonstrated that scMAPA shows a markedly better sensitivity (9.5% on average) than the previous method with a negligible loss in specificity (0.3% on average). scMAPA improves interpretability by modeling the direction and the degree of dynamic APA for each cell type for each gene, while allowing flexibility to control potential confounders. scMAPA enables to bring a systematic understanding of APA dynamics in complex tissues in human Peripheral Blood Monocellular Cells and mouse brain data. Copy rights belong to original authors. Visit the link for more info