THUNDER: A reference-free deconvolution method to infer cell type proportions from bulk Hi-C data

Published: Nov. 12, 2020, 9:03 p.m.

Link to bioRxiv paper: http://biorxiv.org/cgi/content/short/2020.11.12.379941v1?rss=1 Authors: Rowland, B., Huh, R., Hou, Z., Hu, M., Shen, Y., Li, Y. Abstract: Hi-C data provide population averaged estimates of three-dimensional chromatin contacts across cell types and states in bulk samples. To effectively leverage Hi-C data for biological insights, we need to control for the confounding factor of differential cell type proportions across heterogeneous bulk samples. We propose a novel unsupervised deconvolution method for inferring cell type composition from bulk Hi-C data, the Two-step Hi-c UNsupervised DEconvolution appRoach (THUNDER). We conducted extensive real data based simulations to test THUNDER constructed from published single-cell Hi-C (scHi-C) data. THUNDER more accurately estimates the underlying cell type proportions when compared to both supervised and unsupervised deconvolution methods including CIBERSORT, TOAST, and NMF. THUNDER will be a useful tool in adjusting for varying cell type composition in population samples, facilitating valid and more powerful downstream analysis such as differential chromatin organization studies. Additionally, THUNDER estimates cell-type-specific chromatin contact profiles for all cell types in bulk Hi-C mixtures. These estimated contact profiles provide a useful exploratory framework to investigate cell-type-specificity of the chromatin interactome while experimental data is still sparse. Copy rights belong to original authors. Visit the link for more info