Link to bioRxiv paper: http://biorxiv.org/cgi/content/short/2020.10.08.332346v1?rss=1 Authors: Cai, T., Lim, H., Abbu, K. A., Qiu, Y., Nussinov, R., Xie, L. Abstract: Molecular interaction is the foundation of biological process. Elucidation of genome-wide binding partners of a biomolecule will address many questions in biomedicine. However, ligands of a vast number of proteins remain elusive. Existing methods mostly fail when the protein of interest is dissimilar from those with known functions or structures. We develop a new deep learning framework DISAE that incorporates biological knowledge into self-supervised learning techniques for predicting ligands of novel unannotated proteins on a genome-scale. In the rigorous benchmark studies, DISAE outperforms state-of-the-art methods by a significant margin. The interpretability analysis of DISAE suggests that it learns biologically meaningful information. We further use DISAE to assign ligands to human orphan G-Protein Coupled Receptors (GPCRs) and to cluster the human GPCRome by integrating their phylogenetic and ligand relationships. The promising results of DISAE open an avenue for exploring the chemical landscape of entire sequenced genomes. Copy rights belong to original authors. Visit the link for more info