Detection of native and mirror protein structures based on Ramachandran plot analysis by interpretable machine learning models

Published: Sept. 3, 2020, 8:01 a.m.

Link to bioRxiv paper: http://biorxiv.org/cgi/content/short/2020.09.03.280701v1?rss=1 Authors: Villmann, T., Abel, J., Bohnsack, K. S., Kaden, M., Weber, M., Leberecht, C. Abstract: In this contribution the discrimination between native and mirror models of proteins according to their chirality is tackled based on the structural protein information. This information is contained in the Ramachandran plots of the protein models. We provide an approach to classify those plots by means of an interpretable machine learning classifier - the Generalized Matrix Learning Vector Quantizer. Applying this tool, we are able to distinguish with high accuracy between mirror and native structures just evaluating the Ramachandran plots. The classifier model provides additional information regarding the importance of regions, e.g. $alpha$-helices and $beta$-strands, to discriminate the structures precisely. This importance weighting differs for several considered protein classes. Copy rights belong to original authors. Visit the link for more info