MultiPhATE2: Code for Functional Annotation and Comparison of Bacteriophage Genomes

Published: Oct. 7, 2020, 2:02 a.m.

Link to bioRxiv paper: http://biorxiv.org/cgi/content/short/2020.10.05.324566v1?rss=1 Authors: Zhou, C. E., Kimbrel, J., Edwards, R. A., McNair, K., Souza, B. A., Malfatti, S. Abstract: To address the need for improved tools for annotation and comparative genomics of bacteriophage genomes, we developed multiPhATE2. As an extension of the multiPhATE code, multiPhATE2 performs gene finding and functional sequence annotation of predicted gene and protein sequences, and additional search algorithms and databases extend the search space of the original functional annotation subsystem. MultiPhATE2 includes comparative genomics codes for gene matching among sets of input bacteriophage genomes, and scales well to large input data sets with the incorporation of multiprocessing in the functional annotation and comparative genomics subsystems. MultiPhATE2 was implemented in Python 3.7 and runs as a command-line code under Linux or MAC-OS. MultiPhATE2 is freely available under an open- source GPL-3 license at https://github.com/carolzhou/multiPhATE2. Instructions for acquiring the databases and third party codes used by multiPhATE2 are found in the README file included with the distribution. Users may report bugs by submitting issues to the project GitHub repository webpage. Contact: zhou4@llnl.gov or multiphate@gmail.com. Supplementary materials, which demonstrate the outputs of multiPhATE2, are available in a GitHub repository, at https://github.com/carolzhou/multiPhATE2_supplementaryData/. Copy rights belong to original authors. Visit the link for more info