Link to bioRxiv paper: http://biorxiv.org/cgi/content/short/2020.11.15.363259v1?rss=1 Authors: Leonard, R. R., Leleu, M., Van Vlierberghe, M., Kerff, F., BAURAIN, D. Abstract: TQMD is a tool which downloads, stores and produces lists of dereplicated prokaryotic genomes. It has been developed to counter the ever-growing number of prokaryotic genomes and their uneven taxonomic distribution. It is based on word-based alignment-free methods (k-mers), an iterative single-linkage approach and a divide-and-conquer strategy to remain both efficient and scalable. We studied the performance of TQMD by verifying the influence of its parameters and heuristics on the clustering outcome. We further compared TQMD to two other dereplication tools (dRep and Assembly-Dereplicator). Our results showed that TQMD is optimized to dereplicate at high taxonomic levels (phylum/class), whereas the other dereplication tools are optimized for lower taxonomic levels (species/strain), making TQMD complementary to the existing dereplicating tools. TQMD is available at [https://bitbucket.org/phylogeno/tqmd]. Copy rights belong to original authors. Visit the link for more info