Evaluating geographic variation within molecular operational taxonomic units (OTUs) using network analyses in Scandinavian lakes

Published: Aug. 7, 2020, 12:08 a.m.

Link to bioRxiv paper: http://biorxiv.org/cgi/content/short/2020.08.06.240267v1?rss=1 Authors: Forster, D., Lentendu, G., Wilson, M., Mahe, F., Leese, F., Andersen, T., Khomich, M., Dunthorn, M. Abstract: Operational taxonomic units (OTUs) are usually treated as if they are internally uniform in environmental metabarcoding studies of microbial and macrobial eukaryotes, even when the OTUs are being used to infer biogeographic patterns. The OTUs constructed by the program Swarm have underlying network topologies in which nodes represent amplicons and edges represent 1 nucleotide differences between nodes. Such networks can be exploited to search for biogeographic patterns within each OTU. To do this, here we used an available protistan metabarcoding dataset consisting of the hypervariable V4 region of the 18S rRNA locus amplified from 77 lakes collected across Norway and Sweden. The 82 most abundant and wide-spread OTUs constructed by Swarm were evaluated using shortest path, assortativity, and geographical analyses. We found that while pairs of amplicons from the same lake were usually connected directly to each other within the OTUs, these pairs of amplicons from the same lake did not form assortative clusters within the OTUs, and amplicons were not more connected with other amplicons occurring in neighboring lakes than expected by chance. This new approach to looking at within-OTU is applicable to other metabarcoding datasets and we provide code to perform these analyses. Copy rights belong to original authors. Visit the link for more info