KMAP Biosphere Gene Catalogue study includes clusterings and annotations of genes
found in the assembled contig sequences of ~62,500 metagenome sequencing runs of
~40,800 samples
from 924 public projects.
90% identity clustering includes 789 million clusters.
Total number of the
singletons 420 million.