KMAP Biosphere Gene Catalogue study includes clusterings and annotations of genes
found in the assembled contig sequences of ~62,500 metagenome sequencing runs of
~40,800 samples,
from 924 public studies.
90% identity clustering includes 789 million clusters.
Total number of the
singletons 420 million.