KMAP Biosphere Gene Catalogue study includes clusterings and annotations of genes
found in the assembled contig sequences of ~62,500 metagenome sequencing runs of
~40,800 samples
from 924 public projects.
30% identity clustering includes 290 million clusters,
excluding the nonannotated singletons.
Total number of the
singletons 58 million.