KMAP Biosphere Gene Catalogue study includes clusterings and annotations of genes found in the assembled contig sequences of ~62,500 metagenome sequencing runs of ~40,800 samples from 924 public projects. 90% identity clustering includes 789 million clusters. Total number of the singletons 420 million.
Search with enzyme, taxon, InterPro identifiers, or with cluster queries

*** Query results for 'interproid=PF13592' ***

Facets:
  • Biomes
  • KEGG Annotations
    • GO Annotations
      • Enzymes
        • Filters
          • Protein Families
            • Number of clusters in biomes with single counting of cluster biome members
              ...
              Number of samples in biomes
              Number of clusters in superkingdom, phylum, class, order, and family levels
              Number of shared genes with single counting of cluster members
              Open a larger version of this graph in a new browser tab. This diagram presents biome sharing at main biomes level, together with distrubition of clusters sub biomes, represented with the Secondary-biome column. The diagram in the next tab presents biome sharing at sub biomes level with Sankey graphs.
              Most shared biomes with single counting of biomes of the cluster members
              Open a larger version of this graph in a new browser tab, with options to specify the number of top shared biomes
              Sample clusters in the query results
              Download the list of clusters in csv format : Download
              Representative gene's id Representative gene's project Representative gene's sample #Annotations #Member genes #Sub biomes
              Samples with most clusters in the query results
              Download the list of samples in csv format Download
              Compare/view annotations of selected samples:  KEGG  InterPro KEGG module compl.
              Select/un-select all samples

              This tab presents our earlier work for KMAP Ocean Gene Catalogue;  KEGG Modules Completeness numbers for two KEGG sulfate reduction modules are summarised for Ocean Gene Catalogue samples in the query results. We later implemented a generic method with EBI MGnify Team's KEGG Modules Completion project; available with the View option next to the Download option.  One particular feature of this tab; the completion calculations are restricted to the clusters selected by the running queries, as a result when a query is restrictive the samples processed and their completions could be few or none. Since our initial work, KEGG pathway definitions of these two modules were updated, we haven't updated our pathway models yet. We include this tab to have some history of our work.

              Download the detailed completenes numbers in csv format: Download
              Completeness numbers calculated based on the MGnify KEGG pathway graphs: View
              Ocean zones: