Class: GtdbTaxonomyR214v1
GTDB release 214 taxonomy with parsed rank assignments. Each genome has one row with full taxonomic lineage.
GTDB PHYLA (top 5 by genome count): - p__Pseudomonadota: 117,619 genomes (was Proteobacteria) - p__Bacillota: 67,072 genomes (was Firmicutes) - p__Actinomycetota: 26,949 genomes (was Actinobacteria) - p__Bacillota_A: 24,581 genomes (split from Firmicutes) - p__Bacteroidota: 20,615 genomes
URI: https://w3id.org/kbase/kbase_ke_pangenome/GtdbTaxonomyR214v1
classDiagram
class GtdbTaxonomyR214v1
click GtdbTaxonomyR214v1 href "../GtdbTaxonomyR214v1/"
GtdbTaxonomyR214v1 : class
GtdbTaxonomyR214v1 : domain
GtdbTaxonomyR214v1 --> "0..1" GtdbDomain : domain
click GtdbDomain href "../GtdbDomain/"
GtdbTaxonomyR214v1 : family
GtdbTaxonomyR214v1 : genome_id
GtdbTaxonomyR214v1 --> "0..1" Genome : genome_id
click Genome href "../Genome/"
GtdbTaxonomyR214v1 : genus
GtdbTaxonomyR214v1 : gtdb_taxonomy_id
GtdbTaxonomyR214v1 : order
GtdbTaxonomyR214v1 : phylum
GtdbTaxonomyR214v1 : species
Slots
| Name | Cardinality and Range | Description | Inheritance |
|---|---|---|---|
| genome_id | 0..1 Genome |
Genome this taxonomy applies to | direct |
| gtdb_taxonomy_id | 1 String |
Full semicolon-separated taxonomy string | direct |
| domain | 0..1 GtdbDomain |
Domain rank (d__Archaea or d__Bacteria) | direct |
| phylum | 0..1 String |
Phylum name with p__ prefix | direct |
| class | 0..1 String |
Class name with c__ prefix | direct |
| order | 0..1 String |
Order name with o__ prefix | direct |
| family | 0..1 String |
Family name with f__ prefix | direct |
| genus | 0..1 String |
Genus name with g__ prefix | direct |
| species | 0..1 String |
Species name with s__ prefix | direct |
Identifier and Mapping Information
Annotations
| property | value |
|---|---|
| source_table | gtdb_taxonomy_r214v1 |
Schema Source
- from schema: https://w3id.org/kbase/kbase_ke_pangenome
Mappings
| Mapping Type | Mapped Value |
|---|---|
| self | https://w3id.org/kbase/kbase_ke_pangenome/GtdbTaxonomyR214v1 |
| native | https://w3id.org/kbase/kbase_ke_pangenome/GtdbTaxonomyR214v1 |
LinkML Source
Direct
name: GtdbTaxonomyR214v1
annotations:
source_table:
tag: source_table
value: gtdb_taxonomy_r214v1
description: 'GTDB release 214 taxonomy with parsed rank assignments. Each genome
has one row with full taxonomic lineage.
GTDB PHYLA (top 5 by genome count): - p__Pseudomonadota: 117,619 genomes (was Proteobacteria)
- p__Bacillota: 67,072 genomes (was Firmicutes) - p__Actinomycetota: 26,949 genomes
(was Actinobacteria) - p__Bacillota_A: 24,581 genomes (split from Firmicutes) -
p__Bacteroidota: 20,615 genomes'
from_schema: https://w3id.org/kbase/kbase_ke_pangenome
attributes:
genome_id:
name: genome_id
description: Genome this taxonomy applies to
comments:
- 'Foreign key: Genome.genome_id'
examples:
- value: RS_GCF_020034805.1
from_schema: https://w3id.org/kbase/kbase_ke_pangenome
domain_of:
- Genome
- Gene
- GtdbTaxonomyR214v1
- Sample
- GapmindPathways
range: Genome
gtdb_taxonomy_id:
name: gtdb_taxonomy_id
description: Full semicolon-separated taxonomy string
examples:
- value: d__Bacteria;p__Pseudomonadota;c__Gammaproteobacteria;o__Enterobacterales;f__Enterobacteriaceae;g__Escherichia;s__Escherichia
fergusonii
from_schema: https://w3id.org/kbase/kbase_ke_pangenome
identifier: true
domain_of:
- Genome
- GtdbTaxonomyR214v1
range: string
required: true
domain:
name: domain
description: Domain rank (d__Archaea or d__Bacteria)
examples:
- value: d__Bacteria
- value: d__Archaea
from_schema: https://w3id.org/kbase/kbase_ke_pangenome
rank: 1000
domain_of:
- GtdbTaxonomyR214v1
range: GtdbDomain
phylum:
name: phylum
description: Phylum name with p__ prefix. GTDB uses standardized names that may
differ from NCBI (e.g., Pseudomonadota vs Proteobacteria)
examples:
- value: p__Pseudomonadota
description: Formerly Proteobacteria
- value: p__Bacillota
description: Formerly Firmicutes
- value: p__Actinomycetota
description: Formerly Actinobacteria
- value: p__Halobacteriota
description: Archaeal phylum
from_schema: https://w3id.org/kbase/kbase_ke_pangenome
rank: 1000
domain_of:
- GtdbTaxonomyR214v1
range: string
class:
name: class
description: Class name with c__ prefix
examples:
- value: c__Gammaproteobacteria
- value: c__Bacilli
- value: c__Clostridia
from_schema: https://w3id.org/kbase/kbase_ke_pangenome
rank: 1000
domain_of:
- GtdbTaxonomyR214v1
range: string
order:
name: order
description: Order name with o__ prefix
examples:
- value: o__Enterobacterales
- value: o__Staphylococcales
- value: o__Lactobacillales
from_schema: https://w3id.org/kbase/kbase_ke_pangenome
rank: 1000
domain_of:
- GtdbTaxonomyR214v1
range: string
family:
name: family
description: Family name with f__ prefix
examples:
- value: f__Enterobacteriaceae
- value: f__Staphylococcaceae
- value: f__Pseudomonadaceae
from_schema: https://w3id.org/kbase/kbase_ke_pangenome
rank: 1000
domain_of:
- GtdbTaxonomyR214v1
range: string
genus:
name: genus
description: Genus name with g__ prefix
examples:
- value: g__Escherichia
- value: g__Klebsiella
- value: g__Staphylococcus
from_schema: https://w3id.org/kbase/kbase_ke_pangenome
rank: 1000
domain_of:
- GtdbTaxonomyR214v1
range: string
species:
name: species
description: Species name with s__ prefix
examples:
- value: s__Escherichia_coli
- value: s__Escherichia_fergusonii
- value: s__Klebsiella_pneumoniae
from_schema: https://w3id.org/kbase/kbase_ke_pangenome
rank: 1000
domain_of:
- GtdbTaxonomyR214v1
range: string
Induced
name: GtdbTaxonomyR214v1
annotations:
source_table:
tag: source_table
value: gtdb_taxonomy_r214v1
description: 'GTDB release 214 taxonomy with parsed rank assignments. Each genome
has one row with full taxonomic lineage.
GTDB PHYLA (top 5 by genome count): - p__Pseudomonadota: 117,619 genomes (was Proteobacteria)
- p__Bacillota: 67,072 genomes (was Firmicutes) - p__Actinomycetota: 26,949 genomes
(was Actinobacteria) - p__Bacillota_A: 24,581 genomes (split from Firmicutes) -
p__Bacteroidota: 20,615 genomes'
from_schema: https://w3id.org/kbase/kbase_ke_pangenome
attributes:
genome_id:
name: genome_id
description: Genome this taxonomy applies to
comments:
- 'Foreign key: Genome.genome_id'
examples:
- value: RS_GCF_020034805.1
from_schema: https://w3id.org/kbase/kbase_ke_pangenome
alias: genome_id
owner: GtdbTaxonomyR214v1
domain_of:
- Genome
- Gene
- GtdbTaxonomyR214v1
- Sample
- GapmindPathways
range: Genome
gtdb_taxonomy_id:
name: gtdb_taxonomy_id
description: Full semicolon-separated taxonomy string
examples:
- value: d__Bacteria;p__Pseudomonadota;c__Gammaproteobacteria;o__Enterobacterales;f__Enterobacteriaceae;g__Escherichia;s__Escherichia
fergusonii
from_schema: https://w3id.org/kbase/kbase_ke_pangenome
identifier: true
alias: gtdb_taxonomy_id
owner: GtdbTaxonomyR214v1
domain_of:
- Genome
- GtdbTaxonomyR214v1
range: string
required: true
domain:
name: domain
description: Domain rank (d__Archaea or d__Bacteria)
examples:
- value: d__Bacteria
- value: d__Archaea
from_schema: https://w3id.org/kbase/kbase_ke_pangenome
rank: 1000
alias: domain
owner: GtdbTaxonomyR214v1
domain_of:
- GtdbTaxonomyR214v1
range: GtdbDomain
phylum:
name: phylum
description: Phylum name with p__ prefix. GTDB uses standardized names that may
differ from NCBI (e.g., Pseudomonadota vs Proteobacteria)
examples:
- value: p__Pseudomonadota
description: Formerly Proteobacteria
- value: p__Bacillota
description: Formerly Firmicutes
- value: p__Actinomycetota
description: Formerly Actinobacteria
- value: p__Halobacteriota
description: Archaeal phylum
from_schema: https://w3id.org/kbase/kbase_ke_pangenome
rank: 1000
alias: phylum
owner: GtdbTaxonomyR214v1
domain_of:
- GtdbTaxonomyR214v1
range: string
class:
name: class
description: Class name with c__ prefix
examples:
- value: c__Gammaproteobacteria
- value: c__Bacilli
- value: c__Clostridia
from_schema: https://w3id.org/kbase/kbase_ke_pangenome
rank: 1000
alias: class
owner: GtdbTaxonomyR214v1
domain_of:
- GtdbTaxonomyR214v1
range: string
order:
name: order
description: Order name with o__ prefix
examples:
- value: o__Enterobacterales
- value: o__Staphylococcales
- value: o__Lactobacillales
from_schema: https://w3id.org/kbase/kbase_ke_pangenome
rank: 1000
alias: order
owner: GtdbTaxonomyR214v1
domain_of:
- GtdbTaxonomyR214v1
range: string
family:
name: family
description: Family name with f__ prefix
examples:
- value: f__Enterobacteriaceae
- value: f__Staphylococcaceae
- value: f__Pseudomonadaceae
from_schema: https://w3id.org/kbase/kbase_ke_pangenome
rank: 1000
alias: family
owner: GtdbTaxonomyR214v1
domain_of:
- GtdbTaxonomyR214v1
range: string
genus:
name: genus
description: Genus name with g__ prefix
examples:
- value: g__Escherichia
- value: g__Klebsiella
- value: g__Staphylococcus
from_schema: https://w3id.org/kbase/kbase_ke_pangenome
rank: 1000
alias: genus
owner: GtdbTaxonomyR214v1
domain_of:
- GtdbTaxonomyR214v1
range: string
species:
name: species
description: Species name with s__ prefix
examples:
- value: s__Escherichia_coli
- value: s__Escherichia_fergusonii
- value: s__Klebsiella_pneumoniae
from_schema: https://w3id.org/kbase/kbase_ke_pangenome
rank: 1000
alias: species
owner: GtdbTaxonomyR214v1
domain_of:
- GtdbTaxonomyR214v1
range: string