Skip to content

Class: MetabolomicsGold

Metabolomics data linked to GOLD samples. Contains compound identifications with mass spec features and chemical identifiers.

TOTAL RECORDS: 3,129,061+ mass spec features

DATA INCLUDES: - Mass-to-charge ratios (m/z) for detected features - Retention times from chromatography - Peak intensities and areas - Molecular formulas (when determined) - Chemical identifiers (SMILES, InChI, InChIKey when known) - Links to KEGG and ChEBI compound databases

USAGE: For metabolomics analysis, join with study_table via file_id. Many features are unidentified (name is null). Use mz and retention time for feature matching across samples.

URI: https://w3id.org/kbase/nmdc_core/MetabolomicsGold

classDiagram class MetabolomicsGold click MetabolomicsGold href "../MetabolomicsGold/" MetabolomicsGold : chebi MetabolomicsGold : feature_id MetabolomicsGold : file_id MetabolomicsGold : file_name MetabolomicsGold : inchi MetabolomicsGold : inchikey MetabolomicsGold : Intensity MetabolomicsGold : kegg MetabolomicsGold : Molecular_Formula MetabolomicsGold : mz MetabolomicsGold : name MetabolomicsGold : Retention_Time_min MetabolomicsGold : smiles

Slots

Name Cardinality and Range Description Inheritance
file_id 0..1
String
NMDC data object ID for source file direct
file_name 0..1
String
Original CSV file name direct
feature_id 0..1
String
Metabolite feature ID (unique within file) direct
name 0..1
String
Compound name when identified direct
mz 0..1
Float
Mass-to-charge ratio (m/z) direct
Retention_Time_min 0..1
Float
Chromatographic retention time in minutes direct
Intensity 0..1
Float
Signal intensity (peak height) direct
Molecular_Formula 0..1
String
Chemical formula when determined from isotope patterns direct
inchi 0..1
String
InChI (International Chemical Identifier) string direct
inchikey 0..1
String
InChIKey - 27-character hash of InChI for database searching direct
smiles 0..1
String
SMILES notation for chemical structure direct
kegg 0..1
String
KEGG compound ID (C##### format) direct
chebi 0..1
String
ChEBI compound ID direct

Identifier and Mapping Information

Annotations

property value
source_table metabolomics_gold

Schema Source

  • from schema: https://w3id.org/kbase/nmdc_core

Mappings

Mapping Type Mapped Value
self https://w3id.org/kbase/nmdc_core/MetabolomicsGold
native https://w3id.org/kbase/nmdc_core/MetabolomicsGold

LinkML Source

Direct

name: MetabolomicsGold
annotations:
  source_table:
    tag: source_table
    value: metabolomics_gold
description: 'Metabolomics data linked to GOLD samples. Contains compound identifications
  with mass spec features and chemical identifiers.

  TOTAL RECORDS: 3,129,061+ mass spec features

  DATA INCLUDES: - Mass-to-charge ratios (m/z) for detected features - Retention times
  from chromatography - Peak intensities and areas - Molecular formulas (when determined)
  - Chemical identifiers (SMILES, InChI, InChIKey when known) - Links to KEGG and
  ChEBI compound databases

  USAGE: For metabolomics analysis, join with study_table via file_id. Many features
  are unidentified (name is null). Use mz and retention time for feature matching
  across samples.'
from_schema: https://w3id.org/kbase/nmdc_core
attributes:
  file_id:
    name: file_id
    description: NMDC data object ID for source file
    examples:
    - value: nmdc:dobj-12-3j09wy18
      description: Metabolomics data object ID
    from_schema: https://w3id.org/kbase/nmdc_core
    rank: 1000
    domain_of:
    - MetabolomicsGold
    range: string
  file_name:
    name: file_name
    description: Original CSV file name
    examples:
    - value: 4A8_2_41_pinto-63-s028-a03.csv
      description: Sample name encoded in filename
    from_schema: https://w3id.org/kbase/nmdc_core
    rank: 1000
    domain_of:
    - MetabolomicsGold
    range: string
  feature_id:
    name: feature_id
    description: Metabolite feature ID (unique within file)
    examples:
    - value: '16348'
    - value: '1653'
    - value: '19466'
    from_schema: https://w3id.org/kbase/nmdc_core
    rank: 1000
    domain_of:
    - MetabolomicsGold
    range: string
  name:
    name: name
    description: Compound name when identified. NULL for unidentified features (most
      features are unidentified initially).
    from_schema: https://w3id.org/kbase/nmdc_core
    domain_of:
    - AnnotationTermsUnified
    - GoTerms
    - EcTerms
    - KeggKoTerms
    - KeggPathwayTerms
    - StudyTable
    - MetabolomicsGold
    - MetacycPathways
    range: string
  mz:
    name: mz
    description: Mass-to-charge ratio (m/z). Primary identifier for mass spec features.
      Precision typically to 4-6 decimal places.
    examples:
    - value: '493.3288007596016'
      description: Higher mass feature
    - value: '447.25243219640714'
      description: Medium mass feature
    - value: '322.23733159062687'
      description: Lower mass feature
    from_schema: https://w3id.org/kbase/nmdc_core
    rank: 1000
    slot_uri: nmdc:mz
    domain_of:
    - MetabolomicsGold
    range: float
    minimum_value: 0.0
  Retention_Time_min:
    name: Retention_Time_min
    description: Chromatographic retention time in minutes. Second dimension for feature
      identification after m/z.
    examples:
    - value: '6.8750589'
    - value: '7.5899288'
    - value: '5.0135949'
    from_schema: https://w3id.org/kbase/nmdc_core
    rank: 1000
    slot_uri: nmdc:retention_time
    domain_of:
    - MetabolomicsGold
    range: float
    minimum_value: 0.0
  Intensity:
    name: Intensity
    description: Signal intensity (peak height). Proportional to compound abundance
      but not directly comparable across samples.
    examples:
    - value: '7355465.5'
      description: Moderate intensity
    - value: '51934152.0'
      description: High intensity feature
    - value: '6229143.0'
    from_schema: https://w3id.org/kbase/nmdc_core
    rank: 1000
    domain_of:
    - MetabolomicsGold
    range: float
    minimum_value: 0.0
  Molecular_Formula:
    name: Molecular_Formula
    description: 'Chemical formula when determined from isotope patterns. Format:
      Elements with counts and charges.'
    examples:
    - value: C23 H36 O3 N4 P1
      description: Larger molecule with phosphorus
    - value: C19 H32 O3 N1
      description: Medium-sized molecule
    from_schema: https://w3id.org/kbase/nmdc_core
    rank: 1000
    domain_of:
    - MetabolomicsGold
    range: string
  inchi:
    name: inchi
    description: InChI (International Chemical Identifier) string
    from_schema: https://w3id.org/kbase/nmdc_core
    rank: 1000
    domain_of:
    - MetabolomicsGold
    range: string
  inchikey:
    name: inchikey
    description: 'InChIKey - 27-character hash of InChI for database searching. Format:
      14 chars - 10 chars - 1 char checksum.'
    from_schema: https://w3id.org/kbase/nmdc_core
    rank: 1000
    domain_of:
    - MetabolomicsGold
    range: string
    pattern: '[A-Z]{14}-[A-Z]{10}-[A-Z]'
  smiles:
    name: smiles
    description: SMILES notation for chemical structure
    from_schema: https://w3id.org/kbase/nmdc_core
    rank: 1000
    domain_of:
    - MetabolomicsGold
    range: string
  kegg:
    name: kegg
    description: KEGG compound ID (C##### format)
    from_schema: https://w3id.org/kbase/nmdc_core
    rank: 1000
    domain_of:
    - MetabolomicsGold
    range: string
  chebi:
    name: chebi
    description: ChEBI compound ID
    from_schema: https://w3id.org/kbase/nmdc_core
    rank: 1000
    domain_of:
    - MetabolomicsGold
    range: string

Induced

name: MetabolomicsGold
annotations:
  source_table:
    tag: source_table
    value: metabolomics_gold
description: 'Metabolomics data linked to GOLD samples. Contains compound identifications
  with mass spec features and chemical identifiers.

  TOTAL RECORDS: 3,129,061+ mass spec features

  DATA INCLUDES: - Mass-to-charge ratios (m/z) for detected features - Retention times
  from chromatography - Peak intensities and areas - Molecular formulas (when determined)
  - Chemical identifiers (SMILES, InChI, InChIKey when known) - Links to KEGG and
  ChEBI compound databases

  USAGE: For metabolomics analysis, join with study_table via file_id. Many features
  are unidentified (name is null). Use mz and retention time for feature matching
  across samples.'
from_schema: https://w3id.org/kbase/nmdc_core
attributes:
  file_id:
    name: file_id
    description: NMDC data object ID for source file
    examples:
    - value: nmdc:dobj-12-3j09wy18
      description: Metabolomics data object ID
    from_schema: https://w3id.org/kbase/nmdc_core
    rank: 1000
    alias: file_id
    owner: MetabolomicsGold
    domain_of:
    - MetabolomicsGold
    range: string
  file_name:
    name: file_name
    description: Original CSV file name
    examples:
    - value: 4A8_2_41_pinto-63-s028-a03.csv
      description: Sample name encoded in filename
    from_schema: https://w3id.org/kbase/nmdc_core
    rank: 1000
    alias: file_name
    owner: MetabolomicsGold
    domain_of:
    - MetabolomicsGold
    range: string
  feature_id:
    name: feature_id
    description: Metabolite feature ID (unique within file)
    examples:
    - value: '16348'
    - value: '1653'
    - value: '19466'
    from_schema: https://w3id.org/kbase/nmdc_core
    rank: 1000
    alias: feature_id
    owner: MetabolomicsGold
    domain_of:
    - MetabolomicsGold
    range: string
  name:
    name: name
    description: Compound name when identified. NULL for unidentified features (most
      features are unidentified initially).
    from_schema: https://w3id.org/kbase/nmdc_core
    alias: name
    owner: MetabolomicsGold
    domain_of:
    - AnnotationTermsUnified
    - GoTerms
    - EcTerms
    - KeggKoTerms
    - KeggPathwayTerms
    - StudyTable
    - MetabolomicsGold
    - MetacycPathways
    range: string
  mz:
    name: mz
    description: Mass-to-charge ratio (m/z). Primary identifier for mass spec features.
      Precision typically to 4-6 decimal places.
    examples:
    - value: '493.3288007596016'
      description: Higher mass feature
    - value: '447.25243219640714'
      description: Medium mass feature
    - value: '322.23733159062687'
      description: Lower mass feature
    from_schema: https://w3id.org/kbase/nmdc_core
    rank: 1000
    slot_uri: nmdc:mz
    alias: mz
    owner: MetabolomicsGold
    domain_of:
    - MetabolomicsGold
    range: float
    minimum_value: 0.0
  Retention_Time_min:
    name: Retention_Time_min
    description: Chromatographic retention time in minutes. Second dimension for feature
      identification after m/z.
    examples:
    - value: '6.8750589'
    - value: '7.5899288'
    - value: '5.0135949'
    from_schema: https://w3id.org/kbase/nmdc_core
    rank: 1000
    slot_uri: nmdc:retention_time
    alias: Retention_Time_min
    owner: MetabolomicsGold
    domain_of:
    - MetabolomicsGold
    range: float
    minimum_value: 0.0
  Intensity:
    name: Intensity
    description: Signal intensity (peak height). Proportional to compound abundance
      but not directly comparable across samples.
    examples:
    - value: '7355465.5'
      description: Moderate intensity
    - value: '51934152.0'
      description: High intensity feature
    - value: '6229143.0'
    from_schema: https://w3id.org/kbase/nmdc_core
    rank: 1000
    alias: Intensity
    owner: MetabolomicsGold
    domain_of:
    - MetabolomicsGold
    range: float
    minimum_value: 0.0
  Molecular_Formula:
    name: Molecular_Formula
    description: 'Chemical formula when determined from isotope patterns. Format:
      Elements with counts and charges.'
    examples:
    - value: C23 H36 O3 N4 P1
      description: Larger molecule with phosphorus
    - value: C19 H32 O3 N1
      description: Medium-sized molecule
    from_schema: https://w3id.org/kbase/nmdc_core
    rank: 1000
    alias: Molecular_Formula
    owner: MetabolomicsGold
    domain_of:
    - MetabolomicsGold
    range: string
  inchi:
    name: inchi
    description: InChI (International Chemical Identifier) string
    from_schema: https://w3id.org/kbase/nmdc_core
    rank: 1000
    alias: inchi
    owner: MetabolomicsGold
    domain_of:
    - MetabolomicsGold
    range: string
  inchikey:
    name: inchikey
    description: 'InChIKey - 27-character hash of InChI for database searching. Format:
      14 chars - 10 chars - 1 char checksum.'
    from_schema: https://w3id.org/kbase/nmdc_core
    rank: 1000
    alias: inchikey
    owner: MetabolomicsGold
    domain_of:
    - MetabolomicsGold
    range: string
    pattern: '[A-Z]{14}-[A-Z]{10}-[A-Z]'
  smiles:
    name: smiles
    description: SMILES notation for chemical structure
    from_schema: https://w3id.org/kbase/nmdc_core
    rank: 1000
    alias: smiles
    owner: MetabolomicsGold
    domain_of:
    - MetabolomicsGold
    range: string
  kegg:
    name: kegg
    description: KEGG compound ID (C##### format)
    from_schema: https://w3id.org/kbase/nmdc_core
    rank: 1000
    alias: kegg
    owner: MetabolomicsGold
    domain_of:
    - MetabolomicsGold
    range: string
  chebi:
    name: chebi
    description: ChEBI compound ID
    from_schema: https://w3id.org/kbase/nmdc_core
    rank: 1000
    alias: chebi
    owner: MetabolomicsGold
    domain_of:
    - MetabolomicsGold
    range: string