Class: SampleTokensV1
Sample-level token assignments from vocabulary. Sparse representation of sample features for ML models.
USAGE: Each row represents a token-sample-modality combination with an associated value. Use to reconstruct sample feature vectors or train models on tokenized representations.
URI: https://w3id.org/kbase/nmdc_core/SampleTokensV1
classDiagram
class SampleTokensV1
click SampleTokensV1 href "../SampleTokensV1/"
SampleTokensV1 : modality_id
SampleTokensV1 : sample_id
SampleTokensV1 : token_id
SampleTokensV1 : value
Slots
| Name | Cardinality and Range | Description | Inheritance |
|---|---|---|---|
| sample_id | 1 String |
Sample identifier | direct |
| token_id | 1 Integer |
Vocabulary token ID from vocab_registry_v1 | direct |
| modality_id | 0..1 String |
Data modality (taxonomy, trait, abiotic, biochemical) | direct |
| value | 0..1 Float |
Token value/weight (e | direct |
Identifier and Mapping Information
Annotations
| property | value |
|---|---|
| source_table | sample_tokens_v1 |
Schema Source
- from schema: https://w3id.org/kbase/nmdc_core
Mappings
| Mapping Type | Mapped Value |
|---|---|
| self | https://w3id.org/kbase/nmdc_core/SampleTokensV1 |
| native | https://w3id.org/kbase/nmdc_core/SampleTokensV1 |
LinkML Source
Direct
name: SampleTokensV1
annotations:
source_table:
tag: source_table
value: sample_tokens_v1
description: 'Sample-level token assignments from vocabulary. Sparse representation
of sample features for ML models.
USAGE: Each row represents a token-sample-modality combination with an associated
value. Use to reconstruct sample feature vectors or train models on tokenized representations.'
from_schema: https://w3id.org/kbase/nmdc_core
attributes:
sample_id:
name: sample_id
description: Sample identifier
examples:
- value: nmdc:bsm-11-3a6fv767
from_schema: https://w3id.org/kbase/nmdc_core
rank: 1000
domain_of:
- SampleTokensV1
range: string
required: true
token_id:
name: token_id
description: Vocabulary token ID from vocab_registry_v1
comments:
- Foreign key to VocabRegistryV1.token_id
from_schema: https://w3id.org/kbase/nmdc_core
domain_of:
- VocabRegistryV1
- SampleTokensV1
range: integer
required: true
minimum_value: 0
modality_id:
name: modality_id
description: Data modality (taxonomy, trait, abiotic, biochemical)
examples:
- value: taxonomy
- value: trait
- value: biochemical
- value: abiotic
from_schema: https://w3id.org/kbase/nmdc_core
rank: 1000
domain_of:
- SampleTokensV1
range: string
value:
name: value
description: Token value/weight (e.g., abundance, presence score)
from_schema: https://w3id.org/kbase/nmdc_core
rank: 1000
domain_of:
- SampleTokensV1
range: float
Induced
name: SampleTokensV1
annotations:
source_table:
tag: source_table
value: sample_tokens_v1
description: 'Sample-level token assignments from vocabulary. Sparse representation
of sample features for ML models.
USAGE: Each row represents a token-sample-modality combination with an associated
value. Use to reconstruct sample feature vectors or train models on tokenized representations.'
from_schema: https://w3id.org/kbase/nmdc_core
attributes:
sample_id:
name: sample_id
description: Sample identifier
examples:
- value: nmdc:bsm-11-3a6fv767
from_schema: https://w3id.org/kbase/nmdc_core
rank: 1000
alias: sample_id
owner: SampleTokensV1
domain_of:
- SampleTokensV1
range: string
required: true
token_id:
name: token_id
description: Vocabulary token ID from vocab_registry_v1
comments:
- Foreign key to VocabRegistryV1.token_id
from_schema: https://w3id.org/kbase/nmdc_core
alias: token_id
owner: SampleTokensV1
domain_of:
- VocabRegistryV1
- SampleTokensV1
range: integer
required: true
minimum_value: 0
modality_id:
name: modality_id
description: Data modality (taxonomy, trait, abiotic, biochemical)
examples:
- value: taxonomy
- value: trait
- value: biochemical
- value: abiotic
from_schema: https://w3id.org/kbase/nmdc_core
rank: 1000
alias: modality_id
owner: SampleTokensV1
domain_of:
- SampleTokensV1
range: string
value:
name: value
description: Token value/weight (e.g., abundance, presence score)
from_schema: https://w3id.org/kbase/nmdc_core
rank: 1000
alias: value
owner: SampleTokensV1
domain_of:
- SampleTokensV1
range: float