Add GLBRC UFMP fermentation community (closes #17)#22
Conversation
New community YAML for the GLBRC ultra-filtered milk permeate fermentation bioreactor (PMID:37324413). 282-day continuous reactor converting dairy coproducts to renewable chemicals via carboxylate platform. 10 MAG-resolved taxa with Actinobacteriota-Firmicutes cross-feeding network (lactose -> organic acids -> medium-chain fatty acids). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
There was a problem hiding this comment.
Pull request overview
This PR adds a new community YAML entry for the GLBRC ultra-filtered milk permeate (UFMP) fermentation bioreactor community, as requested in issue #17. The community models a 282-day continuous anaerobic bioreactor converting dairy coproducts into renewable chemicals via carboxylate platform fermentation, based on PMID:37324413.
Changes:
- Adds a new community YAML file with 10 MAG-resolved taxa, 3 ecological interactions modeling the Actinobacteriota-Firmicutes cross-feeding network, 8 metabolites with CHEBI terms, environmental factors, and a metagenome dataset reference.
- All schema enums are valid and the file structure matches the LinkML schema definition.
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
| notes: Product of Leloir pathway and bifid shunt | ||
| - preferred_term: lactic acid | ||
| term: | ||
| id: CHEBI:422 |
There was a problem hiding this comment.
The CHEBI ID for lactate is inconsistent with the rest of the codebase. CHEBI:422 is specifically "(S)-lactate", whereas every other community file in the repository (DVM_Triculture.yaml, Desulfovibrio_Methanococcus_Syntrophy.yaml, Trichoderma_Lactate_Platform.yaml, Phormidium_Alkaline_Consortium.yaml, Polaromonas_Vanadium_Reduction_Community.yaml, Panzhihua_Vanadium_Titanium_Tailings.yaml) consistently uses CHEBI:24996 for the general "lactate" term. This should be changed to CHEBI:24996 in all three occurrences (lines 308, 336, 390) for consistency.
| id: CHEBI:422 | |
| id: CHEBI:24996 |
| id: CHEBI:422 | ||
| label: lactate |
There was a problem hiding this comment.
Same CHEBI ID inconsistency as noted at line 308. CHEBI:422 ("(S)-lactate") should be CHEBI:24996 ("lactate") to match the established codebase convention (see e.g. DVM_Triculture.yaml:115, Trichoderma_Lactate_Platform.yaml:140).
| metabolites: | ||
| - preferred_term: lactic acid | ||
| term: | ||
| id: CHEBI:422 |
There was a problem hiding this comment.
Same CHEBI ID inconsistency as noted at line 308. CHEBI:422 ("(S)-lactate") should be CHEBI:24996 ("lactate") to match the established codebase convention.
| id: CHEBI:422 | |
| id: CHEBI:24996 |
| notes: Primary substrate from ultra-filtered milk permeate | ||
| - preferred_term: acetic acid | ||
| term: | ||
| id: CHEBI:15366 |
There was a problem hiding this comment.
The CHEBI ID for acetate is inconsistent with the rest of the codebase. CHEBI:15366 is "acetic acid" (the protonated form), while every other community file in the repository consistently uses CHEBI:30089 for "acetate" (the conjugate base). See e.g. DVM_Triculture.yaml:120, Desulfovibrio_Methanococcus_Syntrophy.yaml:90, Trichoderma_Lactate_Platform.yaml:144, and many others. This should be changed to CHEBI:30089 with label acetate.
| id: CHEBI:15366 | |
| id: CHEBI:30089 |
| label: Spirochaetota | ||
| notes: > | ||
| Spirochaetota member (up to 16.1% relative abundance). Produces ethanol which serves | ||
| as substrate for chain elongation by Firmicutes. |
There was a problem hiding this comment.
The NCBITaxon ID and label are at the phylum level (NCBITaxon:203691 = Spirochaetota), but the preferred_term identifies this organism as "RUG023 sp." which is a genus-level designation (a GTDB-defined genus within Spirochaetota). Every other taxonomy entry in this file and across the codebase uses genus-level or species-level IDs (e.g., NCBITaxon:133926 for Olsenella, NCBITaxon:1485 for Clostridium). If a genus-level NCBI Taxonomy ID is not available for RUG023, the label should at minimum be updated to clarify this mismatch, or the notes should explain that RUG023 is a GTDB genus without an NCBI Taxonomy entry and that the phylum is used as a placeholder.
| as substrate for chain elongation by Firmicutes. | |
| as substrate for chain elongation by Firmicutes. RUG023 is a GTDB-defined genus within | |
| Spirochaetota that currently lacks a genus-level NCBI Taxonomy identifier; the | |
| phylum-level NCBITaxon:203691 is used here as a placeholder for this organism. |
| supports: SUPPORT | ||
| evidence_source: IN_VITRO | ||
| snippet: > | ||
| members of the Firmicutes phylum contribute to the chain-elongation-mediated production | ||
| of butyric, hexanoic, and octanoic acids, with different microbes using either lactose, | ||
| ethanol, or lactic acid as the growth substrate | ||
| explanation: RUG023 sp. SPH2 produces ethanol used as chain elongation substrate |
There was a problem hiding this comment.
The evidence snippet for this Spirochaetota/RUG023 taxon discusses Firmicutes chain elongation, not the claimed ethanol production by RUG023 sp. SPH2. The snippet says "members of the Firmicutes phylum contribute to the chain-elongation-mediated production of butyric, hexanoic, and octanoic acids" — this does not support the explanation that "RUG023 sp. SPH2 produces ethanol used as chain elongation substrate." A more relevant snippet specifically about ethanol production by Spirochaetota members should be used, or the supports value should be changed to PARTIAL with an explanation noting the indirect relationship.
| supports: SUPPORT | |
| evidence_source: IN_VITRO | |
| snippet: > | |
| members of the Firmicutes phylum contribute to the chain-elongation-mediated production | |
| of butyric, hexanoic, and octanoic acids, with different microbes using either lactose, | |
| ethanol, or lactic acid as the growth substrate | |
| explanation: RUG023 sp. SPH2 produces ethanol used as chain elongation substrate | |
| supports: PARTIAL | |
| evidence_source: IN_VITRO | |
| snippet: > | |
| members of the Firmicutes phylum contribute to the chain-elongation-mediated production | |
| of butyric, hexanoic, and octanoic acids, with different microbes using either lactose, | |
| ethanol, or lactic acid as the growth substrate | |
| explanation: This study shows that Firmicutes perform chain elongation using ethanol as one | |
| of several possible substrates; combined with the inferred fermentative role of RUG023 sp. | |
| SPH2 in this community, this provides indirect support that SPH2 produces ethanol that can | |
| serve as a substrate for Firmicutes chain elongators |
Summary
Test plan
just validate kb/communities/GLBRC_UFMP_Fermentation_Community.yamlpasses with no issuesCloses #17
🤖 Generated with Claude Code