Skip to content

Add GLBRC UFMP fermentation community (closes #17)#22

Merged
realmarcin merged 1 commit intomainfrom
add-glbrc-ufmp-fermentation-community
Mar 6, 2026
Merged

Add GLBRC UFMP fermentation community (closes #17)#22
realmarcin merged 1 commit intomainfrom
add-glbrc-ufmp-fermentation-community

Conversation

@cmungall
Copy link
Collaborator

@cmungall cmungall commented Mar 5, 2026

Summary

  • Adds new community YAML for the GLBRC ultra-filtered milk permeate (UFMP) fermentation bioreactor community (PMID:37324413, doi:10.3389/fbioe.2023.1173656)
  • 282-day continuous anaerobic bioreactor converting dairy coproducts into renewable chemicals via carboxylate platform fermentation
  • 10 MAG-resolved taxa spanning Actinobacteriota (Olsenella, Bifidobacterium, Pauljensenia), Firmicutes (Clostridium, Agathobacter, Acidaminococcus), and Spirochaetota
  • 3 ecological interactions modeling the Actinobacteriota-Firmicutes cross-feeding network: lactose degradation -> organic acid production -> chain elongation to medium-chain fatty acids (butyrate, hexanoate, octanoate)
  • 8 metabolites with CHEBI terms, environmental factors (35C, pH 5.5, anaerobic, 6-day HRT), and metagenome dataset (PRJNA768492)

Test plan

  • just validate kb/communities/GLBRC_UFMP_Fermentation_Community.yaml passes with no issues

Closes #17

🤖 Generated with Claude Code

New community YAML for the GLBRC ultra-filtered milk permeate
fermentation bioreactor (PMID:37324413). 282-day continuous reactor
converting dairy coproducts to renewable chemicals via carboxylate
platform. 10 MAG-resolved taxa with Actinobacteriota-Firmicutes
cross-feeding network (lactose -> organic acids -> medium-chain
fatty acids).

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Copilot AI review requested due to automatic review settings March 5, 2026 15:05
Copy link

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR adds a new community YAML entry for the GLBRC ultra-filtered milk permeate (UFMP) fermentation bioreactor community, as requested in issue #17. The community models a 282-day continuous anaerobic bioreactor converting dairy coproducts into renewable chemicals via carboxylate platform fermentation, based on PMID:37324413.

Changes:

  • Adds a new community YAML file with 10 MAG-resolved taxa, 3 ecological interactions modeling the Actinobacteriota-Firmicutes cross-feeding network, 8 metabolites with CHEBI terms, environmental factors, and a metagenome dataset reference.
  • All schema enums are valid and the file structure matches the LinkML schema definition.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

notes: Product of Leloir pathway and bifid shunt
- preferred_term: lactic acid
term:
id: CHEBI:422
Copy link

Copilot AI Mar 5, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The CHEBI ID for lactate is inconsistent with the rest of the codebase. CHEBI:422 is specifically "(S)-lactate", whereas every other community file in the repository (DVM_Triculture.yaml, Desulfovibrio_Methanococcus_Syntrophy.yaml, Trichoderma_Lactate_Platform.yaml, Phormidium_Alkaline_Consortium.yaml, Polaromonas_Vanadium_Reduction_Community.yaml, Panzhihua_Vanadium_Titanium_Tailings.yaml) consistently uses CHEBI:24996 for the general "lactate" term. This should be changed to CHEBI:24996 in all three occurrences (lines 308, 336, 390) for consistency.

Suggested change
id: CHEBI:422
id: CHEBI:24996

Copilot uses AI. Check for mistakes.
Comment on lines +336 to +337
id: CHEBI:422
label: lactate
Copy link

Copilot AI Mar 5, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same CHEBI ID inconsistency as noted at line 308. CHEBI:422 ("(S)-lactate") should be CHEBI:24996 ("lactate") to match the established codebase convention (see e.g. DVM_Triculture.yaml:115, Trichoderma_Lactate_Platform.yaml:140).

Copilot uses AI. Check for mistakes.
metabolites:
- preferred_term: lactic acid
term:
id: CHEBI:422
Copy link

Copilot AI Mar 5, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same CHEBI ID inconsistency as noted at line 308. CHEBI:422 ("(S)-lactate") should be CHEBI:24996 ("lactate") to match the established codebase convention.

Suggested change
id: CHEBI:422
id: CHEBI:24996

Copilot uses AI. Check for mistakes.
notes: Primary substrate from ultra-filtered milk permeate
- preferred_term: acetic acid
term:
id: CHEBI:15366
Copy link

Copilot AI Mar 5, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The CHEBI ID for acetate is inconsistent with the rest of the codebase. CHEBI:15366 is "acetic acid" (the protonated form), while every other community file in the repository consistently uses CHEBI:30089 for "acetate" (the conjugate base). See e.g. DVM_Triculture.yaml:120, Desulfovibrio_Methanococcus_Syntrophy.yaml:90, Trichoderma_Lactate_Platform.yaml:144, and many others. This should be changed to CHEBI:30089 with label acetate.

Suggested change
id: CHEBI:15366
id: CHEBI:30089

Copilot uses AI. Check for mistakes.
label: Spirochaetota
notes: >
Spirochaetota member (up to 16.1% relative abundance). Produces ethanol which serves
as substrate for chain elongation by Firmicutes.
Copy link

Copilot AI Mar 5, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The NCBITaxon ID and label are at the phylum level (NCBITaxon:203691 = Spirochaetota), but the preferred_term identifies this organism as "RUG023 sp." which is a genus-level designation (a GTDB-defined genus within Spirochaetota). Every other taxonomy entry in this file and across the codebase uses genus-level or species-level IDs (e.g., NCBITaxon:133926 for Olsenella, NCBITaxon:1485 for Clostridium). If a genus-level NCBI Taxonomy ID is not available for RUG023, the label should at minimum be updated to clarify this mismatch, or the notes should explain that RUG023 is a GTDB genus without an NCBI Taxonomy entry and that the phylum is used as a placeholder.

Suggested change
as substrate for chain elongation by Firmicutes.
as substrate for chain elongation by Firmicutes. RUG023 is a GTDB-defined genus within
Spirochaetota that currently lacks a genus-level NCBI Taxonomy identifier; the
phylum-level NCBITaxon:203691 is used here as a placeholder for this organism.

Copilot uses AI. Check for mistakes.
Comment on lines +185 to +191
supports: SUPPORT
evidence_source: IN_VITRO
snippet: >
members of the Firmicutes phylum contribute to the chain-elongation-mediated production
of butyric, hexanoic, and octanoic acids, with different microbes using either lactose,
ethanol, or lactic acid as the growth substrate
explanation: RUG023 sp. SPH2 produces ethanol used as chain elongation substrate
Copy link

Copilot AI Mar 5, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The evidence snippet for this Spirochaetota/RUG023 taxon discusses Firmicutes chain elongation, not the claimed ethanol production by RUG023 sp. SPH2. The snippet says "members of the Firmicutes phylum contribute to the chain-elongation-mediated production of butyric, hexanoic, and octanoic acids" — this does not support the explanation that "RUG023 sp. SPH2 produces ethanol used as chain elongation substrate." A more relevant snippet specifically about ethanol production by Spirochaetota members should be used, or the supports value should be changed to PARTIAL with an explanation noting the indirect relationship.

Suggested change
supports: SUPPORT
evidence_source: IN_VITRO
snippet: >
members of the Firmicutes phylum contribute to the chain-elongation-mediated production
of butyric, hexanoic, and octanoic acids, with different microbes using either lactose,
ethanol, or lactic acid as the growth substrate
explanation: RUG023 sp. SPH2 produces ethanol used as chain elongation substrate
supports: PARTIAL
evidence_source: IN_VITRO
snippet: >
members of the Firmicutes phylum contribute to the chain-elongation-mediated production
of butyric, hexanoic, and octanoic acids, with different microbes using either lactose,
ethanol, or lactic acid as the growth substrate
explanation: This study shows that Firmicutes perform chain elongation using ethanol as one
of several possible substrates; combined with the inferred fermentative role of RUG023 sp.
SPH2 in this community, this provides indirect support that SPH2 produces ethanol that can
serve as a substrate for Firmicutes chain elongators

Copilot uses AI. Check for mistakes.
@realmarcin realmarcin merged commit 630b1fd into main Mar 6, 2026
4 checks passed
@realmarcin realmarcin deleted the add-glbrc-ufmp-fermentation-community branch March 6, 2026 09:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Add GLBRC bioreactor fermentation community

3 participants