Refactor Sem multi-group support by alyst · Pull Request #317 · StructuralEquationModels/StructuralEquationModels.jl

alyst · 2026-03-09T22:47:46Z

This is a largest remaining part of #193, which changes some interfaces.

Refactoring of the SEM types

AbstractLoss is the base type for all functions
SemLoss{O,I} <: AbstractLoss is the base type for all SEM losses, it now requires to have observed::O and implied::I field
Since SemLoss ctor should always be given observed and implied (positional), meanstructure keyword is gone -- loss should always respect implied specification.
LossTerm is a thin wrapper around AbstractLoss that adds optional id of the loss term and optional weight
Sem is a container of LossTerm objects (accessible via loss_terms(sem), or loss_term(sem, id)), so it can handle multiple SEM terms (accessible via sem_terms(sem) -- subset of loss_terms(sem), or sem_term(sem, id)).
It replaces both the old Sem and SemEnsemble.
AbstractSingleSem, AbstractSemCollection and SemEnsemble are gone.

Method changes

Multi-term SEMs could be created like

model = Sem(
    :Pasteur => SemML(obs_g1, RAMSymbolic(specification_g1)),
    :Grant_White => SemML(obs_g2, RAM(specification_g2)),
    ...
)

Or with weights specification

model = Sem(
    :Pasteur => SemML(obs_g1, RAMSymbolic(specification_g1)) => 0.5,
    :Grant_White => SemML(obs_g2, RAM(specification_g2)) => 0.6,
)

The new Sem() and loss-term constructors rely less on keyword arguments and more on positional arguments, but some keywords support is present.

update_observed!() was removed. It was only used by replace_observed(),
but otherwise in-place model modification with unclear semantics is error-prone.
replace_observed(sem, data) was simplified by removing support of additional keywords or requirement to pass SEM specification.
It only creates a copy of the given Sem with the observed data replaced,
but implied and loss definitions intact.
Changing observed vars is not supported -- that is something use-case specific
that user should implement in their code.
check_single_lossfun() was renamed into check_same_semterm_type() as
it better describes what it does. If check is successful, it returns the specific
subtype of SemLoss.
bootstrap() and se_bootstrap() use bootstrap!(acc::BootstrapAccumulator, ...)
function to reduce code duplication
bootstrap() returns BootstrapResult{T} for better type inference
fit_measures() now also accepts vector of functions, and includes CFI by default (DEFAULT_FIT_MEASURES constant)
test_fitmeasures() was tweaked to handle more repetitive code: calculating the subset of fit measures, and compairing this subset against lavaan refs, checking for measures that could not be applied to given loss types (SemWLS).

- for SemImplied require spec::SemSpec as positional - for SemLossFunction require implied argument

deduplicate the correction scale methods and move to Sem.jl

remove update_observed!()

to suppress info about inv(obs_cov)

also add CFI to the list

Maximilian-Stefan-Ernst · 2026-03-24T16:25:45Z

docs/src/tutorials/collection/collection.md

+In this case, [`FiniteDiffWrapper`](@ref) method to generate a wrapper around the specific `SemLoss` term that only uses its objective
+to calculate the gradient using the finite difference approximation.


Suggested change

In this case, [`FiniteDiffWrapper`](@ref) method to generate a wrapper around the specific `SemLoss` term that only uses its objective

to calculate the gradient using the finite difference approximation.

In this case, [`FiniteDiffWrapper`](@ref) can be used to generate a wrapper around the specific `SemLoss` term. This wrapper only uses the `LossTerm`s objective, and calculates the gradient using finite difference approximation.

alyst · 2026-03-24T17:06:17Z

@Maximilian-Stefan-Ernst It might be a nice idea to use copilot for catching typos, incorrect sentences, but also potential bugs.
I cannot select copilot as a reviewer -- I'm not exactly sure why, whether it is the organization/repository-level setting, or it's my status in the repository.
But I'm also fine if SEM.jl is kept AI-free :)

Maximilian-Stefan-Ernst · 2026-03-25T12:44:27Z

src/frontend/fit/fitmeasures/chi2.jl

-function χ²(fit::SemFit, model::AbstractSemSingle)
-    check_single_lossfun(model; throw_error = true)
-    return χ²(model.loss.functions[1], fit::SemFit, model::AbstractSemSingle)
+    return χ²(typeof(term1), fit, model)


Is there a reason to pass typeof(term1) instead of term1? I personally find the syntax a bit cleaner without the extra typeof call.

Maximilian-Stefan-Ernst · 2026-03-25T12:54:12Z

src/frontend/fit/fitmeasures/chi2.jl

-############################################################################################
+function χ²(fit::SemFit, model::AbstractSem)
+    terms = sem_terms(model)
+    isempty(terms) && return 0.0


Maybe we should throw an error for a Sem with no terms?

Maximilian-Stefan-Ernst · 2026-03-25T12:56:11Z

src/frontend/fit/fitmeasures/chi2.jl

+    term1 = _unwrap(loss(terms[1]))
+    L = typeof(term1).name
+
+    # check that all SemLoss terms are of the same class (ML, FIML, WLS etc), ignore typeparams
+    for (i, term) in enumerate(terms)
+        lossterm = _unwrap(loss(term))
+        @assert lossterm isa SemLoss
+        if typeof(_unwrap(lossterm)).name != L
+            @error "SemLoss term #$i is $(typeof(_unwrap(lossterm)).name), expected $L. Heterogeneous loss functions are not supported"
+        end
+    end


I thought this is done in check_semterm_type?

Suggested change

term1 = _unwrap(loss(terms[1]))

L = typeof(term1).name

# check that all SemLoss terms are of the same class (ML, FIML, WLS etc), ignore typeparams

for (i, term) in enumerate(terms)

lossterm = _unwrap(loss(term))

@assert lossterm isa SemLoss

if typeof(_unwrap(lossterm)).name != L

@error "SemLoss term #$i is $(typeof(_unwrap(lossterm)).name), expected $L. Heterogeneous loss functions are not supported"

end

end

alyst changed the base branch from main to devel March 9, 2026 22:48

alyst force-pushed the refactor_sem_terms branch from 3c39941 to 32cea82 Compare March 11, 2026 20:31

Alexey Stukalov and others added 3 commits March 21, 2026 11:03

Project.toml: support Symbolics v7 & Utils v4

9c516fd

prepare_start_params(): tighten type check

6e1ffaa

SemImplied/SemLossFun: drop meanstructure kwarg

32068de

- for SemImplied require spec::SemSpec as positional - for SemLossFunction require implied argument

alyst force-pushed the refactor_sem_terms branch from 32cea82 to eb039a2 Compare March 23, 2026 04:49

alyst and others added 10 commits March 23, 2026 00:31

refactor Sem, SemEnsemble, SemLoss

0bcbf05

params/param_labels(): use both as synonyms for now

b98ff09

check_same_semterm_type(): refactor check_single_lossfun()

0f61c1b

update multi-group correction

8a2393c

deduplicate the correction scale methods and move to Sem.jl

replace_observed(): simplify & refactor

355d1bf

remove update_observed!()

bootstrap: sync with Sem updates

39d5854

CFI: sync with Sem refactor

4b6af2e

test/build_models: remove redundant model

0f1afca

revert using

32261a5

WLS: verbose option

93f424a

to suppress info about inv(obs_cov)

alyst force-pushed the refactor_sem_terms branch from eb039a2 to 88a1ff0 Compare March 23, 2026 07:33

alyst changed the title ~~Refactor Sem mult-group support~~ Refactor Sem multi-group support Mar 23, 2026

alyst marked this pull request as ready for review March 23, 2026 08:12

Alexey Stukalov added 5 commits March 23, 2026 10:51

docs: sync with Sem refactor

18c0758

test: fix formatting

f8cd368

fit_measures(): support vectors of funcs

abe6636

also add CFI to the list

test_fitmeasures(): refactor/simplify

7272871

test/multigroup: small tweaks

0406f29

alyst force-pushed the refactor_sem_terms branch from 88a1ff0 to 0406f29 Compare March 23, 2026 17:51

Maximilian-Stefan-Ernst reviewed Mar 24, 2026

View reviewed changes

Maximilian-Stefan-Ernst reviewed Mar 25, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor Sem multi-group support#317

Refactor Sem multi-group support#317
alyst wants to merge 18 commits intoStructuralEquationModels:develfrom
alyst:refactor_sem_terms

alyst commented Mar 9, 2026 •

edited

Loading

Uh oh!

Maximilian-Stefan-Ernst Mar 24, 2026

Uh oh!

alyst commented Mar 24, 2026

Uh oh!

Maximilian-Stefan-Ernst Mar 25, 2026

Uh oh!

Maximilian-Stefan-Ernst Mar 25, 2026

Uh oh!

Maximilian-Stefan-Ernst Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		In this case, [`FiniteDiffWrapper`](@ref) method to generate a wrapper around the specific `SemLoss` term that only uses its objective
		to calculate the gradient using the finite difference approximation.

Conversation

alyst commented Mar 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Refactoring of the SEM types

Method changes

Uh oh!

Maximilian-Stefan-Ernst Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

alyst commented Mar 24, 2026

Uh oh!

Maximilian-Stefan-Ernst Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

Maximilian-Stefan-Ernst Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

Maximilian-Stefan-Ernst Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

alyst commented Mar 9, 2026 •

edited

Loading