(Feature) HDN minimum example by CatEek · Pull Request #396 · CAREamics/careamics

CatEek · 2025-02-10T14:40:41Z

Description

Note

tldr: Minimum working HDN integrated into CAREamist.

Unfortunately, note that many of the modifications are artifacts of the merge with main. Also it's important to keep in mind that this is a mimimal example that is not supposed to be used as is, so I believe we can leave certain things up to future refactoring.

Background - why do we need this PR?

HDN is currently not available in CAREamics, but most of the features are present (Noise model, LVAE model).
It will use existing LVAE model code. The difference from MicroSplit is it is unsupervised(targets for loss computation are defined as input) and the loss function is different(but uses MIcroSplit loss under the hood)

Overview - what changed?

New configurations for HDN, but also modifying the LVAE code base to make it compatible.

New features or files

HDN relevant code in careamist.py
relevant config in hdn_algorithm_model.py
relevant changes in vae_algorithm_model.py
relevant changes in lvae_model.py
HDN configuration code in configuration_factory.py and relevant modules
HDN relevant code in lightning_module.py
HDN code in losses.py and loss factory
etc...

New features or files

Configuration:

Added: HDNAlgorithm algorithm configuration.
Modified: Added hdn into all relevant configuration.
Added hdn_loss which is using microsplit losses inside

How has this been tested?

Created a notebook in the examples repo to check performance on the BSD dataset without noise model

Related Issues

Resolves #

Breaking changes

Additional Notes and Examples

--- BMZ doesn't work, raises NotImplemented because the model outputs are incompatible
--- 3D isn't tested and would need another PR
--- HDN with noise model isn't tested

Please ensure your PR meets the following requirements:

Code builds and passes tests locally, including doctests
New tests have been added (for bug fixes/features)
Pre-commit passes
PR to the documentation exists (for bug fixes / features)

melisande-c · 2025-03-07T10:18:50Z

src/careamics/config/algorithms/vae_algorithm_model.py

+        # hdn
+        if self.algorithm == SupportedAlgorithm.HDN:
+            if self.loss.loss_type != SupportedLoss.HDN:
+                raise ValueError(
+                    f"Algorithm {self.algorithm} only supports loss `hdn`."
+                )
+            if self.model.multiscale_count > 1:
+                raise ValueError("Algorithm `hdn` does not support multiscale models.")


Hmm I see there should probably be separate child classes for MuSplit, DenoiSplit, but I guess this can wait and we create an issue

melisande-c · 2025-03-07T10:23:50Z

src/careamics/config/architectures/lvae_model.py

+    input_shape: tuple[int, ...] = Field(default=(64, 64), validate_default=True)
+    """Shape of the input patch (Z, Y, X) or (Y, X) if the data is 2D."""


changing this to tuple has some serialization issues with the current way we do model dump.

Note, it can actually be solved doing .model_dump(mode="json") which should automatically cast iterable python types to list.

Interesting, I didn't know. And I guess when reading out the list, it gets casted into tuple without any issue?

Should we open an issue for refactoring the way we export the configuration? It would be nice to support tuples, that would also allow immutable defaults in functions signatures.

Yeah default mode argument is "python", which will keep objects as python types, using "json" will convert python types to json serializable objects, and I guess there is overlap with yaml, model_dump API

melisande-c · 2025-03-07T10:26:41Z