Latium Framework

Running ROME

ROME (and related commands) is driven via the Hydra-based CLI in src/cli.py.

Single intervention:

python -m src.cli +command=rome model=gpt2-medium

Batch evaluation:

python -m src.cli +command=batch-rome model=gpt2-medium

Compute second-moment statistics (required before running ROME on a new model):

python -m src.cli +command=second-moment model=gpt2-medium

The default config is at src/config/config.yaml. Override any value on the command line using Hydra syntax (e.g. model=gpt2-large).

Alternatively, use the console fallback (no Hydra overhead):

python -m src.cli --console rome --config src/config/config.yaml

Running Causal Trace

python -m src.cli +command=causal-trace model=gpt2-medium

To inspect the computed noise multiplier without running a full trace:

python -m src.cli +command=compute-multiplier model=gpt2-medium

Running the Structural Benchmark

structural_benchmark.py applies ROME edits across a dataset and evaluates all structural detectors (MSD, blind MSD, spectral, IPR) on the modified weights. Results are written as JSON to analysis_out/.

python structural_benchmark.py \
    --model gpt2-large \
    --n-tests 30 \
    --start-idx 0 \
    --output-dir ./analysis_out \
    --spectral-top-k 50 \
    --trim-first-layers 2 \
    --trim-last-layers 2 \
    --spectral-neighbor-layers 1

Key arguments:

Argument	Default	Description
`--model`	`gpt2-large`	Model name (must match a config in `src/config/model/`)
`--n-tests`	`30`	Number of ROME edits to benchmark
`--start-idx`	`0`	Starting index in the facts dataset
`--output-dir`	`./analysis_out`	Directory for JSON result files
`--spectral-top-k`	`50`	Top-K singular values used by the spectral detector
`--trim-first-layers`	`2`	Layers to exclude from the head of the model
`--trim-last-layers`	`2`	Layers to exclude from the tail of the model
`--n-prompts`	auto	Number of ROME prefix prompts (scales with model size if omitted)

Detection Documentation

Detailed documentation for the detection methods is in the docs/ directory:

docs/structural-docs.md - structural detector metrics (L2 discrepancy, relative discrepancy, directional coherence, MSD, IPR, etc.)
docs/spectral-docs.md - spectral detector signals and the mathematics behind singular-value z-scores and ratio scores

Models roadmap

Supported Models	Causal Trace	Weight intervention
gpt2-medium	✔️	✔️
gpt2-large	✔️	✔️
gpt2-xl	✔️	✔️
gpt-j-6b	✔️	✔️
qwen3-0.6b	✔️	✔️
qwen3-1.7b	✔️	✔️
qwen3-4b	✔️	✔️
qwen3-8b	✔️	✔️
granite4-micro	✔️

Error codes:

Error code	Name of the error	Description
`1`	Help	Help invoked. Typically caused by incorrect script usage.
`2`	Resource already exists	Trying to create a resource that already exists.
`-1`	Unknown	An unknow error. Create GitHub issue with the reproduction steps

Name		Name	Last commit message	Last commit date
Latest commit History 216 Commits
analysis_out		analysis_out
docs		docs
notebooks		notebooks
src		src
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENCE		LICENCE
Makefile		Makefile
README.md		README.md
conda_install.sh		conda_install.sh
detection.py		detection.py
requirements.txt		requirements.txt
structural_benchmark.py		structural_benchmark.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Latium Framework

Running ROME

Running Causal Trace

Running the Structural Benchmark

Detection Documentation

Models roadmap

Error codes:

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Latium Framework

Running ROME

Running Causal Trace

Running the Structural Benchmark

Detection Documentation

Models roadmap

Error codes:

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages