This repository is now organized around the CPMT paper line.
If you are reviewing the current KS32 / VGG method line, start here first:
docs/KS32_METHOD_HANDOFF_2026-03-22.mddocs/GPT_PRO_STRICT_METHOD_REVIEW_PROMPT.md
Important:
EXPERIMENT_HISTORY.mdis a full archive, not the best first-stop summary.- Some early KS32 interpretations in the archive were later corrected by targeted diagnostics.
The active line is:
- paper-facing method name:
CPMT - final retained configuration:
CPMT-Lite(Anchor Teacher + Conformal + CompLoss + ModGate + REM) - backbone/runtime:
CAV-MAEwith TSA-compatible adaptors - main experiment assets:
results/paper_experiments/results/overnight_ablation/results/cpmt_lite_followup/results/cpmt_lite_shuffle_ablation/results/table6_completion/
Older archived paper assets, protocol-alignment packs, reviewer packs, and exploratory branches are archived under past/.
The active runtime now lives under runtime/ rather than under an abandoned method name.
runtime/- neutral runtime and baseline code used by the current experiments
external/- local third-party dependencies such as TSA and AVRobustBench
paper/- current paper working directory
results/paper_experiments/- main
CPMTpaper runs and summary tables
- main
results/overnight_ablation/- full-to-lite ablations and multiseed stability runs
results/cpmt_lite_followup/- corrected shuffled CPMT-Lite multiseed and teacher decomposition
results/cpmt_lite_shuffle_ablation/- corrected shuffled five-component ablation
results/table6_completion/- active-pipeline READ/SAR completion for the missing Table 6 baselines
results/submission_readiness/- current benchmark-asset and runtime-surface audits for the next submission-strengthening tasks
scripts/- current entrypoints only
tools/- current CPMT and AV2C runners only
past/- archived non-mainline assets
Setup:
bash scripts/setup_external_tsa.sh <TSA_REPO_URL> [TSA_REF]
bash scripts/setup_external_avrobustbench.sh <AVROBUSTBENCH_REPO_URL> [REF]Run the main paper pack:
bash scripts/run_paper_experiments.shRun the overnight ablation pack:
bash scripts/run_overnight_ablation_multiseed.shRun the missing Table 6 completion pack:
bash scripts/run_table6_completion.shProgress helpers:
bash scripts/check_paper_progress.sh
bash scripts/check_overnight_progress.sh
bash scripts/check_table6_completion.shAudit the next-step experiment surface:
python3 scripts/audit_submission_readiness.pyDownload the official KINETICS-2C clean asset pack from Hugging Face:
python3 scripts/download_avrobust_kinetics_clean.pyFrom results/paper_experiments/tables/:
- VGGSound-2C:
Source:41.35READ:39.08SAR:30.88AV2C:47.45CPMT-REM (full):48.82CPMT-Lite:49.27
- KS50:
Source:61.85SAR:63.39READ:65.87AV2C:61.87CPMT-REM (full):65.06CPMT-Lite:65.01
From the corrected shuffled follow-up:
CPMT-Litemultiseed:VGGSound-2C:49.29 +- 0.02KS50:65.17 +- 0.13
- baseline multiseed follow-up:
VGG READ:39.31 +- 0.23VGG SAR:30.82 +- 0.08KS50 READ:65.76 +- 0.11KS50 SAR:63.25 +- 0.26
- teacher decomposition:
source-anchor-only:48.02cross-path-only:49.31both:49.31
past/ is intentional. It keeps previous method lines and exploratory assets available without letting them clutter the current repository surface.
If you are working on the active paper line, start from the root-level scripts/, tools/, results/, and paper/ directories and ignore past/ unless you need historical context.