Skip to content

Add runtime config for model-specific jobs#95

Open
Tooyosi wants to merge 1 commit intomainfrom
dynamic-workflow-config
Open

Add runtime config for model-specific jobs#95
Tooyosi wants to merge 1 commit intomainfrom
dynamic-workflow-config

Conversation

@Tooyosi
Copy link
Copy Markdown
Contributor

@Tooyosi Tooyosi commented Apr 7, 2026

Make BaJoR accept model-coupled batch runtime config from job requests instead of relying only on deployment env vars.

This allows Kade to pass context runtime settings such as script paths, container image, promote script, and pretrained checkpoint reference while keeping BaJoR responsible for Azure Batch scheduling.

What changed

  • Extended BaJoR job options to accept request-driven runtime fields:
    • container_image_name
    • training_script_path
    • prediction_script_path
    • promote_script_path
    • pretrained_checkpoint_url
  • Added a runtime config resolver to centralize request-first, env-fallback behavior
  • Updated finetuning scheduling to use request-driven:
    • training script path
    • pretrained checkpoint location
    • promote script path
    • container image
  • Updated prediction scheduling to use request-driven:
    • prediction script path
    • checkpoint target
    • container image

@Tooyosi Tooyosi requested a review from lcjohnso April 7, 2026 17:22
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant