Babulus (XML DSL for Remotion Audio + Timing)

Babulus turns a .babulus.xml file into timing JSON + generated audio for Remotion. It is a thin, narration-first layer that also handles TTS/SFX/music generation with environment-aware caching.

Quick Start

Requirements:

Node.js 18+
ffmpeg + ffprobe on PATH
Playwright (for PNG/MP4 rendering helpers)

Install (from a project that uses Babulus):

npm install -D babulus

Local dev (from this repo):

npm install
npm run babulus -- --help

Generate:

babulus generate content/intro.babulus.xml

Documentation

Branded documentation lives in the Studio Web app:

Web docs (local dev): apps/studio-web at /docs
Source of truth (HTML): apps/studio-web/lib/docs-content
Docs landing page route: apps/studio-web/app/(public)/docs/page.tsx
“What is Babulus?”: /docs/introduction (source: apps/studio-web/lib/docs-content/introduction.ts)
Technical overview: /docs/technical (source: apps/studio-web/lib/docs-content/technical.ts)
Roadmap: /docs/roadmap (source: apps/studio-web/lib/docs-content/roadmap.ts)

VideoML Website (apps/videoml-org)

This repo includes a Gatsby site for the VideoML spec.

Local dev (from repo root):

npm run videoml:develop

Build:

npm run videoml:build

Amplify deployment (multi-app):

App root: apps/videoml-org
Build spec: root amplify.yml (includes a videoml-org entry)
Node version: use .nvmrc (22.12.0)

The DSL (XML)

A .babulus.xml file declares a composition (or multiple). It is a declarative tree of scenes, cues, and components.

<vml id="intro" title="Intro" fps="30" width="1920" height="1080">
  <voiceover provider="openai" voice="echo" />

  <scene id="paradigm" title="A New Kind of Computer Program">
    <music
      prompt="Warm ambient background music, energetic percussion, deep bass, no vocals"
      playThrough="true"
      volume="0.7"
      fadeTo='{"volume":0.12,"afterSeconds":6,"fadeDurationSeconds":3}'
      fadeOut='{"volume":0.7,"beforeEndSeconds":5,"fadeDurationSeconds":3}'
    />
    <cue id="paradigm-vo">
      <voice>Since the dawn of computing...</voice>
      <pause seconds="0.35s" />
      <voice>But tool-using agents flip the script.</voice>
    </cue>
  </scene>
</vml>

Pauses

Use <pause> inside a <cue> to insert silence between narration segments:

<pause seconds="0.4s" />
<pause seconds="600ms" />

Pause timing is resolved at generate time alongside other cue timing.

CLI

# Generate audio + timing JSON
babulus generate content/intro.babulus.xml

# Watch mode
babulus generate --watch content/

# Force regeneration
babulus generate --fresh content/intro.babulus.xml

# Clean (dry run)
babulus clean
babulus clean --yes

# Execute a worker job (local execution plane)
babulus worker run --job job.json --result result.json

Renderer helpers (storyboard previews)

These commands render storyboard frames or MP4 previews from the generated script.json/timeline.json. PNG/MP4 output uses Playwright + ffmpeg. Frame rendering defaults to a small parallel worker pool; pass --workers 1 to disable parallelization. Use --ffmpeg-arg to forward custom ffmpeg options when you need to tune encoding.

# Verify toolchain versions
npm run render:toolchain -- --require-ffmpeg --require-playwright

# Render storyboard HTML frames (no Playwright required)
npm run render:storyboard:frames -- --script src/videos/intro/intro.script.json --frames out/intro-html

# Render storyboard PNG frames (Playwright required)
npm run render:storyboard:png -- --script src/videos/intro/intro.script.json --frames out/intro-png --start 0 --end 60

# Render storyboard MP4 (Playwright + ffmpeg required)
npm run render:storyboard -- --script src/videos/intro/intro.script.json --frames out/intro-frames --out out/intro.mp4

# Parallelize frame rendering (default is auto; set 1 to disable)
npm run render:storyboard -- --script src/videos/intro/intro.script.json --frames out/intro-frames --out out/intro.mp4 --workers 4

# Pass custom ffmpeg arguments (repeat --ffmpeg-arg for each token)
npm run render:storyboard -- --script src/videos/intro/intro.script.json --frames out/intro-frames --out out/intro.mp4 --ffmpeg-arg -preset --ffmpeg-arg ultrafast

If Playwright is missing, install Chromium once:

npx playwright install chromium

Default outputs (for content/<video>.babulus.xml):

script: src/videos/<video>/<video>.script.json
timeline: src/videos/<video>/<video>.timeline.json
audio: public/babulus/<video>.wav
cache: .babulus/out/<video>/env/<environment>/

Environment-Aware Caching

Babulus caches per-environment to avoid burning API quotas. Cache layout:

.babulus/out/<video>/env/<environment>/

Environments: development, aws, azure, production, static

Fallback chain: development -> aws -> azure -> production -> static

BABULUS_ENV=development babulus generate content/intro.babulus.xml
BABULUS_ENV=production babulus generate content/intro.babulus.xml

Config

API keys live in .babulus/config.yml:

providers:
  openai:
    api_key: "..."
  elevenlabs:
    api_key: "..."
  aws_polly:
    region: "us-east-1"
  azure_speech:
    api_key: "..."
    region: "..."

Distribution

This repo builds a Node-based CLI package. The generated JSON/audio are the build artifacts you commit or ship with your Remotion project.

Name		Name	Last commit message	Last commit date
Latest commit History 270 Commits
.github/workflows		.github/workflows
apps		apps
content		content
docs		docs
examples		examples
features		features
packages		packages
public/babulus		public/babulus
scripts		scripts
services/marketing-runtime		services/marketing-runtime
src		src
test-projects		test-projects
.build-trigger		.build-trigger
.dockerignore		.dockerignore
.gitignore		.gitignore
.nvmrc		.nvmrc
AGENTS.md		AGENTS.md
ALTERNATIVE_STREAMS_APPROACH.md		ALTERNATIVE_STREAMS_APPROACH.md
CHANGELOG.md		CHANGELOG.md
COLORS.md		COLORS.md
DYNAMODB_STREAMS_SOLUTION.md		DYNAMODB_STREAMS_SOLUTION.md
Dockerfile		Dockerfile
Dockerfile.render-worker		Dockerfile.render-worker
ECS_INFRASTRUCTURE_COMPLETE.md		ECS_INFRASTRUCTURE_COMPLETE.md
HANDOFF.md		HANDOFF.md
LAYOUTS.md		LAYOUTS.md
LICENSE		LICENSE
MORNING_CHECK.md		MORNING_CHECK.md
OVERNIGHT_SUMMARY.txt		OVERNIGHT_SUMMARY.txt
README.md		README.md
RENDERING_MODES_STATUS.md		RENDERING_MODES_STATUS.md
SESSION-FILES-CHANGED.md		SESSION-FILES-CHANGED.md
SESSION-SUMMARY.md		SESSION-SUMMARY.md
START_HERE.md		START_HERE.md
STREAMS_README.md		STREAMS_README.md
STREAMS_STATUS.md		STREAMS_STATUS.md
TESTING_RENDERING_MODES.md		TESTING_RENDERING_MODES.md
TYPEFACES.md		TYPEFACES.md
VERIFY_STREAMS_DEPLOYMENT.md		VERIFY_STREAMS_DEPLOYMENT.md
amplify.yml		amplify.yml
check-job-status.ts		check-job-status.ts
check-streams-status.sh		check-streams-status.sh
create-test-data.ts		create-test-data.ts
package-lock.json		package-lock.json
package.json		package.json
test-container-local.ts		test-container-local.ts
test-dynamodb-streams.ts		test-dynamodb-streams.ts
test-ecs-worker.sh		test-ecs-worker.sh
test-local-render.ts		test-local-render.ts
test-render-job.ts		test-render-job.ts
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Babulus (XML DSL for Remotion Audio + Timing)

Quick Start

Documentation

VideoML Website (apps/videoml-org)

The DSL (XML)

Pauses

CLI

Renderer helpers (storyboard previews)

Environment-Aware Caching

Config

Distribution

About

Uh oh!

Releases 3

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Babulus (XML DSL for Remotion Audio + Timing)

Quick Start

Documentation

VideoML Website (apps/videoml-org)

The DSL (XML)

Pauses

CLI

Renderer helpers (storyboard previews)

Environment-Aware Caching

Config

Distribution

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages