[Inworld] Flush to drain decoder on every audio chunk from server by ianbbqzy · Pull Request #4983 · livekit/agents

ianbbqzy · 2026-03-03T01:17:54Z

The emitter's non-PCM path still routes through _decode_task, which on FlushSegment does audio_decoder.end_input() + await decode_atask + audio_decoder = None. This means:

Without per-chunk flush(): All audio bytes accumulate in the decoder. The final output_emitter.flush() at line 1137 is the only FlushSegment. TTFB = time to receive all audio from server, not just the first chunk.
With per-chunk flush(): Each chunk triggers FlushSegment → decoder drains → _flush_frame() → SynthesizedAudio is emitted. TTFB = time to first chunk from server (correct).

This is an Inworld-specific issue because Inworld's HTTP API returns one JSON-line per audio chunk (each with its own WAV payload), unlike other providers which returns a single continuous raw PCM byte stream. For Inworld, each JSON line is a discrete audio segment, and delaying the flush until the end defeats the purpose of streaming.

====

Also updated timestamps parsing logic to always add a trailing space regardless if it's the end of a chunk or not. Because in Inworld case, end of a chunk could just be end of a phrase rather than end of a sentence

ianbbqzy · 2026-03-03T17:57:53Z

@tinalenguyen @davidzhao PTAL, Thanks!

tinalenguyen · 2026-03-04T01:55:36Z

Hi @ianbbqzy, thanks for the PR! Few notes and Q's:

Could you update the TTS docstring with the new defaults?
If each chunk is already flushed, might be worth removing the outer flush here

Also updated timestamps parsing logic to always add a trailing space regardless if it's the end of a chunk or not. Because in Inworld case, end of a chunk could just be end of a phrase rather than end of a sentence

Could you elaborate more on this? I also noticed that the last punctuation is not present, not sure if that was pre-existing behavior

This comment was marked as resolved.

Sign in to view

[Inworld] Flush to drain decoder on every audio chunk from server

daf64bb

ianbbqzy force-pushed the ian/inworld-flush branch from 839b7ba to daf64bb Compare March 3, 2026 01:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Inworld] Flush to drain decoder on every audio chunk from server#4983

[Inworld] Flush to drain decoder on every audio chunk from server#4983
ianbbqzy wants to merge 1 commit intolivekit:mainfrom
ianbbqzy:ian/inworld-flush

ianbbqzy commented Mar 3, 2026 •

edited

Loading

Uh oh!

This comment was marked as resolved.

Uh oh!

ianbbqzy commented Mar 3, 2026

Uh oh!

tinalenguyen commented Mar 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ianbbqzy commented Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as resolved.

Uh oh!

ianbbqzy commented Mar 3, 2026

Uh oh!

tinalenguyen commented Mar 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ianbbqzy commented Mar 3, 2026 •

edited

Loading