Sunflower-Ultravox Deployment with Modal platform by huwenjie333 · Pull Request #1 · SunbirdAI/worker-vllm

huwenjie333 · 2025-12-05T11:47:05Z

This PR includes the codes to deploy Sunflower-Ultravox model to Modal platform.

Below the comparison with our current deployment platform Runpod:

Feature	RunPod (current)	Modal (new)
Support audio vLLM (e.g. Ultravox)	No	Yes
Costs (A100-80GB)	$0.00076 / s	$0.00069 / s
GPU availability	low when using network volumes	high
Deployment methods	Docker container	single python script
serverless cold start time	2-3 mins	2-3 mins

The startup time is 2-3 mins, which benefits from the attached network volume that stores 60GB+ model weigths:

Here's an example of transcription task inference from the deployed model (context_eng_7.wav):

modal-deploy/client.py

jqug · 2025-12-08T10:00:40Z

modal-deploy/Sunflower32b-Ultravox/client.py

+    response = get_completion(client, model_id, messages, args)
+    if response:
+        if args.stream:
+            print(Colors.BLUE + "\n🤖:", end="")


I'm trying to figure out what the emojis signify, lol

it was from the official template. See here:

huwenjie333 added 7 commits December 3, 2025 15:42

init

aa10d4a

Qwen3-8B-FP8

beb694b

Sunflower-14B-FP8

95272fc

default client script

6362b00

sunflower client updates

9700a58

deployed ultravox

daacae9

readme

78f3b5a

huwenjie333 requested review from PatrickCmd and jqug December 5, 2025 12:49

jqug approved these changes Dec 8, 2025

View reviewed changes

huwenjie333 added 2 commits December 8, 2025 14:40

update temperature

4acbdd8

reorganize files

7952177

huwenjie333 merged commit a1616dc into runpod-deploy Dec 15, 2025
1 check failed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sunflower-Ultravox Deployment with Modal platform#1

Sunflower-Ultravox Deployment with Modal platform#1
huwenjie333 merged 9 commits intorunpod-deployfrom
modal-deploy

huwenjie333 commented Dec 5, 2025 •

edited

Loading

Uh oh!

Uh oh!

jqug Dec 8, 2025

Uh oh!

huwenjie333 Dec 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

huwenjie333 commented Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

jqug Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

huwenjie333 Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

huwenjie333 commented Dec 5, 2025 •

edited

Loading