Skip to content

Sunflower-Ultravox Deployment with Modal platform#1

Merged
huwenjie333 merged 9 commits intorunpod-deployfrom
modal-deploy
Dec 15, 2025
Merged

Sunflower-Ultravox Deployment with Modal platform#1
huwenjie333 merged 9 commits intorunpod-deployfrom
modal-deploy

Conversation

@huwenjie333
Copy link
Copy Markdown
Collaborator

@huwenjie333 huwenjie333 commented Dec 5, 2025

This PR includes the codes to deploy Sunflower-Ultravox model to Modal platform.

Below the comparison with our current deployment platform Runpod:

Feature RunPod (current) Modal (new)
Support audio vLLM (e.g. Ultravox) No Yes
Costs (A100-80GB) $0.00076 / s $0.00069 / s
GPU availability low when using network volumes high
Deployment methods Docker container single python script
serverless cold start time 2-3 mins 2-3 mins

The startup time is 2-3 mins, which benefits from the attached network volume that stores 60GB+ model weigths:
Screenshot 2025-12-05 at 3 44 53 PM

Here's an example of transcription task inference from the deployed model (context_eng_7.wav):
image

response = get_completion(client, model_id, messages, args)
if response:
if args.stream:
print(Colors.BLUE + "\n🤖:", end="")
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm trying to figure out what the emojis signify, lol

Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

it was from the official template. See here:

image

@huwenjie333 huwenjie333 merged commit a1616dc into runpod-deploy Dec 15, 2025
1 check failed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants