#

speechllm

Here are 12 public repositories matching this topic...

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

pytorch speech-recognition vad punctuation whisper audio-visual-speech-recognition speaker-diarization voice-activity-detection conformer pretrained-model rnnt dfsmn paraformer speechgpt speechllm

Updated Mar 17, 2026
Python

FireRedTeam / FireRedASR

Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics recognition capability.

open-source transformer speech-recognition automatic-speech-recognition asr conformer llm industrial-grade multimodal-llm speechllm

Updated Feb 25, 2026
Python

FireRedTeam / FireRedASR2S

A SOTA Industrial-Grade All-in-One ASR system with ASR, VAD, LID, and Punc modules. FireRedASR2 supports Chinese (Mandarin, 20+ dialects/accents), English, code-switching, and both speech and singing ASR. FireRedVAD supports speech/singing/music in 100+ langs. FireRedLID supports 100+ langs and 20+ zh dialects. FireRedPunc supports zh and en.

open-source speech-recognition vad automatic-speech-recognition asr lid language-identification sota voice-activity-detection asr-pipeline punctuation-restoration audio-event-classification llm punctuation-prediction industrial-grade multimodal-llm speechllm audio-event-detection

Updated Mar 17, 2026
Python

PigeonDan1 / ps-slm

TASU: A New Style of Alignment of Speech LLM with only Text Training Data, zero-shot on ASR and Other SU tasks

asr speechllm speech-understanding

Updated Jan 19, 2026
Python

aidayang / FunASR-OneClick

FunASR实时语音识别版，识别麦克风和电脑内播放的声音，电脑语音打字软件

pytorch speech-recognition vad pretrained-models punctuation whisper audio-visual-speech-recognition speaker-diarization voice-activity-detection conformer rnnt dfsmn paraformer speechgpt speechllm funasr

Updated Sep 12, 2025

Tencent / StableToken

[ICLR 2026] StableToken: A state-of-the-art noise-robust semantic speech tokenizer featuring Voting-LFQ for resilient SpeechLLMs.

iclr noise-robustness speechllm speech-tokenizer iclr2026

Updated Feb 27, 2026
Python

SALT-Research / SHALLOW

SHALLOW, the first hallucination benchmark for ASR models

benchmark speech-recognition asr shallow hallucination speech-evaluation speechllm speech-foundation-models

Updated May 23, 2025
Python

moziarnj07-sys / doubaoime-asr

🎤 Enable voice recognition for the Doubao input method using Python; ideal for learning and research with a focus on audio processing.

Updated Mar 19, 2026
Python

Nexdata-AI / 300-Hours-English-India-Spontaneous-Dialogue-Smartphone-speech-dataset

asr-model speechllm indian-english

Updated May 9, 2025

Pacjay / FireRedVAD

Provide accurate voice activity and audio event detection in 100+ languages with high-performance streaming and non-streaming capabilities.

open-source speech-recognition vad automatic-speech-recognition asr language-identification voice-activity-detection aed sound-event-detection asr-pipeline punctuation-restoration audio-event-classification punctuation-prediction industrial-grade multimodal-llm speechllm audio-event-detection

Updated Mar 19, 2026
Python

12alz / fun-with-clip-path

🎨 Explore clip-path techniques in HTML and CSS to create interactive menus and dynamic shapes without JavaScript for responsive design.

utility tech data-engineering speech-recognition airtable recommendation-algorithms algorithm-engineering audio-visual-speech-recognition data-quality speaker-diarization hiring-without-whiteboards mlops rnnt dfsmn paraformer speechgpt speechllm claude-code

Updated Mar 19, 2026
CSS

sanamid / Fun-ASR

audio python websocket speech-recognition asyncio pretrained-models punctuation whisper audio-visual-speech-recognition speaker-diarization voice-activity-detection paraformer multimodal-large-language-models speechllm audio-language-model funasr-client

Updated Mar 19, 2026
Python

Improve this page

Add a description, image, and links to the speechllm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speechllm topic, visit your repo's landing page and select "manage topics."