From 22c596ce4aad64fbe2d690f5ba0016c81a3732e1 Mon Sep 17 00:00:00 2001 From: "github-actions[bot]" Date: Tue, 24 Mar 2026 15:04:02 +0000 Subject: [PATCH] docs: add domain parameter to AssemblyAI STT Settings Updated documentation for pipecat PR #4117 which adds a `domain` parameter to AssemblyAISTTSettings for specialized recognition modes like Medical Mode. --- server/services/stt/assemblyai.mdx | 31 +++++++++++++++--------------- 1 file changed, 16 insertions(+), 15 deletions(-) diff --git a/server/services/stt/assemblyai.mdx b/server/services/stt/assemblyai.mdx index 7fff0979..f405f4af 100644 --- a/server/services/stt/assemblyai.mdx +++ b/server/services/stt/assemblyai.mdx @@ -173,21 +173,22 @@ Connection-level parameters previously passed via the `connection_params` constr Runtime-configurable settings passed via the `settings` constructor argument using `AssemblyAISTTService.Settings(...)`. These can be updated mid-conversation with `STTUpdateSettingsFrame`. See [Service Settings](/guides/fundamentals/service-settings) for details. -| Parameter | Type | Default | Description | -| ---------------------------------- | ----------------- | ------------- | --------------------------------------------------------------------------- | -| `model` | `str` | `None` | STT model identifier. _(Inherited from base STT settings.)_ | -| `language` | `Language \| str` | `Language.EN` | Language for speech recognition. _(Inherited from base STT settings.)_ | -| `formatted_finals` | `bool` | `True` | Whether to enable transcript formatting. | -| `word_finalization_max_wait_time` | `int` | `None` | Maximum time to wait for word finalization in milliseconds. | -| `end_of_turn_confidence_threshold` | `float` | `None` | Confidence threshold for end-of-turn detection. | -| `min_turn_silence` | `int` | `None` | Minimum silence duration (ms) when confident about end-of-turn. | -| `max_turn_silence` | `int` | `None` | Maximum silence duration (ms) before forcing end-of-turn. | -| `keyterms_prompt` | `List[str]` | `None` | List of key terms to guide transcription. | -| `prompt` | `str` | `None` | Optional text prompt to guide transcription (u3-rt-pro only). | -| `language_detection` | `bool` | `None` | Enable automatic language detection. | -| `format_turns` | `bool` | `True` | Whether to format transcript turns. | -| `speaker_labels` | `bool` | `None` | Enable speaker diarization. | -| `vad_threshold` | `float` | `None` | VAD confidence threshold (0.0–1.0) for classifying audio frames as silence. | +| Parameter | Type | Default | Description | +| ---------------------------------- | ----------------- | ------------- | ------------------------------------------------------------------------------------------------ | +| `model` | `str` | `None` | STT model identifier. _(Inherited from base STT settings.)_ | +| `language` | `Language \| str` | `Language.EN` | Language for speech recognition. _(Inherited from base STT settings.)_ | +| `formatted_finals` | `bool` | `True` | Whether to enable transcript formatting. | +| `word_finalization_max_wait_time` | `int` | `None` | Maximum time to wait for word finalization in milliseconds. | +| `end_of_turn_confidence_threshold` | `float` | `None` | Confidence threshold for end-of-turn detection. | +| `min_turn_silence` | `int` | `None` | Minimum silence duration (ms) when confident about end-of-turn. | +| `max_turn_silence` | `int` | `None` | Maximum silence duration (ms) before forcing end-of-turn. | +| `keyterms_prompt` | `List[str]` | `None` | List of key terms to guide transcription. | +| `prompt` | `str` | `None` | Optional text prompt to guide transcription (u3-rt-pro only). | +| `language_detection` | `bool` | `None` | Enable automatic language detection. | +| `format_turns` | `bool` | `True` | Whether to format transcript turns. | +| `speaker_labels` | `bool` | `None` | Enable speaker diarization. | +| `vad_threshold` | `float` | `None` | VAD confidence threshold (0.0–1.0) for classifying audio frames as silence. | +| `domain` | `str` | `None` | Optional domain for specialized recognition modes (e.g., `"medical-v1"` for Medical Mode). | ## Usage