Skip to content
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
31 changes: 16 additions & 15 deletions server/services/stt/assemblyai.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -173,21 +173,22 @@ Connection-level parameters previously passed via the `connection_params` constr

Runtime-configurable settings passed via the `settings` constructor argument using `AssemblyAISTTService.Settings(...)`. These can be updated mid-conversation with `STTUpdateSettingsFrame`. See [Service Settings](/guides/fundamentals/service-settings) for details.

| Parameter | Type | Default | Description |
| ---------------------------------- | ----------------- | ------------- | --------------------------------------------------------------------------- |
| `model` | `str` | `None` | STT model identifier. _(Inherited from base STT settings.)_ |
| `language` | `Language \| str` | `Language.EN` | Language for speech recognition. _(Inherited from base STT settings.)_ |
| `formatted_finals` | `bool` | `True` | Whether to enable transcript formatting. |
| `word_finalization_max_wait_time` | `int` | `None` | Maximum time to wait for word finalization in milliseconds. |
| `end_of_turn_confidence_threshold` | `float` | `None` | Confidence threshold for end-of-turn detection. |
| `min_turn_silence` | `int` | `None` | Minimum silence duration (ms) when confident about end-of-turn. |
| `max_turn_silence` | `int` | `None` | Maximum silence duration (ms) before forcing end-of-turn. |
| `keyterms_prompt` | `List[str]` | `None` | List of key terms to guide transcription. |
| `prompt` | `str` | `None` | Optional text prompt to guide transcription (u3-rt-pro only). |
| `language_detection` | `bool` | `None` | Enable automatic language detection. |
| `format_turns` | `bool` | `True` | Whether to format transcript turns. |
| `speaker_labels` | `bool` | `None` | Enable speaker diarization. |
| `vad_threshold` | `float` | `None` | VAD confidence threshold (0.0–1.0) for classifying audio frames as silence. |
| Parameter | Type | Default | Description |
| ---------------------------------- | ----------------- | ------------- | ------------------------------------------------------------------------------------------------ |
| `model` | `str` | `None` | STT model identifier. _(Inherited from base STT settings.)_ |
| `language` | `Language \| str` | `Language.EN` | Language for speech recognition. _(Inherited from base STT settings.)_ |
| `formatted_finals` | `bool` | `True` | Whether to enable transcript formatting. |
| `word_finalization_max_wait_time` | `int` | `None` | Maximum time to wait for word finalization in milliseconds. |
| `end_of_turn_confidence_threshold` | `float` | `None` | Confidence threshold for end-of-turn detection. |
| `min_turn_silence` | `int` | `None` | Minimum silence duration (ms) when confident about end-of-turn. |
| `max_turn_silence` | `int` | `None` | Maximum silence duration (ms) before forcing end-of-turn. |
| `keyterms_prompt` | `List[str]` | `None` | List of key terms to guide transcription. |
| `prompt` | `str` | `None` | Optional text prompt to guide transcription (u3-rt-pro only). |
| `language_detection` | `bool` | `None` | Enable automatic language detection. |
| `format_turns` | `bool` | `True` | Whether to format transcript turns. |
| `speaker_labels` | `bool` | `None` | Enable speaker diarization. |
| `vad_threshold` | `float` | `None` | VAD confidence threshold (0.0–1.0) for classifying audio frames as silence. |
| `domain` | `str` | `None` | Optional domain for specialized recognition modes (e.g., `"medical-v1"` for Medical Mode). |

## Usage

Expand Down
Loading