feat(router): add LLM routing with cost optimization and pretrained configs by bsbodden · Pull Request #476 · redis/redis-vl-python

bsbodden · 2026-02-16T22:27:14Z

Extends SemanticRouter with LLM model selection, cost-optimized routing, and pretrained configurations — routing queries to the right model using Redis vector search.

"hello, how are you?" → GPT-4.1 Nano ($0.10/M tokens)
"explain garbage collection" → Claude Sonnet 4.5 ($3/M tokens)
"architect a distributed system" → Claude Opus 4.5 ($5/M tokens)

Design

LLM routing is integrated directly into SemanticRouter. When a Route includes an optional model field, the router returns the LiteLLM-compatible model identifier alongside the match, with a confidence score derived from vector distance (1 - distance/2).

Schema extensions (all optional, no breaking changes):

Route gains model: Optional[str] and metadata: Dict
RouteMatch gains model, confidence, alternatives, and metadata fields
RoutingConfig gains cost_optimization, cost_weight, and default_route
Callable pattern: router(query) returns a RouteMatch
route_many() returns multiple ranked matches
Full async parity via AsyncSemanticRouter

Routes without a model field work exactly as before — existing SemanticRouter usage is unaffected.

Usage

Basic LLM routing:

from redisvl.extensions.router import SemanticRouter, Route

routes = [
    Route(name="simple", model="openai/gpt-4.1-nano",
          references=["hello", "hi"], distance_threshold=0.5),
    Route(name="expert", model="anthropic/claude-opus-4-5",
          references=["architect a system", "design an algorithm"],
          distance_threshold=0.7),
]
router = SemanticRouter(name="llm-router", routes=routes,
                        redis_url="redis://localhost:6379")

match = router("hello there")
print(match.model)       # openai/gpt-4.1-nano
print(match.confidence)  # 0.81

Pretrained config — ships with a 3-route config (simple/standard/expert) mapped to Bloom's Taxonomy levels, with pre-computed sentence-transformers/all-mpnet-base-v2 embeddings:

router = SemanticRouter.from_pretrained("default", redis_url="redis://localhost:6379")

Cost-optimized routing — when multiple routes match with similar distances, a cost penalty biases toward cheaper models:

from redisvl.extensions.router.schema import RoutingConfig

router = SemanticRouter(
    name="cost-router", routes=routes,
    routing_config=RoutingConfig(cost_optimization=True, cost_weight=0.3),
    redis_url="redis://localhost:6379",
)

Async:

router = await AsyncSemanticRouter.create(
    name="async-router", routes=routes, redis_url="redis://localhost:6379")
match = await router("hello")

Export/import with embeddings:

router.export_with_embeddings("my_router.json")
loaded = SemanticRouter.from_pretrained("my_router.json", redis_url="redis://localhost:6379")

Files changed

Area	Files
Core router	`redisvl/extensions/router/semantic.py`, `schema.py`, `__init__.py`
Pretrained configs	`redisvl/extensions/router/pretrained/__init__.py`, `default.json`
Query support	`redisvl/query/query.py`, `redisvl/utils/full_text_query_helper.py`
Tests	`tests/unit/test_llm_router_schema.py`, `tests/integration/conftest.py`, `tests/unit/conftest.py`
Docs	`docs/user_guide/13_llm_router.ipynb`
Tooling	`scripts/generate_pretrained_config.py`

Copilot

Pull request overview

Copilot reviewed 16 out of 18 changed files in this pull request and generated 10 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

redisvl/extensions/router/semantic.py

redisvl/extensions/llm_router/__init__.py

docs/user_guide/13_llm_router.ipynb

redisvl/extensions/router/semantic.py

redisvl/extensions/llm_router/__init__.py

cursor

Cursor Bugbot has reviewed your changes and found 3 potential issues.

^{Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, have a team admin enable autofix in the Cursor dashboard.}

redisvl/extensions/llm_router/__init__.py

redisvl/extensions/llm_router/pretrained/__init__.py

cursor

Cursor Bugbot has reviewed your changes and found 2 potential issues.

^{Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, have a team admin enable autofix in the Cursor dashboard.}

redisvl/extensions/llm_router/__init__.py

cursor

Cursor Bugbot has reviewed your changes and found 3 potential issues.

^{Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, have a team admin enable autofix in the Cursor dashboard.}

redisvl/extensions/llm_router/__init__.py

rbs333

I think we still have some disconnects on the design of this feature. Let's maybe set up some time to talk through it.

redisvl/extensions/llm_router/router.py

docs/user_guide/13_llm_router.ipynb

redisvl/extensions/llm_router/__init__.py

redisvl/extensions/llm_router/DESIGN.md

cursor

Cursor Bugbot has reviewed your changes and found 2 potential issues.

^{Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, have a team admin enable autofix in the Cursor dashboard.}

redisvl/extensions/llm_router/router.py

redisvl/extensions/llm_router/schema.py

jit-ci · 2026-03-15T17:57:36Z

🛡️ Jit Security Scan Results

✅ No security findings were detected in this PR

^{Security scan by Jit}

Copilot

Pull request overview

Copilot reviewed 16 out of 18 changed files in this pull request and generated 3 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

redisvl/extensions/router/semantic.py

cursor

Cursor Bugbot has reviewed your changes and found 2 potential issues.

^{Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, have a team admin enable autofix in the Cursor dashboard.}

redisvl/extensions/llm_router/__init__.py

cursor

Cursor Bugbot has reviewed your changes and found 2 potential issues.

^{Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, have a team admin enable autofix in the Cursor dashboard.}

redisvl/extensions/llm_router/__init__.py

Copilot

Pull request overview

Copilot reviewed 16 out of 18 changed files in this pull request and generated 4 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

redisvl/extensions/router/semantic.py

redisvl/extensions/llm_router/__init__.py

docs/user_guide/13_llm_router.ipynb

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, have a team admin enable autofix in the Cursor dashboard.}

redisvl/extensions/llm_router/__init__.py

bsbodden · 2026-03-15T19:42:38Z

@rbs333 Reworked the PR based on your feedback. Here's where things stand:

1. No separate class — SemanticRouter is the API

All LLM routing logic (cost optimization, confidence scoring, from_pretrained(), export_with_embeddings()) lives directly in SemanticRouter. No separate LLMRouter class.

2. "Tier" → "Route"

Route with an optional model field is the only concept. All code, tests, and the notebook use Route/route consistently.

3. Callable pattern

router(query) throughout — no router.route().

4. from_pretrained() on SemanticRouter

from redisvl.extensions.router import SemanticRouter

router = SemanticRouter.from_pretrained("default", redis_url="redis://localhost:6379")
match = router("hello")  # -> RouteMatch(name="simple", model="openai/gpt-4.1-nano", ...)

5. Dead code removed

llm_router/router.py (1,528 lines) — deleted
llm_router/schema.py (206 lines) — deleted
llm_router/DESIGN.md — deleted

6. Notebook fully rewritten

Every cell uses SemanticRouter, Route, router(), match.name.

cursor

Cursor Bugbot has reviewed your changes and found 2 potential issues.

^{Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, have a team admin enable autofix in the Cursor dashboard.}

redisvl/extensions/llm_router/__init__.py

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, have a team admin enable autofix in the Cursor dashboard.}

redisvl/extensions/router/pretrained/__init__.py

Copilot

Pull request overview

Copilot reviewed 12 out of 14 changed files in this pull request and generated 2 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

redisvl/extensions/router/semantic.py

docs/user_guide/13_llm_router.ipynb

…onfigs Extend SemanticRouter to support LLM model selection by adding optional model, confidence, cost optimization, and multi-match capabilities to the existing routing infrastructure. When a Route includes a `model` field, the router returns the LiteLLM- compatible model identifier alongside the match, with a confidence score derived from vector distance. Cost-optimized routing biases toward cheaper models when semantic distances are close, using a configurable cost_weight penalty. Key additions to SemanticRouter: - Route.model (optional) for LiteLLM model identifiers - RouteMatch.confidence, .alternatives, .metadata fields - RoutingConfig.cost_optimization and .cost_weight settings - RoutingConfig.default_route for fallback when no match found - from_pretrained() to load routers with pre-computed embeddings - export_with_embeddings() to serialize routers with vectors - AsyncSemanticRouter with full async parity A built-in "default" pretrained config ships with 3 tiers (simple, standard, expert) mapped to GPT-4.1 Nano, Claude Sonnet 4.5, and Claude Opus 4.5, using pre-computed sentence-transformers embeddings. Backward compatibility: - LLMRouter/AsyncLLMRouter provided as deprecated wrappers - ModelTier subclass enforces required model field - Legacy field names (tiers/default_tier) mapped bidirectionally - Existing SemanticRouter usage is fully unaffected Includes integration tests, unit tests for schema validation, a user guide notebook, and a pretrained config generation script.

rbs333

Looking good! I think now it just needs an update on the docs and then a quick latency check to make sure we're not adding overhead vs previous router lookup speed.

rbs333 · 2026-03-16T19:55:14Z

docs/user_guide/13_llm_router.ipynb

Looks good 👍

rbs333 · 2026-03-16T20:03:05Z

redisvl/utils/full_text_query_helper.py

+                    last_error = e
+                    if attempt < 2:  # Don't download on last attempt
+                        try:
+                            nltk.download("stopwords", quiet=True)


are these changes for the router?

Copilot AI review requested due to automatic review settings February 16, 2026 22:27

Copilot started reviewing on behalf of bsbodden February 16, 2026 22:27 View session

bsbodden force-pushed the llm-router branch from 0c13644 to fda6eb6 Compare February 16, 2026 22:31

This comment was marked as outdated.

Sign in to view

bsbodden added the experimental label Feb 16, 2026

bsbodden requested review from abrookins, Copilot and tylerhutcherson February 16, 2026 23:19

Copilot started reviewing on behalf of bsbodden February 17, 2026 00:45 View session

bsbodden self-assigned this Feb 17, 2026

This comment was marked as outdated.

Sign in to view

bsbodden requested review from rbs333 and removed request for abrookins and tylerhutcherson February 25, 2026 20:21

This comment was marked as resolved.

Sign in to view

This comment was marked as outdated.

Sign in to view

This comment was marked as resolved.

Sign in to view

bsbodden force-pushed the llm-router branch from e1cd469 to fed62ee Compare March 2, 2026 19:45

bsbodden requested a review from Copilot March 2, 2026 19:49

Copilot started reviewing on behalf of bsbodden March 2, 2026 19:50 View session

Copilot AI reviewed Mar 2, 2026

View reviewed changes

cursor bot reviewed Mar 2, 2026

View reviewed changes

redisvl/extensions/llm_router/__init__.py Outdated Show resolved Hide resolved

redisvl/extensions/llm_router/__init__.py Outdated Show resolved Hide resolved

cursor bot reviewed Mar 2, 2026

View reviewed changes

redisvl/extensions/llm_router/__init__.py Outdated Show resolved Hide resolved

redisvl/extensions/llm_router/__init__.py Outdated Show resolved Hide resolved

redisvl/extensions/llm_router/pretrained/__init__.py Outdated Show resolved Hide resolved

bsbodden force-pushed the llm-router branch from e9ac7cc to 49b9ed9 Compare March 2, 2026 21:07

cursor bot reviewed Mar 2, 2026

View reviewed changes

redisvl/extensions/llm_router/__init__.py Outdated Show resolved Hide resolved

redisvl/extensions/llm_router/__init__.py Outdated Show resolved Hide resolved

Copilot AI review requested due to automatic review settings March 2, 2026 21:19

Copilot started reviewing on behalf of bsbodden March 2, 2026 21:20 View session

This comment was marked as outdated.

Sign in to view

cursor bot reviewed Mar 3, 2026

View reviewed changes

redisvl/extensions/llm_router/__init__.py Outdated Show resolved Hide resolved

redisvl/extensions/llm_router/__init__.py Outdated Show resolved Hide resolved

redisvl/extensions/llm_router/__init__.py Outdated Show resolved Hide resolved

rbs333 reviewed Mar 3, 2026

View reviewed changes

redisvl/extensions/llm_router/router.py Outdated Show resolved Hide resolved

docs/user_guide/13_llm_router.ipynb Show resolved Hide resolved

redisvl/extensions/llm_router/__init__.py Outdated Show resolved Hide resolved

redisvl/extensions/llm_router/DESIGN.md Outdated Show resolved Hide resolved

bsbodden force-pushed the llm-router branch from 8b54c53 to ffff970 Compare March 15, 2026 17:07

redis deleted a comment from jit-ci bot Mar 15, 2026

cursor bot reviewed Mar 15, 2026

View reviewed changes

redisvl/extensions/llm_router/router.py Outdated Show resolved Hide resolved

redisvl/extensions/llm_router/schema.py Outdated Show resolved Hide resolved

bsbodden changed the title ~~feat: LLM Router extension for cost-optimized model selection~~ feat(router): add LLM routing with cost optimization and pretrained configs Mar 15, 2026

Copilot AI review requested due to automatic review settings March 15, 2026 17:55

bsbodden force-pushed the llm-router branch from ffff970 to d44287f Compare March 15, 2026 17:55

Copilot started reviewing on behalf of bsbodden March 15, 2026 17:55 View session

Copilot AI reviewed Mar 15, 2026

View reviewed changes

redisvl/extensions/router/semantic.py Show resolved Hide resolved

redisvl/extensions/router/semantic.py Outdated Show resolved Hide resolved

redisvl/extensions/router/semantic.py Outdated Show resolved Hide resolved

cursor bot reviewed Mar 15, 2026

View reviewed changes

redisvl/extensions/llm_router/__init__.py Outdated Show resolved Hide resolved

redisvl/extensions/llm_router/__init__.py Outdated Show resolved Hide resolved

bsbodden force-pushed the llm-router branch from d44287f to 57ec336 Compare March 15, 2026 18:56

cursor bot reviewed Mar 15, 2026

View reviewed changes

redisvl/extensions/llm_router/__init__.py Outdated Show resolved Hide resolved

redisvl/extensions/llm_router/__init__.py Outdated Show resolved Hide resolved

Copilot AI review requested due to automatic review settings March 15, 2026 19:25

bsbodden force-pushed the llm-router branch from 57ec336 to b131a1e Compare March 15, 2026 19:25

Copilot started reviewing on behalf of bsbodden March 15, 2026 19:25 View session

Copilot AI reviewed Mar 15, 2026

View reviewed changes

redisvl/extensions/router/semantic.py Show resolved Hide resolved

redisvl/extensions/router/semantic.py Show resolved Hide resolved

redisvl/extensions/llm_router/__init__.py Outdated Show resolved Hide resolved

docs/user_guide/13_llm_router.ipynb Outdated Show resolved Hide resolved

cursor bot reviewed Mar 15, 2026

View reviewed changes

redisvl/extensions/llm_router/__init__.py Outdated Show resolved Hide resolved

bsbodden force-pushed the llm-router branch from b131a1e to 8d95d0d Compare March 15, 2026 19:40

cursor bot reviewed Mar 15, 2026

View reviewed changes

redisvl/extensions/llm_router/__init__.py Outdated Show resolved Hide resolved

redisvl/extensions/llm_router/__init__.py Outdated Show resolved Hide resolved

Copilot AI review requested due to automatic review settings March 15, 2026 19:51

bsbodden force-pushed the llm-router branch from 8d95d0d to acb0267 Compare March 15, 2026 19:51

Copilot started reviewing on behalf of bsbodden March 15, 2026 19:52 View session

cursor bot reviewed Mar 15, 2026

View reviewed changes

redisvl/extensions/router/pretrained/__init__.py Show resolved Hide resolved

Copilot AI reviewed Mar 15, 2026

View reviewed changes

redisvl/extensions/router/semantic.py Show resolved Hide resolved

docs/user_guide/13_llm_router.ipynb Show resolved Hide resolved

bsbodden force-pushed the llm-router branch from acb0267 to 54b0ca1 Compare March 15, 2026 20:00

rbs333 reviewed Mar 16, 2026

View reviewed changes

Conversation

bsbodden commented Feb 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Design

Usage

Files changed

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as outdated.

This comment was marked as resolved.

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rbs333 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

jit-ci bot commented Mar 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🛡️ Jit Security Scan Results

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

bsbodden commented Feb 16, 2026 •

edited

Loading

jit-ci bot commented Mar 15, 2026 •

edited

Loading

bsbodden commented Mar 15, 2026 •

edited

Loading