Refactor: Implement Async Semantic Routing to eliminate O(N) LLM bott… by NEHAJAKATE · Pull Request #390 · fireform-core/FireForm

NEHAJAKATE · 2026-03-30T20:04:35Z

Motivation

Currently, the extraction pipeline (e.g., in src/llm.py) iterates sequentially over form fields, creating an O(N) HTTP blocking bottleneck. Attempting to solve this by dumping all fields into a single monolithic prompt causes "Attention Dilution" (Lost in the Middle syndrome) in smaller, local SLMs, leading to hallucinated or omitted fields in the middle of the schema.

Changes Proposed

This PR introduces a Pareto-Optimal Semantic Router** to handle extractions concurrently and deterministically.
Schema Chunking: Decomposed the master extraction requirement into logical, domain-specific Pydantic sub-schemas (e.g., Spatial, Medical, Tactical).
Asynchronous Concurrency: Replaced the synchronous blocking loop with aiohttp and asyncio.gather to fire focused extraction chunks concurrently without blocking the FastAPI event loop.
Non-Destructive Refactor: Implemented the new asynchronous router while maintaining the integrity of the existing project structure.

Impact

Latency Reduction:Reduces wall-clock latency to O(1) (bound only by the single slowest chunk), drastically speeding up report generation.
Accuracy Maximization: Maximizes local SLM accuracy by keeping the context-window hyper-focused on one specific domain per generation.

How to Test

Start the local server and ensure Ollama is running a local model (e.g., mistral or llama3).
Submit a mock incident transcript to the extraction endpoint.
Observe the concurrent processing in the server logs and verify the structural integrity of the returned JSON against the Pydantic schemas.

(Note: I am submitting this PR as part of my active contribution and exploration of the FireForm architecture for GSoC 2026. I would love any feedback from the maintainers on this approach!)

…leneck

Refactor: Implement Async Semantic Routing to eliminate O(N) LLM bott…

c607b5b

…leneck

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor: Implement Async Semantic Routing to eliminate O(N) LLM bott…#390

Refactor: Implement Async Semantic Routing to eliminate O(N) LLM bott…#390
NEHAJAKATE wants to merge 1 commit intofireform-core:mainfrom
NEHAJAKATE:feature/async-semantic-router

NEHAJAKATE commented Mar 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

NEHAJAKATE commented Mar 30, 2026

Motivation

Changes Proposed

Impact

How to Test

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant