Software Engineer · Machine Learning Research · Production ML Systems
| Multi-Agent LLM Systems | Retrieval-Augmented Generation | Local & Cost-Aware Inference | Production ML Infrastructure |
I am a software engineer who builds production-grade AI systems — focusing on how models, data, and infrastructure behave under real-world constraints such as latency, cost, scale, and reliability.
| ICMLC 2026 |
When Graph Structure Hurts: Lightweight Path Ranking for Dense KG-RAG 93.9% AUC · 13× fewer parameters than GNN baselines · Designed for dense, production-scale knowledge graphs |
| Channel AI Conversational BI Platform |
Results System Stack → https://github.com/pranavkumaarofficial/newdhatu-enterprise |
| NLCLI Wizard Local LLM Tooling |
Results System Data |
| OneSKU Hybrid Retrieval System |
Implemented within a client-facing production environment; source code not publicly releasable. Results Engineering Notes |
| Efficient Agent Routing | Cost-aware agent selection for tool-heavy LLM workflows under strict latency budgets |
| Small Language Models for Analytics | Local inference, quantization, and structured reasoning for domain-specific business intelligence |
| AI / ML Systems Multi-agent orchestration · RAG · PEFT · Quantization · Model optimization |
Data & Infrastructure Apache Iceberg · PostgreSQL · Vector databases · Docker · Kubernetes · Cloud platforms |
Production Engineering FastAPI · Python · TypeScript · OAuth2 · PKI · HL7 / FHIR interoperability |

