8+ years building high-scale distributed systems · 2x Jeff Bezos Award Winner · Published IET Researcher · M.Tech AI/ML @ BITS Pilani
Lead Backend Engineer with 8+ years of experience delivering production systems across media and entertainment(UGC), SaaS, fintech, e-commerce, and renewable energy. Currently specializing in GenAI-first architectures: RAG pipelines, multi-agent systems, and LLM-powered applications at scale.
Co-architected Nojoto, India's leading multilingual storytelling platform, scaling it from 10K to 10M+ users and Rs 12Cr ARR. Built Elasticsearch infrastructure handling 225M documents at sub-50ms latency, a 30x performance improvement. Led a team of 8 engineers managing 200K+ daily messages and 50TB of media.
Pursuing an M.Tech in Artificial Intelligence & Machine Learning at BITS Pilani, bridging academic rigor with hands-on production experience.
- Scaled Nojoto from 10K to 10M+ users and 2M+ creators, raising Rs 26Cr in funding; recognized by Google (#WeArePlay), MeitY Top 100 Startups 2022, and YourStory Tech30
- Built Elasticsearch infrastructure for 225M documents delivering sub-50ms latency and 30x performance improvement, powering user and content discovery at scale
- Awarded Jeff Bezos Award for People & Leadership twice, recognizing outstanding impact in engineering and team development
- Published IoT research in IET Electronics Letters: a non-intrusive solar monitoring system for NISE & NIWE, Ministry of New and Renewable Energy · Read Paper
- Advised 50+ startups on MVP development, engineering team building, and go-to-market strategies
- Reduced infrastructure costs by 30% while improving performance by 60% through cloud architecture optimization
Production-grade Retrieval Augmented Generation systems and Generative AI applications.
| Project | Description | Stack |
|---|---|---|
| Hybrid RAG System | Advanced pipeline combining dense + sparse retrieval with Reciprocal Rank Fusion. Automated evaluation using MRR, NDCG, and BERTScore across 500 Wikipedia articles. | FAISS · BM25 · Flan-T5 · RRF |
| Customer Support Voice Agent | Voice-to-voice AI assistant using RAG for documentation-based knowledge retrieval with natural speech interaction. | GPT-4o · Qdrant · TTS |
| Elasticsearch RAG | Large-scale document retrieval using Elasticsearch's vector search capabilities integrated into a full RAG pipeline. | Elasticsearch · Vector Search · LLMs |
| LangChain RAG Chatbot | Production-ready enterprise knowledge retrieval system with customizable document ingestion and hybrid search pipelines. | LangChain · OpenAI · Vector DB |
| Resume Analyzer | Automated resume ingestion from Gmail with AI-powered analysis and structured output to Google Sheets. | LLMs · Gmail API · Google Sheets |
| Multi-Agent Researcher | Collaborative multi-agent system for automated research and information synthesis from diverse sources. | LangChain · Agent Framework · LLMs |
| Project | Description | Stack |
|---|---|---|
| Auto Jobs Applier AI Agent | Intelligent agent that automates job application workflows using AI reasoning and form interaction. | AI Agents · LLMs · Automation |
| AI Professional Headshot Generator | Fine-tuned image generation model for professional portraits. Live Demo | Stable Diffusion · LoRA · Vercel |
| AI Stock Comparison Agent | Financial analysis tool for Indian stocks with real-time data, sector insights, and analyst recommendations. | OpenAI · yFinance · Streamlit |
| AI Cold Email Generator | LLM-based tool for generating highly personalized outreach emails for job applications. | LLMs · LangChain · Python |
| AI Wedding Album | Smart AI-powered photo management and generation platform for weddings. Live Demo | GenAI · Vercel · Image Models |
M.Tech AI/ML at BITS Pilani: combining academic foundations with production-ready implementations.
| Repository | Description |
|---|---|
| M.Tech AI/ML Journey | Labs, research notes, and core AI/ML foundations: Statistical ML, Deep Learning, and NLP. |
| Churn Prediction ML Pipeline | End-to-end MLOps pipeline with experiment tracking, model versioning, and API serving using Airflow, DVC, and FastAPI. |
| NLP & Statistical Machine Translation | Implementation of statistical machine translation with BLEU score evaluation. |
Solar PV Monitoring System · Published in IET Electronics Letters (2017)
Non-intrusive IoT system developed for NISE & NIWE (Ministry of New and Renewable Energy) enabling real-time data capture, cloud processing, and automated fault detection for solar photovoltaic installations.
- Scaled platform from 10K to 10M+ users, 2M+ creators, Rs 12Cr ARR
- Built Elasticsearch infra for 225M documents with sub-50ms latency (30x improvement)
- Delivered AI/ML systems for content recommendations, personalization, and moderation
- Managed team of 8 engineers handling 200K+ daily messages and 50TB of media
- Reduced cloud costs 30%, improved performance 60%, grew organic traffic 70%+
- Awarded Jeff Bezos Award for People & Leadership twice
- Delivered 10+ e-commerce projects ahead of schedule with a team of 3 to 4 developers
- Managed payment integrations and AWS infrastructure, improving client revenue by 25%
- Provided architecture consulting to 50+ clients
- Built solar monitoring platform for government agencies NISE & NIWE under Ministry of New and Renewable Energy
- Developed real-time dashboards with automated anomaly detection pipelines
- Work led to published research in IET Electronics Letters
- From MySQL Hell to Elasticsearch Heaven: How We Built Instagram-Like Search · Scaling to 200M documents and 10M followers with sub-100ms search
- Nojoto System Design: Crafting a Scalable Storytelling Platform · Architecting a platform for millions of users across multiple media formats
- Mastering Elasticsearch: Unleashing the Power of Search · Technical guide on implementing efficient search in data-driven systems
- Leveraging FFmpeg in Content-Based Platforms with Python · Video and audio processing for modern media platforms
- AI Agent Deployment for Company-Wide Automation · Building enterprise-grade AI agent systems for departmental orchestration
- How to Stay Valuable as an Engineer in the AI Era · Shifting focus from code output to system reliability and engineering judgment
Gurugram, India · surya13493@gmail.com · linkedin.com/in/anandsuraj