Aman Mishra questinrest

About

AI/ML Engineer with hands-on experience building LLM systems, RAG pipelines, and backend services across industry and academic internships. Strong foundation in ML, DL, and GenAI, grounded in first-principles understanding, with demonstrated ability to evaluate LLM behavior and ship reliable infrastructure. Currently interested in designing reliability-aware, agent-ready GenAI systems with robust evaluation, monitoring, and scalable infrastructure.

Master's in Data Science — Defence Institute of Advanced Technology (DIAT), Pune

Bachelor's in Mathematics — Indira Gandhi National Open University (IGNOU), New Delhi

Experience

Aug 2025 Jan 2026	Founding Engineering Team · Observo Tech Studio Benchmarked email service providers, warm-up tools, and video analytics APIs for an AI-powered B2B marketing product. Built the early backend for multi-mailbox outbound campaigns with conversation-level state across 5 mailboxes (FastAPI + PostgreSQL). Implemented a real-time webhook pipeline capturing 16+ delivery and engagement signals. Deployed serverless APIs on Azure Functions and Logic Apps.
Apr 2025 Jul 2025	GenAI Engineer, R&D · Confedo AI Evaluated 4 LLM observability frameworks by implementing equivalent RAG workflows and comparing tracing quality, async context propagation, and metric reliability. Reverse-engineered tracing implementations across 3 open-source codebases. Prepared experimental RAG pipelines assessing retrieval quality and faithfulness using LLM-based and classical IR metrics.
Jan 2025 Jun 2025	Research Intern · Usable Security Group (USG) Lab @ IIIT Delhi Designed a zero-shot machine unlearning framework enabling class-level forgetting without original data. Used contrastive model inversion to generate 5K synthetic samples for targeted unlearning. Achieved 90%+ class-wise unlearning efficacy on SVHN with 77% runtime reduction vs. GKT. Benchmarked against 4 baselines. Explored coreset based strategies for effective unlearning.

Tech Stack

Languages
ML / DL
LLM / RAG
Backend
Cloud
Data

Featured Projects

DocStructRAG

Structure-aware documentation RAG pipeline built from scratch. Crawls 104 FastAPI pages, preserves section hierarchy, generates 971 metadata-rich chunks, and delivers citation-grounded answers via Pinecone + Groq.

Precision RAG

RAG system with SHA-256 document deduplication, namespace isolation, semantic reranking, and inline-cited answers. Upload a 200-page manual, ask a question, get a precise cited response.

Offline RAG Bot

Fully local, zero-cloud RAG system. Chat with your own PDFs using Ollama + Streamlit. Ships with a privacy-law corpus (GDPR, CCPA, LGPD, DPDP) for interactive legal research.

Transformer from Scratch

GPT-style decoder-only Transformer implemented from first principles in pure PyTorch. 18.9M-parameter model with ablation studies on Pre-LN vs Post-LN, sinusoidal vs RoPE, tracked via W&B.

YOLO Posture Estimation

Modular computer vision pipeline using YOLOv8 for object detection and pose estimation. Classifies postures (standing, sitting, bending) via geometric heuristics on skeletal keypoints.

CNN-LSTM Frame Classification

Sequential frame classification for gesture recognition using CNN feature extraction + LSTM temporal modeling. Built for TV gesture control with 5 distinct hand gesture classes.

Currently Working On

> Agentic systems design and evaluation workflows
> LLM observability and tracing infrastructure
> Production RAG with evaluation (RAGAS), deduplication, and structured retrieval
> Machine unlearning — zero-shot frameworks for class-level forgetting

Provide feedback

Saved searches

Use saved searches to filter your results more quickly