Skip to content
View questinrest's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report questinrest

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
questinrest/README.md
Aman Mishra — AI/ML Engineer

LinkedIn   Email   Portfolio   Newsletter

Profile Views

About

AI/ML Engineer with hands-on experience building LLM systems, RAG pipelines, and backend services across industry and academic internships. Strong foundation in ML, DL, and GenAI, grounded in first-principles understanding, with demonstrated ability to evaluate LLM behavior and ship reliable infrastructure. Currently interested in designing reliability-aware, agent-ready GenAI systems with robust evaluation, monitoring, and scalable infrastructure.

Master's in Data Science — Defence Institute of Advanced Technology (DIAT), Pune

Bachelor's in Mathematics — Indira Gandhi National Open University (IGNOU), New Delhi


Experience

Aug 2025
Jan 2026
Founding Engineering Team · Observo Tech Studio
Benchmarked email service providers, warm-up tools, and video analytics APIs for an AI-powered B2B marketing product. Built the early backend for multi-mailbox outbound campaigns with conversation-level state across 5 mailboxes (FastAPI + PostgreSQL). Implemented a real-time webhook pipeline capturing 16+ delivery and engagement signals. Deployed serverless APIs on Azure Functions and Logic Apps.
Apr 2025
Jul 2025
GenAI Engineer, R&D · Confedo AI
Evaluated 4 LLM observability frameworks by implementing equivalent RAG workflows and comparing tracing quality, async context propagation, and metric reliability. Reverse-engineered tracing implementations across 3 open-source codebases. Prepared experimental RAG pipelines assessing retrieval quality and faithfulness using LLM-based and classical IR metrics.
Jan 2025
Jun 2025
Research Intern · Usable Security Group (USG) Lab @ IIIT Delhi
Designed a zero-shot machine unlearning framework enabling class-level forgetting without original data. Used contrastive model inversion to generate 5K synthetic samples for targeted unlearning. Achieved 90%+ class-wise unlearning efficacy on SVHN with 77% runtime reduction vs. GKT. Benchmarked against 4 baselines. Explored coreset based strategies for effective unlearning.

Tech Stack

Languages Python  SQL
ML / DL PyTorch  Hugging Face  scikit-learn  YOLO  Weights & Biases
LLM / RAG LangChain  LangSmith  Pinecone  Groq  Ollama
Backend FastAPI  Docker  Celery  Redis  REST APIs
Cloud Azure  Supabase  Linux  Git
Data PostgreSQL  MongoDB  Pandas

Featured Projects

Structure-aware documentation RAG pipeline built from scratch. Crawls 104 FastAPI pages, preserves section hierarchy, generates 971 metadata-rich chunks, and delivers citation-grounded answers via Pinecone + Groq.

Python  LangChain  Pinecone  FastAPI

RAG system with SHA-256 document deduplication, namespace isolation, semantic reranking, and inline-cited answers. Upload a 200-page manual, ask a question, get a precise cited response.

Python  Pinecone  FastAPI  PyMuPDF

Fully local, zero-cloud RAG system. Chat with your own PDFs using Ollama + Streamlit. Ships with a privacy-law corpus (GDPR, CCPA, LGPD, DPDP) for interactive legal research.

Python  Ollama  Streamlit  FAISS

GPT-style decoder-only Transformer implemented from first principles in pure PyTorch. 18.9M-parameter model with ablation studies on Pre-LN vs Post-LN, sinusoidal vs RoPE, tracked via W&B.

PyTorch  W&B

Modular computer vision pipeline using YOLOv8 for object detection and pose estimation. Classifies postures (standing, sitting, bending) via geometric heuristics on skeletal keypoints.

Python  YOLO  OpenCV

Sequential frame classification for gesture recognition using CNN feature extraction + LSTM temporal modeling. Built for TV gesture control with 5 distinct hand gesture classes.

PyTorch  CNN+LSTM


Currently Working On

> Agentic systems design and evaluation workflows
> LLM observability and tracing infrastructure
> Production RAG with evaluation (RAGAS), deduplication, and structured retrieval
> Machine unlearning — zero-shot frameworks for class-level forgetting
Footer

Pinned Loading

  1. TierRAG TierRAG Public

    Multi-Tiered Retrieval-Augmented Generation System

    Python 1

  2. precision-rag-with-deduplication precision-rag-with-deduplication Public

    Jupyter Notebook

  3. doc-struct-rag doc-struct-rag Public

    Structure-aware documentation RAG pipeline with custom ingestion, hierarchical chunking, and citation-grounded answers.

    Jupyter Notebook

  4. offline-rag-bot offline-rag-bot Public

    Jupyter Notebook

  5. transformer_from_scratch transformer_from_scratch Public

    Reimplementation of decoder based transformer in pure PyTorch

    Jupyter Notebook