Skip to content
View vinzlercodes's full-sized avatar
⛰️
Scaling new Goals
⛰️
Scaling new Goals

Block or report vinzlercodes

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
vinzlercodes/README.md

👋 Hi, I’m Vinayak Sengupta

Data Scientist & LLM Systems Engineer

I build LLM + ML systems that actually ship – from fine-tune-and-serve platforms and GraphRAG pipelines to model-agnostic explainability and data products.


LinkedIn

Medium

GitHub

Email

Resume


🧠 What I do

  • LLM platforms & agents

    • Fine-tune and serve open-source LLMs (Axolotl + QLoRA) behind vLLM for enterprise agent use cases.
    • Design guardrailed RAG / GraphRAG systems with content safety (OpenAI Moderation, NeMo Guardrails) and retrieval metrics baked in.
  • ML systems & explainability

    • Ship model-agnostic attribution and PDP pipelines using DuckDB + Arrow, aligned with LightGBM + SHAP outputs for production dashboards.
  • Data & infra

    • Build and harden FastAPI services, K8s reconciliation loops, and data pipelines in SparkSQL, PostgreSQL, and warehouses.

I care about robustness, observability, and measurable impact – not just getting a Jupyter notebook to “work.”

💼 Snapshot of recent work

Aible – Data Scientist (2023–Present)

  • Custom LLM fine-tune & serve platform

    • Architected a fault-tolerant fine-tune-and-serve platform (Axolotl + QLoRA + vLLM) for multiple open-source foundation models.
    • Automated checkpoint detection & recovery, cutting manual setup / monitoring by ~80% and enabling a Fortune 50 telecom to ship a security metadata classifier on time.
  • Model-agnostic feature attribution & PDP

    • Built a single-pass explanation pipeline using model predictions + univariate summaries with DuckDB pivoting over Arrow data.
    • Unified global + pairwise explanations across LightGBM and SHAP, and reduced per-feature PDP compute 17× while keeping curve fidelity (ρ≈0.90).
  • Model reconciliation controller

    • Re-architected a fragmented Flask process into a FastAPI proxy with an idempotent reconcile loop using DeepDiff + K8s helpers.
    • Consolidated four manual steps into one API and cut manual recovery effort by 90%, speeding pod rollouts and eliminating double-launch incidents.
  • Graph-structured document summarization

    • Implemented RAG + GraphRAG over Neo4j with embedding clustering, token-based splitting, BM25, MMR, and FlashRank-based re-ranking.
    • Integrated OpenAI Moderation + NeMo Guardrails and improved retrieval nDCG@k by 25%, shipping a modular retrieval suite into production.

Research & writing

  • PPO post-training for Llama text-to-SQL (3B)

    • Engineered a PPO pipeline that boosted F1-SQL from 16% → 84% using ~1k human feedback samples and ≈$11 of H100 compute, reaching task-bounded parity with an OpenAI o3-series model.
  • Prior ML work

    • SATD detection and refactoring recommendation (capstone @ RIT).
    • Histopathology carcinoma classification using multi-level spatial fusion (CCIS book series, FTNCT 2019).
    • Long-form writing on Medium (Towards Data Science, The Startup, etc.) on topics from customer segmentation to the last 40 years of gaming.

🛠 Tech stack

Languages
Python · SQL · Cypher

ML / LLM
PyTorch · TensorFlow · Keras · scikit-learn · LightGBM · SHAP · ONNX
Axolotl · QLoRA · vLLM · LangChain · LlamaIndex · NeMo Microservices · NeMo Guardrails
OpenAI · Vertex AI

Data & storage
DuckDB · PostgreSQL · MongoDB · Neo4j · Chroma · PySpark
AWS · GCP

Backend / infra
FastAPI · Flask · Kubernetes · Docker · GitHub Actions

📌 Selected projects (public repos)

Some older but representative public work:

✍️ Latest writing

This section is auto-updated from my Medium RSS feed.

🗣 Talks & community

  • Authored the core problem statement & evaluation metrics for the UC Berkeley AI Summit 2023 – Data Science Hackathon.

  • Represented Aible at Ai4 2023, Google Next 2024, and AWS Summit 2024, running technical demos and stakeholder-facing discussions.

  • Enjoy long-form writing on data, ML, and games whenever I can find the time.

📈 GitHub activity & stats

GitHub profile metrics for vinzlercodes


Vinayak's GitHub stats


⚡ Fun fact: I will absolutely over-analyze both fragrance notes and video-game industry trends.

Pinned Loading

  1. Gaming-Industry-Analysis Gaming-Industry-Analysis Public

    A Data Analysis of a Video-Game Industry Dataset

    Jupyter Notebook 1 1

  2. Disaster-Response-Pipeline-Web-App Disaster-Response-Pipeline-Web-App Public

    An end-end ETL pipeline utilizing both an NLP and a Machine Learning Pipeline systems to create a web application that on typing a form of disaster-related message, categorizes it into categories f…

    Python

  3. Customer-Segmentation-and-Prediction Customer-Segmentation-and-Prediction Public

    The main aim of this project is to predict those individuals who are likely to become customers of the company based on many of their attributes, from the general population.

    HTML

  4. Recommendation-of-Refactoring-Techniques-to-address-Self-Admitted-Technical-Debt Recommendation-of-Refactoring-Techniques-to-address-Self-Admitted-Technical-Debt Public

    Jupyter Notebook 1

  5. Java-OOPs--Algorithms Java-OOPs--Algorithms Public

    Java