Skip to content
View Abd0r's full-sized avatar
🫐
Training an LLM
🫐
Training an LLM

Block or report Abd0r

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Abd0r/README.md
Pixel Ghost

Hey, I'm Abdur 👋

17 y/o · Independent AI Researcher · India
Building artificial intelligence that goes beyond token prediction

X   Hugging Face   Email   ORCID   PyPI


About Me

I'm a self-taught AI researcher from Moradabad, India — no CS degree, no lab access, just curiosity and obsession.

I've spent the last year reverse-engineering how the brain works to build AI systems that are fundamentally different from today's transformers. My work spans novel architectures, training frameworks, and cognitive systems — all designed to be efficient, interpretable, and closer to how intelligence actually operates.


🔬 Published Work

🧭 Quatrix — Q-Compass Architecture

Replaces standard attention with Q-Compass — sequence mixing grounded in reinforcement learning navigation theory instead of geometric similarity. Three projections instead of four. 69% fewer attention parameters. One mechanism handles text, vision, audio, and world modeling.

   

🧠 Artificial Neural Mesh (ANM) V0

A modular multi-agent cognitive architecture featuring 12 specialized domain experts collaborating through Web-of-Thought (WoT) reasoning.

 

⚡ GEKO — Gradient-Efficient Knowledge Optimization

A plug-and-play fine-tuning framework that skips samples the model already knows — routing compute to hard samples and freezing mastered ones. Up to 80% compute savings at scale.

     

🔥 Currently Training

Berry-Q0 Model

Berry-Q0 — ~50M parameters, trained from scratch on a single laptop GPU (RTX 4050, 6GB VRAM). Text + vision, currently in GRPO reasoning training (R1-style, math domain).

The goal: push a 50M model as far as possible on reasoning. No cloud. No team. Just architecture.


Built from scratch · No lab · No shortcuts

Pinned Loading

  1. quatrix quatrix Public

    Quatrix — Q-Compass Architecture: novel neural architecture replacing attention with value-based navigation. Base for the Quasar model series.

    Python

  2. GEKO GEKO Public

    Intelligent training framework that automatically skips mastered samples and gives 5× more compute to hard ones. Up to 80% compute savings on LLM fine-tuning.

    Python 1

  3. Artificial-Neural-Mesh-V0 Artificial-Neural-Mesh-V0 Public

    A Multi Agent Reasoning System.

    Python