Skip to content
View DavidZha1994's full-sized avatar

Block or report DavidZha1994

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
DavidZha1994/README.md

Hi there, I'm Yu Zha 👋

Senior ML Engineer & Data Scientist

  • 🔬 Building production ML systems — NLP, Computer Vision, and multimodal AI
  • 🏗️ Currently crafting data pipelines & ARR models at niologic GmbH (Germany)
  • 📄 Published researcher in multimodal NLP for the construction industry (CIB W78 2024)
  • 🌍 Languages: Chinese (native), German (fluent), English (fluent)
  • 🎓 M.Sc. Applied CS @ Ruhr University Bochum · B.Sc. IoT Engineering @ Beijing University of Technology

💁‍♂️ Connect with me

LinkedIn  Email  GitHub  ORCID  Profile views


🧠 ML & Data Science

Python PyTorch Transformers scikit-learn Pandas Jupyter

📀 Data & Infrastructure

Azure SQL dbt Airflow Superset Grafana Docker

🤖 AI Tooling & Dev

TypeScript Vue.js Claude Code MCP Linux Git


💼 Work Experience

gantt
    dateFormat YYYY-MM
    title Work Experience

    section Employment
        ML Researcher & CV/NLP Engineer — Jaeger Gruppe   :done, jg, 2021-07, 2025-01
        Senior Data Scientist & ML Engineer — niologic GmbH :active, nio, 2025-01, 2026-02

    section Education
        B.Sc. IoT Engineering — Beijing Univ. of Technology :done, bsc, 2016-09, 2020-07
        M.Sc. Applied CS — Ruhr University Bochum           :done, msc, 2020-10, 2023-03
Loading

🚀 Featured Projects

Research & ML

Project Description
digital-poststelle Multimodal NLP email classifier for construction — BERT+BiLSTM, 92.3% accuracy. CIB W78 2024
CondiNILM Conditioned multi-task learning framework for non-intrusive load monitoring
ViT-HyperSense Systematic hyperparameter sensitivity analysis of Vision Transformers with Optuna
ARPL-MRI Adversarial Reciprocal Points Learning for MRI Open Set Recognition

AI Tools & MCP Servers

Project Description
everything-claude-code Complete Claude Code config collection — agents, skills, hooks, commands. Anthropic hackathon winner.
superset-mcp-new Connect 50+ data stores via Apache Superset MCP server
metabase-mcp-server MCP integration layer for Metabase + AI assistants
PageIndex Vectorless, reasoning-based RAG — 98.7% accuracy on FinanceBench

📄 Publication

Optimizing Email Classification in the Construction Industry through a Multimodal NLP Approach Yu Zha, Sherief Ali, Sebastian Schumacher, Michael Schulte, Markus König CIB W78 2024, Marrakesh, MoroccoPDF


📊 GitHub Stats


⏱️ WakaTime Stats


Open to collaboration on ML research, AI tooling, and data infrastructure projects.

Popular repositories Loading

  1. ARPL-MRI ARPL-MRI Public

    Adversarial Reciprocal Points Learning for MRI Open Set Recognition

    Jupyter Notebook 1

  2. Gait-Analysis Gait-Analysis Public

    C++

  3. taojinbi taojinbi Public

    Forked from JavisPeng/taojinbi

    淘宝淘金币自动执行脚本,包含蚂蚁森林收取能量,芭芭农场全任务,解放你的双手

    JavaScript

  4. mimotion-1 mimotion-1 Public

    Forked from mixool/mimotion

    小米运动 微信步数 支付宝步数

    Shell

  5. JD_Sign_Action JD_Sign_Action Public

    Forked from zjs5201314/JD_Sign_Action

    基于github actions的京东签到、领京豆

    JavaScript

  6. car-price-prediction-master car-price-prediction-master Public

    Parallel Car Price Prediction in Python with Dask (Dataset: https://www.kaggle.com/ananaymital/us-used-cars-dataset)

    Jupyter Notebook