Skip to content
View Luckyseven1122's full-sized avatar
:octocat:
Focusing
:octocat:
Focusing

Block or report Luckyseven1122

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Luckyseven1122/README.md

Hej, Moi, ๐Ÿ‘‹, I'm Yaoxuan (Seven) Zhu

MLOps Engineer ย |ย  ML Engineer ย |ย  Data Scientist
Building production-grade ML systems from data pipelines to deployment & monitoring

Open to Data Scientist ยท ML Engineer ยท MLOps Engineer ยท Data Engineer opportunity


๐Ÿง  About Me

I'm a PhD researcher & MLOps Engineer at KTH Royal Institute of Technology (Stockholm) with 5+ years of end-to-end ML experience โ€” from raw sensor data to production APIs monitored 24/7.

My core focus is building intelligent process monitoring systems that fuse multi-modal signals (vibration + acoustics) to predict surface quality in real-time manufacturing. I treat every model as a product: versioned, containerized, tested, and observable.

  • ๐Ÿ”ฌ Research: Multimodal deep learning, LLMs for time-series, Vision Transformers, Diffusion Models for data augmentation
  • โš™๏ธ Engineering: End-to-end MLOps (MLflow, Docker, Kubernetes, GitHub Actions, Evidently, Grafana)
  • โ˜๏ธ Cloud: AWS (SageMaker, S3, Lambda, Kinesis, Step Functions), GCP, Azure Databricks
  • ๐Ÿ“ฆ Currently: Part-time ML Engineer @ NordML โ€” demand forecasting system for Nudient AB on GCP
  • ๐Ÿ“ Stockholm, Sweden

๐Ÿš€ What I Bring to DS / MLE / MLOps Roles

Area What I've Done
Data Science Feature engineering on time-series signals; statistical & automated extraction; multimodal fusion; uncertainty quantification
ML Engineering Trained & fine-tuned CNNs, ViTs, LLMs (LoRA/QLoRA), diffusion models; 15%+ error reduction vs. baselines; 95%+ accuracy in production
MLOps MLflow tracking + model registry; Docker + Kubernetes deployments; CI/CD with GitHub Actions; drift detection with Evidently; dashboards in Grafana

๐Ÿ› ๏ธ Tech Stack

๐Ÿค– ML / DL

Python PyTorch TensorFlow Keras scikit-learn HuggingFace

Specialties: LLMs (LoRA/QLoRA fine-tuning) ยท Vision Transformers (ViT) ยท CNNs (VGG, ResNet, InceptionResNet) ยท GANs ยท Diffusion Models ยท LSTM / RNN ยท NLP Transformers ยท SVM / RF / GBR

โš™๏ธ MLOps & Engineering

MLflow Docker Kubernetes GitHub Actions Apache Airflow DVC Grafana Evidently

โ˜๏ธ Cloud & Data

AWS GCP Databricks SQL Git


๐Ÿ’ผ Experience Highlights

๐ŸŽ“ Full-stack Data Scientist โ€” KTH Royal Institute of Technology (Nov 2019 โ€“ Present)

End-to-end intelligent process quality monitoring system

  • Built a multimodal ML pipeline fusing vibration + acoustic signals to predict surface roughness in real time
  • Achieved >15% error reduction vs. single-sensor baselines via ensemble fusion with attention mechanisms
  • Productionized with MLflow + Docker + Kubernetes + GitHub Actions CI/CD; drift detection via Evidently; real-time dashboards in Grafana
  • Deployed both batch and low-latency REST APIs; enforced data contracts for governed, auditable releases

๐Ÿข ML Engineer (Part-time) โ€” NordML / Nudient AB (June 2025 โ€“ Present)

Inventory demand forecasting system

  • Built multi-SKU demand forecasting models with Shopify data pipelines, feature engineering, and DVC versioning
  • Deployed batch + real-time inference on GCP with Docker + CI/CD
  • Monitored via Grafana with MAPE, service level, and fill rate alerts tied to safety-stock replenishment

๐Ÿ”ง Project Engineer โ€” Sandvik Coromant AB (Aug 2018 โ€“ Feb 2019)

ML-based milling runout & tool life prediction

  • Developed SVR, ANN, LR models on vibration time-series to predict tool wear impact from milling runout
  • End-to-end responsibility: data collection โ†’ preprocessing โ†’ modeling โ†’ presentation at R&D monthly conference

๐Ÿ“‚ Featured Projects

๐Ÿค– LLM for Surface Quality Prediction (Dec 2024 โ€“ Present)

Fine-tuned ChatGLM2 with LoRA/QLoRA for surface quality prediction from textualized vibration features. Improved cross-dataset accuracy by 10%; strong results in zero-shot and small-sample scenarios.

๐ŸŽจ Data Augmentation with Diffusion Models (Jan 2024 โ€“ Nov 2024)

Applied Conditional GANs & Classifier-Guidance Diffusion Models to synthesize training images. ๐Ÿ“„ Published in Procedia CIRP, 58th CIRP Conference on Manufacturing Systems

๐Ÿ”ญ Vision Transformer for Process Monitoring (Jun 2023 โ€“ May 2024)

Modified & fine-tuned ViT with multiple attention mechanisms for multimodal surface quality prediction. ๐Ÿ“„ Under review โ€” Journal of Advanced Engineering Informatics

๐Ÿ”„ Deep Transfer Learning for In-Process Monitoring (Feb 2022 โ€“ Feb 2023)

Leveraged pre-trained CNNs (VGG16/19, ResNet50V2, InceptionResNetV2) on audio data; hyperparameter tuning with Bayesian Optimization. ๐Ÿ“„ Published in Journal of Manufacturing Letters

๐Ÿญ Robust Hard Turning โ€” Vinnova 3.5M SEK Project (Feb 2019 โ€“ Apr 2021)

Industrial collaboration with Volvo Truck, Sandvik, Swerim AB โ€” deployed models achieving 95%+ accuracy, Rยฒ > 0.98 for surface quality prediction. ๐Ÿ“„ Published in Journal of Engineering Applications of Artificial Intelligence


๐ŸŽ“ Education

Degree Institution Period
๐ŸŽ“ Ph.D. โ€” Intelligent Manufacturing (ML Monitoring Systems) KTH Royal Institute of Technology, Sweden 2019 โ€“ Present
๐ŸŽ“ M.Sc. โ€” Production Engineering KTH Royal Institute of Technology, Sweden 2015 โ€“ 2018
๐ŸŽ“ M.Sc. โ€” Computer Science (Minor) Aalto University, Finland 2015 โ€“ 2018
๐ŸŽ“ B.Sc. โ€” Mechanical Engineering Savonia University of Applied Sciences, Finland 2012 โ€“ 2015

๐Ÿ“ Selected Publications

๐Ÿ’ก Setup note: Export the overview figure from each paper PDF โ†’ save as paper_images/paper1.png ... paper5.png in this repo to display the images below.


1 ยท Diffusion Model for Data-Augmented Surface Roughness Prediction

Leveraging classifier-guidance diffusion model for improved surface roughness prediction through synthesized audible sound signal

๐Ÿ“ฐ Procedia CIRP, 58th CIRP Conference on Manufacturing Systems, 2025 ย |ย  ๐Ÿ”— DOI: 10.1016/j.procir.2025.XX.XXX

Applied a classifier-guided diffusion model to synthesize 2D Mel-spectrogram images of audible sound signals from machining, tackling data scarcity in deep learning-based monitoring. Synthetic + real data boosted VGG16 prediction accuracy and generalization on enriched datasets.

Key topics: Diffusion Model Data Augmentation Transfer Learning VGG16 Mel-spectrogram Surface Roughness

Paper 1 Overview Figure-Diffusion


2 ยท Deep Transfer Learning for In-Process Surface Quality Monitoring

Surface quality prediction in-situ monitoring system: A deep transfer learning-based regression approach with audible signal

๐Ÿ“ฐ Journal of Manufacturing Letters, 2024 ย |ย  ๐Ÿ”— DOI: 10.1016/j.mfglet.2024.09.156

Built an in-process surface roughness monitoring system using pre-trained CNNs (VGG16, VGG19, ResNet50V2, InceptionResNetV2) on Mel-spectrogram images of audible sound. Model architectures fine-tuned via Bayesian optimization; compared across multiple input window lengths.

Key topics: Transfer Learning CNN Bayesian Optimization Audible Sound Mel-spectrogram In-process Monitoring

Paper 2 Overview Figure-Deep-TL


3 ยท Multi-channel Data-driven Monitoring for Hard Part Turning

Data-driven approaches for surface quality monitoring and prediction based on heterogeneous multi-channel signal fusion in hard part machining

๐Ÿ“ฐ Journal of Engineering Applications of Artificial Intelligence, 2025 ย |ย  ๐Ÿ”— Open Access โ€” CC BY

A comprehensive methodology evaluating signal selection, multi-sensor fusion, and ML model deployment for hard turning. Validated on two machining platforms; includes uncertainty quantification (NPKDE-based prediction intervals) and Bayesian hyperparameter tuning.

Key topics: Multi-sensor Fusion Hard Turning ML Models Uncertainty Quantification Bayesian Optimization Feature Engineering

Paper 3 Overview Figure-Data-driven


4 ยท Feature Extraction Methods Comparison for Turning Process Monitoring

Surface roughness monitoring and prediction based on audible sound signal with the comparison of statistical and automatic feature extraction methods in turning process

๐Ÿ“ฐ Euspen's 24th International Conference & Exhibition, 2024 ย |ย  ๐Ÿ”— Open Access โ€” CC BY

Benchmarked statistical (handcrafted) vs. automated feature extraction methods applied to audible sound signals for surface roughness prediction in CNC turning. Provided clear guidance on feature strategy selection for practical deployment.

Key topics: Feature Engineering Automated Feature Extraction Signal Processing Turning Process Time-series Analysis

Paper 4 Overview Figure โ€” replace with your exported figure


5 ยท Vision Transformer with Bottleneck Attention for Multimodal Monitoring

A multi-modality transformer approach: surface roughness monitoring and prediction with bottleneck attention mechanisms

๐Ÿ“ฐ Under Review โ€” Journal of Advanced Engineering Informatics

Modified and fine-tuned Vision Transformer (ViT) models combined with multiple bottleneck attention mechanisms for surface roughness prediction under multimodal information fusion (vibration + acoustic). Benchmarked across multiple fusion scenarios.

Key topics: Vision Transformer (ViT) Attention Mechanisms Multimodal Fusion Deep Learning Surface Quality Prediction

Paper 5 Overview Figure


6 ยท Data augmenation methods for improved surface quality monitoring and prediction

Data augmenation based surface quality monitoring with machine learning models

๐Ÿ“ฐ Oral presentation as the invited speaker at Digital Cluster Conference, Stockholm, 10-05-2023

ย |ย  ๐ŸŽฅ Video Link

Modified and fine-tuned Vision Transformer (ViT) models combined with multiple bottleneck attention mechanisms for surface roughness prediction under multimodal information fusion (vibration + acoustic). Benchmarked across multiple fusion scenarios.

Key topics: Data Augmentation Signal Synthesis ML Algorithms Surface Roughness Prediction Audible Sound

Paper 5 Overview Figure - DA


๐Ÿ“Š GitHub Stats

ย 

Pinned Loading

  1. mlops2024 mlops2024 Public

    Forked from BestGigaCat/mlops2024

    Jupyter Notebook

  2. Data-Science-Interview-Resources Data-Science-Interview-Resources Public

    Forked from rbhatia46/Data-Science-Interview-Resources

    A repository listing out the potential sources which will help you in preparing for a Data Science/Machine Learning interview. New resources added frequently.

  3. llm-zoomcamp llm-zoomcamp Public

    Forked from DataTalksClub/llm-zoomcamp

    LLM Zoomcamp - a free online course about real-life applications of LLMs. In 10 weeks you will learn how to build an AI system that answers questions about your knowledge base.

    Jupyter Notebook 1

  4. llm_interview_note llm_interview_note Public

    Forked from wdndev/llm_interview_note

    ไธป่ฆ่ฎฐๅฝ•ๅคง่ฏญ่จ€ๅคงๆจกๅž‹๏ผˆLLMs๏ผ‰ ็ฎ—ๆณ•๏ผˆๅบ”็”จ๏ผ‰ๅทฅ็จ‹ๅธˆ็›ธๅ…ณ็š„็Ÿฅ่ฏ†ๅŠ้ข่ฏ•้ข˜

    HTML

  5. MLOPs-Interview-Questions MLOPs-Interview-Questions Public

    Forked from SepidehHosseinian/MLOPs-Interview-Questions

    A collection of MLOps interview questions and answers for data scientists and engineers. Learn how to apply MLOps principles and practices to automate and streamline machine learning workflows.

  6. Time-LLM Time-LLM Public

    Forked from KimMeen/Time-LLM

    [ICLR 2024] Official implementation of " ๐Ÿฆ™ Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"

    Python