-
Notifications
You must be signed in to change notification settings - Fork 326
Pull requests: NVIDIA/Model-Optimizer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix[bug] ONNX models generated by llm_export.py are missing some i/o
#1157
opened Apr 1, 2026 by
Ratheesh1104
Loading…
[fix] AutoQuant: clamp instead of use fp64 in auto quant score
#1156
opened Apr 1, 2026 by
Fridah-nv
Loading…
[NVBug: 6038899] Fix MoE export crash on meta tensors with CPU offload
cherry-pick
After code freeze, cherry-pick into release branch for next rc. Only for bug fixes and doc updates
#1155
opened Apr 1, 2026 by
cjluo-nv
Loading…
Include gpu and example tests also in codecov coverage reporting and enable omitted folder coverage
#1154
opened Apr 1, 2026 by
kevalmorabia97
Loading…
Bump the uv group across 6 directories with 5 updates
dependencies
Pull requests that update a dependency file
python:uv
Pull requests that update python:uv code
#1144
opened Mar 30, 2026 by
dependabot
bot
Loading…
Bug fix: disable weight quantizer rotation after weight fold during vLLM fakequant export
#1143
opened Mar 30, 2026 by
kinjalpatel27
Loading…
Add Claude Code plugin marketplace for ModelOpt agent skills
#1141
opened Mar 30, 2026 by
kaix-nv
Loading…
Fix parquet loading crash from datasets version mismatch
#1140
opened Mar 30, 2026 by
yeyu-nvidia
Loading…
2 tasks
[chore]: weekly bump of uv.lock on main (2026-03-30)
#1139
opened Mar 30, 2026 by
github-actions
bot
Loading…
[Speculative Decoding] Refactor EAGLE3 training to YAML-based config and recipe system
#1134
opened Mar 29, 2026 by
h-guo18
Loading…
Add clear instructions for generating quantized ONNX models (fixes #799)
#1129
opened Mar 28, 2026 by
Pritiks23
Loading…
[4/n] Add vLLM integration for modelopt sparse attention
#1127
opened Mar 27, 2026 by
kaix-nv
Loading…
Add DeepSeek MoE detection and export mapping in HF PTQ/export path
#1125
opened Mar 26, 2026 by
Charles-JCJ
Loading…
Refine _extract_layer_prefixes to better handle mtp modules
#1124
opened Mar 26, 2026 by
Edwardf0t1
Loading…
Add custom MoE quantization guide for HuggingFace models
#1118
opened Mar 25, 2026 by
cjluo-nv
Loading…
Add nvfp4_mse and nvfp4_local_hessian options to the ptq script.
cherry-pick
After code freeze, cherry-pick into release branch for next rc. Only for bug fixes and doc updates
#1113
opened Mar 24, 2026 by
bkartal-dev
Loading…
Add bypass distillation (blockwise local KD) to puzzletron pipeline
#1111
opened Mar 24, 2026 by
Separius
Loading…
[OMNIML-3776]: add clear docs restrict the model types
#1105
opened Mar 23, 2026 by
shengliangxu
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.