NVIDIA / Model-Optimizer Public

Notifications You must be signed in to change notification settings
Fork 326
Star 2.3k

Code
Issues 67
Pull requests 126
Actions
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Security and quality
Insights

Pull requests: NVIDIA/Model-Optimizer

Labels 30 Milestones 0

New pull request New

126 Open 688 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Fix[bug] ONNX models generated by llm_export.py are missing some i/o

#1157 opened Apr 1, 2026 by Ratheesh1104

Loading…

[fix] AutoQuant: clamp instead of use fp64 in auto quant score

#1156 opened Apr 1, 2026 by Fridah-nv

Loading…

[NVBug: 6038899] Fix MoE export crash on meta tensors with CPU offload cherry-pick

After code freeze, cherry-pick into release branch for next rc. Only for bug fixes and doc updates

#1155 opened Apr 1, 2026 by cjluo-nv

Loading…

Include gpu and example tests also in codecov coverage reporting and enable omitted folder coverage

#1154 opened Apr 1, 2026 by kevalmorabia97

Loading…

fix spelling errors

#1153 opened Apr 1, 2026 by noeyy-mino

Loading…

Intermediate checkpointing for sequential calibration

#1152 opened Mar 31, 2026 by sugunav14 • Draft

Bump the uv group across 6 directories with 5 updates dependencies

Pull requests that update a dependency file

python:uv

Pull requests that update python:uv code

#1144 opened Mar 30, 2026 by dependabot bot

Loading…

Bug fix: disable weight quantizer rotation after weight fold during vLLM fakequant export

#1143 opened Mar 30, 2026 by kinjalpatel27

Loading…

Add Claude Code plugin marketplace for ModelOpt agent skills

#1141 opened Mar 30, 2026 by kaix-nv

Loading…

Fix parquet loading crash from datasets version mismatch

#1140 opened Mar 30, 2026 by yeyu-nvidia

Loading…

2 tasks

[chore]: weekly bump of uv.lock on main (2026-03-30)

#1139 opened Mar 30, 2026 by github-actions bot

Loading…

[Speculative Decoding] Refactor EAGLE3 training to YAML-based config and recipe system

#1134 opened Mar 29, 2026 by h-guo18

Loading…

Add Agent Deployment skill for model serving

#1133 opened Mar 28, 2026 by kaix-nv

Loading…

Add Agent Evaluation skill for accuracy benchmarking

#1132 opened Mar 28, 2026 by kaix-nv

Loading…

Add clear instructions for generating quantized ONNX models (fixes #799)

#1129 opened Mar 28, 2026 by Pritiks23

Loading…

add: DFlash block diffusion speculative decoding

#1128 opened Mar 27, 2026 by ChenhanYu

Loading…

[4/n] Add vLLM integration for modelopt sparse attention

#1127 opened Mar 27, 2026 by kaix-nv

Loading…

Add DeepSeek MoE detection and export mapping in HF PTQ/export path

#1125 opened Mar 26, 2026 by Charles-JCJ

Loading…

Refine _extract_layer_prefixes to better handle mtp modules

#1124 opened Mar 26, 2026 by Edwardf0t1

Loading…

Merge puzzletron compression algorithm

#1121 opened Mar 25, 2026 by danielkorzekwa

Loading…

Add custom MoE quantization guide for HuggingFace models

#1118 opened Mar 25, 2026 by cjluo-nv

Loading…

Add nvfp4_mse and nvfp4_local_hessian options to the ptq script. cherry-pick

After code freeze, cherry-pick into release branch for next rc. Only for bug fixes and doc updates

#1113 opened Mar 24, 2026 by bkartal-dev

Loading…

Add bypass distillation (blockwise local KD) to puzzletron pipeline

#1111 opened Mar 24, 2026 by Separius

Loading…

Add Agent PTQ skill for model quantization

#1107 opened Mar 24, 2026 by mxinO

Loading…

[OMNIML-3776]: add clear docs restrict the model types

#1105 opened Mar 23, 2026 by shengliangxu

Loading…

Previous 1 2 3 4 5 6 Next

Previous Next

ProTip! Type g p on any issue or pull request to go back to the pull request listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!