Support MXINT4 scheme by mengniwang95 · Pull Request #1666 · intel/auto-round

mengniwang95 · 2026-04-07T09:36:19Z

Description

Support MXINT4 scheme

How to use:

model quantization:

CUDA_VISIBLE_DEVICES=0 auto-round --model /models/Llama-3.2-3B/ --scheme MXINT4 --iters 0 --format auto_round

inference with transformers:

from transformers import AutoModelForCausalLM, AutoTokenizer
model_name = "tmp_autoround/Llama-3.2-3B-mxint-w4g32/"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(model_name, device_map="auto")
input_ids = tokenizer("Hello my name is", return_tensors="pt").input_ids.to(
    model.device
)
output = model.generate(input_ids, max_new_tokens=100)
print(tokenizer.decode(output[0]))

Type of Change

Signed-off-by: Mengni Wang <mengni.wang@intel.com>

for more information, see https://pre-commit.ci

Signed-off-by: Mengni Wang <mengni.wang@intel.com>

mengniwang95 and others added 5 commits April 7, 2026 17:33

add mxint4

31b1135

Signed-off-by: Mengni Wang <mengni.wang@intel.com>

refine code and add ut

1427510

Signed-off-by: Mengni Wang <mengni.wang@intel.com>

update doc

fa398e7

Signed-off-by: Mengni Wang <mengni.wang@intel.com>

add file

dec438b

Signed-off-by: Mengni Wang <mengni.wang@intel.com>

[pre-commit.ci] auto fixes from pre-commit.com hooks

b86dc11

for more information, see https://pre-commit.ci

mengniwang95 mentioned this pull request Apr 7, 2026

[Feature]: Experimental support MXINT4 scheme #1646

Open

mengniwang95 added 2 commits April 7, 2026 21:41

fix ut

3372e60

Signed-off-by: Mengni Wang <mengni.wang@intel.com>

Merge branch 'main' into mengni/mx_int4

961822d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support MXINT4 scheme#1666

Support MXINT4 scheme#1666
mengniwang95 wants to merge 7 commits intomainfrom
mengni/mx_int4

mengniwang95 commented Apr 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

mengniwang95 commented Apr 7, 2026

Description

Type of Change

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant