fix contiguous issue by xin3he · Pull Request #1594 · intel/auto-round

xin3he · 2026-03-23T03:26:11Z

Description

When using transformers 5.3.0, auto-round --model_name EleutherAI/gpt-j-6b --bits 4 --iters 0
ValueError: You are trying to save a non contiguous tensor: lm_head.weight which is not allowed. It either means y
ou are trying to save tensors which are reference of each other in which case it's recommended to save only the full
tensors, and reslice at load time, or simply call .contiguous() on your tensor to pack it before saving.

Type of Change

Related Issues

Fixes or relates to #

Checklist Before Submitting

My code has been tested locally.
Documentation has been updated as needed.
New or updated tests are included where applicable.

Signed-off-by: Xin He <xin3.he@intel.com>

for more information, see https://pre-commit.ci

Copilot

Pull request overview

Fixes a safetensors serialization error encountered with Transformers 5.3.0 where non-contiguous tensors (e.g., lm_head.weight) cannot be saved, impacting auto-round model export.

Changes:

Ensure tensors are contiguous before calling safetensors.torch.save_file during shard flush.
Keep the torch.save serialization path unchanged.

auto_round/compressors/shard_writer.py

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

xin3he · 2026-03-23T07:56:41Z

/azp run Unit-Test-CUDA-AutoRound

azure-pipelines · 2026-03-23T07:56:51Z

Azure Pipelines successfully started running 1 pipeline(s).

xin3he · 2026-03-23T09:06:47Z

/azp run Unit-Test-CUDA-AutoRound

azure-pipelines · 2026-03-23T09:06:57Z

Azure Pipelines successfully started running 1 pipeline(s).

fix contiguous issue

29378b0

Signed-off-by: Xin He <xin3.he@intel.com>

xin3he requested review from Kaihui-intel, XuehaoSun, Copilot and lvliang-intel March 23, 2026 03:26

[pre-commit.ci] auto fixes from pre-commit.com hooks

455a756

for more information, see https://pre-commit.ci

Copilot started reviewing on behalf of xin3he March 23, 2026 03:26 View session

Copilot AI reviewed Mar 23, 2026

View reviewed changes

auto_round/compressors/shard_writer.py Outdated Show resolved Hide resolved

auto_round/compressors/shard_writer.py Outdated Show resolved Hide resolved

Update auto_round/compressors/shard_writer.py

a4d61ff

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

chensuyue added this to the 0.12.0 milestone Mar 23, 2026

lvliang-intel approved these changes Mar 23, 2026

View reviewed changes

Update requirements_vllm.txt

cc18a55

xin3he merged commit 4689e93 into main Mar 23, 2026
40 checks passed

xin3he deleted the xinhe/3-23 branch March 23, 2026 11:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix contiguous issue#1594

fix contiguous issue#1594
xin3he merged 4 commits intomainfrom
xinhe/3-23

xin3he commented Mar 23, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

xin3he commented Mar 23, 2026

Uh oh!

azure-pipelines bot commented Mar 23, 2026

Uh oh!

xin3he commented Mar 23, 2026

Uh oh!

azure-pipelines bot commented Mar 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

xin3he commented Mar 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of Change

Related Issues

Checklist Before Submitting

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

xin3he commented Mar 23, 2026

Uh oh!

azure-pipelines bot commented Mar 23, 2026

Uh oh!

xin3he commented Mar 23, 2026

Uh oh!

azure-pipelines bot commented Mar 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

xin3he commented Mar 23, 2026 •

edited

Loading