[bugfix] fix mtp by Jintao-Huang · Pull Request #12 · modelscope/mcore-bridge

Jintao-Huang · 2026-04-03T02:28:14Z

No description provided.

gemini-code-assist

Code Review

This pull request updates the forward method in src/mcore_bridge/patcher.py to include packed_seq_params, which is necessary for handling sequence packing. However, a critical issue was identified where the forward method returns a 3-tuple while the caller expects a single tensor, which will lead to a runtime error.

gemini-code-assist · 2026-04-03T02:30:49Z

src/mcore_bridge/patcher.py

            input_ids=input_ids,
            position_ids=position_ids,
            embedding=embedding,
+            packed_seq_params=packed_seq_params,


The addition of packed_seq_params to the _get_embeddings call is correct and necessary for MTP to properly handle sequence packing (THD format). This ensures that the internal roll_tensor call correctly handles sequence boundaries.

Critical Issue: Note that this patched forward method returns a 3-tuple (hidden_states, input_ids, position_ids) at line 470. However, the caller in GPTModel._postprocess (at src/mcore_bridge/model/gpt_model.py:398) expects a single return value: hidden_states = self.mtp(...). This mismatch will cause a TypeError or ValueError in subsequent operations (like torch.chunk at line 413 of gpt_model.py) because hidden_states will be a tuple instead of a tensor.

Since the caller handles its own label/mask shifting and logging, you should consider updating the return statement at line 470 to only return hidden_states to maintain compatibility with the standard Megatron-Core API and the existing caller logic.

fix mtp

a01c9e0

gemini-code-assist bot reviewed Apr 3, 2026

View reviewed changes

Jintao-Huang merged commit e3e3d48 into modelscope:main Apr 3, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[bugfix] fix mtp#12

[bugfix] fix mtp#12
Jintao-Huang merged 1 commit intomodelscope:mainfrom
Jintao-Huang:fix_mtp

Jintao-Huang commented Apr 3, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Apr 3, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Jintao-Huang commented Apr 3, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Apr 3, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant