[feat] Support multimodel mtp by Jintao-Huang · Pull Request #14 · modelscope/mcore-bridge

Jintao-Huang · 2026-04-04T10:46:38Z

No description provided.

gemini-code-assist

Code Review

This pull request introduces decoder_input propagation through the Multi-Token Prediction (MTP) layers and adds a _get_embeddings helper method to handle sequence rolling and embedding computation. Several critical issues need to be addressed: the forward method signature in patcher.py is missing the decoder_input parameter, which will cause a NameError; the MultiTokenPrediction container requires patching to accept the new argument and avoid a TypeError; the make_viewless_tensor function is used without being imported; and the logic in _get_embeddings should be updated to preserve pre-computed decoder_input values instead of unconditionally overwriting them.

src/mcore_bridge/model/gpt_model.py

src/mcore_bridge/patcher.py

Jintao-Huang · 2026-04-05T07:20:40Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces support for multimodal Multi-Token Prediction (MTP) by modifying the GPT model's forward and post-processing logic to handle combined embedding and decoder inputs. It also adds a _get_embeddings method to the MultiTokenPredictionLayer to manage tensor rolling and sequence parallelism. Review feedback suggests addressing a potential IndexError in the _get_embeddings method by rolling along the sequence dimension directly instead of transposing, and improving type safety by adding Union to the imports and updating the embedding argument's type hint.

src/mcore_bridge/patcher.py

Jintao-Huang added 4 commits April 2, 2026 19:13

compat mcore dev

641dc17

support multimodel mtp

e81832d

Merge branch 'main' into support_multimodel_mtp

abc6b2b

fix

9ac5763

gemini-code-assist bot reviewed Apr 4, 2026

View reviewed changes

src/mcore_bridge/model/gpt_model.py Outdated Show resolved Hide resolved

src/mcore_bridge/patcher.py Outdated Show resolved Hide resolved

src/mcore_bridge/patcher.py Outdated Show resolved Hide resolved

src/mcore_bridge/patcher.py Show resolved Hide resolved

Jintao-Huang added 2 commits April 5, 2026 14:35

update

d989994

update

4654faa

Jintao-Huang mentioned this pull request Apr 5, 2026

[megatron] support multimodal MTP modelscope/ms-swift#8390

Merged

gemini-code-assist bot reviewed Apr 5, 2026

View reviewed changes

src/mcore_bridge/patcher.py Show resolved Hide resolved

src/mcore_bridge/patcher.py Show resolved Hide resolved

src/mcore_bridge/patcher.py Show resolved Hide resolved

Jintao-Huang merged commit 27066a2 into modelscope:main Apr 5, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feat] Support multimodel mtp#14

[feat] Support multimodel mtp#14
Jintao-Huang merged 6 commits intomodelscope:mainfrom
Jintao-Huang:support_multimodel_mtp

Jintao-Huang commented Apr 4, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Jintao-Huang commented Apr 5, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Jintao-Huang commented Apr 4, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Jintao-Huang commented Apr 5, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant