Conversation
Signed-off-by: Xin He <xin3.he@intel.com>
for more information, see https://pre-commit.ci
|
|
Thank you for the checking. @XuehaoSun |
Signed-off-by: Xin He <xin3.he@intel.com>
|
It's more complex than the original expectation. since it's an omni model, more time is needed to enable it. |
|
|
Signed-off-by: Xin He <xin3.he@intel.com>
|
@XuehaoSun Please run with |
There was a problem hiding this comment.
Pull request overview
Adds support for the meituan-longcat/LongCat-Next MLLM family by introducing a dedicated longcat_next processor/template and small loader behavior tweaks to avoid chat-template related failures.
Changes:
- Register a new
longcat_nextMLLM template and processor. - Adjust chat-template handling to avoid calling
apply_chat_templatewhen no template is present. - Add LongCat-specific tokenizer loading behavior (
fix_mistral_regex) and additional calibration/block-selection tweaks.
Reviewed changes
Copilot reviewed 5 out of 5 changed files in this pull request and generated 4 comments.
Show a summary per file
| File | Description |
|---|---|
| auto_round/utils/model.py | Adds LongCat-related tokenizer loading flag and (per diff) updates multimodal block discovery behavior. |
| auto_round/envs.py | Adds a new environment toggle for calibration device behavior. |
| auto_round/compressors/mllm/template.py | Registers the new longcat_next template. |
| auto_round/compressors/mllm/processor.py | Adds LongCatNextProcessor and tightens chat-template checks. |
| auto_round/compressors/base.py | Adds (per diff) an env-controlled override for GPU vs CPU calibration caching behavior. |
Signed-off-by: Xin He <xin3.he@intel.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
|
/azp run Unit-Test-CUDA-AutoRound |
|
Azure Pipelines successfully started running 1 pipeline(s). |
|
/azp run Unit-Test-CUDA-AutoRound |
|
Azure Pipelines successfully started running 1 pipeline(s). |
Signed-off-by: Xin He <xin3.he@intel.com>
Description
ValueError: Cannot use apply_chat_template because this processor does not have a chat template.
To reproduce:
auto-round /storage/xinhe/meituan-longcat/LongCat-Next/Type of Change
Related Issues
Fixes or relates to #
Checklist Before Submitting