Skip to content

Add Qwen3.5 0.8B and 2B variants#115

Open
Lokimorty wants to merge 6 commits intomainframecomputer:mainfrom
Lokimorty:add-qwen35
Open

Add Qwen3.5 0.8B and 2B variants#115
Lokimorty wants to merge 6 commits intomainframecomputer:mainfrom
Lokimorty:add-qwen35

Conversation

@Lokimorty
Copy link
Copy Markdown

@Lokimorty Lokimorty commented Mar 3, 2026

Adds two new Qwen3.5 model options to the in-app model catalog: 0.8B 4bit and 2B 4bit. Updates include model definitions, model list wiring, reasoning classification, and memory size metadata used by onboarding filters. README supported-model links are updated to match the new options. This also switches package wiring from mlx-swift-examples to mlx-swift-lm (and aligns swift-transformers to 1.1.x) so qwen3_5 model_type is supported at runtime instead of failing with unsupportedModelType.

@Lokimorty Lokimorty marked this pull request as ready for review March 3, 2026 00:52
@missuo
Copy link
Copy Markdown

missuo commented Mar 3, 2026

Your PR has issues. Did you try to build and use the model? It will report an error: Failed: unsupportedModelType("qwen3_5")

@Lokimorty Lokimorty marked this pull request as draft March 3, 2026 12:25
@missuo
Copy link
Copy Markdown

missuo commented Mar 3, 2026

I've tested it, and the 9B model works. Can you add the 9B model?

@Lokimorty
Copy link
Copy Markdown
Author

Adding just the model entries was not enough for Qwen3.5 to work at runtime. To support them end-to-end, I switched package from mlx-swift-examples to mlx-swift-lm for MLXLLM/MLXLMCommon. With the old package path, runtime failed with: unsupportedModelType("qwen3_5")

In the initial commit I scoped it to model list changes assuming that was a known issue.

New commits are 2 dependency fixes:

  1. Bump swift-transformers to 1.1.x to satisfy the mlx-swift-lm dependency graph.
  2. Add direct mlx-swift / MLXRandom linkage because LLMEvaluator imports MLXRandom; without explicit linkage, Xcode fails with missing module dependency errors.
  3. Also added 9B model by missuo's request

So the extra changes are to unblock build + runtime support for the new Qwen3.5 variants.

@missuo
Copy link
Copy Markdown

missuo commented Mar 3, 2026

  1. Bump swift-transformers to 1.1.x to satisfy the mlx-swift-lm dependency graph.

You should remove the swift-transformers dependency, as mlx-swift-lm already includes it.

@Lokimorty Lokimorty marked this pull request as ready for review March 3, 2026 15:07
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants