ModelResharder-Transformers

A powerful Gradio web application for downloading, resharding, and re-uploading large Hugging Face models, with built-in optimizations for large Vision-Language (VL) models such as the Qwen family. This intuitive tool is designed to help engineers and researchers easily manage unwieldy model weights by breaking them into smaller, more manageable shards and pushing them directly to either a private or public Hugging Face repository—all from a clean UI.

Maintains version control and enables the transformation of files from older Transformers versions to newer versions.

Features

Interactive UI: Fully built with Gradio, providing an easy-to-use visual interface to configure your source model, target repository, and architecture.
Automated Resharding: Dynamically specify your desired shard_size (e.g., 4.4GB or 2GB) directly in the UI to optimize for different hardware constraints or storage limitations.
Hardware Acceleration: Automatically detects CUDA availability and utilizes GPU acceleration for faster model loading and processing.
Supported Architectures: Built-in support for Qwen3.5, Qwen3-VL, Qwen2.5-VL, and Qwen2-VL architectures.
Dependency Isolation: Fully compatible with the uv package manager.

Supported Architectures

The application currently supports the following Hugging Face model architectures (extensible in src/app.py):

Qwen3_5ForConditionalGeneration (Qwen 3.5)
Qwen3VLForConditionalGeneration (Qwen 3 VL)
Qwen2_5_VLForConditionalGeneration (Qwen 2.5 VL)
Qwen2VLForConditionalGeneration (Qwen 2 VL)

Prerequisites

Python: Version 3.10 or higher.
uv: Recommended for fast, reliable dependency management.
Hugging Face Token: A valid token with write access (hf_...) is required to create repositories and upload the sharded models.

Installation

Clone the repository:

git clone https://github.com/PRITHIVSAKTHIUR/model.resharder-transformers.git
cd model.resharder-transformers

Install dependencies: If using uv (recommended):
```
uv sync
```

Running the Application

You can start the Gradio server directly using uv or standard python:

uv run python src/app.py

The application will launch on your local network (typically http://127.0.0.1:7860).

Usage Guide

Open the UI in your browser.
In the configuration panel, enter the Original Model Name (e.g., Qwen/Qwen3-VL-2B-Instruct).
Enter your New Repository ID where you want the resharded model saved (e.g., your-username/Qwen3-VL-2B-Sharded).
Enter your HuggingFace Write Token.
Set the Max Shard Size to your preferred split limit (e.g., 4.4GB).
Select the correct Model Architecture from the dropdown menu.
Click Shard & Upload Model.
Watch the Process Logs panel for the final output once the operation completes.

License

This project is licensed under the Apache License 2.0 - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
ipynb		ipynb
src		src
LICENSE.txt		LICENSE.txt
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ModelResharder-Transformers

Features

Supported Architectures

Prerequisites

Installation

Running the Application

Usage Guide

License

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ModelResharder-Transformers

Features

Supported Architectures

Prerequisites

Installation

Running the Application

Usage Guide

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages