Revisiting Data Challenges of Computational Pathology: A Pack-based Multiple Instance Learning Training Framework

Wenhao Tang^, Heng Fang^, Ge Wu, Xiang Li^†, Ming-Ming Cheng^†

This repository provides the official PyTorch implementation of the PackMIL framework introduced in Revisiting Data Challenges of Computational Pathology: A Pack-based Multiple Instance Learning Training Framework.

What is PackMIL? | Usage | Citation

What is PackMIL?

Pack once, batch anywhere – we pack multiple variable-length WSI feature sequences into fixed-length “packs”, enabling true batched slide-level training while preserving slide-level heterogeneity.
Residual branch with hyperslides – discarded patches are reused to form hyperslides that supply multi-slide supervision, reducing feature loss from sampling.
Attention-driven downsampler – compresses redundant features on-the-fly without sacrificing discriminative regions.

Abstract

Computational pathology (CPath) digitizes pathology slides into whole slide images (WSIs), enabling analysis for critical healthcare tasks such as cancer diagnosis and prognosis. However, WSIs possess extremely long sequence lengths (up to 200K), significant length variations (from 200 to 200K), and limited supervision. These extreme variations in sequence length lead to high data heterogeneity and redundancy. Conventional methods often compromise on training efficiency and optimization to preserve such heterogeneity under limited supervision. To comprehensively address these challenges, we propose a pack-based MIL framework. It packs multiple sampled, variable-length feature sequences into fixed-length ones, enabling batched training while preserving data heterogeneity. Moreover, we introduce a residual branch that composes discarded features from multiple slides into a hyperslide which is trained with tailored labels. It offers multi-slide supervision while mitigating feature loss from sampling. Meanwhile, an attention-driven downsampler is introduced to compress features in both branches to reduce redundancy. By alleviating these challenges, our approach achieves an accuracy improvement of up to 8% while using only 12% of the training time in the PANDA(UNI). Extensive experiments demonstrate that focusing data challenges in CPath holds significant potential in the era of foundation models.

TODO

Add the code for the training and testing of the model.
Add the code for the preprocessing of the datasets.
Upload experiment Docker environment.
Upload the extracted features of the datasets (Huggingface, Baidu Netdisk, and ModelScope).
Add the code for the visualization of the results.
Improving README document.

Usage

1. Environment Preparation

We recommend using Docker for a reproducible environment. Alternatively, you can install dependencies via PyPI.

Option 1: Docker (Recommended)

Download the Docker Image from Google Drive or Baidu Netdisk (Password: 2025)
Load the Docker image:
```
docker load -i XXX.tar
```
(Replace XXX.tar with the downloaded file name.)

Run the Docker container:

docker run --gpus all -it --ipc=host --ulimit memlock=-1 --ulimit stack=67108864\
            -v /path/to/your_code:/workspace/code \
            -v /path/to/your_data:/workspace/dataset \
            -v /path/to/your_output:/workspace/output \
            --name packmil \
            --runtime=nvidia \
            -e NVIDIA_VISIBLE_DEVICES=all \
            -e NVIDIA_DRIVER_CAPABILITIES=compute,utility \
            -d packmil:latest /bin/bash

Option 2: PyPI

Create a new Python environment:

conda create -n packmil python=3.9
conda activate packmil

Install the required packages. A complete list of requirements can be found in requirements.txt.
```
pip install -r requirements.txt
```

2. Data Preprocessing

Download Preprocessed Feature

We provide preprocessed patch features for all datasets. You can download them from: Hugginface, ModelScope, Baidu Netdisk (Password: ujtq)

Preprocess Raw Data

If you have raw Whole-Slide Image (WSI) data, you can preprocess it as follows:

Patching (Following CLAM):

python CLAM/create_patches_fp.py --source YOUR_DATA_DIRECTORY \
                                 --save_dir YOUR_RESULTS_DIRECTORY \
                                 --patch_size 256 \
                                 --step_size 256 \
                                 --patch_level 0 \
                                 --preset YOUR_PRESET_FILE \
                                 --seg \
                                 --patch

Replace placeholders like YOUR_DATA_DIRECTORY with your actual paths and parameters. Preset files are officially provided by CLAM.

Feature Extraction (Modify on the official CLAM repository to support the encoders of CHIEF, UNI and GIGAP):

You can also extract all the required features following the process of TRIDENT.

CUDA_VISIBLE_DEVICES=$TARGET_GPUs python CLAM/extract_features_fp.py \
                                    --data_h5_dir DIR_TO_COORDS \
                                    --data_slide_dir DATA_DIRECTORY \
                                    --csv_path CSV_FILE_NAME \
                                    --feat_dir FEATURES_DIRECTORY \
                                    --slide_ext .svs \
                                    --model_name uni_v1/chief/gigap

3. Training

Grading

CUDA_VISIBLE_DEVICES=$TARGET_GPU bash single_train.sh train --datasets=panda --project=your_project --dataset_root=/path/to/your/dataset -c=../config/feat_pack_panda.yaml --title={fm}_panda_abmil_bs128_packdual40_lr10 --csv_path=/path/to/your/label --model=abmil --log_iter=100 --output_path=/path/to/your/output/ --batch_size=128 --token_dropout=0.4 --input_dim=feat_dim --wandb

Sub-typing

CUDA_VISIBLE_DEVICES=$TARGET_GPU bash single_train.sh train --datasets=brca --project=your_project --dataset_root=/path/to/your/dataset -c=../config/feat_pack_brca.yaml --title={fm}_brca_abmil_bs32_packdual60_dsr4_lr05 --csv_path=/path/to/your/label --model=abmil --log_iter=10 --output_path=/path/to/your/output/ --batch_size=32 --token_dropout=0.5 --input_dim=feat_dim --wandb

Survival

CUDA_VISIBLE_DEVICES=$TARGET_GPU bash single_train.sh train --datasets=surv_{dataset} --project=your_project --dataset_root=/path/to/your/dataset -c=../config/feat_surv.yaml --title=surv_{fm}_brca_abmil_bs32_packdual50_dsr4_lr05 --csv_path=/path/to/your/survival_label --model=abmil --log_iter=10 --output_path=/path/to/your/output/ --batch_size=32 --token_dropout=0.5 --input_dim=feat_dim --wandb

Citation

If you find this repository useful for your research, please consider citing our paper:

@misc{tang2025revisitingdatachallengescomputational,
      title={Revisiting Data Challenges of Computational Pathology: A Pack-based Multiple Instance Learning Framework}, 
      author={Wenhao Tang and Heng Fang and Ge Wu and Xiang Li and Ming-Ming Cheng},
      year={2025},
      eprint={2509.20923},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2509.20923}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
CLAM		CLAM
config		config
datasets		datasets
engines		engines
modules		modules
scripts		scripts
static		static
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
flops.ipynb		flops.ipynb
main.py		main.py
options.py		options.py
requirements.txt		requirements.txt
train_utils.py		train_utils.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Revisiting Data Challenges of Computational Pathology: A Pack-based Multiple Instance Learning Training Framework

Wenhao Tang^, Heng Fang^, Ge Wu, Xiang Li^†, Ming-Ming Cheng^†

What is PackMIL?

TODO

Usage

1. Environment Preparation

Option 1: Docker (Recommended)

Option 2: PyPI

2. Data Preprocessing

Download Preprocessed Feature

Preprocess Raw Data

3. Training

Citation

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Revisiting Data Challenges of Computational Pathology: A Pack-based Multiple Instance Learning Training Framework

Wenhao Tang*, Heng Fang*, Ge Wu, Xiang Li†, Ming-Ming Cheng†

What is PackMIL?

TODO

Usage

1. Environment Preparation

Option 1: Docker (Recommended)

Option 2: PyPI

2. Data Preprocessing

Download Preprocessed Feature

Preprocess Raw Data

3. Training

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages

Wenhao Tang^, Heng Fang^, Ge Wu, Xiang Li^†, Ming-Ming Cheng^†