Distributed inference #3

Open

opened

on Jan 29, 2024

Can you please suggest how to distribute the model during inference? I did not manager to load a model on a 40GB GPU

Metadata

Assignees

No one assigned

Labels

No labels

No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests