forked from NVIDIA/apex
-
Notifications
You must be signed in to change notification settings - Fork 28
Open
Labels
bugSomething isn't workingSomething isn't working
Description
Please find the comment in the PR we enabled --peer_memory and --nccl_p2p extensions: #87 (comment)
Some tests failed sporadically on ROCm by running the following test script:
cd apex/contrib/peer_memory
torchrun --nproc_per_node 2 peer_halo_exchange_module_tests.py
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't working