Skip to content

Amos-Rai-KEYS/astra-sim

 
 

Repository files navigation

ASTRA-sim

ASTRA-sim is a distributed AI system simulator. It models the end-to-end software and hardware stack of modern AI systems - encompassing workload scheduling, collective communication algorithms, and hardware architectures (compute/memory/network). Through a suite of APIs, it enables plug-and-play of external open/proprietary components for modeling different parts of the AI system. This provides end-to-end multi-fidelity simulation capabilities for aiding in design and deployment of next-generation distributed AI systems.

Overview and Documentation

Here is a concise visual summary of ASTRA-sim, showing its layers and APIs: alt text

For a comprehensive understanding of the tool, and to gain insights into its capabilities, please visit our website.

For information on how to use ASTRA-sim, please visit our Wiki.

ASTRA-sim accepts MLCommons Chakra Execution Traces as workload-layer inputs. For details, please visit Chakra Github.

Releases and Contributions

ASTRA-sim is currently at version 2.0. The previous version, ASTRA-sim 1.0, is available in the ASTRA-sim-1.0 branch.

We encourage community contributions to ASTRA-sim via PRs.

Contact Us

For any questions about using ASTRA-sim, you can email the ASTRA-sim User Mailing List: astrasim-users@googlegroups.com

To join the mailing list, please fill out the following form: https://forms.gle/18KVS99SG3k9CGXm6

We appreciate your interest and support in ASTRA-sim!

About

ASTRA-sim2.0: Modeling Hierarchical Networks and Disaggregated Systems for Large-model Training at Scale

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • C++ 93.4%
  • Shell 3.2%
  • CMake 2.5%
  • Other 0.9%