Auto Parallelism

TAPAS: Fast and Automatic Derivation of Tensor Parallel Strategies for Large Neural Networks

We present a framework that drastically speeds up the process of deriving the tensor parallel schedule for large neural networks by 160x.

We present a framework that drastically speeds up the process of deriving the tensor parallel schedule for large neural networks.