← all terms
A widely used software toolkit that helps train very large models.
Microsoft's library for distributed training of large neural networks, including ZeRO sharding, CPU offloading, and gradient accumulation.