DeepSpeed
ActiveDescription
Microsoft's open-source deep learning distributed training optimization library, featuring ZeRO memory optimization, 3D parallelism, and mixed-precision training for efficient training of trillion-parameter models.
Key Features
- ZeRO optimizer — Partitions optimizer states, gradients, and parameters across GPUs for massive memory savings
- 3D parallelism — Combines tensor, pipeline, and data parallelism simultaneously
- Ultra-long sequence training — Ulysses Sequence Parallelism for training on extremely long context sequences
- Mixture of Experts — Built-in DeepSpeed-MoE supports trillion-parameter model training
- Inference acceleration — DeepSpeed Inference optimizes Transformer model inference performance
- Multi-hardware support — Compatible with NVIDIA, AMD, Intel GPUs and CPU training
Use Cases
Categories
Quick Start
# Install DeepSpeed
pip install deepspeed
# Verify installation
ds_report
# Run training script with DeepSpeed
deepspeed --num_gpus=4 train_script.py
# Or use via PyTorch Lightning integration
pip install pytorch-lightning