Key Features

✨

Enables building diverse reinforcement learning algorithms by constructing dataflows in a few lines of code using a hybrid programming model.

✨

Provides modular APIs for integration with existing LLM frameworks such as PyTorch FSDP, Megatron-LM, vLLM, SGLang, and HuggingFace models.

✨

Supports flexible device mapping and parallelism across different GPU sets and cluster sizes to optimize resource utilization.

✨

Leverages state-of-the-art LLM training and inference tools to achieve efficient generation and training throughput.

✨

Uses 3D-HybridEngine for actor model resharding to reduce memory redundancy and communication overhead during training-generation transitions.