Key Features

What you can do

Parallel Python Code Execution

Enables scaling of Python code in parallel for tasks such as simulations and backtesting without rewriting code.

Multimodal Data and GenAI Workflow Support

Supports processing of multimodal data including images, videos, and audio, and enables end-to-end generative AI workflows including retrieval-augmented generation (RAG) applications.

LLM Inference and Fine-Tuning

Provides flexibility to perform inference and fine-tuning of large language models on any accelerator or model.

Workload Observability and Debugging

Includes profiling tools and a dashboard for distributed debugging and dependency management across nodes.

Fault-Tolerant Cluster Deployment

Offers auto-scaling, spot instance management, and cost governance for cluster deployment to improve reliability and reduce costs.