- Collects problems shortly after contests to avoid training data contamination.
- Includes 1055 problems across difficulty levels in the latest release.
- Evaluates multiple coding capabilities beyond code generation.
- Provides time-annotated problems for testing model generalization.