Strengths & Limitations

Balanced assessment

Strengths

  • Achieves state-of-the-art results on coding and scientific benchmarks without custom scaffolds.
  • Integrates multiple tools natively within its reasoning process including web search and Python execution.
  • Supports complex multi-step reasoning across text, code, math, science, and visual inputs.
  • Offers adjustable reasoning effort and full chain-of-thought access for debugging.
  • Available to ChatGPT Plus, Team, and Pro users with API access.

Limitations

  • o3-pro model has slower response times, potentially taking minutes per request.
  • Streaming is not supported for the o3-pro variant.
  • No support for fine-tuning or model distillation.
  • Enterprise ChatGPT access for o3-mini has been delayed.