Getting Started - terminal-bench-20

1

Run `uv tool install terminal-bench` or `pip install terminal-bench` to install the package.

2

Use the CLI commands `tb` or `tb run` to execute benchmark tasks and evaluate AI agents.

3

Set `use_prebuilt_image=false` in CLI commands or Python evaluation scripts to use custom Docker images.

4

Access the public leaderboard at https://www.tbench.ai/leaderboard/terminal-bench/2.0 to compare agent performance.

5

Follow documentation to add new tasks or adapters by placing files in the tasks folder and submitting a pull request.