Getting Started - tao-squared-bench

1

Run `git clone https://github.com/sierra-research/tau2-bench && cd tau2-bench` to download the source code.

2

Create and activate a Python 3.10+ virtual environment (optional but recommended).

3

Install required packages as specified in the repository setup instructions.

4

Use the command `tau2 env <domain>` and visit `http://127.0.0.1:8004/redoc` to access API documentation.

5

Execute provided scripts to run specific tasks by ID or evaluate agent performance.