Skip to content

Examples

Each example is a standalone uv project under examples/ with its own pyproject.toml, rl_app.py, reward.py, and evaluate.py.

  • New to the toolkit? Start with the math agent — the smallest end-to-end example.
  • Building a benchmark harness? Read the OfficeBench or AppWorld example.
  • Designing a custom reward? The migration agent is the clearest template.