b
Discover
Models
Search
About
4 months ago
·
arXiv
τ
\tau
τ
-bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains