Official library for AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories
agent-reward-bench has limited data (2/6 signals) — verify manually before use
Get this data programmatically — free, no authentication.
curl https://depscope.dev/api/check/pypi/agent-reward-benchLast updated · 2025-07-11T10:09:28.086032Z