ToolRM Collection One Model to Critique Them All: Rewarding Agentic Tool-Use via Efficient Reasoning • 6 items • Updated Nov 19 • 2