Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
RioLee 's Collections
ToolRM

ToolRM

updated Nov 19

One Model to Critique Them All: Rewarding Agentic Tool-Use via Efficient Reasoning

Upvote
2

  • One Model to Critique Them All: Rewarding Agentic Tool-Use via Efficient Reasoning

    Paper • 2510.26167 • Published Oct 30

  • RioLee/ToolRM-Qwen3-4B-Thinking-2507

    Text Generation • 4B • Updated Nov 10 • 16

  • RioLee/ToolPref-Pairwise-30K

    Viewer • Updated Nov 10 • 60k • 148

  • RioLee/TRBench-BFCL

    Viewer • Updated Nov 10 • 11.9k • 41
Upvote
2
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs