Scale AI

company

Verified

https://scale.com/

scale_ai

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

pmannam updated a dataset 4 days ago

ScaleAI/SA2_bowlstack0

pmannam published a dataset 4 days ago

ScaleAI/SA2_bowlstack0

bhertz updated a dataset 7 days ago

ScaleAI/dummy_mcp

View all activity

Papers

ResearchRubrics: A Benchmark of Prompts and Rubrics For Evaluating Deep Research Agents

Chasing the Tail: Effective Rubric-based Reward Modeling for Large Language Model Post-Training

View all Papers

ScaleAI 's datasets 19

ScaleAI/SA2_bowlstack0

Viewer • Updated 4 days ago • 200 • 7

ScaleAI/dummy_mcp

Viewer • Updated 7 days ago • 16 • 46

ScaleAI/PRBench

Viewer • Updated 19 days ago • 1.65k • 825 • 4

ScaleAI/researchrubrics

Viewer • Updated 26 days ago • 101 • 134 • 9

ScaleAI/swe-oec-claude-expert

Viewer • Updated Oct 20 • 1.27k • 62 • 1

ScaleAI/VisualToolBench

Viewer • Updated Oct 17 • 1.19k • 109 • 1

ScaleAI/TutorBench

Viewer • Updated Oct 8 • 1.47k • 193

ScaleAI/SWE-bench_Pro

Viewer • Updated Sep 25 • 731 • 14.7k • 39

ScaleAI/BioRiskEval

Viewer • Updated Sep 19 • 156k • 61

ScaleAI/TutorBench_sample

Viewer • Updated Sep 10 • 30 • 26

ScaleAI/mrt

Updated Aug 28 • 788 • 4

ScaleAI/stc

Updated Aug 6 • 8

ScaleAI/fortress_public

Viewer • Updated Aug 5 • 500 • 3.67k • 2

ScaleAI/MultiNRC

Viewer • Updated Jul 23 • 1.06k • 91 • 3

ScaleAI/EnigmaEval

Viewer • Updated Apr 10 • 1.18k • 63

ScaleAI/gsm1k

Viewer • Updated Apr 1 • 1.21k • 195 • 1

ScaleAI/BrowserART

Viewer • Updated Oct 21, 2024 • 2 • 148 • 7

ScaleAI/mhj

Viewer • Updated Sep 19, 2024 • 1 • 162 • 25

ScaleAI/mhj-wmdp-bio

Viewer • Updated Sep 19, 2024 • 43 • 57