Global PIQA A physical commonsense reasoning benchmark for 100+ languages, written in collaboration with 300+ researchers from 65 countries. Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures Paper • 2510.24081 • Published Oct 28 • 16 mrlbenchmarks/global-piqa-nonparallel Viewer • Updated Oct 29 • 11.6k • 4.65k • 27
Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures Paper • 2510.24081 • Published Oct 28 • 16
Multilingual Leaderboards Leaderboards for languages other than English Running on CPU Upgrade 74 La Leaderboard 🌸 74 Evaluate open LLMs in the languages of LATAM and Spain. Running on CPU Upgrade 123 Open Chinese LLM Leaderboard 🏆 123 Explore and submit LLM benchmarks Running on CPU Upgrade 169 Open Arabic LLM Leaderboard 🏆 169 Track, rank and evaluate open Arabic LLMs and chatbots Running 40 OpenLLM French leaderboard 🇫🇷 🥇 40 Explore and submit LLM benchmarks
Running on CPU Upgrade 74 La Leaderboard 🌸 74 Evaluate open LLMs in the languages of LATAM and Spain.
Running on CPU Upgrade 169 Open Arabic LLM Leaderboard 🏆 169 Track, rank and evaluate open Arabic LLMs and chatbots
Global PIQA A physical commonsense reasoning benchmark for 100+ languages, written in collaboration with 300+ researchers from 65 countries. Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures Paper • 2510.24081 • Published Oct 28 • 16 mrlbenchmarks/global-piqa-nonparallel Viewer • Updated Oct 29 • 11.6k • 4.65k • 27
Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures Paper • 2510.24081 • Published Oct 28 • 16
Multilingual Leaderboards Leaderboards for languages other than English Running on CPU Upgrade 74 La Leaderboard 🌸 74 Evaluate open LLMs in the languages of LATAM and Spain. Running on CPU Upgrade 123 Open Chinese LLM Leaderboard 🏆 123 Explore and submit LLM benchmarks Running on CPU Upgrade 169 Open Arabic LLM Leaderboard 🏆 169 Track, rank and evaluate open Arabic LLMs and chatbots Running 40 OpenLLM French leaderboard 🇫🇷 🥇 40 Explore and submit LLM benchmarks
Running on CPU Upgrade 74 La Leaderboard 🌸 74 Evaluate open LLMs in the languages of LATAM and Spain.
Running on CPU Upgrade 169 Open Arabic LLM Leaderboard 🏆 169 Track, rank and evaluate open Arabic LLMs and chatbots