Spaces

·

The AI App Directory

New Space Get PRO Learn more

MTEB Leaderboard

Embedding Leaderboard

Open LLM Leaderboard

Track, rank and evaluate open LLMs and chatbots

UGI Leaderboard

Uncensored General Intelligence Leaderboard

Open VLM Leaderboard

VLMEvalKit Evaluation Results Collection

MMEB Leaderboard

The massive multimodal embedding benchmark

GAIA Leaderboard

Submit and evaluate models on GAIA leaderboard

Open ASR Leaderboard

View and request speech models benchmark data

Big Code Models Leaderboard

Explore and compare code generation models on a leaderboard

Agent Leaderboard

Ranking of LLMs for agentic tasks

WorldScore Leaderboard

Explore the WorldScore leaderboard for global world generation benchmarks

Physical AI Bench Leaderboard

Benchmark for Physical AI generation and understanding

HLE Leaderboard for Agents with Tools

Humanity's Last Exam Leaderboard for LLM Agents with Tools

Video Generation Leaderboard

Text to Video and Image to Video Arena & Leaderboard

BrowserGym Leaderboard

Tracks perf of LLMs, VLMs and agents on web navigation tasks

Leaderboard: Physical Reasoning from Video

Submit model evaluations and view leaderboard results

DeepResearch Bench

Generate and display a leaderboard

Leaderboard

View and compare telecom LLM benchmarks

Ukrainian LLM Leaderboard

Measuring LLM capabilities to process Ukrainian texts

Leaderboard Interspeech 2026

Submit and compare pronunciation assessment system results

Deep Reinforcement Learning Leaderboard

Display and search reinforcement learning leaderboard data

LLM Hallucination Leaderboard

View and filter LLM hallucination leaderboard

Demo Leaderboard

Hebrew LLM Leaderboard

Explore and submit models in the LLM Benchmark

LLM Performance Leaderboard

View LLM performance rankings