Spaces

·

The AI App Directory

New Space Get PRO Learn more

LLM Hallucination Leaderboard

View and filter LLM hallucination leaderboard

VBench Leaderboard

Submit video model evaluation results to a public benchmark

LLM Performance Leaderboard

View LLM performance leaderboard

Image Arena Leaderboard

Image Generation and Image Editing Arena & Leaderboard

Open-LLM performances are plateauing, let’s make the leaderboard steep again

Explore and compare advanced language models on a new leaderboard

PTEB Leaderboard

Persian Text Embedding Benchmark

MMEB Leaderboard

The massive multimodal embedding benchmark

Leaderboard: Physical Reasoning from Video

Submit model evaluations and view leaderboard results

Speech Arena Leaderboard

Text to Speech Arena & Leaderboard

DITING Leaderboard

Explore model performance with interactive radar charts

MADQA Leaderboard

Navigate, retrieve, and reason over PDFs collection.

Ko-FreshQA Leaderboard

Explore, submit, and download Korean QA leaderboard data

DISBench Leaderboard

Submit and view multimodal image‑search benchmark results

Leaderboard - FINAL Bench 'Metacognitive'

Metacognitive

Official Benchmarks Leaderboard 2026

Explore and compare AI model scores across official benchmarks

Open Agent Leaderboard

Explore AI agents' performance leaderboard and efficiency chart

EnterpriseRAG Bench Leaderboard

Explore and compare RAG system performance on a benchmark

MLX Benchmark V2 Leaderboard

Evaluating LLMs on Apple MLX framework

HAKARI-Bench Leaderboard

View model comparison leaderboard for multilingual retrieval

Sally Metabolic LLM Leaderboard

Sally-v1 vs GPT-5, Claude 4.7, Kimi K2.6 (medical)

Vietnamese ASR Leaderboard

Duplicate this leaderboard to initialize your own!

TSDecompose Benchmark Leaderboard

Explore TSDecompose benchmark rankings with interactive tables

WorldScore Leaderboard

View the WorldScore benchmark leaderboard online

Leaderboard