Runtime error 4 CaselawQA leaderboard (WIP) π 4 Browse and submit evaluations for CaselawQA benchmarks
Running on CPU Upgrade 13.9k Open LLM Leaderboard π 13.9k Track, rank and evaluate open LLMs and chatbots