Enhance model retrieval logic in read_evals.py to support dict-based model_args and improve org_and_model extraction; update .gitignore to exclude eval-results files. 5afbc08 djstrong commited on about 10 hours ago
Change ColumnContent dataclass to be immutable by adding frozen=True decorator 0cd768c djstrong commited on 29 days ago
Update requirements.txt to include setuptools and upgrade numpy to version 1.26.0; enhance check_validity.py to recognize 'Qwen3-' model names. 4ce3209 djstrong commited on 29 days ago
Add calc_avg.py for average score calculation and refactor task retrieval in about.py b9262b0 djstrong commited on Mar 25, 2025