This collection contains datasets and models related to "BLEUBERI: BLEU is a surprisingly effective reward for instruction following".
Yapei Chang PRO
yapeichang
AI & ML interests
NLP
Recent Activity
updated
a model
2 days ago
yapeichang/memo-32b-4tier
published
a model
2 days ago
yapeichang/memo-32b-4tier