causal-rewards/sycophancy_dpo_llama3.1_8b_ultrachat200k_iter1_new Viewer • Updated Sep 21 • 847 • 20
causal-rewards/sycophancy_dpo_llama3.1_8b_ultrachat200k_iter1_new Viewer • Updated Sep 21 • 847 • 20
GraPE: A Generate-Plan-Edit Framework for Compositional T2I Synthesis Paper • 2412.06089 • Published Dec 8, 2024 • 4
IndicGenBench: A Multilingual Benchmark to Evaluate Generation Capabilities of LLMs on Indic Languages Paper • 2404.16816 • Published Apr 25, 2024 • 3