KORMo SFT datasets Collection The SFT datasets for KORMo-10B were collected from diverse, publicly available source. • 5 items • Updated Oct 13 • 4
KORMo midtraining datasets Collection The midtraining datasets for KORMo-10B were collected from diverse, publicly available source. • 7 items • Updated Oct 13 • 3
KORMo pretraining datasets Collection The pretraining datasets for KORMo-10B were collected from diverse, publicly available source. • 14 items • Updated Oct 13 • 20
VLR-Bench: Multilingual Benchmark Dataset for Vision-Language Retrieval Augmented Generation Paper • 2412.10151 • Published Dec 13, 2024 • 7
Optimizing Language Augmentation for Multilingual Large Language Models: A Case Study on Korean Paper • 2403.10882 • Published Mar 16, 2024 • 6
X-LLaVA: Optimizing Bilingual Large Vision-Language Alignment Paper • 2403.11399 • Published Mar 18, 2024 • 6