💧 LFM2.5 Collection Collection of Instruct, Base, and Japanese LFM2.5-1.2B models. • 19 items • Updated 4 days ago • 61
Ministral 3 - Additional Checkpoints Collection Different formats and Quantized versions of our Ministral 3 family; 14B/8B/3B Instruct/Reasoning GGUF, 3B Instruct ONNX and 14B/8B/3B Instruct BF16. • 13 items • Updated Dec 2, 2025 • 14
Ministral 3 Collection A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated Dec 2, 2025 • 139
Mistral Large 3 Collection A state-of-the-art, open-weight, general-purpose multimodal model with a granular Mixture-of-Experts architecture. • 4 items • Updated Dec 2, 2025 • 82
Trinity Collection Collection of Arcee AI models in the Trinity family • 8 items • Updated 30 days ago • 21
view article Article We’re open-sourcing our text-to-image model and the process behind it Nov 12, 2025 • 76
Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B Paper • 2511.06221 • Published Nov 9, 2025 • 132
RLVE Collection Models for "RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments" - https://arxiv.org/abs/2511.07317 • 3 items • Updated Nov 12, 2025 • 5
Gemini Embedding: Generalizable Embeddings from Gemini Paper • 2503.07891 • Published Mar 10, 2025 • 45
C2S-Scale-Gemma-Models Collection C2S-Scale Gemma models trained using the Cell2Sentence framework, described in the C2S-Scale paper. • 2 items • Updated Oct 13, 2025 • 12
view article Article Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face Feb 11, 2025 • 94
view article Article Synthetic data: save money, time and carbon with open source Feb 16, 2024 • 85
Dream-Coder 7B Collection https://hkunlp.github.io/blog/2025/dream-coder • 2 items • Updated Jul 15, 2025 • 6
Dream 7B Collection https://hkunlp.github.io/blog/2025/dream/ • 2 items • Updated Jul 16, 2025 • 6