sakuma's picture

56 437

sakuma

sakumaXIII

·

AI & ML interests

None yet

Recent Activity

upvoted a collection 4 days ago

liked a model 7 days ago

SicariusSicariiStuff/Impish_Bloodmoon_12B_GGUF

liked a model 7 days ago

SicariusSicariiStuff/Impish_Bloodmoon_12B_ARM

View all activity

Organizations

None yet

upvoted a collection 4 days ago

💧 LFM2.5

Collection of Instruct, Base, and Japanese LFM2.5-1.2B models. • 19 items • Updated 4 days ago • 61

upvoted a collection 9 days ago

Most of my models - in order

31 items • Updated 20 days ago • 17

upvoted a collection 10 days ago

AI censorship

21 items • Updated 1 day ago • 4

upvoted an article about 1 month ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

570

upvoted 4 collections about 1 month ago

Ministral 3 - Additional Checkpoints

Different formats and Quantized versions of our Ministral 3 family; 14B/8B/3B Instruct/Reasoning GGUF, 3B Instruct ONNX and 14B/8B/3B Instruct BF16. • 13 items • Updated Dec 2, 2025 • 14

Ministral 3

A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated Dec 2, 2025 • 139

Mistral Large 3

A state-of-the-art, open-weight, general-purpose multimodal model with a granular Mixture-of-Experts architecture. • 4 items • Updated Dec 2, 2025 • 82

Trinity

Collection of Arcee AI models in the Trinity family • 8 items • Updated 30 days ago • 21

upvoted 2 articles about 2 months ago

Article

Easily Build and Share ROCm Kernels with Hugging Face

+2

Nov 17, 2025

•

36

Article

We’re open-sourcing our text-to-image model and the process behind it

Nov 12, 2025

•

76

upvoted a paper about 2 months ago

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published Nov 9, 2025 • 132

upvoted a collection about 2 months ago

RLVE

Models for "RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments" - https://arxiv.org/abs/2511.07317 • 3 items • Updated Nov 12, 2025 • 5

upvoted a paper 2 months ago

Gemini Embedding: Generalizable Embeddings from Gemini

Paper • 2503.07891 • Published Mar 10, 2025 • 45

upvoted a collection 2 months ago

C2S-Scale-Gemma-Models

C2S-Scale Gemma models trained using the Cell2Sentence framework, described in the C2S-Scale paper. • 2 items • Updated Oct 13, 2025 • 12

upvoted 4 articles 2 months ago

Article

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

Feb 11, 2025

•

94

Article

Exploring Synthetic Data Generation with DataDreamer

Jan 21, 2025

•

9

Article

Synthetic dataset generation techniques: Self-Instruct

May 15, 2024

•

22

Article

Synthetic data: save money, time and carbon with open source

Feb 16, 2024

•

85

upvoted 2 collections 2 months ago

Dream-Coder 7B

https://hkunlp.github.io/blog/2025/dream-coder • 2 items • Updated Jul 15, 2025 • 6

Dream 7B

https://hkunlp.github.io/blog/2025/dream/ • 2 items • Updated Jul 16, 2025 • 6