Uploaded finetuned model
- Developed by: koutch
- License: apache-2.0
- Finetuned from model : unsloth/SmolLM3-3B
This smollm3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
- Downloads last month
- 1
Model tree for koutch/short_paper_smol_2.json_train_dpo_v2_train_no_think
Base model
HuggingFaceTB/SmolLM3-3B-Base
Finetuned
HuggingFaceTB/SmolLM3-3B
Finetuned
unsloth/SmolLM3-3B 