-
-
-
-
-
-
Inference Providers
Active filters: grpo
tuyentx/qwen-2.5-3b-r1-countdown
Text Generation
• 3B • Updated
pablo-chocobar/qwen-2.5-3b-r1-countdown
Text Generation
• 3B • Updated
• 1
mradermacher/Qwen2.5-1.5B-Open-R1-GRPO-GGUF
2B • Updated
• 86
Julian-Sheeper/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
• 0.1B • Updated
• 1
pullpull/qwen-2.5-3b-r1-countdown
Text Generation
• 3B • Updated
• 4
justinj92/Qwen2.5-1.5B-Thinking
Text Generation
• 2B • Updated
• 14
• 4
justinj92/Qwen2.5-1.5B-Thinking-Q8_0-GGUF
2B • Updated
• 3
justinj92/Qwen2.5-1.5B-Thinking-Q5_K_M-GGUF
spinech/qwen2.5-3b-r1-arc-train
Text Generation
• 3B • Updated
• 1
howardzhou/Qwen2.5-3B-Open-R1-GRPO
Text Generation
• 3B • Updated
justinj92/Qwen2.5-1.5B-Thinking-v1.1
Text Generation
• 2B • Updated
• 4
• 2
jainamit/qwen-2.5-3b-r1-countdown
Text Generation
• 3B • Updated
• 1
GitBag/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
• 2B • Updated
• 2
justinj92/Qwen2.5-1.5B-Thinking-v1.1-Q8_0-GGUF
2B • Updated
• 2
justinj92/Qwen2.5-1.5B-Thinking-v1.1-Q5_K_M-GGUF
2B • Updated
• 2
Text Generation
• 8B • Updated
mradermacher/Qwen2.5-1.5B-Thinking-GGUF
2B • Updated
• 250
• 1
mradermacher/DeepSeek-R1-Qwen-2.5-1.5b-GGUF
2B • Updated
• 200
• 1
Text Generation
• Updated
• 14
• peulsilva/reasoning-qwen-epoch0
Text Generation
• 0.5B • Updated
• 1
peulsilva/reasoning-qwen-epoch1
Text Generation
• 0.5B • Updated
spinech/qwen2.5-3b-r1-arc-train-synthetic
Text Generation
• 3B • Updated
peulsilva/reasoning-qwen-epoch2
Text Generation
• 0.5B • Updated
• 2
Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math
Text Generation
• 8B • Updated
• 2
Text Generation
• 8B • Updated
Dongwei/Qwen2.5-1.5B-Open-R1-GRPO_Math
Text Generation
• 2B • Updated
• 2
Dongwei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_Math
Text Generation
• 2B • Updated
• 6
peulsilva/reasoning-qwen-epoch3
Text Generation
• 0.5B • Updated
mradermacher/DeepSeek-R1-Distill-Qwen-7B-GRPO-GGUF
8B • Updated
• 148
skzxjus/Qwen2.5-7B-Open-R1-GRPO
Text Generation
• 8B • Updated
• 1