Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
alpha-ai
/
llama-3.2-3B-Reason-Reflect-Lite
like
0
Follow
Alpha AI
22
Text Generation
Transformers
PyTorch
Safetensors
openai/gsm8k
English
llama
text-generation-inference
alphaaico
qwen
reasoning
thought
reflection
lite
GRPO
conversational
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
main
llama-3.2-3B-Reason-Reflect-Lite
12.9 GB
3 contributors
History:
8 commits
alphaaico
Update README.md
53a99dd
verified
about 1 year ago
.gitattributes
1.57 kB
Upload tokenizer
about 1 year ago
README.md
5.67 kB
Update README.md
about 1 year ago
config.json
991 Bytes
Trained with Unsloth
about 1 year ago
generation_config.json
166 Bytes
Trained with Unsloth
about 1 year ago
model-00001-of-00002.safetensors
4.97 GB
xet
Adding `safetensors` variant of this model (#1)
about 1 year ago
model-00002-of-00002.safetensors
1.46 GB
xet
Adding `safetensors` variant of this model (#1)
about 1 year ago
model.safetensors.index.json
21.9 kB
Adding `safetensors` variant of this model (#1)
about 1 year ago
pytorch_model-00001-of-00002.bin
4.97 GB
xet
Trained with Unsloth
about 1 year ago
pytorch_model-00002-of-00002.bin
1.46 GB
xet
Trained with Unsloth
about 1 year ago
pytorch_model.bin.index.json
20.9 kB
Trained with Unsloth
about 1 year ago
special_tokens_map.json
454 Bytes
Upload tokenizer
about 1 year ago
tokenizer.json
17.2 MB
xet
Upload tokenizer
about 1 year ago
tokenizer_config.json
54.7 kB
Upload tokenizer
about 1 year ago