Qwen2.5-7B Full SFT Multi-hop
This model was fine-tuned using SFT on multi-hop tool-use tasks.
Training Details
- Base Model: Qwen/Qwen2.5-7B-Instruct
- Training Method: Supervised Fine-Tuning (SFT)
- Task: Multi-hop tool-use (3-6-9 hop)
- Checkpoint: Step 306
Usage
from transformers import AutoModelForCausalLM, AutoTokenizer
model = AutoModelForCausalLM.from_pretrained("Anna4242/qwen25-7b-full-sft-multihop")
tokenizer = AutoTokenizer.from_pretrained("Anna4242/qwen25-7b-full-sft-multihop")
- Downloads last month
- 12
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support