Qwen2.5-7B Full SFT Multi-hop

This model was fine-tuned using SFT on multi-hop tool-use tasks.

Training Details

  • Base Model: Qwen/Qwen2.5-7B-Instruct
  • Training Method: Supervised Fine-Tuning (SFT)
  • Task: Multi-hop tool-use (3-6-9 hop)
  • Checkpoint: Step 306

Usage

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained("Anna4242/qwen25-7b-full-sft-multihop")
tokenizer = AutoTokenizer.from_pretrained("Anna4242/qwen25-7b-full-sft-multihop")
Downloads last month
12
Safetensors
Model size
8B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for Anna4242/qwen25-7b-full-sft-multihop

Base model

Qwen/Qwen2.5-7B
Finetuned
(2218)
this model