YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)
这个目录下的模型是我利用 https://huggingface.co/wh-zhu/qwen2.5-1.5b-cot 使用 verl 框架, 使用 PPO 算法在 GSM8K 数据集上训练 50 个 step 得到的 Actor 模型.
对应的参数见: https://wandb.ai/bohuang/qwen_2_5_1_5b_cot_PPO/runs/65v9s1wo?nw=nwuserbohuang
- Downloads last month
- 1
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support