Instructions to use hermanda/ant-llm-grpo with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use hermanda/ant-llm-grpo with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen3-0.6B") model = PeftModel.from_pretrained(base_model, "hermanda/ant-llm-grpo") - Notebooks
- Google Colab
- Kaggle

- Xet hash:
- 905753fc2a191076e1cf9264494bcdf0916971baef350ecc7e9195d2b9e21167
- Size of remote file:
- 239 kB
- SHA256:
- 123d4201ef13e93194ffebf8c411e914a5aec41cf133b5a381214255938e0dde
·
Xet efficiently stores Large Files inside Git, intelligently splitting files into unique chunks and accelerating uploads and downloads. More info.