Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Spaces:
testingaccc
/
conflict-arbitration-env
like
0
Sleeping
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
conflict-arbitration-env
/
training
22.7 kB
Ctrl+K
Ctrl+K
2 contributors
History:
15 commits
Jeeevan11
ship submission: judge-aligned README, real training curves from 2000-step run, blog, scripts
ce00c50
28 days ago
curriculum.py
Safe
2.45 kB
expand dataset: 40 templates, 8 domains, fixed injection edge cases
29 days ago
grpo_trainer.py
Safe
1.53 kB
disable unsloth fast_inference (vllm not installed)
29 days ago
job_entrypoint.sh
Safe
2.18 kB
bump transformers to >=4.50.3 (unsloth needs Qwen3 support)
29 days ago
metrics.py
Safe
2.31 kB
initial commit: conflict arbitration env
29 days ago
prompt_templates.py
Safe
2.38 kB
initial commit: conflict arbitration env
29 days ago
rollout.py
Safe
3.79 kB
optimize for hackathon time budget: 256 tokens, 200-step checkpoints
29 days ago
train.py
Safe
8.1 kB
ship submission: judge-aligned README, real training curves from 2000-step run, blog, scripts
28 days ago