Running 16 Defeating the trainer-generator precision mismatch in TRL 🎯 16 Download research PDF (Pro access required)
view post Post 723 Big update to llm-datasets, my curated list of datasets and tools for post-training LLMs.> Added many new datasets> New "thinking" column> Refreshed recommended tools.Thanks to everyone who told me they used it for their research at ICLR, you motivated this update! See translation 2 replies · 👀 2 2 🤗 1 1 👍 1 1 + Reply
view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 Mar 10 • 137
Running Featured 51 LFM2.5-VL-450M WebGPU 📹 51 Live video captioning and object tracking in your browser