Liang Feng's picture

2

Liang Feng

lightslightusc

·

AI & ML interests

LLM and VLM

Organizations

upvoted 2 papers 2 months ago

Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward

Paper • 2510.03222 • Published Oct 3, 2025 • 75

Contamination Detection for VLMs using Multi-Modal Semantic Perturbation

Paper • 2511.03774 • Published Nov 5, 2025 • 12