sizhe's picture

8

sizhe

sizhe04

AI & ML interests

None yet

Organizations

None yet

upvoted a paper 3 months ago

VisPlay: Self-Evolving Vision-Language Models from Images

Paper • 2511.15661 • Published Nov 19, 2025 • 43

upvoted 2 papers 4 months ago

Who's Your Judge? On the Detectability of LLM-Generated Judgments

Paper • 2509.25154 • Published Sep 29, 2025 • 30

TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

Paper • 2509.25760 • Published Sep 30, 2025 • 55

upvoted a paper 5 months ago

Self-Rewarding Vision-Language Model via Reasoning Decomposition

Paper • 2508.19652 • Published Aug 27, 2025 • 84

upvoted 2 papers 6 months ago

R-Zero: Self-Evolving Reasoning LLM from Zero Data

Paper • 2508.05004 • Published Aug 7, 2025 • 130

Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

Paper • 2508.01191 • Published Aug 2, 2025 • 238

upvoted a paper 8 months ago

The Quest for Efficient Reasoning: A Data-Centric Benchmark to CoT Distillation

Paper • 2505.18759 • Published May 24, 2025 • 14

upvoted a paper about 1 year ago

From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge

Paper • 2411.16594 • Published Nov 25, 2024 • 39