soso

chengliu

soso010816

AI & ML interests

None yet

Recent Activity

liked a dataset about 2 months ago

nvidia/OpenCodeReasoning

upvoted a paper 6 months ago

RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style

liked a dataset 6 months ago

BAAI/JudgeLM-100K

View all activity

Organizations

None yet

liked a dataset about 2 months ago

nvidia/OpenCodeReasoning

Viewer • Updated May 4, 2025 • 753k • 2.99k • 523

upvoted a paper 6 months ago

RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style

Paper • 2410.16184 • Published Oct 21, 2024 • 25

liked a dataset 6 months ago

BAAI/JudgeLM-100K

Preview • Updated Oct 27, 2023 • 81 • 51

liked a Space 6 months ago

FutureBench Leaderboard

🔮

Display and analyze prediction leaderboard data

liked 4 datasets 6 months ago

published a model 8 months ago

chengliu/cogdual-qwen-rl

Updated May 30, 2025

upvoted a collection 10 months ago

RLVR

Collection

Model and data for 'Expanding RL with Verifiable Rewards Across Diverse Domains' • 3 items • Updated Mar 31, 2025 • 13

upvoted a paper 10 months ago

What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models

Paper • 2503.24235 • Published Mar 31, 2025 • 54

liked 2 datasets 10 months ago

inclusionAI/AReaL-boba-Data

Preview • Updated Mar 29, 2025 • 43 • 23

ZenMoore/RoleBench

Preview • Updated Nov 23, 2023 • 301 • 90

liked a dataset 11 months ago

Congliu/Chinese-DeepSeek-R1-Distill-data-110k-SFT

Viewer • Updated Feb 19, 2025 • 110k • 206 • 215

authored a paper 11 months ago

S$^2$R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning

Paper • 2502.12853 • Published Feb 18, 2025 • 29

upvoted a paper 11 months ago

S^2R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning

Paper • 2502.12853 • Published Feb 18, 2025 • 29

upvoted a paper over 1 year ago

DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation

Paper • 2406.16855 • Published Jun 24, 2024 • 57

liked a model almost 2 years ago

uer/chinese_roberta_L-4_H-512

Fill-Mask • Updated Aug 30, 2023 • 368 • 11

liked 2 datasets over 2 years ago

allenai/sciq

Viewer • Updated Jan 4, 2024 • 13.7k • 31.3k • 133

UCSD26/medical_dialog

Updated Sep 18, 2023 • 300 • 173

soso

AI & ML interests

Recent Activity

Organizations

chengliu's activity

FutureBench Leaderboard