5 8 3

Shixuan Liu

liusx

AI & ML interests

None yet

Recent Activity

authored a paper 4 days ago

GRACE: Generative Representation Learning via Contrastive Policy Optimization

authored a paper 4 days ago

Beyond Turn Limits: Training Deep Search Agents with Dynamic Context Window

authored a paper 4 days ago

Soft Adaptive Policy Optimization

View all activity

Organizations

authored 4 papers 4 days ago

upvoted a paper 4 days ago

Qwen3-VL Technical Report

Paper • 2511.21631 • Published 12 days ago • 110

upvoted 2 papers 6 days ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published 15 days ago • 240

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published 7 days ago • 78

upvoted a paper 12 days ago

Soft Adaptive Policy Optimization

Paper • 2511.20347 • Published 13 days ago • 33

upvoted a paper 3 months ago

T2R-bench: A Benchmark for Generating Article-Level Reports from Real World Industrial Tables

Paper • 2508.19813 • Published Aug 27 • 25

authored 6 papers 4 months ago

TeleChat Technical Report

Paper • 2401.03804 • Published Jan 8, 2024 • 8

From Captions to Rewards (CAREVL): Leveraging Large Language Model Experts for Enhanced Reward Modeling in Large Vision-Language Models

Paper • 2503.06260 • Published Mar 8

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 317

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 187

Stable Reinforcement Learning for Efficient Reasoning

Paper • 2505.18086 • Published May 23

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 314

upvoted a paper 5 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 314

upvoted a paper 6 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 187

upvoted a paper 11 months ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Paper • 2501.11873 • Published Jan 21 • 66

updated a model over 1 year ago

Tele-AI/TeleChat-52B

Text Generation • Updated Aug 27, 2024 • 122 • 2

New activity in Tele-AI/TeleChat-52B over 1 year ago

Update config.json

#3 opened over 1 year ago by

shunxing1234

Shixuan Liu

AI & ML interests

Recent Activity

Organizations

liusx's activity

Update config.json