Wenkai Yang's picture

5 13

Wenkai Yang

Keven16

·

https://keven980716.github.io/

keven980716

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Mixture of Horizons in Action Chunking

upvoted a paper about 1 month ago

Visual Backdoor Attacks on MLLM Embodied Decision Making via Contrastive Trigger Learning

commented on a paper about 1 month ago

Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning

View all activity

Organizations

None yet

Collections 1

Papers 11

arxiv:2510.14943

arxiv:2505.00662

arxiv:2502.18080

arxiv:2406.11431

models 14

Keven16/Qwen2.5-7B-LaSeR

8B • Updated Oct 15 • 8

Keven16/OctoThinker-3B-Short-LaSeR

4B • Updated Oct 15 • 5

Keven16/ORZ-7B-LaSeR

8B • Updated Oct 15 • 11 • 1

Keven16/DeepCritic-7B-RL1.5-PRM800K

8B • Updated Jun 25 • 5

Keven16/DeepCritic-7B-RL1.5-Numina

8B • Updated Jun 23 • 3

Keven16/Qwen2.5-32B-TOPS-Iter-DPO-Preview

33B • Updated May 15 • 6

Keven16/Qwen2.5-32B-TOPS

33B • Updated May 15 • 6

Keven16/Qwen2.5-32B-TOPS-Iter-DPO

33B • Updated May 15 • 2

Keven16/Qwen2.5-32B-Tag

33B • Updated May 15 • 2

Keven16/LLaMA3.1-8B-Tag

8B • Updated May 15 • 1

datasets 4

Keven16/LaSeR_training_data

Viewer • Updated Oct 16 • 104k • 57 • 2

Keven16/TOPS-Data

Preview • Updated Oct 7 • 40

Keven16/DeepCritic-RL-Data

Viewer • Updated May 13 • 55k • 31

Keven16/DeepCritic-4.5K

Preview • Updated May 13 • 14