7 12 5

weize

weizechen

AI & ML interests

None yet

Recent Activity

upvoted a paper 24 days ago

P1: Mastering Physics Olympiads with Reinforcement Learning

updated a model about 2 months ago

weizechen/RL-Compositionality-Stage-1-Model

updated a dataset about 2 months ago

weizechen/RL-Compositionality-Stage2-RL-Level8-TestData

View all activity

Organizations

upvoted a paper 24 days ago

P1: Mastering Physics Olympiads with Reinforcement Learning

Paper • 2511.13612 • Published 25 days ago • 132

updated a model about 2 months ago

weizechen/RL-Compositionality-Stage-1-Model

8B • Updated Oct 17 • 12 • 1

updated 4 datasets about 2 months ago

updated a collection about 2 months ago

RL Compositionality

Collection

From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones. https://huggingface.co/papers/2509.25123 • 5 items • Updated Oct 17 • 1

published a model about 2 months ago

weizechen/RL-Compositionality-Stage-1-Model

8B • Updated Oct 17 • 12 • 1

updated a collection about 2 months ago

RL Compositionality

Collection

From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones. https://huggingface.co/papers/2509.25123 • 5 items • Updated Oct 17 • 1

published 4 datasets about 2 months ago

weizechen/RL-Compositionality-Stage2-RL-Level8-TestData

Viewer • Updated Oct 17 • 2.05k • 33 • 1

weizechen/RL-Compositionality-Stage2-RL-Level2-TrainData

Viewer • Updated Oct 17 • 500k • 41 • 1

weizechen/RL-Compositionality-Stage2-RL-Level1-TrainData

Viewer • Updated Oct 17 • 500k • 39 • 1

weizechen/RL-Compositionality-Stage1-RFT-Data

Viewer • Updated Oct 17 • 118k • 53 • 1

upvoted a paper 2 months ago

From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones

Paper • 2509.25123 • Published Sep 29 • 20

commented a paper 2 months ago

From $f(x)$ and $g(x)$ to $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones

Paper • 2509.25123 • Published Sep 29 • 20 •

authored a paper 3 months ago

MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe

Paper • 2509.18154 • Published Sep 16 • 51

liked a model 3 months ago

openbmb/VoxCPM-0.5B

Text-to-Speech • Updated Sep 19 • 2.31k • 775

weize

AI & ML interests

Recent Activity

Organizations

weizechen's activity