Reasoning Transfer

classroom

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

yuexiang96 authored a paper 11 days ago

Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents

yuexiang96 authored a paper 11 days ago

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

yuexiang96 authored a paper 11 days ago

Simulating Environments with Reasoning Models for Agent Training

View all activity

yuexiang96

authored 4 papers 11 days ago

Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents

Paper • 2510.24702 • Published Oct 28 • 27

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Paper • 2510.25726 • Published Oct 29 • 45

Simulating Environments with Reasoning Models for Agent Training

Paper • 2511.01824 • Published Nov 3 • 2

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

Paper • 2512.07783 • Published 19 days ago • 36

Ibisbill

updated a model 3 months ago

ReasoningTransferability/UniReason-Qwen3-14B-think-SFT

Text Generation • 15B • Updated Sep 28 • 13

aaabiao

authored a paper 4 months ago

TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

Paper • 2508.17445 • Published Aug 24 • 80

Ibisbill

updated 2 models 4 months ago

ReasoningTransferability/UniReason-Qwen3-14B-no-think-SFT

Text Generation • 15B • Updated Aug 25 • 17 • 1

ReasoningTransferability/UniReason-Qwen3-14B-RL

Text Generation • 15B • Updated Aug 25 • 27 • 3

aaabiao

updated a dataset 6 months ago

ReasoningTransferability/math_rl_48k

Viewer • Updated Jul 11 • 48.8k • 14

aaabiao

published a dataset 6 months ago

ReasoningTransferability/math_rl_48k

Viewer • Updated Jul 11 • 48.8k • 14

aaabiao

authored a paper 6 months ago

First Return, Entropy-Eliciting Explore

Paper • 2507.07017 • Published Jul 9 • 23

Ibisbill

updated a dataset 6 months ago

ReasoningTransferability/math_sft_40K

Viewer • Updated Jul 8 • 39.9k • 65 • 4

Ibisbill

published a dataset 6 months ago

ReasoningTransferability/math_sft_40K

Viewer • Updated Jul 8 • 39.9k • 65 • 4

Ibisbill

in ReasoningTransferability/UniReason-Qwen3-14B-RL 6 months ago

Add `library_name` metadata and GitHub link to model card

#1 opened 6 months ago by

nielsr

Ibisbill

in ReasoningTransferability/UniReason-Qwen3-14B-think-SFT 6 months ago

Add library_name and prominent link to GitHub repository

#1 opened 6 months ago by

nielsr

Ibisbill

in ReasoningTransferability/UniReason-Qwen3-14B-no-think-SFT 6 months ago

Add library name and GitHub link to model card

#1 opened 6 months ago by

nielsr

Ibisbill

published 3 models 6 months ago

yuexiang96

authored a paper 6 months ago

Small Models Struggle to Learn from Strong Reasoners

Paper • 2502.12143 • Published Feb 17 • 39

AI & ML interests

Recent Activity

Team members 4