Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2603.10165

WTF GENIUS PAPERS

Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models.

about 7 hours ago

Diffusion Language Models Know the Answer Before Decoding

Paper • 2508.19982 • Published Aug 27, 2025 • 27
ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding

Paper • 2512.13586 • Published Dec 15, 2025 • 94
LSRIF: Logic-Structured Reinforcement Learning for Instruction Following

Paper • 2601.06431 • Published Jan 10 • 12
Distribution-Aligned Sequence Distillation for Superior Long-CoT Reasoning

Paper • 2601.09088 • Published Jan 14 • 63

OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published 14 days ago • 139

OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published 14 days ago • 139
Very Large-Scale Multi-Agent Simulation in AgentScope

Paper • 2407.17789 • Published Jul 25, 2024 • 39

AI safety/Neptune

OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published 14 days ago • 139

AgentConductor: Topology Evolution for Multi-Agent Competition-Level Code Generation

Paper • 2602.17100 • Published Feb 19 • 3
GroupGPT: A Token-efficient and Privacy-preserving Agentic Framework for Multi-User Chat Assistant

Paper • 2603.01059 • Published 24 days ago • 1
Multi-Domain Riemannian Graph Gluing for Building Graph Foundation Models

Paper • 2603.00618 • Published 25 days ago
Heterogeneous Agent Collaborative Reinforcement Learning

Paper • 2603.02604 • Published 22 days ago • 188

about 8 hours ago

OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published 14 days ago • 139

OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published 14 days ago • 139
MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published 7 days ago • 127

about 19 hours ago

OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published 14 days ago • 139
Neural Thickets: Diverse Task Experts Are Dense Around Pretrained Weights

Paper • 2603.12228 • Published 12 days ago • 12
Efficient Memory Management for Large Language Model Serving with PagedAttention

Paper • 2309.06180 • Published Sep 12, 2023 • 47
1-bit AI Infra: Part 1.1, Fast and Lossless BitNet b1.58 Inference on CPUs

Paper • 2410.16144 • Published Oct 21, 2024 • 5

In-Context Reinforcement Learning for Tool Use in Large Language Models

Paper • 2603.08068 • Published 16 days ago • 41
OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published 14 days ago • 139
T2S-Bench & Structure-of-Thought: Benchmarking and Prompting Comprehensive Text-to-Structure Reasoning

Paper • 2603.03790 • Published 21 days ago • 121

Agentic Reasoning for Large Language Models

Paper • 2601.12538 • Published Jan 18 • 202
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 304
Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 275
Weak-Driven Learning: How Weak Agents make Strong Agents Stronger

Paper • 2602.08222 • Published Feb 9 • 283

WTF GENIUS PAPERS

Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models.

about 7 hours ago

Diffusion Language Models Know the Answer Before Decoding

Paper • 2508.19982 • Published Aug 27, 2025 • 27
ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding

Paper • 2512.13586 • Published Dec 15, 2025 • 94
LSRIF: Logic-Structured Reinforcement Learning for Instruction Following

Paper • 2601.06431 • Published Jan 10 • 12
Distribution-Aligned Sequence Distillation for Superior Long-CoT Reasoning

Paper • 2601.09088 • Published Jan 14 • 63

about 8 hours ago

OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published 14 days ago • 139

OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published 14 days ago • 139

OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published 14 days ago • 139
MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published 7 days ago • 127

OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published 14 days ago • 139
Very Large-Scale Multi-Agent Simulation in AgentScope

Paper • 2407.17789 • Published Jul 25, 2024 • 39

about 19 hours ago

OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published 14 days ago • 139
Neural Thickets: Diverse Task Experts Are Dense Around Pretrained Weights

Paper • 2603.12228 • Published 12 days ago • 12
Efficient Memory Management for Large Language Model Serving with PagedAttention

Paper • 2309.06180 • Published Sep 12, 2023 • 47
1-bit AI Infra: Part 1.1, Fast and Lossless BitNet b1.58 Inference on CPUs

Paper • 2410.16144 • Published Oct 21, 2024 • 5

AI safety/Neptune

OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published 14 days ago • 139

In-Context Reinforcement Learning for Tool Use in Large Language Models

Paper • 2603.08068 • Published 16 days ago • 41
OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published 14 days ago • 139
T2S-Bench & Structure-of-Thought: Benchmarking and Prompting Comprehensive Text-to-Structure Reasoning

Paper • 2603.03790 • Published 21 days ago • 121

AgentConductor: Topology Evolution for Multi-Agent Competition-Level Code Generation

Paper • 2602.17100 • Published Feb 19 • 3
GroupGPT: A Token-efficient and Privacy-preserving Agentic Framework for Multi-User Chat Assistant

Paper • 2603.01059 • Published 24 days ago • 1
Multi-Domain Riemannian Graph Gluing for Building Graph Foundation Models

Paper • 2603.00618 • Published 25 days ago
Heterogeneous Agent Collaborative Reinforcement Learning

Paper • 2603.02604 • Published 22 days ago • 188

Agentic Reasoning for Large Language Models

Paper • 2601.12538 • Published Jan 18 • 202
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 304
Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 275
Weak-Driven Learning: How Weak Agents make Strong Agents Stronger

Paper • 2602.08222 • Published Feb 9 • 283

Previous
1
2
Next

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs