73 61 67

Ziyang Luo

Ziyang

https://chiyeunglaw.github.io/

AI & ML interests

Agents, LLMs, Multimodal ML

Recent Activity

upvoted a paper 4 days ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

upvoted a paper 4 days ago

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

updated a Space 5 days ago

HKBU-NLP/README

View all activity

Organizations

upvoted 2 papers 4 days ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

Paper • 2601.08763 • Published 7 days ago • 133

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Paper • 2601.09667 • Published 6 days ago • 78

updated a Space 5 days ago

README

🚀

upvoted a paper 6 days ago

Towards Comprehensive Stage-wise Benchmarking of Large Language Models in Fact-Checking

Paper • 2601.02669 • Published 15 days ago • 2

authored a paper 11 days ago

DiffCoT: Diffusion-styled Chain-of-Thought Reasoning in LLMs

Paper • 2601.03559 • Published 13 days ago • 12

upvoted a paper 11 days ago

DiffCoT: Diffusion-styled Chain-of-Thought Reasoning in LLMs

Paper • 2601.03559 • Published 13 days ago • 12

liked 2 datasets 21 days ago

nvidia/Nemotron-Post-Training-Dataset-v1

Viewer • Updated Aug 25, 2025 • 25.7M • 5.38k • 171

ScaleAI/MCP-Atlas

Viewer • Updated Dec 19, 2025 • 500 • 448 • 6

upvoted a paper 22 days ago

MAI-UI Technical Report: Real-World Centric Foundation GUI Agents

Paper • 2512.22047 • Published 25 days ago • 26

upvoted an article 22 days ago

Article

Building the Open Agent Ecosystem Together: Introducing OpenEnv

Oct 23, 2025

•

145

upvoted a paper about 2 months ago

UserRL: Training Interactive User-Centric Agent via Reinforcement Learning

Paper • 2509.19736 • Published Sep 24, 2025 • 12

upvoted 2 collections about 2 months ago

GTA1

Collection

A collection of GUI grounding models trained with GRPO. • 5 items • Updated Oct 31, 2025 • 4

Elastic-Reasoning