孙子轩

matthewonssd5s

·

AI & ML interests

None yet

Recent Activity

liked a Space 1 day ago

AimeeBingmouQu/ProtectBirds

liked a dataset 13 days ago

legacy-datasets/wikipedia

upvoted a paper 21 days ago

MaskAlign: Token-Subset Representation Alignment for Efficient Diffusion Training

View all activity

Organizations

None yet

upvoted a paper 21 days ago

MaskAlign: Token-Subset Representation Alignment for Efficient Diffusion Training

Paper • 2606.08788 • Published 26 days ago • 4

upvoted 3 papers about 1 month ago

On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters

Paper • 2606.02437 • Published Jun 1 • 236

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Paper • 2605.11609 • Published May 12 • 196

SkillsVote: Lifecycle Governance of Agent Skills from Collection, Recommendation to Evolution

Paper • 2605.18401 • Published May 18 • 130

upvoted 2 papers about 2 months ago

CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence

Paper • 2605.12882 • Published May 13 • 274

From Context to Skills: Can Language Models Learn from Context Skillfully?

Paper • 2604.27660 • Published May 3 • 171

upvoted a paper 2 months ago

Leveraging Verifier-Based Reinforcement Learning in Image Editing

Paper • 2604.27505 • Published Apr 30 • 59

upvoted 6 papers 3 months ago

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published Apr 9 • 248

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 638

Terminal Agents Suffice for Enterprise Automation

Paper • 2604.00073 • Published Mar 31 • 98

When Documents Disagree: Measuring Institutional Variation in Transplant Guidance with Retrieval-Augmented Language Models

Paper • 2603.21460 • Published Mar 23 • 6

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 353

Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models

Paper • 2603.17051 • Published Mar 17 • 109

upvoted 2 papers 4 months ago

Demystifing Video Reasoning

Paper • 2603.16870 • Published Mar 17 • 373

SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models

Paper • 2603.16859 • Published Mar 17 • 248