2 16 15

weiliu

thinkwee

https://thinkwee.top/about/

AI & ML interests

LLM reasoning, agents

Recent Activity

upvoted a paper 29 days ago

SSA: Sparse Sparse Attention by Aligning Full and Sparse Attention Outputs in Feature Space

upvoted a paper 2 months ago

Latent Refinement Decoding: Enhancing Diffusion-Based Language Models by Refining Belief States

upvoted a paper 2 months ago

CCD: Mitigating Hallucinations in Radiology MLLMs via Clinical Contrastive Decoding

View all activity

Organizations

None yet

upvoted a paper 29 days ago

SSA: Sparse Sparse Attention by Aligning Full and Sparse Attention Outputs in Feature Space

Paper • 2511.20102 • Published about 1 month ago • 26

upvoted 2 papers 2 months ago

Latent Refinement Decoding: Enhancing Diffusion-Based Language Models by Refining Belief States

Paper • 2510.11052 • Published Oct 13 • 51

CCD: Mitigating Hallucinations in Radiology MLLMs via Clinical Contrastive Decoding

Paper • 2509.23379 • Published Sep 27 • 14

upvoted 2 papers 3 months ago

Fine-Tuning on Noisy Instructions: Effects on Generalization and Performance

Paper • 2510.03528 • Published Oct 3 • 17

When Thinking Backfires: Mechanistic Insights Into Reasoning-Induced Misalignment

Paper • 2509.00544 • Published Aug 30 • 11

upvoted 3 papers 4 months ago

IntrEx: A Dataset for Modeling Engagement in Educational Conversations

Paper • 2509.06652 • Published Sep 8 • 24

Analysing Chain of Thought Dynamics: Active Guidance or Unfaithful Post-hoc Rationalisation?

Paper • 2508.19827 • Published Aug 27 • 33

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published Jul 26 • 158

upvoted a collection 4 months ago

NOVER1

Collection

NOVER-series models for general reasoning • 3 items • Updated Aug 21 • 2

upvoted 2 papers 4 months ago

A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems

Paper • 2508.07407 • Published Aug 10 • 98

Spectrum Projection Score: Aligning Retrieved Summaries with Reader Models in Retrieval-Augmented Generation

Paper • 2508.05909 • Published Aug 8 • 21

upvoted a collection 5 months ago

NOVEReason

Collection

General Reasoning datasets for training the NOVER model • 4 items • Updated Aug 21 • 2

upvoted a paper 7 months ago

NOVER: Incentive Training for Language Models via Verifier-Free Reinforcement Learning

Paper • 2505.16022 • Published May 21 • 4

upvoted a paper 10 months ago

Scaling Large-Language-Model-based Multi-Agent Collaboration

Paper • 2406.07155 • Published Jun 11, 2024 • 3

upvoted 2 papers about 1 year ago

ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer

Paper • 2412.07720 • Published Dec 10, 2024 • 31

Autonomous Agents for Collaborative Task under Information Asymmetry

Paper • 2406.14928 • Published Jun 21, 2024 • 2

weiliu

AI & ML interests

Recent Activity

Organizations

thinkwee's activity