siqi zhu's picture

3 13 7

siqi zhu

zsqzz

·

zhusq20

AI & ML interests

None yet

Recent Activity

updated a model about 1 month ago

GTAlign/Qwen2.5-3B-Medium-110step

updated a model about 1 month ago

GTAlign/Qwen2.5-3B-Full-160step

updated a model about 1 month ago

GTAlign/Qwen2.5-3B-Math-140step

View all activity

Organizations

upvoted a paper about 1 month ago

Multi-Agent Evolve: LLM Self-Improve through Co-evolution

Paper • 2510.23595 • Published Oct 27 • 10

upvoted 3 papers about 2 months ago

Efficient Long-context Language Model Training by Core Attention Disaggregation

Paper • 2510.18121 • Published Oct 20 • 121

Stronger Together: On-Policy Reinforcement Learning for Collaborative LLMs

Paper • 2510.11062 • Published Oct 13 • 28

GTAlign: Game-Theoretic Alignment of LLM Assistants for Mutual Welfare

Paper • 2510.08872 • Published Oct 10 • 3

upvoted 2 papers 5 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 314

Group-in-Group Policy Optimization for LLM Agent Training

Paper • 2505.10978 • Published May 16 • 18

upvoted a paper 9 months ago

Humanity's Last Exam

Paper • 2501.14249 • Published Jan 24 • 76

upvoted a paper 10 months ago

Fast Video Generation with Sliding Tile Attention

Paper • 2502.04507 • Published Feb 6 • 51

upvoted a paper 11 months ago

Efficiently Serving LLM Reasoning Programs with Certaindex

Paper • 2412.20993 • Published Dec 30, 2024 • 37

upvoted a collection 12 months ago

Synthetic Data and Self-Improvement

113 items • Updated Sep 26 • 9

upvoted a paper about 1 year ago

On the Diagram of Thought

Paper • 2409.10038 • Published Sep 16, 2024 • 13

upvoted 2 papers over 1 year ago

LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Paper • 2408.07055 • Published Aug 13, 2024 • 67

Efficient LLM Scheduling by Learning to Rank

Paper • 2408.15792 • Published Aug 28, 2024 • 20