Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2508.06471

Research Papers/Reviews/Literature

Daily Research papers and review including older relevant content.

about 16 hours ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published Jan 30, 2025 • 61
RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published Mar 18, 2025 • 153
DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning

Paper • 2503.15265 • Published Mar 19, 2025 • 46
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning

Paper • 2503.15558 • Published Mar 18, 2025 • 50

AI Paper of the Day

A collection of papers that I think are interesting, one added each day

about 8 hours ago

Can Large Language Models Understand Context?

Paper • 2402.00858 • Published Feb 1, 2024 • 24
OLMo: Accelerating the Science of Language Models

Paper • 2402.00838 • Published Feb 1, 2024 • 85
Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 152
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity

Paper • 2401.17072 • Published Jan 30, 2024 • 25

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8, 2025 • 206

zai-org/webglm-qa

Viewer • Updated Jul 12, 2023 • 45k • 376 • 61
zai-org/AgentInstruct

Viewer • Updated Oct 23, 2023 • 1.87k • 921 • 231
zai-org/CogVLM-SFT-311K

Preview • Updated Dec 26, 2023 • 58 • 51
zai-org/DeepDive

Viewer • Updated Sep 17, 2025 • 4.11k • 1.43k • 25

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 509
zai-org/GLM-4.6

Text Generation • 357B • Updated Sep 30, 2025 • 84.9k • • 1.2k
deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27, 2025 • 994k • • 13.1k
deepseek-ai/DeepSeek-V3.2-Exp

Text Generation • Updated Nov 18, 2025 • 51.2k • • 967

gradientai/Llama-3-8B-Instruct-Gradient-1048k

Text Generation • Updated Oct 29, 2024 • 9.02k • 680
Are Your LLMs Capable of Stable Reasoning?

Paper • 2412.13147 • Published Dec 17, 2024 • 93
RetroLLM: Empowering Large Language Models to Retrieve Fine-grained Evidence within Generation

Paper • 2412.11919 • Published Dec 16, 2024 • 36
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Paper • 2412.18925 • Published Dec 25, 2024 • 107

Language Models are Few-Shot Learners

Paper • 2005.14165 • Published May 28, 2020 • 19
Evaluating Large Language Models Trained on Code

Paper • 2107.03374 • Published Jul 7, 2021 • 8
Training language models to follow instructions with human feedback

Paper • 2203.02155 • Published Mar 4, 2022 • 24
GPT-4 Technical Report

Paper • 2303.08774 • Published Mar 15, 2023 • 7

Agentic Reasoning Foundation Models

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8, 2025 • 206
Establishing Best Practices for Building Rigorous Agentic Benchmarks

Paper • 2507.02825 • Published Jul 3, 2025 • 1

Two Minds Better Than One: Collaborative Reward Modeling for LLM Alignment

Paper • 2505.10597 • Published May 15, 2025
COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

Paper • 2504.05535 • Published Apr 7, 2025 • 44
nvidia/HelpSteer3

Viewer • Updated Nov 16, 2025 • 133k • 4.04k • 97
nvidia/Nemotron-RL-instruction_following

Preview • Updated Jan 12 • 99 • 11

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8, 2025 • 206

Research Papers/Reviews/Literature

Daily Research papers and review including older relevant content.

about 16 hours ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published Jan 30, 2025 • 61
RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published Mar 18, 2025 • 153
DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning

Paper • 2503.15265 • Published Mar 19, 2025 • 46
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning

Paper • 2503.15558 • Published Mar 18, 2025 • 50

gradientai/Llama-3-8B-Instruct-Gradient-1048k

Text Generation • Updated Oct 29, 2024 • 9.02k • 680
Are Your LLMs Capable of Stable Reasoning?

Paper • 2412.13147 • Published Dec 17, 2024 • 93
RetroLLM: Empowering Large Language Models to Retrieve Fine-grained Evidence within Generation

Paper • 2412.11919 • Published Dec 16, 2024 • 36
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Paper • 2412.18925 • Published Dec 25, 2024 • 107

AI Paper of the Day

A collection of papers that I think are interesting, one added each day

about 8 hours ago

Can Large Language Models Understand Context?

Paper • 2402.00858 • Published Feb 1, 2024 • 24
OLMo: Accelerating the Science of Language Models

Paper • 2402.00838 • Published Feb 1, 2024 • 85
Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 152
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity

Paper • 2401.17072 • Published Jan 30, 2024 • 25

Language Models are Few-Shot Learners

Paper • 2005.14165 • Published May 28, 2020 • 19
Evaluating Large Language Models Trained on Code

Paper • 2107.03374 • Published Jul 7, 2021 • 8
Training language models to follow instructions with human feedback

Paper • 2203.02155 • Published Mar 4, 2022 • 24
GPT-4 Technical Report

Paper • 2303.08774 • Published Mar 15, 2023 • 7

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8, 2025 • 206

Agentic Reasoning Foundation Models

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8, 2025 • 206
Establishing Best Practices for Building Rigorous Agentic Benchmarks

Paper • 2507.02825 • Published Jul 3, 2025 • 1

zai-org/webglm-qa

Viewer • Updated Jul 12, 2023 • 45k • 376 • 61
zai-org/AgentInstruct

Viewer • Updated Oct 23, 2023 • 1.87k • 921 • 231
zai-org/CogVLM-SFT-311K

Preview • Updated Dec 26, 2023 • 58 • 51
zai-org/DeepDive

Viewer • Updated Sep 17, 2025 • 4.11k • 1.43k • 25

Two Minds Better Than One: Collaborative Reward Modeling for LLM Alignment

Paper • 2505.10597 • Published May 15, 2025
COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

Paper • 2504.05535 • Published Apr 7, 2025 • 44
nvidia/HelpSteer3

Viewer • Updated Nov 16, 2025 • 133k • 4.04k • 97
nvidia/Nemotron-RL-instruction_following

Preview • Updated Jan 12 • 99 • 11

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 509
zai-org/GLM-4.6

Text Generation • 357B • Updated Sep 30, 2025 • 84.9k • • 1.2k
deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27, 2025 • 994k • • 13.1k
deepseek-ai/DeepSeek-V3.2-Exp

Text Generation • Updated Nov 18, 2025 • 51.2k • • 967

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8, 2025 • 206

Previous
1
2
3
...
5
Next

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs