Nek's picture

10 8

Nek

Rob1234567

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

upvoted an article 11 days ago

What makes good reasoning data

upvoted an article 11 days ago

Aligning to What? Rethinking Agent Generalization in MiniMax M2

View all activity

Organizations

None yet

upvoted a paper 7 days ago

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

Paper • 2511.22570 • Published 11 days ago • 66

upvoted 2 articles 11 days ago

Article

What makes good reasoning data

Oct 30

•

34

Article

Aligning to What? Rethinking Agent Generalization in MiniMax M2

Oct 30

•

27

upvoted a collection 28 days ago

Gemma 3 Release

28 items • Updated Aug 11 • 549

upvoted a collection 3 months ago

Qwen3Guard

7 items • Updated Sep 30 • 59

liked a model 4 months ago

openai/gpt-oss-120b

Text Generation • 120B • Updated Aug 26 • 4.55M • • 4.22k

liked a model 7 months ago

ai-sage/GigaChat-20B-A3B-instruct

Text Generation • 21B • Updated Jun 25 • 718 • 49

upvoted a paper 7 months ago

Chain-of-Model Learning for Language Model

Paper • 2505.11820 • Published May 17 • 121

liked a dataset 7 months ago

logicreasoning/logi_glue

Viewer • Updated Oct 31, 2023 • 356k • 3.33k • 4

liked a model 7 months ago

microsoft/bitnet-b1.58-2B-4T

Text Generation • 0.8B • Updated May 1 • 7.84k • 1.22k

upvoted a collection 7 months ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Jul 21 • 666

liked a model 7 months ago

Qwen/Qwen3-0.6B

Text Generation • 0.8B • Updated Jul 26 • 7.59M • • 854

upvoted a collection 8 months ago

LiveBench

Datasets for LiveBench • 8 items • Updated Mar 31 • 13

upvoted a paper 9 months ago

I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

Paper • 2503.18878 • Published Mar 24 • 119

liked a Space 9 months ago

Agora Demo

A simple demo showcasing Agora

upvoted a collection 10 months ago

DeepSeek-R1

10 items • Updated 12 days ago • 821

liked a model 10 months ago

Qwen/Qwen2.5-1.5B-Instruct

Text Generation • 2B • Updated Sep 25, 2024 • 5.52M • • 563

liked a Space 11 months ago

Open LLM Leaderboard

Track, rank and evaluate open LLMs and chatbots