Alex Rigler's picture

Alex Rigler

aldaleri

·

https://choochoo.cc

AI & ML interests

systems, security & governance

Recent Activity

liked a dataset 1 day ago

badlogicgames/pi-mono

liked a model 5 days ago

google/gemma-4-26B-A4B-it

upvoted an article 5 days ago

Welcome Gemma 4: Frontier multimodal intelligence on device

View all activity

Organizations

upvoted an article 5 days ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

+5

7 days ago

•

786

upvoted a collection 22 days ago

Mistral Small 4

A state-of-the-art model, open-weight, with a granular Mixture-of-Experts architecture that fuses instruct, reasoning and agentic skills. • 3 items • Updated 23 days ago • 63

upvoted 2 collections 28 days ago

BitNet

🔥BitNet family of large language models (1-bit LLMs). • 7 items • Updated May 1, 2025 • 62

NVIDIA Nemotron v3

Open, Production-ready Enterprise Models • 15 items • Updated 2 days ago • 264

upvoted an article 28 days ago

Article

Introducing Storage Buckets on the Hugging Face Hub

+10

about 1 month ago

•

189

upvoted 4 papers about 1 month ago

GLM-5: from Vibe Coding to Agentic Engineering

Paper • 2602.15763 • Published Feb 17 • 135

Weak-Driven Learning: How Weak Agents make Strong Agents Stronger

Paper • 2602.08222 • Published Feb 9 • 288

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

Paper • 2602.10604 • Published Feb 11 • 194

OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration

Paper • 2602.05400 • Published Feb 5 • 350

upvoted 2 articles about 1 month ago

Article

IBM and UC Berkeley Diagnose Why Enterprise Agents Fail Using IT-Bench and MAST

Feb 18

•

18

Article

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

+4

Feb 20

•

501

upvoted a paper about 2 months ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 303

upvoted a collection about 2 months ago

NanoBEIR 🍺

A collection of smaller versions of BEIR datasets with 50 queries and up to 10K documents each. • 13 items • Updated Sep 11, 2024 • 26

upvoted an article about 2 months ago

Article

ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models?

Feb 19

•

19

upvoted a collection about 2 months ago

ColBERT-Zero 🐶

First large-scale fully pre-trained ColBERT model using only public data, outperforming GTE-ModernColBERT and GTE-ModernBERT • 10 items • Updated 1 day ago • 20

upvoted an article about 2 months ago

Article

LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling

Feb 12

•

53

upvoted a collection about 2 months ago

LateOn-Code 💻

State-of-the-art late interaction code retrieval models • 6 items • Updated 1 day ago • 17

upvoted a collection 2 months ago

Open Coding Agents

13 items • Updated Mar 5 • 52

upvoted 2 papers 3 months ago

A.S.E: A Repository-Level Benchmark for Evaluating Security in AI-Generated Code

Paper • 2508.18106 • Published Aug 25, 2025 • 350

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published Dec 31, 2025 • 321