AI & ML interests
Collection of JS libraries to interact with the Hugging Face Hub
Recent Activity
View all activity
mishigΒ
updated a
Space about 1 month ago
Bump @huggingface/jinja to 0.5.5
#8 opened about 2 months ago
by
Xenova
julien-cΒ
submitted a
paper to Daily Papers about 2 months ago
Bump @huggingface/jinja to v0.5.4
#7 opened 2 months ago
by
Xenova
Post
3958
π What happened in AI in 2025? π
We prepared the 2025 version of the HF AI Timeline Grid, highlighting open vs API-based model releases, and allowing you to browse and filter by access, modality, and release type!
Play with it here:
2025-ai-timeline/2025-ai-timeline
Here's my personal quarterly TL;DR:
1οΈβ£ Q1 β Learning to Reason
Deepseek not only releases a top-notch reasoning model, but shows how to train them and compete with closed frontier models. OpenAI debuts Deep Research.
Significant milestones: DeepSeek R1 & R1-Zero, Qwen 2.5 VL, OpenAI Deep Research, Gemini 2.5 Pro (experimental)
2οΈβ£ Q2 β Multimodality and Coding
More LLMs embrace multimodality by default, and there's a surge in coding agents. Strong vision, audio, and generative models emerge.
Significant milestones: Llama 4, Qwen 3, Imagen 4, OpenAI Codex, Google Jules, Claude 4
3οΈβ£ Q3 β "Gold" rush, OpenAI opens up, the community goes bananas
Flagship models get gold in Math olympiads and hard benchmarks. OpenAI releases strong open source models and Google releases the much anticipated nano-banana for image generation and editing. Agentic workflows become commonplace.
Significant milestones: Gemini and OpenAI IMO Gold, gpt-oss, Gemini 2.5 Flash Image, Grok 4, Claude Sonnet 4.5
4οΈβ£ Q4 β Mistral returns, leaderboard hill-climbing
Mistral is back with updated model families. All labs release impressive models to wrap up the year!
Significant milestones: Claude Opus 4.5, DeepSeek Math V2, FLUX 2, GPT 5.1, Kimi K2 Thinking, Nano Banana Pro, GLM 4.7, Gemini 3, Mistral 3, MiniMax M2.1 π€―
Credits
π NHLOCAL for the source data https://github.com/NHLOCAL/AiTimeline
π«‘ @reach-vb for the original idea, design and recipe
π @ariG23498 and yours truly for compiling and verifying the 2025 edition
π₯³ Here's to 2026, wishing it becomes the best year ever for open releases and on-device-first use-cases! π₯
We prepared the 2025 version of the HF AI Timeline Grid, highlighting open vs API-based model releases, and allowing you to browse and filter by access, modality, and release type!
Play with it here:
2025-ai-timeline/2025-ai-timeline
Here's my personal quarterly TL;DR:
1οΈβ£ Q1 β Learning to Reason
Deepseek not only releases a top-notch reasoning model, but shows how to train them and compete with closed frontier models. OpenAI debuts Deep Research.
Significant milestones: DeepSeek R1 & R1-Zero, Qwen 2.5 VL, OpenAI Deep Research, Gemini 2.5 Pro (experimental)
2οΈβ£ Q2 β Multimodality and Coding
More LLMs embrace multimodality by default, and there's a surge in coding agents. Strong vision, audio, and generative models emerge.
Significant milestones: Llama 4, Qwen 3, Imagen 4, OpenAI Codex, Google Jules, Claude 4
3οΈβ£ Q3 β "Gold" rush, OpenAI opens up, the community goes bananas
Flagship models get gold in Math olympiads and hard benchmarks. OpenAI releases strong open source models and Google releases the much anticipated nano-banana for image generation and editing. Agentic workflows become commonplace.
Significant milestones: Gemini and OpenAI IMO Gold, gpt-oss, Gemini 2.5 Flash Image, Grok 4, Claude Sonnet 4.5
4οΈβ£ Q4 β Mistral returns, leaderboard hill-climbing
Mistral is back with updated model families. All labs release impressive models to wrap up the year!
Significant milestones: Claude Opus 4.5, DeepSeek Math V2, FLUX 2, GPT 5.1, Kimi K2 Thinking, Nano Banana Pro, GLM 4.7, Gemini 3, Mistral 3, MiniMax M2.1 π€―
Credits
π NHLOCAL for the source data https://github.com/NHLOCAL/AiTimeline
π«‘ @reach-vb for the original idea, design and recipe
π @ariG23498 and yours truly for compiling and verifying the 2025 edition
π₯³ Here's to 2026, wishing it becomes the best year ever for open releases and on-device-first use-cases! π₯
Add image assets for image-text-to-image and image-text-to-video tasks
#12 opened 4 months ago
by
multimodalart
Bump @huggingface/jinja to 0.5.3
#4 opened 4 months ago
by
Xenova
coyotte508Β
updated a
model 5 months ago
coyotte508Β
updated a
collection 5 months ago
coyotte508Β
published a
model 5 months ago
coyotte508Β
updated a
collection 5 months ago
Add video-to-video task demo input output files.
1
#10 opened 6 months ago
by
ShahzebKhoso
Post
10220
deepseek-ai/DeepSeek-OCR is out! π₯ my take ‡οΈ
> pretty insane it can parse and re-render charts in HTML
> it uses CLIP and SAM features concatenated, so better grounding
> very efficient per vision tokens/performance ratio
> covers 100 languages
> pretty insane it can parse and re-render charts in HTML
> it uses CLIP and SAM features concatenated, so better grounding
> very efficient per vision tokens/performance ratio
> covers 100 languages
Upload 99 files
#5 opened 6 months ago
by
fellybikush
Delete index.html
#3 opened 6 months ago
by
fellybikush
coyotte508Β
updated a
Space 5 months ago