LLaDA2.1: Speeding Up Text Diffusion via Token Editing Paper β’ 2602.08676 β’ Published 2 days ago β’ 53
Making Avatars Interact: Towards Text-Driven Human-Object Interaction for Controllable Talking Avatars Paper β’ 2602.01538 β’ Published 9 days ago β’ 15
DreaMontage: Arbitrary Frame-Guided One-Shot Video Generation Paper β’ 2512.21252 β’ Published Dec 24, 2025 β’ 35
view article Article Engineering Notes: Training a LoRA for Z-Image Turbo with the Ostris AI Toolkit Dec 2, 2025 β’ 10
view article Article Weβre open-sourcing our text-to-image model and the process behind it Nov 12, 2025 β’ 85
MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use Paper β’ 2509.24002 β’ Published Sep 28, 2025 β’ 175
Wait, We Don't Need to "Wait"! Removing Thinking Tokens Improves Reasoning Efficiency Paper β’ 2506.08343 β’ Published Jun 10, 2025 β’ 54
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM +2 Mar 12, 2025 β’ 486
Textoon: Generating Vivid 2D Cartoon Characters from Text Descriptions Paper β’ 2501.10020 β’ Published Jan 17, 2025 β’ 24
FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors Paper β’ 2501.08225 β’ Published Jan 14, 2025 β’ 20