40 30

Umar Azam

UmarAzam

Umar-Azam

AI & ML interests

Robotics and Simulations

Recent Activity

upvoted a paper about 15 hours ago

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

upvoted an article 2 days ago

How I contributed a new model to the Transformers library using Codex

upvoted an article 6 days ago

Holotron-12B - High Throughput Computer Use Agent

View all activity

Organizations

None yet

upvoted a paper about 15 hours ago

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published 4 days ago • 121

upvoted an article 2 days ago

Article

How I contributed a new model to the Transformers library using Codex

3 days ago

•

upvoted an article 6 days ago

Article

Holotron-12B - High Throughput Computer Use Agent

16 days ago

•

liked a model 9 days ago

allenai/MolmoBot-SPOC-DROID

Updated 8 days ago • 4

upvoted a paper 12 days ago

Loc3R-VLM: Language-based Localization and 3D Reasoning with Vision-Language Models

Paper • 2603.18002 • Published 15 days ago • 13

liked a model 20 days ago

THU-SI/Spatial-TTT-nano

Updated 24 days ago • 42 • 4

upvoted a paper 24 days ago

Utonia: Toward One Encoder for All Point Clouds

Paper • 2603.03283 • Published 30 days ago • 184

liked a model 25 days ago

UWGZQ/TRASER

Video-Text-to-Text • 928k • Updated 25 days ago • 36 • 4

upvoted a paper about 2 months ago

GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning

Paper • 2602.12099 • Published Feb 12 • 61

liked a model about 2 months ago

Nanbeige/Nanbeige4.1-3B

Text Generation • 4B • Updated 8 days ago • 554k • • 1.02k

upvoted an article about 2 months ago

Article

Nemotron ColEmbed V2: Raising the Bar for Multimodal Retrieval with ViDoRe V3’s Top Model

Feb 4

•

upvoted a collection about 2 months ago

LingBot-VLA

Collection

Vision-Language-Action Foundation Model • 5 items • Updated 24 days ago • 13

upvoted 2 papers 2 months ago

DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation

Paper • 2601.22153 • Published Jan 29 • 74

LLM-in-Sandbox Elicits General Agentic Intelligence

Paper • 2601.16206 • Published Jan 22 • 86

liked a model 2 months ago

microsoft/OptiMind-SFT

Text Generation • 21B • Updated Jan 15 • 632 • 98

upvoted a paper 3 months ago

NextFlow: Unified Sequential Modeling Activates Multimodal Understanding and Generation

Paper • 2601.02204 • Published Jan 5 • 63

liked a model 3 months ago

Tongyi-MAI/MAI-UI-8B

Image-Text-to-Text • 9B • Updated Jan 9 • 4.52k • 187

upvoted a paper 3 months ago

Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning

Paper • 2512.20605 • Published Dec 23, 2025 • 62

liked a Space 4 months ago

Chatterbox Turbo Demo

⚡

490

Chatterbox Turbo Demo

upvoted a paper 4 months ago

DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling

Paper • 2512.03000 • Published Dec 2, 2025 • 37

Umar Azam

AI & ML interests

Recent Activity

Organizations

UmarAzam's activity

How I contributed a new model to the Transformers library using Codex

Holotron-12B - High Throughput Computer Use Agent

Nemotron ColEmbed V2: Raising the Bar for Multimodal Retrieval with ViDoRe V3’s Top Model

Chatterbox Turbo Demo