4 5

Huang

GeekHuang

AI & ML interests

CV NLP

Recent Activity

upvoted a paper 5 days ago

FineVLA: Fine-Grained Instruction Alignment for Steerable Vision-Language-Action Policies

liked a dataset 14 days ago

xlangai/RoboFine-bench

upvoted a paper 20 days ago

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

View all activity

Organizations

None yet

upvoted a paper 5 days ago

FineVLA: Fine-Grained Instruction Alignment for Steerable Vision-Language-Action Policies

Paper • 2605.27284 • Published 24 days ago • 8

liked a dataset 14 days ago

xlangai/RoboFine-bench

Viewer • Updated about 4 hours ago • 1k • 2.09k • 4

upvoted a paper 20 days ago

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

Paper • 2605.30280 • Published 22 days ago • 143

updated a dataset 5 months ago

GeekHuang/Issac

Viewer • Updated Jan 4 • 1.75k • 8

published a dataset 5 months ago

GeekHuang/Issac

Viewer • Updated Jan 4 • 1.75k • 8

updated a model about 1 year ago

GeekHuang/CDLlamaGen

Updated Apr 25, 2025

liked a dataset about 1 year ago

timm/imagenet-1k-wds

Viewer • Updated Jan 7, 2024 • 80.7k • 5.55k • 35

liked a model about 1 year ago

stabilityai/sdxl-vae

Updated Aug 4, 2023 • 245k • 751

published a model about 1 year ago

GeekHuang/CDLlamaGen

Updated Apr 25, 2025

liked 2 models over 1 year ago

jadechoghari/mar

Unconditional Image Generation • Updated Nov 25, 2024 • 371 • 17

liuhaotian/llava-v1.5-13b-lora

Image-Text-to-Text • Updated May 9, 2024 • 11 • 28

upvoted 2 papers over 1 year ago

STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution

Paper • 2501.02976 • Published Jan 6, 2025 • 56

Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement

Paper • 2411.06558 • Published Nov 10, 2024 • 36

Huang

AI & ML interests

Recent Activity

Organizations

GeekHuang's activity