Humanoid-GPT: Scaling Data and Structure for Zero-Shot Motion Tracking Paper • 2606.03985 • Published 10 days ago • 39
ToolCUA: Towards Optimal GUI-Tool Path Orchestration for Computer Use Agents Paper • 2605.12481 • Published about 1 month ago • 28
Repurposing 3D Generative Model for Autoregressive Layout Generation Paper • 2604.16299 • Published Apr 17 • 12
Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development Paper • 2603.27460 • Published Mar 29 • 70
EgoActor: Grounding Task Planning into Spatial-aware Egocentric Actions for Humanoid Robots via Visual-Language Models Paper • 2602.04515 • Published Feb 4 • 39
Future Optical Flow Prediction Improves Robot Control & Video Generation Paper • 2601.10781 • Published Jan 15 • 19
Being-H0.5: Scaling Human-Centric Robot Learning for Cross-Embodiment Generalization Paper • 2601.12993 • Published Jan 19 • 77
Robo-Dopamine: General Process Reward Modeling for High-Precision Robotic Manipulation Paper • 2512.23703 • Published Dec 29, 2025 • 7
4D-RGPT: Toward Region-level 4D Understanding via Perceptual Distillation Paper • 2512.17012 • Published Dec 18, 2025 • 49
RoboTracer: Mastering Spatial Trace with Reasoning in Vision-Language Models for Robotics Paper • 2512.13660 • Published Dec 15, 2025 • 37
EditThinker: Unlocking Iterative Reasoning for Any Image Editor Paper • 2512.05965 • Published Dec 5, 2025 • 38
Geometrically-Constrained Agent for Spatial Reasoning Paper • 2511.22659 • Published Nov 27, 2025 • 41
TIGeR: Tool-Integrated Geometric Reasoning in Vision-Language Models for Robotics Paper • 2510.07181 • Published Oct 8, 2025 • 1
LLMs Learn to Deceive Unintentionally: Emergent Misalignment in Dishonesty from Misaligned Samples to Biased Human-AI Interactions Paper • 2510.08211 • Published Oct 9, 2025 • 23
How Far are VLMs from Visual Spatial Intelligence? A Benchmark-Driven Perspective Paper • 2509.18905 • Published Sep 23, 2025 • 31