arxiv:2506.02975
YC Xiao
EasonXiao-888
AI & ML interests
AI, Multimodal Large Model
Recent Activity
upvoted a paper 2 months ago
Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models upvoted a paper 3 months ago
Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance upvoted a paper 3 months ago
Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO