Collections of ICLR 2026 paper: "OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models"
Zekun Qi
qizekun
AI & ML interests
Embodied Intelligence, Large Langugae Model, 3D Computer Vision
Recent Activity
upvoted a paper 3 days ago
PerceptionRubrics: Calibrating Multimodal Evaluation to Human Perception authored a paper 13 days ago
ImageWAM: Do World Action Models Really Need Video Generation, or Just Image Editing? upvoted a paper 16 days ago
ImageWAM: Do World Action Models Really Need Video Generation, or Just Image Editing?