Submitted by
Jiayi Guo
AI & ML interests
Computer Vision, AI, Machine Learning
Recent Activity
View all activity
Papers
IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance
OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation