SpA2V: Harnessing Spatial Auditory Cues for Audio-driven Spatially-aware Video Generation Paper • 2508.00782 • Published Aug 1 • 6
TALE: Training-free Cross-domain Image Composition via Adaptive Latent Manipulation and Energy-guided Optimization Paper • 2408.03637 • Published Aug 7, 2024