Yuran Wang's picture

1 13 5

Yuran Wang

Ryann829

·

AI & ML interests

Multimodal Large Language Model

Recent Activity

authored a paper 1 day ago

CoF-T2I: Video Models as Pure Visual Reasoners for Text-to-Image Generation

upvoted a paper 1 day ago

CoF-T2I: Video Models as Pure Visual Reasoners for Text-to-Image Generation

updated a dataset 3 days ago

Ryann829/SconeEval

View all activity

Organizations

authored a paper 1 day ago

CoF-T2I: Video Models as Pure Visual Reasoners for Text-to-Image Generation

Paper • 2601.10061 • Published 3 days ago • 26

authored a paper about 1 month ago

Scone: Bridging Composition and Distinction in Subject-Driven Image Generation via Unified Understanding-Generation Modeling

Paper • 2512.12675 • Published Dec 14, 2025 • 40

authored 3 papers 3 months ago

Ocean-OCR: Towards General OCR Application via a Vision-Language Model

Paper • 2501.15558 • Published Jan 26, 2025 • 2

DualToken: Towards Unifying Visual Understanding and Generation with Dual Visual Vocabularies

Paper • 2503.14324 • Published Mar 18, 2025 • 2

RealUnify: Do Unified Models Truly Benefit from Unification? A Comprehensive Benchmark

Paper • 2509.24897 • Published Sep 29, 2025 • 46

authored a paper 12 months ago

Baichuan-Omni-1.5 Technical Report

Paper • 2501.15368 • Published Jan 26, 2025 • 60