Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yuran Wang's picture
1 13 5

Yuran Wang

Ryann829
ZhaoXiyu123's profile picture
·

AI & ML interests

Multimodal Large Language Model

Recent Activity

authored a paper 1 day ago
CoF-T2I: Video Models as Pure Visual Reasoners for Text-to-Image Generation
upvoted a paper 1 day ago
CoF-T2I: Video Models as Pure Visual Reasoners for Text-to-Image Generation
updated a dataset 3 days ago
Ryann829/SconeEval
View all activity

Organizations

Wuhan Univeristy's profile picture WYR-Dataset's profile picture

authored a paper 1 day ago

CoF-T2I: Video Models as Pure Visual Reasoners for Text-to-Image Generation

Paper • 2601.10061 • Published 3 days ago • 26
authored a paper about 1 month ago

Scone: Bridging Composition and Distinction in Subject-Driven Image Generation via Unified Understanding-Generation Modeling

Paper • 2512.12675 • Published Dec 14, 2025 • 40
authored 3 papers 3 months ago

Ocean-OCR: Towards General OCR Application via a Vision-Language Model

Paper • 2501.15558 • Published Jan 26, 2025 • 2

DualToken: Towards Unifying Visual Understanding and Generation with Dual Visual Vocabularies

Paper • 2503.14324 • Published Mar 18, 2025 • 2

RealUnify: Do Unified Models Truly Benefit from Unification? A Comprehensive Benchmark

Paper • 2509.24897 • Published Sep 29, 2025 • 46
authored a paper 12 months ago

Baichuan-Omni-1.5 Technical Report

Paper • 2501.15368 • Published Jan 26, 2025 • 60
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs