Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
chongyi's picture
35 2 16

chongyi

yuzaa
Ricardo1227's profile picture bokesyo's profile picture shuyuej's profile picture
·

AI & ML interests

multimodal large language models

Organizations

OpenBMB's profile picture

authored 2 papers 3 months ago

AgentCPM-GUI: Building Mobile-Use Agents with Reinforcement Fine-Tuning

Paper • 2506.01391 • Published Jun 2

MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe

Paper • 2509.18154 • Published Sep 16 • 51
authored 5 papers 11 months ago

Reformulating Vision-Language Foundation Models and Datasets Towards Universal Multimodal Assistants

Paper • 2310.00653 • Published Oct 1, 2023 • 3

Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages

Paper • 2308.12038 • Published Aug 23, 2023 • 2

MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

Paper • 2404.06395 • Published Apr 9, 2024 • 24

GUICourse: From General Vision Language Models to Versatile GUI Agents

Paper • 2406.11317 • Published Jun 17, 2024 • 1

MiniCPM-V: A GPT-4V Level MLLM on Your Phone

Paper • 2408.01800 • Published Aug 3, 2024 • 89
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs