moondream2
a tiny vision language model
a tiny vision language model
MoonDream 2 Vision Model on the Browser: Candle/Rust/WASM
Generate text from images and prompts
Generate images from text prompts
Meta Llama3 8b with Llava Multimodal capabilities
Generate text and segment images using PaliGemma
Generate descriptions and answers about images
Generate text descriptions from images
Microsoft Phi-3 Vision 128k with Multimodal capabilities
let's talk about the meaning of life
Convert images to grayscale
Generate captions and analyze images with various tasks
Generate detailed captions for images
Generate detailed image captions
Analyze images to detect objects, generate captions, or perform OCR
A private and powerful multimodal AI chatbot that runs local
Generate images from prompts or images
Generate text based on an image and prompt
Ask questions about images
Generate text from an image and question
GOT - OCR (from : UCAS, Beijing)
Generate text by uploading images and asking questions
Huggingface space for JanusFlow-1.3B
Generate text from images and queries
Generate captions for images