Images to Text - a ijohn07 Collection

ijohn07 's Collections

LoRA

Text to images NSFW

Justines's Llamafiles

Images to Text

updated Jun 7

Running

439

moondream2

🌔

439

a tiny vision language model
Running

Featured

37

Candle Moondream 2

🕯

37

MoonDream 2 Vision Model on the Browser: Candle/Rust/WASM
Paused

Featured

146

Idefics 8b

🐠

146

Generate text from images and prompts
Runtime error

2.04k

Stable Diffusion XL on TPUv5e

🏋

2.04k

Generate images from text prompts
Running on Zero

88

Llava Llama-3 8B

🔥

88

Meta Llama3 8b with Llava Multimodal capabilities
Running

83

Paligemma HF

🤗

83

Generate text and segment images using PaliGemma
Running on Zero

Featured

150

Llava Next

🔥

150

Generate descriptions and answers about images
Running on Zero

Featured

219

Microsoft Phi-3-Vision-128k

😻

219

Generate text descriptions from images
Running on Zero

46

Microsoft Phi-3 Vision 128k

🔥

46

Microsoft Phi-3 Vision 128k with Multimodal capabilities
Runtime error

Featured

51

Contemplative moondream

🌜

51

let's talk about the meaning of life
Running

3

Gradio Lite

🖼

3

Convert images to grayscale
Running on Zero

Featured

811

Florence 2

📉

811

Generate captions and analyze images with various tasks
Running on Zero

Featured

260

SD3 Long Captioner

🏃

260

Generate detailed captions for images
Running

36

Florence 2 SD3 Captioner

⚡

36

Generate detailed image captions
Runtime error

Featured

198

Better Florence 2

🔥

198

Analyze images to detect objects, generate captions, or perform OCR
Running

21

LLaVA WebGPU

🌋

21

A private and powerful multimodal AI chatbot that runs local
Running on Zero

90

AuraFlow-v0.3 with Captioner

🖼

90

Generate images from prompts or images
Paused

Featured

102

Idefics3

📊

102

Generate text based on an image and prompt
Runtime error

31

Phi 3.5 Vision

👁

31

Ask questions about images
Running on Zero

Featured

224

Phi 3.5 Vision

🔥

224

Generate text from an image and question
Running

MCP

Featured

180

Tonic's GOT OCR

📲

180

GOT - OCR (from : UCAS, Beijing)
Running on Zero

Featured

390

Llama-Vision-11B

🚀

390

Generate text by uploading images and asking questions
Running on Zero

Featured

217

JanusFlow 1.3B

🏃

217

Huggingface space for JanusFlow-1.3B
Runtime error

144

SmolVLM

📊

144

Generate text from images and queries
Sleeping

1

SD3 Long Captioner

🏃

1

Generate captions for images