vLLM support and ONNX models

by Napron - opened Dec 21, 2025

Thanks for a great model.

Napron changed discussion title from vLLM support to vLLM support and ONNX models Dec 21, 2025

Jina AI org Dec 22, 2025

Hey @Napron , vLLM is coming very soon, we are working on this right now. For ONNX you mean separate models for vision and language?

Thats great! Yes I want to test the inference of vision and language models, would be good to have quantized onnx models too.

Thanks in advance.

Jan 13

Hey @Napron , vLLM is coming very soon, we are working on this right now. For ONNX you mean separate models for vision and language?

Hi, thanks for such a great model. Are there any updates on vLLM serving?

Jan 13

I feel like they are very slow on adding vLLM

Jina AI org Jan 13

Hi @grozatech , its taking a bit longer than expected, but I am back at it now. Will update this thread when its ready, thanks ✌️

Jina AI org Jan 27

Jan 27

Hi @gmastparas, thank you for updating. Could you also provide pre-quantized AWQ or GPTQ weights for jina-vlm?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment