JP Balarini's picture

JP Balarini PRO

jpbalarini

·

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

Overworld/Waypoint-1-Small

liked a model 4 days ago

lightonai/LightOnOCR-2-1B

liked a Space 4 days ago

lightonai/LightOnOCR-2-1B-Demo

View all activity

Organizations

upvoted a collection 2 months ago

Olmo 3

Artifacts for the Olmo 3 release. • 9 items • Updated Dec 23, 2025 • 160

upvoted 2 articles 3 months ago

Article

ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases

Nov 5, 2025

•

58

Article

Supercharge your OCR Pipelines with Open Models

+5

Oct 21, 2025

•

299

upvoted a collection about 1 year ago

QVQ-72B-Preview

5 items • Updated Dec 24, 2024 • 7

upvoted an article about 1 year ago

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

Jul 5, 2024

•

310

upvoted 2 collections about 1 year ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated May 5, 2025 • 301

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 50 items • Updated Dec 11, 2025 • 137

upvoted 5 collections over 1 year ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated Dec 23, 2025 • 309

Qwen2-VL

Vision-language model series based on Qwen2 • 16 items • Updated 25 days ago • 227

Sapiens

Foundation models for human tasks. Code: https://github.com/facebookresearch/sapiens • 72 items • Updated Sep 18, 2024 • 60

Phi-3

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated May 1, 2025 • 574

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated May 5, 2025 • 243

upvoted an article over 1 year ago

Article

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

+1

Jun 24, 2024

•

205

upvoted 4 collections over 1 year ago

Florence

9 items • Updated May 1, 2025 • 173

DeepSeekCoder-V2

6 items • Updated Nov 27, 2025 • 112

4M Models

Multimodal models from https://4m.epfl.ch/ • 17 items • Updated Mar 7, 2025 • 31

Nemotron 4 340B

Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated 4 days ago • 162

upvoted 2 papers almost 2 years ago

SaulLM-7B: A pioneering Large Language Model for Law

Paper • 2403.03883 • Published Mar 6, 2024 • 89

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 627

upvoted a collection almost 2 years ago

Qwen1.5

Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. • 55 items • Updated 25 days ago • 212