Suraj

ghishadow

AI & ML interests

None yet

Recent Activity

liked a model 6 days ago

LiquidAI/LFM2-2.6B-Exp

liked a model 20 days ago

Qwen/Qwen3-VL-2B-Thinking

liked a model 26 days ago

moonshotai/Kimi-Linear-48B-A3B-Instruct

View all activity

Organizations

liked a model 6 days ago

LiquidAI/LFM2-2.6B-Exp

Text Generation • 3B • Updated 7 days ago • 6.16k • 292

liked a model 20 days ago

Qwen/Qwen3-VL-2B-Thinking

Image-Text-to-Text • 2B • Updated Oct 20, 2025 • 36.6k • 97

liked a model 26 days ago

moonshotai/Kimi-Linear-48B-A3B-Instruct

Text Generation • 49B • Updated 17 days ago • 90.7k • 515

upvoted a collection about 1 month ago

Ministral 3

Collection

Mistral Ministral 3: new multimodal models in Base, Instruct, and Reasoning variants, available in 3B, 8B, and 14B sizes. • 36 items • Updated 9 days ago • 26

liked a model about 1 month ago

litert-community/Gemma3-1B-IT

Text Generation • Updated Sep 22, 2025 • 18.6k • • 450

liked a model about 2 months ago

maya-research/maya1

Text-to-Speech • 3B • Updated Nov 12, 2025 • 81.3k • • 835

upvoted a paper 2 months ago

Latent Diffusion Model without Variational Autoencoder

Paper • 2510.15301 • Published Oct 17, 2025 • 49

liked 2 models 2 months ago

rednote-hilab/dots.ocr

Image-Text-to-Text • 3B • Updated Oct 31, 2025 • 743k • 1.17k

openai/gpt-oss-20b

Text Generation • 22B • Updated Aug 26, 2025 • 6.65M • • 4.15k

upvoted an article 4 months ago

Article

The Hacker's Guide to Building an AI Supercluster

Aug 31, 2025

•

liked a Space 4 months ago

The Ultra-Scale Playbook

🌌

3.62k

The ultimate guide to training LLM on large GPU Clusters

upvoted a collection 5 months ago

Gemma 3-270m

Collection

Collection of models for Gemma 3-270m • 4 items • Updated 17 days ago • 21

liked a Space 5 months ago

Wllama

🦙

Run GGUF directly on your browser!

liked a model 5 months ago

google/gemma-3-270m

Text Generation • 0.3B • Updated Aug 14, 2025 • 47.1k • 943

liked a Space 5 months ago

chat-ui

🔥

1.21k

Redirect to HuggingChat for conversations

liked a model 5 months ago

microsoft/Phi-3.5-mini-instruct

Text Generation • 4B • Updated 23 days ago • 305k • 940

upvoted a paper 5 months ago

CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning

Paper • 2507.14111 • Published Jul 18, 2025 • 23

liked a model 5 months ago

tencent/HunyuanWorld-1

Image-to-3D • Updated Oct 20, 2025 • 11.8k • 591

liked 2 models 6 months ago

HuggingFaceTB/SmolLM3-3B

Text Generation • 3B • Updated Sep 10, 2025 • 87.1k • • 859

apple/DiffuCoder-7B-cpGRPO

8B • Updated 25 days ago • 489 • 316

Suraj

AI & ML interests

Recent Activity

Organizations

ghishadow's activity

The Hacker's Guide to Building an AI Supercluster

The Ultra-Scale Playbook

Wllama

chat-ui