Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
21.0
TFLOPS
2
7
2
Nitesh Kumar Sharma
carbene101
Follow
21world's profile picture
tugrulkaya's profile picture
SelmaNajih001's profile picture
4 followers
ยท
53 following
95-percent-ci
AI & ML interests
LLMs, OCR
Recent Activity
upvoted
a
paper
4 days ago
Architecture Decoupling Is Not All You Need For Unified Multimodal Model
reacted
to
sergiopaniego
's
post
with ๐ฅ
16 days ago
we've just added several example scripts to TRL showing how to train models with GRPO using some of the new OpenEnv environments train a model to interact with a browser (๐ฎ BrowserGym Env), play Wordle (๐ฎ Wordle Env) and moooore! TRL (GRPO + vLLM) + OpenEnv! โก๏ธ ๐ go play with them: https://github.com/huggingface/trl/tree/main/examples/scripts/openenv ๐ examples list: https://huggingface.co/docs/trl/main/en/example_overview#scripts
reacted
to
Kseniase
's
post
with โค๏ธ
18 days ago
12 Types of JEPA Since Yann LeCun together with Randall Balestriero released a new paper on JEPA (Joint-Embedding Predictive Architecture), laying out its theory and introducing an efficient practical version called LeJEPA, we figured you might need even more JEPA. Here are 7 recent JEPA variants plus 5 iconic ones: 1. LeJEPA โ https://huggingface.co/papers/2511.08544 Explains a full theory for JEPAs, defining the โidealโ JEPA embedding as an isotropic Gaussian, and proposes the SIGReg objective to push JEPA toward this ideal, resulting in practical LeJEPA 2. JEPA-T โ https://huggingface.co/papers/2510.00974 A text-to-image model that tokenizes images and captions with a joint predictive Transformer, enhances fusion with cross-attention and text embeddings before training loss, and generates images by iteratively denoising visual tokens conditioned on text 3. Text-JEPA โ https://huggingface.co/papers/2507.20491 Converts natural language into first-order logic, with a Z3 solver handling reasoning, enabling efficient, explainable QA with far lower compute than large LLMs 4. N-JEPA (Noise-based JEPA) โ https://huggingface.co/papers/2507.15216 Connects self-supervised learning with diffusion-style noise by using noise-based masking and multi-level schedules, especially improving visual classification 5. SparseJEPA โ https://huggingface.co/papers/2504.16140 Adds sparse representation learning to make embeddings more interpretable and efficient. It groups latent variables by shared semantic structure using a sparsity penalty while preserving accuracy 6. TS-JEPA (Time Series JEPA) โ https://huggingface.co/papers/2509.25449 Adapts JEPA to time-series by learning latent self-supervised representations and predicting future latents for robustness to noise and confounders Read further below โ It you like it, also subscribe to the Turing Post: https://www.turingpost.com/subscribe
View all activity
Organizations
carbene101
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
about 1 month ago
datalab-to/chandra
Image-to-Text
โข
9B
โข
Updated
Oct 21
โข
94.2k
โข
405
liked
a Space
7 months ago
Running
2
Tts Dhwani Usecase
๐
2
TTS Indian voice for different domain