Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
š¤
Open to Collab
138.4
TFLOPS
1
21
42
Soumik Rakshit
geekyrakshit
Follow
AXONVERTEX-AI-RESEARCH's profile picture
farisalasmary's profile picture
WildGenie's profile picture
25 followers
Ā·
41 following
http://geekyrakshit.dev
soumikRakshit96
soumik12345
soumikrakshit
AI & ML interests
Computer vision
Recent Activity
updated
a dataset
about 13 hours ago
geekyrakshit/indian-exam-questions
published
a dataset
1 day ago
geekyrakshit/indian-exam-questions
reacted
to
m-ric
's
post
with š
10 days ago
šš®š š š¢š§š š ššš š«šš„ššš¬šš¬ šš¢ššØšš«šØš§, š š¦š¢šš«šØš¬ššØš©š¢š š„š¢š šš”šš š¬šØš„šÆšš¬ ššš šš«šš¢š§š¢š§š šš š©šš«šš„š„šš„š¢š³ššš¢šØš§ š„³ š°ļø Llama-3.1-405B took 39 million GPU-hours to train, i.e. about 4.5 thousand years. š“š» If they had needed all this time, we would have GPU stories from the time of Pharaoh š: "Alas, Lord of Two Lands, the shipment of counting-stones arriving from Cathay was lost to pirates, this shall delay the building of your computing temple by many moons " š ļø But instead, they just parallelized the training on 24k H100s, which made it take just a few months. This required parallelizing across 4 dimensions: data, tensor, context, pipeline. And it is infamously hard to do, making for bloated code repos that hold together only by magic. š¤ ššš š»š¼š šš² š±š¼š»'š š»š²š²š± šµšš“š² šæš²š½š¼š š®š»ššŗš¼šæš²! Instead of building mega-training codes, Hugging Face colleagues cooked in the other direction, towards tiny 4D parallelism libs. A team has built Nanotron, already widely used in industry. And now a team releases Picotron, a radical approach to code 4D Parallelism in just a few hundred lines of code, a real engineering prowess, making it much easier to understand what's actually happening! ā” šš'š šš¶š»š, šš²š š½š¼šš²šæš³šš¹: Counting in MFU (Model FLOPs Utilization, how much the model actually uses all the compute potential), this lib reaches ~50% on SmolLM-1.7B model with 8 H100 GPUs, which is really close to what huge libs would reach. (Caution: the team is leading further benchmarks to verify this) Go take a look š https://github.com/huggingface/picotron/tree/main/picotron
View all activity
Organizations
spaces
4
Sort:Ā Recently updated
Sleeping
Agents
Aryabhatta Inference
š
Running
Line Art Data Annotation
š
An app for annotation of line art data from books
Runtime error
2
MedRAG Multi-Modal
š©ŗ
Runtime error
1
Enhance Me
š
models
5
Sort:Ā Recently updated
geekyrakshit/binary-classifier
67M
ā¢
Updated
Nov 29, 2024
ā¢
1
geekyrakshit/grays-anatomy-index-medcpt
Updated
Nov 3, 2024
ā¢
1
geekyrakshit/grays-anatomy-index-contriever
Updated
Nov 3, 2024
ā¢
4
geekyrakshit/grays-anatomy-index
Updated
Nov 3, 2024
ā¢
2
geekyrakshit/DeepLabV3-Plus
Updated
Jul 3, 2023
ā¢
58
datasets
23
Sort:Ā Recently updated
geekyrakshit/indian-exam-questions
Viewer
ā¢
Updated
about 12 hours ago
ā¢
1.44k
ā¢
32
geekyrakshit/art-images
Viewer
ā¢
Updated
28 days ago
ā¢
12.6k
ā¢
43
geekyrakshit/hotpotqa_sft_traces
Viewer
ā¢
Updated
Dec 15, 2025
ā¢
5
ā¢
13
geekyrakshit/issues
Viewer
ā¢
Updated
Oct 17, 2025
ā¢
1.89k
ā¢
11
geekyrakshit/rust-issues
Viewer
ā¢
Updated
Oct 10, 2025
ā¢
1.89k
ā¢
40
ā¢
2
geekyrakshit/hyperswitch
Viewer
ā¢
Updated
Oct 7, 2025
ā¢
550
ā¢
4
geekyrakshit/drawing-made-easy
Viewer
ā¢
Updated
Aug 12, 2025
ā¢
95
ā¢
7
geekyrakshit/prompt-injection-dataset
Viewer
ā¢
Updated
Nov 29, 2024
ā¢
534k
ā¢
253
ā¢
8
geekyrakshit/test-chunk-dataset
Viewer
ā¢
Updated
Nov 23, 2024
ā¢
832
ā¢
7
geekyrakshit/test-dataset
Viewer
ā¢
Updated
Nov 23, 2024
ā¢
22
ā¢
9
View 23 datasets