Data and models for the paper "How Much Is One Recurrence Worth? Iso-Depth Scaling Laws for Looped Language Models"
Kristian Schwethelm
KristianS7
AI & ML interests
Large Language Models
Recent Activity
updated a model 7 days ago
KristianS7/Ouro-1.4B new activity 7 days ago
KristianS7/Ouro-1.4B:Update tied weight metadata for Transformers 5 liked a model 10 days ago
KristianS7/Ouro-1.4B