RLXF the best collection of RLXF model including RLHF, RLAIF etc. Amu/dpo-phi2 Text Generation • 3B • Updated Mar 4, 2024 • 16 • 2 Amu/spin-phi2 Text Generation • 3B • Updated Mar 16, 2024 • 33 • 10 Amu/t1-3B-grpo Text Generation • 3B • Updated Apr 7 • 9 • 1
BABYLLM The baby-LLM is the future of LLM. Amu/supertiny-llama3-0.25B-v0.1 Text Generation • 0.3B • Updated Jul 8, 2024 • 12 • 6 Amu/t1-3B Text Generation • 3B • Updated Mar 11 • 32 • 1
RAG the best collection of RAG model, like embedding, ranker etc. Amu/tao-8k Sentence Similarity • Updated Dec 3, 2023 • 129 • 45
RLXF the best collection of RLXF model including RLHF, RLAIF etc. Amu/dpo-phi2 Text Generation • 3B • Updated Mar 4, 2024 • 16 • 2 Amu/spin-phi2 Text Generation • 3B • Updated Mar 16, 2024 • 33 • 10 Amu/t1-3B-grpo Text Generation • 3B • Updated Apr 7 • 9 • 1
RAG the best collection of RAG model, like embedding, ranker etc. Amu/tao-8k Sentence Similarity • Updated Dec 3, 2023 • 129 • 45
BABYLLM The baby-LLM is the future of LLM. Amu/supertiny-llama3-0.25B-v0.1 Text Generation • 0.3B • Updated Jul 8, 2024 • 12 • 6 Amu/t1-3B Text Generation • 3B • Updated Mar 11 • 32 • 1