-
CLEX: Continuous Length Extrapolation for Large Language Models
Paper • 2310.16450 • Published • 10 -
E^2-LLM: Efficient and Extreme Length Extension of Large Language Models
Paper • 2401.06951 • Published • 26 -
Data Engineering for Scaling Language Models to 128K Context
Paper • 2402.10171 • Published • 25
Juan Herrera
juampahc
AI & ML interests
None yet
Recent Activity
updated a model 1 day ago
juampahc/LFM2.5-230M-openvino published a model 1 day ago
juampahc/LFM2.5-230M-openvino liked a model 3 months ago
OuteAI/Llama-OuteTTS-1.0-1B