OCR PaddlePaddle/PP-OCRv5_server_rec Image-to-Text • Updated Jul 22, 2025 • 135k • 28 ibm-granite/granite-docling-258M Image-Text-to-Text • 0.3B • Updated Sep 23, 2025 • 228k • 1.19k
text<->image De-Diffusion Makes Text a Strong Cross-Modal Interface Paper • 2311.00618 • Published Nov 1, 2023 • 22
De-Diffusion Makes Text a Strong Cross-Modal Interface Paper • 2311.00618 • Published Nov 1, 2023 • 22
Machine Translation A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language Models Paper • 2309.11674 • Published Sep 20, 2023 • 33 tencent/Hunyuan-MT-7B Translation • 8B • Updated Dec 30, 2025 • 11.4k • 731
A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language Models Paper • 2309.11674 • Published Sep 20, 2023 • 33
ASR Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition Paper • 2309.15223 • Published Sep 26, 2023 • 23 utter-project/mHuBERT-147 Feature Extraction • 94.4M • Updated 13 days ago • 26.4k • • 105
Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition Paper • 2309.15223 • Published Sep 26, 2023 • 23
Multimodal LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents Paper • 2311.05437 • Published Nov 9, 2023 • 52
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents Paper • 2311.05437 • Published Nov 9, 2023 • 52
OCR PaddlePaddle/PP-OCRv5_server_rec Image-to-Text • Updated Jul 22, 2025 • 135k • 28 ibm-granite/granite-docling-258M Image-Text-to-Text • 0.3B • Updated Sep 23, 2025 • 228k • 1.19k
Machine Translation A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language Models Paper • 2309.11674 • Published Sep 20, 2023 • 33 tencent/Hunyuan-MT-7B Translation • 8B • Updated Dec 30, 2025 • 11.4k • 731
A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language Models Paper • 2309.11674 • Published Sep 20, 2023 • 33
ASR Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition Paper • 2309.15223 • Published Sep 26, 2023 • 23 utter-project/mHuBERT-147 Feature Extraction • 94.4M • Updated 13 days ago • 26.4k • • 105
Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition Paper • 2309.15223 • Published Sep 26, 2023 • 23
text<->image De-Diffusion Makes Text a Strong Cross-Modal Interface Paper • 2311.00618 • Published Nov 1, 2023 • 22
De-Diffusion Makes Text a Strong Cross-Modal Interface Paper • 2311.00618 • Published Nov 1, 2023 • 22
Multimodal LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents Paper • 2311.05437 • Published Nov 9, 2023 • 52
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents Paper • 2311.05437 • Published Nov 9, 2023 • 52