Towards a Multi-Task 0.9B VLM for Robust In-the-Wild Document Parsing
-
PaddlePaddle/PaddleOCR-VL-1.5
Image-Text-to-Text β’ 1.0B β’ Updated β’ 296k β’ 509 -
PaddleOCR-VL-1.5 Online Demo
π»70PaddleOCR-VL-1.5_Online_Demo
-
PaddlePaddle/PP-DocLayoutV3
Image Segmentation β’ Updated β’ 20.2k β’ 62 -
PaddlePaddle/PP-DocLayoutV3_safetensors
Object Detection β’ Updated β’ 231k β’ 20