ModelsToolsData

PP-OCRv6: PaddleOCR ships 50-language OCR across three model tiers

PaddleOCR releases PP-OCRv6, an OCR family with tiny/small/medium tiers (1.5M–34.5M parameters) and 50-language support in the small and medium models.

In detail

  • Three model tiers: tiny, small, medium; parameter range 1.5M–34.5M
  • Small and medium support 50 languages including Simplified/Traditional Chinese, English, Japanese and 46 Latin-script languages
  • PP-OCRv6_medium achieves 86.2% detection Hmean and 83.2% recognition on in-house benchmarks; +4.6pp detection and +5.1pp recognition vs PP-OCRv5_server
  • Integration paths: PaddlePaddle, Transformers or ONNX Runtime; emphasis on lightweight, production-ready deployment

Why it matters

Small, accurate multilingual OCR models simplify automated document and label processing without relying on large VLMs—useful for SMEs handling diverse international documents.

For you Run small/medium variants on representative document samples to compare accuracy and latency, and prefer ONNX/Paddle deployments for low-resource servers.

← All news

Summaries are generated automatically and link to the original source.