ToolsData

Mistral releases OCR 4 — layout aware OCR preferred in 72% of blind tests, supports 170 languages

Mistral launches OCR 4, which detects document layout blocks, emits confidence scores and was preferred over competitors in 72 percent of a 600+ document blind test.

In detail

  • OCR 4 extracts text from PDFs, Word and PowerPoint and classifies each element’s position and role (title, table, equation, signature).
  • Provides block classification and confidence scores per word/page to aid search systems and agent pipelines.
  • Supports 170 languages; independent reviewers preferred OCR 4 in 72% of blind test cases across 600+ documents.
  • Available via API, Mistral Studio and Microsoft Foundry; pricing: $4 per 1,000 pages or $2 in batch mode.

Why it matters

Layout awareness plus confidence estimates improve downstream indexing and automated processing of documents, which matters for enterprises digitizing multilingual archives and automated workflows.

For you Run a trial of OCR 4 on a slice of your multilingual document corpus and compare layout extraction and confidence handling to your current OCR solution.

← All news

Summaries are generated automatically and link to the original source.