FICHA · MANJARO

tesseract-data-enm

Tesseract OCR data (enm)

  • Data
  • DATA
  • OCR
  • LOCALIZATION
  • Dependency only
official+codex · reviewed · May 29, 2026 description in en

Description

Enables Tesseract to recognize Middle English text in suitable scanned sources, helping researchers work with older English documents and editions. It is useful for historical language study and archival OCR workflows.

Historical OCR is fragile because spelling, typography, and page condition vary widely. Treat results as a draft that needs expert review.

Permissions

Permissions not analysed for this source yet.