FICHA · MANJARO

tesseract-data-oci

Tesseract OCR data (oci)

Data
DATA
OCR
LOCALIZATION
Dependency only

official+codex · reviewed · May 29, 2026 description in en

Description

Enables Tesseract to recognize Occitan text in scanned documents and images. It is useful for digitizing Occitan publications, historical material, learning resources, and archives.

Occitan OCR may be affected by dialect, spelling variation, and older typography. Proofread output before publication or research use.

Permissions

Permissions not analysed for this source yet.