FICHA · MANJARO

tesseract-data-bod

Tesseract OCR data (bod)

  • Data
  • DATA
  • OCR
  • LOCALIZATION
  • Dependency only
official+codex · reviewed · May 29, 2026 description in en

Description

Enables Tesseract to recognize Tibetan text in scanned pages and images. It is useful for digitizing Tibetan documents, religious texts, study material, and archival content.

OCR for Tibetan can be sensitive to print quality, script style, and page layout. Human review remains important for preservation or publication work.

Permissions

Permissions not analysed for this source yet.