FICHA · MANJARO

tesseract-data-san

Tesseract OCR data (san)

  • Data
  • DATA
  • OCR
  • LOCALIZATION
  • Dependency only
official+codex · reviewed · May 29, 2026 description in en

Description

Enables Tesseract to recognize Sanskrit text in suitable scanned documents and images. It is useful for digitizing classical texts, study material, manuscripts, and academic sources.

Sanskrit OCR can be affected by script choice, diacritics, ligatures, and page condition. Expert review is important before scholarly or archival use.

Permissions

Permissions not analysed for this source yet.