FICHA · AUR

tika-server

Detects and extracts metadata and text from over a thousand different file types, such as PPT, XLS, and PDF. (server)

  • document extraction service
  • SERVICE
  • DOCUMENTS
  • NETWORK
  • Launchable
  • Runs in terminal
  • Background service
official+codex · reviewed · Jun 5, 2026 description in en

Description

File metadata and text extraction can be exposed as a local or networked Apache Tika service. Administrators install it when other applications need an HTTP parser service for documents, media, and archives. Uploaded content, network access, parser bugs, temporary files, memory use, and service permissions are high-risk concerns.

How to run

java

Commands: java

Permissions

Permissions not analysed for this source yet.