FICHA · MANJARO

libexttextcat

N-Gram-Based Text Categorization library primarily intended for language guessing

  • Library
  • LIBRARY
  • LANGUAGE
  • TEXT
  • Dependency only
official+codex · reviewed · May 28, 2026 description in en

Description

An n-gram based text categorization library primarily used to guess the language of text.

It is localization and text-processing infrastructure. Detection helps applications choose dictionaries, spell checking, or language-specific behavior, but short text can still be misclassified.

Permissions

Permissions not analysed for this source yet.