FICHA · AUR

llama-benchy

A simple CLI tool for benchmarking llama.cpp and other LLM inference engines

benchmark-tool
CLI
Dev
Launchable
Runs in terminal

official+codex · reviewed · Jun 2, 2026 description in en

Description

Local LLM inference engines can be benchmarked from the terminal for speed and comparison.

This package is useful for developers and power users who tune llama.cpp or similar model runtimes. It measures performance; it does not provide a model by itself.

Benchmarks can consume high CPU, GPU, memory, and power. Run them when thermal and battery conditions are acceptable.

How to run

llama-benchy

Commands: llama-benchy

Permissions

Permissions not analysed for this source yet.