Whisper-Edge-Tiny

Edge fork of Whisper-Tiny with NPU-friendly attention rewriting. INT4 variant runs on the Pi alone.

Embedded ASRMITINT8INT4asrenglishtiny

412K downloads 25K deploymentsUpdated Mar 18, 2028

Headline:220ms · Raspberry Pi 5 + Hailo HAT · INT8

Overview Benchmarks8 Sim Results Deploy8 Files Discussion23

Deploy Whisper-Edge-Tiny

Pick a chip family. We hand you the artifacts (HEF, TRT engine, Core ML, ONNX) plus a one-click endpoint deploy. For private endpoints, on-prem deploy, or air-gapped distribution, see Enterprise.

HHailo-8

# Compile to Hailo HEF
$ pip install focsle
$ focsle pull openai-edge/whisper-edge-tiny --target hailo-8
$ focsle compile whisper-edge-tiny.onnx --target hailo-8 --quant int8

# Run on-device (HailoRT)
import focsle.runtime as fr
m = fr.load("whisper-edge-tiny.hef", target="hailo")
out = m.run(frame)

One-click endpoint

Spins up a managed endpoint in the closest region. Pro and above.

Or deploy yourself

Docs · Hailo backend
SDK on GitHub
CLI install