OEby OpenAI-Edge
Whisper-Edge-Tiny
8%Edge fork of Whisper-Tiny with NPU-friendly attention rewriting. INT4 variant runs on the Pi alone.
Embedded ASRMITINT8INT4asrenglishtiny
412K downloads 25K deploymentsUpdated Mar 18, 2028
Headline:220ms · Raspberry Pi 5 + Hailo HAT · INT8
Deploy Whisper-Edge-Tiny
Pick a chip family. We hand you the artifacts (HEF, TRT engine, Core ML, ONNX) plus a one-click endpoint deploy. For private endpoints, on-prem deploy, or air-gapped distribution, see Enterprise.
HHailo-8
# Compile to Hailo HEF
$ pip install focsle
$ focsle pull openai-edge/whisper-edge-tiny --target hailo-8
$ focsle compile whisper-edge-tiny.onnx --target hailo-8 --quant int8
# Run on-device (HailoRT)
import focsle.runtime as fr
m = fr.load("whisper-edge-tiny.hef", target="hailo")
out = m.run(frame)One-click endpoint
Spins up a managed endpoint in the closest region. Pro and above.