Florence-2-Edge
12%Florence-2 distilled for edge inspection use cases — caption, detect, segment from a unified head.
Cross-chip benchmark matrix
Every supported chip, in matched-pair runs from the Fo’c’sle HIL lab. Sortable by any column — click a header. Cells where the chip can’t run this model show Not supported.
| Chip platform | Quant | Latency p50(ms) | Latency p95(ms) | Throughput | Acc. retention(%) | Power(W) | Memory(MB) | Tested by |
|---|---|---|---|---|---|---|---|---|
N NVIDIA Jetson AGX Orin 275 TOPS · Module | FP16 | 31.3 | 40.1 | 32 FPS | 99.6 | 44.8 | 543 | Fo’c’sle HIL |
Q Snapdragon 8 Gen 3 NPU 45 TOPS · SoC | INT8 | 42.3 | 55.8 | 24 FPS | 98.3 | 7.5 | 271 | Fo’c’sle HIL |
Q Qualcomm QCS8550 48 TOPS · SoC | INT8 | 58.5 | 74.4 | 17 FPS | 98.6 | 12.2 | 271 | Fo’c’sle HIL |
N NVIDIA Jetson Orin Nano 40 TOPS · SoM | INT8 | 65.4 | 78.9 | 15 FPS | 98.2 | 14.0 | 271 | Community |
Apple Neural Engine (M4) 38 TOPS · SoC | FP16 | 70.2 | 84.1 | 14 FPS | 99.8 | 7.2 | 543 | Fo’c’sle HIL |
H Hailo-8 26 TOPS · M.2 | Not supported | |||||||
H Hailo-10H 40 TOPS · M.2 | Not supported | |||||||
Q Qualcomm QCS6490 12 TOPS · SoC | Not supported | |||||||
N NVIDIA Jetson Thor 2070 TOPS · Module | Not supported | |||||||
π Raspberry Pi 5 + Hailo HAT 26 TOPS · HAT | Not supported | |||||||
A Ambarella CV5 16 TOPS · SoC | Not supported | |||||||
A Ambarella CV72 32 TOPS · SoC | Not supported | |||||||
M MediaTek Genio 700 4 TOPS · SoC | Not supported | |||||||
G Google Coral Edge TPU 4 TOPS · USB | Not supported | |||||||
I Intel Movidius Myriad X 4 TOPS · SoC | Not supported | |||||||
A AMD Versal AI Edge VE2302 22 TOPS · SoC | Not supported | |||||||
R Rockchip RK3588 6 TOPS · SoC | Not supported | |||||||
HIL conditions
All numbers measured on Fo’c’sle HIL rigs in Tel Aviv (primary), Munich (secondary), and Pittsburgh (robotics). Single-stream, batch-1, real preprocessing, real downstream consumer. p50/p95 are over 10,000-frame steady-state windows after a 30-second warm-up. Power draw is package power, not wall power. Memory footprint is the resident model + activations footprint at peak — not on-disk.
Submitted publisher numbers are accepted only if they reproduce within ±8% of an HIL-lab matched run on the same chip in the same input mode. Otherwise they live separately under the Discussion tab.