FocsleFocsle
AP

MobileCLIP-S2

19%
by Apple

Apple's reference vision-language embedding for mobile. The encoder behind a lot of on-device retrieval.

MultimodalApache-2.0INT8FP16clipembeddingmobile
313K downloads 18K deploymentsUpdated Mar 30, 2028
Headline:6.4ms · Apple Neural Engine (M4) · INT8

Cross-chip benchmark matrix

Every supported chip, in matched-pair runs from the Fo’c’sle HIL lab. Sortable by any column — click a header. Cells where the chip can’t run this model show Not supported.

Chip platformQuantLatency p50(ms)Latency p95(ms)ThroughputAcc. retention(%)Power(W)Memory(MB)Tested by
H
Hailo-10H
40 TOPS · M.2
INT820.426.749 FPS98.33.242Fo’c’sle HIL
N
NVIDIA Jetson Orin Nano
40 TOPS · SoM
INT821.426.047 FPS97.812.242Fo’c’sle HIL
Q
Snapdragon 8 Gen 3 NPU
45 TOPS · SoC
INT821.726.646 FPS97.67.442Community
Apple Neural Engine (M4)
38 TOPS · SoC
FP1634.943.529 FPS99.66.584Publisher
Q
Qualcomm QCS8550
48 TOPS · SoC
Coming Q4 2028
H
Hailo-8
26 TOPS · M.2
Not supported
Q
Qualcomm QCS6490
12 TOPS · SoC
Not supported
N
NVIDIA Jetson AGX Orin
275 TOPS · Module
Not supported
N
NVIDIA Jetson Thor
2070 TOPS · Module
Not supported
π
Raspberry Pi 5 + Hailo HAT
26 TOPS · HAT
Not supported
A
Ambarella CV5
16 TOPS · SoC
Not supported
A
Ambarella CV72
32 TOPS · SoC
Not supported
M
MediaTek Genio 700
4 TOPS · SoC
Not supported
G
Google Coral Edge TPU
4 TOPS · USB
Not supported
I
Intel Movidius Myriad X
4 TOPS · SoC
Not supported
A
AMD Versal AI Edge VE2302
22 TOPS · SoC
Not supported
R
Rockchip RK3588
6 TOPS · SoC
Not supported
Leader per columnMS-COCO captions · 224×224 image / 32-token text

HIL conditions

All numbers measured on Fo’c’sle HIL rigs in Tel Aviv (primary), Munich (secondary), and Pittsburgh (robotics). Single-stream, batch-1, real preprocessing, real downstream consumer. p50/p95 are over 10,000-frame steady-state windows after a 30-second warm-up. Power draw is package power, not wall power. Memory footprint is the resident model + activations footprint at peak — not on-disk.

Submitted publisher numbers are accepted only if they reproduce within ±8% of an HIL-lab matched run on the same chip in the same input mode. Otherwise they live separately under the Discussion tab.