DocLayout-Edge

Layout-aware document parser. Pairs with PaddleOCR-v5 for end-to-end PDF understanding on-device.

OCRApache-2.0INT8FP16doc-ailayout

36K downloads 1.2K deploymentsUpdated Mar 8, 2028

Headline:22.6ms · Snapdragon 8 Gen 3 NPU · INT8

Overview Benchmarks4 Sim Results Deploy4 Files Discussion23

About this model

Layout-aware document parser. Pairs with PaddleOCR-v5 for end-to-end PDF understanding on-device.

Authored by adept-edge. Curated into the Fo’c’sle reference set on 2028-03-08. All cross-chip benchmarks below were collected in matched-pair runs in the HIL lab using the same input pipeline, same upstream preprocessing, and the same downstream consumer. See the methodology page for the full protocol.

Task: OCR
Parameters: 16.8 M
Benchmarked on: 4 chips
Deployments: 1.2K

Architecture

Detection + recognition pipeline

Inferred from upstream weights · simplified

Headline benchmarks

QSnapdragon 8 Gen 3 NPUINT8

9.33ms p50

107 FPS98.5% acc6.6 W

QQualcomm QCS8550INT8

11.9ms p50

84 FPS98.7% acc12.9 W

NNVIDIA Jetson Orin NanoINT8

13.8ms p50

73 FPS97.6% acc12.0 W

Training data

Pretrained on the upstream maintainer’s released checkpoint. Edge-distillation pass uses 2.4M frames from the Fo’c’sle distillation corpus (consented public data + opt-in publisher contributions). Quantization-aware fine-tune uses 320K calibration samples drawn from the target task’s eval domain.

Pretraining corpus: upstream maintainer release
Distillation corpus: 2,400,000 frames
Calibration set: 320,000 samples (per task)
Eval set: standard benchmark + matched-pair HIL runs