Medical benchmark.
Focused benchmark across 30 healthcare PDFs and 87 pages: labs, mammography, imaging, colorectal screening, scanned forms, and fax-like documents.
Back to main benchmarkExtract latency
1.1s
#1
p50 wall-clock
Extract coverage
100%
#1
30/30 documents in selected tags
speed vs consensus
Upper-left is better: faster latency with higher consensus against the majority-provider reference.
document tags
← fasterslower →
ExtractOther qualified provider
provider table
Exact measurements for the selected documents.
| provider | mode | success | consensus F1 | p50 latency | pages/sec | bbox |
|---|---|---|---|---|---|---|
| Extract | Hosted API, ocr=auto | 30/30 | 97% | 1.1s | 2.1 | 100% |
| Reducto | Parse API | 30/30 | 98.8% | 15.9s | 0.1 | 100% |
| Gemini Structured Output | gemini-2.5-flash via Vertex AI, structured JSON + bbox schema | 30/30 | 99.7% | 31.2s | 0 | 100% |
documents in this view
30 docs
Fecal Fat Quantitative Sample Report
1p
labsmedicalreportstables
Hereditary Hemolytic Anemia Cbc Sample Report
5p
labsmedicalreportstables
Lipofit High Sample Report
3p
labsmedicalreportstables
Fit Ifobt Colorectal Screening Info
2p
labsmedicalreportstables
Hepatitis B Surface Antibody Sample Report
1p
labsmedicalreportstables
Lipid Panel Sample Report
2p
labsmedicalreportstables
Nmr Lipoprofile Sample Report
8p
labsmedicalreportstables
Abnormal Cholesterol Result Letter
1p
labsmedicalreportstables
Blood Per Rectum Cbc Chem7 Coags
2p
labsmedicalreportstables
Bilateral Mammogram Bi-Rads 0
1p
mammographyimagingmedicalreports
Breast Ultrasound Biopsy
2p
mammographyimagingmedicalreports
Diagnostic Mammogram
1p
mammographyimagingmedicalreports
Ffdm Mammogram Bi-Rads 2
1p
mammographyimagingmedicalreports
Mri Breast Bi-Rads 5
1p
mammographyimagingmedicalreports
Chest Xray Report
1p
imagingmedicalreports
Ct Abdomen Pelvis
1p
imagingmedicalreports
Modified Barium Swallow
1p
imagingmedicalreports
Colonoscopy Report Format Examples Positive Fit
1p
colorectalmedicalreportstables
Iccr Colorectal Polypectomy Histopathology Reporting Guide
22p
colorectalmedicalreportstables
Colon Cancer Screening Consult
1p
colorectalmedicalreportstables
Colonoscopy Polypectomy
1p
colorectalmedicalreportstables
Egd Colonoscopy
1p
colorectalmedicalreportstables
Olympus Colonoscopy Procedure Report Sample
1p
colorectalmedicalreportstables
Plco Cqx Colonoscopy Questionnaire
7p
scannedrasterizedformsmedicalocr_requiredreports
Plco Dec3 Diagnostic Evaluation Form
12p
scannedrasterizedformsmedicalocr_requiredreports
Plco Fsg2 Flexible Sigmoidoscopy Form
2p
scannedrasterizedformsmedicalocr_requiredreports
Lipid Panel Sample Report Faxified
2p
faxifiedrasterizedmedicalocr_requiredreportstables
Colonoscopy Polypectomy Faxified
1p
faxifiedrasterizedmedicalocr_requiredreports
Ct Abdomen Pelvis Faxified
1p
faxifiedrasterizedmedicalocr_requiredreports
Ffdm Mammogram Bi-Rads 2 Faxified
1p
faxifiedrasterizedimagingmedicalocr_requiredreports
Rasterized docs ship as page-image PDFs (no text layer). They test the OCR / layout path, not text-layer extraction — treated as a sibling-class to the scanned-doc set, not equivalent to born-digital business PDFs.
methodology
- Focused customer benchmark for preventive-screening chart uploads, not the public all-provider benchmark.
- 30 public synthetic, de-identified, or sample PDFs across labs, mammography, imaging, colorectal screening, scanned forms, and fax-like documents.
- Only extract hosted, Reducto, and Gemini Structured Output were evaluated because those were the providers relevant to this customer comparison.
- Gemini used gemini-2.5-flash via Vertex AI with structured JSON output and page-aware [y_min, x_min, y_max, x_max] bounding boxes normalized to 0..1000.
- Each provider ran once per document (repeats=1). Latency is wall-clock from the benchmark runner.
- Consensus F1 is provider agreement against a pseudo-reference across the three successful providers, NOT human-labeled accuracy.
- BBox metrics are source-grounding agreement in PDF point space, not human-labeled layout accuracy.