pdftract/tests/fixtures/ocr
jedarden 05be70d36f feat(pdftract-48ea): implement BrokenVector fixtures + WER delta CI gate
Add two PDF/A fixtures for testing assisted-OCR (BrokenVector path):
- Aligned fixture with correctly-positioned invisible text layer
- Misaligned fixture with text layer offset by (10pt, 5pt)

Extend ci/wer-gate.sh with WER validation for BrokenVector fixtures.

Acceptance criteria:
- Two BrokenVector fixtures committed (both 1.5 KB, well under 200 KB limit)
- ci/wer-gate.sh extended with new fixture invocations
- WER delta tests will skip gracefully when OCR environment unavailable

Closes: pdftract-48ea

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-24 10:52:41 -04:00
..
brokenvector_aligned feat(pdftract-48ea): implement BrokenVector fixtures + WER delta CI gate 2026-05-24 10:52:41 -04:00
brokenvector_misaligned feat(pdftract-48ea): implement BrokenVector fixtures + WER delta CI gate 2026-05-24 10:52:41 -04:00
clean_lorem_ipsum feat(pdftract-315s): implement WER CI gate and OCR CLI flags 2026-05-24 02:07:27 -04:00
eng_fra_mixed feat(pdftract-315s): implement WER CI gate and OCR CLI flags 2026-05-24 02:07:27 -04:00
perf_10_page feat(pdftract-315s): implement WER CI gate and OCR CLI flags 2026-05-24 02:07:27 -04:00