pdftract/.needle-predispatch-sha
jedarden d723427da7 feat(pdftract-core): add run_tesseract integration and WER calculation
- Add run_tesseract() for full-page OCR with HOCR parsing
- Add run_tesseract_on_cell() for cell-local OCR with origin offset
- Add calculate_wer() for Word Error Rate measurement
- Export new functions in lib.rs
- Add comprehensive unit tests

Work from Phase 5.4.5 end-to-end Tesseract integration.
2026-05-24 01:12:33 -04:00

1 line
41 B
Text

d752df8c1e06ef4918bdc946cad953e8c13fefbd