pdftract/crates
jedarden d723427da7 feat(pdftract-core): add run_tesseract integration and WER calculation
- Add run_tesseract() for full-page OCR with HOCR parsing
- Add run_tesseract_on_cell() for cell-local OCR with origin offset
- Add calculate_wer() for Word Error Rate measurement
- Export new functions in lib.rs
- Add comprehensive unit tests

Work from Phase 5.4.5 end-to-end Tesseract integration.
2026-05-24 01:12:33 -04:00
..
pdftract-cer-diff docs(pdftract-aawrz): add LICENSE-MIT and LICENSE-APACHE files 2026-05-23 10:36:28 -04:00
pdftract-cli feat(pdftract-3zhf): add unified TableDetector::detect entry point 2026-05-24 00:51:59 -04:00
pdftract-core feat(pdftract-core): add run_tesseract integration and WER calculation 2026-05-24 01:12:33 -04:00
pdftract-libpdftract feat(pdftract-juc): implement Standard 14 font metrics registry 2026-05-23 14:04:02 -04:00
pdftract-py docs(pdftract-aawrz): add LICENSE-MIT and LICENSE-APACHE files 2026-05-23 10:36:28 -04:00