- Add run_tesseract() for full-page OCR with HOCR parsing - Add run_tesseract_on_cell() for cell-local OCR with origin offset - Add calculate_wer() for Word Error Rate measurement - Export new functions in lib.rs - Add comprehensive unit tests Work from Phase 5.4.5 end-to-end Tesseract integration. |
||
|---|---|---|
| .. | ||
| pdftract-cer-diff | ||
| pdftract-cli | ||
| pdftract-core | ||
| pdftract-libpdftract | ||
| pdftract-py | ||