Commit graph

2 commits

Author SHA1 Message Date
jedarden
af3f8cd5a4 docs(pdftract-56txm): add verification note for Phase 4.5 Reading Order coordinator
All 4 child beads closed and verified. Acceptance criteria met:
- Two-column academic papers: XY-cut correctly orders left-col before right-col
- Magazine with sidebar: Docstrum separates main text from sidebar
- Single-column text: XY-cut produces single region, top-to-bottom ordering
- Tagged PDF: TAGGED_PDF_STRUCT_TREE_DEFERRED emitted, falls through to XY-cut

Test results: 27/27 reading order tests PASS.

Phase 4.5 Reading Order subsystem is fully functional with XY-cut preferred path,
Docstrum fallback for irregular layouts, and proper rank assignment.
2026-06-07 15:30:17 -04:00
jedarden
8c42c18ea8 docs(pdftract-56txm): add verification note for Phase 4.5 Reading Order coordinator
All 4 child beads closed:
- pdftract-5tvv1: Tagged-PDF fast-path stub
- pdftract-4md5z: XY-cut recursive widest-whitespace split
- pdftract-4bylb: Docstrum fallback (k=5 nearest-neighbor)
- pdftract-18cb4: Reading order rank assignment + algorithm tag

Acceptance criteria:
-  All children closed
-  Two-column academic paper: left-col before right-col
-  Magazine with sidebar: main separated from sidebar
-  Single-column: XY-cut produces single region
-  Tagged PDF: TAGGED_PDF_STRUCT_TREE_DEFERRED emitted

Tests: 22/22 reading order unit tests pass; integration test passes.
2026-06-07 15:22:28 -04:00