pdftract

History

jedarden d14ec92fcb feat(pdftract-3zhf): add unified TableDetector::detect entry point Add unified detect() method to TableDetector that combines both line-based and borderless table detection pipelines. This completes the coordinator bead for Phase 7.2: Table Detection and Structure Reconstruction. All child beads (7.2.1-7.2.6) are closed: - 7.2.1: Line-based detection (path segment clustering) - 7.2.2: Borderless detection (x0 alignment heuristic) - 7.2.3: Span-to-cell assignment (centroid containment) - 7.2.4: Header row detection (bold + StructTree TH) - 7.2.5: Merged cell detection (missing interior edges) - 7.2.6: Table JSON output schema integration Critical tests pass: - 5x3 bordered table (15 cells extracted) - Merged header cell colspan=3 - Borderless 3-column table detection - Two-page table continuation detection Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>		2026-05-24 00:51:59 -04:00
..
adr	feat(pdftract-bf-2y2rp): implement lazy stream decoding for PDF extraction	2026-05-23 12:30:26 -04:00
conformance	feat(pdftract-5omc): implement SDK conformance test runner pattern	2026-05-18 01:22:23 -04:00
notes	feat(pdftract-3zhf): add unified TableDetector::detect entry point	2026-05-24 00:51:59 -04:00
plan	feat(pdftract-3zhf): add unified TableDetector::detect entry point	2026-05-24 00:51:59 -04:00
research	feat(pdftract-ilen): implement header row detection with bold+TH support	2026-05-23 23:32:54 -04:00
schema/v1.0	feat(pdftract-3zhf): add unified TableDetector::detect entry point	2026-05-24 00:51:59 -04:00
security	docs(pdftract-58kz): add security policy documentation	2026-05-20 19:39:24 -04:00
user-docs	docs(pdftract-1g87): create mdBook scaffolding for user documentation	2026-05-18 00:38:51 -04:00
research-index.md	Add parallel extraction research and comprehensive research index	2026-05-16 16:30:35 -04:00