pdftract/docs
jedarden 94b02dedfe docs(pdftract-1tjn): finalize OpenType MATH and formula extraction research note v1.0
- Add Section 11: Formula-Region Detection Algorithm with pseudo-code
- Add Section 12: Inline vs Display Formula Classification rules
- Add Section 13: LaTeX-Like Reconstruction (Best-Effort) with feature-flag guidance
- Add Section 14: Profile Classifier Signal `structural.has_math` definition
- Add Section 15: Validation Methodology with arXiv fixture corpus strategy

File grows from 168 to 426 lines. All acceptance criteria PASS.

Closes: pdftract-1tjn
2026-05-24 10:41:39 -04:00
..
adr feat(pdftract-bf-2y2rp): implement lazy stream decoding for PDF extraction 2026-05-23 12:30:26 -04:00
conformance feat(pdftract-5omc): implement SDK conformance test runner pattern 2026-05-18 01:22:23 -04:00
integrations feat(pdftract-vk0gc): implement markdown anchors with parser regex 2026-05-24 02:49:16 -04:00
notes docs(pdftract-3b1x): finalize sdk-invocation.md with subprocess contract and TH-07 compliance 2026-05-24 07:48:09 -04:00
plan feat(pdftract-3zhf): add unified TableDetector::detect entry point 2026-05-24 00:51:59 -04:00
research docs(pdftract-1tjn): finalize OpenType MATH and formula extraction research note v1.0 2026-05-24 10:41:39 -04:00
schema/v1.0 feat(pdftract-j6yd): implement signatures array output + validation_status enum + schema integration 2026-05-24 04:05:34 -04:00
security docs(pdftract-58kz): add security policy documentation 2026-05-20 19:39:24 -04:00
user-docs docs(pdftract-1g87): create mdBook scaffolding for user documentation 2026-05-18 00:38:51 -04:00
research-index.md Add parallel extraction research and comprehensive research index 2026-05-16 16:30:35 -04:00