pdftract/docs
jedarden 96b548ea18 docs(pdftract-19oy): add verification note for codespace parser + tokenizer
Implementation is complete. The codespace range parser and multi-byte
tokenizer exist in crates/pdftract-core/src/cmap/:
- codespace.rs: CodespaceParser for begincodespacerange blocks
- tokenize.rs: tokenize_cjk_bytes with widest-first matching

All acceptance criteria PASS. Compilation blocked by unrelated missing_docs
errors in parser/struct_tree.rs and other modules.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-28 12:26:25 -04:00
..
adr feat(pdftract-bf-2y2rp): implement lazy stream decoding for PDF extraction 2026-05-23 12:30:26 -04:00
conformance feat(pdftract-5omc): implement SDK conformance test runner pattern 2026-05-18 01:22:23 -04:00
integrations feat(pdftract-2u6q2): implement diagnostic infrastructure 2026-05-25 13:16:38 -04:00
notes docs(pdftract-19oy): add verification note for codespace parser + tokenizer 2026-05-28 12:26:25 -04:00
operations feat(pdftract-30ahi): configure maturin for 5-target wheel builds 2026-05-28 08:04:32 -04:00
plan feat(pdftract-3zhf): add unified TableDetector::detect entry point 2026-05-24 00:51:59 -04:00
research docs(pdftract-1tjn): finalize OpenType MATH and formula extraction research note v1.0 2026-05-24 10:41:39 -04:00
schema/v1.0 feat(pdftract-2qw5j): add explicit enum constraints to JSON Schema 2026-05-28 02:47:54 -04:00
security docs(pdftract-58kz): add security policy documentation 2026-05-20 19:39:24 -04:00
user-docs fix: resolve compilation errors across codebase 2026-05-25 08:38:04 -04:00
research-index.md Add parallel extraction research and comprehensive research index 2026-05-16 16:30:35 -04:00