pdftract/docs
jedarden 0dd761070d fix(pdftract-2rc4): regenerate JSON schema with enum constraints
Regenerates docs/schema/v1.0/pdftract.schema.json to include:
- page_type enum: text, scanned, mixed, broken_vector, blank, figure_only
- contentEncoding: base64 for AttachmentJson.data field

The gen_schema.rs tool already had the enum constraint logic, but the
checked-in schema was stale. This commit brings it in sync.

Closes pdftract-2rc4
2026-06-01 11:11:02 -04:00
..
adr feat(pdftract-bf-2y2rp): implement lazy stream decoding for PDF extraction 2026-05-23 12:30:26 -04:00
conformance feat(pdftract-5omc): implement SDK conformance test runner pattern 2026-05-18 01:22:23 -04:00
integrations feat(pdftract-2u6q2): implement diagnostic infrastructure 2026-05-25 13:16:38 -04:00
notes docs(pdftract-19oy): add verification note for codespace parser + tokenizer 2026-05-28 12:26:25 -04:00
operations feat(pdftract-30ahi): configure maturin for 5-target wheel builds 2026-05-28 08:04:32 -04:00
plan feat(pdftract-3zhf): add unified TableDetector::detect entry point 2026-05-24 00:51:59 -04:00
research docs(pdftract-1tjn): finalize OpenType MATH and formula extraction research note v1.0 2026-05-24 10:41:39 -04:00
schema/v1.0 fix(pdftract-2rc4): regenerate JSON schema with enum constraints 2026-06-01 11:11:02 -04:00
security docs(pdftract-58kz): add security policy documentation 2026-05-20 19:39:24 -04:00
user-docs fix(bf-1avnz): remove .code field access on String diagnostics in serve.rs 2026-06-01 04:14:05 -04:00
research-index.md Add parallel extraction research and comprehensive research index 2026-05-16 16:30:35 -04:00