pdftract

History

jedarden 9b5fbc9b5e feat(pdftract-bf-2y2rp): implement lazy stream decoding for PDF extraction - Add decode_page_content_streams() function for per-page lazy decode - Update extract_page_from_dict() to support lazy stream decoding - Modify extract_pdf() and extract_pdf_ndjson() to enable lazy decoding - Fix borrow checker issue in LazyPageIter::next() This ensures content streams are decoded lazily per page and dropped immediately after processing, keeping peak RSS flat across page count. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>		2026-05-23 12:30:26 -04:00
..
adr	feat(pdftract-bf-2y2rp): implement lazy stream decoding for PDF extraction	2026-05-23 12:30:26 -04:00
conformance	feat(pdftract-5omc): implement SDK conformance test runner pattern	2026-05-18 01:22:23 -04:00
notes	feat(pdftract-5omc): implement per-language conformance test runner pattern	2026-05-18 01:32:24 -04:00
plan	docs(plan): SDKs are monorepo members, not separate repos	2026-05-22 07:21:45 -04:00
research	Add parallel extraction research and comprehensive research index	2026-05-16 16:30:35 -04:00
security	docs(pdftract-58kz): add security policy documentation	2026-05-20 19:39:24 -04:00
user-docs	docs(pdftract-1g87): create mdBook scaffolding for user documentation	2026-05-18 00:38:51 -04:00
research-index.md	Add parallel extraction research and comprehensive research index	2026-05-16 16:30:35 -04:00