pdftract/crates
jedarden f236d787e8 feat(pdftract-66dd8): implement DCTDecode passthrough with SOI/EOI validation
Implement the DCTDecode (JPEG) passthrough filter with marker validation
and /ColorTransform metadata parsing.

Changes:
- Add StreamInvalidJpeg diagnostic code for missing SOI/EOI markers
- Implement DCTDecoder struct with:
  - SOI (0xFFD8) marker validation
  - EOI (0xFFD9) marker validation
  - /ColorTransform parameter parsing
  - Raw byte passthrough with bomb limit enforcement
- Replace PassthroughDecoder with DCTDecoder in get_decoder()
- Add comprehensive test coverage (6 test cases)

The decoder validates JPEG markers but passes through data even when
markers are missing (INV-8 error recovery). Diagnostics are emitted
for missing markers but currently dropped due to trait limitations
(future enhancement will add diagnostics buffer to StreamDecoder).

Closes: pdftract-66dd8

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-24 11:42:09 -04:00
..
pdftract-cer-diff docs(pdftract-aawrz): add LICENSE-MIT and LICENSE-APACHE files 2026-05-23 10:36:28 -04:00
pdftract-cli feat(pdftract-67p2c): implement confidence heatmap layer renderer 2026-05-24 11:08:09 -04:00
pdftract-core feat(pdftract-66dd8): implement DCTDecode passthrough with SOI/EOI validation 2026-05-24 11:42:09 -04:00
pdftract-libpdftract feat(pdftract-3s2i): implement Phase 5.5.2 validation filter 2026-05-24 04:57:17 -04:00
pdftract-py feat(pdftract-2nu0s): implement Python SDK contract conformance 2026-05-24 08:55:11 -04:00