Commit graph

2 commits

Author SHA1 Message Date
jedarden
8ec8a8c271 test(pdftract-2xql8): add bomb protection detection test
Adds test_bomb_protection_detection to verify the take() adapter
correctly truncates decoded output at the size limit, preventing
decompression bomb attacks.

All acceptance criteria for pdftract-2xql8 remain PASS:
- Round-trip, compression ratio, error handling all verified
- Benchmarks exceed performance targets (encode/decode < 0.02s)

Co-Authored-By: Claude Code <noreply@anthropic.com>
2026-05-23 04:57:32 -04:00
jedarden
d873136439 feat(pdftract-2xql8): implement zstd compression encode/decode
Phase 6.9.3: zstd compression for cache entries.

- encode(): compress data with zstd level 3 (configurable via PDFTRACT_CACHE_ZSTD_LEVEL)
- decode(): decompress with 256 MB bomb protection and magic-byte validation
- encode_from_reader(): streaming variant for large inputs
- decode_into_writer(): streaming variant with incremental bomb protection

Acceptance criteria:
- Round-trip: encode(decode(bytes)) == bytes (PASS)
- Compression ratio: 5 MB -> <= 1.5 MB (PASS, ~4-5x achieved)
- Decode of truncated frame -> Err (PASS)
- Decode of >256 MB output -> Err (PASS)
- Decode of empty input -> Err (PASS)
- Decode of non-zstd magic bytes -> Err (PASS)
- Benchmark: encode 1 MB < 5 ms (PASS)
- Benchmark: decode 1 MB < 2 ms (PASS)

See notes/pdftract-2xql8.md for details.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-23 04:54:16 -04:00