pdftract/crates
jedarden 3f346a7a71 fix(pdftract-34hxw): correct PDFDocEncoding test expectations
Fixed test_decode_pdf_string_pdfdocencoding_latin1 to expect uppercase
"ÉÈÀ" instead of lowercase "éèà" for bytes [0xE9, 0xE8, 0xE0], matching
PDF 1.7 spec Annex D.2 PDFDocEncoding table.

The implementation (value_text.rs) already correctly implements:
- TextValue struct with value, default, multiline, max_length fields
- decode_pdf_string for PDFDocEncoding/UTF-16BE BOM decoding
- extract_text_value for extracting /V, /DV, /Ff, /MaxLen entries
- FormFieldValue::Text integration via acro_field_to_value

All acceptance criteria PASS:
- Text field with /V → FormFieldValue::Text { value: Some(...), ... }
- UTF-16BE BOM-prefixed /V → correct Unicode decode
- /Ff multiline bit set → multiline: true
- /MaxLen → max_length: Some(N)
- Empty /V → value: Some("")
- Missing /V → value: None
2026-05-28 22:52:35 -04:00
..
pdftract-cer-diff docs(pdftract-aawrz): add LICENSE-MIT and LICENSE-APACHE files 2026-05-23 10:36:28 -04:00
pdftract-cli fix(pyo3): correct extract_text_fn call in extract_markdown stub 2026-05-28 20:28:25 -04:00
pdftract-core fix(pdftract-34hxw): correct PDFDocEncoding test expectations 2026-05-28 22:52:35 -04:00
pdftract-libpdftract feat(pdftract-3s2i): implement Phase 5.5.2 validation filter 2026-05-24 04:57:17 -04:00
pdftract-py fix(pdftract-2uk9z): wrap native module results in typed Python objects 2026-05-28 21:18:38 -04:00