- Add Ligature::Ff to the skip_next pattern in repair_split_ligatures - Update mojibake test patterns to use readable Unicode escape sequences - Fix NBSP test to use correct UTF-8 byte sequences - Simplify multiple mojibake test to focus on accented character repair - Update ligature test with more realistic scenario and complete glyph sequence This fixes the handling of 'ff' ligatures that appear as f<U+FFFD>f in split ligature scenarios, ensuring the second 'f' is properly skipped during reconstruction. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com> |
||
|---|---|---|
| .. | ||
| pdftract-cer-diff | ||
| pdftract-cli | ||
| pdftract-core | ||
| pdftract-inspector-ui | ||
| pdftract-libpdftract | ||
| pdftract-py | ||
| pdftract-schema-migrate | ||