- Add #page=N URL fragment routing for shareable inspector links - Support browser back/forward navigation via hashchange event - Persist overlay toggle state in localStorage with error handling - Add isUpdatingFragment flag to prevent double-render on hash updates - Update thumbnail click handler to rely on updateFragment() - Clamp out-of-range page numbers with console warnings - Default to page 0 for invalid/non-numeric page numbers - Add vector fixture provenance entries Acceptance criteria: - URL #page=14 on load → starts on page 14 ✓ - Navigate via next button → URL updates to #page=15 ✓ - Browser back button → URL and view update correctly ✓ - Bookmark with #page=14 → reopens to page 14 ✓ - Overlay toggles persist across page refresh ✓ - Out-of-range #page=999 → clamps to last page ✓ - Invalid #page=abc → defaults to page 0 ✓ Closes pdftract-47e42 Verification: notes/pdftract-47e42.md
743 B
743 B
Code Library Documentation - CER Test Fixture
Purpose
This fixture is used for Character Error Rate (CER) testing in the vector PDF corpus.
Files
source.pdf- Clean vector PDF with embedded textground_truth.txt- Exact text content for CER comparisonREADME.md- This file
Content
libpdf - PDF Processing Library Installation pip install libpdf Quick Example from libpdf import Document doc = Document('example.pdf') text = doc.extract_text() API Reference Document.open(path) Open...
Expected CER
Target: < 0.5% character error rate when extracted by pdftract.
Metadata
- Title: Code Library Documentation
- Author: Open Source Contributors
- Creator: Markdown
- Generated by: generate_vector_cer_corpus.py