- Add #page=N URL fragment routing for shareable inspector links - Support browser back/forward navigation via hashchange event - Persist overlay toggle state in localStorage with error handling - Add isUpdatingFragment flag to prevent double-render on hash updates - Update thumbnail click handler to rely on updateFragment() - Clamp out-of-range page numbers with console warnings - Default to page 0 for invalid/non-numeric page numbers - Add vector fixture provenance entries Acceptance criteria: - URL #page=14 on load → starts on page 14 ✓ - Navigate via next button → URL updates to #page=15 ✓ - Browser back button → URL and view update correctly ✓ - Bookmark with #page=14 → reopens to page 14 ✓ - Overlay toggles persist across page refresh ✓ - Out-of-range #page=999 → clamps to last page ✓ - Invalid #page=abc → defaults to page 0 ✓ Closes pdftract-47e42 Verification: notes/pdftract-47e42.md
30 lines
743 B
Markdown
30 lines
743 B
Markdown
# Code Library Documentation - CER Test Fixture
|
|
|
|
## Purpose
|
|
This fixture is used for Character Error Rate (CER) testing in the vector PDF corpus.
|
|
|
|
## Files
|
|
- `source.pdf` - Clean vector PDF with embedded text
|
|
- `ground_truth.txt` - Exact text content for CER comparison
|
|
- `README.md` - This file
|
|
|
|
## Content
|
|
libpdf - PDF Processing Library
|
|
Installation
|
|
pip install libpdf
|
|
Quick Example
|
|
from libpdf import Document
|
|
doc = Document('example.pdf')
|
|
text = doc.extract_text()
|
|
API Reference
|
|
Document.open(path)
|
|
Open...
|
|
|
|
## Expected CER
|
|
Target: < 0.5% character error rate when extracted by pdftract.
|
|
|
|
## Metadata
|
|
- Title: Code Library Documentation
|
|
- Author: Open Source Contributors
|
|
- Creator: Markdown
|
|
- Generated by: generate_vector_cer_corpus.py
|