pdftract/tests/fixtures/vector/code-documentation/README.md
jedarden fe59fa9785 feat(pdftract-47e42): implement URL fragment routing for shareable links
- Add #page=N URL fragment routing for shareable inspector links
- Support browser back/forward navigation via hashchange event
- Persist overlay toggle state in localStorage with error handling
- Add isUpdatingFragment flag to prevent double-render on hash updates
- Update thumbnail click handler to rely on updateFragment()
- Clamp out-of-range page numbers with console warnings
- Default to page 0 for invalid/non-numeric page numbers
- Add vector fixture provenance entries

Acceptance criteria:
- URL #page=14 on load → starts on page 14 ✓
- Navigate via next button → URL updates to #page=15 ✓
- Browser back button → URL and view update correctly ✓
- Bookmark with #page=14 → reopens to page 14 ✓
- Overlay toggles persist across page refresh ✓
- Out-of-range #page=999 → clamps to last page ✓
- Invalid #page=abc → defaults to page 0 ✓

Closes pdftract-47e42

Verification: notes/pdftract-47e42.md
2026-06-01 08:23:59 -04:00

30 lines
743 B
Markdown

# Code Library Documentation - CER Test Fixture
## Purpose
This fixture is used for Character Error Rate (CER) testing in the vector PDF corpus.
## Files
- `source.pdf` - Clean vector PDF with embedded text
- `ground_truth.txt` - Exact text content for CER comparison
- `README.md` - This file
## Content
libpdf - PDF Processing Library
Installation
pip install libpdf
Quick Example
from libpdf import Document
doc = Document('example.pdf')
text = doc.extract_text()
API Reference
Document.open(path)
Open...
## Expected CER
Target: < 0.5% character error rate when extracted by pdftract.
## Metadata
- Title: Code Library Documentation
- Author: Open Source Contributors
- Creator: Markdown
- Generated by: generate_vector_cer_corpus.py