- Add #page=N URL fragment routing for shareable inspector links - Support browser back/forward navigation via hashchange event - Persist overlay toggle state in localStorage with error handling - Add isUpdatingFragment flag to prevent double-render on hash updates - Update thumbnail click handler to rely on updateFragment() - Clamp out-of-range page numbers with console warnings - Default to page 0 for invalid/non-numeric page numbers - Add vector fixture provenance entries Acceptance criteria: - URL #page=14 on load → starts on page 14 ✓ - Navigate via next button → URL updates to #page=15 ✓ - Browser back button → URL and view update correctly ✓ - Bookmark with #page=14 → reopens to page 14 ✓ - Overlay toggles persist across page refresh ✓ - Out-of-range #page=999 → clamps to last page ✓ - Invalid #page=abc → defaults to page 0 ✓ Closes pdftract-47e42 Verification: notes/pdftract-47e42.md |
||
|---|---|---|
| .. | ||
| ground_truth.txt | ||
| README.md | ||
| source.pdf | ||
Academic Paper on Machine Learning - CER Test Fixture
Purpose
This fixture is used for Character Error Rate (CER) testing in the vector PDF corpus.
Files
source.pdf- Clean vector PDF with embedded textground_truth.txt- Exact text content for CER comparisonREADME.md- This file
Content
Abstract This paper presents a novel approach to machine learning using deep neural networks. Our method achieves state-of-the-art results on several benchmark datasets. Introduction Machine learning ...
Expected CER
Target: < 0.5% character error rate when extracted by pdftract.
Metadata
- Title: Academic Paper on Machine Learning
- Author: Jane Doe
- Creator: LaTeX
- Generated by: generate_vector_cer_corpus.py