pdftract/tests/fixtures/vector/code-documentation/README.md
jedarden fe59fa9785 feat(pdftract-47e42): implement URL fragment routing for shareable links
- Add #page=N URL fragment routing for shareable inspector links
- Support browser back/forward navigation via hashchange event
- Persist overlay toggle state in localStorage with error handling
- Add isUpdatingFragment flag to prevent double-render on hash updates
- Update thumbnail click handler to rely on updateFragment()
- Clamp out-of-range page numbers with console warnings
- Default to page 0 for invalid/non-numeric page numbers
- Add vector fixture provenance entries

Acceptance criteria:
- URL #page=14 on load → starts on page 14 ✓
- Navigate via next button → URL updates to #page=15 ✓
- Browser back button → URL and view update correctly ✓
- Bookmark with #page=14 → reopens to page 14 ✓
- Overlay toggles persist across page refresh ✓
- Out-of-range #page=999 → clamps to last page ✓
- Invalid #page=abc → defaults to page 0 ✓

Closes pdftract-47e42

Verification: notes/pdftract-47e42.md
2026-06-01 08:23:59 -04:00

743 B

Code Library Documentation - CER Test Fixture

Purpose

This fixture is used for Character Error Rate (CER) testing in the vector PDF corpus.

Files

  • source.pdf - Clean vector PDF with embedded text
  • ground_truth.txt - Exact text content for CER comparison
  • README.md - This file

Content

libpdf - PDF Processing Library Installation pip install libpdf Quick Example from libpdf import Document doc = Document('example.pdf') text = doc.extract_text() API Reference Document.open(path) Open...

Expected CER

Target: < 0.5% character error rate when extracted by pdftract.

Metadata

  • Title: Code Library Documentation
  • Author: Open Source Contributors
  • Creator: Markdown
  • Generated by: generate_vector_cer_corpus.py