pdftract/crates/pdftract-cli/tests
jedarden 2f010c51fb feat(pdftract-206o6): implement scientific_paper profile with fixtures and tests
Author profiles/builtin/scientific_paper.yaml per Phase 7.10 YAML schema:
- Match predicates: text_contains (Abstract, References, doi:, arXiv:, Bibliography)
- Structural predicates: has_math, heading_depth, page_count
- Extraction tuning: xy_cut reading order for 2-column layout
- Fields: title, authors, abstract, doi, journal, publication_date, references

Add 5 fixtures covering diverse scientific paper types:
- arXiv preprint (CC-BY license)
- PLOS ONE journal article
- IEEE-style 2-column paper
- Nature-style single-column with sidebar
- ACM/IEEE conference proceedings

Add comprehensive regression tests in test_scientific_paper.rs:
- Profile schema validation
- Fixture structure verification
- Expected output consistency checks
- Match predicate validation
- Fixture diversity verification
- xy_cut reading order verification
- DOI regex format validation

Co-Authored-By: Claude Code (claude-opus-4-7) <noreply@anthropic.com>
2026-05-27 20:19:10 -04:00
..
conformance.rs feat(pdftract-3s2i): implement Phase 5.5.2 validation filter 2026-05-24 04:57:17 -04:00
mcp-cli-args.rs feat(pdftract-3s2i): implement Phase 5.5.2 validation filter 2026-05-24 04:57:17 -04:00
mcp-http.rs feat(pdftract-3s2i): implement Phase 5.5.2 validation filter 2026-05-24 04:57:17 -04:00
mcp-stdio.rs feat(pdftract-3s2i): implement Phase 5.5.2 validation filter 2026-05-24 04:57:17 -04:00
mcp-tools-integration.rs feat(pdftract-3s2i): implement Phase 5.5.2 validation filter 2026-05-24 04:57:17 -04:00
root-path-protection.rs feat(pdftract-3s2i): implement Phase 5.5.2 validation filter 2026-05-24 04:57:17 -04:00
test_contract.rs feat(pdftract-dtpwa): implement contract profile per Phase 7.10 schema 2026-05-24 07:10:32 -04:00
test_form.rs feat(pdftract-3h9xo): implement threads JSON output + schema integration 2026-05-25 13:40:15 -04:00
test_scientific_paper.rs feat(pdftract-206o6): implement scientific_paper profile with fixtures and tests 2026-05-27 20:19:10 -04:00
TH-02-path-traversal.rs test(pdftract-4h06h): implement TH-02 path traversal security test 2026-05-25 13:03:45 -04:00
TH-09-inspector-xss.rs feat(pdftract-3b1mk): implement TH-09 inspector XSS test with CSP headers 2026-05-26 20:38:21 -04:00