pdftract/profiles/builtin
jedarden 71705ed77b feat(profiles): implement built-in classification profiles (5.6.4)
Add 9 built-in classification profile definitions as YAML files bundled
via include_str! for the document type classifier (Phase 5.6).

- Create profiles/builtin/classification/{invoice,receipt,contract,scientific_paper,slide_deck,form,bank_statement,legal_filing,book_chapter}.yaml
- Implement load_builtins() in profiles module with profiles feature gate
- Each profile uses MatchPredicate schema with text patterns, structural signals, page counts
- Add comprehensive unit tests for profile loading and feature gate

Closes: pdftract-5sdd

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-24 15:04:43 -04:00
..
bank_statement docs(pdftract-4iier): complete per-profile README documentation 2026-05-18 00:35:35 -04:00
book_chapter docs(pdftract-4iier): complete per-profile README documentation 2026-05-18 00:35:35 -04:00
classification feat(profiles): implement built-in classification profiles (5.6.4) 2026-05-24 15:04:43 -04:00
contract feat(pdftract-dtpwa): implement contract profile per Phase 7.10 schema 2026-05-24 07:10:32 -04:00
form docs(pdftract-4iier): complete per-profile README documentation 2026-05-18 00:35:35 -04:00
invoice docs(pdftract-4iier): complete per-profile README documentation 2026-05-18 00:35:35 -04:00
legal_filing docs(pdftract-4iier): complete per-profile README documentation 2026-05-18 00:35:35 -04:00
receipt docs(pdftract-4iier): complete per-profile README documentation 2026-05-18 00:35:35 -04:00
scientific_paper docs(pdftract-4iier): complete per-profile README documentation 2026-05-18 00:35:35 -04:00
slide_deck docs(pdftract-4iier): complete per-profile README documentation 2026-05-18 00:35:35 -04:00