Implements the slide_deck document profile for PowerPoint/Keynote/Google Slides exports as PDF. Includes 5 fixtures, expected outputs, and regression tests. Components: - profiles/builtin/slide_deck/profile.yaml - Profile configuration - tests/fixtures/profiles/slide_deck/ - 5 PDF fixtures with expected outputs - crates/pdftract-cli/tests/test_slide_deck.rs - Regression tests (12 PASS) Fixtures cover: 1. pitch_deck - Sales pitch (10 slides) 2. academic_lecture - Academic lecture (40 slides) 3. corporate_kickoff - Corporate kickoff (15 slides) 4. bilingual_deck - Bilingual EN/ES (12 slides) 5. googleslides_handout - Google Slides handout mode (4 pages, 3 slides/page) Extracted fields: title, presenter, date, slide_titles Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
3.7 KiB
3.7 KiB
Slide Deck Profile Fixtures - Provenance
pitch_deck.pdf
Source: Synthetic template created for testing Type: Sales pitch deck (10 slides) License: MIT-0 (synthetic) PII: None Key Fields:
- Title: "Q3 2024 Product Roadmap"
- Presenter: "Jane Smith, VP Product"
- Date: 2024-09-15
- Slide Titles: ["Title", "Agenda", "Market Overview", "Product Vision", "Key Features", "Technical Architecture", "Go-to-Market Strategy", "Pricing & Packaging", "Next Steps", "Q&A"]
academic_lecture.pdf
Source: Synthetic template created for testing Type: Academic lecture slides (40 slides) License: MIT-0 (synthetic) PII: None Key Fields:
- Title: "Introduction to Machine Learning: Neural Networks Deep Dive"
- Presenter: "Prof. Robert Chen, PhD"
- Date: 2024-10-01
- Slide Titles: ["Title", "Overview", "What is a Neural Network?", "Perceptrons", "Multi-Layer Networks", "Activation Functions", "Backpropagation", "Loss Functions", "Optimization", "Regularization", "Convolutional Networks", "Recurrent Networks", "Transformer Architecture", "Attention Mechanisms", "Training Strategies", "Hyperparameter Tuning", "Evaluation Metrics", "Case Studies", "Current Research", "Future Directions", "Summary", "References", "Q1", "Q2", "Q3", "Q4", "Q5", "Q6", "Q7", "Q8", "Q9", "Q10", "Q11", "Q12", "Q13", "Q14", "Q15", "Q16", "Thank You"]
corporate_kickoff.pdf
Source: Synthetic template created for testing Type: Corporate kickoff presentation (15 slides) License: MIT-0 (synthetic) PII: None Key Fields:
- Title: "2025 Annual Kickoff: Accelerating Growth"
- Presenter: "Michael Johnson, CEO"
- Date: 2025-01-15
- Slide Titles: ["Welcome", "2024 Recap", "Financial Highlights", "Customer Success Stories", "Product Roadmap 2025", "Market Expansion", "Team Growth", "Strategic Priorities", "OKR Framework", "Investment Areas", "Culture & Values", "Events Calendar", "Leadership Team", "Key Dates", "Thank You"]
bilingual_deck.pdf
Source: Synthetic template created for testing Type: Bilingual English/Spanish presentation (12 slides) License: MIT-0 (synthetic) PII: None Key Fields:
- Title: "Informe Anual 2024 / Annual Report 2024"
- Presenter: "Maria Garcia / Director General"
- Date: 2024-12-01
- Slide Titles: ["Portada / Title", "Resumen Ejecutivo / Executive Summary", "Logros 2024 / 2024 Achievements", "Crecimiento de Ingresos / Revenue Growth", "Expansión Global / Global Expansion", "Productos Nuevos / New Products", "Sostenibilidad / Sustainability", "Compromiso Social / Social Commitment", "Perspectivas 2025 / 2025 Outlook", "Estrategia / Strategy", "Próximos Pasos / Next Steps", "Gracias / Thank You"]
googleslides_handout.pdf
Source: Synthetic template created for testing Type: Google Slides export with 3 slides per page (handout mode) - 4 pages total License: MIT-0 (synthetic) PII: None Key Fields:
- Title: "Team Onboarding Guide"
- Presenter: "HR Department"
- Date: 2024-08-20
- Slide Titles: ["Welcome!", "Company Values", "Our Mission", "Tools & Resources", "Benefits Overview", "Who's Who", "First Week Checklist", "Questions?", "Contact HR", "Thank You", "Insurance", "401k", "PTO Policy", "Remote Work", "Emergency Contacts"]
- Note: page_count (4) does NOT equal slide count (15) due to multi-slide-per-page handout format; this is a known limitation
Notes
- All fixtures are synthetic templates created for testing purposes
- No real presentations or PII are included
- Expected output JSON files document the ground truth for each fixture
- PDF files will be created following the slide deck profile schema validation
- Fixtures cover common export formats and edge cases (bilingual, handout mode)