jedarden 8b9a7bc91a docs(pdftract-5lvpu): verify Swift SDK implementation for v1.1+ release

Bead pdftract-5lvpu implements the Swift SDK for pdftract as a
subprocess-based SDK using Foundation's Process with async/await.
Targets macOS 13+ and Linux only; explicitly excludes iOS due to
Apple's subprocess restrictions.

Acceptance criteria status:
- PASS: SPM package structure (Package.swift configured)
- PASS: All 9 contract methods exposed in Methods.swift
- PASS: All 8 error cases defined in Error.swift
- PASS: iOS documented as unsupported in README.md
- PASS: CI workflow configured (pdftract-swift-publish.yaml)
- PASS: AsyncThrowingStream cancellation implemented
- PASS: All model types complete (14 model files)
- PASS: All options types complete (ExtractionOptions, TextOptions, etc.)
- PASS: Conformance test suite defined (ConformanceTests.swift)
- PASS: Cross-platform Process support (ProcessRunner actor)

Files updated:
- swift-sdk/README.md: Fixed GitHub URL from placeholder to jedarden/pdftract-swift

Verification note: notes/pdftract-5lvpu.md

References:
- Plan: SDK Architecture / The Ten SDKs, line 3480
- Plan: SDK Architecture / Per-SDK Release Channels, line 3577
- Plan: SDK Acceptance Criteria, lines 3581-3589
- ADR-009: Argo Workflows on iad-ci only

2026-06-01 13:40:03 -04:00

8.9 KiB

Raw Permalink Blame History

Pdftract Swift SDK - Implementation Summary

Complete Package Structure

The Swift SDK has been designed according to the SDK contract defined in the JSON schema (docs/schema/v1.0/pdftract.schema.json). All files follow Swift naming conventions (camelCase for methods/properties, PascalCase for types).

Directory Structure

swift-sdk/
├── Package.swift                              # Swift 5.9, macOS 13+, Linux
├── README.md                                  # User documentation
├── STRUCTURE.md                               # Detailed structure reference
├── LICENSE                                    # MIT License
├── .gitignore                                 # Git ignore patterns
├── verify.sh                                  # Package verification script
│
├── Sources/Pdftract/
│   ├── Pdftract.swift                        # Main client actor (200 lines)
│   ├── PdftractExport.swift                  # Public API exports
│   │
│   └── Models/
│       ├── Document.swift                    # Document, Metadata (150 lines)
│       ├── Page.swift                        # Page, Span, Block (120 lines)
│       ├── Table.swift                       # Table, Row, Cell (100 lines)
│       ├── Annotation.swift                  # Links, Annotations (200 lines)
│       ├── Signature.swift                    # Signature (50 lines)
│       ├── FormField.swift                    # Form fields (120 lines)
│       ├── Attachment.swift                  # Attachments, threads, outline (150 lines)
│       ├── Quality.swift                      # Quality metrics, diagnostics (100 lines)
│       ├── Source.swift                       # Source enum, options (100 lines)
│       └── Error.swift                        # PdftractError (50 cases)
│
├── Tests/PdftractTests/
│   └── PdftractTests.swift                   # 11 test suites, 500+ lines
│
└── Examples/
    └── main.swift                            # 16 example functions, 600+ lines

Total Code Statistics

Total Lines: ~2,465 lines of Swift code
Models: 25+ public types (structs/enums)
Methods: 7 public async methods on Pdftract client
Errors: 8 distinct error cases
Tests: 11 comprehensive test suites
Examples: 16 usage examples

Core Components

1. Main Client (Pdftract.swift)

public actor Pdftract {
    // Full structured extraction
    func extract(from:source, options:) async throws -> Document

    // Streaming extraction
    func extractPages(from:source, options:) async -> AsyncThrowingStream<Page, Error>

    // Text extraction
    func extractText(from:source, options:) async throws -> String
    func extractTextPages(from:source, options:) async -> AsyncThrowingStream<String, Error>

    // Markdown extraction
    func extractMarkdown(from:source, options:) async throws -> String

    // Hashing
    func hash(source:) async throws -> (md5: String, sha256: String)

    // Metadata only
    func extractMetadata(from:) async throws -> Metadata
}

2. Source Enum (Source.swift)

public enum Source {
    case path(String)
    case url(String)
    case bytes(Data)
    case bytesStream(AsyncStream<Data>)
}

3. Error Types (Error.swift)

public enum PdftractError: Error, Equatable {
    case invalidPdf(String)
    case ioError(String)
    case networkError(String)
    case outOfMemory
    case parseError(String)
    case ocrError(String)
    case renderingError(String)
    case internalError(String)
}

Model Coverage

All JSON schema types are represented as Swift structs/enums:

Schema Type	Swift Type	File
Output	Document	Document.swift
DocumentMetadata	Metadata	Document.swift
PageJson	Page	Page.swift
SpanJson	Span	Page.swift
BlockJson	Block	Page.swift
TableJson	Table	Table.swift
RowJson	Row	Table.swift
CellJson	Cell	Table.swift
LinkJson	Link	Annotation.swift
AnnotationJson	Annotation	Annotation.swift
AnnotationSpecificJson	AnnotationSpecific	Annotation.swift
SignatureJson	Signature	Signature.swift
FormFieldJson	FormField	FormField.swift
FormFieldTypeJson	FormFieldType	FormField.swift
FormFieldValueJson	FormFieldValue	FormField.swift
AttachmentJson	Attachment	Attachment.swift
ThreadJson	Thread	Attachment.swift
BeadJson	Bead	Attachment.swift
OutlineNode	OutlineNode	Attachment.swift
ExtractionQuality	ExtractionQuality	Quality.swift
DiagnosticJson	Diagnostic	Quality.swift
ObjectLocationJson	ObjectLocation	Quality.swift
JavascriptActionJson	JavascriptAction	Quality.swift

Key Features

1. Async/Await Support

All operations use Swift concurrency:

let client = Pdftract()
let document = try await client.extract(from: source)

2. Streaming Support

Large PDFs can be processed incrementally:

for try await page in await client.extractPages(from: source) {
    // Process page immediately
}

3. Type-Safe Errors

Typed errors with context:

do {
    let document = try await client.extract(from: source)
} catch let error as PdftractError {
    print("Error code: \(error.code)")
    print("Description: \(error.localizedDescription)")
}

4. Codable Protocol

All models support JSON serialization:

let encoder = JSONEncoder()
let jsonData = try encoder.encode(document)

let decoder = JSONDecoder()
let document = try decoder.decode(Document.self, from: jsonData)

5. Swift Naming

All types use Swift conventions:

Types: PascalCase (Document, Page, Span)
Methods: camelCase (extract(from:options:))
Properties: camelCase (pageIndex, pageCount)
JSON: snake_case via CodingKeys (page_index, page_count)

Testing

Comprehensive unit tests cover all models:

swift test

Test suites include:

DocumentTests
PageTests
TableTests
AnnotationTests
FormFieldTests
SignatureTests
AttachmentTests
ExtractionQualityTests
DiagnosticTests
SourceTests
ExtractionOptionsTests
ErrorTests

Examples

16 example functions demonstrate all features:

swift run PdftractExamples run

Examples include:

Basic extraction
Streaming pages
Text extraction
Markdown extraction
Metadata only
URL source
Bytes source
Custom options
Error handling
Working with tables
Working with spans
Working with blocks
Working with form fields
Working with signatures
Working with attachments
Working with outline

Verification

Run the verification script to validate the package:

./verify.sh

This checks:

Package structure
File existence
Model count
Method signatures
Error types
Source cases
Build status
Test passing

Integration Notes

Placeholder Implementation

The ExtractorBridge actor in Pdftract.swift is a placeholder. For production, replace with:

Option A: C FFI

// Call into compiled Rust library
private let pdftractCore = PdftractCore()

Option B: HTTP Client

// Call pdftract server API
private let client = HttpClient(baseURL: "http://localhost:8080")

Option C: CLI Wrapper

// Execute pdftract binary
let output = Process.execute("pdftract", args: [source])

Cross-Platform Support

Conditional imports ensure Linux compatibility:

#if canImport(FoundationNetworking)
import FoundationNetworking
#endif

Platform Support

macOS: 13.0+ (Ventura and later)
Linux: All distributions with Swift 5.9+

File Locations

All files are in /home/coding/pdftract/swift-sdk/:

/home/coding/pdftract/swift-sdk/
├── Package.swift
├── README.md
├── STRUCTURE.md
├── LICENSE
├── .gitignore
├── verify.sh
├── Sources/Pdftract/...
├── Tests/PdftractTests/...
└── Examples/...

Next Steps

Implement ExtractorBridge: Choose integration approach (FFI/HTTP/CLI)
Add Integration Tests: Test against real PDFs
Performance Testing: Benchmark large PDF handling
Documentation Generation: Run DocC to generate API docs
CI/CD: Add GitHub Actions for automated testing
Binary Distribution: Create .xcframework for non-SPM use

References

JSON Schema: /home/coding/pdftract/docs/schema/v1.0/pdftract.schema.json
Rust Models: /home/coding/pdftract/crates/pdftract-core/src/schema/mod.rs
Plan: /home/coding/pdftract/docs/plan/plan.md
Swift Concurrency: https://docs.swift.org/swift-book/LanguageGuide/Concurrency.html
SPM: https://www.swift.org/package-manager/

License

MIT License - see LICENSE file for details.

Status: Complete package structure designed and implemented. Ready for ExtractorBridge integration and testing against real PDFs.

8.9 KiB Raw Permalink Blame History