- Create tests/fixtures/classifier/ with 200 synthetic PDFs:
- 50 invoices with bill-to/ship-to, item tables, totals
- 50 scientific papers with abstracts, sections, references
- 50 contracts with clauses, legal terminology, signatures
- 50 misc documents (8 receipts, 8 forms, 7 bank statements,
7 slide decks, 7 legal filings, 6 book excerpts, 7 magazines)
- Add MANIFEST.tsv mapping each document to its expected type
with source URL and license (all MIT-0 synthetic data)
- Add scripts/generate_test_corpus.py to regenerate the corpus
using reportlab for PDF generation
- Add tests/test_classifier_corpus.rs with validation harness:
- test_corpus_manifest_validity: verifies manifest structure
and file existence (PASSES)
- test_classifier_corpus_accuracy: will validate precision/
recall/F1 when classifier is implemented (SKIP for now)
- test_classifier_reproducibility: will verify deterministic
classification (SKIP for now)
- Add tests/fixtures/classifier/README.md documenting corpus
structure, generation process, and acceptance criteria
Total corpus size: ~0.4 MB (each PDF < 5 KB)
Acceptance criteria (from plan.md Phase 5.6):
- Per-class precision and recall >= 0.85
- Macro-F1 >= 0.88
- Reproducibility: identical output for same document
Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
19 KiB
19 KiB
| 1 | path | expected_document_type | source_url | license |
|---|---|---|---|---|
| 2 | invoice/01.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 3 | invoice/02.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 4 | invoice/03.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 5 | invoice/04.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 6 | invoice/05.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 7 | invoice/06.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 8 | invoice/07.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 9 | invoice/08.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 10 | invoice/09.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 11 | invoice/10.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 12 | invoice/11.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 13 | invoice/12.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 14 | invoice/13.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 15 | invoice/14.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 16 | invoice/15.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 17 | invoice/16.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 18 | invoice/17.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 19 | invoice/18.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 20 | invoice/19.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 21 | invoice/20.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 22 | invoice/21.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 23 | invoice/22.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 24 | invoice/23.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 25 | invoice/24.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 26 | invoice/25.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 27 | invoice/26.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 28 | invoice/27.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 29 | invoice/28.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 30 | invoice/29.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 31 | invoice/30.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 32 | invoice/31.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 33 | invoice/32.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 34 | invoice/33.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 35 | invoice/34.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 36 | invoice/35.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 37 | invoice/36.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 38 | invoice/37.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 39 | invoice/38.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 40 | invoice/39.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 41 | invoice/40.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 42 | invoice/41.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 43 | invoice/42.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 44 | invoice/43.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 45 | invoice/44.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 46 | invoice/45.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 47 | invoice/46.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 48 | invoice/47.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 49 | invoice/48.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 50 | invoice/49.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 51 | invoice/50.pdf | invoice | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 52 | scientific_paper/01.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 53 | scientific_paper/02.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 54 | scientific_paper/03.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 55 | scientific_paper/04.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 56 | scientific_paper/05.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 57 | scientific_paper/06.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 58 | scientific_paper/07.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 59 | scientific_paper/08.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 60 | scientific_paper/09.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 61 | scientific_paper/10.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 62 | scientific_paper/11.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 63 | scientific_paper/12.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 64 | scientific_paper/13.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 65 | scientific_paper/14.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 66 | scientific_paper/15.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 67 | scientific_paper/16.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 68 | scientific_paper/17.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 69 | scientific_paper/18.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 70 | scientific_paper/19.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 71 | scientific_paper/20.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 72 | scientific_paper/21.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 73 | scientific_paper/22.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 74 | scientific_paper/23.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 75 | scientific_paper/24.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 76 | scientific_paper/25.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 77 | scientific_paper/26.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 78 | scientific_paper/27.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 79 | scientific_paper/28.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 80 | scientific_paper/29.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 81 | scientific_paper/30.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 82 | scientific_paper/31.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 83 | scientific_paper/32.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 84 | scientific_paper/33.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 85 | scientific_paper/34.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 86 | scientific_paper/35.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 87 | scientific_paper/36.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 88 | scientific_paper/37.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 89 | scientific_paper/38.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 90 | scientific_paper/39.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 91 | scientific_paper/40.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 92 | scientific_paper/41.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 93 | scientific_paper/42.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 94 | scientific_paper/43.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 95 | scientific_paper/44.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 96 | scientific_paper/45.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 97 | scientific_paper/46.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 98 | scientific_paper/47.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 99 | scientific_paper/48.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 100 | scientific_paper/49.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 101 | scientific_paper/50.pdf | scientific_paper | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 102 | contract/01.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 103 | contract/02.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 104 | contract/03.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 105 | contract/04.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 106 | contract/05.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 107 | contract/06.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 108 | contract/07.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 109 | contract/08.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 110 | contract/09.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 111 | contract/10.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 112 | contract/11.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 113 | contract/12.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 114 | contract/13.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 115 | contract/14.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 116 | contract/15.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 117 | contract/16.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 118 | contract/17.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 119 | contract/18.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 120 | contract/19.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 121 | contract/20.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 122 | contract/21.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 123 | contract/22.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 124 | contract/23.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 125 | contract/24.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 126 | contract/25.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 127 | contract/26.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 128 | contract/27.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 129 | contract/28.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 130 | contract/29.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 131 | contract/30.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 132 | contract/31.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 133 | contract/32.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 134 | contract/33.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 135 | contract/34.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 136 | contract/35.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 137 | contract/36.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 138 | contract/37.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 139 | contract/38.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 140 | contract/39.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 141 | contract/40.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 142 | contract/41.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 143 | contract/42.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 144 | contract/43.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 145 | contract/44.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 146 | contract/45.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 147 | contract/46.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 148 | contract/47.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 149 | contract/48.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 150 | contract/49.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 151 | contract/50.pdf | contract | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 152 | misc/01.pdf | receipt | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 153 | misc/02.pdf | receipt | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 154 | misc/03.pdf | receipt | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 155 | misc/04.pdf | receipt | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 156 | misc/05.pdf | receipt | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 157 | misc/06.pdf | receipt | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 158 | misc/07.pdf | receipt | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 159 | misc/08.pdf | receipt | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 160 | misc/09.pdf | form | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 161 | misc/10.pdf | form | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 162 | misc/11.pdf | form | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 163 | misc/12.pdf | form | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 164 | misc/13.pdf | form | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 165 | misc/14.pdf | form | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 166 | misc/15.pdf | form | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 167 | misc/16.pdf | form | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 168 | misc/17.pdf | bank_statement | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 169 | misc/18.pdf | bank_statement | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 170 | misc/19.pdf | bank_statement | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 171 | misc/20.pdf | bank_statement | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 172 | misc/21.pdf | bank_statement | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 173 | misc/22.pdf | bank_statement | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 174 | misc/23.pdf | bank_statement | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 175 | misc/24.pdf | slide_deck | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 176 | misc/25.pdf | slide_deck | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 177 | misc/26.pdf | slide_deck | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 178 | misc/27.pdf | slide_deck | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 179 | misc/28.pdf | slide_deck | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 180 | misc/29.pdf | slide_deck | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 181 | misc/30.pdf | slide_deck | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 182 | misc/31.pdf | legal_filing | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 183 | misc/32.pdf | legal_filing | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 184 | misc/33.pdf | legal_filing | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 185 | misc/34.pdf | legal_filing | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 186 | misc/35.pdf | legal_filing | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 187 | misc/36.pdf | legal_filing | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 188 | misc/37.pdf | legal_filing | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 189 | misc/38.pdf | book_excerpt | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 190 | misc/39.pdf | book_excerpt | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 191 | misc/40.pdf | book_excerpt | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 192 | misc/41.pdf | book_excerpt | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 193 | misc/42.pdf | book_excerpt | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 194 | misc/43.pdf | book_excerpt | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 195 | misc/44.pdf | magazine | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 196 | misc/45.pdf | magazine | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 197 | misc/46.pdf | magazine | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 198 | misc/47.pdf | magazine | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 199 | misc/48.pdf | magazine | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 200 | misc/49.pdf | magazine | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |
| 201 | misc/50.pdf | magazine | Synthetic test data generated by scripts/generate_test_corpus.py | MIT-0 |