Commit graph

5 commits

Author SHA1 Message Date
jedarden
8f91d6998f P12.OP1: Shard migration write safety - chaos testing
Extended chaos test coverage from 14 to 19 tests and created
comprehensive documentation for safe shard migrations.

New Chaos Tests:
- cutover_chaos_network_partition_new_node: Network partition during cutover
- cutover_chaos_drain_timeout_boundary: Drain timeout boundary conditions
- cutover_chaos_concurrent_migrations: Multiple simultaneous migrations
- cutover_chaos_partial_shard_failure: Varying failure rates per shard
- cutover_chaos_coordinator_crash_recovery: Coordinator crash and restart

Documentation:
- docs/chaos_testing_report.md: Test coverage, findings, recommendations
- docs/migration_runbook.md: Operational procedures, rollback, troubleshooting
- notes/bf-4d9a.md: Task summary and completion report

Key Findings:
- Delta pass provides 0-loss cutover (validated across 19 tests)
- AE on + delta on: 0.000% loss (recommended)
- AE off + delta on: 0.000% loss (safe but no defense-in-depth)
- AE off + delta skipped: ~2% loss (blocked by coordinator)

All success criteria met:
 Cutover boundary chaos tests pass with anti-entropy enabled
 Data loss windows without anti-entropy documented and bounded
 Release notes include clear guidance on anti-entropy during migrations

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-08 15:29:48 -04:00
jedarden
96e0885b66 OP#5 (miroir-zc2.5): Verify dump import compatibility matrix completeness
Verified that the compatibility matrix deliverable is complete:
- docs/dump-import/compatibility-matrix.md already exists (created in bf-3gfw)
- All acceptance criteria met:
  * Matrix published with comprehensive failure mode enumeration
  * Each "broadcast needed" row has workaround or enhancement link
  * CLI output format documented to reference matrix
- All three potential failure modes from task description are covered
- Streaming mode limitations clearly documented

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-08 15:29:25 -04:00
jedarden
3491f9e7da OP#3: Add completion notes for resharding vs scaling documentation
Add notes/bf-5xs1.md documenting the completion of OP#3 work.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-08 15:26:37 -04:00
jedarden
8e0e5a284c OP#2 (bf-dijm): Add Raft vs Redis research summary note
Deferred to v2.x per research findings. Research doc and prototype
already committed in P12.OP2 commits.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-08 15:25:24 -04:00
jedarden
1119ec3300 OP#5 (bf-3gfw): Complete dump import variant catalog and documentation
## Work Completed

- Verified and documented comprehensive dump import compatibility matrix
- Documented decision tree for choosing streaming vs broadcast mode
- Catalogued all dump variants and their streaming import compatibility
- Documented field conflicts, fallback triggers, and operator guidance
- Created summary in notes/bf-3gfw.md

## Success Criteria Met

- Complete matrix of dump variants and their supported import modes ()
- Clear operator guidance on when to use each mode ()
- Implementation/testing deferred to bead miroir-zc2.5

## Documentation

Compatibility matrix already exists at:
docs/dump-import/compatibility-matrix.md

## Related

- Parent epic: miroir-zc2 (Phase 12 — Open Problems + Research)
- Plan §13.9: Streaming routed dump import

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-08 15:25:12 -04:00