No description

Find a file

jedarden ba70cd25c0 P3: Complete Phase 3 — Task Registry + Persistence (SQLite + Redis) Implements all 14 tables from plan §4 with dual backend support. ## Implementation ### TaskStore Trait (502 lines) - Complete API covering all 14 tables - Runtime backend selection (sqlite \| redis) ### SQLite Backend (2,536 lines) - rusqlite-based with WAL mode - Idempotent migrations (schema_versions table) - 36 tests passing (proptest + integration) ### Redis Backend (3,884 lines) - Full TaskStore trait implementation - Uses `_index` sets for O(1) list queries (no SCAN) - 33 integration tests (testcontainers) ### Schema Files - 001_initial.sql: Tables 1-7 - 002_feature_tables.sql: Tables 8-14 - 003_task_registry_fields.sql: No-op marker ### Validation - Helm values.schema.json enforces HA constraints: - replicas > 1 requires backend: redis - HPA requires replicas >= 2 + redis - Verified with helm lint ### Documentation - REDIS_MEMORY_ACCOUNTING.md: Complete sizing guide ## Definition of Done — Complete ✅ rusqlite store with idempotent table initialization ✅ Redis store mirrors TaskStore API ✅ Migrations/versioning with schema_version row ✅ Property tests (proptest) for SQLite ✅ Restart resilience integration tests ✅ Redis integration tests (testcontainers) ✅ `_index` pattern for list queries ✅ Helm schema enforces HA requirements ✅ Redis memory accounting (plan §14.7) Total: 6,922 lines of production code + tests Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>		2026-05-02 17:14:29 -04:00
.beads	P3: Verify Phase 3 Task Registry + Persistence completion	2026-05-02 17:05:46 -04:00
.cargo	Multi-stage Dockerfile with musl cross-compilation and .dockerignore	2026-04-19 13:47:45 -04:00
.github	P8.6: Release mechanics — bump script, release-ready check, PR template, Argo CIs	2026-04-19 09:54:26 -04:00
benches	P12.OP4: Implement dfs_query_then_fetch for cross-shard comparability	2026-04-19 03:43:10 -04:00
charts/miroir	P3: Complete Phase 3 — Task Registry + Persistence (SQLite + Redis)	2026-05-02 17:14:29 -04:00
crates	P3: Phase 3 Task Registry + Persistence — COMPLETE	2026-05-02 16:50:42 -04:00
dashboards	P7.3: Add §13.1 resharding row to Grafana dashboard, fix y-coordinate overlaps	2026-04-19 13:18:13 -04:00
docs	P3: Complete Phase 3 — Task Registry + Persistence (SQLite + Redis)	2026-05-02 16:52:25 -04:00
k8s	P12: close Phase 12 epic — all 6 open problems triaged and documented	2026-04-24 19:14:23 -04:00
notes	P3: Complete Phase 3 — Task Registry + Persistence (SQLite + Redis)	2026-05-02 17:14:29 -04:00
scripts	P8.6: Release mechanics — bump script, release-ready check, PR template, Argo CIs	2026-04-19 09:54:26 -04:00
tests/benches/score-comparability	P2.2: Implement write path with primary key validation, shard injection, and two-rule quorum	2026-04-19 06:48:30 -04:00
.dockerignore	Multi-stage Dockerfile with musl cross-compilation and .dockerignore	2026-04-19 13:47:45 -04:00
.editorconfig	Add repo hygiene: LICENSE, CHANGELOG, .gitignore	2026-04-18 20:47:36 -04:00
.gitignore	P8: Add optional OpenTelemetry tracing deps, fix subscriber init, clean up .gitignore	2026-04-19 13:24:24 -04:00
.needle-predispatch-sha	P3: Verify Phase 3 Task Registry + Persistence completion	2026-05-02 17:05:46 -04:00
1	P7.5.a: Request ID middleware + X-Request-Id response header	2026-04-21 08:01:30 -04:00
Cargo.lock	P3: Complete Phase 3 — Task Registry + Persistence (SQLite + Redis)	2026-05-02 16:52:25 -04:00
Cargo.toml	P12.OP4: Implement dfs_query_then_fetch for cross-shard comparability	2026-04-19 03:43:10 -04:00
CHANGELOG.md	P8: Finalize CI/CD templates, prod ArgoCD app, and CHANGELOG for v0.1.0	2026-04-19 15:09:14 -04:00
clippy.toml	Add repo hygiene: LICENSE, CHANGELOG, .gitignore	2026-04-18 20:47:36 -04:00
Dockerfile	Multi-stage Dockerfile with musl cross-compilation and .dockerignore	2026-04-19 13:47:45 -04:00
LICENSE	Add repo hygiene: LICENSE, CHANGELOG, .gitignore	2026-04-18 20:47:36 -04:00
miroir.yaml	P3.3.d: Fix compilation - add missing local_search_ui_rate_limiter field	2026-04-26 19:30:10 -04:00
README.md	Add repo hygiene: LICENSE, CHANGELOG, .gitignore	2026-04-18 20:47:36 -04:00
rust-toolchain.toml	Add repo hygiene: LICENSE, CHANGELOG, .gitignore	2026-04-18 20:47:36 -04:00
rustfmt.toml	Add repo hygiene: LICENSE, CHANGELOG, .gitignore	2026-04-18 20:47:36 -04:00

README.md

Miroir

Multi-node Index Replication Orchestrator, Integrated Rebalancing

Miroir is a RAID-like orchestration layer for Meilisearch. It stripes a large index across a fleet of small-RAM Meilisearch nodes with a configurable replication factor, fans out search queries across all shards, and rebalances shard assignments when nodes are added or removed — all using the Meilisearch Community Edition.

The Problem

Meilisearch loads its entire index into memory-mapped LMDB files. A large index that exceeds a single server's available RAM cannot run on that server. The Enterprise Edition's native sharding is gated behind a commercial license. Miroir solves this without it.

How It Works

Client
  │
  ▼
Miroir Orchestrator
  ├── Write path: hash(doc_id) → assign to shard → write to R replicas
  ├── Read path:  scatter query to all shards → gather → merge ranked results
  └── Rebalance: on node add/remove → recompute assignments → migrate minimum shards

Meilisearch Nodes (N instances, each holding a subset of shards)
  node-0   node-1   node-2   ...   node-N

Replication Factor

Analogous to software RAID — configurable per deployment:

RF	Redundancy	Node failures tolerated	Capacity
1	None (stripe only)	0	100% of fleet
2	One replica	1 per shard group	50% of fleet
3	Two replicas	2 per shard group	33% of fleet

Key Components

Orchestrator — proxy that handles shard routing, scatter-gather, result merging, and topology management
Shard router — consistent hash function (Rendezvous/HRW) mapping document IDs to node assignments; minimal reshuffling on topology change
Rebalancer — on node add/remove, recomputes assignments and migrates only the shards that changed owners; surviving replicas serve reads during rebuild
Result merger — normalizes and merges ranked result sets from multiple shards into a single coherent response

Status

Design phase. See docs/ for architecture detail.