No description

Find a file

jedarden cfc0001ada P5.5 §13.5: Complete two-phase settings broadcast + drift reconciler Implements the propose/verify/commit flow for settings changes with drift detection and repair. Replaces sequential settings apply with a safer two-phase broadcast that prevents partial settings apply. Key components: - SettingsBroadcast coordinator (miroir-core/src/settings.rs): * Phase 1 (Propose): PATCH all nodes in parallel, collect task UIDs * Phase 2 (Verify): GET settings, verify SHA256 fingerprints * Phase 3 (Commit): Increment settings_version, persist to task store * Retry loop with exponential backoff for hash mismatches * Per-(index, node) version tracking for client-pinned freshness - DriftReconciler background worker (rebalancer_worker/drift_reconciler.rs): * Mode B leader election for singleton execution * Periodic settings hash comparison across all nodes * Auto-repair drifted nodes with consensus settings * Catches out-of-band changes (operator SSH'd to a node) - Config (config/advanced.rs): * settings_broadcast.strategy: two_phase or sequential (legacy) * settings_broadcast.verify_timeout_s: 60s default * settings_broadcast.max_repair_retries: 3 default * settings_drift_check.interval_s: 300s (5 min) default * settings_drift_check.auto_repair: true default - Integration (main.rs, admin_endpoints.rs, indexes.rs): * Drift reconciler started as background task * Two-phase broadcast in PATCH /indexes/{uid}/settings * X-Miroir-Settings-Version response header * Legacy sequential mode for rollback compatibility - Router (router.rs): * covering_set_with_version_floor() filters stale nodes * 503 when no floor-satisfying covering set exists Acceptance criteria: - ✅ Normal flow: add synonym; propose+verify succeed; version increments once - ✅ Mid-broadcast node failure: verify fails, reissue succeeds after backoff - ✅ Out-of-band drift: direct PATCH detected and repaired within interval_s - ✅ X-Miroir-Min-Settings-Version floor excludes stale nodes; 503 when no floor-satisfying set - ✅ Legacy sequential strategy still works Tests: 15 total (7 acceptance + 8 integration), all passing. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>		2026-05-23 00:26:05 -04:00
.beads	P5.5 §13.5: Update bead traces for miroir-uhj.5 completion	2026-05-22 23:40:12 -04:00
.cargo	Multi-stage Dockerfile with musl cross-compilation and .dockerignore	2026-04-19 13:47:45 -04:00
.github	P8.6: Release mechanics — bump script, release-ready check, PR template, Argo CIs	2026-04-19 09:54:26 -04:00
benches	P12.OP4: Implement dfs_query_then_fetch for cross-shard comparability	2026-04-19 03:43:10 -04:00
charts/miroir	P6.10 Wire §14.8 resource-aware config defaults into Rust + values.yaml	2026-05-20 07:35:03 -04:00
crates	P5.5 §13.5: Complete two-phase settings broadcast + drift reconciler	2026-05-23 00:26:05 -04:00
dashboards	P7.3: Add §13.1 resharding row to Grafana dashboard, fix y-coordinate overlaps	2026-04-19 13:18:13 -04:00
docs	P6.11: Add single-pod oversized mode support (§14.10 vertical scaling escape valve)	2026-05-20 07:29:39 -04:00
examples	P11.7: Add quick-start example artifacts (Docker Compose + config)	2026-05-20 06:49:05 -04:00
k8s	P5.5 §13.5: Fix drift_reconciler compilation and complete two-phase settings broadcast	2026-05-22 18:10:10 -04:00
notes	P5.5 §13.5: Complete two-phase settings broadcast + drift reconciler	2026-05-22 22:03:01 -04:00
scripts	P8.6: Release mechanics — bump script, release-ready check, PR template, Argo CIs	2026-04-19 09:54:26 -04:00
tests	miroir-zc2.5: Fix dump import compatibility matrix enhancement bead refs	2026-05-20 07:18:56 -04:00
.dockerignore	Multi-stage Dockerfile with musl cross-compilation and .dockerignore	2026-04-19 13:47:45 -04:00
.editorconfig	Add repo hygiene: LICENSE, CHANGELOG, .gitignore	2026-04-18 20:47:36 -04:00
.gitignore	P8: Add optional OpenTelemetry tracing deps, fix subscriber init, clean up .gitignore	2026-04-19 13:24:24 -04:00
.needle-predispatch-sha	P5.5 §13.5: Update bead traces for miroir-uhj.5 completion	2026-05-22 23:40:12 -04:00
1	P7.5.a: Request ID middleware + X-Request-Id response header	2026-04-21 08:01:30 -04:00
Cargo.lock	P5.5 §13.5: Complete two-phase settings broadcast + drift reconciler	2026-05-23 00:26:05 -04:00
Cargo.toml	P12.OP4: Implement dfs_query_then_fetch for cross-shard comparability	2026-04-19 03:43:10 -04:00
CHANGELOG.md	P11.9 v1.0 versioning-commitments policy doc (§12)	2026-05-20 06:41:27 -04:00
clippy.toml	Add repo hygiene: LICENSE, CHANGELOG, .gitignore	2026-04-18 20:47:36 -04:00
Dockerfile	Multi-stage Dockerfile with musl cross-compilation and .dockerignore	2026-04-19 13:47:45 -04:00
LICENSE	Add repo hygiene: LICENSE, CHANGELOG, .gitignore	2026-04-18 20:47:36 -04:00
miroir.yaml	P3.3.d: Fix compilation - add missing local_search_ui_rate_limiter field	2026-04-26 19:30:10 -04:00
proptest.toml	P1.6: Add proptest.toml for 1024 test cases	2026-05-20 08:07:00 -04:00
README.md	P11.7: Add quick-start example artifacts (Docker Compose + config)	2026-05-20 06:50:43 -04:00
rust-toolchain.toml	Add repo hygiene: LICENSE, CHANGELOG, .gitignore	2026-04-18 20:47:36 -04:00
rustfmt.toml	Add repo hygiene: LICENSE, CHANGELOG, .gitignore	2026-04-18 20:47:36 -04:00

README.md

Miroir

Multi-node Index Replication Orchestrator, Integrated Rebalancing

Miroir is a RAID-like orchestration layer for Meilisearch. It stripes a large index across a fleet of small-RAM Meilisearch nodes with a configurable replication factor, fans out search queries across all shards, and rebalances shard assignments when nodes are added or removed — all using the Meilisearch Community Edition.

The Problem

Meilisearch loads its entire index into memory-mapped LMDB files. A large index that exceeds a single server's available RAM cannot run on that server. The Enterprise Edition's native sharding is gated behind a commercial license. Miroir solves this without it.

How It Works

Client
  │
  ▼
Miroir Orchestrator
  ├── Write path: hash(doc_id) → assign to shard → write to R replicas
  ├── Read path:  scatter query to all shards → gather → merge ranked results
  └── Rebalance: on node add/remove → recompute assignments → migrate minimum shards

Meilisearch Nodes (N instances, each holding a subset of shards)
  node-0   node-1   node-2   ...   node-N

Replication Factor

Analogous to software RAID — configurable per deployment:

RF	Redundancy	Node failures tolerated	Capacity
1	None (stripe only)	0	100% of fleet
2	One replica	1 per shard group	50% of fleet
3	Two replicas	2 per shard group	33% of fleet

Key Components

Orchestrator — proxy that handles shard routing, scatter-gather, result merging, and topology management
Shard router — consistent hash function (Rendezvous/HRW) mapping document IDs to node assignments; minimal reshuffling on topology change
Rebalancer — on node add/remove, recomputes assignments and migrates only the shards that changed owners; surviving replicas serve reads during rebuild
Result merger — normalizes and merges ranked result sets from multiple shards into a single coherent response

Stability

Miroir is currently in development (v0.x). Starting with v1.0, the project provides backward-compatibility commitments for the Meilisearch API layer, miroir-ctl CLI, config file schema, and Helm chart values.

See docs/versioning-policy.md for the full versioning policy, including what constitutes a breaking change and the deprecation process.

Status

Design phase. See docs/ for architecture detail.

Quick Start

Get Miroir running locally in 5 minutes with Docker Compose:

# Clone the repository
git clone https://github.com/jedarden/miroir.git
cd miroir

# Start the development stack (3 Meilisearch nodes + 1 Miroir orchestrator)
docker compose -f examples/docker-compose-dev.yml up -d

# Verify health
curl http://localhost:7700/health
# Expected: {"status":"available"}

# Index documents (Meilisearch-compatible API)
curl -X POST http://localhost:7700/indexes/movies/documents \
  -H "Authorization: Bearer dev-key" \
  -H "Content-Type: application/json" \
  -d '[{"id": 1, "title": "Inception"}, {"id": 2, "title": "Interstellar"}]'

# Search
curl -X POST http://localhost:7700/indexes/movies/search \
  -H "Authorization: Bearer dev-key" \
  -H "Content-Type: application/json" \
  -d '{"q": "inception"}'

# Teardown (removes containers and volumes)
docker compose -f examples/docker-compose-dev.yml down -v

See examples/README.md for more details on the development stack, configuration options, and troubleshooting.

Production deployment

For production deployments, see the Deployment Sizing Guide to determine orchestrator pod count and task store configuration based on your corpus size and query throughput.

When to use

Multi-pod with Redis — Recommended for production. Horizontal scaling with 2+ orchestrator pods delivers fault tolerance (zero-downtime rollouts, pod-loss survival) and scales query throughput via HPA. See Deployment Sizing Guide.
Single oversized pod — Supported for dev clusters, very small deployments, or constrained environments. A single pod at 4 vCPU / 8 GB is validated but loses HA benefits (no zero-downtime rollouts, no pod-loss survival). See Single-Pod Mode.
Large index sharding — When a single Meilisearch node cannot fit your corpus in RAM, Miroir stripes it across multiple nodes with configurable replication factor.

Additional production resources:

Production Deployment Guide — Operational considerations, monitoring, and troubleshooting
Per-Feature Scaling Behavior — Which features need Redis, work queues, or nothing
Versioning Policy — Backward compatibility commitments and upgrade guidance