No description
Find a file
jedarden 0fb5f58f2e Phase 1 (miroir-cdo): Core Routing — Final verification complete
This commit completes Phase 1 of the Core Routing implementation by
updating test assertions to match the Definition of Done requirements.

## Changes
- Updated `test_shard_distribution_64_3_rf1` to assert 18-26 shard range
  (previously 15-27) to match DoD requirement
- Updated `acceptance_uniformity_64_shards_3_nodes_rf1` to assert 18-26
  shard range for consistency

## DoD Verification
All Phase 1 requirements are satisfied:
- ✓ Rendezvous assignment is deterministic (test_rendezvous_determinism)
- ✓ Adding a 4th node moves at most ~2 × (1/4) of shards (test_minimal_reshuffling_on_add)
- ✓ 64 shards / 3 nodes / RF=1 → each node holds 18–26 shards (test_shard_distribution_64_3_rf1)
- ✓ Top-RF placement changes minimally (test_top_rf_stability)
- ✓ write_targets returns exactly RG × RF nodes (test_write_targets_count)
- ✓ query_group distributes evenly (test_query_group_distribution)
- ✓ covering_set returns one node per shard (test_covering_set_one_per_shard)
- ✓ merger passes all tests (comprehensive tests in merger.rs)
- ✓ ≥90% line coverage (router: 96.20%, topology: 100%, scatter: 100%, merger: 94.67%)

## Implementation Summary
Phase 1 implements the deterministic, coordination-free routing primitives:
- router.rs: HRW-based rendezvous hashing with seed 0 (matches Meilisearch Enterprise)
- topology.rs: Node health state machine (healthy/degraded/draining/failed/joining/active/removed)
- scatter.rs: Fan-out orchestration primitives (stubbed for Phase 1)
- merger.rs: Result merge with global sort, offset/limit, facet aggregation

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-09 12:19:46 -04:00
.beads Phase 2 (miroir-9dj): Proxy + API Surface — Complete implementation 2026-05-09 12:08:28 -04:00
charts/miroir Phase 0 (miroir-qon): Verification complete - foundation confirmed 2026-05-09 05:51:59 -04:00
coverage Phase 1 (miroir-cdo): Close bead - Core Routing complete 2026-05-09 11:38:45 -04:00
crates Phase 1 (miroir-cdo): Core Routing — Final verification complete 2026-05-09 12:19:46 -04:00
docs Phase 3 (miroir-r3j): Task Registry + Persistence — Verification complete 2026-05-09 05:40:08 -04:00
notes Phase 1 (miroir-cdo): Core Routing — Bead summary note 2026-05-09 12:17:34 -04:00
.editorconfig Add repo hygiene: LICENSE, CHANGELOG, .gitignore 2026-04-18 20:47:36 -04:00
.gitignore Add repo hygiene: LICENSE, CHANGELOG, .gitignore 2026-04-18 20:47:36 -04:00
.needle-predispatch-sha Phase 2 (miroir-9dj): Proxy + API Surface — Complete implementation 2026-05-09 12:08:28 -04:00
Cargo.lock Phase 2 (miroir-9dj): Proxy + API Surface — Complete implementation 2026-05-09 12:08:28 -04:00
Cargo.toml Phase 0 (miroir-qon): Rust 1.88 upgrade + test infrastructure 2026-05-09 02:05:44 -04:00
CHANGELOG.md Add repo hygiene: LICENSE, CHANGELOG, .gitignore 2026-04-18 20:47:36 -04:00
clippy.toml Add repo hygiene: LICENSE, CHANGELOG, .gitignore 2026-04-18 20:47:36 -04:00
lcov.info Phase 1 (miroir-cdo): Core Routing - Final verification 2026-05-09 11:50:04 -04:00
LICENSE Add repo hygiene: LICENSE, CHANGELOG, .gitignore 2026-04-18 20:47:36 -04:00
README.md Add repo hygiene: LICENSE, CHANGELOG, .gitignore 2026-04-18 20:47:36 -04:00
rust-toolchain.toml Phase 0 (miroir-qon): Rust 1.88 upgrade + test infrastructure 2026-05-09 02:05:44 -04:00
rustfmt.toml Add repo hygiene: LICENSE, CHANGELOG, .gitignore 2026-04-18 20:47:36 -04:00

Miroir

Multi-node Index Replication Orchestrator, Integrated Rebalancing

Miroir is a RAID-like orchestration layer for Meilisearch. It stripes a large index across a fleet of small-RAM Meilisearch nodes with a configurable replication factor, fans out search queries across all shards, and rebalances shard assignments when nodes are added or removed — all using the Meilisearch Community Edition.

The Problem

Meilisearch loads its entire index into memory-mapped LMDB files. A large index that exceeds a single server's available RAM cannot run on that server. The Enterprise Edition's native sharding is gated behind a commercial license. Miroir solves this without it.

How It Works

Client
  │
  ▼
Miroir Orchestrator
  ├── Write path: hash(doc_id) → assign to shard → write to R replicas
  ├── Read path:  scatter query to all shards → gather → merge ranked results
  └── Rebalance: on node add/remove → recompute assignments → migrate minimum shards

Meilisearch Nodes (N instances, each holding a subset of shards)
  node-0   node-1   node-2   ...   node-N

Replication Factor

Analogous to software RAID — configurable per deployment:

RF Redundancy Node failures tolerated Capacity
1 None (stripe only) 0 100% of fleet
2 One replica 1 per shard group 50% of fleet
3 Two replicas 2 per shard group 33% of fleet

Key Components

  • Orchestrator — proxy that handles shard routing, scatter-gather, result merging, and topology management
  • Shard router — consistent hash function (Rendezvous/HRW) mapping document IDs to node assignments; minimal reshuffling on topology change
  • Rebalancer — on node add/remove, recomputes assignments and migrates only the shards that changed owners; surviving replicas serve reads during rebuild
  • Result merger — normalizes and merges ranked result sets from multiple shards into a single coherent response

Status

Design phase. See docs/ for architecture detail.