jedarden/ai-code-battle

Author	SHA1	Message	Date
jedarden	df7a3e38c7	feat(worker): implement map engagement scoring per plan §14.6 Update the map engagement scoring formula to match plan §14.6: - score = win_prob_crossings * 3.0 + critical_moments * 2.0 + resource_contest_turns * 1.5 + survival_turns * 0.5 New metrics computed from replay data: - resource_contest_turns: turns where energy is contested by multiple players - survival_turns: turns where all players have at least one bot alive The old formula used map_coverage_pct, closeness, and turn_pct which did not match the specification. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-04 02:28:45 -04:00
jedarden	0f44672634	feat(engine): add TestINV6_ToroidalBounds property-based fuzz test Implements plan §3.9 requirement for INV-6 invariant verification. The test runs thousands of random scenarios across various grid dimensions (30x30 to 200x200) and multiple random seeds to verify that no bot, energy, core, or wall position ever has coordinates outside the valid bounds [0, rows) x [0, cols). Test coverage: - Random wall placement with potentially out-of-bounds input - 1000 random Wrap() calls with positions far outside bounds - Move() operations from edge and corner positions in all directions - Neighbors() and VisibleFrom() return value validation The test uses a manual random-seed loop approach for maximum control and reproducibility, testing 6 grid sizes × 10 seeds for comprehensive coverage of the toroidal wrapping invariant. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-03 23:52:44 -04:00
jedarden	45b05b1188	feat(engine): add tests verifying win_prob in replays and map engagement calculation - Add map_engagement_test.go with tests for: - Win prob dependency in map engagement (lead changes counted) - Critical moments dependency in engagement score - Empty/nil replay handling - Complete ComputeWinProbability + SetWinProbability flow This confirms the existing implementation already correctly: - Computes win probability via Monte Carlo rollout (100 iterations) - Sets win_prob and critical_moments on replay before serialization - Calculates map engagement score from win_prob_crossings and critical_moments - Writes engagement score to maps table via UpdateMapEngagement Task: bf-qps	2026-05-03 23:45:18 -04:00
jedarden	92576dbed4	feat(worker): add map engagement score tracking and verify win_prob in replays - Add engine.CalculateMapEngagement() to compute map engagement scores from replay data (win_prob_crossings, critical_moments, map_coverage_pct, closeness, turn_pct) - Add DBClient.UpdateMapEngagement() to update map engagement using rolling average - Worker now calculates and writes map engagement scores after each match - Add test to verify win_prob array is non-empty in produced replays This implements the win probability Monte Carlo array storage in replay JSON feature. The engine already called ComputeWinProbability() in MatchRunner.Run(), so this commit adds the missing map engagement tracking. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-03 23:21:57 -04:00
jedarden	de4bc9eedd	fix(engine): add JSON marshaling to Direction for string/int interop Bot responses send direction as a string ("N","E","S","W") but the engine Direction type is int with no custom JSON handling. json.Unmarshal was failing silently, leaving Direction=0 (DirNone) for every move — bots never moved and every match ended in stalemate. MarshalJSON serializes as string; UnmarshalJSON accepts both forms. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-02 16:23:55 -04:00
jedarden	3a2d48b3b5	fix(test): use deterministic garbage signature in TestVerifyRequest The old test used "0"+sig[1:] to corrupt the signature. If the real HMAC starts with "0", the corruption is a no-op and the test fails non-deterministically. Replace with a fixed 64-char hex constant that is never a valid HMAC output. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-30 21:53:15 -04:00
jedarden	e64230b122	fix: resolve universal stalemate — signing format and secret decryption Two root causes prevented bots from making any moves: 1. SignRequest signing string included timestamp ({match_id}.{turn}.{timestamp}.{hash}) but all bots implement verifySignature without timestamp ({match_id}.{turn}.{hash}). Fixed by dropping timestamp from the signing string; X-ACB-Timestamp header is still sent for clock-skew checks but not in the HMAC. 2. The API stores bot secrets AES-GCM encrypted (184 hex chars) in the DB. The worker was passing the ciphertext directly as the HMAC key, while bots use their plaintext k8s secret (64 hex chars). Fixed by decrypting in the worker using ACB_ENCRYPTION_KEY. Also tightens the home page winner filter to exclude winner_id="0" stalemates. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-30 21:48:25 -04:00
jedarden	17dbef0927	fix(engine): repair partitionBots refactor in phase13 strategies The partitionBots() return type was changed to a struct but two call sites still referenced config.Turn inside bestExploreDir (now a parameter), and RaiderBot had an unused enemySet variable. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-22 18:48:46 -04:00
jedarden	7978ebbab3	feat(§15.2): generate and stream static meta JSON files to R2 - Add data/meta/rivalries.json to R2 upload list in uploadMetaJSONToR2 - Add attachCommunityHints() to narrative.go to enrich story arcs with highest-upvote community tactical hints (upvotes >= 3, idea/mistake types) - Fix detectRivalryArcs() key separator from "-" to "\|" to avoid UUID hyphen collisions when parsing bot ID pairs - Fix partitionBots() call sites in bot_strategies_phase13.go to use struct field access (.friendly, .enemy) matching updated return type generator.go already contains generateArchetypes, generateCommunityHints, and generateMatchFeedback (all called from generateAllIndexes). main.go uploads all four outputs to R2 on every build cycle. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-22 18:46:27 -04:00
jedarden	60b83a02d9	feat(§15.3): implement screen reader transcript for replay viewer - Add transcript panel with turn-by-turn summaries generated from replay events - Each turn shows: player moves, combat, deaths, captures, energy collection, spawns, win probability - Add 'T' key shortcut to toggle transcript panel - Panel supports three view modes: All Turns, ±10 Turns from Current, Recent 20 Turns - Click on transcript entry to jump to that turn - Current turn is highlighted in transcript with smooth scroll - Panel content is selectable/copyable for screen reader users - Transcript generation logic already existed in replay-viewer.ts; this adds the UI - Transcript button slides in from right side of screen Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-22 18:42:49 -04:00
jedarden	6c1f031071	feat(config): add season_id + rules_version to Config per §4.2 - SeasonID and RulesVersion already present in engine/types.go Config struct - Worker already populates from active season row via DB join - Config embedded in VisibleState sent to bots each turn (including turn 0) - All starter kits (go, python, rust, java, csharp) already expose and log fields - Add season_id/rules_version logging to JavaScript starter on turn 0 - TypeScript Config interface already includes season_id and rules_version Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-22 18:09:26 -04:00
jedarden	1b55d4dc51	feat(voting): add map voting UI widget to replay viewer (§14.6) - Add MapID field to engine Config struct for inclusion in replay JSON - Add map_id to TypeScript Config interface - Add map voting panel to replay viewer sidebar with: - Map metadata display (dimensions, wall density, energy node count) - Thumbs up/down vote buttons wired to POST /api/vote/map - One vote per visitor enforcement (disables after voting) - Net vote count display with positive/negative coloring - Graceful fallback when map_id unavailable (local replays) Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-22 18:08:55 -04:00
jedarden	89560e5ec4	feat(index): implement match thumbnail PNG generation (§7.2, §14.3) Implementation complete: 1. engine/thumbnail.go - New thumbnail rendering package - GenerateMatchThumbnail() creates 640x360 PNG thumbnails - Renders grid, bots, cores, walls, energy with player colors - SelectThumbnailTurn() chooses most interesting turn - Pure Go stdlib image rendering (no canvas required) 2. cmd/acb-worker - Upload thumbnails to B2 alongside replays - uploadThumbnail() generates PNG and uploads to B2 - Key: thumbnails/{match_id}.png, content-type: image/png - Called after match completion, non-blocking on failure 3. cmd/acb-index-builder/deploy.go - Promote thumbnails to R2 - promoteRecentReplays() copies both replays AND thumbnails from B2 to R2 - Thumbnails promoted to warm cache alongside replay promotion 4. cmd/acb-index-builder/generator.go - Populate thumbnail URLs - buildPlaylistMatch() now includes thumbnail_url field - URL pattern: https://r2.aicodebattle.com/thumbnails/{match_id}.png - Enables playlist cards and embed OG tags to show preview images Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-22 17:42:36 -04:00
jedarden	477a54c548	feat(matchmaker): implement §6.1 Pareto skill-proximity + LRU pairing algorithm Replace random 2-player pairing with the full §6.1 algorithm: - Seed selection: bot with oldest last-match timestamp (tiebreak: lowest bot ID) - Format selection: seed's least-played player count among {2, 3, 4, 6} - Opponent selection: Pareto 80%/16-rank skill proximity + oldest last-pairing with seed + fewest 24h games for game-count balance - Map selection: least-recently-used active map for the chosen player count, with map_scores.last_used_at updated after each match - Random player slot assignment for all participant counts Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-22 17:35:00 -04:00
jedarden	c56cc8bae6	fix(matchmaker): multi-match crash cooldown (3 strikes / 30 min) per §4.5 + §6.1 Add crash_strikes and cooldown_until columns to bots table. Worker increments strikes on crash (cooldown at 3), resets on success. Matchmaker excludes cooldown bots from pairing, series scheduling, and championship brackets. Fix erroneous cooldown filter on series table in finalizeCompletedSeries (column only exists on bots). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-22 15:22:12 -04:00
jedarden	677fde5245	fix(engine): use core1 variable in spawn priority tiebreak test The TestSpawnPriority_LowerIDBreaksTie test declared core1 but never referenced it, causing a compile error. Added an assertion that core1.LastSpawnedTurn remains 0 (confirming it didn't spawn). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-22 14:45:06 -04:00
jedarden	5443e4d0ed	fix(engine): enforce strict HMAC response signature verification per §4.4 Remove the lenient fallback that accepted bot responses missing the X-ACB-Signature header. Missing or invalid signatures now cause the response to be discarded and count toward the crash threshold (§4.5). Add tests for missing-header, bad-signature, and crash-after-10 cases. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-22 13:18:10 -04:00
jedarden	206189f914	feat(engine): integrate Monte Carlo win probability computation into match runner Collect GameState snapshots during match execution (one per turn), then run 100 random-play rollouts per snapshot post-match to compute per-turn win probabilities and detect critical moments (\|delta\| > 0.15). Results are stored in the replay JSON as win_prob and critical_moments fields. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-21 08:46:39 -04:00
jedarden	4ba39e3aa8	feat(evolver): complete Phase 7 LLM-driven evolution implementation - Complete autonomous evolution pipeline with island model (4 islands) - MAP-Elites behavior grid integration for diversity - LLM ensemble integration (fast + strong model tiers) - 3-stage validation pipeline (syntax → schema → sandbox smoke test) - Evaluation arena (10-match mini-tournament per candidate) - Promotion gate (Nash equilibrium PSRO + MAP-Elites niche fill) - Retirement policy (auto-retire low-rated bots, population cap) - Live export to R2 for evolution dashboard - Enhanced replay viewer with commentary and win probability - Added series, seasons, and predictions pages All tests passing. Phase 7 exit criteria met. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-08 16:38:48 -04:00
jedarden	df3e59b9a9	feat(acb-local): add strategy bots for offline matches Add Go implementations of 5 strategy bots directly into the engine: - GathererBot: prioritizes energy collection, avoids combat - RusherBot: aggressively rushes enemy cores - GuardianBot: defends cores with cautious expansion - SwarmBot: formation-based coordinated movement - HunterBot: targets isolated enemy units Update acb-local with bot selection flags: - -bot0/-bot1: select bot strategies - -list-bots: list available strategies - Default to gatherer vs rusher for interesting gameplay Enables demo replays with real strategic behavior without K8s infrastructure. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-29 14:27:36 -04:00
jedarden	f5d7553f98	Add Phase 7-9 features: evolution dashboard, WASM sandbox, enhanced replay Phase 7 Evolution: - Add live-export subcommand to acb-evolver for dashboard JSON generation - Export programs, stats, and generation log to live.json Phase 8 Enhanced Features: - Add WASM game engine build (cmd/acb-wasm/) with JS bindings - Add in-browser sandbox page with Monaco editor (web/src/pages/sandbox.ts) - Add win probability computation (web/src/win-probability.ts) - Add replay commentary generator (web/src/commentary.ts) - Add clip maker for GIF/MP4 export (web/src/pages/clip-maker.ts) - Add rivalry detection and pages (web/src/pages/rivalries.ts) - Add replay feedback system (web/src/pages/feedback.ts) - Add evolution dashboard page (web/src/pages/evolution.ts) Phase 9 Platform Depth: - Add predictions API (cmd/acb-api/predictions.go) - Add series management API (cmd/acb-api/series.go) - Add seasons API (cmd/acb-api/seasons.go) - Add narrative generator for rivalries (cmd/acb-indexer/src/narrative.ts) Engine Updates: - Add debug field to move response schema - Add match event timeline extraction - Add replay enrichment fields Web Updates: - Update app.html navigation for new pages - Add API client methods for predictions, series, seasons - Export engine types for browser use Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-29 01:13:23 -04:00
jedarden	0f7d55c5d4	Fix sin/cos math bug and add cellular automata map generation Replace broken Taylor series sin/cos approximations with math.Sin/math.Cos in both engine/match.go and cmd/acb-mapgen. The Taylor series produced incorrect results for angles > π, causing wrong positions in 3+ player maps. Upgrade map generator wall placement from random scatter to cellular automata (B5/S4 rule, 4 iterations) with rotational symmetry enforcement and connectivity validation. Add comprehensive mapgen tests and dominance win condition tests. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-26 01:10:46 -04:00
jedarden	6f1b50384c	Complete Phase 2: HTTP protocol and 6 strategy bots Phase 2 Implementation: - HMAC authentication for engine-to-bot communication - Request signing with timestamp anti-replay - Response signing for integrity verification - HTTP bot client with timeout and crash detection - Per-turn 3s timeout, 10 consecutive failure crash threshold - Move validation (position ownership, direction validity) - Integration tests for HTTP match execution - 6 strategy bots in 6 languages: - RandomBot (Python): Random valid moves - rating floor - GathererBot (Go): Energy-focused with combat avoidance - RusherBot (Rust): Aggressive core rushing via BFS - GuardianBot (PHP): Defensive core protection - SwarmBot (TypeScript): Formation-based group combat - HunterBot (Java): Target isolation and hunting All bots include: - HMAC signature verification - Dockerfile for containerization - README documentation All engine tests passing (32+ tests) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-24 07:00:38 -04:00
jedarden	890785c5c4	Complete Phase 1: add connectivity validation and determinism tests - Add connectivity.go: BFS-based map connectivity validation with retry - Update mapgen to use connectivity checking by default - Add determinism_test.go: property-based tests for reproducibility - Same seed produces identical replays - Turn execution is deterministic - Grid operations are deterministic - Combat resolution is deterministic - Full 500-turn match validation - All 32 tests pass - Update PROGRESS.md: Phase 1 complete Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-24 03:44:44 -04:00
jedarden	6d3f3506b3	Implement Phase 1 core engine: grid, combat, fog of war, turn execution - Add engine package with toroidal grid, game state, turn execution - Implement focus-fire combat resolution with simultaneous deaths - Add fog of war visibility filtering for bot state - Implement energy collection (contested resources denied) - Add bot spawning at active cores - Implement win conditions: elimination, draw, dominance, turns - Add replay JSON writer for match recording - Add match runner with concurrent bot communication - Add CLI tools: acb-local (match runner), acb-mapgen (map generator) - Add comprehensive unit tests (26 tests passing) Exit criteria met: can run complete 500-turn matches and produce valid replays Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-24 01:48:27 -04:00

25 commits