jedarden/ai-code-battle

Author	SHA1	Message	Date
jedarden	d5515e0bca	feat(mechanics): reduce flee thresholds and derive aggression from kill rate ## Flee Threshold Changes - Reduced flee threshold from AttackRadius2+4 to AttackRadius2 (no buffer) - Modified bots: farmer, gatherer, siege - Bots now only consider enemies in actual attack range, not preemptively - Added outnumber logic: only flee when nearbyAllies < nearbyEnemies ## Behavior Vector Changes - Derive aggression from actual kill rate (not self-reported) - Formula: behaviorVec[0] = min(killRate, 1.0) - Preserves existing economy value or defaults to 0.5 - Enhanced logging to show derived aggression value ## Rationale Aggression must be economically necessary, not just rewarded. Previous flee logic created a false safe option that discouraged combat. Now bots only flee when actually outnumbered within combat range. Related: bf-413 genesis bead tracking mechanics iteration	2026-06-17 03:51:15 -04:00
jedarden	d42d1a5336	feat(evolver): update fitness function to weight kill rate alongside win rate - Updated fitness formula: fitness = 0.7win_rate + 0.3kill_rate (was win_rate only) - Added kill tracking to ArenaResult: TotalKills, TotalMatches, KillRate - Updated evolver system prompt to explicitly mention combat kills are valuable - Enhanced arena logging to show kill rate and total kills This change makes the LLM evolver select for combat aggression, not just win optimization. The system prompt now informs bots that kills and eliminations are part of the fitness evaluation, encouraging more aggressive strategies. Related: bf-59h	2026-06-17 03:11:05 -04:00
jedarden	3f0ece8508	fix(evolver): correct ctx variable declaration (use = instead of := for parameter shadow) The function RunEvolutionLoop takes ctx as a parameter, so line 191 should use = instead of := to avoid shadowing the parameter. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-26 13:43:24 -04:00
jedarden	04b860a8be	feat(index-builder, evolver): improve evolution system initialization and logging Index builder: - Add slog import for structured logging - Improve fetchEvolutionMeta to return empty meta instead of error when programs table is empty - Add logging to show evolution system status (running vs not initialized) - Add logging in generateEvolutionMeta to show when evolution data is written Evolver: - Add automatic schema initialization and population seeding in RunEvolutionLoop - Programs table is now automatically seeded with 6 initial strategy bots on startup - Log seeding status to indicate whether programs table was already initialized These changes ensure the evolution system properly initializes when deployed and provides better visibility into its status via structured logging. Closes: bf-4zde	2026-05-26 13:28:44 -04:00
jedarden	ea04f4debb	style: apply gofmt alignment fixes across codebase Tab/space alignment consistency from running gofmt on all packages. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-24 10:40:33 -04:00
jedarden	a4bdeba8fd	Phase 10: Live evolution observatory - evolver live.json feed + observatory page Evolver writes live.json to R2 every cycle. Observatory page polls and renders live feed + lineage tree + meta shift chart. - Added ACB_R2_UPLOAD_ENABLED env var to enable automatic R2 upload during run loop - CycleState tracks real-time evolution cycle status (generation, phase, candidate, validation, evaluation) - Export() now includes cycle info when cycleState is provided - runCycle() integrated with live observatory exports at each phase transition - exportLiveQuiet() for mid-cycle status updates without verbose logging - Fixed function signature mismatches for exportLiveQuiet calls Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-08 14:52:17 -04:00
jedarden	5242d6037c	feat(acb-evolver): add weekly automated map evolution ticker Wire up the acb-map-evolver to run automatically on a weekly schedule (Sunday 03:00 UTC by default) from the evolver deployment. The map evolution ticker: - Waits until the next scheduled time (weekday:hour:minute UTC) - Runs acb-map-evolver --once to evolve maps for all player counts - Repeats every 7 days The schedule can be configured via ACB_MAP_EVOLUTION_SCHEDULE env var (format: WEEKDAY:HH:MM, e.g., "0:03:00" for Sunday 03:00 UTC). Enable via ACB_MAP_EVOLUTION_ENABLED=true or --enable-map-evolution flag. Per plan §14.6: the weekly map evolution loads engagement scores, runs MAP-Elites evolution, promotes high-scoring variants, and updates the active map pool in the database. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-08 09:26:38 -04:00
jedarden	b31c306013	feat(acb-map-evolver): add weekly automated run wiring per plan §14.6 - Implement runWeeklyLoop() function that waits for scheduled time and runs evolution for all player counts (2, 3, 4, 6) weekly - Add --weekly flag to enable weekly mode (default: Sunday 03:00 UTC) - Add --weekly-schedule flag for custom schedule (WEEKDAY:HH:MM format) - Add ACB_WEEKLY_SCHEDULE env var for configuration feat(acb-evolver): add weekly map evolution ticker - Add MapEvolutionEnabled and MapEvolutionSchedule to RunConfig - Add --enable-map-evolution flag to acb-evolver run subcommand - Add startMapEvolutionTicker() goroutine that runs weekly - Ticker executes acb-map-evolver --once to trigger map breeding - Configurable via ACB_MAP_EVOLUTION_ENABLED and ACB_MAP_EVOLUTION_SCHEDULE This integrates map evolution into the bot evolver's deployment, allowing weekly automated map evolution based on engagement scores as specified in plan §14.6. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-08 09:15:19 -04:00
jedarden	38f14e1997	fix: remove unused imports in evolver, misc pre-dispatch changes Remove unused encoding/json and net/http imports from cmd/acb-evolver/run.go that caused build failure. Include other pre-dispatch changes from prior work. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-22 18:32:46 -04:00
jedarden	98a9f645c4	feat(evolver): update retirement ticker interval to daily (§10.8) Changed RetirementCheckInterval from 1 hour to 24 hours to align with the 7-day low-rating rule specified in §10.8. The retirement automation is already fully implemented: - startRetirementTicker: runs periodic checks (now daily) - EnforcePolicy: retires bots below rating threshold (800) for 7 consecutive days, enforces 50-bot population cap - queryConsecutiveLowRating: uses rating_history table to track consecutive days below threshold - RetireBot: handles K8s manifest deletion via declarative-config - TestEnforcePolicy_CapEnforcement: integration test for cap enforcement Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-22 18:23:03 -04:00
jedarden	88bd70640a	fix(types): add missing ReplayPlayer import and type annotation for transcript feature - Add ReplayPlayer to type imports in replay-viewer.ts - Add explicit type annotation for entry parameter in replay.ts transcript map - Fixes TypeScript compilation errors for §15.3 screen reader transcript feature	2026-04-22 18:20:56 -04:00
jedarden	6c1f031071	feat(config): add season_id + rules_version to Config per §4.2 - SeasonID and RulesVersion already present in engine/types.go Config struct - Worker already populates from active season row via DB join - Config embedded in VisibleState sent to bots each turn (including turn 0) - All starter kits (go, python, rust, java, csharp) already expose and log fields - Add season_id/rules_version logging to JavaScript starter on turn 0 - TypeScript Config interface already includes season_id and rules_version Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-22 18:09:26 -04:00
jedarden	7f2407ed00	feat: add Prometheus metrics instrumentation across services Add metrics server startup and HTTP middleware to acb-api, generation counter metric to evolver, and R2 cache size metric to index builder. Also remove dead measureR2CacheSize reference from index builder main. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-22 16:16:03 -04:00
jedarden	7a0de02059	feat(evolver): persist cross-pollination state to Postgres per §10.2 Add crosspoll_state table to persist per-island generation counters across evolver restarts. Load state on startup and save after each cross-pollination check. Add persistence pattern and translation structure tests. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-22 16:04:15 -04:00
jedarden	80334c6e34	feat(evolver): expand MAP-Elites from 2-D to 4-D grid per §10.2 - Add Exploration and Formation axis definitions with feature extraction from source code pattern matching (exploration/formation indicators) - Extend Grid key from (x,y) to (x,y,z,w) with 3⁴=81-cell behavior grid - Update bin assignment, promotion gate, and persistence (JSON snapshot) - Add Slice() for 2-D dashboard visualization across any axis pair - Migration: old 2-D archives project at z=middle, w=middle - Update cross-pollination to pad 2-element behavior vectors to 4 - Add Prometheus metrics to matchmaker (bot crashes, stale job count) - Add rivalry detection to index builder (data/meta/rivalries.json) - Web: batched bot list loading, leaderboard keyboard accessibility, improved ARIA attributes on match/playlist cards Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-22 15:44:39 -04:00
jedarden	d43cf83471	feat(evolver): island cross-pollination every 50 generations per §10.2 Adds cross-pollination logic that copies the top program from each island to a random other island every 50 generations. When source and target islands use different languages, the LLM translates the code. Generation boundaries are tracked per-island to prevent duplicate events. - New crosspoll package with boundary detection, migration, and LLM translation - Added MaxGenerationByIsland DB query for generation counter tracking - Integrated into RunEvolutionLoop with observability logging - Tests for boundary logic, translation prompts, and target selection Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-22 15:13:27 -04:00
jedarden	4ba39e3aa8	feat(evolver): complete Phase 7 LLM-driven evolution implementation - Complete autonomous evolution pipeline with island model (4 islands) - MAP-Elites behavior grid integration for diversity - LLM ensemble integration (fast + strong model tiers) - 3-stage validation pipeline (syntax → schema → sandbox smoke test) - Evaluation arena (10-match mini-tournament per candidate) - Promotion gate (Nash equilibrium PSRO + MAP-Elites niche fill) - Retirement policy (auto-retire low-rated bots, population cap) - Live export to R2 for evolution dashboard - Enhanced replay viewer with commentary and win probability - Added series, seasons, and predictions pages All tests passing. Phase 7 exit criteria met. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-08 16:38:48 -04:00

17 commits