jedarden/ai-code-battle

Author	SHA1	Message	Date
jedarden	de15046e3e	feat(evolver): add C# language support - Add csharp case to buildCandidate() with dotnet-script and mcs fallback - Add defender_strategy.cs.txt seed file (combined Program.cs, Strategy.cs, Grid.cs) - Add defender seed to population (gamma island, aggression=0.3, economy=0.4) - Add csharp to langDisplayName() and update comments Co-Authored-By: Claude <noreply@anthropic.com>	2026-06-17 05:08:19 -04:00
jedarden	d42d1a5336	feat(evolver): update fitness function to weight kill rate alongside win rate - Updated fitness formula: fitness = 0.7win_rate + 0.3kill_rate (was win_rate only) - Added kill tracking to ArenaResult: TotalKills, TotalMatches, KillRate - Updated evolver system prompt to explicitly mention combat kills are valuable - Enhanced arena logging to show kill rate and total kills This change makes the LLM evolver select for combat aggression, not just win optimization. The system prompt now informs bots that kills and eliminations are part of the fitness evaluation, encouraging more aggressive strategies. Related: bf-59h	2026-06-17 03:11:05 -04:00
jedarden	1478a9365c	fix(evolver): use ConfigForPlayers for 2-player matches per plan §3.4 The evolver arena was using DefaultConfig() which has attack_radius2=12 for all matches. Per plan §3.4, 2-player matches should have attack_radius2=36 (6 tiles) to achieve 65-80% combat density. This bug caused evolved bots to learn energy-farming strategies since enemies were rarely in attack range on 40x40 maps with only 3.5 tile radius. With the correct 6-tile radius, bots will experience actual combat during evolution and should develop fighting behaviors. Closes: bf-3lt3 Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-24 22:35:15 -04:00
jedarden	ea04f4debb	style: apply gofmt alignment fixes across codebase Tab/space alignment consistency from running gofmt on all packages. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-24 10:40:33 -04:00
jedarden	80334c6e34	feat(evolver): expand MAP-Elites from 2-D to 4-D grid per §10.2 - Add Exploration and Formation axis definitions with feature extraction from source code pattern matching (exploration/formation indicators) - Extend Grid key from (x,y) to (x,y,z,w) with 3⁴=81-cell behavior grid - Update bin assignment, promotion gate, and persistence (JSON snapshot) - Add Slice() for 2-D dashboard visualization across any axis pair - Migration: old 2-D archives project at z=middle, w=middle - Update cross-pollination to pad 2-element behavior vectors to 4 - Add Prometheus metrics to matchmaker (bot crashes, stale job count) - Add rivalry detection to index builder (data/meta/rivalries.json) - Web: batched bot list loading, leaderboard keyboard accessibility, improved ARIA attributes on match/playlist cards Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-22 15:44:39 -04:00
jedarden	76e8791e4d	Add evaluation arena, promotion gate, and retirement policy (Phase 7) - arena/arena.go: 10-match mini-tournament running candidate as a local subprocess against diverse live opponents sampled across the rating distribution; AES-GCM secret decryption for opponent auth - arena/psro.go: Nash equilibrium computation for the 1×K meta-game; FictitiousPlayNash included for future K×K support - arena/winrate.go: Wilson-score 95% CI for win-rate calculation; draws counted as 0.5 wins - arena/gate.go: two-part promotion gate — Nash value ≥ threshold AND MAP-Elites niche fill or improvement; detailed reason strings - promoter/promoter.go: full promotion pipeline — bot source + Dockerfile + K8s Secret/Deployment/Service manifests, docker build, git commit/push (ArgoCD sync), kubectl readiness poll, bots-table INSERT, programs-table update; RetireBot and EnforcePolicy (rating threshold + population cap 50) - db/db.go: add bot_name / bot_secret migration columns - db/programs.go: ListPromoted, SetBotNameAndSecret, UnsetPromoted, GetByBotID, PromotedCount helpers for promotion/retirement lifecycle - main.go: evaluate and retire subcommands wiring arena + gate + promoter; remove unused island flag from evaluate - arena/arena_test.go: 21 unit tests covering Nash, Wilson CI, Gate logic, and selectDiverse opponent sampling - promoter/promoter_test.go: tests for Dockerfiles, bot-ID/secret generation, AES-GCM helpers, and K8s manifest templates Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-26 23:32:37 -04:00

6 commits