Extracted from ardenone-cluster/containers/zai-proxy and ardenone-cluster/containers/zai-proxy-dashboard. - proxy/: OpenAI-compatible ZAI reverse proxy (Go, v1.10.0) - Token counting, rate limiting, Prometheus metrics, canary support - dashboard/: Metrics dashboard backend + React frontend (Go, v1.0.0) - Prometheus collector, SQLite storage, SSE live updates - docs/: Operational notes, research, and plan subdirs Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
1.2 KiB
1.2 KiB
zai-proxy
LLM reverse proxy and metrics dashboard for the Z.AI API.
Components
proxy/
OpenAI-compatible reverse proxy that fronts the Z.AI API. Features:
- Request/response body parsing and token counting (tiktoken + GLM tokenizers)
- Rate limiting with configurable burst and steady-state limits
- Prometheus metrics export
- Blue/green and canary deployment support
- Translation layer for provider-specific request/response formats
See proxy/README.md for setup and configuration.
dashboard/
Go backend + React frontend for visualizing proxy metrics, token usage, and request history.
- Collects metrics from the proxy's Prometheus endpoint
- Stores aggregated data in SQLite
- Serves a Tailwind/Vite frontend via SSE for live updates
See dashboard docs for deployment and monitoring.
Docs
docs/plan/— architecture decisions and roadmapsdocs/notes/— deployment, operations, monitoring, canary proceduresdocs/research/— tokenizer research, metrics references
Git remotes
- Forgejo (primary):
https://git.ardenone.com/jedarden/zai-proxy.git