jedarden/claude-print

Author	SHA1	Message	Date
jedarden	e97a8413b5	feat(bf-2f5): watchdog timeout implementation complete Implement comprehensive watchdog timeout mechanism to prevent indefinite hangs when child process wedges. All four timeout types are now enforced: Timeout Types: - PTY first-output timeout (default 90s): Detects if child produces no PTY output - Stream-json first-output timeout (default 90s): Detects if child emits no stream-json events - Overall timeout (default 3600s): Maximum session duration - Stop hook timeout (default 120s): Detects if Stop hook doesn't fire after prompt injection Timeout Behavior: - Sends SIGTERM to child process - Signals event loop via self-pipe to wake up - Calls kill_child() which waits 2s then SIGKILL if needed - Writes clear diagnostic to stderr - Tears down temp resources via CleanupGuard - Exits with code 124 (GNU timeout convention) CLI Arguments: - --timeout <seconds>: Overall timeout - --first-output-timeout <seconds>: PTY first-output timeout - --stream-json-timeout <seconds>: Stream-json first-output timeout - --stop-hook-timeout <seconds>: Stop hook watchdog timeout Testing: - All 90 unit tests pass (6 watchdog-specific tests) - Integration tests verify end-to-end timeout behavior This ensures marathon loop/NEEDLE can retry cleanly instead of hanging indefinitely.	2026-06-25 13:39:29 -04:00
jedarden	a19e2b0aed	chore(bf-2w7): verify cleanup implementation is complete and remove unused imports - Confirm comprehensive cleanup on all exit paths: - Startup orphan sweep via cleanup_orphans() - RAII cleanup guard (CleanupGuard) - process::exit cleanup via exit_with_cleanup() - Signal safety via self-pipe pattern - Watchdog timeout cleanup via self-pipe signaling - Panic safety via catch_unwind - Remove unused imports from watchdog.rs and session.rs All cleanup paths verified: ✓ Normal exit → CleanupGuard drop ✓ Error return → CleanupGuard drop ✓ Timeout → Self-pipe → Event loop exit → CleanupGuard drop ✓ Signal → Handler writes to self-pipe → Event loop exit → CleanupGuard drop ✓ Panic → catch_unwind → CleanupGuard drop	2026-06-25 09:32:01 -04:00
jedarden	7d40c937fb	feat(bf-2f5): add comprehensive watchdog timeout mechanism Implement a complete watchdog timeout system that ensures hung child processes are terminated cleanly with proper diagnostics and cleanup. Features: - PTY first-output timeout (default 90s): detects if child produces no PTY output - Stream-json first-output timeout (default 90s): detects if child produces no stream-json events - Overall session timeout (default 3600s): prevents indefinite hangs - Stop hook watchdog timeout (default 120s): detects if Stop hook doesn't fire after prompt injection Timeout handling: - Sends SIGTERM to child process when timeout fires - kill_child() ensures SIGTERM → SIGKILL sequence (2s grace period) - Writes clear diagnostic to stderr indicating timeout type - Emits stream-json error event for downstream consumers - CleanupGuard ensures temp dir/FIFO cleanup on all exit paths - Returns Error::Timeout and exits non-zero (code 3) for retry loop Fixes: - Pass temp_dir_path to Watchdog so stream-json monitoring works correctly - Remove unused constants (duplicates of watchdog module defaults) - Improve mock-claude binary path resolution for workspace builds This prevents the indefinite hang that occurs when Claude Code wedges during session initialization or tool use, ensuring marathon loops and NEEDLE can retry cleanly instead of blocking forever. Bead-Id: bf-2f5	2026-06-25 07:42:17 -04:00
jedarden	54834e5070	feat(bf-2f5): add comprehensive watchdog timeout mechanism - Add Watchdog module with 4 timeout types: * PTY first-output timeout (90s default) * Stream-json first-output timeout (90s default) * Overall session timeout (3600s default) * Stop hook watchdog timeout (120s default) - Timeout thread monitors child and sends SIGTERM on deadline - Main thread detects timeout, kills child (SIGTERM→SIGKILL), exits non-zero (code 3) - Clear diagnostics to stderr with specific timeout descriptions - CleanupGuard ensures temp dir/FIFO removal on all exit paths - Add CLI flags: --timeout, --first-output-timeout, --stream-json-timeout, --stop-hook-timeout - Integration tests verify timeout fires and cleanup succeeds This prevents indefinite hangs regardless of why child wedges. Bead-Id: bf-2f5	2026-06-25 06:59:23 -04:00
jedarden	6d3841e67f	fix(bf-2w7): fix Session::run call sites after signature change Update all call sites to include the new first_output_timeout_secs parameter: - src/main.rs: pass None for default first-output timeout - tests/watchdog.rs: pass None in both watchdog tests The prior commit added the 5th parameter but missed updating the callers, causing compilation errors. Co-Authored-By: Claude <noreply@anthropic.com> Bead-Id: bf-2w7	2026-06-25 00:59:49 -04:00
jedarden	7176ef2939	Add bf-5nr validation notes: claude-print-ci WorkflowTemplate YAML is valid YAML parses cleanly and kubectl dry-run returns no errors. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-10 02:11:37 -04:00
jedarden	50b213285a	Add Phase 9: NEEDLE integration — install.sh, claude-print.yaml, --check subcommand - claude-print.yaml: NEEDLE agent config with stdin input_method, needle-transform-claude output_transform, and invoke_template for subscription-billed claude-print runs - install.sh: download release binary from GitHub, backup existing, install mock_claude, install NEEDLE config if present, run --check to verify, print --version - src/check.rs: --check doctor subcommand with openpty probe, mkfifo probe, and optional mock_claude PTY round-trip (skipped if mock_claude not in PATH) - src/main.rs + src/lib.rs: wire up check::run() for --check flag - README.md: add Install, Usage, Flags table (matches --help exactly), Exit codes, and NEEDLE integration sections - test-fixtures/mock-claude: extend with all MOCK_* env var controls needed for integration tests (MOCK_SILENT, MOCK_EXIT_BEFORE_STOP, MOCK_TRUST_DIALOG, etc.) - tests/cli.rs, tests/hooks.rs, tests/version_compat.rs: Phase 10 unit test stubs claude-print --check passes: openpty PASS, mkfifo PASS, mock_claude PTY PASS bash -n install.sh: syntax OK Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-10 01:36:28 -04:00
jedarden	bfb50da40c	Add Phase 8: Emitter — text/json/stream-json output formats Adds emitter.rs with three output format handlers and stream-json reader thread, ClaudePrintError enum with exit codes and JSON subtypes to error.rs, and 13 unit tests in tests/emitter.rs covering all plan requirements. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-10 00:57:30 -04:00
jedarden	c6241e37b7	Add Phase 7: transcript reader with retry loop and dedup Implements src/transcript.rs: lenient JSONL parsing, message.id dedup with usage-fingerprint fallback, text extraction from ContentBlock arrays, 40×50ms retry loop for Stop-before-JSONL races (PO-5), and last_assistant_message fallback. All 18 tests in tests/transcript.rs pass; AS-6 verified with MOCK_DELAY_JSONL=100. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-10 00:51:59 -04:00
jedarden	59e170ed03	Implement Phase 6: Stop Poller (bf-64s) Add src/poller.rs with FIFO O_NONBLOCK open (read-end + keeper write-end), Stop hook JSON payload parsing, transcript path derivation via cwd slug, and StopInfo resolution. Wire poller into EventLoop via add_fifo_fd() which was already present in event_loop.rs from Phase 3. Update mock-claude to emit proper JSON Stop payloads (with and without transcript_path via MOCK_OMIT_TRANSCRIPT_PATH=1) and update the pty_integration assertion to match. Tests test_stop_hook_fires and test_missing_transcript_path_derived both pass. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-10 00:05:14 -04:00
jedarden	edd9470038	Add startup.rs: trust-keyword scanner with test_trust_dialog_* integration tests (Phase 5) Implements StartupSeq with scan_line() that detects 2+ trust keywords ("trust", "Allow", "continue", "folder", "permission", "proceed") on a PTY output line and returns CR to dismiss the dialog. Includes idle fallback (0.8 s after 200+ bytes) and hard timeout (45 s / <200 bytes → HardTimeout). Phase 2 injects the prompt via bracketed paste after a 2 s post-dismiss idle. 11 test_trust_dialog_* integration tests cover keyword match, threshold, case sensitivity, chunk-boundary assembly, one-shot dismiss, and CR- vs LF-terminated lines. 12 unit tests in startup::tests cover scan_line and feed() in isolation. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-08 10:05:27 -04:00
jedarden	7b64f5b340	Add terminal.rs: probe scanner with response table, dedup bitmask, unknown-probe passthrough (Phase 4) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-08 09:25:54 -04:00
jedarden	17c35f40a7	Add mock-claude fixture, test_pty_spawns_tty integration test, and hook module export - test-fixtures/mock-claude: implement mock binary that writes 'stop' to FIFO then checks isatty(stdin), exiting 0 on TTY - tests/pty_integration.rs: test_pty_spawns_tty uses HookInstaller + PtySpawner to verify controlling TTY is established - src/lib.rs: expose hook module publicly - Cargo.lock: add libc dependency for mock-claude Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-08 08:56:36 -04:00

13 commits