claw-code

mirror of https://github.com/instructkr/claw-code.git synced 2026-04-05 23:54:50 +08:00

Author	SHA1	Message	Date
Yeachan-Heo	85c5b0e01d	Expand parity harness coverage before behavioral drift lands The landed mock Anthropic harness now covers multi-tool turns, bash flows, permission prompt approve/deny paths, and an external plugin tool path. A machine-readable scenario manifest plus a diff/checklist runner keep the new scenarios tied back to PARITY.md so future additions stay honest. Constraint: Must build on the deterministic mock service and clean-environment CLI harness Rejected: Add an MCP tool scenario now \| current MCP tool surface is still stubbed, so plugin coverage is the real executable path Confidence: high Scope-risk: moderate Reversibility: clean Directive: Keep rust/mock_parity_scenarios.json, mock_parity_harness.rs, and PARITY.md refs in lockstep Tested: cargo fmt --all Tested: cargo clippy --workspace --all-targets -- -D warnings Tested: cargo test --workspace Tested: python3 rust/scripts/run_mock_parity_diff.py Not-tested: Real MCP lifecycle handshakes; remote plugin marketplace install flows	2026-04-03 04:00:33 +00:00
Yeachan-Heo	c2f1304a01	Lock down CLI-to-mock behavioral parity for Anthropic flows This adds a deterministic mock Anthropic-compatible /v1/messages service, a clean-environment CLI harness, and repo docs so the first parity milestone can be validated without live network dependencies. Constraint: First milestone must prove Rust claw can connect from a clean environment and cover streaming, tool assembly, and permission/tool flow Constraint: No new third-party dependencies; reuse the existing Rust workspace stack Rejected: Record/replay live Anthropic traffic \| nondeterministic and unsuitable for repeatable CI coverage Confidence: high Scope-risk: moderate Reversibility: clean Directive: Keep scenario markers and expected tool payload shapes synchronized between the mock service and the harness tests Tested: cargo fmt --all Tested: cargo clippy --workspace --all-targets -- -D warnings Tested: cargo test --workspace Tested: ./scripts/run_mock_parity_harness.sh Not-tested: Live Anthropic responses beyond the five scripted harness scenarios	2026-04-03 01:15:52 +00:00
Jobdori	1abd951e57	docs: add PARITY.md — honest behavioral gap assessment Catalog all 40 tools as real-impl vs stub, with specific behavioral gap notes per tool. Identify missing bash submodules (18 upstream vs 1 Rust), file validation gaps, MCP/plugin flow gaps, and runtime behavioral gaps. This replaces surface-count bragging with actionable gap tracking.	2026-04-03 08:27:02 +09:00
Yeachan-Heo	0e722fa013	auto: save WIP progress from rcc session	2026-04-01 04:01:37 +00:00
Yeachan-Heo	4fb2aceaf1	fix: critical parity bugs - enable tools, default permissions, tool input Tighten prompt-mode parity for the Rust CLI by enabling native tools in one-shot runs, defaulting fresh sessions to danger-full-access, and documenting the remaining TS-vs-Rust gaps. The JSON prompt path now runs through the full conversation loop so tool use and tool results are preserved without streaming terminal noise, while the tool-input accumulator keeps the streaming {} placeholder fix without corrupting legitimate non-stream empty objects. Constraint: Original TypeScript source was treated as read-only for parity analysis Constraint: No new dependencies; keep the fix localized to the Rust port Rejected: Leave JSON prompt mode on a direct non-tool API path \| preserved the one-shot parity bug Rejected: Keep workspace-write as the default permission mode \| contradicted requested parity target Confidence: high Scope-risk: moderate Reversibility: clean Directive: Keep prompt text and prompt JSON paths on the same tool-capable runtime semantics unless upstream behavior proves they must diverge Tested: cargo build --release; cargo test Not-tested: live remote prompt run against LayoffLabs endpoint in this session	2026-04-01 02:42:49 +00:00

5 Commits