claw-code

mirror of https://github.com/instructkr/claw-code.git synced 2026-07-18 21:08:28 +08:00

Files

T

Yeachan-Heo fa72cd665e Block oversized requests before providers hard-fail

The runtime already tracked rough token estimates for compaction, but provider-bound
requests still relied on naive model output limits and could be sent upstream even
when the selected model could not fit the estimated prompt plus requested output.

This adds a small model token/context registry in the API layer, estimates request
size from the serialized prompt payload, and fails locally with a dedicated
context-window error before Anthropic or xAI calls are made. Focused integration
coverage asserts the preflight fires before any HTTP request leaves the process.

Constraint: Keep the first pass minimal and reusable across both Anthropic and OpenAI-compatible providers
Rejected: Auto-compact-and-retry in the same patch | broader control-flow change than the requested minimal preflight
Confidence: medium
Scope-risk: narrow
Reversibility: clean
Directive: Expand the model registry before enabling preflight for additional providers or aliases
Tested: cargo build -p api -p tools -p rusty-claude-cli; cargo test -p api
Not-tested: End-to-end CLI auto-compaction or retry behavior after a local context_window_blocked failure

2026-04-05 16:39:58 +00:00

api

Block oversized requests before providers hard-fail

2026-04-05 16:39:58 +00:00

commands

feat(commands): reach upstream slash command parity — 135 → 141 specs

2026-04-03 19:55:12 +09:00

compat-harness

wip: plugins progress

2026-04-01 07:09:06 +00:00

mock-anthropic-service

feat(harness+usage): add auto_compact and token_cost parity scenarios