Paths: File paths ( ) are relative to this skill directory. Benchmark Compare Type: L3 Worker Category: 8XX Optimization - 840 Benchmark Run a clean A/B benchmark in Claude Code: one session with built-in tools only, one with . The benchmark is scenario-based, diff-validated, manifest-driven, and runtime-backed. It measures activation, correctness, time, cost, and tokens. The current runner is intentionally scoped to this internal A/B. It does not, by itself, prove best-in-class against external alternatives. --- Input / Output | Direction | Content | |-----------|----------| | Input | Repo c…