/evaluate:skill Evaluate a skill's effectiveness by running behavioral test cases and grading the results against assertions. When to Use This Skill | Use this skill when... | Use alternative when... | |------------------------|------------------------| | Want to test if a skill produces correct results | Need structural validation - | | Validating skill improvements before merging | Want to file feedback about a session - | | Benchmarking a skill against a baseline | Need to check skill freshness - | | Creating eval cases for a new skill | Want to review code quality - | Context - Skill file…