Eval Designer Overview This skill covers end-to-end design of evaluation frameworks for LLM-powered systems. It helps teams define what "good" looks like for their specific use case, create diverse test suites that cover both capability and failure modes, design human evaluation rubrics with clear scoring criteria, implement automated eval pipelines using reference-based and LLM-as-judge approaches, and track quality over time as models and prompts change. A robust eval framework is the engineering foundation that enables confident model upgrades, prompt changes, and feature launches. When to…