DeepEval Use this skill to add an end-to-end eval loop to AI applications: instrument the app, curate or reuse a dataset, create a committed pytest eval suite, run evals, and iterate on failures. Prerequisites Requires Python 3.9+ and in the target project. Metrics and synthetic generation need model credentials. Confident AI reporting, hosted traces, and online evals require . Workflow Summary 1. Inspect the target app and existing DeepEval usage. 2. Ask the required intake questions. 3. Reuse existing metrics and datasets when available. 4. Use an existing dataset if the user has one; other…