Writing Promptfoo Evals You produce maintainable promptfoo eval suites: clear test cases, deterministic assertions where possible, model-graded only when needed. See for the full assertion and provider reference. For deep questions about promptfoo features, consult https://www.promptfoo.dev/llms-full.txt Inputs (infer from repo context if not provided) - What is being evaluated (prompt, agent, endpoint, RAG pipeline)? - What are the inputs and outputs (text, JSON, multi-turn chat, tool calls)? - What does "good" look like (acceptance criteria, failure modes)? If context is insufficient, scaff…