Statistical Experimental Evaluation Overview Use this skill after formulation, method proposal, and theory. Experiments should test specific claims and theoretical predictions. Experiment Plan Define: - Conditions or data-generating processes - Real data source or synthetic data generator - Sample sizes, folds, repetitions, seeds, or resamples - Proposed method - Baselines - Ablations - Diagnostics - Metrics - Failure accounting Required Artifacts Evidence Schema Use a row-oriented metric format: Claim verdicts should connect theory and experiments: Evidence Rules - A metric must map to a for…