Prompt Engineering in Practice · Prompt Evaluation and Iteration
Test Sets and Golden Datasets
Prompt Evaluation and Iteration
Introduction
To compare prompts objectively you need a reproducible test set. How to build the dataset, how many examples and what pitfalls to avoid.