Every example is evaluated against production criteria. Scores are derived from review checklists, state matrices, and test results.