Cost vs Quality

Find the cheapest AI model that meets your quality bar. Each point represents a model's average cost per call plotted against its quality score from Lab benchmarks. The dashed line marks the efficiency frontier — models that offer the best quality at each price point.

Task type:

AnthropicOpenaiEfficiency frontier

How to read this chart

Top-left = best value (high quality, low cost)
Dashed line = efficiency frontier — models on this line are Pareto-optimal (no other model is both cheaper and better)
Quality scores are based on human verdicts: Correct = 100%, Partial = 50%, Incorrect = 0%
Models with fewer than 3 verdicts are excluded to avoid misleading results