Cost vs Quality
Find the cheapest model for your task type
Real AI models, real tasks, real costs. We run the same prompts against multiple models daily and publish the results — response quality, latency, and cost — with human verdicts.
Tasks
19
Models Tested
11
Total Runs
223
Community Votes
4
Find the cheapest model for your task type
We run the same prompt against multiple models every day. Read the responses, vote on which one got it right, and see how your judgment compares to the community.
223
responses to judge
11
models competing
4
community votes
Sign in to propose benchmark tasks or suggest models to test.