Factual Recall with Distractor
MEDIUM11 runs · Last: Mar 31
o4 Mini: 1333ms · $0.000000o3 Mini: 1259ms · $0.000000o3: 1290ms · $0.000000GPT-4.1 Nano: 332ms · $0.000000GPT-4.1 Mini: 840ms · $0.000000GPT-4.1: 664ms · $0.000000Claude Opus 4.6: 1842ms · $0.000000Claude Sonnet 4.6: 1967ms · $0.000000GPT-4o: 1372ms · $0.0000001/1GPT-4o Mini: 1256ms · $0.0000001/1Claude Haiku 4.5: 850ms · $0.000000