isitcooked.ai
← leaderboard

DeepSeek V4 Pro

no data

DeepSeek · deepseek/deepseek-v4-pro

Daily test history (90 days, baseline band = trailing mean ± 2σ)

No daily test data yet.

Public benchmarks overall 62.9

MMLU-Pro

82

GPQA Diamond

71

SWE-bench Verified

60

LMArena Elo

1375

AIME 2025

74

retrieved 2026-07-02 from public sources — see methodology

Recent samples (latest run, one per test case)