isitcooked.ai
← leaderboard

Gemini 3.5 Flash

CALIBRATING

Google · google/gemini-3.5-flash · verdict as of 2026-07-03

Daily test history (90 days, baseline band = trailing mean ± 2σ)

Structured Extraction

CALIBRATING

Math & Reasoning

CALIBRATING

Instruction Following

CALIBRATING

Code Generation

CALIBRATING

Report Analysis

CALIBRATING

Summarization

CALIBRATING

Customer Service

CALIBRATING

Creative Writing

CALIBRATING

Web Design

CALIBRATING

Game Design

CALIBRATING

Serving providers (last 30 days)

striped = multiple providers served this model that day (hover for detail)

Public benchmarks overall 58.5

MMLU-Pro

80

GPQA Diamond

70

SWE-bench Verified

55

LMArena Elo

1360

AIME 2025

72

retrieved 2026-07-02 from public sources — see methodology

Recent samples (latest run, one per test case)