Back to Leaderboard

GPT-5.4

OpenAI
OpenAI400K context2026-03-05
Overall Rank
#2
of 29 models
Overall Score
83.5
avg across benchmarks
Best Task
Text Extraction
91.1
Weakest Task
Visual QA
78.2

Benchmark Performance

OlmOCR Benchv1.0
4/29
OverallArXiv MathH&FLong/TinyMulti-ColOld ScansScans MathTables
81.083.120.182.683.743.982.391.1
OmniDocBenchv1.5
10/29
OverallText Edit↓CDM↑TEDS↑TEDS-S↑Read Order↓
85.30.08983.481.386.70.077
IDP Core Benchv1.0
2/29
OverallKIEOCRTableVQA
84.485.769.194.878.2

Capability Profile

Strength Analysis

Auto-generated from benchmark scores

Strengths

  • Text Extraction91.1
  • Table Understanding89.1

Weaknesses

  • Visual QA78.2
  • Formula83.4