zlacker

[parent] [thread] 2 comments
1. simonw+(OP)[view] [source] 2025-12-05 20:12:34
I was surprised at how poorly GPT-5 did in comparison to Opus 4.1 and Gemini 2.5 on a pretty simple OCR task a few months ago - I should run that again against the latest models and see how they do. https://simonwillison.net/2025/Aug/29/the-perils-of-vibe-cod...
replies(1): >>daemon+oE
2. daemon+oE[view] [source] 2025-12-06 00:16:35
>>simonw+(OP)
Agreed, GPT-5 and even 5.1 is noticeably bad at OCR. OCRArena backs this up: https://www.ocrarena.ai/leaderboard (I personally would rank 5.1 as even worse than it is there).

According to the calculator on the pricing page (it's inside a toggle at the bottom of the FAQs), GPT-5 is resizing images to have a minor dimension of at most 768: https://openai.com/api/pricing/ That's ~half the resolution I would normally use for OCR, so if that's happening even via the API then I guess it makes sense it performs so poorly.

replies(1): >>datadr+Op2
◧◩
3. datadr+Op2[view] [source] [discussion] 2025-12-06 19:27:07
>>daemon+oE
and GPT4 was pretty decent at OCR, so that's weird?
[go to top]