zlacker

[return to "Gemini 3 Pro: the frontier of vision AI"]
1. hodder+PN[view] [source] 2025-12-05 19:51:29
>>xnx+(OP)
"Gemini 3 Pro represents a generational leap from simple recognition to true visual and spatial reasoning."

Prompt: "wine glass full to the brim"

Image generated: 2/3 full wine glass.

True visual and spatial reasoning denied.

◧◩
2. zmmmmm+LG1[view] [source] 2025-12-06 01:54:01
>>hodder+PN
do it the other way - give it images of wine glasses and ask it whether they are full to the brim. I suspect it's going to nail them all (mainly because Qwen-VL already does nail things like that).
[go to top]