zlacker

[return to "Gemini 2.5 Pro Preview"]
1. andy12+F8[view] [source] 2025-05-06 15:55:14
>>meetpa+(OP)
Interestingly, when compering benchmarks of Experimental 03-25 [1] and Experimental 05-06 [2] it seems the new version scores slightly lower in everything except on LiveCodeBench.

[1] https://storage.googleapis.com/model-cards/documents/gemini-... [2] https://deepmind.google/technologies/gemini/

◧◩
2. nopins+Xm[view] [source] 2025-05-06 17:14:39
>>andy12+F8
Livebench.ai actually suggests the new version is better on most things.

https://livebench.ai/#/

[go to top]