zlacker

[return to "Gemini 2.5 Pro Preview"]
1. andy12+F8[view] [source] 2025-05-06 15:55:14
>>meetpa+(OP)
Interestingly, when compering benchmarks of Experimental 03-25 [1] and Experimental 05-06 [2] it seems the new version scores slightly lower in everything except on LiveCodeBench.

[1] https://storage.googleapis.com/model-cards/documents/gemini-... [2] https://deepmind.google/technologies/gemini/

◧◩
2. jjani+tf[view] [source] 2025-05-06 16:31:26
>>andy12+F8
Sounds like they were losing so much money on 2.5-Pro they came up with a forced update that made it cheaper to run. They can't come out with "we've made it worse across the board", nor do they want to be the first to actually raise prices, so instead they made a bit of a distill that's slightly better at coding so they can still spin it positively.
◧◩◪
3. sauwan+Bi[view] [source] 2025-05-06 16:51:22
>>jjani+tf
I'd be surprised if this was a new base model. It sounds like they just did some post-training RL tuning to make this version specifically stronger for coding, at the expense of other priorities.
◧◩◪◨
4. jjani+km[view] [source] 2025-05-06 17:10:28
>>sauwan+Bi
Every frontier model now is a distill of a larger unpublished model. This could be a slightly smaller distill, with potentially the extra tuning you're mentioning.
◧◩◪◨⬒
5. tangju+dD[view] [source] 2025-05-06 18:54:35
>>jjani+km
Any info on this?
[go to top]