zlacker

[return to "Gemini 2.5 Pro Preview"]
1. artdig+mf1[view] [source] 2025-05-06 23:44:49
>>meetpa+(OP)
Gemini 2.5 pro is great, but also VERY expensive with non opaque cost insights

Just recently a lot of people (me included) got hit with a surprise bill, with some racking up $500 in cost for normal use

I certainly got burnt and removed my API key from my tools to not accidentally use it again

Example: https://x.com/pashmerepat/status/1918084120514900395?s=46

◧◩
2. danpal+zw1[view] [source] 2025-05-07 03:09:49
>>artdig+mf1
From the linked tweet the author seems to be using Gemini through another layer called OpenRouter - it seems quite possible that the issue around lack of clarity of billing/caching could be from that extra layer of indirection.
◧◩◪
3. cma+cy1[view] [source] 2025-05-07 03:27:58
>>danpal+zw1
OpenRouter lets you fund a wallet and spend no more than that. Google will let it go out of control and they purposely delay the billing console by up to 24 hours so if you don't track it all yourself you can get hit big, especially if it is a coding error that uses up to the rate limits.
◧◩◪◨
4. danpal+cF1[view] [source] 2025-05-07 05:01:10
>>cma+cy1
Well OpenRouter is also facading the API calls, so you may not get the full details of the response back from the upstream LLM service. As far as I can tell the Gemini API returns the token counts in its response enabling you to estimate billing yourself if you want to.

> they purposely delay the billing console by up to 24 hours

This is about scalability and performance. Billing for as many requests per second as a cloud provider gets can't be done live, without significant performance and reliability degradation.

[go to top]