zlacker

[return to "Tell HN: I cut Claude API costs from $70/month to pennies"]
1. 44za12+rC[view] [source] 2026-01-26 07:03:53
>>ok_orc+(OP)
This is the way. I actually mapped out the decision tree for this exact process and more here:

https://github.com/NehmeAILabs/llm-sanity-checks

◧◩
2. andai+vHm[view] [source] 2026-02-01 17:05:40
>>44za12+rC
>Before you reach for a frontier model, ask yourself: does this actually need a trillion-parameter model?

>Most tasks don't. This repo helps you figure out which ones.

About a year ago I was testing Gemini 2.5 Pro and Gemini 2.5 Flash for agentic coding. I found they could both do the same task, but Gemini Pro was way slower and more expensive.

This blew my mind because I'd previously been obsessed with "best/smartest model", and suddenly realized what I actually wanted was "fastest/dumbest/cheapest model that can handle my task!"

[go to top]