zlacker

[return to "Tell HN: I cut Claude API costs from $70/month to pennies"]
1. 44za12+rC[view] [source] 2026-01-26 07:03:53
>>ok_orc+(OP)
This is the way. I actually mapped out the decision tree for this exact process and more here:

https://github.com/NehmeAILabs/llm-sanity-checks

◧◩
2. homeon+Bd1[view] [source] 2026-01-26 12:45:24
>>44za12+rC
That's interesting. Is there any kind of mapping to these respective models somewhere?
◧◩◪
3. 44za12+sj1[view] [source] 2026-01-26 13:24:03
>>homeon+Bd1
Yes, I included a 'Model Selection Cheat Sheet' in the README (scroll down a bit).

I map them by task type:

Tiny (<3B): Gemma 3 1B (could try 4B as well), Phi-4-mini (Good for classification). Small (8B-17B): Qwen 3 8B, Llama 4 Scout (Good for RAG/Extraction). Frontier: GPT-5, Llama 4 Maverick, GLM, Kimi

Is that what you meant?

[go to top]