zlacker

[return to "Claude Code: connect to a local model when your quota runs out"]
1. paxys+c7c[view] [source] 2026-02-04 21:59:44
>>fugu2+(OP)
> Reduce your expectations about speed and performance!

Wildly understating this part.

Even the best local models (ones you run on beefy 128GB+ RAM machines) get nowhere close to the sheer intelligence of Claude/Gemini/Codex. At worst these models will move you backwards and just increase the amount of work Claude has to do when your limits reset.

◧◩
2. zozbot+i8c[view] [source] 2026-02-04 22:05:11
>>paxys+c7c
The best open models such as Kimi 2.5 are about as smart today as the big proprietary models were one year ago. That's not "nothing" and is plenty good enough for everyday work.
◧◩◪
3. paxys+ecc[view] [source] 2026-02-04 22:24:43
>>zozbot+i8c
LOCAL models. No one is running Kimi 2.5 on their Macbook or RTX 4090.
◧◩◪◨
4. Dennis+6uc[view] [source] 2026-02-05 00:12:02
>>paxys+ecc
On Macbooks, no. But there are a few lunatics like this guy:

https://www.youtube.com/watch?v=bFgTxr5yst0

◧◩◪◨⬒
5. danw19+gSd[view] [source] 2026-02-05 13:02:48
>>Dennis+6uc
He must be mad, accepting $50k of free (probably loaned?) hardware from Apple !

Great demo video though. Nice to see some benchmarks of Exo with this cluster across various models.

[go to top]