zlacker

[parent] [thread] 2 comments
1. cubefo+(OP)[view] [source] 2026-02-03 20:58:45
> I experimented with the Q2 and Q4 quants.

Of course you get degraded performance with this.

replies(1): >>Aurorn+9j
2. Aurorn+9j[view] [source] 2026-02-03 22:40:15
>>cubefo+(OP)
Obviously. That's why I led with that statement.

Those are the quant thresholds where people with mid-high end hardware can run this locally at reasonable speed, though.

In my experience Q2 is flakey, but Q4 isn't dramatically worse.

replies(1): >>cubefo+cH1
◧◩
3. cubefo+cH1[view] [source] [discussion] 2026-02-04 09:48:31
>>Aurorn+9j
> Obviously. That's why I led with that statement.

Then why did you write this?

> It's always possible that there are some bugs in early implementations that need to be fixed later, but so far I don't see any reason to believe this is actually a Sonnet 4.5 level model.

[go to top]