zlacker

[parent] [thread] 0 comments
1. codedo+(OP)[view] [source] 2026-02-04 12:50:02
30-A3B model gives 13 t/s without GPU (I noticed that token/sec * # of params matches memory bandwidth).
[go to top]