zlacker
[parent]
[thread]
0 comments
1. codedo+(OP)
[view]
[source]
2026-02-04 12:50:02
30-A3B model gives 13 t/s without GPU (I noticed that token/sec * # of params matches memory bandwidth).
[go to top]