zlacker

[parent] [thread] 0 comments
1. TheDud+(OP)[view] [source] 2025-07-07 00:19:54
If "LLMs" includes reasoning models, then you're already wrong in your first paragraph:

"something that is just MatMul with interspersed nonlinearities."

[go to top]