zlacker
[parent]
[thread]
1 comments
1. andes3+(OP)
[view]
[source]
2026-02-04 15:56:21
Linear time attention doesn’t work, by principle. Dead end pursuit. Much great research on more efficient quadratic time inference
replies(1):
>>smokel+ce
◧
2. smokel+ce
[view]
[source]
2026-02-04 16:57:14
>>andes3+(OP)
What about n log n?
[go to top]