zlacker

[parent] [thread] 1 comments
1. andes3+(OP)[view] [source] 2026-02-04 15:56:21
Linear time attention doesn’t work, by principle. Dead end pursuit. Much great research on more efficient quadratic time inference
replies(1): >>smokel+ce
2. smokel+ce[view] [source] 2026-02-04 16:57:14
>>andes3+(OP)
What about n log n?
[go to top]