zlacker

[parent] [thread] 0 comments
1. astran+(OP)[view] [source] 2022-05-23 23:22:28
Bigger model = better because a lot of performance at this task is memorization or the “lottery ticket hypothesis”.

An impressive advance would be a small model that’s capable of working from an external memory rather than memorizing it.

[go to top]