zlacker

[parent] [thread] 0 comments
1. cft+(OP)[view] [source] 2023-11-22 12:18:35
Sutskever says there's a "phase transition" at the order of 9 bn neurons, after which LLMs begin to become really useful. I don't know much here, but wouldn't the monomodels become overfit, because they don't have enough data for 9+bn parameters?
[go to top]