zlacker
[parent]
[thread]
0 comments
1. cft+(OP)
[view]
[source]
2023-11-22 12:18:35
Sutskever says there's a "phase transition" at the order of 9 bn neurons, after which LLMs begin to become really useful. I don't know much here, but wouldn't the monomodels become overfit, because they don't have enough data for 9+bn parameters?
[go to top]