zlacker

[parent] [thread] 0 comments
1. oedemi+(OP)[view] [source] 2026-01-01 15:31:04
as architectures evolve, i think it can be that we learn more "side effects".. back in 2020 openai researchers said "GPT-3 is applied without any gradient updates or fine-tuning" the model emerges at a certain level of scale...
[go to top]