zlacker

[parent] [thread] 2 comments
1. onion2+(OP)[view] [source] 2026-02-03 21:29:48
The G in GPT stands for Generalized. You don't need that for specialist models, so the size can be much smaller. Even coding models are quite general as they don't focus on a language or a domain. I imagine a model specifically for something like React could be very effective with a couple of billion parameters, especially if it was a distill of a more general model.
replies(2): >>Mzxgck+U4 >>christ+Ok
2. Mzxgck+U4[view] [source] 2026-02-03 21:53:45
>>onion2+(OP)
I'll be that guy: the "G" in GPT stands for "Generative".
3. christ+Ok[view] [source] 2026-02-03 23:22:37
>>onion2+(OP)
Thats what i want and orchestrator model that operates with a small context and then very specialized small models for react etc
[go to top]