zlacker

[parent] [thread] 2 comments
1. m3kw9+(OP)[view] [source] 2023-12-21 01:43:12
This model is not close to even 3.5 from when I used it. It first of all does not follow instructions properly and it just runs on and on
replies(1): >>coder5+M1
2. coder5+M1[view] [source] 2023-12-21 02:05:25
>>m3kw9+(OP)
What you're describing is the behavior you get from any base model that has not been instruction-tuned. The article is clear that this model is not for "direct use". It needs tuning for a specific application.
replies(1): >>m3kw9+S8
◧◩
3. m3kw9+S8[view] [source] [discussion] 2023-12-21 03:24:46
>>coder5+M1
how does one fine tune it to follow instructions? I would have thought they have open source training set for these instruction-follow fine tunes?
[go to top]