zlacker

The article shows (fine tuned) Mistral 7B outperforming GPT-4, never mind GPT-3.5.

replies(1): >>m3kw9+A6

>>coder5+(OP)
This model is not close to even 3.5 from when I used it. It first of all does not follow instructions properly and it just runs on and on

replies(1): >>coder5+m8

>>m3kw9+A6
What you're describing is the behavior you get from any base model that has not been instruction-tuned. The article is clear that this model is not for "direct use". It needs tuning for a specific application.

replies(1): >>m3kw9+sf

>>coder5+m8
how does one fine tune it to follow instructions? I would have thought they have open source training set for these instruction-follow fine tunes?

[go to top]