zlacker

[parent] [thread] 0 comments
1. cubefo+(OP)[view] [source] 2023-06-10 16:41:26
Well, temperature 0 means the completion is always the most "likely" (or "best", after fine-tuning) token, while temperature 1 means to choose the next tokens stochastically according to their probability (or "goodness" after fine-tuning). Usually some temperature in between is chosen, like 0.7. It's not a priori clear to me which is the best way to do it.
[go to top]