zlacker

[parent] [thread] 0 comments
1. lyu072+(OP)[view] [source] 2023-11-20 06:14:00
No it's reinforcement learning with human feedback, RLHF lots of labeling
[go to top]