zlacker
[parent]
[thread]
0 comments
1. lyu072+(OP)
[view]
[source]
2023-11-20 06:14:00
No it's reinforcement learning with human feedback, RLHF lots of labeling
[go to top]