zlacker

[parent] [thread] 0 comments

1. lyu072+(OP)[view] [source] 2023-11-20 06:14:00

No it's reinforcement learning with human feedback, RLHF lots of labeling

[go to top]