zlacker

[parent] [thread] 1 comments
1. tintor+(OP)[view] [source] 2022-12-15 20:56:51
Improve itself through experimentation with reinforcement learning. This is how humans improve too. AlphaZero does it.
replies(1): >>lostms+16
2. lostms+16[view] [source] 2022-12-15 21:27:42
>>tintor+(OP)
The amount of work in that area of research is substantial. You will see world shattering results in a few years.

Current SOTA: https://openai.com/blog/vpt/

[go to top]