zlacker
[parent]
[thread]
0 comments
1. macrol+(OP)
[view]
[source]
2024-01-07 09:28:50
Any particular reason why that shouldn't work well with fine-tuning of an LLM using reinforcement learning?
[go to top]