zlacker

[parent] [thread] 0 comments
1. charle+(OP)[view] [source] 2026-02-05 02:14:05
Right. If the dynamics of training are governed by RG flow, then the best optimization path should remove redundant directions, as specified by the RG operator(s)
[go to top]