zlacker

[parent] [thread] 0 comments
1. lblume+(OP)[view] [source] 2025-04-04 16:15:49
Transformers already are very flexible. We know that we can basically strip blocks at will, reorder modules, transform their input in predictable ways, obstruct some features and they will after a very short period of re-training get back to basically the same capabilities they had before. Fascinating stuff.
[go to top]