zlacker
[parent]
[thread]
0 comments
1. valine+(OP)
[view]
[source]
2025-05-24 16:53:03
You’re thinking about this like the final layer of the model is all that exists. It’s highly likely reasoning is happening at a lower layer, in a different latent space that can’t natively be projected into logits.
[go to top]