zlacker
[return to "How does misalignment scale with model intelligence and task complexity?"]
◧
1. nayroc+b9
[view]
[source]
2026-02-03 01:28:59
>>salkah+(OP)
The models they tested are already way behind the current state-of-the-art. Would be interesting to see if their results hold up when repeated with the latest frontier models.
◧◩
2. Stiles+VA1
[view]
[source]
2026-02-03 13:20:43
>>nayroc+b9
I think we have all seen the latest models turn into a hot mess.
[go to top]