zlacker

[parent] [thread] 1 comments
1. steve_+(OP)[view] [source] 2026-02-05 04:09:44
I'd be interested, but they don't even list any anthropic model on their code review benchmark, so I feel like they haven't really tested their benchmark on SOTA models.
replies(1): >>nomel+t1
2. nomel+t1[view] [source] 2026-02-05 04:26:29
>>steve_+(OP)
Whenever I see this, I make the (almost always correct) assumption that the SOTA models had an advantage, with the alternative explanation being a complete lack of awareness of the state of AI (which is very very rare for a tool like this).

With SOTA missing, it also is a strong indicator that someone like you is not the target audience.

[go to top]