zlacker

[parent] [thread] 2 comments
1. al_bor+(OP)[view] [source] 2026-01-27 03:19:31
It gets it wrong 100% of the time. A script to validate would send it into an infinite loop of generating code and failing validation.
replies(1): >>simonw+F
2. simonw+F[view] [source] 2026-01-27 03:24:40
>>al_bor+(OP)
Are you sure about that?

I don't think I've ever seen Opus 4.5 or GPT-5.2 get stuck in a loop like that. They're both very good at spotting when something doesn't work and trying something else instead.

Might be a problem with older, weaker models I guess.

replies(1): >>al_bor+ia
◧◩
3. al_bor+ia[view] [source] [discussion] 2026-01-27 05:02:24
>>simonw+F
I’m limited on the tools and models I can use due to privacy restrictions at work.
[go to top]