zlacker

[parent] [thread] 3 comments
1. simonw+(OP)[view] [source] 2026-01-27 01:25:06
Have you tried telling it to run a script to verify that the YAML is valid? I imagine it could do that with Python.
replies(1): >>al_bor+bd
2. al_bor+bd[view] [source] 2026-01-27 03:19:31
>>simonw+(OP)
It gets it wrong 100% of the time. A script to validate would send it into an infinite loop of generating code and failing validation.
replies(1): >>simonw+Qd
◧◩
3. simonw+Qd[view] [source] [discussion] 2026-01-27 03:24:40
>>al_bor+bd
Are you sure about that?

I don't think I've ever seen Opus 4.5 or GPT-5.2 get stuck in a loop like that. They're both very good at spotting when something doesn't work and trying something else instead.

Might be a problem with older, weaker models I guess.

replies(1): >>al_bor+tn
◧◩◪
4. al_bor+tn[view] [source] [discussion] 2026-01-27 05:02:24
>>simonw+Qd
I’m limited on the tools and models I can use due to privacy restrictions at work.
[go to top]