zlacker

[return to "Two kinds of AI users are emerging"]
1. decima+lc[view] [source] 2026-02-02 01:34:22
>>martin+(OP)
> I helped one recently almost one-shot[3] converting a 30 sheet mind numbingly complicated Excel financial model to Python with Claude Code.

I'm sure Claude Code will happily one-shot that conversion. It's also virtually guaranteed to have messed up vital parts of the original logic in the process.

◧◩
2. linsom+4d[view] [source] 2026-02-02 01:40:53
>>decima+lc
It depends on how easily testable the Excel is. If Claude has the ability to run both the Excel and the Python with different inputs, and check the outputs, it's stunningly likely to be able to one-shot it.
◧◩◪
3. AlotOf+4e[view] [source] 2026-02-02 01:48:37
>>linsom+4d
Something being simultaneously described as a "30 sheet, mind-numbingly complex Excel model" and "testable" seems somewhat unlikely, even before we get into whether Claude will be able to test such a thing before it runs into context length issues. I've seen Claude hallucinate running test suites before.
◧◩◪◨
4. martin+Ye[view] [source] 2026-02-02 01:55:29
>>AlotOf+4e
It compacted at least twice but continued with no real issues.

Anyway, please try it if you find it unbelievable. I didn't expect it to work FWIW like it did. Opus 4.5 is pretty amazing at long running tasks like this.

◧◩◪◨⬒
5. moregr+Ag[view] [source] 2026-02-02 02:11:22
>>martin+Ye
I think the skepticism here is that without tests or a _lot_ of manual QA how would you know that it did it correctly?

Maybe you did one or the other , but “nearly one-shotted” doesn’t tend to mean that.

Claude Code more than occasionally likes to make weird assumptions, and it’s well known that it hallucinates quite a bit more near the context length, and that compaction only partially helps this issue.

[go to top]