zlacker

[return to "Advancing AI Benchmarking with Game Arena"]
1. 10xDev+td[view] [source] 2026-02-02 18:54:06
>>salkah+(OP)
If AI can program, why does it matter if it can play Chess using CoT when it can program a Chess Engine instead? This applies to other domains as well.
◧◩
2. Nitpic+xu[view] [source] 2026-02-02 20:11:40
>>10xDev+td
> If AI can program, why does it matter if it can play Chess using CoT when it can program a Chess Engine instead?

Heh, we really did come full circle on this! When chatgpt launched in dec22 one of the first things that people noticed is that it sucked at math. Like basic math 12 + 35 would trip it up. Then people "discovered" tool use, and added a calculator. And everyone was like "well, that's cheating, of course it can use a calculator, but look it can't do the simple addition logic"... And now here we are :)

[go to top]