zlacker

[return to "Advancing AI Benchmarking with Game Arena"]
1. 10xDev+td[view] [source] 2026-02-02 18:54:06
>>salkah+(OP)
If AI can program, why does it matter if it can play Chess using CoT when it can program a Chess Engine instead? This applies to other domains as well.
◧◩
2. Rivier+OE[view] [source] 2026-02-02 20:54:49
>>10xDev+td
It can write a chess engine because it has read the code of a thousand of chess engines. This benchmark measures a different aspect of intelligence.

And as a poker player, I can say that this game is much more challenging for computers than chess, writing a program that can play poker really well and efficiently is an unsolved problem.

◧◩◪
3. 10xDev+AW[view] [source] 2026-02-02 22:10:38
>>Rivier+OE
The program doesn't need to be a solver. It can be anything that helps it.

It doesn't even need to be one tool but a series of tools.

[go to top]