zlacker

[return to "Advancing AI Benchmarking with Game Arena"]
1. 10xDev+td[view] [source] 2026-02-02 18:54:06
>>salkah+(OP)
If AI can program, why does it matter if it can play Chess using CoT when it can program a Chess Engine instead? This applies to other domains as well.
◧◩
2. 10xDev+2Q[view] [source] 2026-02-02 21:46:56
>>10xDev+td
I'm not going to respond to everything but the key to my comment was "This applies to other domains as well." But people are limiting their imagination to the chess engine example given for chess. The tool or program (or even other neural networks that are available) can be literally anything for any task... Use your imagination.

Maybe we should just get rid of tedious benchmarks like chess altogether at this point that is leading people to think of how to limit AI as a way of keeping it a relevant benchmark rather than expanding on what is already there.

[go to top]