zlacker

[return to "We gave 5 LLMs $100K to trade stocks for 8 months"]
1. Neverm+57[view] [source] 2025-12-04 23:47:25
>>cheese+(OP)
Just one run per model? That isn't backtesting. I mean technically it is, but "testing" implies producing meaningful measures.

Also just one time interval? Something as trivial as "buy AI" could do well in one interval, and given models are going to be pumped about AI, ...

100 independent runs on each model over 10 very different market behavior time intervals would producing meaningful results. Like actually credible, meaningful means and standard deviations.

This experiment, as is, is a very expensive unbalanced uncharacterizable random number generator.

◧◩
2. hhutw+Om[view] [source] 2025-12-05 01:44:14
>>Neverm+57
Yeah...one run per model is just random walk in my opinion
[go to top]