zlacker

[parent] [thread] 2 comments
1. tovej+(OP)[view] [source] 2023-11-20 10:54:24
There was just a post on this website where GPT4 failed to perform basic reasoning tasks better than minimum paid mechanical turk "microworkers".
replies(1): >>andyba+rS2
2. andyba+rS2[view] [source] 2023-11-21 00:48:35
>>tovej+(OP)
And the comments section pointed out multiple flaws in that article.
replies(1): >>tovej+KN3
◧◩
3. tovej+KN3[view] [source] [discussion] 2023-11-21 08:07:24
>>andyba+rS2
it didn't really. There were no fundamental flaws that I could see.

Perhaps the only salient critique was the textual representation of the problem, but I think it was presented in a way where the model was given all the help it could get.

You forget the result of the paper was actually improving the model's performance and still failing to get anywhere near decent results.

[go to top]