zlacker

[return to "Elon Musk sues Sam Altman, Greg Brockman, and OpenAI [pdf]"]
1. 1vuio0+vn2[view] [source] 2024-03-02 01:59:41
>>modele+(OP)
"In March 2023, OpenAI released its most powerful language model yet, GPT-4. GPT-4 is not just capable of reasoning. It is better at reasoning than average humans. It scored in the 90th percentile on the Uniform Bar Exam for lawyers. It scored in the 99th percentile on the GRE Verbal Assessment. It even scored a 77% on the Advanced Sommelier examination."

One could argue a common characteristic of the above exams is that they each test memory, and, as such, one could argue that GPT-4's above-average performance is not necessarily evidence of "reasoning". That is, GPT-4 has no "understanding" but it has formidable reading speed and retention (memory).

While preparation for the above exams depends heavily on memorisation, other exams may focus more on reasoning and understanding.

Surely GPT-4 would fail some exams. But when it comes to GPT-4's exam performance, only the positive results are reported.

https://freeman.vc/notes/reasoning-vs-memorization-in-llms

◧◩
2. bastaw+Qk3[view] [source] 2024-03-02 14:33:11
>>1vuio0+vn2
> Surely GPT-4 would fail some exams. But when it comes to GPT-4's exam performance, only the positive results are reported.

The default is failing the exams. I'd be no less impressed if they came right out and said "This is a short list of the only exams it passes" simply because (IMO) it's remarkable that a machine could pass any of those exams in the first place. Just a couple years ago, it would have been outlandish for a machine to even have a double digit score (at best!).

If we've already found ourselves in a position where passing grades on some exams that qualify people for their careers is unremarkable, I'll honestly be a bit disappointed. 99th percentile on the GRE Verbal would make an NLP researcher from 2010 have a damn aneurysm; if we're now saying that's "not reasoning" then we're surely moving the goalposts for what that means.

[go to top]