1. Yes, GPT-4 Turbo is quantitatively getting lazier at coding. I benchmarked the last 2 updates to GPT-4 Turbo, and it got lazier each time.
2. For coding, asking GPT-4 Turbo to emit code changes as unified diffs causes a 3X reduction in lazy coding.
Here are some articles that discuss these topics in much more detail.
public someComplexLogic() { // Complex logic goes here }
or another example when the code is long (ex: asking it to create a vue component) is that it will just add a comment saying the rest of the code goes here.
So you could test for it by asking it to create long/complex code and then running the output against unit tests that you created.
- Can you do XXX (something complex) ?
- Yes of course, to do XXX, you need to implement XXX, and then you are good, here is how you can do:
int main(int argc, char **argv) {
/* add your implementation here */
}