zlacker

This is just obviously not true. I had a full-time job of reviewing code for roughly 15 years and it was never true, but it's also just intuitively not true that engineers spend 10 hours reviewing their peers code to every 1 they spend writing it.

What you mean to claim here is that verification is 10x harder than authorship. That's true, but unhelpful to skeptics, because LLMs are extremely useful for verification.

replies(2): >>JohnKe+X >>girvo+41

>>tptace+(OP)
I once graded over 100 exams in an introductory programming course (Python). The main exercise was to implement a simple game (without using a runtime).

Some answers were trivial to grade—either obviously correct or clearly wrong. The rest were painful and exhausting to evaluate.

Checking whether the code was correct and tracing it step by step in my head was so draining that I swore never to grade programming again.

replies(1): >>tptace+g1

>>tptace+(OP)
While I'm still a skeptic (despite my Cursor usage at work), I still absolutely agree. Careful review has never been 10x more difficult than writing the code, I'm not sure where this comes from. And I've had about the same experience as you.

Also, and know this doesn't matter, but it's so weird to see this downvoted. That's not an "I disagree" button...

replies(1): >>tptace+j1

>>JohnKe+X
Right, sure. So: this doesn't generally happen with LLM outputs, but if it does, you simply kill the PR. A lot of people seem to be hung up on the idea that LLM agents don't have a 100% hit rate, let alone a 100% one-shot hit rate. A huge part of the idea is that it does not matter if an agents output is immediately workable. Just take the PRs where the code is straightforwardly reviewable.

replies(2): >>JohnKe+Y6 >>fzeror+qg

>>girvo+41
Thanks! On HN, it actually is a "disagree" button though; totally cromulent to downvote disagreement here. One of the site's big idiosyncrasies!

replies(2): >>whyeve+Fp >>girvo+0f2

>>tptace+g1
But your reply was to "reviewing code is easily 10x harder than writing it". Of course that's not true if you just kill all PRs that are difficult to review.

Sometimes, code is hard to review. It's not very helpful if the reviewer just kills it because it's hard.

replies(2): >>tptace+SX1 >>girvo+gf2

>>tptace+g1
You're massively burying the lede here with your statement of 'just take the PRs where the code is straightforwardly reviewable'. It's honestly such an odious statement that it makes me doubt your expertise in reviewing code and PRs.

A lot of code can not and will not be straightforwardly reviewable because it all depends on context. Using an LLM adds an additional layer of abstraction between you and the context, because now you have to untangle whether or not it accomplished the context you gave it.

replies(1): >>enrage+eD

>>tptace+j1
If it is a disagree button, then why is it disabled for new users?

replies(1): >>tptace+6Y1

>>fzeror+qg
I have no idea what you mean. Tptacek is correct. LLM does not add an additional layer because at the end of the day code is code. You read it and you can tell whether it does what you want because you were the person who gave the instructions. It is no different than reviewing the code written by a junior (who also does not add an additional layer of abstraction).

replies(1): >>fzeror+8V2

>>JohnKe+Y6
I understand everything about this comment except the words "but" and "it's not very helpful if".

>>whyeve+Fp
I think the karma thresholds here are kind of silly given how low they are, but Dan and Tom actually run this place and they know things I don't know about what chaos would be unleashed if we eliminated them.

>>tptace+j1
Weird, I always read it as more than that due to the karma threshold! The more you know :)

>>JohnKe+Y6
> It's not very helpful if the reviewer just kills it because it's hard.

I am absolutely still an AI skeptic, but like: we do this at work. If a dev has produced some absolutely nonsense overcomplicated impossible to understand PR, it gets rejected and sent back to the drawing board (and then I work with them to find out what happened, because thats a leadership failure more than a developer one IMO)

>>enrage+eD
That's exactly what an additional layer is! If I outsource coding to someone else whether it's a junior engineer, an outside firm or an LLM that is an additional layer of context you need to understand. You need to figure out if the junior engineer grasped the problem set, if the firm understood the requirements or if the LLM actually generated decent code.