zlacker

[parent] [thread] 1 comments
1. lcnPyl+(OP)[view] [source] 2025-05-21 12:57:48
This is important context given that it would be absurd for the managers to have already drawn a definitive conclusion about the models’ capabilities. An explicit understanding that the purpose of the exercise is to get a better idea of the current strengths and weaknesses of the models in a “real world” context makes this actually very reasonable.
replies(1): >>mrguyo+iF
2. mrguyo+iF[view] [source] 2025-05-21 17:03:09
>>lcnPyl+(OP)
So why in public, and why in the most ham-fisted way, and why on important infrastructure, and why in such a terrible integration that it can't even verify that things compile before opening a PR!

In my org, we would have had to bypass precommit hooks to do this!

[go to top]