zlacker

[parent] [thread] 0 comments
1. esafak+(OP)[view] [source] 2026-02-03 17:03:31
For the tasks in SWE-Bench Pro they obtained a distribution of agent turns, summarized as the box plot. The box likely describes the inter-quartile range while the whiskers describe the some other range. You'd have to read their report to be sure. https://en.wikipedia.org/wiki/Box_plot
[go to top]