Edit: URL since changed - see https://news.ycombinator.com/item?id=42639155
---
Edit: I found these past related threads, but not much discussion there:
Pplx and Dbrx founder giving $1M to first OSS AI that gets 90% on SWE-bench - https://news.ycombinator.com/item?id=42413392 - Dec 2024 (3 comments)
We might be overestimating coding agent performance on SWE-Bench - https://news.ycombinator.com/item?id=42054973 - Nov 2024 (1 comment)
SWE-Bench Verified - https://news.ycombinator.com/item?id=41237204 - Aug 2024 (10 comments)
Show HN: Public and Free SWE-bench-lite evaluations - https://news.ycombinator.com/item?id=40974181 - July 2024 (1 comment)
#1 agent on swe-bench wrote 7% of its own code - https://news.ycombinator.com/item?id=40627095 - June 2024 (1 comment)
Aider Is SOTA for Both SWE Bench and SWE Bench Lite - https://news.ycombinator.com/item?id=40562121 - June 2024 (1 comment)
How Aider Scored SOTA 26.3% on SWE Bench Lite - https://news.ycombinator.com/item?id=40477191 - May 2024 (1 comment)
Edit: URL since changed - see https://news.ycombinator.com/item?id=42639155
---
Edit: I found these past related threads, but not much discussion there:
Pplx and Dbrx founder giving $1M to first OSS AI that gets 90% on SWE-bench - https://news.ycombinator.com/item?id=42413392 - Dec 2024 (3 comments)
We might be overestimating coding agent performance on SWE-Bench - https://news.ycombinator.com/item?id=42054973 - Nov 2024 (1 comment)
SWE-Bench Verified - https://news.ycombinator.com/item?id=41237204 - Aug 2024 (10 comments)
Show HN: Public and Free SWE-bench-lite evaluations - https://news.ycombinator.com/item?id=40974181 - July 2024 (1 comment)
#1 agent on swe-bench wrote 7% of its own code - https://news.ycombinator.com/item?id=40627095 - June 2024 (1 comment)
Aider Is SOTA for Both SWE Bench and SWE Bench Lite - https://news.ycombinator.com/item?id=40562121 - June 2024 (1 comment)
How Aider Scored SOTA 26.3% on SWE Bench Lite - https://news.ycombinator.com/item?id=40477191 - May 2024 (1 comment)