Hacker News new | past | comments | ask | show | jobs | submit login

It can't be done in the LLM itself of course, but the wrapper you're taking about already exists in multiple projects fighting in SWEbench. The simplest one is aider with --auto-test https://aider.chat/docs/usage/lint-test.html

There are also large applications like https://devin.ai/ or https://github.com/AI-App/OpenDevin.OpenDevin




Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: