Counter point though- what if it was trained on your specific code base? Wouldn'...

throw-qqqqq · 2025-02-13T17:47:02 1739468822

The rare times where I have no good idea what to do, it is faster to code with an LLM. The rest of the time, when I know just what I want, it takes longer for me to formulate it, query the LLM and discuss and validate its output.

My more experienced/senior colleagues all say roughly the same. It’s great help for our juniors though. They learn a lot and are more capable on their own with the AI assistance.

It’s improving all the time though, so I’m not writing it off at all. I am developing an evaluation suite so I can keep watching the progress in a systematic way..

taylodl · 2025-02-14T01:09:06 1739495346

> I am developing an evaluation suite so I can keep watching the progress in a systematic way..

Sounds like something that should be published on github

pona-a · 2025-02-14T09:10:02 1739524202

Open benchmarks are vulnerable to saturation. I think benchmarks should have an embargo periodic, until which only 3% of the question-answer pairs is released, with an explicit warning not to use it 3 months after being released.

AndreasMoeller · 2025-02-14T09:09:08 1739524148

I think there are types of problems that AI will be great at solving. If you can pass in a whole codebase then we might have LLMs that can suggest refactoring etc to improve code quality.

Code mods like upgrading from python 2 -> 3 could also become possible.