What makes you think (1) will be true? It is only generating based on training d...

ryandrake · 2025-02-11T16:05:50 1739289950

> The new logic you'd want to add is likely something never done before.

99% of software development jobs are not as groundbreaking as this. It’s mostly companies doing exactly what their competitors are doing. Very few places are actually doing things that an LLM model has truly never seen crawling through GutHub. Even new innovative products generally boil down to the same database fetches and CRUD glue and JSON parsing and front end form filling code.

SpicyLemonZest · 2025-02-11T17:08:15 1739293695

Groundbreakingness is different from the type of novelty that's relevant to an LLM. The script I was trying to write yesterday wasn't groundbreaking at all: it just needed to pull some code from a remote repository, edit a specific file to add a hash, then run a command. But it had to do that _within our custom build system_, and there's few examples of that, so our coding assistant couldn't figure out how to do it.

skydhash · 2025-02-11T17:23:06 1739294586

> Even new innovative products generally boil down to the same database fetches and CRUD glue and JSON parsing and front end form filling code.

The simplest version of that is some CGI code a PHP script. Which everyone should be writing according to your description. But why so many books have been written to be able to do this seemingly simple task? So many frameworks, so many patterns, so many methodologies....

Madmallard · 2025-02-12T00:35:03 1739320503

I don't know man

It can't do anything in these random Phaser games I'm making and even translating my 10,000 line XNA game to Phaser. It is totally hopeless.

Phaser has been out forever now, and XNA used to be too.

nerder92 · 2025-02-11T16:00:15 1739289615

> It is only generating based on training data

This is not the case anymore, current SOTA CoT models are not just parroting stuff from the training data. And as of today they are not even trained exclusively on publicly (and not so publicly) available stuff, but they massively use synthetic data which the model itself generated or distilled data from other smarter models.

I'm using and I know plenty of people using AI in current "mature" codebases with great results, this doesn't mean it does the work while you sip a coffee (yet)

*NOTE: my evidence for this is that o3 could not break ARC AGI by parroting, because it's a banchmark made exactly for this reason. Not a coding banchmark per se, but still transposable imo.

fragmede · 2025-02-11T18:00:31 1739296831

Try Devin or OpenHands. OpenHands isn't quite ready for production, but it's informative on where things are going and to watch the LLM go off and "do stuff", kinda on its own, from my prompt (while I drink coffee).