LLMs need significant optimization or we get significant improvement on computin...

killerstorm · 2025-07-15T06:55:45 1752562545

LLM can give you thousands of lines of perfectly working code for less than 1 dollar. How is that trivial or expensive?

sgt101 · 2025-07-15T08:18:59 1752567539

Looking up a project on github, downloading it and using it can give you 10000 lines of perfectly working code for free.

Also, when I use Cursor I have to watch it like a hawk or it deletes random bits of code that are needed or adds in extra code to repair imaginary issues. A good example was that I used it to write a function that inverted the axis on some data that I wanted to present differently, and then added that call into one of the functions generating the data I needed.

Of course, somewhere in the pipeline it added the call into every data generating function. Cue a very confused 20 minutes a week later when I was re-running some experiments.

brulard · 2025-07-15T09:31:20 1752571880

Are you seriously comparing downloading static code from github with bespoke code generated for your specific problem? LLMs don't keep you from coding, they assist it. Sometimes the output works, sometimes it doesn't (on first or multiple tries). Dismissing the entire approach because it's not perfect yet is shortsighted.

ozgrakkurt · 2025-07-15T10:56:25 1752576985

They didn’t dismiss it, they just said it is not really that useful which is correct?

brulard · 2025-07-15T20:48:48 1752612528

Obviously YMMV, but it is extremely useful for me and for many people out there.

Matticus_Rex · 2025-07-15T15:36:02 1752593762

Many obviously disagree that it's correct

fendy3002 · 2025-07-15T07:14:53 1752563693

well I presented the statement wrongly. What I mean is the use case for LLM are trivial things, it shouldn't be expensive to operate

and the 1 dollar cost for your case is heavily subsidized, that price won't hold up long assuming the computing power stays the same.

killerstorm · 2025-07-15T12:58:31 1752584311

Cheaper models might be around $0.01 per request, and it's not subsidized: we see a lot of different providers offering open source models, which offer quality similar to proprietary ones. On-device generation is also an option now.

For $1 I'm talking about Claude Opus 4. I doubt it's subsidized - it's already much more expensive than the open models.

zwnow · 2025-07-15T07:14:04 1752563644

Thousands of lines of perfectly working code? Did you verify that yourself? Last time I tried it produced slop, and I've been extremely detailed in my prompt.

killerstorm · 2025-07-15T20:56:32 1752612992

Yes. I verified it myself. Best results from Opus 4 so far, Gemini might be OK too.

DSingularity · 2025-07-15T17:13:25 1752599605

Try again.

mrbungie · 2025-07-15T20:38:53 1752611933

Any retries before nailing the prompt are still going to be billed, so this supports GP position about LLMs being expensive for trivial things.

jsnell · 2025-07-15T13:03:53 1752584633

But the thing is, LLMs are already incredibly cheap to operate compared to the alternatives. Both for trivial things and for complex things.

fendy3002 · 2025-07-15T13:22:19 1752585739

Well recently cursor got a heat for rising price and having opaque usage, while anthropic's claude reported to be worse due to optimization. IMO the current LLMs are not sustainable, and prices are expected to increase sooner or later.

Personally, until models comparable with sonnet 3.5 can be run locally on mid range setup, people need to wary that the price of LLM can skyrocket

awuji · 2025-07-16T14:54:48 1752677688

You can already run a large LLM (like sonnet 3.5) locally on CPU with 128GB of ram which is <300 USD, but can be offset by swap space. Obviously, response speed is going to be slower, but I can't imagine people will pay much more than 20 USD for waiting 30-60 seconds longer for a response.

And obviously consumer hardware is already being more optimized for running models locally.

lblume · 2025-07-15T06:11:26 1752559886

Imagine telling a person from five years ago that the programs that would basically solve NLP, perform better than experts at many tasks and are hard not to anthropomorphize accidentally are actually "trivial". Good luck with that.

jrflowers · 2025-07-15T06:46:52 1752562012

>programs that would basically solve NLP

There is a load-bearing “basically” in this statement about the chat bots that just told me that the number of dogs granted forklift certification in 2023 is 8,472.

lblume · 2025-07-15T07:26:15 1752564375

Sure, maybe solving NLP is too great a claim to make. It is still not at all ordinary that beforehand we could not solve referential questions algorithmically, that we could not extract information from plain text into custom schemas of structured data, and context-aware mechanical translation was really unheard of. Nowadays LLMs can do most of these tasks better than most humans in most scenarios. Many NLP questions at least I find interesting reduce to questions of the explanability of LLMs.

Applejinx · 2025-07-15T10:42:45 1752576165

"hard not to anthropomorphize accidentally' is a you problem.

I'm unhappy every time I look in my inbox, as it's a constant reminder there are people (increasingly, scripts and LLMs!) prepared to straight-up lie to me if it means they can take my money or get me to click on a link that's a trap.

Are you anthropomorphizing that, too? You're not gonna last a day.

lblume · 2025-07-15T12:12:41 1752581561

I didn't mean typical chatbot output, these are luckily still fairly recognizable due to stylistic preferences learned during fine-tuning. I mean actual base model output. Take a SOTA base model and give it the first two paragraphs of some longer text you wrote, and I would bet on many people being unable to distinguish your continuation from the model's autoregressive guesses.

clarinificator · 2025-07-15T07:01:39 1752562899

Yeah it solved NLP about 50% of the time, and also mangles data badly and in often hard-to-detect ways.

hyperbovine · 2025-07-15T20:19:56 1752610796

It still doesn't pass the Turing test, and is not close. Five years ago me would be impressed but still adamant that this is not AI, nor is it on the path to AI.

trashchomper · 2025-07-15T06:08:47 1752559727

Calling LLMs trivial is a new one. Yea just consume all of the information on the internet and encode it into a statistical model, trivial, child could do it /s

hammyhavoc · 2025-07-15T06:58:17 1752562697

> all of the information on the internet

Total exaggeration—especially given Cloudflare providing free tools to block AI and now tools to charge bots for access to information.

fendy3002 · 2025-07-15T07:13:27 1752563607

well I presented the statement wrongly. What I mean is the use case for LLM are trivial things, it shouldn't be expensive to operate