Integrating AI

ripped_britches · 2025-02-15T07:25:23 1739604323

Mobile site is broken, width is fixed beyond screen dimensions and scrolling horizontally is impossible

nsteel · 2025-02-15T08:54:57 1739609697

How is this a thing in 2025? Can't we just get the easy things right first?

And then someone probably pipes up to tell me that cross-browser margins are actually more complicated than "AI"!

sfryxell · 2025-02-15T21:04:31 1739653471

sorry to make you my QA. Its the first time using a pre tag on the blog. I didn't check it on mobile before pushing.

It's fixed now. Thanks for commenting and letting me know.

ripped_britches · 2025-02-16T08:16:54 1739693814

Thank you for fixing this!

computerthings · 2025-02-15T10:19:35 1739614775

I read this, twice, and could not draw a drop of meaning from it. As in, I have no clue what I just read, twice.

sfryxell · 2025-02-14T23:47:19 1739576839

AI coding assistants are changing how we work, but they're not replacing the core skills that make a developer. What matters most is our ability to hold complex systems in our minds and think through their implications.

klipt · 2025-02-15T02:37:49 1739587069

> not replacing the core skills that make a developer

so far

numba888 · 2025-02-15T03:02:49 1739588569

yep, they didn't do even this a few years back

koakuma-chan · 2025-02-15T04:09:15 1739592555

Has your mind ever held a system so complex, it wouldn't fit in two million tokens? My hasn't.

LouisSayers · 2025-02-15T08:14:19 1739607259

A big problem with current AI is a simple one - we don't document our thought processes that go into building solutions, at least not in enough detail.

Without the thousands of micro decisions that go into building even the simplest of solutions, it doesn't matter how large your context window is. It's not about holding a system in your mind - it's about what you do with it, what decisions you make to move towards your goal.

At least that's my take on current LLMs and their limitations.

koakuma-chan · 2025-02-15T21:39:23 1739655563

That's what the reasoning models are for.

stnikolauswagne · 2025-02-15T06:15:16 1739600116

Easily, I have a somewhat working understanding of the SAP version we run in my head, LLMs love hallucinating columns or endpoints that do not exist. I‘m sure the full SAP documentation easily clears 2 mil tokens. And thats not even touching our own codebase.

Tostino · 2025-02-15T07:52:18 1739605938

As others have echoed, yes.

Enterprise software gets quite complex, has a ton of dependencies that need to be understood together, etc.

Just the data model can take up hundreds of thousands of tokens describing the tables and relationships in some of the code bases I've worked on.

These models degrade like crazy at those long token counts though. I have not found them useful if I need to just stuff everything in a giant context window. I'm mostly using Claude though, so slightly different context scale.

rukuu001 · 2025-02-15T05:06:02 1739595962

At the right level of abstraction, yes.

daveguy · 2025-02-15T04:38:00 1739594280

Has your LLM ever known just the right next token that should come after 2 million other tokens? Mine hasn't.

whatever1 · 2025-02-15T08:44:31 1739609071

Oh boy you have not seen scientific code

ben_w · 2025-02-15T13:15:16 1739625316

> Has your mind ever held a system so complex, it wouldn't fit in two million tokens? My hasn't.

This is one of those things which superficially seems like a slam-dunk gotcha, but isn't.

Yes, correct, I can't do that.

Unfortunately, my experience with LLMs is that they can't really pay attention to all the things in the context window either.

Even a mere 5,613 tokens[0] had it getting confused.

If any AI could really do two million tokens with perfect recall of the problem, that would indeed be wildly super-human. Even just having a 6k tokens worth of custom instructions that are applied consistently to an ongoing data stream — which I bet could be done with the right scaffolding on the API of better models, even if not a naïve use of chat UIs — is superhuman. That kind of ongoing focus and persistence would still be superhuman even when the quality of the result is "ok, not great, just ok", owing to how "human doing same thing for 4 hours" is much worse than "freshly rested human begins work for the day".

I don't know where the boundary really is, though, where it becomes superhuman on any axis[1]. The failure mode I'm describing here reminds me of art lessons towards the end of my time at school, where the teacher had to remind people that accurate still-life studies required you to keep looking again and again at the material, not just once when you started and then filling in the details entirely from your imagination.

[0] I tried using it to translate this Wikipedia page to English, and it was hallucinating plausible but false things by the time it got to the timeline: https://de.wikipedia.org/wiki/Döberitzer_Heide

Here's the chat link, even repeated prompting didn't work for the timeline: https://chatgpt.com/share/67b08f40-2cb8-8011-acd8-fc5b9d6fa8...

Tried it again while writing this to see if current models are any better, this time the same prompt when to the canvas editor, it didn't complete the translation, when I replied "continue" it replaced the attempted first half with a non-translated German wikipedia article that was essentially unrelated: https://chatgpt.com/canvas/shared/67b091d021088191bd9e0ca7c3...

[1] and there are many different aspects of intelligence.

Consider the converse: if it was a fundamental requirement of the nature of intelligence that all aspects of human intelligence correlate well with each other, then chess AI could only have beat world champions like Kasparov in the same year that Go AI beat those like Lee Sedol.

koakuma-chan · 2025-02-15T17:43:35 1739641415

Obviously not going to reply to everybody, but many people here got confused thinking that the LLM two million input tokens is analogous to their brain long-term memory. This is not true. The LLM two million input tokens is more like your brain "working memory," and your brain long-term memory is more like the LLM training data.

I do agree with you that LLMs get "confused," failing to follow constraints. This is also my experience. But, the reason for this phenomena is the lack of emphasis on your constraints.

For example, when working with stable diffusion, you can manipulate the weight of parts of your prompt. Say, you wanted to generate an image, and you really wanted there to be a dog, you could prompt: "a clear sky under the moonlight, (dog:1.5)," and this case the model would give the "dog" part would be 1.5x more important to the model then the rest of the prompt. Not sure why there is no such feature for LLMs (it could be, just that I'm not aware).

I looked at your prompt history with the German article, and I can see that the reason it fails is that you prompt incorrectly. When you want to give certain information or context to the LLM, say, your codebase, or some documentation, or some article, you gotta put it first in your prompt, and at the very bottom you should put your instructions. This apparently makes it easier for the LLM to parse your request.

Also, generally, LLMs will not give you a response longer than a few thousand tokens, so what you should do is: ask it to translate it section by section, and keep asking "Translate the next section," until it translates all of them. I was able to translate your article this way using Gemini, not sure how accurate it is though.

ben_w · 2025-02-15T22:16:35 1739657795

> I looked at your prompt history with the German article, and I can see that the reason it fails is that you prompt incorrectly. When you want to give certain information or context to the LLM, say, your codebase, or some documentation, or some article, you gotta put it first in your prompt, and at the very bottom you should put your instructions. This apparently makes it easier for the LLM to parse your request.

Good to know — I somehow failed to be aware of this before now despite playing with these models since the days of AI Dungeon (for open source models) and text-davinci-003 (from OpenAI).

> I was able to translate your article this way using Gemini, not sure how accurate it is though.

The second part is important — checking the translation was how I knew the it was making things up in the "translated" timeline.

booleandilemma · 2025-02-15T03:22:01 1739589721

Can AI fix his website layout on mobile?

kvdveer · 2025-02-15T06:23:54 1739600634

Maybe AI can't, but Firefox reader mode can.

enos_feedler · 2025-02-15T06:55:00 1739602500

Why is this trending here? It's not a hot take. Of course we will use AI to the fullest extent possible. And interviews will evolve