More

olmo23 · 2026-06-04T11:34:08 1780572848

Perhaps, but I'm not sure my boss would appreciate that though.

olmo23 · 2026-06-01T13:37:14 1780321034

> And perhaps the people who built and deployed the autocomplete and the connection as well.

I disagree. IMO it's the person who connects the LLM to the button who bears the responsibility of the workings of the resulting contraption.

tgv · 2026-06-01T14:22:30 1780323750

Shareholder meeting to CEO: you must connect the button.

CEO to CIO: you must connect the button.

CIO to VP AI: you must connect the button.

VP AI to team lead AI integration: you must connect the button.

Team lead AI integration to senior: you must connect the button.

Senior to medior: you must connect the button.

Medior to junior: Hey, Olmo. That button they were talking about. You know?

Olmo: Yeah.

Medior: You have to hook it up to the LLM output.

Olmo: Why?

Medior: The boss says so.

Olmo: Ok.

Shrugs and deploys.

runarberg · 2026-06-01T14:03:34 1780322614

I used to hear things like “if cigarettes/alcohol were invented now, they would never allow it”, indicating that consumer protection used to be a thing, as early as 10-20 years ago. Now when AI hit the market it was obvious how bad and dangerous it was, yet governments (even the supposedly good ones in Europe which still [pretend to] do consumer protection) did nothing to protect their citizens from the harms AI was causing.

If we still did (or ever did) consumer protection like that cigarette/alcohol myth above indicates, then the makers of that tool would indeed be responsible for when their products does dangerous things.

olmo23 · 2026-05-29T09:45:31 1780047931

Since my employer pays for it, I just select the latest and greatest.

olmo23 · 2026-05-27T12:58:22 1779886702

What really grinds my gears is how easy it is to get better designs out of LLMs. But if you don't ask, you get the default.

drdrey · 2026-05-27T13:45:35 1779889535

as someone who doesn't know how to get better design out of LLMs, can you elaborate?

embedding-shape · 2026-05-27T17:05:30 1779901530

Have an opinion on the design, imagine something, then tell it to do just that, then iterate. It's when you're unspecific you get the generic, bland and typical LLM design, you just have to be subjective and influence it in some (human) direction.

lbindreiter · 2026-05-28T07:03:32 1779951812

Also check out https://impeccable.style/, it's really good

agos · 2026-05-27T13:55:28 1779890128

what would you ask to get a better design?

hansmayer · 2026-05-27T13:45:25 1779889525

Here is a provocative thought - maybe these are the so-called "better designs" from LLMs? It's not like writing English sentences is some huge secret you are sitting on that no one else knows.

embedding-shape · 2026-05-27T16:03:25 1779897805

> It's not like writing English sentences is some huge secret you are sitting on that no one else knows.

I'd actually say what really makes an excellent engineer stick out among many great engineers, is their ability to communicate clearly and knowing what needs to be communicated vs not, basically being way better at language and communication in general, and they also understand the important of it.

hansmayer · 2026-05-27T18:50:04 1779907804

I agree. But I was talking about the "super secret" ability to write prompts, which pretty much anyone can do.

embedding-shape · 2026-05-27T22:03:52 1779919432

My point being that not everyone writes as good prompts as everyone else, the way you communicate, how clearly and how exact you are matters a lot, much more than you seemingly is under the impression of.

Same goes with the "LLM does web design" example from before, a web designer with great communication skills in web design, will (naturally) have a better prompt for something that'll potentially could look good, compared to a web designer that isn't at good at communicating what they actually want.

taintlord223 · 2026-05-27T14:09:02 1779890942

Outside design systems I rarely get good CSS from LLMs.

3D type stuff too, it's useless outside boilerplate.

Very little spatial reasoning training, no end-user subjective reasoning inference (Google is starting to though even in unrelated chats), so it's no surprise the LLM doesn't know what you want.

Since I don't even know what I want half the time until I saw it, the subjective reasoning piece is key - that is, being able to predict what I'll want to pretty good accuracy. Then you have your agents etc.

olmo23 · 2026-05-26T12:11:14 1779797474

> Funnily enough, people on HN often do not consider this an issue, like at all...

I didn't have a problem with it when it was Aaron Swartz, not sure why I should have a problem with it when others do it.

isityettime · 2026-05-26T12:20:24 1779798024

Aaron Swartz never did whatever it was he was going to do. He was caught and hounded to death before that.

But he was working with scientific papers— the outputs of public institutions— and his likely goal was releasing them to the public. What proprietary AI companies have done in training LLMs on every book in existence is nothing like that.

graemep · 2026-05-26T15:53:25 1779810805

A lot of what they have done is the reverse. They have used a lot of such publicly funded information (and a lot of other freely available information) to train LLMs that are proprietary.

ocschwar · 2026-05-26T15:02:53 1779807773

The strange thing is he picked a fight with a store of humanities papers rather than scientific ones.

isityettime · 2026-05-27T00:31:35 1779841895

JSTOR holds content from lots of journals including in the sciences. It's not only humanities papers.

insane_dreamer · 2026-05-27T01:56:47 1779847007

1) those were scientific papers; the authors weren't getting paid either way (unless book authors making a living from them)

2) more importantly, Swartz wasn't building a business empire on the pirated data, and charging access

I don't see how the two are even remotely similar

olmo23 · 2026-05-21T14:09:59 1779372599

Also check out https://en.wikipedia.org/wiki/Rice%27s_theorem

basically generalized the halting problem to arbitrary semantic properties.

tialaramex · 2026-05-21T15:31:33 1779377493

It's convenient that Henry Rice lived long before the age of language cults. I don't even think Rice wrote software, he's just a mathematician, he proved this nice property in mathematics. Stuff like FORTRAN and ALGOL happens later.

Also though, just as for the Halting Problem, we are always allowed a three-way split. Rice proves that "Has property" vs "Does not have property" can't be done, but "Has property" vs "Does not have property" vs "Shrug - I dunno, seems hard" is possible, and indeed easy if you're OK with lots of machines landing in the "Shrug" pile. You can expend as much work as you like to shrink that pile, Rice just proved it would need infinite work to empty it completely.

red75prime · 2026-05-21T15:50:51 1779378651

Another way around the Rice's theorem is the Curry-Howard correspondence. A constructive proof of existence of a program that has a property can be transformed into a program that has this property. Yet another way is to have a programming language where syntactic correctness implies a range of semantic properties.

olmo23 · 2026-05-21T06:41:53 1779345713

My go-to usecase for Gemini is summarizing Youtube tech-influencers.

ReptileMan · 2026-05-21T07:38:18 1779349098

How do you do it?

olmo23 · 2026-05-21T09:55:35 1779357335

I basically open up a new conversation, copy paste a link to Theo's latest video and ask it to summarize his yapping :p

renticulous · 2026-05-21T10:05:36 1779357936

I copy paste the transcript because sometimes youtube has blocked AIs from scrapping

sgerenser · 2026-05-21T11:33:01 1779363181

Would YouTube really block Gemini? I thought summarizing (or asking questions about) YouTube videos was one of their advertised features.

renticulous · 2026-05-21T12:17:26 1779365846

I don't like youtube gemini summaries. whenever I have asked gemini about detailed comprehensive report on a long hour podcasts, it always refuses. hence claude is my go report generation AI.

olmo23 · 2026-04-28T07:31:34 1777361494

In certain circumstances, they might be :-)

But you can't "hack a server" using just these techniques: they would be a (small) part of a chain of exploits.

olmo23 · 2026-04-28T07:29:04 1777361344

This is an active area of research. Demis Hassabis proposed training a model with a strict knowledge cutoff before 1915, and seeing whether it can independently arrive at general relativity.

olmo23 · 2026-04-22T13:43:59 1776865439

We are there. This is pretty much the reason why Mythos isn't being released publically.

pocksuppet · 2026-04-22T14:22:49 1776867769

The reason Mythos isn't being released publicly is to drive up Anthropic's valuation by making big promises.

dymk · 2026-04-22T14:32:22 1776868342

https://blog.mozilla.org/en/privacy-security/ai-security-zer...

> As part of our continued collaboration with Anthropic, we had the opportunity to apply an early version of Claude Mythos Preview to Firefox. This week’s release of Firefox 150 includes fixes for 271 vulnerabilities identified during this initial evaluation.

SleepyMyroslav · 2026-04-22T19:34:01 1776886441

I understand that they are trying to say that it is getting better... 271 vulnerabilities is a lot. I have been using FF for a long time. I am now considering if using it at all was a mistake or not. And I think it was.

warkdarrior · 2026-04-22T15:26:36 1776871596

So you're saying Mozilla is in on it, hyping up Anthropic. Are they getting a kickback?

bitwize · 2026-04-22T15:43:38 1776872618

What they're saying is that the capabilities of Mythos to find overlooked vulnerabilities in large code bases are real.

We're in a new era for security. You're either using AI to catch vulnerabilities in your code... or someone else is, and 0wning you.

dymk · 2026-04-22T16:40:04 1776876004

What I’m saying is the youths call this “smoking copium”

pocksuppet · 2026-04-22T17:54:08 1776880448

Both can be true at once. It can be good at finding vulnerabilities, and also overhyped to pump the stock price.