randyl's comments

randyl · 2026-01-13T23:05:24 1768345524

The SQLite team faced a similar problem last year, and Richard Hipp (the creator of SQLite) made almost the same comment:

"The malefactor behind this attack could just clone the whole SQLite source repository and search all the content on his own machine, at his leisure. But no: Being evil, the culprit feels compelled to ruin it for everyone else. This is why you don't get to keep nice things...."

https://sqlite.org/forum/forumpost/7d3eb059f81ff694

squigz · 2026-01-13T23:08:08 1768345688

[flagged]

PrairieFire · 2026-01-13T23:21:18 1768346478

I am with you in that this rhetoric is getting exhausting.

In this particular case though I don't think "evil” is a moral claim, more shorthand for cost externalizing behavior. Hammering expensive dynamic endpoints with millions of unique requests isn’t neutral automation, it's degrading a shared public resource. Call it evil, antisocial, or extractive, the outcome is the same.

consp · 2026-01-13T23:49:51 1768348191

> shorthand for cost externalizing behavior

I consider that evil, having no regard for the wellbeing of others for you own greed.

LastTrain · 2026-01-13T23:13:03 1768345983

OK. How about shitty and selfish then?

sensanaty · 2026-01-13T23:27:56 1768346876

What other word would you use? I find "evil" quite an apt description.

themafia · 2026-01-13T23:21:49 1768346509

You can be ignorantly evil.

sodapopcan · 2026-01-13T23:32:34 1768347154

Absolutely evil.

Forgeties79 · 2026-01-13T23:19:16 1768346356

When they routinely do things like take down public libraries yes I consider it evil too.

Afforess · 2026-01-13T23:13:41 1768346021

Sounds like you have zero empathy for the real costs AI is driving and feelings that this creates for website owners. How about you pony up and pay for your scraping?

immibis · 2026-01-14T12:00:24 1768392024

There's zero evidence any of this is related to AI btw

philipwhiuk · 2026-01-14T02:06:00 1768356360

Lawful Evil

lysace · 2026-01-13T23:17:56 1768346276

[flagged]

juliangmp · 2026-01-14T00:21:00 1768350060

"Why don't you just clone the repo?" Yes. Why dont you?

If you're gonna grab a repo to make a code theft machine then at least dont ddos the servers while you're at it.

lysace · 2026-01-14T00:28:03 1768350483

[flagged]

ninkendo · 2026-01-14T02:18:28 1768357108

Why don’t you take a moment to explain to the class why you think web crawling means you can’t cache anything?

It seems to me that the very first thing I’d try to solve if I were writing a tool for an LLM to search the web, would be caching.

An LLM should have to go through a proxy to fetch any URL. That proxy should be caching results. The cache should be stored on the LLM’s company’s servers. It should not be independently hitting the same endpoint repeatedly any time it wants to fetch the same URL for its users.

Is it expensive to cache everything the LLM fetches? You betcha. Can they afford to spend of the billions they have for capex to buy some fucking hard drives? Absolutely. If archive.org can do it via funding from donations, a trillion dollar AI company should have no problem.

nitwit005 · 2026-01-14T01:04:24 1768352664

"The malefactor behind this attack" isn't a complaint about the web crawler.

Forgeties79 · 2026-01-13T23:36:16 1768347376

There are people behind the web crawler. If they’re so well funded they can exert a little effort to not so badly inconvenience people as they steal their training data.

lysace · 2026-01-13T23:39:07 1768347547

[flagged]

mikestew · 2026-01-14T00:11:02 1768349462

It may come as shock to you ("Video/audio producer")

It’s one thing to ignore parent’s point entry, but no reason to be an ass about it.

NegativeK · 2026-01-14T00:11:52 1768349512

I've downvoted you for being incredibly aggressive in your responses. I'm not sure why you're ad homineming the parent commenter, but it's not helping the discussion.

Forgeties79 · 2026-01-14T02:54:42 1768359282

I don’t even really get what they are saying. I am also saying that they are hostile, and with all of their money they can afford to not be hostile. So I feel like we agree?

randyl · on July 24, 2024

Have you experienced the corruption issues mentioned in the original article?

randyl · on June 17, 2024

I really enjoyed "Coders at Work". Joe Armstrong's interview really stuck out to me; it seemed like he just thought differently than everyone else in the book. In a way, he reminded me of Larry Wall.

randyl · on Nov 13, 2023

Any idea on what the root cause of the sqlite corruption was? There's some discussion on the SQLite forums about corruption with wasm (I've encountered it myself on a personal project), but from what I understand no one has identified a cause yet.

randyl · on March 14, 2014

I misread the headline, and thought it was saying NewsBlur is shutting down one year after Google Reader.

I used NewsBlur for a while, but didn't like the UI. I switched to InoReader.com, and love it.

randyl · on March 7, 2013

Along the same lines, I had bad posture for a long time, and always wondered what to do about it. "Stand up straight" was too simplistic, and "do yoga" was too broad.

The solution that worked for me came from books by Pete Egoscue. "Health Through Motion", especially: http://www.amazon.com/Egoscue-Method-Health-Through-Motion/d...

He describes how to analyze posture (once you learn, you see signs of bad posture -- aka muscle imbalances -- in nearly everyone: people at work, at the store, even professional athletes), and then he lists dozens of simple exercises that work to restore your muscle balance. And at some point you'll do an exercise that engages a muscle you haven't used in years, and when that happens, your body feels FANTASTIC!!

The books are just the tip of the iceberg, too. There are Egoscue clinics around the country that can take you to advanced levels.