More

geocar · 2026-02-26T09:30:08 1772098208

I completely agree: If it is ugly-as-sin-but-useful I will learn it.

The aesthetic of mathematics as it appears in journals is I think questionable, but undeniably convenient for communication, so it is every language making the case that you (dear reader) can say something very complicated and useful in the ideal amount of space.

"Hello world" isn't that: That's the one program everyone should be able to write correctly, 100% of the time. That's how we can talk about brainfuck as exercise, but APL is serious.

Or put another way, even if seeing a new kind of "hello world" excites dear reader, it's probably not going to excite me, unless it's objectively disgusting.

What Om does here is exactly right for me: It tells me what it is, and makes it easy for me to drill down to each of those things to figure out what the author means by that, and decide if I am convinced.

I mean, that's the point right? I'm here trying to learn something new and that requires I allow myself to be convinced, and since "hello world" is table-stakes, seeing it can only slow my ability to be convinced.

geocar · 2026-02-24T06:18:28 1771913908

These "types" are hindley-milner types and have almost nothing to do with what C calls a type.

Your "feelings" may help you make snap judgements that can keep you alive, but they cannot help you code and they will conspire against you when you effort to learn new things. Nobody wants to feel wrong, and you will feel wrong many times when you learn something new, but it is the only way to actually learn the thing. Remember this the next time you have "feelings" about knowledge

throwaway27448 · 2026-02-24T15:00:44 1771945244

So what problem is this solving? No need to be a dick.

geocar · 2026-02-26T13:33:09 1772112789

> No need to be a dick.

But there was a need for you to characterise me so?

> So what problem is this solving?

What makes you ask me that instead of reading the website and papers for yourself? Do you think I could possibly know enough about the kinds of other problems you have from the example one that makes you call me names?

I mean, did you read even the first page of the paper I suggested? Were you confused by anything in the first paragraph? Do you know what System-F means in that context? Did you do an Internet search? Anything? Anything at all you could say you got stuck on that you didn't understand? Or did you somehow get the impression I should spoon-feed you?

Why do you waste anyones time with this?

leptons · 2026-02-24T06:28:33 1771914513

You don't know me.

geocar · 2026-02-24T06:14:07 1771913647

> If you're gonna make a website for your programming language, you NEED to put an example of the language front and center on the landing page.

Did you consider the possibility that this sort of thing was done to avoid wasting time with non-experts who think an "example" of a language they don't know is enough to make comments about?

> I still have not seen a line of TAL

My suggestion: Start with the "Papers" and then look at the paper that introduces TAL. It has an example program with analysis

makerofthings · 2026-02-24T06:24:56 1771914296

I’m an expert and I find it very frustrating when I don’t find some example code front and centre. It might not reveal the detail, but it sets the scene quickly and lets me know what sort of a thing I’m dealing with.

geocar · 2026-02-24T06:44:29 1771915469

> I’m an expert and I find it very frustrating

So you say, but I think _I'm_ an expert too, and I wasn't frustrated in the slightest. Maybe you're just not an expert in this space. Did you consider that?

Of course it would be nice if everyone communicated to us in our preferred way, but I think making the reader work a little bit before they have a conversation is a good way to figure out if you're dealing with an expert or not, because an expert actually worth talking to about your ideas will not find it to be too much work to understand them

Students can especially benefit from this advice, because they are still too new to be able to recognise experts from the substance of their words

nananana9 · 2026-02-24T07:10:57 1771917057

"I don't like having my time wasted" does not imply anything about one's skill in a field.

It's not 1995. Most of the internet is noise, and if you're showcasing something it's good form to immediately show your readers what actually is, and why they may or may not care about it.

Not showing the syntax of a programming language on the homepage of a programming language is poor communication. If you're OK with that - great, but not valuing your time and willingness to have it wasted in no way implies that you're an "expert".

geocar · 2026-02-24T07:19:35 1771917575

> "I don't like having my time wasted" does not imply anything about one's skill in a field.

I have no idea what you think you just said, but I did not say anything like that.

> Most of the internet is noise, and if you're showcasing something it's good form to immediately show your readers what actually is

So you say, but without responding to either of my suggestions for not doing this, and after saying something that doesn't sound relevant at all.

Of what exactly are you trying to convince me to do? I'm not the author of this page, I'm not confused by what TAL is, and I'm not going to agree that you don't deserve to have your time wasted when you're here wasting mine, so what is it?

geocar · 2026-02-24T05:41:41 1771911701

> We can't say that a function equals a set

Why not?

Can we not so easily speak of the set of all inputs and the set of all outputs? Why not exactly then is a function not a set of morphisms/arrows?

To me, x->x+1 and {(x,x+1)|x∈R} seem the same[1] but maybe it just seems useful to be able to make statements of the cardinality of that set: If there are a lot of rules, then that set is big, but if there are few rules (like x->x+1), that set is small. This is enough to permit some analysis.

It also preserves "plus" for sets, because a function plus a function is the sum of those rules being considered.

What is it do you think I am missing?

[1]: I understand I don't really mean big-R here because computers have limited precision for fadd/add circuits, so if you'd prefer I said something slightly differently there please imagine I did so.

herni · 2026-02-24T11:25:36 1771932336

You miss the point here. Just because functions happen to be sets ZF does not mean sets of functions are functions. O(...) denotes a set of functions.

geocar · 2026-02-24T13:42:26 1771940546

> Just because functions happen to be sets ZF does not mean sets of functions are functions. O(...) denotes a set of functions.

We can enumerate all programs up to a given length, so up to that limit all sets of functions are functions.

f(x)=O(g(x)) still makes sense in exactly this way: if g(x) is 1, then f(x) is a function that is O(1) right? How do we know g(x) is 1? Because all programs of some length that compute f(x) have that property. Of course there are longer programs that do it, and shorter programs that don't, and other programs still, but we're talking about these ones.

f(x)<O(g(x)) then says that the f(x) must be shorter than that; it isn't a member of the set.

What do you think I am missing?

geocar · 2026-02-22T05:37:42 1771738662

> When talking about permacomputing, for example, I don't know why one wouldn't encourage, in any way possible, commercial viability that would lead to the stated goal.

Because capitalism is what destroys the world. Fucking duh.

There's very little point in spending so much time thinking about C compilers in forth that run on scavenged z80s these days if capitalism is actually viable.

yellowapple · 2026-02-22T10:29:12 1771756152

> Because capitalism is what destroys the world. Fucking duh.

The issue is that “commercial” includes plenty of not-necessarily-capitalist entities as well, like sole proprietors and cooperatives (sole proprietors being single-member worker cooperatives).

Of course, a society in which worker cooperatives and individual craftspersons are the dominant forms of economic participation is probably (hopefully!) also a society which has done away with intellectual property and the enforcement thereof, rendering software license terms (including non-commercial use clauses) entirely moot.

geocar · 2026-02-22T12:29:51 1771763391

> The issue is that “commercial” includes plenty of not-necessarily-capitalist entities as well

I see no issue, and believe me, I have the deepest empathies for people who participate in capitalism under duress.

If you could explain why I or anyone else should need to help some people murder so that those "not-necessarily-capitalists" we are so worried about can use my software without legal threat, I would happily listen to it, but I think you will be unconvincing.

I mean, you have realised that someone could just ask, right? I could listen to them, and if they had a reason that I agreed was good, I could give them whatever they needed for themselves without accessorising myself to that murder that others would do with those things.

abetusk · 2026-02-23T00:29:04 1771806544

Digital artifacts are the modern form of the means of production. Making digital artifacts available under a libre/free is giving these modern means of production to the public by putting it into the commons.

By requiring an interaction and explicit permission, you're putting yourself as a gate keeper to the means of production. In the best case, the uses will depend on your temperament at the time of request. In the worst case, such as a state backed morality commission, this could lead to corruption and abuse.

Part of capitalism is the ownership of private property, including intellectual property. If your focus is on making sure you have control over your work for things you think are moral, then the modern copyright system is in place for you to keep that control.

My focus is on empowering people with tools and making sure those tools aren't restricted in their use by a small oligarchy.

geocar · 2026-02-23T10:54:23 1771844063

> you're putting yourself as a gate keeper to the means of production.

Liar.

I am only the gatekeeper of my own efforts.

If you had any clue how to code yourself you I could not prevent you from making your own code.

> If your focus is on making sure you have control over your work for things you think are moral, then the modern copyright system is in place for you to keep that control.

> My focus is [on preventing people from exercising moral judgement].

Good for you, and thank you for being a shining reminder of the reason I don't contribute to open source.

geocar · 2026-02-15T02:06:00 1771121160

Try searching for thunderbolt lto.

I know MagStor has one with usb-c presentation.

geocar · 2026-02-15T02:00:47 1771120847

LTO9 is like 45TB for <$100 (I got a bunch for €55 a piece), so 4.5TB for <$10 is being generous. And even if you didn't think they lasted 30-40 years and made copies every 3 years, it's still cheaper, not to mention you have fewer tapes to manage.

Also: I don't have a bd/dvd player in my house today, so even if there are the most tremendous gains in medical sciences I'm almost certainly not going to have one in 100+ years, so I'm not sure m disc even makes cost-sense for smaller volumes.

Maybe if you want to keep your data outside for sunshine like the author of the article, but that's not me...

ghshephard · 2026-02-27T00:05:33 1772150733

In 20 years - what is more likely for you to be able to trivially go find on Ebay for < $50 - LTO Tape Reader - Random BD/DVD Player.

Likewise for 100 years, or 200 years. etc...

ALso - Archival media that needs re-copying every 20-30 years is not Archival media by my defintion

Dylan16807 · 2026-02-15T03:15:26 1771125326

LTO-9 tapes are actually 18TB, but yes they are a lot cheaper than optical discs. If you can afford the drive.

derefr · 2026-02-15T05:59:47 1771135187

> so even if there are the most tremendous gains in medical sciences I'm almost certainly not going to have one in 100+ years

Never say never. People of today are building "90s entertainment center" setups for nostalgia, complete with VCRs. Given how many generations of game consoles had DVD drives (or BD drives that supported DVDs) in them, I would fully expect the "retro gaming" market of 100 years from now to be offering devices that can play a DVD.

somat · 2026-02-15T02:32:09 1771122729

> Also: I don't have a bd/dvd player in my house today

You have just stumbled on the inherent problem with any archival media.

You really think you will have a working tape drive after 40 years?

Hell, in my experience tape drives are mechanically complex and full of super thin plastic wear surfaces. Do you really expect to have a working tape drive in 10 years?

As far as I can tell there is no good way to do long term static digital archives, And in the absence of that you have to depend on dynamic archives, transfer to new media every 5 years.

I think to have realistic long term static archives the best method is to only depend on the mark 1 eyeball. find your 100 best pictures, and print them out. identify important data and print it out. Stuff you want to leave to future generations, make sure it is in a form they can read.

queenkjuul · 2026-02-15T03:43:20 1771127000

I do think LTO is a common enough format, and explicitly designed to be backwards-compatible, that it is very likely to be around in 10 years. The companies that rely on it wouldn't invest in it if they didn't think the hardware would be available. 40 years, harder to say, but as someone who owns a fair bit of working tape equipment (cassette, VHS, DV) that is almost all 25+ years old, i wouldn't think it'd be impossible.

That said, i imagine optical drives will be much the same.

somat · 2026-02-15T04:38:10 1771130290

It is only backwards compatible two generations, occasionally something slips at the LTO trust (or wherever those things are designed) and you get three generations. But if I have a basement full of LTO1 tapes no currently manufactured drive will read them. I would have to buy a used drive and the drives were never really made all that well. Better than the DAT drives one company I worked for used for some of their backups. But still mechanically very complex with many many small delicate plastic parts that wear out quickly. Those DAT drives were super delicate and also suffered from the same generational problems LTO does. We had a bunch of DAT1 tapes somebody wanted data from but had no working drives to do so. All our working drives were newer DAT3 and 4

That was always the hard part to justifying tape backup. the storage is cheap. but the drives are very expensive. And never seemed to last as long as their price would warrant.

hugo1789 · 2026-02-15T04:51:27 1771131087

That also changed somehow... LTO-10 drives are not backward compatible and can only read/write LTO-10 media.

UltraSane · 2026-02-15T07:00:23 1771138823

That is because LTO-10 had to make an incompatible change to go from 18TB to 30TB

UltraSane · 2026-02-15T02:58:56 1771124336

For LTO tapes? Yes they will be available since the format is so common.

nosuchthing · 2026-02-15T03:26:58 1771126018

LTO9 is only 18TB.

The LTO compression ratio is theoretical and most peoples data will be incompatible with native LTO compression method used.

UltraSane · 2026-02-15T03:01:03 1771124463

3 years is way overkill. 10 years is more reasonable.

geocar · 2026-02-14T09:09:12 1771060152

> I can’t think of a single time I’ve needed a sorted list of only numbers.

Gosh. Let me try to convince you.

I use permutation arrays all the time: lists of indexes that can be used across multiple vectors.

This is much faster than the pattern of scanning rows, constructing tuples of (thingToSort . thingIWantInThatOrder) and making a custom sort function, and destructuring those tuples...

And really, not having to write custom sort functions is really really nice.

> Especially in telemetry, where mean is easy and median is not.

Funny. Yes median is obvious with a permutation array, and maybe mean is less so.

When your data is really big and not very variable, mean of x is roughly the same as the mean of any sufficient sample of x, and that sample can be meaningfully represented as a permutation array!

You can get such an array with reservoir sampling and some maths, and (depending on what you know of your data and variance) sometimes even simpler tricks.

That's kindof actually how the "faster than dijkstra" trick referred-to in the article works: Data sets with small variance has this same property that the min of x is roughly the same as the min of a sufficient sample of x (where the size of sufficient has to do with the variance). And so on.

Another big use-case in my code is trees: Apter trees have a flat memory layout which is convenient for permutation arrays which can simultaneously represent index, rotation, tombstones, and all sorts of other things you might need to do with a tree.

Give it a dig. There's good stuff in there.

geocar · 2026-02-12T19:18:44 1770923924

> How is not having a message-id a security risk?

CVE classify a lot of things that have nothing to do with security.

Not having a Message-ID can cause problems for loop-detection (especially on busy netnews and mailing lists), and with reliable delivery status notification.

Dealing with these things for clients who can't read the RFC wastes memory and time which can potentially deny legitimate users access to services

> It seems that Gmail is being pedantic for no reason

Now you know that feeling is just ignorance.

atherton94027 · 2026-02-13T05:49:27 1770961767

Well, gmail does not manage usenet groups and mailing lists. Delivery status notifications are considered best effort so it wouldn't make sense to block messages for that case.

Additionally, Gmail adds its own message identifier on every message (g-msgid) because it knows that message ids can not be trusted to be unique.

Finally just calling me ignorant is the cherry on top – please try to keep things civil on here.

geocar · 2026-02-13T08:42:45 1770972165

> Well, [google] does not manage usenet groups and mailing lists.

They do. Sort of.

Google used to nntp, and manages the largest usenet archive; They still have one of the largest mailing list servers in the world, and they still perform distribution on those lists via SMTP email.

They still have all of the problems associated with it, as do lots of other mail/news/list sites still do that are a fraction of Google's size.

> Delivery status notifications are considered best effort so it wouldn't make sense to block messages for that case.

Sure it does.

You consider them best-effort, but that doesn't follow that I should consider them best-effort. For a simple example: Consider spam.

In any event, if you keep sending me the same message without any evidence you can handle them, I'm not going to accept your messages either, because I don't know what else you aren't doing. That's part of the subtext of "SHOULD".

Most big sites take this policy because it is internal nodes that will generate the delivery notification, but the edge nodes that are tasked with preventing loops. If the edge node adds a Message-ID based on the content, it'll waste CPU and possibly deny service; If the edge node naively adds a Message-ID like an MSA, the origin won't recognise it, and forwarded messages can loop or (if sent to a mailing list) be amplified. There also are other specific documented requirements related to Internet Mail that edge nodes not do this (e.g. RFC2821 § 6.3).

However you seem to be assuming Google is blocking messages "for this case" which is a little presumptuous. Google is presumably trying to save themselves a headache of handling errors for people who aren't prepared to do anything about it, the most common of which is spam. And the use of Message-ID in this application is documented at least as early as RFC2635.

> Additionally, Gmail adds its own message identifier on every message (g-msgid) because it knows that message ids can not be trusted to be unique.

Without knowing what Google does with the g-msgid header, you are making a mistake to assume it is equivalent to the Message-ID header just because it has a similar name. You have no reason to believe this is true.

> Finally just calling me ignorant is the cherry on top – please try to keep things civil on here.

I am sorry you are offended to not know things, but you do not know this thing, and your characterising my actions will make it very difficult for you to learn something new and not be so ignorant in the future.

Think hard exactly about what you want to happen here: Do you want Google (et al) to do something different? Do you want me to agree Google should? Who exactly are you trying to convince of what, and to what end?

I am trying to tell you how to interpret the documentation of the Internet, in this case to be successful sending email. That's it.

I am not likely to try and tell Google what to do in this case because of my own experiences running mail servers over the last 30 years, but listen: I am willing to be convinced. That's all I can do.

If it's something else, I'm sorry I just don't understand.

atherton94027 · 2026-02-13T14:27:06 1770992826

> If it's something else, I'm sorry I just don't understand.

I'm trying to explain to you that you're speaking very authoritatively about how things were done 30 years ago but things have changed since then, Gmail won't even send a message non-delivery notification to non-DKIM hosts.

Same thing with the g-msgid I'm telling you about – google documented it explicitly as a unique identifier, see here for example: https://developers.google.com/workspace/gmail/imap/imap-exte...

So yeah, things changed.

geocar · 2026-02-13T15:53:06 1770997986

[flagged]

atherton94027 · 2026-02-13T18:42:09 1771008129

sic transit gloria mundi..

hsbauauvhabzb · 2026-02-12T19:38:13 1770925093

So add a message id at the first stop, or hard ban the sender server version until they confirm. A midway point that involves a doom switch is not a good option.

geocar · 2026-02-12T22:06:09 1770933969

> So add a message id at the first stop

That should have already happened. Google is not the "first stop".

> hard ban the sender server version until they confirm

SMTP clients do not announce their version.

Also I don't work for you, stop telling me what to do.

> A midway point that involves a doom switch is not a good option.

No shit. That's almost certainly a big part of why Google blocks messages from being transited without a Message-ID.

geocar · 2026-02-12T19:07:50 1770923270

> isn't a valid excuse to reject a client either.

Yes it absolutely is: https://www.rfc-editor.org/rfc/rfc2119 is quite clear.

    3. SHOULD   This word, or the adjective "RECOMMENDED", mean that there
       may exist valid reasons in particular circumstances to ignore a
       particular item, but the full implications must be understood and
       carefully weighed before choosing a different course.

If the client SHOULD do something and doesn't, and your server does not know why, you SHOULD disconnect and move on.

If the server has considered fully the implications of not having a Message-ID header, then it MAY continue processing.

In general, you will find most of the Internet specifications are labelled MUST if they are required for the protocol's own state-processing (i.e. as documented), while specifications are labelled SHOULD if they are required for application state-processing in some circumstances (i.e. other users of the protocol).

Dylan16807 · 2026-02-12T21:20:00 1770931200

> If the client SHOULD do something and doesn't, and your server does not know why, you SHOULD disconnect and move on.

That is not a rule.

In this situation the server can reject any message if it wants to, and not doing a SHOULD tests the server's patience, but it's still ultimately in the "server wanted to" category, not the "RFC was violated" category.

geocar · 2026-02-12T22:19:57 1770934797

[flagged]

Dylan16807 · 2026-02-12T23:39:56 1770939596

You are confused about what I'm doing. I'm not telling anyone what to do. I'm saying what category their actions fall into.

And the line of yours I quoted is still not supported by anything.

geocar · 2026-02-13T08:55:13 1770972913

> You are confused about what I'm doing.

Absolutely.

And if you're not confused by what I said, that's not obvious in the slightest.

> I'm not telling anyone what to do.

So you say.

> I'm saying what category their actions fall into.

I think that's pretty weird that you think you get to decide that all by yourself.

But I'm not playing, because you're right: I don't know why you would be doing that.

> And the line of yours I quoted is still not supported by anything.

Yes it is. It's a description of the behaviour of other Internet hosts, and it's a description of exactly what is happening in the linked article.

Dylan16807 · 2026-02-13T09:24:10 1770974650

> But I'm not playing, because you're right: I don't know why you would be doing that.

This is the comment you originally replied to:

>> If you're implementing a server, "the client SHOULD but didn't" isn't a valid excuse to reject a client either.

>> You can do it anyway, you might even have good reasons for it, but then you sure don't get to point at the RFC and call the client broken.

They're talking about how to categorize actions, just like I am.

So I thought you were playing on that same topic.

But if you weren't on that topic, and given that you quoted only the first one of those sentences, I have a guess.

I think you didn't realize how the second sentence affects the meaning of the first one, and you misunderstood what they were saying as trying to tell servers they can't reject. They were not trying to tell servers they can't reject.

If that's the case, then this whole line of conversation is pointless, because you were rebutting an argument that nobody made.

> It's a description of the behaviour of other Internet hosts, and it's a description of exactly what is happening in the linked article.

Taking a description of behavior and throwing an RFC-style "SHOULD" in front is only going to be correct if you get lucky. "is" and "SHOULD" are different things!

drecked · 2026-02-12T21:40:42 1770932442

That clearly means it’s not required.

How does Google know whether or not the sender has a valid reason? They cannot know that so for them to reject an email for it means they would reject emails that have valid reasons as well.

conductr · 2026-02-12T22:55:45 1770936945

How would the sender know the consequences of sending without the header? You shouldn’t assume anything here. As a sender, you should include it unless you’ve already worked out what the recipient is expecting or how it will be handled. Doing this with email is silly because the client is sennding to so many different servers they know nothing about so it’s basically a requirement to include it.

geocar · 2026-02-12T22:16:32 1770934592

> That clearly means it’s not required.

You and I have different definitions of "clearly"

It is not required for the protocol of one SMTP client sending one message to one SMTP server, but it is required for many Internet Mail applications to function properly.

This one for example, is where if you want to send an email to some sites, you are going to need a Message-ID, so you SHOULD add one if you're the originating mail site.

> How does Google know whether or not the sender has a valid reason?

If the Sender has a valid reason, they would have responded to the RFC (Request For Comments) telling implementers what they SHOULD do, rather than do their own thing and hope for the best!

Google knows the meaning of the word SHOULD.

> it means they would reject emails that have valid reasons as well.

No shit! They reject spam for example. And there's more than a few RFC's about that. Here's one about spam that specifically talks about using Message-ID:

https://datatracker.ietf.org/doc/html/rfc2635

buran77 · 2026-02-12T21:53:55 1770933235

> If the server has considered fully the implications

The server "considers" nothing. The considerations are for the human implementers to make when building their software. And they can never presume to know why the software on the other side is working a certain way. Only that the RFC didn't make something mandatory.

The rejection isn't to be compliant with the RFC, it's a choice made by the server implementers.

hsbauauvhabzb · 2026-02-12T19:35:43 1770924943

Either the server must explicitly confirm to servers or the clients must accept everything. Otherwise message delivery is not guaranteed. In the context of an email protocol, this often is a silent failure which causes real-world problems.

I don’t care what the protocol rfc says, the client arbitrarily rejecting an email from the server for some missing unimportant header (for deduction detection?) is silly.

behringer · 2026-02-12T21:00:49 1770930049

If it was unimportant it would be MAY.

hsbauauvhabzb · 2026-02-12T21:57:55 1770933475

Is the server somehow unable to inject an ID if the sender did not send one? Stop hiding behind policy and think for yourself.

geocar · 2026-02-12T22:30:51 1770935451

> Is the server somehow unable to inject an ID if the sender did not send one?

Yes. https://www.rfc-editor.org/rfc/rfc2821#section-6.3 refers to servers that do this and says very clearly:

    These changes MUST NOT be applied by an SMTP server that
       provides an intermediate relay function.

That's Google in this situation.

> Stop hiding behind policy and think for yourself.

Sometimes you should think for yourself, but sometimes, and friend let me tell you this is one of those times, you should take some time to read all of the things that other people have thought about a subject, especially when that subject is as big and old as email.

There is no good reason viva couldn't make a Message-ID, but there's a good reason to believe they can't handle delivery status notifications, and if they can't do that, they are causing bigger problems than just this.

behringer · 2026-02-14T16:55:43 1771088143

You want me to think for myself when writing an email server that interoperates with other email servers? Are you just clueless?