Sometimes I feel I am the only person in the world who likes XML. It just follow...

andyjohnson0 · 2024-02-10T12:43:09 1707568989

I like XML. There, I said it.

JSON can be a valid choice, as can XML, but I feel that the decision about which to use is too often based on fashion rather than choosing the best tool for the job. I wish this was different but there seems to be something structural in web development that favours the new over the proven, regardless of the circumstances.

ChrisMarshallNY · 2024-02-10T12:09:05 1707566945

I’ve never liked XML, per se, but I used it extensively, for decades, and even got fairly good with it.

These days, I mostly use JSON, but XML is pretty much an ironclad data definition and transfer protocol. You can define and transfer just about any type of data, using it, albeit, in a rather “prolix” manner.

cryptonector · 2024-02-10T22:53:23 1707605603

> but XML is pretty much an ironclad data definition and transfer protocol.

I suspect that may be why you might not have liked it. XML is not a good serialization of data for network protocols. XML is a good serialization of documents. Ok, that's the received wisdom that I'm echoing, but it's also my experience and my opinion.

When it comes to serialization of data for network protocols there are and have been many other better-suited schemes. XML got used as a serialization protocol for the the web because it's what existed at the time that was... close to HTML and textual, but it's got the disadvantage of being verbose.

ChrisMarshallNY · 2024-02-11T01:35:48 1707615348

Yes and no. You are correct about it being a document protocol, but it has long been set up as a big document protocol.

Most XML parsers are structured to parse and deliver XML in element-delimited packets, in asynchronous fashion. I don't know of many JSON parsers that can do the same. They do exist (I use one, in the backend of one of my projects[0]), but they aren't as common. XML has packet/async built into OS SDKs, but JSON tends to be "The Whole Nine Yards" handling.

With Big Data/ML, I'm surprised that this is still a thing.

[0] https://github.com/salsify/jsonstreamingparser

cryptonector · 2024-02-11T04:38:50 1707626330

> but it has long been set up as a big document protocol.

[Meaning stream parsing.] Yes, but typically one does not need that for small messages. I see that streaming decoders for Protocol Buffers is a thing now, but historically one does not bother with streaming for small messages, instead one streams lots of small messages.

> I don't know of many JSON parsers that can do the same.

libjq has one. With jsonlines or similar, if each text is small, there's no need for streamed decoding. Typically a DB query will produce a sequence of lots of small JSON texts, not one very large one.

If you're updating a DOM from XML then stream decoding makes sense, but in many cases streaming isn't ergonomic, just necessary when dealing with large documents (e.g., if there's not enough memory to hold them in memory without thrashing).

ambigious7777 · 2024-02-10T16:29:38 1707582578

XML is such a versatile format, and I wished it was used so much more. It doesn't have exactly the cleanest syntax, but it would be so much better than JSON in some of the cases I've seen. Especially when you are transfering document-type data. Why use JSON to represent rich text, when XML is infinitely better?

virtue3 · 2024-02-10T17:28:45 1707586125

XML was/is great. It was the issue with people abusing the shit out of CDATA and comments to do meta programming inside the XML and making it an absolute nightmare.

I believe it was an additional reason why comments were excluded from the JSON spec. I can't find the exact quote but Crockford's comment about excluding them for parser directive reasoning is pretty bang on considering usage of JSON primarily as an interchange format (and evidence of a good decision based on it's staying power).

https://web.archive.org/web/20190112173904/https://plus.goog...

_heimdall · 2024-02-10T12:29:02 1707568142

Its not just you. I think web development would be in a much nicer place today if we spent the last 20 years improving XML and XSLT rather than abandoning it for JSON and client-side JS.

People are starting to realize that most sites really boil down to parsing server state and rendering DOM. We never needed to do all this nonsense with serializing all state to JSON and shipping the entire rendering pipeline to the browser, that was just a heavy handed solution for a very specific scaling issue at Facebook.

d-lisp · 2024-02-10T14:00:20 1707573620

I want to believe in the cycle: "From telnet to static web to dynamic web to clientside rendered to serverside rendered to telnet."

Andrex · 2024-02-10T15:48:18 1707580098

Just as a thought exercise, if XML had been the de facto interchange format, how much would that have added to the historical bandwidth transfer of the Internet? Even 1 KB added to every AJAX call would add up pretty significantly pretty fast, I'd imagine...

Obviously JSON isn't well optimized either but I wonder how much, if any, progress might have been slowed by XML syntax clogging the pipes even more.

chriswarbo · 2024-02-10T16:55:35 1707584135

Resource requirements expand until it they hit a user-noticable limit. Even ultra-compressed every-bit-counts encodings[0] would be ignored and abused until they're bloated to a user-noticable limit. Or the extra bandwidth would be used for more video ads.

> how much, if any, progress might have been slowed by XML syntax clogging the pipes even more.

Depends what you mean by "progress", and if you think Web development has been improving or devolving over time.

[0]https://www.microsoft.com/en-us/research/publication/functio...

zaik · 2024-02-10T17:43:22 1707587002

I feel like most web applications today make hundreds of HTTP requests when opening them. If that's acceptable, then XML vs. JSON doesn't matter.

_heimdall · 2024-02-10T18:16:52 1707589012

XML is definitely more verbose than JSON, though I'd be very surprised if an average content-heavy site would be smaller with something like JSON + react. I'd be surprised if server components tipped the scales either given that the server state would still be shipped as HTML and/or a virtual dom representation.

Thiez · 2024-02-11T08:30:20 1707640220

With compression a lot of the verboseness of XML disappears. And the vast majority of the data "clogging the pipes" is video and images.

cryptonector · 2024-02-10T22:55:29 1707605729

FastInfoSet enters the chat

I.e., you can have XML and have it be compact by serializing it to something very close to ASN.1's PER (packed encoding rules).

Ditto for JSON, though there's lots of competing binary JSON schemes out there.

cryptonector · 2024-02-10T22:58:42 1707605922

jq is to JSON as XSLT/XPath is to XML.

Maybe any time we create a new serialization scheme we should create an ETL for it.

Or maybe we shouldn't create new serialization schemes all the time. Here's just a few of them, and that's rather many: https://en.wikipedia.org/wiki/Comparison_of_data-serializati...

Maybe we should require licensure for creating new serialization schemes :)

acdha · 2024-02-11T02:14:16 1707617656

I liked XML 1.0. I gave up after getting tired of the standards community not prioritizing users - the thickets of interdependent specs, dearth of good documentation, and critical lack of work on quality implementations of the standards (e.g. no decent editors except $$$ oXygen, libxml2 and Xalan never implementing XSLT after the 90s, often missing or conflicting examples of anything non-trivial, etc.). I really wish there’d been an effort to focus on the basics so there wasn’t such a gap between the vision of the standards committees and the lived experience of most users.

josteink · 2024-02-10T12:10:04 1707567004

> Whenever I have to write it Emacs verifies the doctype for me and handles the structural part of it.

Clearly you are not talking about the OOB experience here?

What customizations have you made? Something you can share?

Here’s me hoping I can move my MSBuild work to Emacs ;)

gkbrk · 2024-02-10T14:15:17 1707574517

It's pretty much out-of-the-box. I remember Emacs validating XML automatically as well, and knowing the schemas somehow.

josteink · 2024-02-10T21:08:53 1707599333

Regular Emacs or Doom-Emacs?

I dont recall getting schema-validation... ever.

bjoli · 2024-02-10T21:42:16 1707601336

I remember spending some time writing some kind of wrapper for converters from dtd and w3c XML schema to relax Ng compact. Then nxml handles the rest.

If I recall correctly it probably just litters my nxml schema dir with whatever I happen to edit.

rcade · 2024-02-11T01:09:33 1707613773

I love love love XML, but when I encounter an effort to use it to carry presentation like HTML alongside executable code -- such as Apache Jelly -- I regret the choices that brought me to that place in my life.

I wouldn't call OPML a good example of XML for reasons I detail elsewhere in the thread. But if you need a subscription list of feeds for import or export it's alright.