Microsoft, Google, Mozilla, and Apple Object to W3C Fork of DOM Spec

qmarchi · on April 13, 2018

For some clarity:

- Mozilla's Objection: https://github.com/w3c/dom/issues/175#issuecomment-380771954

- Apple's Objection: https://github.com/w3c/dom/issues/175#issuecomment-380534425

- Microsoft's Objection: https://github.com/w3c/dom/issues/176

- Google's Objection: https://github.com/w3c/dom/issues/177

colanderman · on April 13, 2018

Thanks. The TL;DR seems to be that, instead of sticking to documenting what is, the W3C is (either deliberately or through incompetence) trying to push their own "vision" for DOM 4.1 without browser buy-in.

c-smile · on April 13, 2018

That does not mean that it is a good thing to put all bells and whistles from different browser vendors under the same umbrella.

Let's say you are a) owning and b) managing a project that three competing teams are working on in parallel.

If you will not curate the project you will have one team adding <marquee> and another team adding <blink>...

In real world you would invite dedicated architect or team of architects to define the spec that all 3 teams will implement.

All that WHATWG vs W3C flame is about bazaar vs cathedral management style I think.

Origins of the mess: W3C itself has no architects on board - they are just trying to moderate votes of others. Where each vote has an obvious weight (weight(Google) > weight(JohnButSmart)).

In contrary WHATWG has professional architect on top of the construct - Ian Hickson. In fact WHATWG was created by him. But Ian is associated with Google and that makes WHATWG legitimacy a bit questionable.

domenicd · on April 13, 2018

This information about how the WHATWG works is somewhat outdated; please see https://whatwg.org/faq for more.

lambda · on April 13, 2018

I'm pretty sure Ian Hickson is no longer involved with the WHATWG. I think he's working on Flutter now (https://news.ycombinator.com/threads?id=Hixie).

In fact, I would say that it's not quite like a cathedral and bazaar here.

The W3C is like a legislative committee. Lots of things done by committee vote. Formal process that must be gone through to advance beyond committee. Petty politics that screw with actually getting things done. The committee chair having the ability to block work by procedural means.

The WHATWG is more like an open forum. The work happens out in the open. Anyone can participate in the discussion (the W3C committees frequently have private meetings, there have been private mailing lists, and so on). There are a few people who have commit rights to the repo, so who actually control what goes in and what doesn't, but they are generally willing to let in changes that have broad support, and are implementable, rather than letting such features get held up by political processes.

Besides the difference in structure, there's just a difference in attitudes. The W3C tends to strongly favor certain principles, like accessibility and modularity, but to the exclusion of compromise for technical reasons or real-world reasons. They also seem to have a tendency to get very attached to particular ways of doing things, without being willing to compromise. I think the biggest example of this was longdesc; it was never implemented properly in pretty much any browser, and very few people actually followed the spec and had it point to a URL (many people just copied the alt tag, or provided a longer description in the attribute instead of a URL), so even if browsers or screen readers had implemented it, the content wouldn't be useful. But people in the W3C made a big stink about removing this, and spent a lot of time and effort fighting and litigating over that, rather than actually trying to work on a different feature that could gain wider adoption.

The WHATWG tends to take a more pragmatic approach; pave the cowpaths, if there are differences between implementations do it in the most sensible way that preserved compatibility.

Now, what the WHATWG has produced hasn't always been the best; there are times when it's made mistakes in its approach. The drag and drop spec, which had been reverse-engineered from IE's drag and drop support, was a pretty bad; not sure if it's gotten better since.

But overall, the WHATWG has been a lot more productive in getting standards done that are actually used on the web, because they involve the implementers, and don't override them with tedious, drawn out, political battles over obscure features that no one uses.

make3 · on April 13, 2018

the formal complaints of all browser people seem to support the WHATWG

ttepasse · on April 13, 2018

The browser people are the WHATWG. There are no other stakeholders in that.

domenicd · on April 13, 2018

Not sure exactly what your comment is saying, but at least one reading is that "browsers are the only stakeholders in the WHATWG", which is not accurate. The WHATWG is a community organization open to participation by all; see https://whatwg.org/faq#process for more. We receive a lot of participation from users, web developers, and other companies.

There is a formal group, the Steering Group, which represents the browsers that implement WHATWG standards, and serves as the point of final appeal if the community doesn't come to a consensus on its own. But this is similar to the W3C appeals track where if all the paying member companies don't come to consensus, they appeal to "The Director" as the ultimate decider. ("The Director" is nominally Tim Berners-Lee, but recently all Director decisions have been made by W3C management "on behalf of" The Director.)

hungryfoolish · on April 13, 2018

Are there any non-browser members as part of the editors or steering committee in the whatwg?

To me, this is giving browser makers even more power over HTML than they have otherwise.

domenicd · on April 13, 2018

The MIME Sniffing, Streams, Console, and Quirks Mode Standards are all edited by people who are not working for the browser-engine-developer companies. (Streams is co-edited by Googlers as well.) That's 4 out of the 15 standards currently developed at the WHATWG; not so bad, given how few companies are willing to pay people to work full time on web standards.

Of course, we have lots of work to do, and if you or anyone else are able to devote a good chunk of time to standards work, we'd love to have more editors---no matter what company they work for.

Browser engine developers comprise the steering group, since ultimately we want the specs to reflect what will be implemented in web browsers, so the best way to resolve disputes about what should be in the spec is by asking the people who will be spending their resources to implement it what they think. I think this is probably the right way to go, instead of "The Director" having the final voice.

hungryfoolish · on April 14, 2018

> That's 4 out of the 15 standards currently developed at the WHATWG; not so bad, given how few companies are willing to pay people to work full time on web standards.

Thats actually nice to know, but specifically about HTML (which is one of the most important specs the whatwg works on) it's all browser makers. (That too, majority of them from one particular browser maker).

>I think this is probably the right way to go, instead of "The Director" having the final voice.

I think having a better mix of people in the decision making process than just browser makers would be the right way to go. I understand that browser makers have to implement the specs, but the web community as a whole has to use the specs to build the web of the future - and as such, people from non-browser companies should have a greater say in the final voice.

domenicd · on April 15, 2018

Oh, you're right, I forgot HTML! That is co-edited by someone from Bocoup too. So make it 5 out of 15.

dragonwriter · on April 14, 2018

Not only are there non-browser-maker WHATWG stakeholders, there are non-browser-maker formal objectors to the W3C DOM move to CR endorsing the same view as the four browser makers (Bloomberg and Disruptive Innovations; I've never heard of the latter previously, AFAIK.)

ttepasse · on April 14, 2018

I was curious so I googled a little bit:

Bloomberg is know for its Terminal. It seems they have a program where you can extend their terminal with web applications. Presumably Bloomberg didn't built a renderer themselves but there are custom APIs. So they are something between a browser maker and a UI framework like Electron.

Disruptive Innovations is a firm by Daniel Glazman, formerly of Netscape and of the CSSWG fame. His main product seem to be continuing the Editor part of the original Mozilla Suite based on the Gecko Layout Engine. There seems to be an NVU Editor and rather new an Editor called BlueGriffon which also does ePub and maybe his custom WebBooks format.

saas_co_de · on April 13, 2018

Tldr is that corporations don't want a standards body with any public input so they created their own competing body, strangled w3c, and are now saying w3c is limited to being a rubber stamp for the standards they create.

jhanschoo · on April 13, 2018

This is not remotely the case. W3C used to be the reference, up until when they obstinately championed XHTML 2.0 that nobody wanted to write. Meanwhile, browser vendors wanted to implement all the fun stuff that web apps need but were being blocked by the W3C's glacial speed. So they just founded the WHATWG for collaboration and standardization, and since then W3C has beed ripping off the WHATWG's standards but always with inexplicable alterations.

domenicd · on April 13, 2018

This isn't accurate; notably, the WHATWG is the standards body that is actually open to public input (see https://whatwg.org/faq#process). Whereas to give input to the W3C, you have to pay membership fees (https://www.w3.org/Consortium/fees; between $2250 and $77K depending on company size for the US). This latter model is commonly referred to as "pay-to-play" standardization.

lambda · on April 13, 2018

You can give input to the W3C without that, but yeah, it's the input from the paying members that counts the most.

Additionally, the W3C has some private mailing lists, does private working group meetings, and so on. So yeah, the WHATWG is a lot more open when it comes to input.

gsnedders · on April 13, 2018

> Additionally, the W3C has some private mailing lists, does private working group meetings, and so on.

For all the web platform stuff (I don't know about the semweb and digital publishing side of things), you can basically ignore the existence of the private mailing lists (they get basically no traffic, and people like me start screaming whenever anyone tries to have a technical discussion there). I'd rather they didn't exist, but realistically they're not a barrier to participation.

The F2F meetings and telecons are more problematic (because both are inherently exclusionary, either in time or in travel), but those are at least publicly minuted and any resolution from them can be overturned.

c-smile · on April 13, 2018

Unfortunately that's over-simplification of the problem.

10-15 years we were told that use of <table>s for layout purposes was terribly wrong. Without any reliable alternative mechanism.

Here is my proposal to W3C CSS WG to add flow property and flex units: https://www.terrainformatica.com/w3/flex-layout/flex-layout.... It covers as flexbox as grid features under the same mechanism. Yet it establishes robust framework for other layout methods.

Note it is dated April, 5, 2009. It took us almost 10 years to have something matching that.

So neither W3C process nor WHATWG process is perfect - there is no reliable feedback from customers (web designers) to browser vendors.

What if browser would provide us just abstract DOM with minimal HTML/CSS implementation and some extensibility/plug-ins mechanism with something like Java.

In this case we, the community, can provide better implementations:

   main {
     flow: HolyGrailLayout(params) url(/layouts/HolyGrail.class);
   }

   <main>
     <header>  
     <footer>
     <aside>
   </main>

So instead of waiting for the weather for 10 years we will do something literally tomorrow when we need it.

In this case W3C will be able to perfectly fit its role - high-level management of all this.

No one can manage the reality up to need of toilet paper in full. USSR tried and where did it go?

ajnin · on April 14, 2018

> What if browser would provide us just abstract DOM with minimal HTML/CSS implementation and some extensibility/plug-ins

That's what Houdini layout extensions are about, if that work ever comes to fruition that will indeed be a pretty major advance for web design.

ma2rten · on April 13, 2018

It's interesting that a formal objection is done by creating an issue on github.

gfo · on April 13, 2018

A sign of the times perhaps? It makes sense given the repository, but of course, it does make it challenging to verify the authenticity of the request when, for all we know by looking at the messages, they could be random users.

ubernostrum · on April 13, 2018

Any group discussion could be "random users" to an outsider. To people at W3C, or even to people who just follow web standards development, the names are pretty instantly recognizable, and presumably since these accounts have been added to the W3C's GitHub org, W3C feels confident that they are who they claim to be.

xnyanta · on April 13, 2018

It does say that they are members of the W3C organization.

klez · on April 13, 2018

So could I.

I mean, I see what you mean, but to a casual observer that's not very clear.

solarkraft · on April 13, 2018

Your objection is funny. Other methods of communication could be faked. Here you have Github as an authority to check membership and the organisation's repositories as an official place of discussion (perhaps equivalent to the website).

You do have to trust that the org is legitimate, but you could also fake a website or a whole organisaion.

Ajedi32 · on April 13, 2018

GP is referring to the "Member" badge next to the commenter's name, not to any claims made in the comments themselves.

You could claim to be a member of the W3C, but unless you actually are that badge won't show up next to your name in the issue tracker.

apaprocki · on April 13, 2018

The GitHub org member list is not going to be an authoritative list of all delegates from all members that all have GH accounts. One of our W3C participants conveyed our objection and he isn't in the GH w3c org member list.

scarface74 · on April 13, 2018

We know that Apple, Google, Microsoft, and Mozilla, consider the WHATWG to be the canonical version. We don't yet know whether the other 450+ W3C member organisations that represent the wider web platform agree with this position or not though.

Honestly, do the other 450+ W3C member organizations matter? (https://www.w3.org/Consortium/Member/List)

dragonwriter · on April 13, 2018

> Honestly, do the other 450+ W3C member organizations matter?

Well, the four objectors are responsible for browser engines that cover somewhere between 95 and 99+% of browser usage, depending on which set of stats you use and whether you count other browsers that have Chromium or Firefox upstream, including the system browsers of every major mobile and desktop OS.

So, no, in practice if those four agree on something, it is the way the web works.

solarkraft · on April 13, 2018

Remember that Apple just implemented the Canvas tag. If you implement something good perhaps almost none of the W3C member organisations matter in pushing something.

lonnyk · on April 13, 2018

Can you explain what you mean? According to caniuse[1] it has been supported for a while.

[1]https://caniuse.com/#feat=canvas

*EDIT I misread the parent. I didn't put the context for that paragraph together and read 'just' as in 'right now' instead of 'just decided to'.

SllX · on April 13, 2018

Apple was the originator and original implementer of the canvas tag, in support of a feature they added to their Desktop operating system in 2005.

Prior their implementation, canvas was not a tag that existed, nor supported in any version of HTML at that time, but it was later incorporated into a new version of HTML. Apple was a minor browser developer as well as a minor OS developer and minor system designer at the time.

quink · on April 13, 2018

I was confused as well, as Apple literally invented <canvas>.

I think what he meant to say was "Remember that Apple just went ahead and implemented the Canvas tag. [without waiting on any standards organisation]"

lonnyk · on April 13, 2018

Yeah, I agree w/ your reading of the paragraph.

solarkraft · on April 19, 2018

(sorry)

olliej · on April 13, 2018

The canvas tag (and canvas APIs) were first developed/shipped by Apple in tiger (so more than a decade ago) so yes, it has existed for quite a while :)

gsnedders · on April 13, 2018

And note that the parsing of the canvas element was changed in a backwards incompatible way compared with how Apple originally shipped it. (It was originally a void element, with no closing tag, like img. It was changed to not be, which made the rest of the page vanish into the canvas element.) Standardisation isn't always plain sailing.

s2g · on April 13, 2018

Hurray for the oligarchy.

happyopossum · on April 13, 2018

> Honestly, do the other 450+ W3C member organizations matter? (https://www.w3.org/Consortium/Member/List)

Given that the W3C charter requires actual implementations in order for a standard to move forward, I'd have to say that no - entities other than those who might produce a significant implementation probably don't matter in this case.

gsnedders · on April 13, 2018

The W3C Process considers all implementations equivalent: if you were to implement DOM 4.1 in Python, that would be as significant as a browser implementing it. That said, each group has to define what they'll consider "sufficient implementation experience" for each spec when they publish a Candidate Recommendation: the DOM 4.1 spec does not do this, and this forms part of Apple, Google and Mozilla's objections to the spec advancing to CR.

The DOM 4 implementation report, http://w3c.github.io/test-results/dom/details.html, is based on the tests in web-platform-tests: however, the web-platform-tests policy is that we test what browsers implement, and hence the DOM tests there are based on the WHATWG spec: there's no evidence provided that anyone has implemented what the W3C spec says in any case where it differs.

tptacek · on April 13, 2018

No.

They could matter, if they built a browser with competing market share. It's not a static equilibrium.

scarface74 · on April 13, 2018

Did you look at the list of members? Who on the list is going to be building their own browser and what would the business case be? Out of the big four, only two of them even thought it made sense to build a browser from scratch. Apple and Google started off with KHTML.

EDIT:

To clarify. Who is going to be building their own rendering engine instead of taking an existing one - 3 of the 4 are open source - and building a browser on top?

tptacek · on April 13, 2018

The lineage of the rendering engine doesn't matter. What gives Apple and Google and Microsoft command of the standard is the fact that they mediate access to web pages; they own th actual customers. That's what matters.

scarface74 · on April 13, 2018

Firefox doesn't own the customer.

But creating a rendering engine from scratch is hard and there is no business case for anyone doing it from scratch. Apple didn't (they tried with CyberDog ages ago) they used KHTML to create WebKit. Google didn't either, they started with WebKit. Opera gave up on their own rendering engine years ago.

cwyers · on April 13, 2018

Why is "creating a rendering engine from scratch" the bar here? A new player could fork Blink or WebKit.

da_chicken · on April 13, 2018

If you fork an existing rendering engine, you're rather implicitly using a WHATWG DOM. In order to use W3C DOM 4.1, you'd either have to modify a WHATWG DOM renderer into a W3C DOM 4.1 renderer -- which is a bit like starting with the emacs source to build a vim clone -- or to build your own. The inertia of forking an existing project pushes you towards the WHATWG implementation, not W3C. That's the point of forking. You get to preserve the forking.

Further, the W3C's argument historically has been to make the DOM easier to implement from the ground up (XHTML strict) compared to the overall rats nest of HTML5, which, as far as I'm aware, still is not fully supported anywhere [1] and is very loose about document structure errors etc. So, if you're planning to implement the W3C's DOM, it makes sense that you're agreeing at least somewhat with the W3C's historical philosophy about what the web should look like and how it should behave, so you're more likely to be concerned about the implementation difficulty of HTML5.

1: https://html5test.com/results/desktop.html

cwyers · on April 13, 2018

I think you have the cart before the horse here. WHATWG is just a codification of what is. If we didn't have WHATWG, you wouldn't be freed from the burden of supporting current webpages, you just wouldn't know what that burden entails. At any rate, refactoring Blink to meet W3C's spec has to be easier to write a renderer from scratch, even if you don't care about writing noncompliant webpages, or as they're usually known, webpages.

jsgo · on April 13, 2018

If that's the case:

1a) Why would it matter that Microsoft, Google, Mozilla, and/or Apple object to W3C DOM 4.1 if they don't implement it?

1b) Why would Microsoft, Google, Mozilla, and/or Apple care enough to object to W3C DOM 4.1 if they aren't implementing it? Why would they even give any effort to a competing specification and just allow it to die from inactivity?

2) Why does what is in W3C DOM 4.1 matter if the high 90s percentage of users are served by a browser in the WHATWG DOM camp? This could probably be condensed down to "Why do W3C's specifications matter at all" really.

dragonwriter · on April 13, 2018

> Why would it matter that Microsoft, Google, Mozilla, and/or Apple object to W3C DOM 4.1 if they don't implement it?

It matters to the utility of the W3C DOM spec that it doesn't represent either what browsers have implemented or what they will implement.

> Why would Microsoft, Google, Mozilla, and/or Apple care enough to object to W3C DOM 4.1 if they aren't implementing it?

They care enough because they want the W3C, if it is going to write purported web standards, to do something that won't confuse developers and lead to browser vendors fielding complaints from developers who mistake useless W3C documents for something meaningful.

> Why would they even give any effort to a competing specification and just allow it to die from inactivity?

They don't want to have a competing specification, though they do not seem opposed to having a specification with a different focus but consistent with WHATWG to the degree dictate by the purpose.)

> Why does what is in W3C DOM 4.1 matter if the high 90s percentage of users are served by a browser in the WHATWG DOM camp?

The idea is not to have opposing camps, though if W3C insists on making it an opposing camps situation, thst becomes a real issue.

saas_co_de · on April 13, 2018

Exactly. They don't want any competing products so preventing the development of a standard they don't control is obvious.

cwyers · on April 13, 2018

If this is about preventing competition in the browser space, then why doesn't one of the browser makers with lower market share defect from Google's position?

tptacek · on April 13, 2018

No, you start with a WHATWG DOM. What you do after that is up to you.

794CD01 · on April 13, 2018

Creating a browser isn't the only part of mediating access to web pages. In different senses, Digicert, Comcast, Akamai, and Cisco do that as well.

chucksmash · on April 13, 2018

Fair point, but I think it is somewhat orthogonal to the discussion. I don't imagine Cisco cares which DOM spec is used in rendering the application layer bytes of the packets it routes.

defen · on April 13, 2018

The browser is the only part of the web-access stack where the user has any choice. I suppose they can choose the ISP as well but that effectively only changes access speed, possibly.

bad_user · on April 13, 2018

Nonsense. Web standards are supposed to be de jure, not de facto.

Once upon a time Microsoft had 90% of the browser’s market. We created web standards in order to prevent monopolies, such as the former IExplorer, from holding the market hostage. That’s the whole reason behind web standards.

And yes, they matter even with an IExplorer that has 90% market share, because governments can and do enforce adherence. That’s also the reason for why Microsoft came up with OOXML, ODF being a threat even with a tiny market share.

scarface74 · on April 13, 2018

It doesn't matter what is suppose to happen. In reality. If none of the popular browsers support the standard - it doesn't matter.

Governments are not going to force every major browser manufacturer to support a standard.

That's why W3C lost relevance.

ubernostrum · on April 13, 2018

Web standards are supposed to be de jure, not de facto.

HTML5, in large part, was created to do exactly the opposite -- formally set down in writing all the de facto quirks of HTML as actually used, parsed and rendered in the real world, instead of continuing to prescribe behaviors which didn't match observed reality.

bad_user · on April 13, 2018

You and I lived a different history then, because if what you're saying is true, then ActiveX should have been standardized.

We've got no ActiveX, so your claim is false. Mozilla actually could implement ActiveX. They refused to do so.

Also, lets not forget that IExplorer 6 had incompatibilities with the standard, including XMLHttpRequest, even though Microsoft invented it.

amaranth · on April 13, 2018

For the most part it was documenting the common subset of how the browsers actually worked. Only one browser implemented ActiveX or ever wanted to so it isn't in the spec.

ubernostrum · on April 13, 2018

Nothing in your comment actually refutes anything I said.

c-smile · on April 13, 2018

"Who is going to be building their own rendering engine instead of taking an existing one"

Just for the note: I did - https://sciter.com

It was not meant to render all possible pages from Wild World Web but it renders HTML5/CSS3 (some subsets but still).

lbenes · on April 13, 2018

Impressive! So what are your thoughts on the technical merits of W3C's DOM approach? You agree with Google/Apple or is this just a case of them using their power to lock down the market?

qbaqbaqba · on April 13, 2018

Awesome! How would you compare your product to Electron?

c-smile · on April 13, 2018

Check this discussion: https://sciter.com/forums/topic/sciter-vs-electron/

Semaphor · on April 13, 2018

TL;DR: It's small and performant.

But the thread is an interesting read.

ksec · on April 18, 2018

Holy Crap that is bloody impressive!.

cptskippy · on April 13, 2018

> if they built a browser with competing market share.

As if building a competitive browser isn't hard enough. You then have to convince people to use it. Considering the walled gardens 3 of the big 4 are erecting around the platforms the control, that seems neigh impossible.

garmaine · on April 13, 2018

Users should have a voice too.

domenicd · on April 13, 2018

We definitely believe users should have a voice in the WHATWG, and thus in guiding what browsers implement. We strive to maintain an open and welcoming community; this has brought a lot of good ideas to the table.

A few years ago I gave a talk on this. https://www.youtube.com/watch?v=hneN6aW-d9w . I hope it's not too embarassingly outdated now :)

In particular, unlike the W3C, we do not require membership fees (https://www.w3.org/Consortium/fees?countryCode=US&quarter=04...) for participation.

garmaine · on April 14, 2018

Right, that's my point :) But thanks for making it clear for those reading.

tptacek · on April 13, 2018

They do: they pick which browser to run, and thus give power to.

garmaine · on April 13, 2018

That's not a free, expressive choice. One shouldn't expect that the spectrum of browser maker's choices align with user's preferences. Also they might use a browser because a website requires it, not because it aligns with their preferences, or because it was built into their phone, etc. etc.

c-smile · on April 13, 2018

Hmmm... Do you remember IE6 days? It was the best browser at that time - at least in respect of user base.

Yes, they did a lot of innovations there we all use now. Most notable - the whole AJAX idea was born there.

kitsune_ · on April 13, 2018

I was about to say no, but then I clicked the link and the first entry I saw is an organisation I happen to know (Access-for-All, Swiss Foundation), a foundation dealing with accessible technologies. They're doing a lot of good work with educating developers here in Switzerland.

tptacek · on April 13, 2018

There are lots of smart people doing important work that are also on the W3C, but that doesn't make them truly influential in de facto web standards.

From what I've read (which is not that much), the relationship between accessibility advocates and web standards has been particularly fraught, with advocates pushing for standards features that are received poorly in the marketplace. The argument here being: not every good idea about HTML is best expressed as a fundamental part of the HTML standard.

dkersten · on April 13, 2018

I came here to ask the same thing! If all the major browser vendors think one way, then it’s irrelevant how many people disagree with them.

chrismorgan · on April 13, 2018

Serious question: why are the W3C still publishing or trying to publish standards for DOM and HTML and probably a few others, when no one that matters cares about them? Why not rather throw in the towel on those particular standards and acknowledge that the WHATWG has won on them?

dragonwriter · on April 13, 2018

> Serious question: why are the W3C still publishing or trying to publish standards for DOM and HTML and probably a few others, when no one that matters cares about them?

There is a potential legitimate role for the two-track approach, if WHATWG represents a moving target of what browser vendors have agreed to implement and essentially is the vehicle for documenting hmthe agreed future common web platform, and W3C presents a versioned publication of the stable, widely implemented, currently usable state at a particular point of time; the W3C version would then be the target for conservative app developers that need something that works everywhere today, the WHATWG standard would be what people making browsers and other user agents would target, and what more ambitious developers willing to deal with “can I use...?” pitfalls would be guided by.

manigandham · on April 13, 2018

Why cant people just use an older copy of the WHATWG standard as the new "stable" documentation? This is basically how caniuse.com and browserslist work today, allowing developers to precisely describe their compatibility targets and even automate their builds.

It seems unnecessary to have an entirely separate organization to just copy/paste/publish new "versions" of existing archives.

dragonwriter · on April 13, 2018

> Why cant people just use an older copy of the WHATWG standard as the new "stable" documentation?

Because the order of incorporation into the standard and the order of implementation and stabilization aren't the same, and some fesutres may be implemented incompletely in some browsers, so that what is stable and usable is a subset of features (and sometimes a subset of functionality within a particular feature) that doesn't correspond to any particular version of the LS. So you'd need manual curation.

> It seems unnecessary to have an entirely separate organization

Perhaps, though the audience and thus interested parties for the implementor-focussed spec and the developer-focussed spec are different.

domenicd · on April 13, 2018

The WHATWG is actually the only organization I know of that publishers a developer-focused specification; see https://html.spec.whatwg.org/dev/. (We only do it for HTML currently.)

Anyway, I agree with the grandparent poster that caniuse.com is a much better approach to documenting the interoperable subset than copying and pasting someone else's spec, and trying to delete the parts that are not interoperable by some threshold. We actually have caniuse.com boxes in the margin of the HTML Standard: see for example https://html.spec.whatwg.org/multipage/scripting.html#attr-s...

Finally, it's worth noting that we only incorporate features into WHATWG Living Standards if they have multiple implementer interest; see https://whatwg.org/working-mode#additions

BinaryIdiot · on April 13, 2018

I don't really understand how this would make the two-track approach legit. Couldn't they just version the specification under WHATWG if that's what they're after?

I just can't see a reason for the W3C to be handling any of this anymore except for money reasons.

domenicd · on April 13, 2018

In fact, we already do publish commit snapshots for every change we make: https://dom.spec.whatwg.org/commit-snapshots/

BinaryIdiot · on April 13, 2018

Then what is the purpose of the W3C in this case?

Many years ago I tried to get a membership to the W3C as I wanted to provide a voice for a company I worked for (and for myself, honestly) but found out that the lowest level of membership was many thousands of dollars. How can anyone who isn't already very well established ever be properly represented there?

Then you check out WHATWG and, as far as I can tell, there are never fees associated with being a member and participating.

domenicd · on April 13, 2018

Yeah, we try to make the WHATWG a welcoming place for all, with no pay-to-play structure. Please feel free to provide your voice there! We've gotten a lot of good community contributions and ideas.

BinaryIdiot · on April 13, 2018

Thanks. I plan though :)

currysausage · on April 13, 2018

W3C process dictates that two implementations exist, not that all major browsers implement the whole standard. Thus, a W3C fork of the standard is of no practical use to “conservative app developers”.

If you need to target specific (probably legacy) browsers, you check caniuse.com. By the way, the WHATWG HTML standard integrates little boxes with caniuse.com data. That is indeed useful to developers.

manigandham · on April 13, 2018

It's especially amazing that there's a comment asking what the WHATWG will give up in return if W3C gives this up... as if this is some kind of battle truce.

These people are living in wonderland and need to wake up to the reality that there's already a winner and the war is long over.

c-smile · on April 13, 2018

It definitely needs to be some curation at least.

As an example, CSS: Google came up with Flexbox, Microsoft came up with Grid.

Now we have two competing layout methods doing pretty much the same. Yet they are conflicting in the sense that define the same flexibility entity by two different means: flexbox uses CSS property (that by itself conflicts with CSS 2.1 box model) and grid uses fr units for defining the same flexibility concept - portion of free space left in container from other fixed elements in it.

Problem is that all browser from now on shall follow this mess.

gsnedders · on April 13, 2018

Flexbox and Grid coming from different companies isn't really relevant (and, to note, Flexbox comes from Mozilla originally, if I'm not mistaken, it is ultimately based on parts of XUL).

They're also not competing: flexbox makes many 1D layouts easier than grid does. They're complementary, not conflicting.

c-smile · on April 13, 2018

Yes, flexbox was an old Mozilla XUL's feature (<vbox>/<hbox>) where flexes were defined by attributes. We all agreed at the moment that having presentation attributes in markup is nto that good idea. And that CSS flex was no-brainer port of that thing by replacing DOM attributes by bunch of CSS properties.

Problem is that flexbox ruins CSS box model that mandates that width CSS property is what defines the width of inner box of the element. Now they have flex-basis that if defined in galaxy far, far away overrides that width by something else.

That above already recognized as a mistake: https://wiki.csswg.org/ideas/mistakes

As of that 1D ...

grid-auto-flow: row | column;

makes flexbox obsolete at great extent.

et-al · on April 13, 2018

> Problem is that flexbox ruins CSS box model that mandates that width CSS property is what defines the width of inner box of the element. Now they have flex-basis that if defined in galaxy far, far away overrides that width by something else.

I'm not quite following; may you please explain this? Thanks.

c-smile · on April 13, 2018

Having this CSS:

    .flex-container {
       border: 1px solid #555;
       display : flex;
     }

     .flex-container > span {
       display:block;
       border: 1px solid #900;
       flex:1;
       width:100px;
     }

and this markup:

    <div class="flex-container">
      <span>Foo</span>
      <span>Bar</span>
    </div>

what would be the width of each <span> there?

lambda · on April 13, 2018

Well, flexbox was much simpler and easier to standardize on and implement sooner. It has been available in all major browsers since 2015.

Grid is great, and now works in most places, but it was good to have flexbox while grid was still in process; it only became available in Edge, in the final form, late last year, making it now available on all major browsers.

You're always going to have cases like this; where there's something that's simple that works now, and something better that comes later. If you always wait for the better one, it will take forever for things to get done. Not to mention that flexbox is probably simpler and easier to understand and use for some of the simple use cases, so people will still probably continue to use it despite the fact that grid is available.

c-smile · on April 14, 2018

Problem is that flexbox is a subset of grid. Or to be precise: flexbox and grid are just two forms in a set of layout methods that we already have and will have in future.

In normal architectural process we would establish first common infrastructure for all layout methods.

I've proposed something like that 9 years ago: https://www.terrainformatica.com/w3/flex-layout/flex-layout.... but it didn't go through (Who am I and who are browser vendors?)

So we would have single property that defines layout methods:

   display:block;
   flow: horizontal; // flexbox now
   flow: vertical; // ditto
   flow: grid(
           rows: ...,  
           columns: ...
         );          // grid 
   flow: multi-column( columns: ... ); // current multi-col
   flow: stack;
   flow: row(label,input); // variant of grid

So flexbox and grid are just parts of larger entity - set of current and future layout methods. Yet flexes has to be units as they a) were from the very beginning ( see "proportional" units here: http://www.w3.org/TR/html401/struct/tables.html#h-11.2.4.4 ) and b) can be used in other layouts and properties (why not margin-left:1fr ?)

Note that each layout method has its own parameters in their own namespaces.

Currently set of CSS properties is about 400 in one flat namespace. Any junior architect will tell you that this is close to unmanageable state. But we still pushing new stuff on that x'mas tree. It will fall down by its own weight as some moment, but who cares ...

roneythomas6 · on April 13, 2018

I use both flexbox and grid layout together. They are not comepting, merely completing each other. Grid can do things Flexbox can't do. Flexbox can do things Grid can't do.

c-smile · on April 13, 2018

"Flexbox can do things Grid can't do"

For example?

rossy · on April 14, 2018

flex-wrap. flexbox is one-dimensional, so it can "wrap" items that don't fit to the next line. The wrapped items don't have to line up with vertical grid lines, like they would in a grid, and they can be stretched or centred to fit across the full width of the parent.

c-smile · on April 14, 2018

How that is different from a sequence of display:inline-block's (horizontal wrap) and multi-col layout (vertical wrap)?

jonnyscholes · on April 15, 2018

I assume you're talking about CSS columns? Again they're not designed for layout - it's designed specifically for newspaper style columns of text where it doesn't matter which piece of text flows into which column. To use it for layout requires hacks or luck to adhear closely to designs. Pre-grid and flex I spent many hours of my life dealing with oddities around CSS columns - my life is in a much better place now not having to resort to it!

rossy · on April 15, 2018

Like I said, stretching and centering. The justify-content property works on each row of a flexbox, so you can have a flexbox full of items with different initial sizes, automatically wrap the items that don't fit to multiple rows, and distribute the space around each item evenly. You can't do this with flow layout.

jonnyscholes · on April 15, 2018

inline-block isn't designed for layout - so you get side effects like the preservation of whitespace which you have to use hacks to resolve. Sure it works - but flex-wrap is a syntactically correct solution that doesn't require hacks.

currysausage · on April 13, 2018

It really is all about not losing face.

W3C/TBL had “owned” HTML and DOM for way too long to just acknowledge that they botched it and that their work on that has no practical relevance any more.

“We are the organization that provides infrastructure for the standardization of stylesheets, and also some awesome-in-theory semantic web standards that are too complicated for actual implementations, and also some XML standards that are actually relevant for DTP” doesn’t seem to be the mission statement of choice for the creator of the web.

Analemma_ · on April 13, 2018

Politics and pride, mostly. It's really rare and difficult for an organization to voluntarily admit that it has no purpose anymore and dissolve itself. That's what the W3C should do, but the realities of human psychology mean that's unlikely to happen.

forgot-my-pw · on April 13, 2018

Interesting discussions from a year ago: https://www.reddit.com/r/javascript/comments/5swe9b/what_is_...

The W3C is very well funded and they don't wanna see the money gone.

ChrisSD · on April 13, 2018

It's sad to see the relationship between WHATWG and W3C has deteriorated to this point. Trying to wrangle a standard from a "living" (i.e. constantly changing) specification was always going to be tough but I'd have hoped both WHATWG and W3C would be able to maintain a working relationship.

chrisseaton · on April 13, 2018

Is there an article with the background on this? Why do we have both the W3C and the WHATWG, and why do the W3C just copy and paste work from WHATWG, if that is indeed what happens?

ChrisSD · on April 13, 2018

I don't know of an article, sorry. A brief history from memory would be that during XHTML days the W3C essentially let the HTML spec languish and people weren't moving to XHTML (at best they were moving to XHTML-like HTML).

So the WHATWG came along (mainly organised by the major browser vendors) and started the HTML spec moving again. This became part of what's known as HTML5.

However WHATWG doesn't exactly make a "standard" it makes a "living standard", which is a constantly shifting document which aims to describe where browsers currently are and what they hope to implement. The W3C decided to keep publishing its own HTML specifications and, as the WHATWG does describe what browsers are trying to do, the W3C's spec has to build at least partly on that work. There are differences though. For example, the W3C requires at least two implementations of a feature for it to be included in their spec.

The WHATWG has always opposed the W3C's spec. They see it as confusing to have two "official" specifications.

lucideer · on April 13, 2018

To put a slightly different spin on the same story as perspective always colours the telling:

W3C decided to deprecate HTML in favour of XHTML. Most of the web quickly moved to XHTML. One individual (an employee at Opera, then Mozilla, finally and currently Google) wrote an oddly influential opinion piece saying the the move to XHTML had been somehow harmful and pushed for the major browser vendors to form a rival non-democratic standards body (WHATWG) to the W3C, which forked and completely redefined HTML.

The W3C, which unlike the WHATWG has many voting members from many backgrounds, not all related to browser making, quite understandably was never fully on board with the new WHATWG HTML spec efforts. However, with the level of adoption and support it received (mainly from being the creation of the powerful browser vendors) W3C were eventually pressured into conceding to advocate for HTML. Which they've done by maintaining a copy, rather than blindly directing people to the work by what for all intents and purposes effectively amounts to a rival organisation, and an extremely undemocratic one at that.

As web developers, we should follow the WHATWG and ignore the W3C, because the W3C have lost the political battle for HTML and we need to get our stuff working on browsers, all of whom follow WHATWG. But that's an unfortunately pragmatic approach that shouldn't amount to acceptance.

Sharlin · on April 13, 2018

> Most of the web quickly moved to XHTML.

This simply is not true. The web moved to an XHTML-like dialect of HTML which was still served as text/html and browsers interpreted it as "HTML soup" because actually serving pages as application/xhtml+xml would have broken the majority of the web because browsers would actually validate them and refuse to display a page at all if there was even a single missing close tag.

lucideer · on April 13, 2018

> This simply is not true. The web moved to an XHTML-like dialect of HTML

You're thinking of XHTML 1.1 or XHTML 2. That "XHTML-dialect" that everyone switched to was called "XHTML 1.0", which allowed serving as either content type.

If you're choosing to nitpick about the fact that most sites published would not have worked if served as application/xhtml+xml, I'd invite you to do a survey of sites currently being served as valid HTML5. It's not even that easy to verify as the Nu validator version in use varies so much depending on where it's hosted (or if it's a local jar), and which iteration of the living standard it conforms to is always ambiguous. Have you tried reading the WHATWG spec diffs?

The burden on devs who might like to adhere to any kind of strict automated verification of spec. conformance is now out of the question. With XHTML, even if you were serving non-well-formed XML with a text/html content-type, at least your markup could be trivially checked for conformance by almost any XML parser to see why it's not well-formed. It was actually conceivably viable to put that check into build steps or CI.

Serving application/xhtml+xml was a nice to have, but anyone believing that serving XHTML as text/html had no value completely missed the point. At least now, years later, the mess we're stuck with should make it a little easier to see though.

jerf · on April 13, 2018

OK, so by lucideer's quirky definition of "XHTML", the vast majority of the web moved to XHTML. Based on the expansiveness of lucideer's definition, this appear to have encompassed web developers who probably weren't even aware they were writing "XHTML".

By the definition that most of us are using, which is that XHTML is complaint XHTML that could be rendered without error in browser's XHTML modes, to a first approximation nobody ever did it. Even today XHTML-levels of precision in HTML requires an awful lot of API support and very careful usage; doing it ten years ago was above almost everybody's skill level.

lucideer · on April 13, 2018

> by lucideer's quirky definition

Which also happens to be the definition the w3c xhtml 1.0 spec. You can choose to think that's quirky, please don't attribute it to me.

> definition that most of us are using, which is that XHTML is complaint XHTML that could be rendered without error in browser's XHTML modes

Which, again, is the definition used in the later w3c xhtml 1.1 & 2 specs, the former which wasn't widely used, the latter which was abandoned without being published at all.

If your issue with XHTML was that W3C were moving towards a direction you disagreed with, then you don't have an issue with the version of XHTML that was in popular use.

> XHTML-levels of precision in HTML requires an awful lot of API support and very careful usage

I'm not really sure where this view comes from. HTML validation is a lot more complex and difficult to achieve than XML well formedness, and HTML4/XHTML1 validation were both far simpler than modern HTML5 validation (the Nu validator is inordinately complex in comparison to the older DTD one). Furthermore, dev tools for ensuring XML well-formedness are far more readily available and integrated into most things even today, while HTML5 validation is such an obscure concept today I'm sure many devs don't even know it's a thing.

ubernostrum · on April 13, 2018

Which also happens to be the definition the w3c xhtml 1.0 spec. You can choose to think that's quirky, please don't attribute it to me.

Except that the number of people who actually implemented valid, well-formed, properly-served XHTML Strict -- of any version -- in compliance with all the relevant specifications is at best vanishingly tiny. XHTML Transitional was tag soup.

Your retort further up about many sites serving invalid HTML5 actually works against you, since HTML5 explicitly has a forgiving parsing model, while XHTML is explicitly "every error is a fatal error". If browsers had enforced the XHTML approach on every document using an XHTML DOCTYPE, we would have seen the death of XHTML much earlier.

This is why people say XHTML was never really adopted -- many people certainly put a "/>" to close their empty elements, and stuck an XML prolog and an XHTML DOCTYPE up at the top, but surveys like the infamous "XHTML 100" showed that next to nobody actually adopted XHTML in a manner compliant with the relevant standards.

And I say this as someone who, way back in the early 00's, was serving valid, well-formed XHTML as application/xhtml+xml. XHTML was a terrible approach, and the W3C process was dragging farther and farther from practicality at every revision (remember XHTML 2.0?).

fourthark · on April 13, 2018

You're taking about the ease of validation. Everyone else is talked about the ease of writing.

kuschku · on April 14, 2018

Oh, you mean like AMP, which now every major site supports, and which is even stricter than XHTML?

adamrezich · on April 14, 2018

Turns out, when there's financial incentive to use strict syntax, people will... guess that's all XHTML lacked...

Hello71 · on April 15, 2018

I mean... yeah? "there must be an actual benefit to do something that costs me development time = money". XHTML did not offer this.

hsivonen · on April 13, 2018

> If you're choosing to nitpick about the fact that most sites published would not have worked if served as application/xhtml+xml, I'd invite you to do a survey of sites currently being served as valid HTML5.

Completely different thing. XML processing and all reasoning based on the premise of XML processing are fiction when XHTML is served as text/html. The HTML parsing algorithm and tve rest of the processing requirements is not fiction when HTML is invalid.

(Why are we still talking about this in 2018. Sigh.)

wink · on April 13, 2018

Because someone asked and it explains some of the history quite nicely.

FWIW, maybe I'm the 1% but I wrote valid XHTML 1.0 for a while, but also soon gave up :P

Semaphor · on April 13, 2018

I did server side browser sniffing to give IE the version it understood (IIRC it couldn't handle well-formed XHTML served with the proper mime tag, not sure, it's been a while :D) while everything else got proper fully compliant XHTML. I'm pretty sure I used a code snippet from Anne van Kesteren who is also posting here ;)

xtian · on April 13, 2018

And the versions of IE in use didn’t support application/xhtml+xml anyway so you would have to switch to text/html based on the user agent string.

It was never clear what the technical benefit of this was supposed to be. I only ever saw one site whose pages served double duty as an API and UI by serving styled XML. It seemed like a challenging approach to pull off well.

spiralx · on April 16, 2018

I wrote an XSL stylesheet that turned an HTML page into a pretty-printed and syntax highlighted display of its source code.

duskwuff · on April 13, 2018

The shoe web site skechers.com used to do this. With the removal of XSL support from browsers, though, it looks like it's now using some form of JS templating.

chias · on April 13, 2018

The Gentoo website / handbook does this.

Or rather, did this a few years ago when I was last messing around with Gentoo. It seems to be HTML now.

Hello71 · on April 15, 2018

The Handbook was interesting in that it was one of the few sites that actually went with the XML + XSLT = XHTML route. Of course, nobody knows XSLT, and everyone hates XML, so it was dumped in favor of MediaWiki, which everyone still hates, but at least now mostly understands how to use. (although the same people that insisted we use XML+XSLT also insisted we use SMW, which is even worse... I gave up then, but I hear they're trying to undo SMW now.)

chias · on April 15, 2018

Interesting!

What's SMW? I'm not familiar with the term and searching for it isn't being particularly helpful.

Hello71 · on April 17, 2018

Semantic MediaWiki

coding123 · on April 13, 2018

Oh I remember this period of time and damn this is true. I think about 80% of the pages on the web during a certain time period had that tramp-stamp of XHTML Validated button somewhere on the page.

jopsen · on April 13, 2018

Only the cool sites :)

riquito · on April 13, 2018

And you couldn't use target="_blank" in anchors...

AgentME · on April 14, 2018

I remember the fierce battle in my mind trying to judge whether I wanted a proper strict xhtml page, or I wanted external links to open in a separate window... This was literally what drove me away from strict xhtml. Everything else I was on board for at the time.

ChrisSD · on April 13, 2018

I'm not certain that it's true to say most of the web quickly moved to XHTML. Sure a number of sites advertised themselves as XHTML but they were not strictly XHTML compliant. This could be due to third party widgets or other included code or it could be due to a mistake in template construction. Whatever the issue fully compliant XHTML wasn't used much in practice outside of hand-crafted pages.

Also Internet Explorer, for example, never implemented XHTML which would have been a deal breaker for many sites.

lucideer · on April 13, 2018

> they were not strictly XHTML compliant

The vast majority were not strictly XHTML compliant, but whatever the figure was, I'd imagine it wasn't too different to all the many "strictly HTML compliant" sites now (compliant according to which commit?).

The point was they used XHTML, which means they could trivially choose to validate and test their XML well-formedness with built-in tools everyone had ready access to. Exposing your end-users to those conformance checks (i.e. the in-browser strict XML-parser) wasn't the only "value" offered.

ubernostrum · on April 13, 2018

The point was they used XHTML, which means they could trivially choose to validate and test their XML well-formedness with built-in tools everyone had ready access to.

I'm honestly trying to figure out whether this is satire or not.

tptacek · on April 13, 2018

That seems to ignore the conventional wisdom that according-to-Hoyle XHTML was a DOA standard because it mandated error handling in ways that no browser implemented and most of the authoring community didn't want. Authors don't write well-formed XML, even today.

lucideer · on April 13, 2018

> it mandated error handling

XHTML 1.0 didn't mandate so-called "draconian error-handling", it just offered it as an optional feature.

XHTML 1.1 (which was released but noone used) and XHTML 2 (which was never finished nor released) did mandate it. I wasn't a big fan of that decision, I don't think it would've worked, but XHTML 1.1 was still very usable while ignoring that one requirement; throwing out the baby with the bath water was a massive overreaction on the WHATWG's part.

ubernostrum · on April 13, 2018

XHTML, other than Transitional, which nobody should count as "implementing XHTML", is an XML application. It inherits XML's parsing model. Every error is a fatal error.

anonymfus · on April 13, 2018

>because it mandated error handling in ways that no browser implemented

IIRC Opera implemented XHTML error handling.

kalleboo · on April 13, 2018

IIRC several major browsers implemented XHTML error handling, but only for documents with a Content-Type: application/xhtml+xml header, which was basically nothing because that would then trip up other browsers

rimliu · on April 13, 2018

Opera had the "draconical" approach, where upon the error you just had that, an error. Firefox, iirc had a softer approach where you still got the page rendered, but you'd get the error reported too. Anyway it all depended on the proper MIME type for the XTHML (as it should). However the whole MIME type and everything associated with it (some elements and APIs are treated differently) is a whole barrel of worms, so XHTML in any of the incarnations was never a good idea.

winkeltripel · on April 13, 2018

> "draconical"

That's xml error handling, following rules as written.

tptacek · on April 13, 2018

"Draconian" error handling is a term of art in HTML.

colanderman · on April 13, 2018

> not all related to browser making, quite understandably was never fully on board with the new WHATWG HTML spec efforts.

That's the other thing that pisses me off about the WHATWG, is how much they shit all over XML and other interoperability technologies. E.g. their URL standard (because, why not fuck the IETF as well) basically ignores anything non-HTTP for specious reasons.

bzbarsky · on April 13, 2018

The only reason there is a URL standard in the WHATWG is that the IETF URL RFC didn't define error handling and this led to interop problems. So there was a need for a URL standard that _would_ define error handling. The IETF refused to produce one (basically said "fuck you, we don't care about your use cases or interop problems" with slightly more polite wording), so the WHATWG ended up doing it...

I'm not saying this is a great situation. I'm not saying the WHATWG couldn't try to do better at considering non-HTTP or non-browser use cases here. But the representatives of those use cases in the IETF told browsers to take a hike. And then browsers did.

kuschku · on April 14, 2018

The first line of the WHATWG URL spec says that it deprecates and replaces all IETF URL standards.

bzbarsky · on April 19, 2018

For its target audience (browsers and web pages) it does.

hsivonen · on April 13, 2018

A number of folks involved in WHATWG work bought into the XML vision initially, but reality has a strong text/html bias and we've been able to adjust views as experience has accimulated.

See https://annevankesteren.nl/2011/02/xml-tired

(Personally, the first time I managed to get funding to work on a Web engine was to make Gecko's XHTML-as-XML support better. At the time, I thought it was so important that I sought funding to get it done...)

annevk · on April 13, 2018

What do you mean by non-HTTP? It handles URLs whose scheme is not http(s): just fine...

colanderman · on April 13, 2018

I forget the specifics, but there are several incompatibilities between the IETF URI and IRI spec and the WHATWG URL spec (see [1], EDIT: as I'm sure you're well aware, given your username). The WHATWG spec amounts to "what four popular web browsers do", explicitly without considering compatibility with the hundreds (thousands?) of other non-web-browser tools that make use of URIs.

What you've defined are effectively not URLs. Very similar, but different. If you wanted to call them "WHATWGRLs" or something I wouldn't care. But they're not URLs, and the WHATWG is choosing to muddy the waters rather than, say, specify an optional legacy compatibility layer on top of the IETF spec. It's one thing to say "in addition to IETF URIs, browsers should also accept these malformed URIs, but should not accept these valid but problematic URIs"… it's quite another to say "URIs aren't that anymore, now they're this".

[1] https://daniel.haxx.se/blog/2016/05/11/my-url-isnt-your-url/

annevk · on April 14, 2018

I'm not sure I get the distinction. As for curl, it doesn't follow any standard which seems worse, but does at least helpfully demonstrate that the RFCs cannot be implemented by major clients.

dragonwriter · on April 13, 2018

> Most of the web quickly moved to XHTML

No, it didn't.

At best a large share of new, greenfield development moved to XHTML, but I'm not convinced it was a majority of even that.

Hello71 · on April 13, 2018

> Most of the web quickly moved to XHTML

Ridiculous and absurd. A small proportion of the web moved to invalid XHTML that rendered as tag soup because it was sent as text/html. Virtually no websites actually served XHTML as XHTML, because: 1. there were and still are no compelling technical benefits of XHTML, 2. it broke Internet Explorer, 3. most webmasters were and still are incompetent and have no clue what Content-Type is.

coldtea · on April 13, 2018

>W3C decided to deprecate HTML in favour of XHTML. Most of the web quickly moved to XHTML.

In some parallel universes, yes.

Even if so, there's also the fact that XHTML wasn't updated itself with features people needed.

lucideer · on April 13, 2018

What features?

If you're referring to "features" in the HTML5 spec., like canvas, webgl, geolocation, DOM etc. they were separate specs, which WHATWG lumped into one monolith (though they're mainly JavaScript APIs, and aren't directly related to HTML). They were being worked on separately to XHTML, and still work fine with XHTML to this day.

coldtea · on April 13, 2018

>they were separate specs, which WHATWG lumped into one monolith

For which I could not care less. Whether there's a big spec for HTML5+JS APIs, or 20 different specs, is a bureaucratic concern, not a concern to the developers or the end users.

W3C might had them "neatly" separated, by it also haven't moved them notch towards completion and release for more than a decade.

I've used and worked for the web before W3C, in its heyday, in its long decline days when we waiting a decade+ for some progress, and after it become irrelevant. Now it's a way better situation.

lambda · on April 13, 2018

> Most of the web quickly moved to XHTML.

Most of the web didn't move to XHTML.

A lot of people who were interested in being standards compliant moved to XHTML 1.0 Transitional, which was the HTML compatibility subset, but they only ever served it and validated it as HTML, not XHTML, because if you served it as XHTML, one single stray < that someone had forgotten to quote somewhere would break the parsing of the whole page.

The piece written by Hixie was influential because it was a wake up call that the direction the standards bodies were going in was pretty much fruitless, and that there could be a much better way to do it which wouldn't involve breaking compatibility with all of the existing content and would give web developers and users features that they actually wanted.

> As web developers, we should follow the WHATWG and ignore the W3C, because the W3C have lost the political battle for HTML and we need to get our stuff working on browsers, all of whom follow WHATWG. But that's an unfortunately pragmatic approach that shouldn't amount to acceptance.

I fail to see how there is anything unfortunate about this. What about rewriting everything in XHTML 2.0 (https://www.w3.org/TR/2010/NOTE-xhtml2-20101216/), and having to be extremely conscious of any possible stray < that could sneak in to a page without being quoted, would have been preferable to:

1. Consistent parsing support for existing content, and content that might have slight problems like stray <, in all browsers

2. Standardization of things that people actually use to build web apps, like XMLHttpRequest and Canvas

3. Consistent handling of encodings between browsers, including encoding sniffing

4. Consistent handling of quirks mode vs. standards mode between browsers

5. Actually having browsers support compatibility with vendor-prefixed versions of features, because some browsers widely used introduced prefixed features that web developers actually started relying upon

And also, have you ever tried getting involved with the WHATWG process? I have, and I find that they are very receptive to intelligent discussion of issues.

What doesn't work well is to insist that you have a problem and that this particular solution must be used to address the problem; because a lot of times, it's easy to come up with some proposed solution but it then turns out that it's either a lot more complex in practice, your proposal does't fit in with the rest of the ecosystem well, or the problem can actually be solve just in tooling on top of HTML without having to change the spec at all and then wait for multiple browser vendors to all independently implement it.

_fq4v · on April 13, 2018

> and having to be extremely conscious of any possible stray < that could sneak in to a page without being quoted, would have been preferable to:

Any system that publishes content that would let this kind of thing pass is incredibly insecure, and shouldn't be on the internet. Today it's a stray <. Tomorrow it's a stray <script>

It's no wonder software is where it is today with attitudes like these.

lambda · on April 13, 2018

Not if that < had slipped in because it was in a piece of static text in a string somewhere in the source code.

You can apply mandatory quoting to untrusted input all you want, but there are going to be times when you have trusted strings that can still contain stray characters that will make the resulting markup invalid. And in many cases you don't want to have mandatory quoting for all of that, because these strings may have markup you want to include.

And yeah, you can argue that instead of generating content by appending strings, you should be building up a proper type-safe DOM structure that can be serialized. I'll wait while you go boil the ocean of converting every single web application framework that exists now outside of a couple of obscure type-safe functional programming frameworks, and in the meantime I'll be able to browse the real web without every other page giving me validation errors.

_fq4v · on April 13, 2018

To be fair, I only use obscure type-safe functional programming frameworks. That's what I'm employed to do, and this obviously impacts my feelings on the matter. Personally, I think it's irresponsible to use anything that could be this unsafe. This doesn't mean everyone needs to use FP, just that frameworks and libraries should be chosen so as to guarantee safety. There are easy-to-use libraries for all these things in every language.

In no other world of engineering is this attitude okay. If you were a civil engineer and had to hold a license to practice due to the danger your designs could present to society, this attitude would eventually cause you to lose your ability to practice. It's becoming more and more clear that software can have similar levels of impact, and software engineers should practice as scuh.

lambda · on April 13, 2018

I agree with you that we do need to do better about writing more robust software, and type safe languages are a good way to do that.

But what you're saying is as if you suggest that since the metric system is more consistent and more widely used than the English, I as a bolt distributor should start selling my bolts in metric sizes, despite the fact that the nuts that everyone has are in English sizes.

The browser vendors, at least, are working on implement their browsers in more type-safe languages (https://github.com/servo/servo), but even still they have to work with the content that is produced by thousands of different languages, frameworks, and tools, and millions of hand written HTML files, templates, and the like. Just turning on strict XML parsing doesn't make that go away, it just makes your browser fail on most websites.

_fq4v · on April 13, 2018

A good first step in enforcing web standards would be if browsers would detect these rule violations, and -- instead of failing -- put a giant banner on the top of the page warning end users that the site may be compromised and could compromise data.

Soon, every business will be clamoring to fix their buggy software, and users will still be able to access the unsafe websites they so desire.

greenhouse_gas · on April 14, 2018

You can be unsafe even with typesafe builders. See

fn build(text_to_show: &str) -> HTML{ HTML(Body(H1(text_to_show))) }

What if text_to_show wasn't sanitized? You got yourself a XSS. And if you do sanitize it (and keep it in a StrSanitized type), what are the chances of accidental XSS?

Really, what should have been done is a "user supplied tag", which automatically displays everything as plain text, like <user-supplied id="ahdjdh37736xhdhd"> Content </user-supplied id="ahdjdh37736xhdhd">

lambda · on April 14, 2018

You would generally want the general purpose string type in your language to always be escaped when serializing, and only allow avoiding that if you opt-in explicitly.

So, for instance you'd have an H1::new(contents: TextNode) constructor, and you'd have to build a TextNode; if you build TextNode::new(text: &str), then it would escape it. If you wanted to explicitly pass in raw HTML, then you'd need something like HTMLFragment::from_str(&str), and it would parse and return the fully parsed and appropriately typed fragment object that could then be used to build larger fragments.

There might be some way to unsafely opt out, like HTMLFragment::from_str_raw(&str), that would just give a node that when traversed would just be dumped raw into the output, but that would be warned against and only used if you wanted to avoid the cost of parsing and re-serializing some large, known-safe fragment; it wouldn't be what you would normally use.

_fq4v · on April 14, 2018

Your builder isn't really using types to guarantee safety. You can write untyped programs in a strongly typed language, by just coercing everything to strings, but this isn't what I mean when I say 'type-safety'.

lambda · on April 13, 2018

> The WHATWG has always opposed the W3C's spec. They see it as confusing to have two "official" specifications.

The WHATWG has not always opposed the W3C's spec. The WHATWG explicitly agreed to work with the W3C to form an edited, snapshot spec based on the WHATWG spec. That's what HTML5 was supposed to be.

However, the W3C process then hijacked this, by dropping things from the WHATWG spec, adding things back that had been removed because they had never been implemented properly and implementing them wouldn't have been very useful, and so on. The WHATWG objected to this useless divergence.

lucideer · on April 13, 2018

> W3C process then hijacked this

While I agree the W3C's insistence on maintaining a parallel spec. is a silly idea they should absolutely abandon, I fail to see how any but the most biased perspective could conclude that they are "hijacking" a process of their own. W3C haven't dropped anything from the WHATWG spec.: that's separate and out of their control. They can drop what they like from their copy, it's their copy. Unless you're proposing that the WHATWG should be running the W3C, I'm not sure what you're getting at with the term "hijack". Surely you can't hijack your own thing?

lambda · on April 13, 2018

Because there's no point in reconciling the specs if you don't actually reconcile them.

If the W3C spec is a snapshot, possibly of a subset, possibly with some editorial but not functional changes, then reconciling the specs is useful; it gives you want the W3C wants to provide, versioned, frozen specifications that can be used as the basis for other specs, for people to claim "full conformance" with a particular version, and so on.

Or, if the W3C process identifies real issues, then it should work with the WHATWG community to resolve those issues; since the WHATWG spec is being used as the upstream, evolving spec that these snapshots are being made from, it makes the most sense to get the changes into the upstream first, so you don't have to resolve the issues every time or maintain divergence forever.

However, the W3C instead just insisted on writing the spec the way it wanted, without regards to whether it would actually be implemented.

It makes no sense to publish a spec that will never be implemented by any of the projects that actually have real-world implementations, and differs from the spec that the implementers actually use. That just causes confusion.

So yes, they hijacked the process in the sense that the WHATWG agreed to work together with the W3C, but the W3C never really worked in good faith to resolve differences or provide technical arguments for their changes.

lucideer · on April 13, 2018

> there's no point in reconciling the specs if you don't actually reconcile them

completely agree

> yes, they hijacked the process in the sense that the WHATWG agreed to work together with the W3C, but the W3C never really worked in good faith to resolve differences or provide technical arguments for their changes.

I can't see either party working in good faith. In what way did the WHATWG's "agreement to work together" bear out in terms of a positive contribution to W3C's parallel spec., which we can both agree isn't a great idea but I'm failing to see how the WHATWG is a positive actor here in any way; they've forced W3C into an impossible position through political bullying and somehow W3C are vilified for hijacking something?

lambda · on April 13, 2018

Ian Hickson, the editor of the WHATWG spec at the time, acted as editor of the HTML5 standard at the W3C for a while after the two groups agreed to work together. However, the committee had chairs who could override the editor.

However, despite a couple of years of effort working together, the W3C process allowed for a lot of people to raise objections that re-litigated a lot of things that had already been decided in the WHATWG process, or just didn't have implementer support, or whatnot. This led to the HTML5 draft specification being stale, as these objections held up migrating the editor's draft (which was the WHATWG specification) to the TR on the W3C site.

So lots of people who still saw the W3C as the "official" source of HTML were brought to an out-dated copy of the standard, because publishing more interim drafts was held up with all kinds of bureaucracy; and the W3C objected to linking to the WHATWG copy to suggest a more up to date version with bug fixes, so there was a fight over this.

The combination of the W3C's heaviweight process making it easy for lots of people to raise objections to slow down the process, and having the ability for those objections to be escalated above the editor, eventually made Hixie give up on editing HTML5 and just go back to editing the living specification.

The thing is, a specification only makes sense if it's actually implemented. Lots of non-implementers raising blocking issues on wishlist features, and then having to take the time to formally resolve all of those issues, does not make for a productive environment; and when the resolutions of those issues are escalated to chairs of the group or higher up in the W3C against the support of the implementers, it really hampers the process of coming up with a productive spec.

By the way, I haven't followed this drama in a few years, but taking a look at what's happening now, it looks like the W3C is essentially just plagiarizing the work of the WHATWG.

Features are generally discussed in the WHATWG, or implemented by browsers and then proposed, and the spec writing goes on there. After the spec is reasonably well worked out, the W3C is copying and editing some of the text into their standard.

Now, the WHATWG spec is under a Creative Commons attribution license, and the W3C does provide a small attribution in the acknowledgements section, so they are not violating copyright.

However, what they are doing essentially amounts to plagiarism as they are presenting themselves as the source for the standard. The introduction to the standard doesn't indicate that the actual work is going on in another group; they invite people to make comments on the W3C's GitHub. This is confusing, it gives people an out-of-date view of the standard, and it seems to be a move to make the W3C seem to still be the relevant authority when it's basically just cloning the standard from the WHATWG, but with enough wording and formatting differences that it could conflict and is hard to tell when it would.

Alternatively, they could fork the standard but do so more in the way that distributions package packages; take what's from upstream, have a separate set of patches that they apply on top that make it clear what the differences are. For instance, those patches might apply their layout, their disclaimers and the like, possibly disable some things that they think are underdeveloped or contentions and likely to change, and otherwise mostly just freeze the text. They could push any patches that they thought were for meaningful differences that they've fixed to the upstream project. They could properly attribute the WHATWG spec as the original source at the very top of the article, and list the editors of the WHATWG spec as the primary editors and the people doing the W3C release as maintainers of that particular fork.

But instead, they are listing as editors people who are basically just doing light paraphrases of the WHATWG spec.

lucideer · on April 15, 2018

> By the way, I haven't followed this drama in a few years, but taking a look at what's happening now, it looks like the W3C is essentially just plagiarizing the work of the WHATWG.

Ditto, and it's why if anyone asks about HTML and spec. conformance, I don't even mention the W3C, except to dissuade them from paying any attention to them. Their current HTML work is irrelevant and misguided.

My issue here is more with the historical negationism around the relationship between the organisations. The W3C's current HTML is, frankly, wrong-headed. But the context around their current situation is the fact that they've been bullied, cajoled and even somewhat ridiculed reputationally into these quite irrational actions by the WHATWG's very existence. That fact is lost when they're accused of acting negatively toward the WHATWG (e.g. hi-jacking apparent agreements and processes), when the actual background was WHATWG originally hi-jacking the specification of the web's central language.

Your post here is supporting the idea that the W3C's current direction on HTML is irrational. That's fine, I agree. But what they're doing is no worse than what WHATWG did originally with HTML5; the only differentiator is that WHATWG was extremely powerful (being primary implementors) and could use that power to win hearts and minds of pragmatic developers. The W3C have no such power and as such their wrong-headed actions are fruitless. But the equivalence is still worth pointing out.

mattur · on April 15, 2018

For anyone confused about how the WHATWG came to write a new HTML spec entirely from scratch, after the W3C blocked the work happening at the W3C, this is a good place to start: http://diveintohtml5.info/past.html#webapps-cdf

I'm going to assume your point that the WHATWG is "extremely powerful" compared to the W3C was meant to be satirical.

colanderman · on April 13, 2018

> The WHATWG has always opposed the W3C's spec. They see it as confusing to have two "official" specifications.

As the old joke goes... if it hurts, they should stop doing that!

The WHATWG spec is worse than useless to me as a developer. It's impossible to tell what is usable and what is just Google's wishlist (which is about half of it). The MDN has entirely replaced it for me, since they at least do a good job of documenting reality.

WHATWG should quit trying to bully the W3C out of the field, and instead clearly mark the WHATWG "spec" as what it is: a public notepad for browser developers. Leave the business of documenting what browsers actually conform to to the W3C.

dragonwriter · on April 13, 2018

> The WHATWG spec is worse than useless to me as a developer. It's impossible to tell what is usable and what is just Google's wishlist (which is about half of it). The MDN has entirely replaced it for me, since they at least do a good job of documenting reality.

The WHATWG living standard is largely where browser vendors (and other interested parties) work out what the web will be. W3C (with their implementation requirement), and, as you note, MDN serve to describe what the Web is. The latter is more useful to developers, but, as you suggest, MDN is doing a better job of it.

OTOH, to get to a place where things have interoperable implementations, a forum for implementors to collaborate on forward-lookong specifications is necessary, and that’s what WHATWG does well, and W3C does not (which is why WHATWG exists.)

colanderman · on April 13, 2018

I agree with everything you said. What rubs me the wrong way about the WHATWG is that they give the perception (and it may be just that) that they are trying not just to serve as that forum for browser makers, but also as the standard reference for web developers (which is the role the W3C HTML specs, save XHTML 2, have historically served), and doing a poor job of the latter.

dragonwriter · on April 13, 2018

I don't think the WHATWG is trying to serve as the reference for web developers (though their HTML spec has notable and laudable features for that use); I think they are mostly fine with W3C trying to do that as long as they do it correctly (which requires alignment with what browsers do, otherwise developers will target a non-existent platform.)

I don't know if they (or developers, MDN is probably a more widely used reference than W3C) see a standards body as essential in that role, though, and I don't think it seems W3C really wants to accept being relegated to that role rather than driving the web platform, even though they haven't driven the platform for a long time.

tptacek · on April 13, 2018

Exactly what is the purpose of a standard reference for web developers that fails to track the documented behavior of browsers?

colanderman · on April 13, 2018

I'm not sure the intent of your comment; failure to track what browsers actually do is exactly the problem with the WHATWG "living standard" – it's very much a forward-looking spec at best, and too often a wishlist.

ubernostrum · on April 14, 2018

The ideal situation is for the WHATWG document to be a roadmap of what vendors have discussed and tentatively agreed on, and the W3C document to be a periodic snapshot of what's actually been implemented.

That wouldn't make either one of them "bad". The issue here seems to be W3C wanting to push forward things that the vendors haven't agreed on or implemented yet.

mcchampion · on April 15, 2018

I don't think the W3C DOM document has anything the developers haven't agreed to. The problem is that it's an incomplete, intrinsically out of date, and often buggy subset of the the WHATWG living standard.

I agree that the W3C value proposition COULD be to publish a snapshot that describes what's actually implemnted. That might be a way forward here, but it requires a lot of work to define what "actually implemented" means in a useful way, and to check the test results and update the document (or build an automated way to harvest resources such as https://wpt.fyi/dom ).

kuschku · on April 14, 2018

Great, so if I build a website based purely on the WHATWG specs, it will work in all browsers, correctly?

No, it won't.

I can take the A4 paper spec and build a printer that takes that paper, and I know paper will comply with it. And the other way around.

You can't build a website just from WHATWG specs, and you can't, excluding the parts about backwards compatible parsing, easily build a new browser from scratch either.

A standard is an a-priori written document that describes the entire API surface, so that people on both sides can develop based on the standard without having to verify with actual implementations.

The WHATWG documents are useless for this purpose.

chrisseaton · on April 14, 2018

But how are the W3C specs any better than the WHATWG specs?

As far as I know the W3C take the WHATWG specs, and modify them with some of their own ideas so they're different from the what the browsers implement or are planning to implement.

What on earth is the point of that? Why design your own spec that nobody is implementing or planning to implement? What a waste of time!

And back to your point - why is it better than the WHATWG specs?

ghostly_s · on April 13, 2018

> Leave the business of documenting what browsers actually conform to to the W3C.

Documenting the prevailing conditions is very much not the purpose of a standard.

colanderman · on April 13, 2018

That's what the W3C has historically done, with HTML 2.0, 3.2, and 4.0. "Document, clean up, and nudge" is maybe a better description. The WHATWG today seems to take more of a "document, don't clean up, and add our wishlist" approach. (The "don't clean up" mentality is embodied in their "don't break the web" ethos; the "add our wishlist" mentality is a consequence of the "living standard" ethos… the "standard" never becomes reality because it is constantly changing.)

annevk · on April 14, 2018

I'm not sure where you got this impression, but it's wrong. https://whatwg.org/working-mode stipulates the requirements on additions. That's quite a bit different from a set of wishes.

And there's a lot of cleanup of legacy APIs happening too. E.g., removal of the isindex tag and deprecation of AppCache.

ttepasse · on April 13, 2018

That is the governing philosophy of the WHATWG, sadly.

annevk · on April 14, 2018

That's a bit of a stretch. This is only relevant to legacy APIs and only when all implementations are in agreement, which is quite the rarity.

coding123 · on April 13, 2018

I think that's the rub - if w3c wants to "document" what a browser conforms to - I imagine it will be a copy-paste from what the browser vendors are doing in their separate meetings of the minds.

y4mi · on April 13, 2018

Urm, I thought MDN was based on WHATWG? (Not W3C)

colanderman · on April 13, 2018

MDN includes clear documentation about what is actually implemented in all major browsers (the compatibility tables), so I (as someone who wants my code to work everywhere now, not next year) can tell at a glance what pie-in-the-sky ideas I should ignore.

That's great that it's based on the WHATWG's work – it should be, since it should document what's in Firefox, and Firefox presumably is following their own work with the WHATWG. But the WHATWG shouldn't pretend that they're useful to me in any other way than a preview of what's coming down the pipeline. For that, I need clear documentation of what is, not what will be. W3C HTML specs prior to HTML 5 (with the exception of the abortion that was XHTML 2) have historically served that purpose well. It was easy to make the judgement that, once my target market primarily supported HTML 4, I could use anything in that spec. The WHATWG "spec" throws that idea out the window.

Ideally, with a "living standard", periodically there are snapshots of some form that document what all or most major browsers supported as of some point in time. So I as a developer can say, "well I know most of my target market have updated their browsers since date X, so I can just use anything in this standard snapshot". The W3C I think is trying to do this. They might not be doing a very good job (indeed that is the crux of the WHATWG's objections); like I said, I personally rely on MDN to fill this same role for me. But the WHATWG living standard itself cannot fill this role, short of including MDN-style compatibility tables, or making their own snapshots that are somehow "better" than what the W3C puts out.

annevk · on April 13, 2018

FWIW, the HTML Standard (not the DOM Standard) does include CanIUse information in a sidebar, to help with this. I'd like to include this into other WHATWG standards, but it hasn't really happened yet. I'd expect most web developers to use MDN and StackOverflow though, as you say.

colanderman · on April 13, 2018

I appreciate the attempt to include compatibility tables, but they're nowhere near detailed enough for serious usage. Take the canvas element as an example. The WHATWG spec has one "CanIUse" sidebar for basically each section, if that. But compatibility issues exist at the level of individual methods. E.g. .filter and .resetTransform() both have very low cross-platform support ([1] and [2]), which I can tell at a glance from MDN, both in the sidebar listing them, and the compatibility tables on each page. Whereas the WHATWG spec doesn't even mention that these are experimental ([3] and [4]), and the CanIUse sidebar is totally absent for them.

StackOverflow is not a reference, and the answers for even popular queries are sometimes a decade out of date.

[1] https://developer.mozilla.org/en-US/docs/Web/API/CanvasRende...

[2] https://developer.mozilla.org/en-US/docs/Web/API/CanvasRende...

[3] https://html.spec.whatwg.org/dev/canvas.html#dom-context-2d-...

[4] https://html.spec.whatwg.org/dev/canvas.html#transformations

codetrotter · on April 13, 2018

Some history here:

- https://en.wikipedia.org/wiki/WHATWG#History

- https://annevankesteren.nl/2011/02/html-development

- https://annevankesteren.nl/2016/01/film-at-11

- https://thehistoryoftheweb.com/when-standards-divide/

Differences between the W3C HTML 5.2 spec and the WHATWG Living Spec: https://www.w3.org/wiki/HTML/W3C-WHATWG-Differences

gsnedders · on April 13, 2018

> Differences between the W3C HTML 5.2 spec and the WHATWG Living Spec: https://www.w3.org/wiki/HTML/W3C-WHATWG-Differences

FWIW, I'm pretty certain that is incomplete. It may well be the case that that is the set of deliberate changes from the WHATWG spec (at some revision), but we've had cases before where changes from the WHATWG spec have been copied only partially leading to the W3C spec, as published as a Recommendation (i.e., with two interoperable implementations) has been impossible to implement as written.

paulojreis · on April 13, 2018

http://diveinto.html5doctor.com/past.html

A very entertaining read, IMO.

kuschku · on April 13, 2018

The W3C was the original standardization organization for the web.

They wanted to create standards that allow easy implementation by others, and were willing to make some tradeoffs with backwards compatibility for that (see XHTML).

The browser vendors obviously oppose this, and want standards that just formalize what they already implement. As result, the browser vendors created their own standards committee, which standardizes whatever the browsers already do (if existing). This is the WHATWG.

As result, the web standards situation has gone to insanity. The WHATWG URL spec contains 4 pages of pseudocode and algorithm definitions for how Chrome parses URLs, and how you should as well, and the W3C, still being relied on by the other actors on the web that aren’t the 4 largest browsers, has to copy the WHATWG spec as base for their own specs, because browsers will ignore whatever the W3C says anyway.

But remember that the WHATWG proposed to the W3C that the W3C should copy the WHATWG specs as base for their own specs: https://en.wikipedia.org/wiki/WHATWG#cite_note-9

Latty · on April 13, 2018

To offer another perspective.

I'd say the W3C wasted a huge amount of time pursuing quests of purity (XHTML) over actually making the web better for users. I see the value in what they were trying to do, but it wasn't letting people do the things they wanted to do on the web.

As browsers started just implementing features outside of standards in completely disparate ways because everyone was desperate for them (leading to plugin hell, apps rather than websites, etc...), WHATWG was created to try and ensure that the web remained a single thing and not a mess of things that would only work in one browser.

The web platform sprang forward massively as a result of this, with browsers implementing much more consistently and with new features tending to be implemented in compatible ways, with real progress being made.

This lead to the W3C specs becoming totally redundant and the only way for W3C to keep up was to lamely try to copy from WHATWG into a "spec" at random intervals and claim it was something people could work towards, when it reality it offers no real advantages over working to the living standard, because no browser offers better coverage of that spec than any other random point of the living spec.

People want features. We saw what happens with a very slow moving, rigid standard: plugins. Flash was popular because at the time you simply couldn't do good video, animation, games on the web platform. Likewise, mobile phones shifted to apps because websites couldn't do notifications or use location information. You can bemoan a living spec, but you get one anyway, because people will work around the web if they can't do what they want. If you want to use a subset of that living spec, use it, but at least keeping it together and agreeing on roughly how to do these things is better than plugins or abandoning the web entirely.