More

moltensyntax · on Feb 18, 2019

No unit tests? :)

moltensyntax · on Nov 10, 2018

> Significands are fixed point, and fixed point adders, multipliers, and dividers on these are needed for arithmetic operations... Hardware multipliers and dividers are usually much more resource-intensive

It's been a number of years since I've implemented low-level arithmetic, but when you use fixed point, don't you usually choose a power of 2? I don't see why you'd need multiplication/division instead of bit shifters.

jhj · on Nov 10, 2018

Multiplication or division by a power of 2 can be done by bit shift assuming binary numbers represent base-2 numbers; i.e. not a beta-expansion https://en.wikipedia.org/wiki/Non-integer_representation where binary numbers are base-1.5 or base-sqrt(2) or base-(pi-2) or whatever (in which case multiplication or division by powers of 1.5 or sqrt(2) or (pi-2) could be done via bit shift).

But when multiplying two arbitrary floating point numbers, your typical case is multiplying base-2 numbers not powers of 2, like 1.01110110 by 1.10010101, which requires a real multiplier.

General floating point addition, multiplication and division thus require fixed-point adders, multipliers and dividers on the significands.

repsilat · on Nov 10, 2018

Fixed point might usually put the "binary point" in between bits, but when doing a multiply between two of them you still have to do at least an integer multiply before the bit shift. Ditto division.

moltensyntax · on Sept 21, 2018

We as humans have a lot of intuitive tools for knowing whether a source of data is trustworthy. AI could possibly approach this ability given enough training... we'd need to do something like add a "trust" score to every node in the graph.

moltensyntax · on April 22, 2018

Yes, this is possible, depending on the codec and container. I have done similar operations with h264+mp4.

It's good to be able to edit video without losing quality.

Are you sure you need sub-keyframe precision? In h264+aac+mp4, for example, if it's not keyframe aligned, the result is usually a stalled video frame for a split second, but since the audio continues smoothly, it's not that noticeable.

If you know the exact codec settings that were used to encode the video, you can create new pieces to be fit losslessly together. Otherwise, it is more difficult.

Contact me on twitter at @downpoured and I can describe more.

moltensyntax · on April 20, 2018

Tessel (https://tessel.io/) also runs JavaScript on embedded. It cleverly used Lua as an intermediate language.

CharlesW · on April 20, 2018

That makes it sound they're running Lua on embedded. In that case, is the JS-to-Lua transpiling done on-device as well?

moltensyntax · on April 20, 2018

This article again? In my opinion, this article is biased. The subtext here is that the author is claiming that his "lzip" format is superior. But xz was not chosen "blindly" as the article claims.

To me, most of the claims are arguable.

To say 3 levels of headers is "unsafe complexity"... I don't agree. Indirection is fundamental to design.

To say padding is "useless"... I don't understand why padding and byte-alignment that is given so much vitriol. Look at how much padding the tar format has. And tar is a good example of how "useless padding" was used to extend the format to support larger files. So this supposed "flaw" has been in tar for dozens of years, with no disastrous effects at all.

The xz decision was not made "blindly". There was thought behind the decision.

And it's pure FUD to say "Xz implementations may choose what subset of the format they support. They may even choose to not support integrity checking at all. Safe interoperability among xz implementations is not guaranteed". You could say this about any software - "oh no, someone might make a bad implementation!" Format fragmentation is essentially a social problem more than a technical problem.

I'll leave it at this for now, but there's more I could write.

arghwhat · on April 20, 2018

> To say 3 levels of headers is "unsafe complexity"... I don't agree. Indirection is fundamental to design.

3 individual headers for one file format is unnecessary complexity.

> To say padding is "useless"

Padding in general is not useless, but padding in a compression format is very counterproductive.

> And it's pure FUD to say "Xz implementations may choose what subset of the format they support. They may even choose to not support integrity checking at all. Safe interoperability among xz implementations is not guaranteed". You could say this about any software - "oh no, someone might make a bad implementation!" Format fragmentation is essentially a social problem more than a technical problem.

This isn't about "someone making a bad implementation!", it's about crucial features being optional. That is, completely compliant implementations may or may not be able to decompress a given XZ archive, and may or may not be able to validate the archive.

XZ may not have been chosen blindly, but it certainly does not seem like a sensible format. There is no benefit to this complexity. We do not need or benefit from a format that is flexible, as we can just swap format and tool if we want to swap algorithms, like we have done so many times before (a proper compression format is just a tiny algorithm-specific header + trailing checksum, so it is not worth generalizing away).

Any and all benefits of XZ lie in LZMA2. We could have lzip2 and avoid all of these problems.

(I have no opinion as to whether LZIP should supersede GZIP/BZIP2, but XZ certainly seems like a poor choice.)

shawnz · on April 21, 2018

> 3 individual headers for one file format is unnecessary complexity.

So all these file formats are unnecessarily complex?

- all OpenDocument formats

- all MS office formats

- all multimedia container formats

- deb/rpm packages

etc?

arghwhat · on April 21, 2018

It depends on how you count headers, but yes.

Multimedia containers, while too complicated, don't really qualify for a position on that list. These containers are basically just special purpose file containers, and thus the headers of the "files" within should not contribute to the header count.

deb/rpm is also a good example for old and quite obnoxious formats. Deb is an AR archive of two GZIP compressed TAR archives (control and data) and a single file (debian-binary). TAR replaced AR for all but a few ancient tasks long ago, but for some reason, Deb uses both. A tar.gz with 3 files/folders that were not tar'd or compressed would have been much simpler. I believe RPM goes that route, but rather than TAR they use CPIO, and rather than embedding the metadata inside the archive, the RPM package has its own header.

Both RPM and DEB have given support for using a bunch of compression formats, meaning that not only do the content of the DEB/RPM package have dependencies, but there each package can now basically end up having its own dependencies that need to be satisfied before you can even read the package in the first place. Oh, and one of the supported compression formats is XZ now, adding an extra dependency as your version of XZ might not support the contained XZ archive at all.

adrianN · on April 21, 2018

Aren't MS office formats the poster child for overly complex file formats?

nerdponx · on April 21, 2018

> rpm packages

I recall an article posted here detailing how incredibly bloated and crufty the RPM format was.

pmoriarty · on April 20, 2018

"Look at how much padding the tar format has. And tar is a good example of how "useless padding" was used to extend the format to support larger files. So this supposed "flaw" has been in tar for dozens of years, with no disastrous effects at all."

Just because it's in tar doesn't mean that the design is flawless. tar was created a long time ago, when a lot of things we are concerned with now weren't even thought of.

Deterministic, bit-reproduceable archives are one thing that tar has recently struggled with[1], because the archive format was not originaly designed with that in mind. With more foresight and a better archive format, this need not have been an issue at all.

[1] - https://lists.gnu.org/archive/html/help-tar/2015-05/msg00005...

rootbear · on April 20, 2018

The name tar comes from Tape ARchive. Lots of padding makes sense when you know that tar was originally used to write files to magnetic tape, which is highly block oriented. The use of tar today as a bundling and distribution format is something of a misapplication, as it lacks features one might want of such a program.

rolandog · on April 21, 2018

Thanks for such an amazing rabbit-hole of a link.

nebulous1 · on April 21, 2018

I feel he has made a case for some inadequacies in Xz. Some of the claims seem exaggerated, such as (2.2) the optional integrity checking, assuming the decompressor at least logs the fact that it couldn't do the integrity checking. Some others are clearly more significant issues, such as (2.5) not checksumming the length fields (2.6) the variable length integers being able to cause framing errors. Others still are petty, such as (2.3) too many possible filters.

While I think he made a case, I somewhat doubt that the other formats are flawless, and the real answer would lie in a more open analysis of all of them.

moltensyntax · on March 27, 2018

This should now be resolved, Safari should work fine.

moltensyntax · on March 26, 2018

It looks like Safari implements regular expressions slightly differently, and it leads to the parser behaving differently. Hope to be addressed by the end of the week.

ddingus · on March 26, 2018

Irregular expressions

moltensyntax · on March 26, 2018

I liked Runtime Revolution. I wish the broader coding community would taken environments like this more seriously, is the text editor sacred? I left the platform before they added more mobile+online support, I'll have to see what they have added recently.

icebraining · on March 26, 2018

is the text editor sacred?

No, it's the worst form of development, except for all those other forms that have been tried from time to time.

tluyben2 · on March 26, 2018

Livecode is great for creating tools on most platforms very fast. Literally a few hours to create quite functional frontends to commandline applications for instance. The batteries included way of working is breath of fresh air compared to what other environments have, but the reach of the platform seems limited. Mostly because to make applications actually release ready for ‘the world’ takes insane amounts of effort. However that does not diminish the power of the system for me; being able to whip up prototypes and ‘good enough’ helper/tool applications is something I only know from Tcl/Tk (especially with Visual tcl). Delphi/VB are fast but not as fast as these. I create mostly write-only software with it but for what I need it for, it is good and I do not know anything that matches that kind of productivity.

moltensyntax · on March 26, 2018

I'm still kind of awestruck by this. Hosting an OS inside a browser. It's great as a time capsule to perfectly see the past. But because it's an emulator, it would be difficult to add a more modern feature, like ViperCard's share-url-to-stack and save-stack-as-json.