Does anyone know how this compares to other projects such as Lepton? https://git...

tpetry · on March 1, 2020

The files created by lepton cant be displayed by any client. Brunsli is a converter for JPEG <-> JPEG XR which is losless, and by the improved jpeg xr algorithms decreases filesize.

The interesting part is you can therefore convert between these formats thousands of times without visual regressions. You could (in a few years) only store the jpeg xr file and your webserver may transcode it to jpeg for legacy browsers.

ksec · on March 1, 2020

>Brunsli is a converter for JPEG <-> JPEG XR which is losless...

You mean JPEG XL?

JPEG XR is a completely different thing [1]

[1] https://en.wikipedia.org/wiki/JPEG_XR

DagAgren · on March 1, 2020

Extremely good branding work there, JPEG.

zuminator · on March 1, 2020

Don't forget JPEG XS and JPEG XT.

https://jpeg.org/jpeg/index.html

thedance · on March 1, 2020

They may still be comparable. If the point of Lepton is to save space, and if a round-trip through Brunsli costs less than Lepton encoding while saving similar amounts of space, then it could be a design alternative.

llarsson · on March 1, 2020

Their respective READMEs both claim a 22% size reduction, which sounds like an interesting coincidence. Have they identified a similar inefficiency in the format itself?

ksec · on March 1, 2020

JPEG's entropy encoding is ancient. Adding modern arithmetic coding can save significant bits without changing the actual visual data.

robryk · on March 1, 2020

Another inefficiency of JPEG is that each block (8x8 pixels in size) is compressed independently[^]. This means that the correlations between pixels that are adjacent across a block boundary are not used. If I were to take a JPEG image and randomly flip its blocks (mirror them along x and/or y axis), the resulting JPEG would have a very similar filesize, even though it's a much less likely image.

Brunsli and, IIUC, Lepton, make use of these correlations.

[^] the average color of blocks is not compressed strictly independently, but the space used on those is small compared to all the rest of the information about a block

Disclaimer: I've worked on Brunsli.

mochomocha · on March 1, 2020

Very interesting. This independence across blocks can presumably be leveraged at decode time for faster decoding though. Surely there must be decoders out there parallelizing over the blocks on multi-cores arch /GPU?

Do you know how Brunsli & Lepton fare when it comes to parallelizability?

robryk · on March 1, 2020

I assume that you mean parallelizability of decoding and not of encoding.

JPEG's decoding is poorly parallelizable: the entropy decoding is necessarily serial; only inverse FFTs can be parallelized.

Sharing the data about boundaries need not hamper parallelizability in its most simple meaning: imagine a format where we first encode some data for each boundary, and then we encode all the blocks that can only be decoded when provided the data for all its four boundaries.

However, what often matters is the total number of non-cacheable memory roundtrips that each pixel/block of the image has to take part in: a large cost during decoding is memory access time. If we assume that a single row of blocks across the whole image is larger than the cache, then any approaches similar to the one I described in the previous paragraph add one roundtrip.

Another consideration is that a single block is often too small to be a unit of parallelization -- parallelizing entropy decoding usually has additional costs in filesize (e.g. for indices), and any parallelization has startup costs for each task.

The end-result is that a reasonably useful approach to parallelization is to split the image into "large blocks" that are large enough to be units of parallelization on their own, and encode _those_ independently.

jonsneyers · on March 2, 2020

Brunsli and Lepton are both sequential.

In JPEG XL, there's both the original sequential Brunsli, and a tiled version of it (using groups of 256x256 pixels) which can be encoded/decoded in parallel. If you have a 4:4:4 JPEG (no chroma subsampling), you can also instead of Brunsli, use the VarDCT mode of JPEG XL, where all the block sizes happen to be 8x8. Compression is similar to that of Brunsli, and it's even slightly faster (but it doesn't allow you to reconstruct the JPEG _file_ exactly, only the _image_ itself is lossless in that case).

londons_explore · on March 1, 2020

That might prove to be a good measure of image compression 'efficiency'.

Present to a user two images, one an image compressed by image compressor X, and one compressed by the same image compressor with a single bit of output flipped.

In an ideal image compression scenario, the decompressed images would not be the same, but a user could not tell which was the 'correct' image, since both would look equally realistic.

robryk · on March 1, 2020

If a scheme had something like that property and satisfied some simpler conditions, I would wager that it necessarily is a good compression scheme. However, this is very much not required of a good compression scheme:

Imagine that a compression scheme used the first bit to indicate if the encoded image is an image of a cat or not. Changing that bit would then have very obvious and significant implications on the encoded image.

If that example seems too unrealistic, imagine a modification of a compression scheme that, before decoding, xors every non-first bit with the first bit. Then flipping the first bit in the modified scheme is equivalent to flipping a lot of bits in the unmodified scheme, but they are equivalently good at encoding images.

Edit: To put it short, the important property is that equally-long encoded images are "equally plausible": it's not important how many bits differ between them, and it doesn't matter if they are similar to each other.

Dylan16807 · on March 1, 2020

In the thought experiment, I don't think the user is told beforehand what the image is.

So you flip the cat bit and get an image of a helicopter, and they still can't tell which one is 'correct'.

robryk · on March 1, 2020

Ah, thank you. I misread the GP. It seems that he is saying nearly[^] exactly what I wanted to say in the edit.

[^] the property should hold not only for single-bit changes, but all length-preserving changes -- it's perfectly fine for all single bitflips to e.g. result in invalid codestreams.

JyrkiAlakuijala · on March 2, 2020

Single-threaded brunsli runs 2.5x faster than single-threaded lepton, but brunsli compresses 1 % less. Brunsli is able to compress more JPEGs (including more exotic JPEG configurations) than lepton.

thedance · on March 4, 2020

Just to put numbers on this, on an Intel Skylake CPU, brunsli compressed a 1MB image in .16s, lepton needed .5s, more than triple the CPU time. Brunsli peaked at 100MB while lepton peaked at 44MB of RAM.

Decoding with lepton wants .23s and 8MB of RAM while brunsli used .13s and 42MB of RAM.

todotask · on March 1, 2020

Have you tried to test on your machine? You'll see various results with lossless/lossy image optimisers.

I got <22% with Brunsli.

gbanfalvi · on March 1, 2020

And how does it compare against HEIF?

duskwuff · on March 1, 2020

Not comparable. Brunsli and Lepton are lossless compressors for JPEG files; HEIF is a completely different lossy image encoder.

To compare the size of a Brunsli/Lepton encoded JPEG file with an HEIF image, you'd need to define some sort of quality equivalence between the two, which gets complicated fast.

0-_-0 · on March 1, 2020

HEIF can't compress JPEG images losslessly

rndgermandude · on March 1, 2020

Indeed, HEIF/HEIC is basically a slightly dumbed down HEVC (h.265) i-frame (full frame) (HEIC)[1] and new container format (HEIF)[2], similar to WEBP being VP8 i-frames in a RIFF container. So they are used as full blown codecs in practice, usually not in a lossless mode, so shifting JPEG to HEIC or WEBP will lose some quality.

[1] Decoding HEIC in Windows (10) requires you to have installed a compatible HEVC decoder. Which is 99 cents (and the hassle of setting up a store accounts and payment processing with MS) or an alternative free one which will use the HEVC codec that is shipped with hardware such as newer Intels (QSV) or GPUs. Thank you patent mess!

[2] HEIF the container format can contain JPEG data, but in practice does not or only as a supplementary image (previews, pre-renders, thumbnails, etc)

ksec · on March 1, 2020

Below 0.5 bpp, both HEIF or BPG [1] and AVIF performs quite a bit better than JPEG XL , XL shines in 0.8 bpp. At least in initial testing.

[1] https://bellard.org/bpg/

ajnin · on March 1, 2020

Do you have a link for JPEG XL comparisons ? The comparisons on that page are with JPEG XR, which is a different thing.

ksec · on March 2, 2020

Unfortunately It couldn't find the exact one, it was from the author's presentation against a few other codec on common metrics such as PSNR and SSIM.

There is a thread on Doom9 [1] with similar results though.

[1] https://forum.doom9.org/showpost.php?p=1894341&postcount=167