Hacker News

Docker already fills up my dev machines yet they decided for this insane solution:

> The containerd image store uses more disk space than the legacy storage drivers for the same images. This is because containerd stores images in both compressed and uncompressed formats, while the legacy drivers stored only the uncompressed layers.

Why ?

ElevenLathe 4 days ago [ - ]

Sounds like a straightforward time-space tradeoff: if you have the compressed layers sitting around when you need them, you can avoid the expense and time of compressing them.

colechristensen 4 days ago [ - ]

I'm not sure about the fastest macbook disk access, but even with NVMe storage I've found lz4 to be faster than the disk. That is (it's hard to say this exactly correct) compressed content gets read/written FASTER than uncompressed content because fewer bytes need to transit the disk interface and the CPU is able to compress/decompress significantly faster than data is able to go through whatever disk bus you've got.

fpoling 3 days ago [ - ]

On my 2 years old ThinkPad laptop SSD is faster than lz4. On a fat EC2 server lz4 is faster. So one really has to test a particular config.

colechristensen 3 days ago [ - ]

Yeah, I'm not surprised the PCIe 5.0 transfer speeds matched with top tier SSD chips win that race.

It still bothers me that the fastest most performant computer I have access to is almost always my laptop, and that by a considerable margin.

Someone should do some lz4 vs. ssd benchmarks across hardware to make my argument more solid and the boundaries clear.

UltraSane 3 days ago [ - ]

You can get AWS instances with very fast local NVMe drives.

colechristensen 3 days ago [ - ]

If you have many cores and have the right optimizations in place the bottleneck for lz4 decompression is RAM throughput which is always going to beat whatever fancy disk setup you have.

But yes on the extreme end absolutely there's a point where lz4 stops making sense, but also most of us aren't trying to max out a 128 core postgres server or whatever.

Filligree 4 days ago [ - ]

Why would I need the compressed layers?

XYen0n 3 days ago [ - ]

The OCI manifest references the hashes of these compressed layers, and re-compressing them does not guarantee obtaining the same hash

flakes 3 days ago [ - ]

Recompressing should be guaranteed deterministic. It’s the packing/unpacking of tar archives to/from directories on disk that leads to the non-determinism (such as timestamps and ownership metadata). If the tar is left intact, both zstd and gzip should produce byte for byte identical outputs given the same compression parameters.

cpuguy83 3 days ago [ - ]

That is not correct. You would have to use the same compression tool (and likely version) for this to match.

Old docker discarded the compressed bits but kept some metadata about the the so it can at least recreate the tar.

It also recreated the manifest o push.

flakes 3 days ago [ - ]

Thanks for the correction. I did mean given the same tooling version/parameters, but (as you and others pointed out) preserving and recreating that state is not at all straightforward.

XYen0n 3 days ago [ - ]

You are correct; I confused archiving with compression. However, even considering only the compression process, same compression parameters cannot be guaranteed, as it is unknown which compression parameters the image publisher used.

flakes 3 days ago [ - ]

Thats true. And regardless of compressed vs regular tar, I think the OCI format working with opaque archives is extremely limiting. I hope the industry will eventually redesign to use content addressable storage per file and have metadata to describe the layer/disk layout instead. That would allow per file deduplication, and we can use tar for just bulk transfer over the wire, rather than using tar for the data at rest.

cpuguy83 3 days ago [ - ]

containerd 2.3 has support for erofs which does a direct import of the layer. It can even convert the tar based layers to erofs, faster than extracting the tar normally.

Also looking at block-based content store so that blocks can be deduped across images.

mort96 3 days ago [ - ]

If that's the purpose, couldn't you store the hash and throw away the compressed image?

(As others said, compression is deterministic for the same algorithm, parameters and input data)

a_t48 3 days ago [ - ]

Zstd for example only promises determinism on the same version of the library. I've personally seen the hashes mutate between pull and export. Things like tar padding also make a difference. Really, the thing to do is to hash on the _uncompressed_ data and let compression be a transport/registry detail. That's what I've done, at least.

mort96 3 days ago [ - ]

I didn't know that about zstd, that's a bit unfortunate.

Tar isn't related here though, we're talking about compression not archival formats

thaJeztah 3 days ago [ - ]

Yes, compression being part of the OCI image's digest was (in hindsight) a poor decision. _Technically_ OCI images allow uncompressed layers, and the layers could be included without compression (and transport compression to be used); this would allow layers to be fully reproducible. We explored some options to do this (and made some preparations; https://github.com/containerd/containerd/pull/8166), but also discovered that various implementations of registry clients didn't handle transport-compression correctly (https://github.com/distribution/distribution/pull/3754), which could result in client either pulling the full, uncompressed, content, or image validation failing.

a_t48 3 days ago [ - ]

For my registry fork/custom pull client I hash on the uncompressed content and store as compressed under the uncompressed digest. This lets me have my cake and eat it, too - compression free digests, smaller storage costs, be able to set consistent compression settings, have the ability to spend extra CPU to recompress on the backend without breaking hashes, etc. I control both pull client and registry, so it works.

cpuguy83 3 days ago [ - ]

The whole entire reason is compression is not deterministic across tooling.

NewJazz 3 days ago [ - ]

Pushing

mort96 3 days ago [ - ]

What about pushing? Computers are fast enough to compress stuff as it's being transmitted, you don't need to store the compressed copy anywhere...

cryptonym 3 days ago [ - ]

To save disk space /s

4 days ago [ - ]

[deleted]

awestroke 3 days ago [ - ]

But if it stores the uncompressed layers, why store the compressed ones too? Why both at the same time?

freedomben 4 days ago [ - ]

did you mean the first "compressed" to be "uncompressed" ?

giobox 3 days ago [ - ]

> https://docs.docker.com/reference/cli/docker/system/prune/

Just in case - I'm always amazed how many Docker users don't know about the prune command for cleaning up the caches and deleting unused container images and just slowly let their docker image cache eat their disk.

johannes1234321 3 days ago [ - ]

Prune is nice, but if you have a bunch of containers which run shirt time for a build step or similar prune would collect those, too. A filter "last used a few months ago" would be useful.

giobox 3 days ago [ - ]

I think you can filter on last created, but agree last used would be helpful:

  docker image prune -a --filter "until=24h"

> https://docs.docker.com/reference/cli/docker/image/prune/#fi...

3 days ago [ - ]

[deleted]

Oxodao 3 days ago [ - ]

I do prune all the time, but working on a lot of different projects it fills up as quickly as it empties

kevinmgranger 3 days ago [ - ]

Any reason why those containers can't be run ephemerally?

black3r 3 days ago [ - ]

Also this doesn't just mean more disk space usage, but also longer local build times... for the app I'm working on `exporting to image` takes 71.5 seconds with containerd, without containerd it's 4.3s (the rest of the build takes ~180 seconds). And that's just a 5.76GB image.

cpuguy83 3 days ago [ - ]

That's almost certainly nothing to do with the change. Please report this if you can with commands used.

Buildkit isn't changing behavior here. Internally in docker there is a shim to make the legacy storage behave like containerd snapshotters (as well as it can, anyway--not perfect due to hard to resolve issues in the old storage). But it still kept both the compressed and uncompressed versions of images.

sschueller 4 days ago [ - ]

[flagged]

stingraycharles 3 days ago [ - ]

What does Apple have to do with any of this?

mschuster91 3 days ago [ - ]

> It is shameful for apple to hard solder their disks. There is no benefit to the user

Actually, it is. The speed and latency difference does matter, that is how even an 8GB RAM MacBook feels snappier than many a 32GB Windows machine - it can use the disk as swap.

newsoftheday 3 days ago [ - ]

I had to work on a Mac M3 for a year, it sucked, it did not feel snappier than any Windows or Linux machine (including this one) that I've ever used and that is going back to the 1980's.

stingraycharles 3 days ago [ - ]

I suggest you judge based on benchmarks rather than vibes.

If you believe the latest M3 does not perform better than machines you’ve used in the 80s, I have no idea how to even start a reasonable discussion about this.

newsoftheday 3 days ago [ - ]

> If you believe the latest M3 does not perform better than machines you’ve used in the 80s

That wasn't what I was trying to say, I apologize, I should have been clearer. What I intended to say was that I've been using various, many computers since the 1980's so I have a wide and deep sampling of experiences with them and to that end...the M3 did NOT feel to me like it performed better. Regardless the benchmarks, I know how the machine should feel and I know M3 did not feel any better than any other machine I've used (and that is a lot of laptops).

stingraycharles 3 days ago [ - ]

Ok that is a much better point and a fair correction.

Oxodao 3 days ago [ - ]

Well, no ? That's litterally saying "trust the synthetic process, we don't care about real world usage" ?? I don't care if it works better theorically, if it feels bad in everyday usage it IS bad

stingraycharles 3 days ago [ - ]

Well, grab an Apple from the 80s and try running a modern app on it and see whether the M3 performs better or worse.

Or, if the point is that software became very bloated, then sure but they also do a lot more nowadays so then you’re really just comparing apples with oranges.

giobox 3 days ago [ - ]

This explanation for the soldered in SSD on some models has never fully made sense, because Apple make computers with removable fast SSDs right now: the M4 Mac Mini, and their range topping Mac Studios.

I absolutely agree Apple typically ship a fast SSD in their computers. I am not convinced they had to solder them to achieve the performance.