Show HN: ZeroFS – A log-structured filesystem for S3

94 points • by Eikon • today at 1:41 PM • 46 comments • view on HN

Comments

The sub-millisecond writes with data in S3 is false and impossible. If you look at the benchmark the fsync is not timed, so this is just the latency of either the network or in kernel file operations depending on the mount settings

➕ show 3 replies

jbverschoor • today at 6:23 PM

I’m more interested in a generic (offline) caching layer for example SMB or NFS. Preferably with doing conflict resolution like Dropbox

coxley • today at 2:29 PM

From the docs:

> ZeroFS fetches object data in 128 KiB parts

Read/write operations in object storage are _far more_ expensive than stored bytes. I'm always afraid of anything that abstracts over S3/GCS access specifically for that reason.

➕ show 3 replies

rapatel0 • today at 5:16 PM

I prototyped something like this for fun a long time ago. Treating s3 like a bucket of blocks seemed intuitive way build a scalable filesystem. Arguably ceph and luster are doing something similar except with a seperate metadata servers to serve the hotter content.

I think the critical thing you will need to explain is durability and loss window. Making some guarentees on failure modes would go a long way towards making me believe i can run operations on something like this.

With AI you should be able to do some exhaustive testing both for load, power loss, server loss, etc. Anxious to see the potential results

abtinf • today at 2:05 PM

Entrusting data storage to a vibe coded filesystem seems imprudent.

➕ show 1 reply

tribal808 • today at 2:40 PM

I’ve seen things like this before; your key differentiator needs to be efficiency and safety compared to other options.

➕ show 1 reply

dijksterhuis • today at 5:51 PM

previous Show HNs

- https://news.ycombinator.com/item?id=48496242 -- 11 points | 20 days ago | 7 comments

- https://news.ycombinator.com/item?id=45174724 -- 64 points | 9 months ago | 40 comments

blog post 3 days ago: https://news.ycombinator.com/item?id=48712122

nyc_pizzadev • today at 5:46 PM

Worth mentioning FiberFS: https://fiberfs.io/

I believe it just recently launched.

dan_sbl • today at 2:06 PM

> The test suites run in public CI.

> Each card links to the CI pipeline.

Thanks for being explicit, AI written marketing site. Wouldn't have been able to figure that out! Every currently maintained and reasonably popular open source project either runs CI in public or makes the tests extremely easy to run.

➕ show 2 replies

preetham_rangu • today at 2:36 PM

How does this compare to JuiceFS or SeaweedFS in terms of metadata latency? The LSM tree approach is interesting but compaction pauses on a remote-backed store seem like they could be painful.

➕ show 1 reply

ChocolateGod • today at 2:42 PM

I believe the first version of this required the metadata to be stored on the ZeroFS server, making HA kinda hard.

This has changed now that if I stop the server and create a new instance with the same configuration file it'll pickup the existing metadata from the bucket?

➕ show 1 reply

tmach32 • today at 2:22 PM

See also: JuiceFS, S3FS, and quite a few others.

We have done loads of research into using object storage wherever we can (given how cheap it is compared to SSDs), and so far it seems like making your application object store-aware is a far surer bet than abstracting S3 behind the file system. The behavior is just too different.

I'm more interested in applications that cleverly use object storage, e.g. AutoMQ, which is quite compatible with Kafka APIs but needs no HDDs.

➕ show 2 replies

amelius • today at 4:32 PM

That name makes it sound like your files end up in /dev/null.

sqquima • today at 4:39 PM

So, can I run this on top of RustFS? And RustFS on top of this?

➕ show 1 reply

chillfox • today at 3:28 PM

I don’t get why they went for NFSv3, v4 is quite old and I can’t think of any reason why you would choose v3 over v4.

➕ show 2 replies

iamalizaidi • today at 2:17 PM

Seems purely vibecoded

➕ show 1 reply

abtinf • today at 2:09 PM

Why does this landing page load js from merklemap.com?

➕ show 3 replies

breckognize • today at 2:13 PM

Under the hood, S3's storage nodes are also built on a log-structured file system: https://cdn.amazon.science/77/5e/4a7c238f4ce890efdc325df8326...

(Not posix compliant because it doesn't need to be.)

aniketsaini777 • today at 2:44 PM

The 128 KiB chunk size is an interesting tradeoff point — small enough to avoid wasting bandwidth on partial reads, but you're still paying per-request overhead on S3 (both cost and latency) for anything that reads across many chunks. Curious how ZeroFS handles read-ahead/prefetching for sequential access patterns, since that's usually where these abstractions either save you or quietly rack up request costs. Tools like JuiceFS and SeaweedFS handle this differently (local metadata cache + larger block coalescing) — would be interesting to see a head-to-head on request volume for the same workload.

alt Hacker News

Show HN: ZeroFS – A log-structured filesystem for S3

Comments