logoalt Hacker News

dec0dedab0detoday at 2:58 PM2 repliesview on HN

This is really interesting, is there a way to provide downloadable caches for websites where that would be legal? I could Imagine just pre-downloading wikipedia, stack overflow, all kinds of documentation, etc. In a compressed and preorganized format instead of scraping it every time.


Replies

ploumtoday at 3:05 PM

The cache is pure and straight files-in-folders. This makes it trivial to browse the cache by hand:

cd .cache/offpunk/https/news.ycombinator.com/item cat "id=46943752"

So it could be trivially shared.

The "netcache" tool gives you the cached content or, with --path, returns the path were to find the contentd.

The only point is to preserve the file-modification attribute, which serves to know the age of a cached ressource.

iamnotheretoday at 3:14 PM

Kiwix basically provides this. You can use kiwix-serve to serve the downloaded zim files locally.

show 1 reply