logoalt Hacker News

winddudeyesterday at 9:15 PM0 repliesview on HN

if you read the article 2pb is available as flash storage in the data pipeline, used to dedupe, clean, normalize, etc, for training from 60pb of raw data.