logoalt Hacker News

djhnyesterday at 8:01 PM1 replyview on HN

Aren’t books massively outweighed by the crawled internet corpus?


Replies

r_leeyesterday at 9:18 PM

I would doubt that because books are probably weighed as higher quality and more trustworthy than random Reddit posts

Especially if it's unsupervised training