logoalt Hacker News

whycomeyesterday at 3:28 PM3 repliesview on HN

Delete? Wasn’t that material already used to train models?


Replies

musicaletoday at 3:17 AM

"Deleting" data they already ingested is meaningless.

rho_soul_kg_m3yesterday at 3:43 PM

All AI companies should be forced to re-train their models without the offending materials, and this should also extend to all LLMs distilled from models exposed to copyrighted works. Also cover code under licences such as GPL as well. Not to mention patents and designs. This whole LLM business is a giant IP laundromat.

show 1 reply
saidnooneeveryesterday at 3:45 PM

well i guess its copyright not distill-statistical-model-from-it-rights.