logoalt Hacker News

hilariouslytoday at 3:51 PM2 repliesview on HN

Why? You stole my stuff and now are pretending I need to argue for you to stop stealing it. It's a joke.


Replies

marssaxmantoday at 5:34 PM

This is the very question under debate. Training LLMs on publicly available data is a novel situation, and neither law nor social opinion have settled a consensus on the subject.

Copyright maximalists like to borrow unearned moral weight for their position by conflating copyright infringement with "stealing", but this is not actually true in any legal sense. It's not clear that training an AI on publicly available data should even constitute copyright infringement, much less "stealing".

Gormotoday at 4:09 PM

What? What is being "stolen" from you?

Are you now layering the old and tired "copyright infringement = stealing" argument on top of the still unsubstantiated premise that all LLM training is copyright infringement?