logoalt Hacker News

munk-ayesterday at 11:01 PM2 repliesview on HN

Yea, at the end of the day a big part of this question comes down to whether that copying is fair use and that is an open question with the transformative nature being the primary point in favor of the LLM. But it is copying from some works to another - if it doesn't have some fair use exception it is absolutely violating the licensing of most of the training data. It's a bit different from previous settled case law because it's copying so little from so many billions of different things. I think blocking reproduction is wise by LLM companies for PR purposes but it doesn't guarantee that training is a license exempted activity.


Replies

strogonofftoday at 7:40 AM

Would it be fair to say that if you steal from enough people then it becomes OK? I can’t see it—especially considering this is IP law, expected to grant people confidence in their authorship rights and thus encourage innovation and creativity.

show 1 reply
crazygringoyesterday at 11:03 PM

Yup. Of course it's copying. But all expectations are that courts will rule that fair use allows such copying, because of the nature of the transformation.