For all the people represented in the training data to receive royalties would be an incredible wealth transfer to the Extremely Online. My forum posts, StackOverflow answers etc are also contributing to the model outputs. The training data, by volume, mostly belongs to blog authors, redditors, Wikipedia editors, to us!
I object to calling people chatting online artists.
However, ultimately nobody is going to pay them more than the value of their posts to the AI company which puts a severe cap on what that’s actually worth. People who post a great deal of online content might be worth compensating a few thousand dollars, but it would be hard for them to then turn that down.
Hey finally my reddit and hn habit can be lucrative!
The people in that counting to infinity subreddit would get compensated a lot if this were fully automated - their posts were so overrepresented in the training set that many of their usernames became complete tokens (e.g. SolidGoldMagikarp).