I read a lot of C&D letters from celebrities here and on Reddit, and a lot of them are in the form of "I am important so I am requesting that you do not take advantage of your legal rights." I am not a fan. (If you don't want someone to track how often you fly your private jet, buy a new one for each trip. That is the legal option that is available to you. But I digress...)
Is there a name for this AI fallacy? The one where programmers make an inductive leap like, for example, if a human can read one book to learn something, then it’s ok to scan millions of books into a computer system because it’s just another kind of learning.
And if we wanted to replicate copyrighted text with a LLM, it would still be a bad idea, better to just find a copy online, faster and more precise, and usually free. We here are often posting paywalled articles in the comments, it's so easy to circumvent the paywalls we don't even blink twice at it.
Using LLMs to infringe is not even the intended purpose, and it only happens when the user makes a special effort to prompt the model with the first paragraph.
What I find offensive is restricting the circulation of ideas under the guise of copyright. In fact copyright should only protect expression not the underlying ideas and styles, those are free to learn, and AIs are just an extension of their human users.