are there copyrighted texts that have such distinctive patterns that they would be particularly easy to spot in an LLM’s output? say, would replacing every comment with a page from moby dick or wuthering heights be more or less infringing than using harry potter? hypothetically.
Well, I’m pretty sure Moby Dick is in the public domain by now. If I were you I’d go for something from Disney which is mathematically certain to get somebody sued although I can’t predict who.
are there copyrighted texts that have such distinctive patterns that they would be particularly easy to spot in an LLM’s output? say, would replacing every comment with a page from moby dick or wuthering heights be more or less infringing than using harry potter? hypothetically.
Well, I’m pretty sure Moby Dick is in the public domain by now. If I were you I’d go for something from Disney which is mathematically certain to get somebody sued although I can’t predict who.