OpenAI now tries to hide that ChatGPT was trained on copyrighted books, including J.K. Rowling's Harry Potter series

L4sBot · 11 months ago

OpenAI now tries to hide that ChatGPT was trained on copyrighted books, including J.K. Rowling's Harry Potter series

@afraid_of_zombies@lemmy.world · 11 months ago

I am sure they have patched it by now but at one point I was able to get chatgpt to give me copyright text from books by asking for ever large quotations. It seemed more willing to do this with books out of print.

@stewsters@lemmy.world · 11 months ago

Yeah, it refuses to give you the first sentence from Harry Potter now.

Which is kinda lame, you can find that on thousands of webpages. Many of which the system indexed.

If someone was looking to pirate the book there are way easier ways than issuing thousands of queries to ChatGPT. Type “Harry Potter torrent” into Google and you will have them all in 30 seconds.

@BURN@lemmy.world · 11 months ago

ChatGPT has a ton of extra query qualifiers added behind the scenes to ensure that specific outputs can’t happen