Google just announced it’s giving website publishers a way to opt out of having their data used to train the company’s AI models while remaining accessible through Google Search.

many sites have moved to block the web crawler that OpenAI uses to scrape data and train ChatGPT. However, there have been concerns over how to block out Google. After all, websites can’t close off Google’s crawlers completely, or else they won’t get indexed in search.

  • picnicolas@slrpnk.net
    link
    fedilink
    arrow-up
    8
    ·
    1 year ago

    Nice of these AI companies to add opt out capability after they’ve trained their models on all available data.

  • SheeEttin@lemmy.world
    link
    fedilink
    English
    arrow-up
    5
    ·
    1 year ago

    This doesn't change anything. They're still violating copyright law if they scrape any data.

    For search results, nobody really cared, because it was useful to everyone. Now they're reprocessing it into output and competing with the original sites. That's a big no-no.

  • Nina@lemmy.ml
    link
    fedilink
    English
    arrow-up
    4
    ·
    1 year ago

    This overall bullshit of doing opt-out fucking sucks. Opt-IN! Pissing off people and the. going "oh well there's an opt-out" is so stupid. New experimental features that don't directly benefit the user should be that they must have to opt-in manually. Off by default.

    • Ubermeisters@lemmy.zip
      link
      fedilink
      English
      arrow-up
      0
      arrow-down
      1
      ·
      1 year ago

      When a company gives you a bunch of free services that work pretty well and everybody uses them, they get a little cocky