ChatGPT generates cancer treatment plans that are full of errors — Study finds that ChatGPT provided false information when asked to design cancer treatment plans::Researchers at Brigham and Women’s Hospital found that cancer treatment plans generated by OpenAI’s revolutionary chatbot were full of errors.

  • •••@lemmy.ml
    link
    fedilink
    English
    arrow-up
    10
    ·
    1 year ago

    I suppose most sensible people already know that ChatGPT is not the answer for medical diagnosis.

    Prompts were input to the GPT-3.5-turbo-0301 model via the ChatGPT (OpenAI) interface.

    If the researcher wanted to investigate whether LLM is helpful, they should develop a model specifically using cancer treatment plans with GPT-4/3.5 before testing it thoroughly, in addition to entering prompts into the model that is available on OpenAI.

    • ours@lemmy.film
      link
      fedilink
      English
      arrow-up
      2
      ·
      1 year ago

      Or they could feed the current model with a reputable source of medical information.

      • testo12@lemmynsfw.com
        link
        fedilink
        English
        arrow-up
        2
        ·
        1 year ago

        That wouldn’t guarantee correct answers.

        It’s arguably more dangerous if ChatGPT gives mostly sane specific medical advice because it makes people put more trust in it than they should.

        • ours@lemmy.film
          link
          fedilink
          English
          arrow-up
          1
          ·
          1 year ago

          True but it would reduce the chances of it making stuff up entirely.

    • wewbull@feddit.uk
      link
      fedilink
      English
      arrow-up
      2
      ·
      1 year ago

      There have been a number of articles about how GPT has been out-diagnosing doctors in various domains. To me, that isn’t that surprising as diagnosis is a pattern matching problem, something a neuralnet will be very good at. Human doctors were seen to be discounting rare conditions just because they were rare and so “it was much more likely to be something else” even if the symptoms backed up the conclusion. A computer can be more objective about such things.

      …but none of that needs AI/ML. We’ve had expert systems since the 60s.

      It’s also very different from constructing a treatment plan, which is what we’re discussing here.