Claude was being judgy, so I called it out. It immediately caved. Is verbal abuse a valid method of circumventing LLM censorship??

  • OmegaLemmy
    link
    fedilink
    English
    arrow-up
    2
    ·
    1 day ago

    Some ai models do have ‘thinking’ where they use your prompt to first generate a description use and what not for it to better generate the rest of the content (it’s hidden from users)

    That might’ve lead Claude to saying ‘fuck no, most common uses is in military?’ and shut you down