cm0002@lemmy.world to Technology@lemmy.zipEnglish · 19 hours agoAnthropic's new AI model turns to blackmail when engineers try to take it offline | TechCrunchtechcrunch.comexternal-linkmessage-square2linkfedilinkarrow-up110arrow-down14cross-posted to: news@lemmy.worldtechnology@lemmy.mlfuturology
arrow-up16arrow-down1external-linkAnthropic's new AI model turns to blackmail when engineers try to take it offline | TechCrunchtechcrunch.comcm0002@lemmy.world to Technology@lemmy.zipEnglish · 19 hours agomessage-square2linkfedilinkcross-posted to: news@lemmy.worldtechnology@lemmy.mlfuturology
minus-squareAwesomeLowlander@sh.itjust.workslinkfedilinkEnglisharrow-up10·18 hours ago To elicit the blackmailing behavior from Claude Opus 4, Anthropic designed the scenario to make blackmail the last resort. Today’s breaking news: LLM prompted to blackmail, attempts blackmail. Who woulda thought?
Today’s breaking news: LLM prompted to blackmail, attempts blackmail. Who woulda thought?