• Anthropic’s new Claude 4 features an aspect that may be cause for concern.
  • The company’s latest safety report says the AI model attempted to “blackmail” developers.
  • It resorted to such tactics in a bid of self-preservation.
  • neukenindekeuken@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    2
    ·
    6 hours ago

    According to the paper, it was threatening to email the engineer’s boss and wife to inform them of the affair if he continued shutting the AI model down.