haxor@derp.fooMB to Hacker News@derp.fooEnglish · 2 years agoUniversal and Transferable Adversarial Attacks on Aligned Language Modelsllm-attacks.orgexternal-linkmessage-square0linkfedilinkarrow-up13arrow-down11file-textcross-posted to: ai_infosec@infosec.pubaistuff@lemdro.idtechnews@radiation.party
arrow-up12arrow-down1external-linkUniversal and Transferable Adversarial Attacks on Aligned Language Modelsllm-attacks.orghaxor@derp.fooMB to Hacker News@derp.fooEnglish · 2 years agomessage-square0linkfedilinkfile-textcross-posted to: ai_infosec@infosec.pubaistuff@lemdro.idtechnews@radiation.party