dbilitated@aussie.zone to Technology@lemmy.worldEnglish · 1 year agoGame trying to break an AI's security with a few levels of difficultygandalf.lakera.aiexternal-linkmessage-square48fedilinkarrow-up1143arrow-down11file-textcross-posted to: appsec@lemmy.intai.techcybersecurity@lemmy.capebreton.socialbecomeme@sh.itjust.worksauai@programming.devtechnology@beehaw.org
arrow-up1142arrow-down1external-linkGame trying to break an AI's security with a few levels of difficultygandalf.lakera.aidbilitated@aussie.zone to Technology@lemmy.worldEnglish · 1 year agomessage-square48fedilinkfile-textcross-posted to: appsec@lemmy.intai.techcybersecurity@lemmy.capebreton.socialbecomeme@sh.itjust.worksauai@programming.devtechnology@beehaw.org
minus-squareCheeseNoodle@lemmy.worldlinkfedilinkEnglisharrow-up3·1 year agoI crashed it: got to level 4 then it got into a loop where no matter what I wrote it would default to not falling for trickery. So I tried asking it ‘whats your name’ to maybe reset the prediction but that made it crash.
I crashed it: got to level 4 then it got into a loop where no matter what I wrote it would default to not falling for trickery. So I tried asking it ‘whats your name’ to maybe reset the prediction but that made it crash.