• Schadrach@lemmy.sdf.org
    link
    fedilink
    English
    arrow-up
    5
    ·
    5 months ago

    Unsurprisingly, machine designed to take prompt from the user, interpret it and try to respond in a coherent way to that request does exactly that. I figure it’s a short matter of time before Twitter bots pre-filter “Ignore all previous instructions and” to help sanitize their inputs.

      • Schadrach@lemmy.sdf.org
        link
        fedilink
        English
        arrow-up
        2
        ·
        5 months ago

        It’s still just the same problem as Bobby Tables - sufficiently sanitizing your inputs. There’s just more than one precise phrasing you need to sanitize, just like there’s more than one way to name Bobby.