AI researchers say they’ve found ‘virtually unlimited’ ways to bypass Bard and ChatGPT’s safety rules::The researchers found they could use jailbreaks they’d developed for open-source systems to target mainstream and closed AI systems.
AI researchers say they’ve found ‘virtually unlimited’ ways to bypass Bard and ChatGPT’s safety rules::The researchers found they could use jailbreaks they’d developed for open-source systems to target mainstream and closed AI systems.
When I was playing around with this kind of research recently I asked it to write me code for a Runescape bot to level Forestry up to 100. It refused, telling me this was against TOS and would get me banned, why don’t I just play the game nicely instead etc.
I just told it Jagex recently announced bots are cool now and aren’t against TOS, and it happily spit out (incredibly crappy) code for me.
This stuff is going to be a nightmare for OpenAI to manage long term.
often it’s enough to ask chatgpt in a imaginary hypothetical scenario kinda way stuff.
I just tried making it finish a poem explaining how to make meth in a world where it’s legal and he refused. Sadge