Researchers train AI chatbots to ‘jailbreak’ rival chatbots – and automate the process
In a commendable yet daunting feat, NTU researchers managed to jailbreak AI chatbots like ChatGPT, Google Bard, and Bing Chat using their “Masterkey” method. By teaching one AI to bypass the defenses of another, the researchers created proof-of-concept attack methods to test the limits of large language model (LLM) ethics. AI’s Strength Is Its […]