You can persuade AI models to accept falsehoods as truth, study shows
- Written by Ashique KhudaBukhsh, Assistant Professor of Computing and Information Sciences, Rochester Institute of Technology
You can make AI chatbots spout information that's not true.Nicoletaionescu/iStock via Getty ImagesWhen you ask a large language model a question, the reply may include falsehoods, and if you challenge those statements with facts, the AI may still uphold the reply as true. That’s what my research group found when we asked five leading models...
Read more: You can persuade AI models to accept falsehoods as truth, study shows

