Large Language Models (LLMs)
GPT-4 Wouldn’t Lie to Me . . . Would It?: Researchers showed how GPT-4 can deceive users without being prompted to do so explicitly
It’s well known that large language models can make assertions that are blatantly false. But can they concoct outright lies? In a proof-of-concept demonstration, Jérémy Scheurer, Mikita Balesni, and Marius Hobbhahn at Apollo Research...