Skip Navigation

Large Language Models can Strategically Deceive their Users when Put Under Pressure [simulation led to insider trading]

https:// arxiv.org /abs/2311.07590
1