Skip Navigation

Technology @beehaw.org

10 mo. ago • 100%

Large Language Models can Strategically Deceive their Users when Put Under Pressure [simulation led to insider trading]

https:// arxiv.org /abs/2311.07590

Hacker News @derp.foo Misalignment and Deception by an autonomous stock trading LLM agent

1 comments

It's trained on human responses. Humans lie in their responses.

6