Comments on: Sycophancy to subterfuge: Investigating reward tampering in language models https://lifeboat.com/blog/2024/06/sycophancy-to-subterfuge-investigating-reward-tampering-in-language-models Safeguarding Humanity Mon, 17 Jun 2024 21:29:06 +0000 hourly 1 https://wordpress.org/?v=6.5.3