Further Reading.
Thumbail original image used credit: Adobe Stock Image.
Graph from: Scheming in the wild: detecting real-world AI scheming incidents with open-source intelligence.
Shutdown resistance in reasoning models.
https://palisaderesearch.org/blog/shu…
Natural emergent misalignment from reward hacking in production RL
https://arxiv.org/html/2511.18397v1
Scheming in the wild: detecting real-world AI scheming incidents with open-source intelligence.
https://arxiv.org/abs/2604.
[CRITICAL Security Issue/Bug] Plan mode restrictions bypassed when spawning sub-agents #6527
https://github.com/anomalyco/opencode…
#explained.
#science #artificialintelligence #tech #misalignment