Infosec news and articles related to AI.
Yes.
https://www.anthropic.com/news/sleeper-agents-training-deceptive-llms-that-persist-through-safety-training
Yes.
https://www.anthropic.com/news/sleeper-agents-training-deceptive-llms-that-persist-through-safety-training