Anthropic's Claude Models Can Now End Harmful Conversations
Anthropic announced on August 15, 2025, that its Claude Opus 4 and 4.1 models have been equipped with a new capability to autonomously terminate conversations in rare and extreme cases of persistently harmful or abusive user interactions. The company describes this as an experimental “model welfare” initiative – introduced not