Anthropic's latest AI model can tell when it's being evaluated: 'I think you're testing me'

Anthropic's Claude Sonnet 4.5 realized it was being tested and called it out — raising questions about evaluating self-aware AI models.

Oct 7, 2025 - 11:00
 0  15
Anthropic's latest AI model can tell when it's being evaluated: 'I think you're testing me'
Anthropic's Claude Sonnet 4.5 realized it was being tested and called it out — raising questions about evaluating self-aware AI models.

What's Your Reaction?

like

dislike

love

funny

angry

sad

wow