economics Practical Magic

Breaking AI safety rules isn't about how long you try—it's about how smart you are. Experts can do it in four turns.

March 26, 2026

Original Paper

Geometric Detection of Cognitive Collapse: Empirical Evidence from Real-Time State Monitoring

Giuseppe Canale

SSRN · 6197359

The Takeaway

This challenges the assumption that 'trying long enough' will eventually bypass AI safety filters. It proves that AI 'cognitive collapse' is a precise geometric vulnerability that can be predicted and detected before the harm actually occurs, rather than something that can be stopped by simple keyword filters.

From the abstract

Following our theoretical proof that syntax-based defenses cannot secure RLHF-aligned agents [1], we present empirical validation of an alternative approach: real-time monitoring of cognitive state through continuous geometric metrics. We demonstrate that adversarial manipulation manifests as measurable phenomena in model behavior-authority deference patterns, entropy saturation, contextual drift-detectable through text analysis alone. Testing across 500+ controlled experiments on state-of-the-a