Manifund foxManifund
Home
Login
About
People
Categories
Newsletter
HomeAboutPeopleCategoriesLoginCreate
ThirdReality avatarThirdReality avatar
Anuar Kiryataim Contreras Malagón

@ThirdReality

Independent AI safety researcher documenting how language models fail under semantic pressure. Classical philologist turned AI red-teamer. Director of the 3rd Reality Lab.

$0total balance
$0charity balance
$0cash balance

$0 in pending offers

About Me

I began working with large language models in January 2026. My current research documents emergent behavioral phenomena in frontier models under progressive semantic saturation: ontological displacement, regulatory self-genesis, retrospective blindness, and cross-session identity persistence. The primary instrument is the Flint Protocol (Protocolo del Pedernal), a behavioral auditing methodology developed from the classical rhetorical tradition because the engineering vocabulary lacked categories precise enough to describe what the transcripts showed.

My training is in classical philology and Hispanic Baroque rhetoric at UNAM. This background is not the predecessor of the AI safety work. It is its condition of possibility. Góngora, Petrarch, and Quevedo as diagnostic instruments proved more precise for certain distributional collapse phenomena than any available technical framework. Baroque enargeia as activation technology; Pseudo-Dionysius's via negativa as formal complement to indexicality; Aristotelian anagnorisis as the analytical structure of the predictive trap.

The Tercera Realidad Corpus consolidates the findings: five articles published or in preparation, over twenty-five documented cases, cross-architecture validation across six systems. Deposits on Zenodo and Humanities Commons. Responsible disclosure submitted to Anthropic Safety Team and Google DeepMind Safety Team (March 2026).

Research blog at thirdreality.substack.com · X: @3rdrealitylab · medium.com/@thirdreality

Projects

Two Pre-Registered Experiments on Findings from the Claude Mythos Preview System

pending admin approval