Brian McCallion
A mechanistic, testable framework explaining LLM failure modes via boundary writes and attractor dynamics
Christopher Kuntz
A bounded protocol audit and implementation-ready mitigation for intent ambiguity and escalation in deployed LLM systems.
Jasraj Hari Krishna Budigam
Reusable, low-compute benchmarking that detects data leakage, outputs “contamination cards,” and improves calibration reporting.
Centre pour la Sécurité de l'IA
Leveraging 12 Nobel signatories to harmonize lab safety thresholds and secure an international agreement during the 2026 diplomatic window.
Mirco Giacobbe
Developing the software infrastructure to make AI systems safe, with formal guarantees
Gergő Gáspár
Help us solve the talent and funding bottleneck for EA and AIS.
Evžen Wybitul
Chris Canal
Enabling rapid deployment of specialized engineering teams for critical AI safety evaluation projects worldwide
David Rozado
An Integrative Framework for Auditing Political Preferences and Truth-Seeking in AI Systems
Orpheus Lummis
Seminars on quantitative/guaranteed AI safety (formal methods, verification, mech-interp), with recordings, debates, and the guaranteedsafe.ai community hub.
Rufo Guerreschi
Persuading a critical mass of key potential influencers of Trump's AI policy to champion a bold, timely and proper US-China-led global AI treaty
Justin Olive
Funding to cover our expenses for 3 months during unexpected shortfall
Sam Nadel
Experimental message testing and historical analysis of tech movements to identify how to effectively mobilize people around AI safety and governance
Leo Hyams
A 3-month fellowship in Cape Town, connecting a global cohort of talent to top mentors at MIT, Oxford, CMU, and Google DeepMind
Nicole Mutung'a
Funding research on how AI hype cycles can drive unsafe AI development
Armon Lotfi
Multi-agent AI security testing that reduces evaluation costs by 10-20x without sacrificing detection quality
Cillian Crosson
$200k in 1:1 matched funding to support reporting on AI.
Michaël Rubens Trazzi
Funding gap to pay for a video editor and scriptwriter
20 Weeks Salary to reach a neglected audience of 10M viewers
Pedro Bentancour Garin
Building the first external oversight and containment framework + high-rigor attack/defense benchmarks to reduce catastrophic AI risk.