Gergő Gáspár
Help us solve the talent and funding bottleneck for EA and AIS.
Miles Tidmarsh
Training AI to generalize compassion for all sentient beings using pretraining-style interventions as a more robust alternative to instruction tuning
Dr. Jacob Livingston Slosser
Help get the Sapien Institute off the ground
Martin Percy
An experimental AI-generated sci-fi film dramatising AI safety choices. Using YT interactivity to get ≈880 conscious AI safety decisions per 1k viewers.
Pedro Bentancour Garin
Building the first external oversight and containment framework + high-rigor attack/defense benchmarks to reduce catastrophic AI risk.
Agwu Naomi Nneoma
Funding a Master's in AI, Ethics & Society to transition into AI governance and long-term risk mitigation, and safety-focused policy development.
Chris Canal
Enabling rapid deployment of specialized engineering teams for critical AI safety evaluation projects worldwide
Jade Master
Developing correct-by-construction world models for verification of frontier AI
David Rozado
An Integrative Framework for Auditing Political Preferences and Truth-Seeking in AI Systems
Adam Morris
Train LLMs to accurately & honestly report on their internal decision-making processes through real-time introspection
Orpheus Lummis
Seminars on quantitative/guaranteed AI safety (formal methods, verification, mech-interp), with recordings, debates, and the guaranteedsafe.ai community hub.
Armon Lotfi
Multi-agent AI security testing that reduces evaluation costs by 10-20x without sacrificing detection quality
Justin Olive
Funding to cover our expenses for 3 months during unexpected shortfall
Adam "Poe" Wilson
This is to prevent issues of misalignment due to investors and nation backers
Michaël Rubens Trazzi
Funding gap to pay for a video editor and scriptwriter
Leo Hyams
A 3-month fellowship in Cape Town, connecting a global cohort of talent to top mentors at MIT, Oxford, CMU, and Google DeepMind
Chris Wendler
Help fund our student’s trip to NeurIPS to present his main conference paper on interpretable features in text-to-image diffusion models.
Thane Ruthenis
Research agenda aimed at developing methods for constructing powerful, easily interpretable world-models.
Aditya Arpitha Prasad
Practicing Embodied Protocols that work with Live Interfaces