Technical AI safety

5 proposals

32 active projects

$1.78M

Seeking funding

Tom McGrath

Train great open-source sparse autoencoders

Find the best settings for SAE training we can, then scale across models

Technical AI safety

$4K

Oliver Habryka

Lightcone Infrastructure

Funding for LessWrong.com, the AI Alignment Forum, Lighthaven and other Lightcone Projects

Science & technology Technical AI safety AI governance EA community Forecasting Global catastrophic risks

$6.03K

Dan Hendrycks

Research Staff for AI Safety Research Projects

Technical AI safety Biosecurity

$500/$1M

LAUREL

Organizing global AI ethics think tank for dynamic AI research updates

Organizing global AI ethics think tank for dynamic AI research updates and framework for AI safety policies implementation and humanity income support

Technical AI safety AI governance EA Infrastructure Fund EA community Forecasting Global catastrophic risks Global health & development

$0/$50K

Charbel-Raphael Segerie

Investigating constructability as a safer approach to machine-learning

Science & technology Technical AI safety

$124/$15K

Glen M. Taggart

Independent research to improve SAEs (4-6 months)

By rapid iteration on possible alternative architectures & training techniques

Technical AI safety

$55K

Open Paws

Open Paws: Eliminating Speciesist Bias in AI

Building ethical AI for all sentient beings, fostering safer AI practices for human and non-human animals.

Technical AI safety AI governance Animal welfare Global catastrophic risks

$0/$150K

🥭

Thane Ruthenis

Novel research agenda towards a comprehensive theory of interpretability

Science & technology Technical AI safety Global catastrophic risks

$0/$180K

Remmelt Ellen

10th edition of AI Safety Camp

Technical AI safety AI governance

$57.4K

Apart Research

Help Apart Expand Global AI Safety Research

Incubate AI safety research and develop the next generation of global AI safety talent via research sprints and research fellowships

Science & technology Technical AI safety AI governance EA community Global catastrophic risks

$5.13K

Kabir Kumar

AI-Plans.com

Science & technology Technical AI safety AI governance

$5.37K

Ryan Kidd

MATS Funding

Help us support more research scholars!

Technical AI safety

$272K

🐶

Alexander Pan

Removing Hazardous Knowledge from AIs

Technical AI safety

$190K

Kunvar Thaman

Exploring feature interactions in transformer LLMs through sparse autoencoders

Technical AI safety

$8.5K

Zhonghao He

Mapping neuroscience and mechanistic interpretability

Surveying neuroscience for tools to analyze and understand neural networks and building a natural science of deep learning

Technical AI safety

$5.95K

🥦

Dusan D Nesic

AI Safety Serbia Hub - Office Space for (Frugal) AI Safety Researchers

Free/Subsidized/Cheap office space outside of EU but in good timezones with favorable visa policies (especially for Chinese/Russian but also other citizens).

Technical AI safety AI governance EA community

$2.35K