Nada Amin
Building LemmaScript, a verification toolchain for TypeScript
Godwin Abuh Faruna
Safety steering that actually works off English
shivam dubey
Mapping the attention heads that push LLMs toward refusal vs. compliance, and building an inference-time defense against both single- and multi-turn jailbreaks.
Alex Wolf
Mirror is a programming language written BY AI FOR AI and written FOR HUMANS BY HUMANS.
Oliver Klingefjord
A study to empirically study the depolarization effect of our values elicitation method
A publication about the institutions we need for powerful AI.
Gaetan Duchateau
Can a Distributed Sensorimotor System Reconstruct Without External Stimulus?
Anju Chhetri
Yingnan Hao
This motion affects everything. Since only gravity can move all objects, this anomaly could unlock the secret to artificial gravity control.
Nika Novak
Ahmed
A fast, comprehensive directory of the people and orgs in AI safety: search, filter, and match.
John Greer
Kevin Yandoka Denamganai
Compositional Learning Behaviours as a Necessary Condition for Olympiad-Level Formal Theorem Proving
David Wood
The Longevity Escape Velocity Foundation (LEVF) seeks support for one of the most neglected and potentially important questions in aging research
Ryan Ingosi
An independent safety score for AI agents you can verify — deterministic, reproducible, auditable, and it never needs your private data.
Conor Plunkett
Benchmark for agent safety when spending users money. How often do they violate user intent and rules?
Oleksii Simon
Deterministic, no-LLM-judge benchmark for how faithfully AI tracks changing beliefs. Funding v1.1: a new ambivalence metric + 20 cross-domain scenarios.
Saurav Panigrahi
Accepted ICML 2026 workshop paper on cross-constitution drift in LLMs; seeking $2,050 travel support to present in Seoul and gather research feedback.