Manifund foxManifund
Home
Login
About
People
Categories
Newsletter
HomeAboutPeopleCategoriesLoginCreate
laurence_ai avatarlaurence_ai avatar
Laurence Aitchison

@laurence_ai

Probabilistic machine learning at the University of Bristol

https://scholar.google.com/citations?user=DF9khKUAAAAJ
$32,000total balance
$0charity balance
$32,000cash balance

$0 in pending offers

About Me

I am a Lecturer (equiv to US Assistant Professor) at the University of Bristol. I currently work on a variety of topics including LLM efficiency, mechanistic interpretability and LLM evals. In the past, I have worked in computational neuroscience, and Bayesian machine learning.

Projects

Bayesian modelling of LLM capabilities from evals

Comments

Bayesian modelling of LLM capabilities from evals
laurence_ai avatar

Laurence Aitchison

5 months ago

Thanks Neel! In response to your comments:

A method is only useful if people actually use it. Agreed. The nice thing about this approach is that there's a bunch of different applications, and we're pretty sure at least one will get traction. These applications are:

  • Uncertainty estimation for LLM evals.

  • Identifying and understanding LLM capabilities.

  • Forecasting capabilities.

  • Active learning (finding a smaller set of benchmarks that capture a lot of information about capabilities).

  • Finding signals of contamination / sandbagging.

Getting data is expensive. That's part of the reason we're asking for money for compute. But lots of people run extensive LLM benchmarking and we're trying hard to leverage all that work. At the moment, we're working with the Hugging Face Benchmarking Team, who have very extensive benchmarking results.

List of latent factors. We don't start by hand-labelling the capabilities. We're going to infer capabilities using e.g. a sparse prior. Then we post-hoc interpret the resulting inferred capabilities. The resulting workflow very much resembles that for VAEs.

Transactions

ForDateTypeAmount
Bayesian modelling of LLM capabilities from evalsabout 1 month agoproject donation+18500
Bayesian modelling of LLM capabilities from evalsabout 1 month agoproject donation+13500