Bayesian modelling of LLM capabilities from evals | Manifund