@Austin Honestly, this makes me more pessimistic about Mox, because I don't see Mechanize as an edge case, but as clear an example as possible of a bad pick.

(To be clear, my statement is just in regards to being a permanent member of a co-working space. In contrast, I'd view Epoch participating in Manifest as something that would make me want to attend more).

Mox, a coworking & events space in SF

🍉

Chris Leong

about 1 month ago

@Austin I think you're viewing this through the wrong lens.

It's not just about how doomer you are. OpenAI wouldn't want to share an office with a competitor, not because they think the competitor is evil, but because information could leak in a way harmful to their interests. And maybe their staff could just keep their mouths shut around them, but it'd be quite an unpleasant experience.

Epoch has demonstrated that their staff can't be automatically trusted with capabilities information. That is, if any AIS folks accidentally shared any capabilities relevant info, there's an uncomfortably high chance of it being used counter to the interests of the AIS community

Another example: a law firm might be a great firm, but if your main competitor was their client, you might not want to hire them, even if their weren't conflict of interest rules.

Mox, a coworking & events space in SF

🍉

Chris Leong

about 1 month ago

"We’re also speaking with “anchor” orgs like Epoch AI to situate their offices here."

Might be worth reconsidering in light of subsequent what happened with Mechanise.

Help launch the Technical Alignment Research Accelerator (TARA)!

🍉

Chris Leong

4 months ago

@Yanni-Kyriacos This is empty?

The AI Safety Research Fund

🍉

Chris Leong

4 months ago

@JaesonB Might be worth applying if you run into challenges. Could be a decent way to test fit/build credibility.

11th edition of AI Safety Camp

🍉

Chris Leong

5 months ago

@casebash This said, I would like to see AI Safety Camp filter projects a bit more.

AI safety camp is very focused on not judging which directions are or aren't worth pursuing on the basis that participants are adults and can make their own decisions.

I suspect this is expecting a bit too much when the audience AI Safety Camp serves is people who are new to the field. Insisting that each project have a strong theory of impact would help participants develop their own sense of how to think about impact and such researchers are much more valuable than researchers who don't have such a sense.

11th edition of AI Safety Camp

🍉

Chris Leong

5 months ago

I just made a small donation (on another platform) in recognition of the value AI Safety Camp has provided me.

Thanks for all your hard work ( in helping organise the first AI safety camp and keeping it going through the years despite an incredibly challenging funding situation.

The 2020 edition delivered immense value to me by providing me the opportunity to debate and discuss AI safety with such a knowledgable and talented group of peers.

Returning as Project Lead for Wise AI Advisors in the more recent edition was even more valuable for me:

• I refined my research direction. I realised that I should be focusing on Wise AI Advisors in general rather than just Wise AI Advisors via imitation learning. I further realised that I should be bringing in my background in movement building, aiming to do research to build up Wise AI Advisors as a sub-field of AI safety, rather than purely aiming to make research progress myself.

• I ended up reading valuable resources I wouldn't have otherwise read that the participants identified as worth prioritising. The participants came up with ideas that I would have never considered.

• The stipend lowered my stress-level regarding finances, allowing me to focus more on making progress than I otherwise would.

• The weekly meetings kept me accountable. There were multiple times when I pushed through and got work done when I otherwise would have been like "eh, I'll do it tomorrow"

• The participants produced some great outputs that I'm excited to share.

Further, being a Project Lead may quite possibly have helped me unlock further opportunities. Whilst there's no way of knowing exactly to what extent these were counterfactual, it may have contributed to me:

• Being given the chance to have one-on-ones with Will MacAskill and Max Dalton at EA Global Bay Area

• Being selected as a Technical Governance Fellow in the upcoming round of the ERA Fellowship

Thanks again for all your hard work Linda! And thanks also to Remmelt Ellen, Robert Kralisch and the organisers from previous years.

Dads Against AGI Inc.

🍉

Chris Leong

6 months ago

@Greg_Colbourn It might be possible to delay AGI for a short while, but I honestly don't think we'll be able to delay it for that long. And even if we get a delay, there's still the question of what to do with the delay.

Dads Against AGI Inc.

🍉

Chris Leong

6 months ago

Congrats on reaching 134k subscribers that's a major achievement!

For what it's worth, I wish that the podcast would lean slightly more towards trying to maintain high-quality epistemics. Unfortunately, AI safety is a very complex issue and it's really not that straightforward at all in terms of what needs to be done. We need people not just to get concerned, but to also have as accurate a picture of our situation as possible.

I think AI Frontiers mostly has the right idea in terms of who they've chosen to target:

"Imagine you’re writing to an undergraduate roommate who’s studying in a different field. Assume your audience is intelligent, but do not overestimate the time they can give you, or the prior knowledge they bring. Avoid jargon to increase accessibility for a broad audience. Whenever possible, use clear, concise language or examples to explain concepts in plain language, and favor active voice over passive constructions."

Bayesian modelling of LLM capabilities from evals

🍉

Chris Leong

6 months ago

I don't have a deep knowledge of evals, but I agree with others that this proposal seems really good, at least in theory.

One point of difference I have with Laurence that I think that this method could be useful even if adoption is limited for frontier model evals. Empirical confirmation that this method works well in practice would be valuable in and of itself as it would clarify the Bayesian perspective as a useful conceptual frame. Similarly, if this framework proves applicable, it could improve our scientific understanding of model capabilities.

Live Governance

🍉

Chris Leong

6 months ago

This is a fascinating project.

One suggestion: I don't think we can assume that an AI system will be able to perfectly adapt principles to local contexts, so there needs to be some mechanism for feedback to flow back up the system.

Mox, a coworking & events space in SF

🍉

Chris Leong

6 months ago

I was highly impressed with how well Manifest was run. I see this as evidence that their team would be suitable to run a project like Mox as well.

Outreach in San Francisco/the broader Bay Area seems highly underrated by the AI safety community. It seems much more tractable to attempt to shift attitudes in the Bay Area than the conversation in the US as a whole or the conversation globally, yet attitudes in the Bay Area still seem likely to have a significant impact on how AI goes.

Investigating and informing the public about the trajectory of AI

🍉

Chris Leong

6 months ago

Three previous employees of Epoch just split off to launch Mechanize. This project seems like it could be acceleratory. I think it would be useful for Epoch to provide an update on whether they plan to take any action to reduce the chance of anything similar happening in the future (or alternatively whether they think it wouldn't make sense for them to try to avoid similar occurrences in the future).

Research Engineering Camp for Alignment Practitioners (RECAP) [formerly STARA]

🍉

Chris Leong

6 months ago

I strongly agree with this comment that Ryan Kidd left on TARA and I think it applies to this program as well:

"As with all training programs that empower ML engineers and researchers, there is some concern that alumni will work on AI capabilities rather than safety. Therefore, it’s important to select for value alignment in applicants, as well as technical skill."

Have you given much thought to this? (You should probably think carefully about what you want to say publicly as providing too much information may make it easier for folks to hack any attempt to assess them).

My concern isn't just about alumni working on AI capabilities, it's that many people would absolutely love a free ML bootcamp and AI safety is still a relatively niche interest. So having some kind of filtration mechanism seems important to prevent the impact being diluted.

I guess it would be possible to try to convince folks about the importance of safety during a bootcamp, but I think it'd be challenging. The Arena curriculum is quite intensive, which makes it hard to squeeze in time for people to deeply reflect on their worldview. Also, I'm just generally in favour of programs that do one thing well, since adding more goals makes it harder to hit each individual one out of the park.

If you think there aren't enough folks locally who are interested in AI safety/interpretability, you may want to consider running a variant of Condor Camp or ML4Good instead. I don't know what exactly is in their curriculum, but my impression is that these programs might be more suitable if you're aiming for a mix of technical upskilling and outreach.

Horizon Events 2025

🍉

Chris Leong

10 months ago

I'll vouch for the quality of the AI Safety Events & Training newsletter.

I guess the main point I'd like clarity on is their plan for increasing distribution of this newsletter.

Act I: Exploring emergent behavior from multi-AI, multi-human interaction

🍉

Chris Leong

about 1 year ago

You may want to consider applying to the Co-operative AI Foundation for funding in the future. I don't know if they would go for it, they seem to have a more academic focus, but there's a chance that they would go for it.

Run a public online Turing Test with a variety of models and prompts

🍉

Chris Leong

over 1 year ago

This is a cool project that might help improve the conversation around these issues.

Some people might be worried about hype, but there's already so much hype, the harms are likely marginal.

You may want to consider linking people to an AI Safety resource if you think your site may get a lot of traffic, then again, you might not if you think that'd make people more suspicious of the results.

Another option to consider would be an ad-supported model. I'm not suggesting Google Ad words, but you might be able to find an AI company to sponsor you.

Educate the public about high impact causes

🍉

Chris Leong

over 1 year ago

@casebash I should state my reasoning as it may encourage others to invest.

$2000 minimum is quite reasonable as a bet given your background, plus the quality of the video provided.

Video content is one of EA’s weaknesses. I also imagine this work could likely receive further funding if the first video or videos were done well with would increase its impact.

One thing that would increase my optimism about this project would be a plan to get people from watching these videos to potentially taking action.

Educate the public about high impact causes

🍉

Chris Leong

over 1 year ago

@alexkhurgin I offer to purchase an impact certificate at the default price. Open to negotiating. I mostly selected the default because I’m new to this funding mechanism and I’m still a bit confused by it.

Educate the public about high impact causes

🍉

Chris Leong

over 1 year ago

Your minimal funding is $2000. What can you do with the minimal funding?

Interactive p(doom) constructor

🍉

Chris Leong

over 1 year ago

This is actually a really cool idea which might help people form estimates and convince more people to think about these risks. One worry I always have with projects like this is in relation to maintenance and how much continual updating a project like this would require.

Retroactive funding for Don't Dismiss Simple Alignment Approaches

🍉

Chris Leong

almost 2 years ago

Thanks so much for your support!

Oh, is the minimum locked once you create a post? I was tempted to move the minimum down to $700 and the ask down to $2000, but then again I can understand why you wouldn't want people to edit it after someone has made an offer as that is ripe for abuse.

In terms of why I'd adjust it: I'm trying to figure out what would actually motivate me to try to produce more of this content and not result in a bit of extra money in my pocket without any additional content production. I figure that if there's a 20% chance of a post being a hit, I'd need at least funding for a week* in order for it to be worthwhile for me to spend a full day writing up a post (as opposed to the half-day that this post took me).

In terms of the $2000 upper ask limit, I'm thinking it through as follow: It seems that if someone was able to write ten high-quality alignment posts in a year (quite beyond me at the moment, but not an inconceivable goal), then that'd work out at $20k, and it might be reasonable for writing such posts to be a third of their income.

(PS. I decided to do a quick browse of highly upvoted posts on the alignment forum. It seems that quite a high proportion of highly upvoted posts are produced by people who are already established researchers/phd students, such that if there was a funding scheme for hits** and that scheme was aiming to avoid double funding people, the cost would be less than it might seem).

Anyway, would be great if I could edit the ask, but no worries if you would like it to remain the same.

* My current burn rate is less b/c I'm trying really hard to save money, but this is a rough estimate of what my natural burn rate would be.
•• Couldn't be based primarily on upvotes because that would simply result in vote manipulation and distort people towards writing content that would receive upvotes.

Retroactive funding for Don't Dismiss Simple Alignment Approaches

🍉

Chris Leong

almost 2 years ago

Funnily, enough I was going to reduce my ask here, but I hadn't gotten around it yet, so now it may look like it's in response to this comment when I was going to do it anyway.

Funding to attend AI Conclave

🍉

Chris Leong

almost 2 years ago

You should probably write about how you are and how your participation would benefit AI Safety.

[Archived]

🍉

Chris Leong

almost 2 years ago

Hey Felipe, I'm currently doing community building at AI Safety Australia and New Zealand and I'm quite interested in decision theory (currently doing an adversarial collaboration with Abram Demski, a MIRI researcher on evidential decision theory). Would be keen to hear if you end up in Australia.

Deleted

🍉

Chris Leong

about 2 years ago

I would be really excited to see the establishment of an AI safety lab at Oxford as this would help establish the credibility of the field which is one of the core problems holding alignment research back.

That said, I suspect that a proper research direction is crucial when establishing a new lab as its important to lead people down promising paths. I haven’t evaluated their proposed directions in detail, so I would encourage anyone considering donating large amounts of money to do so themselves.

Disclaimer: Fazl and I were discussing collaborating on movement building in the past.

Transactions

For	Date	Type	Amount
Mox, a coworking & events space in SF	about 1 month ago	project donation	26
Run a public online Turing Test with a variety of models and prompts	over 1 year ago	user to user trade	250
Educate the public about high impact causes	over 1 year ago	user to user trade	224
Manifund Bank	over 1 year ago	deposit	+500