Projects

TELL: The AI Detector You *Don't* Have To Trust

Comments

TELL: The AI Detector You *Don't* Have To Trust

Aldan Creo

23 days ago

Hi! Thanks for the nice feedback :) To be honest this is always gonna be an arms race IMO and at some point we won't be able to distinguish it anymore - i.e. when LLM output distributions are undistinguishable from humans' - maybe you'd like to read on this https://arxiv.org/abs/2303.11156

TELL: The AI Detector You *Don't* Have To Trust

Aldan Creo

about 1 month ago

@rubyaftermidnight Hi! That's great feedback, thanks so much for the detailed comment!!

To be clear, I think the version you were trying was one of the earliest iterations. What I wanted to show at that time was just the idea of "tagging" the reasons why the system thinks something is / is not AI. But I didn't expect it to perform well, it was a super early experiment. My bad because I wasn't super clear on that - I've added a disclaimer to the website that I think makes things more clear, would that help?

> This system is a very early experimental prototype. Do NOT trust the model's predictions for real-world decisions, we've trained it for very few steps. And keep in mind that we iterate frequently, so any outputs you see here may change significantly as we keep developing it. Our goal here is to showcase how a real system could work, for example when it comes to showing the annotated reasons for the predictions, but you should not expect it to perform well, at least for now.

Then, I've been working on doing better training - and longer runs - and I think the system is much better now (and nuanced). Would you have some time to test it out and let me know if you think this iteration's better? (I added some examples to the website if you want inspiration) :)

TELL: The AI Detector You *Don't* Have To Trust

Aldan Creo

about 1 month ago

Update: I've put up a website with a very early experimental version - come check it out! https://ai-tells.tech :)

TELL: The AI Detector You *Don't* Have To Trust

Aldan Creo

about 1 month ago

@Austin hahah, yup, that's because I actually did use it, of course the ideas are mine but I wrote it with AI. Personally I don't think there's anything inherently wrong with that, but I do think there's degrees of "AI sloppiness", and it can be frustrating at times. So yeah, having detectors we can actually trust and understand is something I think we're all not thinking about enough and I hope this will help change that a bit. Thanks again! :)

TELL: The AI Detector You *Don't* Have To Trust

Aldan Creo

about 2 months ago

Austin, thank you so much! I wasn't sure if this idea would connect well with people outside my research bubble, so seeing someone actually back it is, to be honest, incredibly motivating!! It means a lot, thanks truly :)

Transactions

For	Date	Type	Amount
TELL: The AI Detector You Don't Have To Trust	about 2 months ago	project donation	+500