But with LLMs is there really more to understand? They’re just large functions t...

wrs · 2025-10-16T16:09:30 1760630970

I’ve never seen the word “just” take on more load than it did in that second sentence!

Sorry to inform everybody doing their Ph.D. on LLM interpretability that they’re just wasting their time.

beyarkay · 2025-10-16T18:13:12 1760638392

> But with LLMs is there really more to understand?

Yes! loads! (: I want to be able to say statements like "this model will never ask the user to kill themselves" and be confident, but I can't do that today, and we don't know how. Note that we do know how to prove similar statements for regular software.