Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Do you think it's possible to make the conversation adversarial or is that against the guardrails?


I'd be very surprised if they had that completely airtight, I bet we'll see some examples of people breaking that soon.

Hah, just found this on Reddit: https://www.reddit.com/r/notebooklm/comments/1g64iyi/holy_sh... - many f-bombs.


That feels/sounds so natural.

It’s amazing how it doesn’t occur to OpenAI and others that the “safety guardrails” really dilute the output. And it’s a display of conservatism. Bizarre to think that in America AI couldn’t swear.

Until today, where Google allowed the AI to behave as the lord intended.


Why do you think it doesn't occur to them? I'm genuinely curious.

It seems to me these alignment questions are a conscious trade-off between giving users what they ask for and brand safety knowing that every spicy output will immediately find its way to twitter/reddit/etc.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: