Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The blog mentions checking each agent action (say the agent was planning to send a malicious http request) against the user prompt for coherence; the attack vector exists but it should make the trivial versions of instruction injection harder


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: