The problem is that these guardrails are trivially bypassed. At best you end up playing a losing treadmill game against adversarial prompting.
The problem is that these guardrails are trivially bypassed. At best you end up playing a losing treadmill game against adversarial prompting.