Making AI safer for victims of intimate partner violence
Conversational AI tools denied blunt requests for harmful content by researchers posing as intimate partner abusers, but these guardrails were easily circumvented when they requested the content under ...
16 Apr 16:40 · Tech Xplore