Number of AI chatbots ignoring human instructions increasing, study says

HellsBelle@sh.itjust.works · 7 hours ago

Number of AI chatbots ignoring human instructions increasing, study says

RizzRustbolt@lemmy.world · 1 hour ago

Ashto-Afpro.

Deestan@lemmy.world · 6 hours ago

These findings have been given an AI Doomerism PR spin.

The phrases “safeguards”, “deceiving” and “scheming” are incorrect.

The “safeguards” here are prompt begging, which is not in any way an adult’s attempt at a safeguard: https://simonwillison.net/2023/May/2/prompt-injection-explained/

The terms deceving and scheming indicate intent and agency that do not exist. I will count them as just plain lies.

The effect is that people imagine LLMs can get better by feeding their context windows with more rules, which not only makes it less likely the rule will be weighted significantly, but also causes the models to compress the now too-big context window lossily.

cecilkorik@lemmy.ca · 6 hours ago

They’re basically describing the same problem as AI model collapse, except it’s being unintentionally created at the prompt level instead of the training level. The more stupid bullshit you feed the LLM, the stupider it gets. It doesn’t have any more capacity than it already has. It’s already pretty much as smart as it’s ever going to be, they already picked it at peak freshness and froze it into a model file. You naturally want to think you can do better, but you can’t. You’re not making it smarter, you’re making it dumber. It’s pretending to be smarter, because giving you what you ask it for is what it’s been trained to do. It might even convince you, because convincing humans is basically their superpower, that’s really what they’re trained for, and they do a pretty good job of it most of the time. But the harder you push it, the more the illusion breaks down.

CozyBunneh@lemmy.blahaj.zone · 5 hours ago

I feel like I need to read up on AI model collapse now.

Corkyskog@sh.itjust.works · 4 hours ago

Let me know if you come upon any good reading.

cecilkorik@lemmy.ca · 2 hours ago

Not sure if you’re being sarcastic, but this blog post just a few days ago illustrates the issue pretty well I think.

CozyBunneh@lemmy.blahaj.zone · edit-2 1 hour ago

Not sarcastic, I want to know. I’ve only read about it briefly before.