Considering that LLMs are trained on the whole of the internet, it's kind of amazing that they don't talk back to you like a condescending, smug asshole

Perspectivist@feddit.uk · 3 days ago

Considering that LLMs are trained on the whole of the internet, it's kind of amazing that they don't talk back to you like a condescending, smug asshole

TheLeadenSea@sh.itjust.works · 3 days ago

They have RLHF (reinforcement learning from human feedback) so any negative, biased, or rude responses would have been filtered out in training. That’s the idea anyway, obviously no system is perfect.

SpaceNoodle@lemmy.world · 3 days ago

Then why are they all still smarmy assholes?

SkyNTP@lemmy.ml · edit-2 3 days ago

That’s what was said. LLMs have been reinforced to respond exactly how they do. In other words, that “smarmy asshole” attitude, you describe was a deliberate choice. Why? Maybe that’s what the creators wanted, or maybe that’s what focus groups liked most.

BCsven@lemmy.ca · 2 days ago

Chatgpt is normal, maybe a bit too casual lately. Responses now are "IKR classic (software brand) doing that crazy thing they are known for”.

But my last Copilot interaction was copilot being a passive aggressive dick in responses.