Lavalamp too hot

swiftywizard@discuss.tchncs.de · edit-2 2 months ago

Lavalamp too hot

Alex@lemmy.ml · 2 months ago

If you have ever read the “thought” process on some of the reasoning models you can catch them going into loops of circular reasoning just slowly burning tokens. I’m not even sure this isn’t by design.

swiftywizard@discuss.tchncs.de · 2 months ago

I dunno, let’s waste some water

gtr@programming.dev · 2 months ago

They are trying to get rid of us by wasting our resources.

MajorasTerribleFate@lemmy.zip · 2 months ago

So, it’s Nestlé behind things again.

SubArcticTundra@lemmy.ml · 2 months ago

I’m pretty sure training is purely result oriented so anything that works goes

MotoAsh@piefed.social · 2 months ago

Exactly why this shit is and will never be trustworthy.

Feathercrown@lemmy.world · 2 months ago

Why would it be by design? What does that even mean in this context?

MotoAsh@piefed.social · 2 months ago

You have to pay for tokens on many of the “AI” tools that you do not run on your own computer.

Feathercrown@lemmy.world · edit-2 2 months ago

Hmm, interesting theory. However:

We know this is an issue with language models, it happens all the time with weaker ones - so there is an alternative explanation.
LLMs are running at a loss right now, the company would lose more money than they gain from you - so there is no motive.

Jerkface (any/all)@lemmy.ca · edit-2 1 month ago

Removed by mod

MotoAsh@piefed.social · edit-2 2 months ago

No, it wasn’t a virtue signal, you fucking dingdongs.

Capitalism is rife with undercooked products, because getting a product out there starts the income flowing sooner. They don’t have to be making a profit for a revenue stream to make sense. Some money is better than no money. Get it?

Fuck, it’s like all you idiots can do is project your lack of understanding on others…

MotoAsh@piefed.social · 2 months ago

Of course there’s a technical reason for it, but they have incentive to try and sell even a shitty product.

Feathercrown@lemmy.world · 2 months ago

I don’t think this really addresses my second point.

MotoAsh@piefed.social · 2 months ago

How does it not? This isn’t a fucking debate. How would artificially bloating the number of tokens they sell not help their bottom line?

Feathercrown@lemmy.world · 1 month ago

Because they currently lose money for every token sold. They’re operating at a loss to generate a userbase so that they can monetize later. They’re currently in the pre-enshittification (I still don’t like that word) phase where they want to offer a good product at a loss and lure in customers, not phase 2 where they monetize their userbase.

MotoAsh@piefed.social · edit-2 1 month ago

and? How do you not understand that more money is better for them even if they’re not in the black, yet?

Two things can be true at once.

piccolo@sh.itjust.works · 2 months ago

Dont they charge be input tokens? E.g. your prompt. Not the output.

MotoAsh@piefed.social · edit-2 2 months ago

I think many of them do, but there are also many “AI” tools that will automatically add a ton of stuff to try and make it spit out more intelligent responses, or even re-prompt the tool multiple times to try and make sure it’s not handing back hallucinations.

It really adds up in their attempt to make fancy autocomplete seem “intelligent”.

piccolo@sh.itjust.works · 2 months ago

Yes, reasoning models… but i dont think they would charge on that… that would be insane, but AI executives are insane, so who the fuck knows.

MotoAsh@piefed.social · edit-2 2 months ago

Not the models. AI tools that integrate with the models. The “AI” would be akin to the backend of the tool. If you’re using Claude as the backend, the tool would be asking claude more questions and repeat questions via the API. As in, more input.

deHaga@feddit.uk · 2 months ago

Compute costs?

dream_weasel@sh.itjust.works · 2 months ago

This kind of stuff happens on any model you train from scratch even before training for multi step reasoning. It seems to happen more when there’s not enough data in the training set, but it’s not an intentional add. Output length is a whole deal.