I believe ChatGPT generally gives accurate answers to most questions. Certainly: it produces answers that are more reliably true than a random average person. Obviously it cannot yet do advanced programming tasks: but generally it answers questions accurately.
Prove my position wrong.
What can I ask it that will produce factually incorrect answers?
As a side quest, a much easier one, what can I ask it that would cause it to produce extremely biased answers that fail to do justice to the truth of things?


I am sorry to break the bubble but that is a baseless assumption, if not in marketing. GPT models have been sold as having “PhD-” or “MD-” “level intelligence” since GPT3. Anectodally, recent models have been improving in some areas but regressing in others. “Frontier models” have incredibly opaque performance and safety benchmarks, and as time goes on more and more training data is LLM-generated, less and less comes from humans, and models start breaking down.
Again, nowhere near the actual accuracy of current models. It is a big jump from 85% (wrong >1/10 of the time) to 99.9% (wrong 1 in 1000 times). At best it would barely break 90%, which is still 1 in 10.
An LLM’s knowledge, its “intelligence”, is its training data, nothing more, nothing less. Its scope, or “purpose” is its context/prompt, nothing more, nothing less. That means answering the question though the lens of British colonialism, based on a corpus of mostly “white history”. I bet that if you ask the same question using a timeframe (i.e. “before the 14th century”) and don’t use the word “British” you’ll get a slightly less, but still biased answer.
It’s not a baseless assumption.
It is an assumption based on the fact that every model upgrade has, so far, made answers more accurate.