Hallucination is Inevitable: An Innate Limitation of Large Language Models (arxiv preprint)

OmnipotentEntity@beehaw.org · 8 months ago

Hallucination is Inevitable: An Innate Limitation of Large Language Models (arxiv preprint)

jmp242@sopuli.xyz · 8 months ago

Well, LLMs can and do provide feedback about confidence intervals in colloquial terms. I would think one thing we could do is have some idea of how good the training data is in a given situation - LLMs already seem to know they aren’t up to date and only know stuff to a certain date. I don’t see why this could not be expanded so they’d say something much like many humans would - i.e. I think bla bla but I only know very little about this topic. Or I haven’t actually heard about this topic, my hunch would be bla bla.

Presumably like it was said, other models with different data might have a stronger sense of certainty if their data covers the topic better, and the multi cycle would be useful there.

zygo_histo_morpheus@programming.dev · 8 months ago

The problem isn’t just that llms can’t say “I don’t know”, it’s also that they don’t know if they know something or not. Confidence intervals can help prevent some low-hanging fruit hallucinations but you can’t eliminate hallucinations entirely since they will also hallucinate about how correct they are about a given topic.