Navigate Select ESC Close

OpenAI Just SOLVED Hallucinations...

2025-09-08 Education
45.8k
2.1k
572
Wes Roth
Wes Roth
320.0k subscribers

Unlock all features

FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.

Description

Shownotes: https://natural20.com/openai-solved-hallucinations/ The latest AI News. Learn about LLMs, Gen AI and get ready for the rollout of AGI. Wes Roth covers the latest happenings in the world of OpenAI, Google, Anthropic, NVIDIA and Open Source AI. ______________________________________________ My Links 🔗 ➡️ Twitter: https://x.com/WesRothMoney ➡️ AI Newsletter: https://natural20.beehiiv.com/subscribe Want to work with me? Brand, sponsorship & business inquiries: [email protected] Check out my AI Podcast where me and Dylan interview AI experts: https://www.youtube.com/@Wes-Dylan ______________________________________________ #ai #openai #llm

Top Comments (10)

@TangentFuture41 2025-09-09

The fact that humans will literally make stuff up and we sit here and wonder why the models make things up is kind of wild to me

253 27 replies
@tjn0110 2025-09-09

I respect an honest "I don't know" far more than a guess.

115 7 replies
@Will-kt5jk 2025-09-08

It reminds me of incentives for academic papers: - positive results = more citations - more citations = better prestige, opportunities & funding That discourages publishing of negative and null results, which in aggregate, have a lot of value & for exploring fringe areas which may be more likely to hit negative results (but have high value in narrowing down search space & potentially very high value if a positive result is found). No to mention the incentives to reach for (potentially flawed) positive results via biases. P-hacking etc.

100 2 replies
@eSKAone- 2025-09-08

key points: Why Language Models Hallucinate * Language models are incentivized to guess when they don't know the answer, as training gives a "thumbs up" for correct answers and a "thumbs down" for everything else, including "I don't know" answers [02:00]. * The training and evaluation procedures create a "natural statistical pressure" for models to hallucinate [03:09]. * There's no penalty for wrong answers, and the models are always in "test-taking mode" [11:43]. Confidence and Self-Certainty * Language models have an internal sense of confidence in their answers. Consistency in answers to the same question indicates their certainty [05:38]. * Models can be trained using their own confidence as a reward signal, a method called "reinforcement learning from internal feedback" [05:54]. The Role of Pre-Training and Post-Training * Base models will always have hallucinations because they're not trained to eliminate them [11:51]. * Post-training helps reduce hallucinations but doesn't eliminate them completely [09:36]. Solutions to Hallucinations * The solution is to reward models for expressing uncertainty, similar to how humans get social credit for admitting they don't know something [13:51]. * Updating benchmarks to reward expressions of uncertainty could significantly reduce hallucinations, as most popular benchmarks use binary grading [14:51]. Conclusion * The video suggests we've been approaching the problem from the wrong angle. A minor tweak in the training process could lead to a significant improvement [18:14]. * This new approach could be a breakthrough, although it might mean we see more "I don't know" answers from language models [16:31]. For more details, you can watch the video here: https://youtu.be/uesNWFP40zw?feature=shared YouTube video views will be stored in your YouTube History, and your data will be stored and used by YouTube according to its Terms of Service

70 9 replies
@raul_jocson_ 2025-09-09

This "everything to gain from guessing" dynamic can also be seen in human social systems where a person is penalized for saying "I don't know" but not penalized for making something up. In politics and corporate hierarchies, for example, a person is much better off making ambiguous or unverifiable claims rather than saying they don't know what's going on. Maybe we need to fix that too.

21
@PoffinScientist 2025-09-09

In a court of justice you'd better say 'I don't know' when you don't know instead of inventing an answer, or you'll be punished hard

14
@BrokeTheInterweb 2025-09-09

So we programmatically encouraged the Dunning-Kruger effect to save time 😆

6
@JohnDlugosz 2025-09-08

I did UIL academic "sports" in high school. Testing _was_ penalized for wrong answers, to compensate for guessing. We had to understand the rule to decide how many I needed to eliminate before guessing would come out ahead of leaving it blank.

5
@YJxAI 2025-09-09

I think competitive exams are a very good example you get 1 point for getting correct and 0 for not solving but if you get the the quesiton wrong you loose 0.25 from your already accumulated score. This penalizes hallucination but also enoourages to get more score.

5
@HarrisonLongmore-th3bk 2025-09-09

AI informing users of their limitations would be so useful. I can’t tell you the number of times a simple, “I’m sorry, I’m not sure I can code that,” or “If I do this, there is a very good chance you may lose your current code as I will need to recode everything,” would have saved me so much time.

4

Unlock the Data Inside
Turn Videos into Knowledge

  • Get FREE 10/day: transcripts, summaries, chats
  • Chat with videos, export text & PDF
  • $1 free API credit for RAG, chatbots & research

Free forever plan • All features unlocked

App screenshot