about

resources

events

contribute

republishing

☰

ΑΙhub.org

‘Probably’ doesn’t mean the same thing to your AI as it does to you

by The Conversation

17 April 2026

Why it matters

Far from being a linguistic quirk, this misalignment is a fundamental challenge for AI safety and human-AI interaction. As large language models are increasingly used in high-stakes fields like health care, government policy and scientific reporting, the way they communicate risk becomes a matter of public trust.

If an AI assistant helping a doctor, for instance, describes a side effect as “unlikely,” but the model’s internal calculation of “unlikely” is much higher than the doctor’s interpretation, the resulting decision could be flawed.

What other research is being done

Scientists have studied how humans quantify uncertainty since the 1960s, a field pioneered by CIA analysts to improve intelligence reporting. More recently, there has been an explosion in large language model literature seeking to look under the hood of neural networks to better understand their “behaviors” and linguistic patterns.

Our study adds a layer of complexity by treating the interaction between humans and artificial intelligence as a biological-like system where meaning can degrade. It moves beyond simply measuring if an AI is “smart” and instead asks if it is aligned.

Other researchers are currently exploring whether so-called chain-of-thought prompting – asking the AI to show its work – can fix these errors. However, our study found that even advanced reasoning doesn’t always bridge the gap between statistical data and verbal labels.

What’s next

A goal for future AI development is to create models that don’t just predict the next likely word but actually understand the weight of the uncertainty they are conveying. Researchers are calling for more robust consistency metrics to ensure that if a model sees a 10% chance in the data, it chooses the same word every time.

As we move toward a world where AI summarizes scientific papers and manages people’s schedules, making sure that “probably” means “probably” is a vital step in making these systems reliable partners rather than just sophisticated parrots.

Mayank Kejriwal, Research Assistant Professor of Industrial & Systems Engineering, University of Southern California

This article is republished from The Conversation under a Creative Commons license. Read the original article here.

The Conversation is an independent source of news and views, sourced from the academic and research community and delivered direct to the public.

AUAI is supported by:

Everything, eco-where, AI at once?

Laura Martínez Agudelo and Better Images Of AI 19 Jun 2026

Laura Martinez Agudelo builds on her research of visual representations of ecology and digitalisation to explore how "AI eco-imagery" is portrayed.

AI is making journalistic language more repetitive and predictable – and it’s a problem for all of us

The Conversation 17 Jun 2026

What happens to language when a growing amount of text published in the press, online and on social media is written by machines?

monthly digest

Statistical or embodied? Comparing people and LLMs in their processing of color metaphors: an interview with Douglas Guilbeault

Ella Scallan 09 Jun 2026

We learn what implications color metaphors and synaesthesia have for human and AI cognition.

The Good Robot podcast: the battle over data centres with Tara Merk

The Good Robot Podcast 08 Jun 2026

Eleanor Drage speaks with Tara Merk about how community-owned data centers could transform digital ownership and challenge the dominance of Big Tech.

Congratulations to the #AAMAS2026 best paper award winners

Lucy Smith 05 Jun 2026

Find out who won in the categories of best paper, best student paper, and best blue sky paper.

‘Probably’ doesn’t mean the same thing to your AI as it does to you

Why it matters

What other research is being done

What’s next

Related posts :

Everything, eco-where, AI at once?

AI is making journalistic language more repetitive and predictable – and it’s a problem for all of us

AIhub monthly digest: June 2026 – biodiversity, resource allocation, and color metaphors

AAAI presidential panel – AI agents

Interview with AAAI Fellow Tanya Berger-Wolf: AI for ecology, biodiversity, and conservation

Statistical or embodied? Comparing people and LLMs in their processing of color metaphors: an interview with Douglas Guilbeault

The Good Robot podcast: the battle over data centres with Tara Merk

Congratulations to the #AAMAS2026 best paper award winners

↑