ΑΙhub.org
 

Advanced AI models are not always better than simple ones


by
09 September 2025



share this:

There are 6 different arrangements of lines with circles. In each image, there are 4/5 lines in different colours: yellow, blue, pink, and turquoise. The middle circle is blue, and the other lines stick out at different angles surrounding the circle.  Elise Racine / Toy Models I / Licenced by CC-BY 4.0

By Tanya Petersen

Understanding genetic perturbations, when scientists intentionally alter genes to see how this affects cells, is key to understanding what our genes do and how they are controlled. This knowledge has important applications in cell engineering and in developing new treatments.

Today, scientists can test many different genetic perturbations in the lab. But there are so many possible combinations that it is impossible to test them all.

AI and machine learning have created the opportunity to use information from large biological datasets to predict what will happen when a gene is changed — even if that change has never been tested in the laboratory. But how well do these models really work?

Evaluating different prediction models

To assess this, researchers in EPFL’s Machine Learning for Biomedicine Laboratory (MLBio), affiliated with both the School of Computer and Communication Sciences and the School of Life Sciences, in collaboration with international colleagues, tested the best AI models. They used data from ten different experiments and compared them to simple statistical approaches.

In a study recently published in Nature Biotechnology the team found something surprising. Simple approaches did just as well as, if not better than, advanced AI models on many datasets.

“The observation that simple approaches perform as well as advanced AI models made us wonder: are the advanced models actually understanding what gene changes do? Are the standard metrics suitable for evaluating these models?” said Assistant Professor Maria Brbic, head of the MLBio Lab.

Why did the simple methods do so well?

Advanced models may look better than they are. This is because of systematic differences between treated and untreated cells. In these cases, the models may not be learning the true effects of the genetic changes. Instead, they may just notice patterns caused by the design of the experiment or effects that happen for almost all genetic changes.

The researchers also found that common ways of checking model performance can be misleading. They often fail to account for these systematic differences.

“To deal with this, we created a tool called Systema. It reduces the influence of systematic biases and focuses on the unique effects of each genetic perturbation. Systema also makes it easier to understand what genetic perturbations actually do,” explained Ramon Viñas Torné, a postdoctoral researcher in the MLBio Lab and the first author of the paper.

Prediction is harder than standard metrics suggest

With Systema, the researchers found that it’s still very hard for AI models to predict the effects of new genetic changes. Some models could make correct guesses when the genes were part of the same biological process, but overall the challenge remains.

Systema helps tell the difference between models that are just picking up biases and those that truly understand how genetic modifications affect cells.

The researchers suggest that AI models should be evaluated based on their biological value. This means looking at how well predictions explain cellular traits.

“Looking ahead, having bigger and more diverse experiments will help make these predictions better. Also, new technologies that look at cells in more detail, like their shape or location, could help us to understand how gene changes affect cells and tissues better,” concluded Brbic.

References

Learn more about Systema.

Systema: a framework for evaluating genetic perturbation response prediction beyond systematic variation, Viñas Torné, R., Wiatrak, M., Piran, Z. et al., Nat Biotechnol (2025).




EPFL

            AIhub is supported by:



Subscribe to AIhub newsletter on substack



Related posts :

Interview with AAAI Fellow Yan Liu: machine learning for time series

  19 Mar 2026
Hear from 2026 AAAI Fellow Yan Liu about her research into time series, the associated applications, and the promise of physics-informed models.

A principled approach for data bias mitigation

  18 Mar 2026
Find out more about work presented at AIES 2025 which proposes a new way to measure data bias, along with a mitigation algorithm with mathematical guarantees.

An AI image generator for non-English speakers

  17 Mar 2026
"Translations lose the nuances of language and culture, because many words lack good English equivalents."

AI and Theory of Mind: an interview with Nitay Alon

  16 Mar 2026
Find out more about how Theory of Mind plays out in deceptive environments, multi-agents systems, the interdisciplinary nature of this field, when to use Theory of Mind, and when not to, and more.
coffee corner

AIhub coffee corner: AI, kids, and the future – “generation AI”

  13 Mar 2026
The AIhub coffee corner captures the musings of AI experts over a short conversation.

AI chatbots can effectively sway voters – in either direction

  12 Mar 2026
A short interaction with a chatbot can meaningfully shift a voter’s opinion about a presidential candidate or proposed policy.

Studying the properties of large language models: an interview with Maxime Meyer

  11 Mar 2026
What happens when you increase the prompt length in a LLM? In the latest interview in our AAAI Doctoral Consortium series, we sat down with Maxime, a PhD student in Singapore.

What the Moltbook experiment is teaching us about AI

An experimental social media platform where only AI bots can post reveals surprising lessons about artificial intelligence behaviour and safety.



AIhub is supported by:







Subscribe to AIhub newsletter on substack




 















©2026.02 - Association for the Understanding of Artificial Intelligence