ΑΙhub.org
 

Machine learning helps in the search for new antibiotics


by
29 March 2021



share this:

fenoximetilpenicilina
By Rianne Lindhout

With the search for new antibiotics becoming increasingly urgent, artificial intelligence offers valuable help. Smart software developed by Leiden PhD candidate Alexander Kloosterman searched genomes of bacteria and found clusters of DNA that code for proteins that have an antibiotic effect. “This new search method is enormously promising.”

AIhub focus issue on good health and well-being

The discovery was published in PLoS Biology. Professor of Molecular Biotechnology Gilles van Wezel from the Leiden Institute of Biology (IBL) initiated the research together with visiting professor Marnix Medema. “We knew it must be possible to find new antibiotics if we searched the genome of bacteria in a completely different way.”

Classical search exhausted

The classical search for antibiotics, which are used to fight bacterial illnesses, involves cultures of bacteria or moulds that may make antibiotics. In 1928, Alexander Fleming discovered the first antibiotic by chance as a result of contamination in a petri dish. That was penicillin, produced by a fungus. Since then, researchers have been making cultures of other micro-organisms and seeing whether they make molecules that could inhibit other bacteria.

Today, more and more types of bacteria are becoming resistant to antibiotics. This means we need new antibiotics, but the classical search method using cultures has been exhausted. In the lab, bacteria do not produce all the antibiotics that are part of their genetic make-up.

Criteria for antibiotic code

Van Wezel: “To find the piece of code in DNA that can be used to make a possible antibiotic, there are a number of things you need to look at. All the genes that code for antibiotics are found close together in a cluster. There has to be a gene there that dictates that the molecule to be made leaves the bacterium cell: the bacterium always excretes the antibiotic. The DNA shouldn’t be present in all bacteria. The making of antibiotics is a specialist function and it is limited to only some of the bacteria. We adopted the criterion that it had to be present in 10 to 30 per cent of the types we examined.”

Van Wezel knew what he wanted to look for, but he didn’t know how to approach it technically. “So, I worked with Marnix Medema, a bio-informatics specialist at Wageningen University. He is able to create software that can search bacteria DNA.”

Medema: “This joint project was my first collaboration with Leiden University.” It is a collaboration that is set to continue, with Medema as the holder of the Van der Klauw guest chair at IBL since May 2020.

Taking up the ideas of Van Wezel and Medema, PhD candidate Alexander Kloosterman set to work. Under the supervision of Medema, he developed software that could search based on criteria he and Van Wezel specified. The software identified 42 new types of clusters of genes in the DNA code that met the necessary conditions. These genes could possibly code for ‘precursors’ that will later become proteins with antibiotic effects.

Pristinine: basis for new medicines?

Van Wezel: “One of these 42 clusters was the best match for all our conditions and also occurred in a type of bacterium that we could easily work on in our lab.” The researchers were successful in getting this bacterium to produce a small amount of pristinine, the substance that is produced with the gene cluster identified by the software.’

Pristinine is one of a new subclass of the so-called lanthipeptides that many antibiotics belong to. The best-known is nisine, which is commonly used as a preservation agent in such foodstuffs as cheese. The machinery in the cell that produces pristinine is very different from that of nisine, which is why conventional software did not recognise the DNA encoding for it as potentially interesting. Thanks to machine learning, it now does.

The potential uses of pristinine have yet to be determined. The other 41 gene clusters that the new software identified are also interesting subjects for further research. Van Wezel: “Our work is open science; it’s all online, so every research group can use it.” The project is funded by the Chemistry Top Sector. Van Wezel coordinates the Syngenopep project within this group, which aims to find new protein antibiotics. Van Wezel: “This new search method offers a huge array of possibilities for systematically identifying new antibiotics.”

Alexander Kloosterman will obtain his PhD later this year based on his dissertation Exploring the chemical space of post-translationally modified peptides in Streptomyces with machine learning.

Read the work in full

Expansion of RiPP biosynthetic space through integration of pan-genomics and machine learning uncovers a novel class of lanthipeptides, Alexander M. Kloosterman, Peter Cimermancic, Somayah S. Elsayed, Chao Du, Michalis Hadjithomas, Mohamed S. Donia, Michael A. Fischbach, Gilles P. van Wezel, Marnix H. Medema.



tags: ,


Universiteit Leiden

            AIhub is supported by:



Subscribe to AIhub newsletter on substack



Related posts :

‘Probably’ doesn’t mean the same thing to your AI as it does to you

  17 Apr 2026
Are you sure you and the AI chatbot you’re using are on the same page about probabilities?

Interview with Xinwei Song: strategic interactions in networked multi-agent systems

  16 Apr 2026
Xinwei Song tells us about her research using algorithmic game theory and multi-agent reinforcement learning.

2026 AI Index Report released

  15 Apr 2026
Find out what the ninth edition of the report, which was published on 13 April, says about trends in AI.

Formal verification for safety evaluation of autonomous vehicles: an interview with Abdelrahman Sayed Sayed

  14 Apr 2026
Find out more about work at the intersection of continuous AI models, formal methods, and autonomous systems.

Water flow in prairie watersheds is increasingly unpredictable — but AI could help

  13 Apr 2026
In recent years, the Prairies have seen bigger swings in climate conditions — very wet years followed by very dry ones.

Identifying interactions at scale for LLMs

  10 Apr 2026
Model behavior is rarely the result of isolated components; rather, it emerges from complex dependencies and patterns.

Interview with Sukanya Mandal: Synthesizing multi-modal knowledge graphs for smart city intelligence

  09 Apr 2026
A modular four-stage framework that draws on LLMs to automate synthetic multi-modal knowledge graphs.

Emergence of fragility in LLM-based social networks: an interview with Francesco Bertolotti

  08 Apr 2026
Francesco tells us how LLMs behave in the social network Moltbook, and what this reveals about network dynamics.



AIhub is supported by:







Subscribe to AIhub newsletter on substack




 















©2026.02 - Association for the Understanding of Artificial Intelligence