ΑΙhub.org
 

#AAAI2021 invited talk – Regina Barzilay on deploying machine learning methods in cancer diagnosis and drug design


by
09 February 2021



share this:

AAAI2021 logo
In September 2020, Regina Barzilay was announced as the winner of the inaugural AAAI Squirrel AI award. Regina was formally presented with the prize during an award ceremony at the AAAI2021 conference, following which she delivered an invited talk. She spoke about two particular areas of medicine that she has been researching: drug discovery and cancer diagnosis.

AIhub focus issue on good health and well-being
Drug discovery

It is well-known that the development of drugs is slow and expensive. Currently, drug discovery is primarily experimentally driven, with properties of molecules investigated empirically. The problem is that the number of molecules that have the potential to be used as drugs is huge, and only a tiny fraction of these will actually be a good candidate. This is where machine learning comes in – it is an ideal tool for assisting in the search through this vast molecule space.

Regina talked about the research methodology that led to the discovery of Halicin for use as an antibiotic. She, and her co-authors, trained a graph neural network model on 2,500 molecules, which they had experimentally tested to see if they were effective against E.coli. For the purposes of the model this gave them a 2d representation of the molecule and a number. This number reflected the inhibitory capacity of the molecule in question. They used the trained model to computationally screen 107 molecules. From this huge number, they were left with 102 candidates to test empirically, then just one candidate to test on animals. You can read more about this work in the Cell paper “A Deep Learning Approach to Antibiotic Discovery”.

The team actually discovered that not only was the drug effective against E.coli, but it also worked on the drug-resistant strains of C.difficile and A.baumannii. The reason Halicin was so effective here was because it had a distinct mechanism of action, as compared to known antibiotics.

Drug discovery slides from Barzilay talkSlides from the talk. Chemprop is a tool the team developed to predict molecule properties.
Cancer diagnosis

Regina talked about breast cancer and began by explaining how classical risk models work. They take in several variables, such as age, family history, and prior medical breast procedures, then apply a simple statistical algorithm. The AUC (area under the curve) metric for these models is around 0.607, so not significantly better than chance (where AUC is 0.5).

One way to improve these models is to include images from mammograms. Regina and her team developed a machine learning model called “Mirai” to predict breast cancer risk based on traditional mammograms. To do this, they collected consecutive screening mammograms from 80,134 patients screened between 1 January 2009 and 31 December 2016. An examination was referred to as “positive” if it was followed by a pathology-confirmed cancer diagnosis within 5 years. The deep learning model was designed to predict risk at multiple timepoints.

You can read the research in full in this paper “Toward robust mammography-based models for breast cancer risk”.

Regina talk race performance for breast cancer slide
Slide showing the performance of the team’s model (MIRAI) versus the standard model used in the USA (Tyrer-Cuzick). She noted the poor performance of the Tyrer-Cuzick model for African American and Asian women.

Regina stressed the need to ensure that models are built and tested using data from all racial groups. She called for standards for researchers to follow when reporting their results. She gave an example of a published paper where there was no racial breakdown of the model performance; as a result we are left none the wiser as to how it performs for different populations. Reproducibility is an issue too; code needs to be made available so that research findings can be rigorously verified.

Personal journey

In the final part of her talk, Regina shed some light on her personal research journey. After being treated for breast cancer in 2014 she wanted to work on problems that could really make a difference to people’s lives, so she changed her focus from natural language processing to applying her machine learning expertise to the field of healthcare. This was not an easy transition and she encountered many challenges along the way, including a struggle to obtain funding, and lack of access to the required data for her research. It was refreshing to hear a researcher talk openly about the challenges they have faced and, if the comments in the chat following the talk were representative, it is something that the audience greatly valued.

AAAI plan to make the recordings of the talks publicly available in a month or two. When the videos are released we will add a link to this article.



tags: , , ,


Lucy Smith is Senior Managing Editor for AIhub.
Lucy Smith is Senior Managing Editor for AIhub.




            AIhub is supported by:



Related posts :



Forthcoming machine learning and AI seminars: November 2025 edition

  03 Nov 2025
A list of free-to-attend AI-related seminars that are scheduled to take place between 3 November and 31 December 2025.

#ECAI2025 – social media round up

  31 Oct 2025
Over the past week, researchers have gathered in Bologna for the 28th European Conference on Artificial Intelligence.
monthly digest

AIhub monthly digest: October 2025 – energy supply challenges, wearable sensors, and atomic-scale simulations

  29 Oct 2025
Welcome to our monthly digest, where you can catch up with AI research, events and news from the month past.

Winners of the #ECAI2025 outstanding paper awards announced

  28 Oct 2025
Find out which articless were selected as ECAI and PAIS outstanding papers.

The great wildebeest migration, seen from space: satellites and AI are helping count Africa’s wildlife

  27 Oct 2025
Researchers analysed satellite imagery of the Serengeti-Mara ecosystem from 2022 and 2023.

New AI tool helps match enzymes to substrates

  24 Oct 2025
A new machine learning-powered tool can help researchers determine how well an enzyme fits with a desired target.

#AIES2025 social media round-up

  24 Oct 2025
Find out what participants got up to at the Conference on Artificial Intelligence, Ethics, and Society.

Looking ahead to #ECAI2025

  23 Oct 2025
Find out what the programme has in store at the European Conference on AI.



 

AIhub is supported by:






 












©2025.05 - Association for the Understanding of Artificial Intelligence