ΑΙhub.org
 

#ICML2023 invited talk: Jennifer Doudna on machine learning for biological research


by
09 August 2023



share this:

Three DNA strands
The programme of the International Conference on Machine Learning (ICML) featured an invited talk by Jennifer Doudna entitled “The future of ML in biology: CRISPR for health and climate”.

Jennifer Doudna and Emmanuelle Charpentier won the 2020 Nobel Prize in Chemistry for “the development of a method for genome editing”. The method in question is often referred to as CRISPR/Cas9 genetic scissors. Using this technique, researchers can change the DNA of animals, plants and microorganisms with extremely high precision. This technology has already had a huge impact on the biological sciences.

In her talk, Jennifer gave an overview of the CRISPR/Cas9 method, touching on a successful case study, talked about protein structure and function prediction, and shared how she thinks machine learning will contribute to biological research.

CRISPR stands for clustered regularly interspaced short palindromic repeats, and is a family of DNA sequences found in the genomes of organisms such as bacteria. These sequences are derived from DNA fragments of viruses that had previously infected the organism. They are used to detect and destroy DNA from similar viruses during subsequent infections. Cas9 is an enzyme that uses CRISPR sequences as a guide to recognize and cut specific strands of DNA that are complementary to the CRISPR sequence. The Cas9 enzyme works in tandem with a piece of RNA (called guide RNA) about 20 bases long and located within a longer RNA scaffold. The guide RNA binds to the target sequence in the DNA and the Cas9 enzyme, in turn, binds to the guide RNA, before cutting the two DNA strands. This triggers repair of the DNA and at this point the required mutation is introduced.

Jennifer talked about a successful use of the technology, in the treatment of Sickle Cell Disease. This disease is a result of a genetic mutation in which a defective form of haemoglobin is produced. The CRISPR/Cas9 technique has been used to turn on the production of fetal haemoglobin. Once the cells are transplanted, they will continue to reproduce, so only one treatment is needed. The therapy should be approved by the FDA (USA food and drug administration) this year. Jennifer noted that we’re just at the beginning of what this technology has to offer. She also commented that getting to this point required a detailed knowledge of how proteins and RNA molecules work and how genes interact.

Jennifer moved on to talk about protein structure prediction, which she views as a huge win for machine learning in biology. She has been amazed to see how quickly researchers such as herself have adopted these tools (like AlphaFold2, RosettaFold and others). These models were all made possible by the open source Protein Data Bank (PDB), founded in 1971, which today contains over 200,000 structures. Despite the great success of these models, there are still challenges that remain. For example, determining function through structure, predicting conformational changes, and predicting RNA structures.

There are a number of biological questions that are going to be very difficult to solve without the help of machine learning. A key ingredient for machine learning models is good quality data, and this is where CRISPR can be utilised. Not only is CRISPR being used as a therapeutic and as a research tool, but also as a tool to generate very large datasets. These will be very exciting to mine with machine learning, provided that, firstly, the datasets are well-curated, and secondly, that researchers can define the questions they want to ask in a way that ensures that the models are trained appropriately.

Jennifer outlined some of the research challenges in which machine learning could play a key role. One of these is the study of gene function and interaction, where CRISPR could be used to provide the raw data. Another concerns the study of protein functions. Machine learning methods could also be deployed in the prediction of RNA structures and RNA-protein interactions. In short, there are many exciting avenues to pursue, and the combination of machine learning techniques and biological data promises to reveal a myriad of new insights in the field of the life sciences.



tags: ,


Lucy Smith is Senior Managing Editor for AIhub.
Lucy Smith is Senior Managing Editor for AIhub.




            AIhub is supported by:


Related posts :



2024 AAAI / ACM SIGAI Doctoral Consortium interviews compilation

  20 Dec 2024
We collate our interviews with the 2024 cohort of doctoral consortium participants.

Interview with Andrews Ata Kangah: Localising illegal mining sites using machine learning and geospatial data

  19 Dec 2024
We spoke to Andrews to find out more about his research, and attending the AfriClimate AI workshop at the Deep Learning Indaba.

#NeurIPS social media round-up part 2

  18 Dec 2024
We pick out some highlights from the second half of the conference.

The Good Robot podcast: Machine vision with Jill Walker Rettberg

  17 Dec 2024
Eleanor and Kerry talk to Jill about machine vision's origins in polished volcanic glass, whether or not we'll actually have self-driving cars, and a famous photo-shopped image.

Five ways you might already encounter AI in cities (and not realise it)

  13 Dec 2024
Researchers studied how residents and visitors experience the presence of AI in public spaces in the UK.

#NeurIPS2024 social media round-up part 1

  12 Dec 2024
Find out what participants have been getting up to at the Neural Information Processing Systems conference in Vancouver.

Congratulations to the #NeurIPS2024 award winners

  11 Dec 2024
Find out who has been recognised by the conference awards.

Multi-agent path finding in continuous environments

and   11 Dec 2024
How can a group of agents minimise their journey length whilst avoiding collisions?




AIhub is supported by:






©2024 - Association for the Understanding of Artificial Intelligence


 












©2021 - ROBOTS Association