ΑΙhub.org
 

DeepMind and EMBL release database of predicted protein structures


by
23 July 2021



share this:

AF-Q8I3H7-F1
T-cell immunomodulatory protein homolog, from the AlphaFold Protein Structure Database, reproduced under a CC-BY-4.0 license.

DeepMind and the European Molecular Biology Laboratory (EMBL) have partnered to produce a database of predicted protein structure models.

The first release covers all ~20,000 proteins expressed in the human proteome, and the proteomes of 20 other biologically significant organisms, totalling over 350k structures. In the coming months they plan to expand the database to cover a large proportion of all catalogued proteins (the over 100 million in UniRef90).

The data is freely and openly available to the scientific community. You can access the AlphaFold Protein Structure Database here.

Back in November, DeepMind reported on their AlphaFold system that was able to predict, with high accuracy, a protein’s 3D structure from its amino acid sequence. We wrote about it here. This database is the next step in the journey, and the collaborators hope that this will be a useful tool for researchers and open up new avenues for scientific discovery.


Another example protein structure from the AlphaFold Protein Structure Database, reproduced under a CC-BY-4.0 license. This is Striatin-interacting protein 1. It plays a role in the regulation of cell morphology and cytoskeletal organization, required in the cortical actin filament dynamics and cell shape. AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100. The parts of the protein with a pLDDT score of above 90 are shown in dark blue, between 70 and 90 in light blue, between 50 and 70 in yellow, and below 50 in red.

In a recently published Nature article, Highly accurate protein structure prediction with AlphaFold, you can find out more about the neural network-based model and methodology that the AlphaFold team used. In this second Nature article, Highly accurate protein structure prediction for the human proteome, published yesterday, you can read more about the application of AlphaFold to the human proteome.

Find out more

AlphaFold Protein Structure Database
DeepMind blog post
EMBL-EBI news article
Highly accurate protein structure prediction with AlphaFold, Nature article.
Highly accurate protein structure prediction for the human proteome, Nature article.
DeepMind open source code
AlphaFold Colab



tags:


Lucy Smith is Senior Managing Editor for AIhub.
Lucy Smith is Senior Managing Editor for AIhub.




            AIhub is supported by:



Related posts :



AIhub blog post highlights 2025

  16 Dec 2025
As the year draws to a close, we take a look back at some of our favourite blog posts.

Using machine learning to track greenhouse gas emissions

  15 Dec 2025
PhD candidate Julia Wąsala searches for greenhouse gas emissions in satellite data.

AAAI 2025 presidential panel on the future of AI research – video discussion on AGI

  12 Dec 2025
Watch the first in a series of video discussions from AAAI.

The Machine Ethics podcast: the AI bubble with Tim El-Sheikh

Ben chats to Tim about AI use cases, whether GenAI is even safe, the AI bubble, replacing human workers, data oligarchies and more.

Australia’s vast savannas are changing, and AI is showing us how

Improving decision-making for dynamic and rapidly changing environments.

AI language models show bias against regional German dialects

New study examines how artificial intelligence responds to dialect speech.

We asked teachers about their experiences with AI in the classroom — here’s what they said

  05 Dec 2025
Researchers interviewed teachers from across Canada and asked them about their experiences with GenAI in the classroom.



 

AIhub is supported by:






 












©2025.05 - Association for the Understanding of Artificial Intelligence