ΑΙhub.org
 

DeepMind and EMBL release database of predicted protein structures


by
23 July 2021



share this:

AF-Q8I3H7-F1
T-cell immunomodulatory protein homolog, from the AlphaFold Protein Structure Database, reproduced under a CC-BY-4.0 license.

DeepMind and the European Molecular Biology Laboratory (EMBL) have partnered to produce a database of predicted protein structure models.

The first release covers all ~20,000 proteins expressed in the human proteome, and the proteomes of 20 other biologically significant organisms, totalling over 350k structures. In the coming months they plan to expand the database to cover a large proportion of all catalogued proteins (the over 100 million in UniRef90).

The data is freely and openly available to the scientific community. You can access the AlphaFold Protein Structure Database here.

Back in November, DeepMind reported on their AlphaFold system that was able to predict, with high accuracy, a protein’s 3D structure from its amino acid sequence. We wrote about it here. This database is the next step in the journey, and the collaborators hope that this will be a useful tool for researchers and open up new avenues for scientific discovery.


Another example protein structure from the AlphaFold Protein Structure Database, reproduced under a CC-BY-4.0 license. This is Striatin-interacting protein 1. It plays a role in the regulation of cell morphology and cytoskeletal organization, required in the cortical actin filament dynamics and cell shape. AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100. The parts of the protein with a pLDDT score of above 90 are shown in dark blue, between 70 and 90 in light blue, between 50 and 70 in yellow, and below 50 in red.

In a recently published Nature article, Highly accurate protein structure prediction with AlphaFold, you can find out more about the neural network-based model and methodology that the AlphaFold team used. In this second Nature article, Highly accurate protein structure prediction for the human proteome, published yesterday, you can read more about the application of AlphaFold to the human proteome.

Find out more

AlphaFold Protein Structure Database
DeepMind blog post
EMBL-EBI news article
Highly accurate protein structure prediction with AlphaFold, Nature article.
Highly accurate protein structure prediction for the human proteome, Nature article.
DeepMind open source code
AlphaFold Colab



tags:


Lucy Smith is Senior Managing Editor for AIhub.
Lucy Smith is Senior Managing Editor for AIhub.

            AUAI is supported by:



Subscribe to AIhub newsletter on substack



Related posts :

coffee corner

AIhub coffee corner: World models

  22 May 2026
The AIhub coffee corner captures the musings of AI experts over a short conversation.

Why the world’s banks are so worried about Anthropic’s latest AI model

  21 May 2026
The finance world’s concern rests on the impressive cyber capabilities of a product called Mythos.

Embracing empiricism – from the lottery hypothesis to creating real-world impact: an interview with Jonathan Frankle

  20 May 2026
Jonathan Frankle discusses empiricism, making an impact, and the legacy of his lottery ticket hypothesis.

A faster way to estimate AI power consumption

  19 May 2026
The “EnergAIzer” method generates reliable results in seconds, enabling data center operators to efficiently allocate resources and reduce wasted energy.

Introducing ARFBench: A time series question-answering benchmark based on real incidents

  18 May 2026
To resolve system failures, engineers must troubleshoot outages quickly.

Does ‘federated unlearning’ in AI improve data privacy, or create a new cybersecurity risk?

  15 May 2026
As the capacity of AI systems increases apace, so do concerns about the privacy of user data.

Reflections from #AIES2025

and   14 May 2026
We reflect on AIES 2025, outlining a discussion session on LLMs for clinical usage and human rights.

Deep learning-powered biochip to detect genetic markers

System can detect extremely small amounts of microRNAs, genetic markers linked to diseases such as heart disease.



AUAI is supported by:







Subscribe to AIhub newsletter on substack




 















©2026.02 - Association for the Understanding of Artificial Intelligence