ΑΙhub.org
 

DeepMind and EMBL release database of predicted protein structures


by
23 July 2021



share this:

AF-Q8I3H7-F1
T-cell immunomodulatory protein homolog, from the AlphaFold Protein Structure Database, reproduced under a CC-BY-4.0 license.

DeepMind and the European Molecular Biology Laboratory (EMBL) have partnered to produce a database of predicted protein structure models.

The first release covers all ~20,000 proteins expressed in the human proteome, and the proteomes of 20 other biologically significant organisms, totalling over 350k structures. In the coming months they plan to expand the database to cover a large proportion of all catalogued proteins (the over 100 million in UniRef90).

The data is freely and openly available to the scientific community. You can access the AlphaFold Protein Structure Database here.

Back in November, DeepMind reported on their AlphaFold system that was able to predict, with high accuracy, a protein’s 3D structure from its amino acid sequence. We wrote about it here. This database is the next step in the journey, and the collaborators hope that this will be a useful tool for researchers and open up new avenues for scientific discovery.


Another example protein structure from the AlphaFold Protein Structure Database, reproduced under a CC-BY-4.0 license. This is Striatin-interacting protein 1. It plays a role in the regulation of cell morphology and cytoskeletal organization, required in the cortical actin filament dynamics and cell shape. AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100. The parts of the protein with a pLDDT score of above 90 are shown in dark blue, between 70 and 90 in light blue, between 50 and 70 in yellow, and below 50 in red.

In a recently published Nature article, Highly accurate protein structure prediction with AlphaFold, you can find out more about the neural network-based model and methodology that the AlphaFold team used. In this second Nature article, Highly accurate protein structure prediction for the human proteome, published yesterday, you can read more about the application of AlphaFold to the human proteome.

Find out more

AlphaFold Protein Structure Database
DeepMind blog post
EMBL-EBI news article
Highly accurate protein structure prediction with AlphaFold, Nature article.
Highly accurate protein structure prediction for the human proteome, Nature article.
DeepMind open source code
AlphaFold Colab



tags:


Lucy Smith is Senior Managing Editor for AIhub.
Lucy Smith is Senior Managing Editor for AIhub.




            AIhub is supported by:


Related posts :



Interview with Onur Boyar: Drug and material design using generative models and Bayesian optimization

  09 May 2025
Find out how Onur is applying machine learning techniques to bioinformatics-related problems.

2025 AI Index Report

  08 May 2025
Read the latest edition of the AI Index Report which tracks and visualises data related to AI.

Defending against prompt injection with structured queries (StruQ) and preference optimization (SecAlign)

  06 May 2025
Recent advances in LLMs enable exciting LLM-integrated applications. However, as LLMs have improved, so have the attacks against them.

Forthcoming machine learning and AI seminars: May 2025 edition

  05 May 2025
A list of free-to-attend AI-related seminars that are scheduled to take place between 5 May and 30 June 2025.

Competition open for images of “digital transformation at work”

Digit and Better Images of AI have teamed up to launch a competition to create more realistic stock images of "digital transformation at work"
monthly digest

AIhub monthly digest: April 2025 – aligning GenAI with technical standards, ML applied to semiconductor manufacturing, and social choice problems

  30 Apr 2025
Welcome to our monthly digest, where you can catch up with AI research, events and news from the month past.



 

AIhub is supported by:






©2025.05 - Association for the Understanding of Artificial Intelligence


 












©2025.05 - Association for the Understanding of Artificial Intelligence