ΑΙhub.org
 

COVID-19 Open Research Dataset (CORD-19) now available for researchers


by
17 March 2020



share this:
CORD-19 dataset

On 16 March the COVID-19 Open Research Dataset (CORD-19) was released. This comprises an open-source, machine-readable collection of scholarly literature covering COVID-19, SARS-CoV-2, and the Coronavirus group. This free resource contains over 29,000 relevant scholarly articles, including over 13,000 with full text.

The release of the dataset is a result of a collaborate effort between the Allen Institute for AI, Chan Zuckerberg Initiative, Georgetown University, Microsoft, and the US National Library of Medicine (NLM). This resource is intended to mobilize researchers to apply recent advances in natural language processing to generate new insights in support of the fight against this infectious disease.

The CORD-19 dataset is available on the Allen Institute’s SemanticScholar.org website and will continue to be updated as new research is published in archival services and peer-reviewed publications.

Kaggle is hosting a challenge using this dataset and at present there are 10 initial tasks for people to work on. These key scientific questions have been drawn from the National Academies of Sciences, Engineering, and Medicine’s research topics and the World Health Organization’s R&D Blueprint for COVID-19.

Links:

You can access the official webpage for CORD-19 here .
Find the kaggle challenge page here.




Lucy Smith is Senior Managing Editor for AIhub.
Lucy Smith is Senior Managing Editor for AIhub.

            AUAI is supported by:



Subscribe to AIhub newsletter on substack



Related posts :

#RoboCup2026 – humanoid league day 2

  03 Jul 2026
Find out the latest from day two of the competition.

#RoboCup2026 – humanoid league day 1

  02 Jul 2026
In the first of our round-ups from the humanoid league we introduce the competition, and report some preliminary results.

Adaptive parallel reasoning: the next paradigm in efficient inference scaling

  02 Jul 2026
A detailed analysis of recent progress in the field of parallel reasoning.

Scientists develop new method to generate protein datasets for training AI

  01 Jul 2026
AI is only as good as the data used to train it, and in some areas of protein engineering, the right data is hard to come by.

What’s coming up at #RoboCup2026?

  29 Jun 2026
Find out what's in store at this year's international competition.

AI model used to generate complete models of proteins in motion

  26 Jun 2026
Researchers have used a neural network to create all-atom models of proteins, as well as the dynamic movements that govern their function.

Three ways to avoid being fooled by AI slop

  24 Jun 2026
Global society makes billions of images and uploads hundreds of thousands of hours of video on the internet every day. The problem is, some of this content is misleading or downright wrong.

Engineering Out Loud: S13E1 – How many robots can a single human supervise?

  22 Jun 2026
Professor Julie Adams describes the research showing that one person can supervise more than 100 autonomous ground and aerial robots.



AUAI is supported by:







Subscribe to AIhub newsletter on substack




 















©2026.05 - Association for the Understanding of Artificial Intelligence