COVID-19 Open Research Dataset (CORD-19) now available for researchers

17 March 2020

share this:

CORD-19 dataset

On 16 March the COVID-19 Open Research Dataset (CORD-19) was released. This comprises an open-source, machine-readable collection of scholarly literature covering COVID-19, SARS-CoV-2, and the Coronavirus group. This free resource contains over 29,000 relevant scholarly articles, including over 13,000 with full text.

The release of the dataset is a result of a collaborate effort between the Allen Institute for AI, Chan Zuckerberg Initiative, Georgetown University, Microsoft, and the US National Library of Medicine (NLM). This resource is intended to mobilize researchers to apply recent advances in natural language processing to generate new insights in support of the fight against this infectious disease.

The CORD-19 dataset is available on the Allen Institute’s website and will continue to be updated as new research is published in archival services and peer-reviewed publications.

Kaggle is hosting a challenge using this dataset and at present there are 10 initial tasks for people to work on. These key scientific questions have been drawn from the National Academies of Sciences, Engineering, and Medicine’s research topics and the World Health Organization’s R&D Blueprint for COVID-19.


You can access the official webpage for CORD-19 here .
Find the kaggle challenge page here.

Lucy Smith , Managing Editor for AIhub.

            AIhub is supported by:

Related posts :

Forthcoming machine learning and AI seminars: May 2021 edition

A list of free-to-attend AI-related seminars that are scheduled to take place between 11 May and 30 June 2021.
11 May 2021, by

Artificial intelligence could be used to triage patients suspected at risk of early stage oesophageal cancer

Find out how Cambridge researchers are using deep-learning to assist pathologists.
10 May 2021, by

Counterfactual predictions under runtime confounding

We propose a method for using offline data to build a prediction model that only requires access to the available subset of confounders at prediction time.
07 May 2021, by

©2021 - Association for the Understanding of Artificial Intelligence