COVID-19 Open Research Dataset (CORD-19) now available for researchers

by Lucy Smith

17 March 2020

share this:

On 16 March the COVID-19 Open Research Dataset (CORD-19) was released. This comprises an open-source, machine-readable collection of scholarly literature covering COVID-19, SARS-CoV-2, and the Coronavirus group. This free resource contains over 29,000 relevant scholarly articles, including over 13,000 with full text.

The release of the dataset is a result of a collaborate effort between the Allen Institute for AI, Chan Zuckerberg Initiative, Georgetown University, Microsoft, and the US National Library of Medicine (NLM). This resource is intended to mobilize researchers to apply recent advances in natural language processing to generate new insights in support of the fight against this infectious disease.

The CORD-19 dataset is available on the Allen Institute’s SemanticScholar.org website and will continue to be updated as new research is published in archival services and peer-reviewed publications.

Kaggle is hosting a challenge using this dataset and at present there are 10 initial tasks for people to work on. These key scientific questions have been drawn from the National Academies of Sciences, Engineering, and Medicine’s research topics and the World Health Organization’s R&D Blueprint for COVID-19.

Links:

You can access the official webpage for CORD-19 here .
Find the kaggle challenge page here.

Lucy Smith is Senior Managing Editor for AIhub.

AIhub is supported by:

Interview with Shaghayegh (Shirley) Shajarian: Applying generative AI to computer networks

Lucy Smith 05 Aug 2025

Read the latest interview in our series featuring the AAAI/SIGAI Doctoral Consortium participants.

How AI can help protect bees from dangerous parasites

The Conversation 04 Aug 2025

Tiny but mighty, honeybees play a crucial role in our ecosystems, pollinating various plants and crops.

The Machine Ethics podcast: AI Ethics, Risks and Safety Conference 2025

The Machine Ethics Podcast 01 Aug 2025

Listen to a special episode recorded at the AI Ethics, Risks and Safety Conference.

Interview with Aneesh Komanduri: Causality and generative modeling

Lucy Smith 31 Jul 2025

Read the latest interview in our series featuring the AAAI/SIGAI Doctoral Consortium participants.

monthly digest

AIhub monthly digest: July 2025 – RoboCup round-up, ICML in Vancouver, and leveraging feedback in human-robot interactions

Lucy Smith 30 Jul 2025

Welcome to our monthly digest, where you can catch up with AI research, events and news from the month past.

COVID-19 Open Research Dataset (CORD-19) now available for researchers

Links:

Related posts :

Interview with Shaghayegh (Shirley) Shajarian: Applying generative AI to computer networks

How AI can help protect bees from dangerous parasites

The Machine Ethics podcast: AI Ethics, Risks and Safety Conference 2025

Interview with Aneesh Komanduri: Causality and generative modeling

AIhub monthly digest: July 2025 – RoboCup round-up, ICML in Vancouver, and leveraging feedback in human-robot interactions

Interview with Yuki Mitsufuji: Text-to-sound generation

Open-source Swiss language model to be released this summer

Interview with Kate Candon: Leveraging explicit and implicit feedback in human-robot interactions

↑