ΑΙhub.org
 

COVID-19 Open Research Dataset (CORD-19) now available for researchers


by
17 March 2020



share this:
CORD-19 dataset

On 16 March the COVID-19 Open Research Dataset (CORD-19) was released. This comprises an open-source, machine-readable collection of scholarly literature covering COVID-19, SARS-CoV-2, and the Coronavirus group. This free resource contains over 29,000 relevant scholarly articles, including over 13,000 with full text.

The release of the dataset is a result of a collaborate effort between the Allen Institute for AI, Chan Zuckerberg Initiative, Georgetown University, Microsoft, and the US National Library of Medicine (NLM). This resource is intended to mobilize researchers to apply recent advances in natural language processing to generate new insights in support of the fight against this infectious disease.

The CORD-19 dataset is available on the Allen Institute’s SemanticScholar.org website and will continue to be updated as new research is published in archival services and peer-reviewed publications.

Kaggle is hosting a challenge using this dataset and at present there are 10 initial tasks for people to work on. These key scientific questions have been drawn from the National Academies of Sciences, Engineering, and Medicine’s research topics and the World Health Organization’s R&D Blueprint for COVID-19.

Links:

You can access the official webpage for CORD-19 here .
Find the kaggle challenge page here.




Lucy Smith is Senior Managing Editor for AIhub.
Lucy Smith is Senior Managing Editor for AIhub.

            AIhub is supported by:



Subscribe to AIhub newsletter on substack



Related posts :

What I’ve learned from 25 years of automated science, and what the future holds: an interview with Ross King

  30 Mar 2026
We launch our new series with a conversation with Ross King - a pioneer in the field of AI-enabled scientific discovery.

A multi-armed robot for assisting with agricultural tasks

and   27 Mar 2026
How can a robot safely manipulate branches to reveal hidden flowers while remaining aware of interaction forces and minimizing damage?

Resource-constrained image generation and visual understanding: an interview with Aniket Roy

  26 Mar 2026
Aniket tells us about his research exploring how modern generative models can be adapted to operate efficiently while maintaining strong performance.

RWDS Big Questions: how do we highlight the role of statistics in AI?

  25 Mar 2026
Next in our series, the panel explores the statistical underpinning of AI.

A history of RoboCup with Manuela Veloso

  24 Mar 2026
Find out how RoboCup got started and how the competition has evolved, from one of the co-founders.

Information-driven design of imaging systems

  23 Mar 2026
Framework that enables direct evaluation and optimization of imaging systems based on their information content.

Machine learning framework to predict global imperilment status of freshwater fish

  20 Mar 2026
“With our model, decision makers can deploy resources in advance before a species becomes imperiled.”

Interview with AAAI Fellow Yan Liu: machine learning for time series

  19 Mar 2026
Hear from 2026 AAAI Fellow Yan Liu about her research into time series, the associated applications, and the promise of physics-informed models.



AIhub is supported by:







Subscribe to AIhub newsletter on substack




 















©2026.02 - Association for the Understanding of Artificial Intelligence