ΑΙhub.org
 

COVID-19 Open Research Dataset (CORD-19) now available for researchers


by
17 March 2020



share this:
CORD-19 dataset

On 16 March the COVID-19 Open Research Dataset (CORD-19) was released. This comprises an open-source, machine-readable collection of scholarly literature covering COVID-19, SARS-CoV-2, and the Coronavirus group. This free resource contains over 29,000 relevant scholarly articles, including over 13,000 with full text.

The release of the dataset is a result of a collaborate effort between the Allen Institute for AI, Chan Zuckerberg Initiative, Georgetown University, Microsoft, and the US National Library of Medicine (NLM). This resource is intended to mobilize researchers to apply recent advances in natural language processing to generate new insights in support of the fight against this infectious disease.

The CORD-19 dataset is available on the Allen Institute’s SemanticScholar.org website and will continue to be updated as new research is published in archival services and peer-reviewed publications.

Kaggle is hosting a challenge using this dataset and at present there are 10 initial tasks for people to work on. These key scientific questions have been drawn from the National Academies of Sciences, Engineering, and Medicine’s research topics and the World Health Organization’s R&D Blueprint for COVID-19.

Links:

You can access the official webpage for CORD-19 here .
Find the kaggle challenge page here.




Lucy Smith is Senior Managing Editor for AIhub.
Lucy Smith is Senior Managing Editor for AIhub.

            AIhub is supported by:



Subscribe to AIhub newsletter on substack



Related posts :

Water flow in prairie watersheds is increasingly unpredictable — but AI could help

  13 Apr 2026
In recent years, the Prairies have seen bigger swings in climate conditions — very wet years followed by very dry ones.

Identifying interactions at scale for LLMs

  10 Apr 2026
Model behavior is rarely the result of isolated components; rather, it emerges from complex dependencies and patterns.

Interview with Sukanya Mandal: Synthesizing multi-modal knowledge graphs for smart city intelligence

  09 Apr 2026
A modular four-stage framework that draws on LLMs to automate synthetic multi-modal knowledge graphs.

Emergence of fragility in LLM-based social networks: an interview with Francesco Bertolotti

  08 Apr 2026
Francesco tells us how LLMs behave in the social network Moltbook, and what this reveals about network dynamics.

Scaling up multi-agent systems: an interview with Minghong Geng

  07 Apr 2026
We sat down with Minghong in the latest of our interviews with the 2026 AAAI/SIGAI Doctoral Consortium participants.

Forthcoming machine learning and AI seminars: April 2026 edition

  02 Apr 2026
A list of free-to-attend AI-related seminars that are scheduled to take place between 2 April and 31 May 2026.

#AAAI2026 invited talk: machine learning for particle physics

  01 Apr 2026
How is ML used in the search for new particles at CERN?
monthly digest

AIhub monthly digest: March 2026 – time series, multiplicity, and the history of RoboCup

  31 Mar 2026
Welcome to our monthly digest, where you can catch up with AI research, events and news from the month past.



AIhub is supported by:







Subscribe to AIhub newsletter on substack




 















©2026.02 - Association for the Understanding of Artificial Intelligence