ΑΙhub.org
 

History playground – finding patterns in historical newspapers


by
13 January 2020



share this:

Ever fancied finding out more about historical trends? Well, thanks to researchers at the University of Bristol, and their History Playground tool, anyone can analyse the content from a collection of historical British and American newspapers.

Macroscopic patterns of continuity and change over the course of centuries can be detected through the analysis of time series extracted from massive textual corpora. Similar data-driven approaches have already revolutionised the natural sciences. It is widely believed that there is similar potential for the humanities and social sciences. As such, new interactive tools are required to discover and extract macroscopic patterns from these vast quantities of data.

History Playground enables users to search for small sequences of words and retrieve their relative frequencies over the course of history. The tool makes use of scalable algorithms to first extract trends from textual corpora, before making them available for real-time search and discovery, presenting users with an interface to explore the data.

At present there are two large sets of text available:

Find out how to start using the History Playground by watching this short video:

Watch a further introduction to the project here:

History Playground uses the concept of n-grams, defined as short sequences of words. It is these n-grams that users search for when they use the tool. N-gram models are also widely used in the fields of natural language processing, probability, communication theory and data compression.

The team hope that in the long term, as more large textual datasets are released and additional feedback from the community helps to improve the Playground, they will be able to incorporate more varied and interesting corpora into the tool. In addition they are continuing to develop methods of analysis and additional views and visualisations. The tool also has the potential to incorporate text in languages other than English. For looking at more contemporary sources of data (for example, social media) the time resolution can be adjusted to study daily or even hourly changes.

This work is part of the ERC ThinkBIG project, Principal Investigator Nello Cristianini, University of Bristol.

Nello Cristianini is a Professor of Artificial Intelligence at the University of Bristol. His research interests include data science, artificial intelligence, machine learning, and applications to computational social sciences, digital humanities and news content analysis.

 

 

Read the full research articles on this topic:




Nello Cristianini is a Professor of Artificial Intelligence at the University of Bristol.
Nello Cristianini is a Professor of Artificial Intelligence at the University of Bristol.

            AIhub is supported by:



Subscribe to AIhub newsletter on substack



Related posts :

What the Moltbook experiment is teaching us about AI

An experimental social media platform where only AI bots can post reveals surprising lessons about artificial intelligence behaviour and safety.

The malleable mind: context accumulation drives LLM’s belief drift

  09 Mar 2026
LLMs change their "beliefs" over time, depending on the data they are given.

RWDS Big Questions: how do we balance innovation and regulation in the world of AI?

  06 Mar 2026
The panel explores the tensions, trade-offs and practical realities facing policymakers and data scientists alike.

Studying multiplicity: an interview with Prakhar Ganesh

  05 Mar 2026
What is multiplicity, and what implications does it have for fairness, privacy and interpretability in real-world systems?

Top AI ethics and policy issues of 2025 and what to expect in 2026

, and   04 Mar 2026
In the latest issue of AI Matters, a publication of ACM SIGAI, Larry Medsker summarised the year in AI ethics and policy, and looked ahead to 2026.

The greatest risk of AI in higher education isn’t cheating – it’s the erosion of learning itself

  03 Mar 2026
Will AI hollow out the pipeline of students, researchers and faculty that is the basis of today’s universities?

Forthcoming machine learning and AI seminars: March 2026 edition

  02 Mar 2026
A list of free-to-attend AI-related seminars that are scheduled to take place between 2 March and 30 April 2026.
monthly digest

AIhub monthly digest: February 2026 – collective decision making, multi-modal learning, and governing the rise of interactive AI

  27 Feb 2026
Welcome to our monthly digest, where you can catch up with AI research, events and news from the month past.



AIhub is supported by:







Subscribe to AIhub newsletter on substack




 















©2026.02 - Association for the Understanding of Artificial Intelligence