ΑΙhub.org
 

AI weather models can now beat the best traditional forecasts


by
06 January 2025



share this:

NASA/GSFC, MODIS Rapid Response Team, Jacques Descloitres

By Vassili Kitsios, CSIRO

A new machine-learning weather prediction model called GenCast can outperform the best traditional forecasting systems in at least some situations, according to a paper by Google DeepMind researchers published last month in Nature.

Using a diffusion model approach similar to artificial intelligence (AI) image generators, the system generates multiple forecasts to capture the complex behaviour of the atmosphere. It does so with a fraction of the time and computing resources required for traditional approaches.

How weather forecasts work

The weather predictions we use in practice are produced by running multiple numerical simulations of the atmosphere.

Each simulation starts from a slightly different estimate of the current weather. This is because we don’t know exactly what the weather is at this instant everywhere in the world. To know that, we would need sensor measurements everywhere.

These numerical simulations use a model of the world’s atmosphere divided into a grid of three-dimensional blocks. By solving equations describing the fundamental physical laws of nature, the simulations predict what will happen in the atmosphere.

Known as general circulation models, these simulations need a lot of computing power. They are usually run at high-performance supercomputing facilities.

Machine-learning the weather

The past few years have seen an explosion in efforts to produce weather prediction models using machine learning. Typically, these approaches don’t incorporate our knowledge of the laws of nature the way general circulation models do.

Most of these models use some form of neural network to learn patterns in historical data and produce a single future forecast. However, this approach produces predictions that lose detail as they progress into the future, gradually becoming “smoother”. This smoothness is not what we see in real weather systems.

Researchers at Google’s DeepMind AI research lab have just published a paper in Nature describing their latest machine-learning model, GenCast.

GenCast mitigates this smoothing effect by generating an ensemble of multiple forecasts. Each individual forecast is less smooth, and better resembles the complexity observed in nature.

The best estimate of the actual future then comes from averaging the different forecasts. The size of the differences between the individual forecasts indicates how much uncertainty there is.

According to the GenCast paper, this probabilistic approach creates more accurate forecasts than the best numerical weather prediction system in the world – the one at the European Centre for Medium-Range Weather Forecasts.

Generative AI – for weather

GenCast is trained on what is called reanalysis data from the years 1979 to 2018. This data is produced by the kind of general circulation models we talked about earlier, which are additionally corrected to resemble actual historical weather observations to produce a more consistent picture of the world’s weather.

The GenCast model makes predictions of several variables such as temperature, pressure, humidity and wind speed at the surface and at 13 different heights, on a grid that divides the world up into 0.25-degree regions of latitude and longitude.

GenCast is what is called a “diffusion model”, similar to AI image generators. However, instead of taking text and producing an image, it takes the current state of the atmosphere and produces an estimate of what it will be like in 12 hours.

This works by first setting the values of the atmospheric variables 12 hours into the future as random noise. GenCast then uses a neural network to find structures in the noise that are compatible with the current and previous weather variables. An ensemble of multiple forecasts can be generated by starting with different random noise.

Forecasts are run out to 15 days, taking 8 minutes on a single processor called a tensor processor unit (TPU). This is significantly faster than a general circulation model. The training of the model took five days using 32 TPUs.

Machine-learning forecasts could become more widespread in the coming years as they become more efficient and reliable.

However, classical numerical weather prediction and reanalysed data will still be required. Not only are they needed to provide the initial conditions for the machine learning weather forecasts, they also produce the input data to continually fine-tune the machine learning models.

What about the climate?

Current machine learning weather forecasting systems are not appropriate for climate projections, for three reasons.

Firstly, to make weather predictions weeks into the future, you can assume that the ocean, land and sea ice won’t change. This is not the case for climate predictions over multiple decades.

Secondly, weather prediction is highly dependent on the details of the current weather. However, climate projections are concerned with the statistics of the climate decades into the future, for which today’s weather is irrelevant. Future carbon emissions are the greater determinant of the future state of the climate.

Thirdly, weather prediction is a “big data” problem. There are vast amounts of relevant observational data, which is what you need to train a complex machine learning model.

Climate projection is a “small data” problem, with relatively little available data. This is because the relevant physical phenomena (such as sea levels or climate drivers such as the El Niño–Southern Oscillation) evolve much more slowly than the weather.

There are ways to address these problems. One approach is to use our knowledge of physics to simplify our models, meaning they require less data for machine learning.

Another approach is to use physics-informed neural networks to try to fit the data and also satisfy the laws of nature. A third is to use physics to set “ground rules” for a system, then use machine learning to determine the specific model parameters.

Machine learning has a role to play in the future of both weather forecasting and climate projections. However, fundamental physics – fluid mechanics and thermodynamics – will continue to play a crucial role.The Conversation

Vassili Kitsios, Senior Research Scientist, Climate Forecasting, CSIRO

This article is republished from The Conversation under a Creative Commons license. Read the original article.


Read the work in full

Probabilistic weather forecasting with machine learning, Ilan Price, Alvaro Sanchez-Gonzalez, Ferran Alet, Tom R. Andersson, Andrew El-Kadi, Dominic Masters, Timo Ewalds, Jacklynn Stott, Shakir Mohamed, Peter Battaglia, Remi Lam & Matthew Willson (2024).




The Conversation is an independent source of news and views, sourced from the academic and research community and delivered direct to the public.
The Conversation is an independent source of news and views, sourced from the academic and research community and delivered direct to the public.

            AIhub is supported by:



Subscribe to AIhub newsletter on substack



Related posts :

Reinforcement learning applied to autonomous vehicles: an interview with Oliver Chang

  25 Feb 2026
In the third of our interviews with the 2026 AAAI Doctoral Consortium cohort, we hear from Oliver Chang.

The Machine Ethics podcast: moral agents with Jen Semler

In this episode, Ben and Jen Semler talk about what makes a moral agent, the point of moral agents, philosopher and engineer collaborations, and more.

Extending the reward structure in reinforcement learning: an interview with Tanmay Ambadkar

  23 Feb 2026
Find out more about Tanmay's research on RL frameworks, the latest in our series meeting the AAAI Doctoral Consortium participants.

The Good Robot podcast: what makes a drone “good”? with Beryl Pong

  20 Feb 2026
In this episode, Eleanor and Kerry talk to Beryl Pong about what it means to think about drones as “good” or “ethical” technologies.

Relational neurosymbolic Markov models

and   19 Feb 2026
Relational neurosymbolic Markov models make deep sequential models logically consistent, intervenable and generalisable

AI enables a Who’s Who of brown bears in Alaska

  18 Feb 2026
A team of scientists from EPFL and Alaska Pacific University has developed an AI program that can recognize individual bears in the wild, despite the substantial changes that occur in their appearance over the summer season.

Learning to see the physical world: an interview with Jiajun Wu

and   17 Feb 2026
Winner of the 2019 AAAI / ACM SIGAI dissertation award tells us about his current research.

3 Questions: Using AI to help Olympic skaters land a quint

  16 Feb 2026
Researchers are applying AI technologies to help figure skaters improve. They also have thoughts on whether five-rotation jumps are humanly possible.



AIhub is supported by:







Subscribe to AIhub newsletter on substack




 















©2026.02 - Association for the Understanding of Artificial Intelligence