about

resources

events

contribute

republishing

☰

ΑΙhub.org

Harnessing the power of AI to elucidate mobile traffic consumption at city scale

by Paul Patras

15 December 2021

What makes mobile traffic analysis difficult?

Gaining fine-grained knowledge from mobile network traffic is not straightforward. To begin with, this requires dedicated equipment (probes) for measurements collection, which runs e.g. on network gateways. Secondly, processing vast amounts of data in a scalable and timely fashion is non-trivial, as this involves substantial local storage capabilities, costly overhead associated with transferring detailed logs to central locations for analysis, data filtering by scope, etc. To simplify traffic analysis, mobile operators often make simple assumptions about the distribution of data traffic consumption across cells. For instance, it is frequently assumed that users and traffic are uniformly distributed, irrespective of the geographical layout of coverage areas (Lee et al., 2014). Unfortunately, such approximations are usually highly inaccurate, as traffic volumes may exhibit considerable differences between nearby locations (Wang et al., 2015), and therefore have limited value.

Mobile traffic super-resolution

To tackle this problem, our work published earlier in ACM CoNEXT’17 introduces the concept of mobile traffic super-resolution (MTSR), by which we infer narrowly localised traffic consumption from aggregate data recorded by a limited number of probes (thereby reducing deployment and measurement costs) that have arbitrary spatial granularity, which is reflective of practical coverage area sizes that depend on population density. We draw inspiration from image processing, and treat city-scale mobile traffic ‘snapshots’ similar to images, regarding a network measurement collection location similar to a pixel in an image, and the traffic volume at that location as the intensity of a pixel (see the similarities between the two problems as illustrated in Figure 1). We propose a novel Generative Adversarial neural Network (GAN) architecture tailored to MTSR, where high-resolution traffic maps are obtained through a generative model that receives coarse measurements collected by network probes and outputs approximations of the real traffic distribution. This is trained with a discriminative model that estimates the probability that a sample snapshot comes from a fine-grained ground truth measurements set, rather than being produced by the generator.

Figure 1: Illustration of the image super-resolution (SR) problem (above) and the underlying principle of the mobile traffic super-resolution (MTSR) technique (below).

The generator component of our GAN upgrades a ResNet model (He et al., 2016) with a set of additional “skipconnections”, without introducing extra parameters, while allowing gradients to backpropagate faster through the model in the training phrase. To train our architecture, which we named ZipNet, we rely on a data processing and augmentation approach that crops the original city-wide mobile data traffic snapshots to smaller size windows and repeats this process with different offsets to generate extra data points from the original ones, thereby maximising the usage of data available for training and preventing model overfitting.

Experiments we conduct with a real-world mobile traffic dataset, published by a major European operator, demonstrate the feasibility of using deep learning to infer fine-grained mobile traffic distributions with up orders of magnitude higher granularity as compared to standard probing, irrespective of the coverage and the location of the probes. Importantly, our ZipNet(-GAN) models achieve much lower reconstruction errors and higher fidelity of reconstructed traffic than existing interpolation techniques.

Mobile traffic sampling and reconstruction

Another challenge is deciding where to collect measurements prior to performing any interpolation. Selecting sampling locations at random can overlook important spatiotemporal correlations specific to mobile traffic, leading to inaccurate reconstruction. Further, an important question to clarify is how many samples are needed to achieve high interpolation fidelity with the lowest possible measurement overhead.

Figure 2: The Spider framework: sparse mobile traffic snapshots used by a policy network to select optimal cells where to collect measurements; a dedicated reconstruction neural model outputs complete network traffic snapshots.

In our recent work, which was published in IEEE GLOBECOM’21, we put forward a deep learning-driven mobile traffic measurement collection and reconstruction framework, called Spider, which we illustrate in Figure 2. Our approach relies on a dedicated neural network that we train to selectively sample small subsets of target mobile coverage areas, so as to minimise overhead while acquiring enough data to ensure high-quality traffic consumption interpolations at all non-sampled cells. We take a Deep Reinforcement Learning (DRL) approach to produce examples for training this policy network. Given the large action space, our DRL agent learns in a tractable manner by sampling small subsets of the action space based on the most likely action and its nearest neighbors. The highest valued action from such subsets is then selected, circumventing the need to evaluate the entire action space.

For action evaluation and interpolation, we introduce a purpose-built mobile traffic reconstruction neural model (MTRNet), which builds on our earlier ZipNet design and similarly exploits spatiotemporal correlations within historical data. MTRNet outperforms existing methods in terms of accuracy (up to 67% lower Mean Absolute Error) and reconstruction speed (over 800× runtime reduction). By means of experiments with the same real mobile network dataset, we also showed that Spider requires on average 48% fewer samples to reconstruct complete traffic matrices, even when applied to previously unseen traffic patterns, such as those observed during holidays.

References

He, K., Zhang, X., Ren, S., and Sun, J. (2016). Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 770–778.

Lee, D., Zhou, S., Zhong, X., Niu, Z., Zhou, X., and Zhang, H. (2014). Spatial modeling of the traffic density in cellular networks. IEEE Wireless Communications, 21(1):80–88.

Wang, H., Xu, F., Li, Y., Zhang, P., and Jin, D. (2015). Understanding mobile traffic patterns of large scale cellular towers in urban environment. In Proc. ACM IMC, pages 225–238.

Paul Patras is an Associate Professor in the School of Informatics at the University of Edinburgh, and a co-founder and CEO of Net AI.

AIhub is supported by:

Forthcoming machine learning and AI seminars: July 2025 edition

Lucy Smith 30 Jun 2025

A list of free-to-attend AI-related seminars that are scheduled to take place between 1 July and 31 August 2025.

monthly digest

AIhub monthly digest: June 2025 – gearing up for RoboCup 2025, privacy-preserving models, and mitigating biases in LLMs

Lucy Smith 26 Jun 2025

Welcome to our monthly digest, where you can catch up with AI research, events and news from the month past.

RoboCupRescue: an interview with Adam Jacoff

Lucy Smith 25 Jun 2025

Find out what's new in the RoboCupRescue League this year.

Making optimal decisions without having all the cards in hand

Nathanaël Fijalkow, Hugo Gimbert, Florian Horn, Guillermo Perez and Pierre Vandenhove 24 Jun 2025

Read about research which won an outstanding paper award at AAAI 2025.

Exploring counterfactuals in continuous-action reinforcement learning

Shuyang Dong 20 Jun 2025

Shuyang Dong writes about her work that will be presented at IJCAI 2025.

What is vibe coding? A computer scientist explains what it means to have AI write computer code − and what risks that can entail

The Conversation 19 Jun 2025

Until recently, most computer code was written, at least originally, by human beings. But with the advent of GenAI, that has begun to change.

Gearing up for RoboCupJunior: Interview with Ana Patrícia Magalhães

Lucy Smith 18 Jun 2025

We hear from the organiser of RoboCupJunior 2025 and find out how the preparations are going for the event.

Interview with Mahammed Kamruzzaman: Understanding and mitigating biases in large language models

Lucy Smith 17 Jun 2025

Find out how Mahammed is investigating multiple facets of biases in LLMs.