Identification of hazardous areas for priority landmine clearance: AI for humanitarian mine action

by ML@CMU

19 November 2024

RELand: Risk Estimation of Landmines via Interpretable Invariant Risk Minimization

RELand is a holistic pipeline to identify priority hazard areas to support non-technical surveys in humanitarian demining operations. Theses initial surveys are currently carried out by human experts who evaluate the possible presence of landmines based on available information and that provided by the residents. Since landmines are not used randomly but under war logic, Machine Learning can potentially help with these surveys by analyzing historical events and their correlation to relevant features. However, identifying landmine contamination has been scarcely studied in the literature, and poses three main challenges: noisy labels, geographic dependence, and sparse predicted risk scores. We address the challenges of landmine risk estimation by enhancing existing datasets with rich relevant features, constructing a novel, robust, and interpretable ML model that outperforms standard and new baselines, and identifying cohesive hazard clusters under geographic and budgetary constraints. Finally, the results are delivered through a web application developed with key mine action stakeholders. The major components of RELand are illustrated in Fig. 2. Notably, our approach is the first public pipeline of its kind that can be easily adapted for use in demining workflows globally.

Figure 2. Integration of RELand system into the humanitarian demining pipeline. Current non-technical surveys (grey) are based on the visual inspection of data in geospatial information systems and human expert analyses including local community surveys and domain knowledge. RELand (yellow dashed box) serves as an additional toolbox that contains three major components: dataset enhancement based on existing public geospatial datasets (red), risk modeling with machine learning methods (blue), and interactive web interface (green).

The first component of the system, Dataset Enhancement, integrates different sources of information to construct a dataset for landmine presence with rich relevant features based on geographic information, socio-demographic variables, remnants of war indicators, and historical landmine events. We introduce several new features which prove useful to identify hazard areas and to rule out false alarms. We also argue how labels should be assigned to predict the results of humanitarian demining operations, rectifying the definition of labels used in previous literature.

For the Risk Modeling component, we designed a novel interpretable deep learning tabular model extending TabNet. We propose to minimize the Invariant Risk Minimization (IRM), which enables the model to be robust to distribution shifts and invariant to diverse deployment environments. Intuitively, we define an “easy” environment as one where landmines are found close to past events or grid cells with no historical landmines nearby have indeed negative labels. In contrast, a “hard” environment is one where despite there being some historical events there are no new landmines (and resources are going to be used inefficiently) or new landmines found far away from previous events (and likely missed by baseline methods leading to a latent risk to humans). Formally, let us denote an environment by $e = (X^e, Y^e)$ and let $w$ be a dummy scalar classifier. Then the IRM loss is composed by an ERM cross-entropy term that encourages prediction accuracy, and a regularization term that forces $f_\theta$ to be simultaneously optimal across all environments $E$ . Our landmine risk estimator $f_{\theta}(X)$ is penalized for applying the distance-existence rule in “easy” environments to “hard” ones, and therefore generalizes well on both environments.

$IRM(\theta) = \min_{\theta} \sum\limits_{e \in E} \ell_{\text{CE}}(f_\theta(X^e), Y^e) + \lambda \cdot ||\nabla_{w|w=1} \ \ell_{\text{CE}}(w( f_\theta(X^e)), Y^e)||^2$

However, our partner demining organizations quickly emphasized the need for interpretable models, as they must explain to communities why certain areas are prioritized for clearance or not. Therefore, as the first step towards the interpretation of landmine risk estimators, we utilize SparseMax layers to generate global feature importance for our model. SparseMax (SM) is an activation function that normalizes the input vector to sparse probabilities (like a LASSO regularization), and is shown at the top of Fig. 3. Finally, we leverage the sequential design in TabNet to form decision blocks that are summed together and passed into an aggregation FC layer as the final prediction. This sequential design resembles additive modeling in Gradient Boosting Machines and ResNet skip connection mechanism. Initial blocks capture the main correlation in the dataset, and the following blocks can use the rest of the features to learn the residuals to fit the function better. Our final architecthure is show in Figure 3.

*Figure 3. RELand architecture with interpretation branch that generates sparse feature masks on the top, and decision blocks at the bottom aggregated before the final FC layer.*

To validate the proposed system, we simulate different scenarios in which the RELand system could be deployed in mine clearance operations using real data from Colombia. We use a block cross-validation approach, where the hold-out set corresponds to all cells in a municipality, to account for the geographical nature of current demining operations. In addition, since false negatives represent a higher cost in terms of human lives, we use the Height and Reverse Height (rHeight) metrics of how well a ranking is generated, in the sense that positive cells should be ranked higher than negative cells. Intuitively, models with better predictions for top-ranked regions can speed-up land clearing operations. Given a predicted risk score, Height refers to the number of positive cells ranked below a negative cell, and rHeight is the number of negative cells ranked above a positive one. An ideal classifier minimizes both of these metrics and perfectly rank positive cells above negative cells. Formally,

$Height(X_n) = \sum\limits_{i = 1}^{P}\mathbb{1}(\widehat{f}(X_\text{p})_i \leq \widehat{f}(X_\text{n})),$

$rHeight(X_p) = \sum\limits_{j = 1}^{N}\mathbb{1}(\widehat{f}(X_\text{p}) \leq \widehat{f}(X_\text{n})_j)$

where $P$ and $N$ are the total counts of positive and negative labels, respectively, and $\widehat{f}(X_\text{p}$ ) $(\widehat{f}(X_\text{n}))$ is the predicted probability when the ground truth of $X_i$ $(X_j)$ is positive (negative).

Table 1 presents the result of the experimental validation comparing the proposed methodology with current practices, focusing mainly on historical landmine reports, and two previous ML models proposed in the literature. RELand consistently outperforms the benchmark models on all relevant metrics. Furthermore, Table 1 shows that the proposed method reduces the mean-rHeight by almost half compared to previous approaches. Intuitively, if we were to sequentially clear a region according to the generated risk score ranking, this metric tells us the average number of negative cells we would need to visit before the region is completely cleared. This measures how efficiently we could demine a geographic region of interest: RELand reduces the false alarms and the time required for landmine clearance by half.

Model	ROC (↑)	PR (↑)	mean-Height (↓)	mean-rHeight (↓)
LR-single (current)	86.35 (11.54)	17.07 (10.76)	3.06 (3.19)	226.79 (211.23)
LR-geo (2019, 2016)	67.62 (18.58)	5.37 (8.00)	8.09 (6.93)	573.36 (440.71)
SVM-geo (2019)	48.61 (18.09)	1.73 (1.82)	15.26 (15.66)	821.26 (729.12)
RELand (ours)	92.90 (4.43)	29.03 (22.11)	2.17 (2.48)	132.03 (133.50)

Table1. Validation results in Colombia. Each entry is the mean (std) performance on validation folds following the block cross-validation rule. RELand is our interpretable IRM model. Full experimental results and ablation studies are available in our paper.

Hazard Cluster Identification as a Quadratic Knapsack Problem

Building a reliable prediction model to estimate landmine contamination risk is a crucial first step in data-driven prioritization of land clearance operations. However, integrating the risk maps generated by machine learning models into demining workflows requires considering the additional geographical and budgetary constraints that mine action organizations face in their ground operations. For instance, demining organizations often operate under limited budgets, allowing them to clear only a fraction of the total area under study while also covering the costs associated with mobilizing equipment and teams across the region (e.g., metal detectors, sniffing dogs, and human deminers). Moreover, if multiple regions are to be demined, there must be a secure path connecting these regions to ensure the safe movement of such demining teams. Humanitarian demining organizations need to maximize the land released back to local communities while navigating these challenges.

We propose to find which cells to prioritize for mine clearance by using a Quadratic Knapsack Problem (QKP), whose optimal solution naturally results in the identification of cohesive hazard clusters due to rewarding the program for prioritizing nearby grid cells. Formally, we use the risk scores $r_i$ estimated by our trained deep learning model to compute proxies for the benefit of demining candidate grid cell $i$ with centroid $(x_i,y_i)$ . Then, define the reward matrix $U$ that captures the (additional) benefit of prioritizing both grid cells $i$ and $j$ as

$u_{ij} = \sqrt{r_i r_j}\exp\left(-\lambda ||s_i - s_j||_{h}\right),$

where $||\cdot||_{h}$ is the standard Haversine distance, and $\lambda$ controls for the exponential decay of the spatial distance between two locations $s_i = (x_i, y_i)$ and $s_j = (x_j, y_j)$ . For example, selecting a grid cell $i$ for mine clearance results in a direct benefit of $u_{ii} = r_i$ . Note that, in our formulation, riskier cells yield greater rewards. This results in the following binary QKP with variables $z_i \in \{0,1\}$ , for $i\in [n]$ , which indicate if a grid cell $i$ is selected for demining. Then, the total reward is given by $z^{T}Uz$ , which is maximized subject to a given budget $C \in \mathbb{R}_{+}$ and demining costs $w_i$ :

$\max_{z \in \mathbb{R}^n} ~ z^{T}Uz$

$s.t. \quad \sum_{i=1}^n w_i z_i \leq C, \quad z_i \in \{0, 1\} \quad \forall i \in [n].$

Our approach rewards for geographic cohesion, ultimately finding more useful hazard clusters than a greedy solution that prioritizes the $C$ grid cells with the largest estimated risk scores (Fig. 4). Moreover, our approach also incorporates realistic budget constraints, unlike standard spatial statistical approaches for geographic clustering such as Moran Local I and LISA.

Figure 4. Hazardous areas identified by RELand in our field test in Colombia. (a) Estimated risk scores from our trained DL model , (b) greedy risk clusters subject to budget constraints, and (c) QKP cohesive risk clusters with geographic pairwise interactions. Three landmines (panel (c), in white) have been found so far in one of the prioritized areas.

Tangible Impact of RELand

We are currently conducting a field study in Colombia, in partnership with the United Nations Mine Action Service and the Colombian Campaign to Ban Landmines, in two municipalities recently selected for humanitarian demining that have not been previously surveyed. We applied RELand to these regions to (i) build the enhanced dataset with rich geographic features, (ii) generate landmine contamination risk estimates by using the trained DL model, and (iii) use the predicted risk scores to identify priority hazard clusters with the QKP formulation. We worked together with the field teams of our partner NGO in Colombia to validate the hazard clusters identified by the system and to create an initial demining plan in the assigned regions. Crucially, the proposed methodology (Fig. 4c) identifies useful cohesive hazard clusters under realistic budgetary constraints. These hazard regions are more useful for demining prioritization than the sparse raw risk scores (Fig. 4a) and the greedy risk clusters (Fig. 4b), which lead to excessive mobilization of demining teams and equipment. Overall, the risk maps generated are in line with what is expected by human experts in humanitarian demining in Colombia. To date, three landmines have been found in one priority area, saving human lives. Moreover, in collaboration with UNOPS and MAPA, we have tailored and deployed the system in Afghanistan, identifying 81 hazardous areas for prioritized demining interventions, positively impacting over 4 million people across the country.

We expect to have the full results of our demining field tests within 6 months to provide a real-world validation of RELand’s capabilities in ground operations. Based on the initial positive feedback, we believe the system can support critical parts of the initial planning of humanitarian mine action, making demining operations more efficient and safer. We are actively working with UNMAS, UNOPS, and local NGOs to refine the system in its three components and prepare it for deployment in war-torn territories globally.

Aknowledgments

RELand was developed in collaboration with Cindy Zeng (UIUC), Anna Wang (CMU), Didier Alvarado (UNMAS Colombia), Francisco Moreno (CCBL), Hoda Heidari (CMU), and Fei Fang (CMU). Special thanks to UNOPS and MAPA for their partnership in our Afghanistan field tests. All errors remain mine.

References

Dulce Rubio, M., Zeng, S., Wang, Q., Alvarado, D., Moreno Rivera, F., Heidari, H., & Fang, F. (2024). RELand: Risk Estimation of Landmines via Interpretable Invariant Risk Minimization. ACM Journal on Computing and Sustainable Societies, 2(2), pp. 1-29. https://doi.org/10.1145/3648437.
Dulce Rubio, M. (2024). Identification of Hazard Clusters for Priority Landmine Clearance as a Quadratic Knapsack Problem. Doing Good with Good OR Competition, INFORMS Annual Meeting.
Collins, R., Fragniere, L., & Dulce Rubio, M. (2024). Advancements In Mine Action: Enhancing Remote Reporting And Analysis Through Innovative Technologies. The Journal of Conventional Weapons Destruction, 28(3), 7.

This article was initially published on the ML@CMU blog and appears here with the author’s permission.