PhD: Detecting hotspots of water pollution in complex constrained domains and networks

University of Glasgow

Glasgow, UK 🇬🇧

Project institution: University of Glasgow

Project supervisor(s): Dr Mu Niu (University of Glasgow), Dr Craig Wilkie (University of Glasgow), Prof Cathy Yi-Hsuan Chen (University of Glasgow) and Dr Michael Tso (Lancaster University)

Overview and Background

Technological developments with smart sensors are changing the way that the environment is monitored.  Many such smart systems are under development, with small, energy efficient, mobile sensors being trialled.  Such systems offer opportunities to change how we monitor the environment, but this requires additional statistical development in the optimisation of the location of the sensors.

The aim of this project is to develop a mathematical and computational inferential framework to identify optimal sensor deployment locations within complex, constrained domains and networks for improved water contamination detection. Methods for estimating covariance functions in such domains rely on computationally intensive diffusion process simulations, limiting their application to relatively simple domains and small-scale datasets. To address this challenge, the project will employ accelerated computing paradigms with highly parallelized GPUs to enhance simulation efficiency. The framework will also address regression, classification, and optimization problems on latent manifolds embedded in high-dimensional spaces, such as image clouds (e.g., remote sensing satellite images), which are crucial for sensor deployment and performance evaluation. As the project progresses, particularly in the image cloud case, the computational demands will intensify, requiring advanced GPU resources or exascale computing to ensure scalability, efficiency, and performance.

Methodology and Objectives

The idea of on-site sensors to detect water contaminants has a rich history.  Since water flows at finite speeds, placing sensors strategically reduces time until detection. The mathematical analysis is often made difficult by the need to model the nonlinear dynamical systems of hydraulics within a non-Euclidean space such as constrained domains (lake or river, Wood et al., 2008) or networks (pipe network, Oluwaseye, et al., 2018). It requires solving large nonlinear systems of differential equations in the complex domain and is difficult to apply to even moderate-sized problems.

This proposed PhD project aims to develop new methods to improve environmental sampling, enabling improved estimation of water pollution and associated uncertainty that appropriately accounts for the geometry and topology of the water body.

Methods Used:

Intrinsic Bayesian Optimization (BO) on complex constrained domains and networks allows the use of the prediction and uncertainty quantification of intrinsic Gaussian processes (GPs) (Niu et al., 2019, 2023) to direct the search of the water pollution. Once new detection is observed, the search for a hotspot can be sequentially updated.

The key ingredients of BO are the Gaussian processes (GPs) prior that captures beliefs about the behaviour of the unknown black-box function in the complex domains. The student will develop intrinsic BO on non-Euclidean spaces such as complex constrained domains and networks with the state-of-the-art GPs on manifolds and GPs on graphs. Extending the idea of estimating covariance functions on manifolds, the project aims to estimate the heat kernel of the point cloud, allowing the incorporation of the intrinsic geometry of the data, and a potentially complex interior structure.

The application areas are water quality in lakes with complex domains (such as the Aral Sea) and pollution sources in a city’s sewage network. The methods would have the potential to inform about emergent water pollution events like algal blooms, providing an early warning system, and help to identify pollution sources.

Teaser Project 1:

This first teaser project will introduce the student to parallel computing using GPUs, with a focus on applying intrinsic Gaussian Processes to water quality data for efficient GPU implementation. The project will lay the foundation for the PhD by enabling the understanding of complex water quality patterns in non-Euclidean spaces, including continuous domains with complex boundaries and network domains, while developing efficient stochastic diffusion process simulations leveraging GPU power. The student will begin by applying existing methods to datasets ranging from small to large scales. This work has the potential to evolve into a full PhD project centred on the development of computationally intensive methods for modelling water quality and detecting contamination hotspots over complex domains. The use of GPU parallelization will enable scalable modelling across large spatial areas and handle the high data volumes characteristic of high-resolution water quality datasets.

Teaser Project 2:

In the second teaser project, the student will expand their work to the spatio-temporal (or manifold-temporal) setting, incorporating both complex spatial and temporal structures to fully explain the changing nature of the water quality patterns. Again, this teaser project will use involve applying existing methods to small-scale datasets. Due to the high computational complexity of spatio-temporal models, this project has the potential to evolve into a PhD with a focus on developing highly computationally efficient methods, with a focus on parallelisation on GPUs.

The student will benefit from the extensive expertise of the supervisory team. Dr Niu specializes in statistical inference in Non-Euclidean spaces, with application in ecology and environmental science.

Dr Wilkie has a background in developing spatiotemporal data fusion approaches for environmental data, focussing on satellite and in-lake water quality data. Prof Chen specializes in network modeling, statistical inference, data science, machine learning and economics. Dr Tso is an environmental data scientist with strong computational background and a portfolio of work on water quality monitoring, including adaptive sampling.

References and Further Reading

  1. Niu, et al., (2019), “Intrinsic Gaussian processes on complex constrained domain”, J. Roy Statist. Soc. Series B, Volume 81, Issue 3
  2. Niu, et al., (2023): Intrinsic Gaussian processes on unknown manifold with probabilistic geometry, Journal of Machine Learning Research; 24 (104)
  3. Oluwaseye, et al.,(2018) A state-of-the-art review of an optimal sensor placement for contaminant warning system in a water distribution network, Urban Water Journal, 15:10, 985–1000
  4. Giudicianni et al., (2020). Topological Placement of Quality Sensors in Water-Distribution Networks without the Recourse to Hydraulic Modeling. Journal of Water Resources Planning and Management, 146(6)
  5. Wood, S. N., Bravington, M. V. and Hedley, S. L. (2008) Soap film smoothing. J. Royal Stat. Soc. Series B, 70, 931–955

POSITION TYPE

ORGANIZATION TYPE

EXPERIENCE-LEVEL

DEGREE REQUIRED

You ad could be here!