Water and Sediment Analyse Using Predictive Models
Xiaoting Xu, Tin Lai, Sayka Jahan, Farnaz Farid

TL;DR
This paper presents a machine learning framework for assessing water quality and pollution levels using water and sediment samples, addressing data sparsity issues with imputation techniques, achieving 75% accuracy with incomplete data.
Contribution
It introduces an automated predictive modeling approach for water quality assessment that handles incomplete datasets, improving pollution monitoring efficiency.
Findings
Achieved 75% accuracy with 57% missing data
Evaluated various data imputation methods for water datasets
Demonstrated model's usefulness with real-world incomplete data
Abstract
The increasing prevalence of marine pollution during the past few decades motivated recent research to help ease the situation. Typical water quality assessment requires continuous monitoring of water and sediments at remote locations with labour intensive laboratory tests to determine the degree of pollution. We propose an automated framework where we formalise a predictive model using Machine Learning to infer the water quality and level of pollution using collected water and sediments samples. One commonly encountered difficulty performing statistical analysis with water and sediment is the limited amount of data samples and incomplete dataset due to the sparsity of sample collection location. To this end, we performed extensive investigation on various data imputation methods' performance in water and sediment datasets with various data missing rates. Empirically, we show that our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWater Quality Monitoring Technologies · Hydrological Forecasting Using AI · Water Quality and Pollution Assessment
