River Water Salinity Prediction Using Hybrid Machine Learning Models

dc.contributor.authorMelesse, Assefa M.
dc.contributor.authorKhosravi, Khabat
dc.contributor.authorTiefenbacher, John
dc.contributor.authorHeddam, Salim
dc.contributor.authorKim, Sungwon
dc.contributor.authorMosavi, Amir
dc.contributor.authorPham, Binh Thai
dc.date.accessioned2021-07-26T17:53:32Z
dc.date.available2021-07-26T17:53:32Z
dc.date.issued2020-10-21
dc.description.abstractElectrical conductivity (EC), one of the most widely used indices for water quality assessment, has been applied to predict the salinity of the Babol-Rood River, the greatest source of irrigation water in northern Iran. This study uses two individual—M5 Prime (M5P) and random forest (RF)—and eight novel hybrid algorithms—bagging-M5P, bagging-RF, random subspace (RS)-M5P, RS-RF, random committee (RC)-M5P, RC-RF, additive regression (AR)-M5P, and AR-RF—to predict EC. Thirty-six years of observations collected by the Mazandaran Regional Water Authority were randomly divided into two sets: 70% from the period 1980 to 2008 was used as model-training data and 30% from 2009 to 2016 was used as testing data to validate the models. Several water quality variables—pH, HCO3‾, CI‾, SO₄²⁻, Na⁺, Mg²⁺, Ca²⁺, river discharge (Q), and total dissolved solids (TDS)—were modeling inputs. Using EC and the correlation coefficients (CC) of the water quality variables, a set of nine input combinations were established. TDS, the most effective input variable, had the highest EC-CC (r = 0.91), and it was also determined to be the most important input variable among the input combinations. All models were trained and each model’s prediction power was evaluated with the testing data. Several quantitative criteria and visual comparisons were used to evaluate modeling capabilities. Results indicate that, in most cases, hybrid algorithms enhance individual algorithms’ predictive powers. The AR algorithm enhanced both M5P and RF predictions better than bagging, RS, and RC. M5P performed better than RF. Further, AR-M5P outperformed all other algorithms (R² = 0.995, RMSE = 8.90 μs/cm, MAE = 6.20 μs/cm, NSE = 0.994 and PBIAS = -0.042). The hybridization of machine learning methods has significantly improved model performance to capture maximum salinity values, which is essential in water resource management.
dc.description.departmentGeography and Environmental Studies
dc.formatText
dc.format.extent21 pages
dc.format.medium1 file (.pdf)
dc.format.medium21 pages
dc.identifier.citationMelesse, A. M., Khosravi, K., Tiefenbacher, J. P., Heddam, S., Kim, S., Mosavi, A., & Pham, B. T. (2020). River Water Salinity Prediction Using Hybrid Machine Learning Models. Water, 12(10), 2951.
dc.identifier.doihttps://doi.org/10.3390/w12102951
dc.identifier.issn2073-4441
dc.identifier.urihttps://hdl.handle.net/10877/14079
dc.language.isoen
dc.publisherMultidisciplinary Digital Publishing Institute
dc.rights.holder© 2020 The Authors.
dc.rights.licenseThis work is licensed under a Creative Commons Attribution 4.0 International License.
dc.sourceWater, 2020, Vol. 12, No. 10, Article 2951.
dc.subjectwater salinity
dc.subjectmachine learning
dc.subjectbagging
dc.subjectrandom forest
dc.subjectrandom subspace
dc.subjectdata science
dc.subjecthydrological model
dc.subjecthydroinformatics
dc.subjectelectrical conductivity
dc.subjectGeography and Environmental Studies
dc.titleRiver Water Salinity Prediction Using Hybrid Machine Learning Models
dc.typeArticle

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
water-12-02951-v2.pdf
Size:
5.36 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.54 KB
Format:
Item-specific license agreed upon to submission
Description: