Simulating soil salinity dynamics, cotton yield and evapotranspiration under drip irrigation by ensemble machine learning

Zewei Jiang, Shihong Yang, Shide Dong, Qingqing Pang, Pete Smith, Mohamed Abdalla, Jie Zhang, Guangmei Wang, Ying Xu

Research output: Contribution to journalArticlepeer-review


Cotton is widely used in textile, decoration, and industry, but it is also threatened by soil salinization. Drip irrigation plays an important role in improving water and fertilization utilization efficiency and ensuring crop production in arid areas. Accurate prediction of soil salinity and crop evapotranspiration under drip irrigation is essential to guide water management practices in arid and saline areas. However, traditional hydrological models such as Hydrus require more variety of input parameters and user expertise, which limits its application in practice, and machine learning (ML) provides a potential alternative. Based on a global dataset collected from 134 pieces of literature, we proposed a method to comprehensively simulate soil salinity, evapotranspiration (ET) and cotton yield. Results showed that it was recommended to predict soil salinity, crop evapotranspiration and cotton yield based on soil data (bulk density), meteorological factors, irrigation data and other data. Among them, meteorological factors include annual average temperature, total precipitation, year. Irrigation data include salinity in irrigation water, soil matric potential and irrigation water volume, while other data include soil depth, distance from dripper, days after sowing (for EC and soil salinity), fertilization rate (for yield and ET). The accuracy of the model has reached a satisfactory level, R2 in 0.78-0.99. The performance of stacking ensemble ML was better than that of a single model, i.e., gradient boosting decision tree (GBDT); random forest (RF); extreme gradient boosting regression (XGBR), with R2 increased by 0.02%-19.31%. In all input combinations, other data have a greater impact on the model accuracy, while the RMSE of the S1 scenario (input without meteorological factors) without meteorological data has little difference, which is -34.22%~19.20% higher than that of full input. Given the wide application of drip irrigation in cotton, we recommend the application of ensemble ML to predict soil salinity and crop evapotranspiration, thus serving as the basis for adjusting the irrigation schedule.
Original languageEnglish
JournalFrontiers in plant science
Publication statusAccepted/In press - 17 May 2023


Dive into the research topics of 'Simulating soil salinity dynamics, cotton yield and evapotranspiration under drip irrigation by ensemble machine learning'. Together they form a unique fingerprint.

Cite this