Soil organic carbon content mapping along the coast of northern Peru: an ensemble machine learning approach
| dc.contributor.author | Salazar Coronel, Wilian | |
| dc.contributor.author | Carbajal Llosa, Carlos Miguel | |
| dc.contributor.author | Chuchon Remon, Rodolfo Juan | |
| dc.date.accessioned | 2026-04-07T17:18:32Z | |
| dc.date.available | 2026-04-07T17:18:32Z | |
| dc.date.issued | 2026-03-26 | |
| dc.description.abstract | Introduction: Soil organic carbon (SOC) content plays a fundamental role in regulating the global carbon cycle and mitigating climate change. It is also a key marker of soil health and a vital plant component. Its distribution in space varies in dry ecosystems, where climate and land use affect it. This study aimed to estimate and map SOC in the Motupe River Basin, northern Peru, by applying machine learning algorithms and ensemble methods. Methods: Four predictive models were evaluated: Support Vector Regression (SVR), Random Forest (RF), Artificial Neural Network (ANN), and Extreme Gradient Boosting (XGBoost), together with two ensemble approaches—simple averaging and weighted — integrating topographic, climatic, edaphic, and vegetation indices variables. Spatial autocorrelation was minimized by spatial block cross-validation. Uncertainty was measured with bootstrapping and the Prediction Interval Ratio (PIR) derived from 90% prediction intervals. Results and discussion: Best performance was achieved by XGBoost (R² = 0.83), weighted ensemble (R² = 0.70), and RF (R² = 0.63). The most influential predictors were EVI, GNDVI, temperature, TRI, and pH. SOC contents showed relatively higher concentrations (>0.7%) in areas with greater vegetation density, within a semi-arid context where SOC levels are generally low. In contrast, lower areas exhibited reduced SOC contents (< 0.6%). The uncertainty analysis indicated that SOC predictions had high to moderate confidence (PIR < 0.2) in the middle-and upper zones of the basin, and moderate confidence (0.1–0.2) in the lower areas. The results suggest that machine learning and ensemble methods improve SOC prediction, benefiting the sustainable management of soil fertility and quality in arid and semi-arid ecosystems of northern Peru. | |
| dc.description.sponsorship | Funding: The author(s) declared that financial support was received for this work and/or its publication. This research was funded by the CUI 2487112 INIA project “Mejoramiento de los servicios de investigación y transferencia tecnológica en el manejo y recuperación de suelos Salazar-Coronel et al. 10.3389/fsoil.2026.1745154 Frontiers in Soil Science 15 frontiersin.org agrıcolas degradados y aguas para riego en la pequeña y mediana ́ agricultura en los departamentos de Lima, Á ncash, San Martın, ́ Cajamarca, Lambayeque, Junın, Ayacucho, Arequipa, Puno y Ucayali. Acknowledgments: The authors would like to acknowledge the support of Eng. Ivan Vilchez, Eng. Issac Castro, and Bach. Johan Rivas for their contribution during the soil sampling process. The authors also express their gratitude to the LABSAF staff at the Vista Florida Agricultural Experimental Station (INIA) and LABSAF Lima for their support in the analysis of the soil samples. | |
| dc.format | application/pdf | |
| dc.identifier.citation | Salazar-Coronel, W., Carbajal-Llosa, C., & Chuchon-Remon, R. (2026). Soil organic carbon content mapping along the coast of northern Peru: an ensemble machine learning approach. Frontiers in Soil Science, 6, 1745154. https://doi.org/10.3389/fsoil.2026.1745154 | |
| dc.identifier.doi | http//doi.org/10.3389/fsoil.2026.1745154 | |
| dc.identifier.issn | 2673-8619 | |
| dc.identifier.uri | http://hdl.handle.net/20.500.12955/3082 | |
| dc.language.iso | eng | |
| dc.publisher | Frontiers Media SA | |
| dc.publisher.country | CH | |
| dc.relation.ispartof | urn:issn:2673-8619 | |
| dc.relation.ispartofseries | Frontiers in Soil Science | |
| dc.rights | info:eu-repo/semantics/openAccess | |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | |
| dc.source | Instituto Nacional de Innovación Agraria | |
| dc.source.uri | Repositorio Institucional - INIA | |
| dc.subject | Machine learning | |
| dc.subject | Aprendizaje automático | |
| dc.subject | Soil organic carbon | |
| dc.subject | Carbono orgánico del suelo | |
| dc.subject | Topographic indices | |
| dc.subject | Indices topográficos | |
| dc.subject | Vegetation indices | |
| dc.subject | Indices de vegetación | |
| dc.subject | Digital soil mapping | |
| dc.subject | Cartografía digital del suelo | |
| dc.subject | Ensemble modeling | |
| dc.subject | Modelado ensemble | |
| dc.subject.agrovoc | Fertilidad del suelo; Soil fertility; Zona árida; Arid zones; Cuencas hidrográficas; Watersheds | |
| dc.subject.ocde | https://purl.org/pe-repo/ocde/ford#4.01.04 | |
| dc.title | Soil organic carbon content mapping along the coast of northern Peru: an ensemble machine learning approach | |
| dc.type | info:eu-repo/semantics/article |
Archivos
Bloque original
1 - 1 de 1
No hay miniatura disponible
- Nombre:
- Salazar-Coronel_et-al_2026_soil-organic-carbon_machine-learning_northern-Peru.pdf
- Tamaño:
- 6.38 MB
- Formato:
- Adobe Portable Document Format
Bloque de licencias
1 - 1 de 1
No hay miniatura disponible
- Nombre:
- license.txt
- Tamaño:
- 1.75 KB
- Formato:
- Item-specific license agreed upon to submission
- Descripción:
