Soil organic carbon content mapping along the coast of northern Peru: an ensemble machine learning approach

dc.contributor.authorSalazar Coronel, Wilian
dc.contributor.authorCarbajal Llosa, Carlos Miguel
dc.contributor.authorChuchon Remon, Rodolfo Juan
dc.date.accessioned2026-04-07T17:18:32Z
dc.date.available2026-04-07T17:18:32Z
dc.date.issued2026-03-26
dc.description.abstractIntroduction: Soil organic carbon (SOC) content plays a fundamental role in regulating the global carbon cycle and mitigating climate change. It is also a key marker of soil health and a vital plant component. Its distribution in space varies in dry ecosystems, where climate and land use affect it. This study aimed to estimate and map SOC in the Motupe River Basin, northern Peru, by applying machine learning algorithms and ensemble methods. Methods: Four predictive models were evaluated: Support Vector Regression (SVR), Random Forest (RF), Artificial Neural Network (ANN), and Extreme Gradient Boosting (XGBoost), together with two ensemble approaches—simple averaging and weighted — integrating topographic, climatic, edaphic, and vegetation indices variables. Spatial autocorrelation was minimized by spatial block cross-validation. Uncertainty was measured with bootstrapping and the Prediction Interval Ratio (PIR) derived from 90% prediction intervals. Results and discussion: Best performance was achieved by XGBoost (R² = 0.83), weighted ensemble (R² = 0.70), and RF (R² = 0.63). The most influential predictors were EVI, GNDVI, temperature, TRI, and pH. SOC contents showed relatively higher concentrations (>0.7%) in areas with greater vegetation density, within a semi-arid context where SOC levels are generally low. In contrast, lower areas exhibited reduced SOC contents (< 0.6%). The uncertainty analysis indicated that SOC predictions had high to moderate confidence (PIR < 0.2) in the middle-and upper zones of the basin, and moderate confidence (0.1–0.2) in the lower areas. The results suggest that machine learning and ensemble methods improve SOC prediction, benefiting the sustainable management of soil fertility and quality in arid and semi-arid ecosystems of northern Peru.
dc.description.sponsorshipFunding: The author(s) declared that financial support was received for this work and/or its publication. This research was funded by the CUI 2487112 INIA project “Mejoramiento de los servicios de investigación y transferencia tecnológica en el manejo y recuperación de suelos Salazar-Coronel et al. 10.3389/fsoil.2026.1745154 Frontiers in Soil Science 15 frontiersin.org agrıcolas degradados y aguas para riego en la pequeña y mediana ́ agricultura en los departamentos de Lima, Á ncash, San Martın, ́ Cajamarca, Lambayeque, Junın, Ayacucho, Arequipa, Puno y Ucayali. Acknowledgments: The authors would like to acknowledge the support of Eng. Ivan Vilchez, Eng. Issac Castro, and Bach. Johan Rivas for their contribution during the soil sampling process. The authors also express their gratitude to the LABSAF staff at the Vista Florida Agricultural Experimental Station (INIA) and LABSAF Lima for their support in the analysis of the soil samples.
dc.formatapplication/pdf
dc.identifier.citationSalazar-Coronel, W., Carbajal-Llosa, C., & Chuchon-Remon, R. (2026). Soil organic carbon content mapping along the coast of northern Peru: an ensemble machine learning approach. Frontiers in Soil Science, 6, 1745154. https://doi.org/10.3389/fsoil.2026.1745154
dc.identifier.doihttp//doi.org/10.3389/fsoil.2026.1745154
dc.identifier.issn2673-8619
dc.identifier.urihttp://hdl.handle.net/20.500.12955/3082
dc.language.isoeng
dc.publisherFrontiers Media SA
dc.publisher.countryCH
dc.relation.ispartofurn:issn:2673-8619
dc.relation.ispartofseriesFrontiers in Soil Science
dc.rightsinfo:eu-repo/semantics/openAccess
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.sourceInstituto Nacional de Innovación Agraria
dc.source.uriRepositorio Institucional - INIA
dc.subjectMachine learning
dc.subjectAprendizaje automático
dc.subjectSoil organic carbon
dc.subjectCarbono orgánico del suelo
dc.subjectTopographic indices
dc.subjectIndices topográficos
dc.subjectVegetation indices
dc.subjectIndices de vegetación
dc.subjectDigital soil mapping
dc.subjectCartografía digital del suelo
dc.subjectEnsemble modeling
dc.subjectModelado ensemble
dc.subject.agrovocFertilidad del suelo; Soil fertility; Zona árida; Arid zones; Cuencas hidrográficas; Watersheds
dc.subject.ocdehttps://purl.org/pe-repo/ocde/ford#4.01.04
dc.titleSoil organic carbon content mapping along the coast of northern Peru: an ensemble machine learning approach
dc.typeinfo:eu-repo/semantics/article

Archivos

Bloque original

Mostrando 1 - 1 de 1
No hay miniatura disponible
Nombre:
Salazar-Coronel_et-al_2026_soil-organic-carbon_machine-learning_northern-Peru.pdf
Tamaño:
6.38 MB
Formato:
Adobe Portable Document Format

Bloque de licencias

Mostrando 1 - 1 de 1
No hay miniatura disponible
Nombre:
license.txt
Tamaño:
1.75 KB
Formato:
Item-specific license agreed upon to submission
Descripción:

Sede Central: Av. La Molina 1981 - La Molina. Lima. Perú - 15024

Central telefónica (511) 240-2100 / 240-2350

FacebookLa ReferenciaEurocris
Correo: repositorio@inia.gob.pe

© Instituto Nacional de Innovación Agraria - INIA