Predictive model for estimating nitrogen density in MD2 pineapple crops from multispectral images and sensors integrated in an IoT platform

ABSTRACT : Nitrogen is the most important nutritional element during the vegetative growth phase of the pineapple crop; however, its presence in the soil is insufficient to meet plant demands. In this doctoral research, nine machine learning techniques were validated to estimate total nitrogen (TN)...

Full description

Autores:
Chaparro Mesa, Jorge Enrique
Tipo de recurso:
Doctoral thesis
Fecha de publicación:
2024
Institución:
Universidad de Antioquia
Repositorio:
Repositorio UdeA
Idioma:
eng
OAI Identifier:
oai:bibliotecadigital.udea.edu.co:10495/44223
Acceso en línea:
https://hdl.handle.net/10495/44223
Palabra clave:
Técnicas de predicción
Forecasting techniques
Procesamiento de imágenes
Image processing
Internet de las cosas
Internet of things (IoT)
Multispectral Imaging
Unmanned Aerial Vehicle (UAV)
Sensors in the crop
http://aims.fao.org/aos/agrovoc/c_e4315b22
Rights
openAccess
License
https://creativecommons.org/licenses/by-nc-sa/4.0/
id UDEA2_eb0d87b14d26f5fbf0615c49ec16fce2
oai_identifier_str oai:bibliotecadigital.udea.edu.co:10495/44223
network_acronym_str UDEA2
network_name_str Repositorio UdeA
repository_id_str
dc.title.spa.fl_str_mv Predictive model for estimating nitrogen density in MD2 pineapple crops from multispectral images and sensors integrated in an IoT platform
title Predictive model for estimating nitrogen density in MD2 pineapple crops from multispectral images and sensors integrated in an IoT platform
spellingShingle Predictive model for estimating nitrogen density in MD2 pineapple crops from multispectral images and sensors integrated in an IoT platform
Técnicas de predicción
Forecasting techniques
Procesamiento de imágenes
Image processing
Internet de las cosas
Internet of things (IoT)
Multispectral Imaging
Unmanned Aerial Vehicle (UAV)
Sensors in the crop
http://aims.fao.org/aos/agrovoc/c_e4315b22
title_short Predictive model for estimating nitrogen density in MD2 pineapple crops from multispectral images and sensors integrated in an IoT platform
title_full Predictive model for estimating nitrogen density in MD2 pineapple crops from multispectral images and sensors integrated in an IoT platform
title_fullStr Predictive model for estimating nitrogen density in MD2 pineapple crops from multispectral images and sensors integrated in an IoT platform
title_full_unstemmed Predictive model for estimating nitrogen density in MD2 pineapple crops from multispectral images and sensors integrated in an IoT platform
title_sort Predictive model for estimating nitrogen density in MD2 pineapple crops from multispectral images and sensors integrated in an IoT platform
dc.creator.fl_str_mv Chaparro Mesa, Jorge Enrique
dc.contributor.advisor.none.fl_str_mv Aedo Cobo, José Edison
dc.contributor.author.none.fl_str_mv Chaparro Mesa, Jorge Enrique
dc.contributor.researchgroup.spa.fl_str_mv Sistemas Embebidos e Inteligencia Computacional (SISTEMIC)
dc.subject.lemb.none.fl_str_mv Técnicas de predicción
Forecasting techniques
Procesamiento de imágenes
Image processing
topic Técnicas de predicción
Forecasting techniques
Procesamiento de imágenes
Image processing
Internet de las cosas
Internet of things (IoT)
Multispectral Imaging
Unmanned Aerial Vehicle (UAV)
Sensors in the crop
http://aims.fao.org/aos/agrovoc/c_e4315b22
dc.subject.agrovoc.none.fl_str_mv Internet de las cosas
Internet of things (IoT)
dc.subject.proposal.spa.fl_str_mv Multispectral Imaging
Unmanned Aerial Vehicle (UAV)
Sensors in the crop
dc.subject.agrovocuri.none.fl_str_mv http://aims.fao.org/aos/agrovoc/c_e4315b22
description ABSTRACT : Nitrogen is the most important nutritional element during the vegetative growth phase of the pineapple crop; however, its presence in the soil is insufficient to meet plant demands. In this doctoral research, nine machine learning techniques were validated to estimate total nitrogen (TN) content in MD2 pineapple crops from data from multiple sources. These sources included multispectral images captured by an unmanned aerial vehicle (UAV) and in situ sensors that collected information on ecological and environmental factors, such as pH, temperature, solar radiation, relative humidity, soil moisture, and wind speed and direction. In addition, plant information was collected related to SPAD values, which indicate leaf chlorophyll content, and total nitrogen (TN) values, obtained from leaf tissue samples sent to a certified laboratory for analysis. To introduce nitrogen variability, a randomized complete block experimental design was implemented, applying five different treatments in five blocks, each with 12 replications, during a 6-month period in a pineapple crop located in the municipality of Tauramena, Casanare, Colombia. To address the inherent variability of the agricultural and environmental data, dimensionality was reduced using Principal Component Analysis (PCA). Regularization techniques were also applied, including cross-validation, feature selection, boost methods, L1 (Lasso) and L2 (Ridge) regularization, as well as hyperparameter optimization. These strategies generated more robust and accurate models, among which regression, multilayer perceptron (MLP regressor) and extreme gradient boosting (XGBoost) algorithms stood out. On the first sampling date, XGBoost achieved an R^2 of 86.98\%, which was the highest during the entire experiment. On subsequent dates, MLP achieved an R^2 of 59.11\% on the second date; XGBoost achieved an R^2 of 68.00\% on the third date, and on the last date, MLP achieved an R^2 of 69.4\%. These results indicate that the integration of data from multiple sources and the use of machine learning models enable nitrogen (N) diagnostics in pineapple crops, especially in real-time applications. These results highlight the promising potential of developing machine learning models that integrate multisensor data fusion for various applications in agriculture. In the implementation of the machine learning models, the total nitrogen content obtained in the laboratory was considered as the response variable. The predictor variables comprised sensor data, SPAD values, and statistical information derived from 16 vegetation indices calculated from the multispectral images. To reduce the dimensionality of the predictor variable dataset, Principal Component Analysis (PCA) was applied. Following this dimensionality reduction, nine regression algorithms were used to estimate leaf nitrogen content during each of the four study periods. This comprehensive approach yielded close estimates of leaf nitrogen content. The results of the study indicated that the MLP (Multilayer Perceptron) and XGB (XGBoost) regression algorithms stood out for their superior performance, evidenced by the best performance metrics.
publishDate 2024
dc.date.issued.none.fl_str_mv 2024
dc.date.accessioned.none.fl_str_mv 2025-01-16T15:51:19Z
dc.date.available.none.fl_str_mv 2025-01-16T15:51:19Z
dc.type.spa.fl_str_mv Tesis/Trabajo de grado - Monografía - Doctorado
dc.type.coar.spa.fl_str_mv http://purl.org/coar/resource_type/c_db06
dc.type.redcol.spa.fl_str_mv https://purl.org/redcol/resource_type/TD
dc.type.coarversion.spa.fl_str_mv http://purl.org/coar/version/c_b1a7d7d4d402bcce
dc.type.driver.spa.fl_str_mv info:eu-repo/semantics/doctoralThesis
dc.type.version.spa.fl_str_mv info:eu-repo/semantics/draft
format http://purl.org/coar/resource_type/c_db06
status_str draft
dc.identifier.uri.none.fl_str_mv https://hdl.handle.net/10495/44223
url https://hdl.handle.net/10495/44223
dc.language.iso.spa.fl_str_mv eng
language eng
dc.rights.uri.spa.fl_str_mv https://creativecommons.org/licenses/by-nc-sa/4.0/
dc.rights.uri.*.fl_str_mv http://creativecommons.org/publicdomain/zero/1.0/
dc.rights.accessrights.spa.fl_str_mv info:eu-repo/semantics/openAccess
dc.rights.coar.spa.fl_str_mv http://purl.org/coar/access_right/c_abf2
rights_invalid_str_mv https://creativecommons.org/licenses/by-nc-sa/4.0/
http://creativecommons.org/publicdomain/zero/1.0/
http://purl.org/coar/access_right/c_abf2
eu_rights_str_mv openAccess
dc.format.extent.spa.fl_str_mv 143 páginas
dc.format.mimetype.spa.fl_str_mv application/pdf
dc.publisher.spa.fl_str_mv Universidad de Antioquia
dc.publisher.place.spa.fl_str_mv Medellín, Colombia
dc.publisher.faculty.spa.fl_str_mv Facultad de Ingeniería. Ingeniería Electrónica
institution Universidad de Antioquia
bitstream.url.fl_str_mv https://bibliotecadigital.udea.edu.co/bitstreams/326d502d-d447-4642-9b03-351937e3c3c3/download
https://bibliotecadigital.udea.edu.co/bitstreams/f3396944-41eb-46be-8c59-2b4afc77cf3f/download
https://bibliotecadigital.udea.edu.co/bitstreams/1fde51eb-d284-45e3-9407-ede49435ab80/download
https://bibliotecadigital.udea.edu.co/bitstreams/795ca5d7-91b2-484d-a837-0bc9fa180f2d/download
https://bibliotecadigital.udea.edu.co/bitstreams/2d81e501-7a82-4f45-87c5-6028eda97b99/download
bitstream.checksum.fl_str_mv d199f44ff3863386a15140d479a80a80
fd0548b8694973befb689f3e7a707f1d
8a4605be74aa9ea9d79846c1fba20a33
1b15dc5affd63eb5b79771b6c9863592
660e415bb9ed6161e3748647e84674f0
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
MD5
MD5
MD5
repository.name.fl_str_mv Repositorio Institucional de la Universidad de Antioquia
repository.mail.fl_str_mv aplicacionbibliotecadigitalbiblioteca@udea.edu.co
_version_ 1851052318785536000
spelling Aedo Cobo, José EdisonChaparro Mesa, Jorge EnriqueSistemas Embebidos e Inteligencia Computacional (SISTEMIC)2025-01-16T15:51:19Z2025-01-16T15:51:19Z2024https://hdl.handle.net/10495/44223ABSTRACT : Nitrogen is the most important nutritional element during the vegetative growth phase of the pineapple crop; however, its presence in the soil is insufficient to meet plant demands. In this doctoral research, nine machine learning techniques were validated to estimate total nitrogen (TN) content in MD2 pineapple crops from data from multiple sources. These sources included multispectral images captured by an unmanned aerial vehicle (UAV) and in situ sensors that collected information on ecological and environmental factors, such as pH, temperature, solar radiation, relative humidity, soil moisture, and wind speed and direction. In addition, plant information was collected related to SPAD values, which indicate leaf chlorophyll content, and total nitrogen (TN) values, obtained from leaf tissue samples sent to a certified laboratory for analysis. To introduce nitrogen variability, a randomized complete block experimental design was implemented, applying five different treatments in five blocks, each with 12 replications, during a 6-month period in a pineapple crop located in the municipality of Tauramena, Casanare, Colombia. To address the inherent variability of the agricultural and environmental data, dimensionality was reduced using Principal Component Analysis (PCA). Regularization techniques were also applied, including cross-validation, feature selection, boost methods, L1 (Lasso) and L2 (Ridge) regularization, as well as hyperparameter optimization. These strategies generated more robust and accurate models, among which regression, multilayer perceptron (MLP regressor) and extreme gradient boosting (XGBoost) algorithms stood out. On the first sampling date, XGBoost achieved an R^2 of 86.98\%, which was the highest during the entire experiment. On subsequent dates, MLP achieved an R^2 of 59.11\% on the second date; XGBoost achieved an R^2 of 68.00\% on the third date, and on the last date, MLP achieved an R^2 of 69.4\%. These results indicate that the integration of data from multiple sources and the use of machine learning models enable nitrogen (N) diagnostics in pineapple crops, especially in real-time applications. These results highlight the promising potential of developing machine learning models that integrate multisensor data fusion for various applications in agriculture. In the implementation of the machine learning models, the total nitrogen content obtained in the laboratory was considered as the response variable. The predictor variables comprised sensor data, SPAD values, and statistical information derived from 16 vegetation indices calculated from the multispectral images. To reduce the dimensionality of the predictor variable dataset, Principal Component Analysis (PCA) was applied. Following this dimensionality reduction, nine regression algorithms were used to estimate leaf nitrogen content during each of the four study periods. This comprehensive approach yielded close estimates of leaf nitrogen content. The results of the study indicated that the MLP (Multilayer Perceptron) and XGB (XGBoost) regression algorithms stood out for their superior performance, evidenced by the best performance metrics.RESUMEN : El nitrógeno es el elemento nutricional más importante durante la fase de crecimiento vegetativo del cultivo de piña; sin embargo, su presencia en el suelo es insuficiente para satisfacer las demandas de las plantas. En esta investigación doctoral, se validaron nueve técnicas de aprendizaje automático para estimar el contenido total de nitrógeno (TN) en cultivos de piña MD2 a partir de datos de múltiples fuentes. Estas fuentes incluyeron imágenes multiespectrales capturadas por un vehículo aéreo no tripulado (UAV) y sensores in situ que recopilaron información sobre factores ecológicos y ambientales, como el pH, la temperatura, la radiación solar, la humedad relativa, la humedad del suelo y la velocidad y dirección del viento. Además, se recopiló información de la planta relacionada con los valores de SPAD, que indican el contenido de clorofila en las hojas, y los valores de nitrógeno total (TN), obtenidos de muestras de tejido foliar enviadas a un laboratorio certificado para su análisis. Para introducir la variabilidad del nitrógeno, se implementó un diseño experimental de bloques completos al azar, aplicando cinco tratamientos diferentes en cinco bloques, cada uno con 12 repeticiones, durante un período de 6 meses en un cultivo de piña ubicado en el municipio de Tauramena, Casanare, Colombia. Para abordar la variabilidad inherente de los datos agrícolas y ambientales, la dimensionalidad se redujo utilizando el Análisis de Componentes Principales (PCA). También se aplicaron técnicas de regularización, incluyendo validación cruzada, selección de características, métodos de boost, regularización L1 (Lasso) y L2 (Ridge), así como optimización de hiperparámetros. Estas estrategias generaron modelos más robustos y precisos, entre los que se destacaron los algoritmos de regresión, perceptrón multicapa (MLP regressor) y aumento de gradiente extremo (XGBoost). En la primera fecha de muestreo, XGBoost alcanzó un R^2 de 86.98\%, que fue el más alto durante todo el experimento. En las fechas posteriores, MLP alcanzó un R^2 de 59.11\% en la segunda fecha; XGBoost logró un R^2 de 68.00\% en la tercera fecha, y en la última fecha, MLP logró un R^2 de 69.4\%. Estos resultados indican que la integración de datos de múltiples fuentes y el uso de modelos de aprendizaje automático permiten el diagnóstico de nitrógeno (N) en cultivos de piña, especialmente en aplicaciones en tiempo real. Estos resultados ponen de manifiesto el prometedor potencial del desarrollo de modelos de aprendizaje automático que integren la fusión de datos multisensor para diversas aplicaciones en la agricultura. En la implementación de los modelos de machine learning se consideró como variable de respuesta el contenido total de nitrógeno obtenido en el laboratorio. Las variables predictoras comprendieron datos de sensores, valores de SPAD e información estadística derivada de 16 índices de vegetación calculados a partir de las imágenes multiespectrales. Para reducir la dimensionalidad del conjunto de datos de variables predictoras, se aplicó el Análisis de Componentes Principales (PCA). Después de esta reducción de dimensionalidad, se utilizaron nueve algoritmos de regresión para estimar el contenido de nitrógeno foliar durante cada uno de los cuatro períodos de estudio. Este enfoque integral produjo estimaciones detalladas del contenido de nitrógeno de las hojas. Los resultados del estudio indicaron que los algoritmos de regresión MLP (Multilayer Perceptron) y XGB (XGBoost) destacaron por su rendimiento superior, evidenciado por las mejores métricas de rendimiento.COL0010717DoctoradoDoctor en Ingeniería Electrónica y de Computación143 páginasapplication/pdfengUniversidad de AntioquiaMedellín, ColombiaFacultad de Ingeniería. Ingeniería Electrónicahttps://creativecommons.org/licenses/by-nc-sa/4.0/http://creativecommons.org/publicdomain/zero/1.0/info:eu-repo/semantics/openAccesshttp://purl.org/coar/access_right/c_abf2Predictive model for estimating nitrogen density in MD2 pineapple crops from multispectral images and sensors integrated in an IoT platformTesis/Trabajo de grado - Monografía - Doctoradohttp://purl.org/coar/resource_type/c_db06https://purl.org/redcol/resource_type/TDhttp://purl.org/coar/version/c_b1a7d7d4d402bcceinfo:eu-repo/semantics/doctoralThesisinfo:eu-repo/semantics/draftTécnicas de predicciónForecasting techniquesProcesamiento de imágenesImage processingInternet de las cosasInternet of things (IoT)Multispectral ImagingUnmanned Aerial Vehicle (UAV)Sensors in the crophttp://aims.fao.org/aos/agrovoc/c_e4315b22PublicationORIGINALChaparroJorge_2025_PredectiveModelDensityChaparroJorge_2025_PredectiveModelDensityTesis doctoralapplication/pdf2893059https://bibliotecadigital.udea.edu.co/bitstreams/326d502d-d447-4642-9b03-351937e3c3c3/downloadd199f44ff3863386a15140d479a80a80MD51trueAnonymousREADCC-LICENSElicense_rdflicense_rdfapplication/rdf+xml; charset=utf-8712https://bibliotecadigital.udea.edu.co/bitstreams/f3396944-41eb-46be-8c59-2b4afc77cf3f/downloadfd0548b8694973befb689f3e7a707f1dMD52falseAnonymousREADLICENSElicense.txtlicense.txttext/plain; charset=utf-81748https://bibliotecadigital.udea.edu.co/bitstreams/1fde51eb-d284-45e3-9407-ede49435ab80/download8a4605be74aa9ea9d79846c1fba20a33MD53falseAnonymousREADTEXTChaparroJorge_2025_PredectiveModelDensity.txtChaparroJorge_2025_PredectiveModelDensity.txtExtracted texttext/plain100110https://bibliotecadigital.udea.edu.co/bitstreams/795ca5d7-91b2-484d-a837-0bc9fa180f2d/download1b15dc5affd63eb5b79771b6c9863592MD54falseAnonymousREADTHUMBNAILChaparroJorge_2025_PredectiveModelDensity.jpgChaparroJorge_2025_PredectiveModelDensity.jpgGenerated Thumbnailimage/jpeg6109https://bibliotecadigital.udea.edu.co/bitstreams/2d81e501-7a82-4f45-87c5-6028eda97b99/download660e415bb9ed6161e3748647e84674f0MD55falseAnonymousREAD10495/44223oai:bibliotecadigital.udea.edu.co:10495/442232025-03-26 20:27:15.118https://creativecommons.org/licenses/by-nc-sa/4.0/open.accesshttps://bibliotecadigital.udea.edu.coRepositorio Institucional de la Universidad de Antioquiaaplicacionbibliotecadigitalbiblioteca@udea.edu.coTk9URTogUExBQ0UgWU9VUiBPV04gTElDRU5TRSBIRVJFClRoaXMgc2FtcGxlIGxpY2Vuc2UgaXMgcHJvdmlkZWQgZm9yIGluZm9ybWF0aW9uYWwgcHVycG9zZXMgb25seS4KCk5PTi1FWENMVVNJVkUgRElTVFJJQlVUSU9OIExJQ0VOU0UKCkJ5IHNpZ25pbmcgYW5kIHN1Ym1pdHRpbmcgdGhpcyBsaWNlbnNlLCB5b3UgKHRoZSBhdXRob3Iocykgb3IgY29weXJpZ2h0Cm93bmVyKSBncmFudHMgdG8gRFNwYWNlIFVuaXZlcnNpdHkgKERTVSkgdGhlIG5vbi1leGNsdXNpdmUgcmlnaHQgdG8gcmVwcm9kdWNlLAp0cmFuc2xhdGUgKGFzIGRlZmluZWQgYmVsb3cpLCBhbmQvb3IgZGlzdHJpYnV0ZSB5b3VyIHN1Ym1pc3Npb24gKGluY2x1ZGluZwp0aGUgYWJzdHJhY3QpIHdvcmxkd2lkZSBpbiBwcmludCBhbmQgZWxlY3Ryb25pYyBmb3JtYXQgYW5kIGluIGFueSBtZWRpdW0sCmluY2x1ZGluZyBidXQgbm90IGxpbWl0ZWQgdG8gYXVkaW8gb3IgdmlkZW8uCgpZb3UgYWdyZWUgdGhhdCBEU1UgbWF5LCB3aXRob3V0IGNoYW5naW5nIHRoZSBjb250ZW50LCB0cmFuc2xhdGUgdGhlCnN1Ym1pc3Npb24gdG8gYW55IG1lZGl1bSBvciBmb3JtYXQgZm9yIHRoZSBwdXJwb3NlIG9mIHByZXNlcnZhdGlvbi4KCllvdSBhbHNvIGFncmVlIHRoYXQgRFNVIG1heSBrZWVwIG1vcmUgdGhhbiBvbmUgY29weSBvZiB0aGlzIHN1Ym1pc3Npb24gZm9yCnB1cnBvc2VzIG9mIHNlY3VyaXR5LCBiYWNrLXVwIGFuZCBwcmVzZXJ2YXRpb24uCgpZb3UgcmVwcmVzZW50IHRoYXQgdGhlIHN1Ym1pc3Npb24gaXMgeW91ciBvcmlnaW5hbCB3b3JrLCBhbmQgdGhhdCB5b3UgaGF2ZQp0aGUgcmlnaHQgdG8gZ3JhbnQgdGhlIHJpZ2h0cyBjb250YWluZWQgaW4gdGhpcyBsaWNlbnNlLiBZb3UgYWxzbyByZXByZXNlbnQKdGhhdCB5b3VyIHN1Ym1pc3Npb24gZG9lcyBub3QsIHRvIHRoZSBiZXN0IG9mIHlvdXIga25vd2xlZGdlLCBpbmZyaW5nZSB1cG9uCmFueW9uZSdzIGNvcHlyaWdodC4KCklmIHRoZSBzdWJtaXNzaW9uIGNvbnRhaW5zIG1hdGVyaWFsIGZvciB3aGljaCB5b3UgZG8gbm90IGhvbGQgY29weXJpZ2h0LAp5b3UgcmVwcmVzZW50IHRoYXQgeW91IGhhdmUgb2J0YWluZWQgdGhlIHVucmVzdHJpY3RlZCBwZXJtaXNzaW9uIG9mIHRoZQpjb3B5cmlnaHQgb3duZXIgdG8gZ3JhbnQgRFNVIHRoZSByaWdodHMgcmVxdWlyZWQgYnkgdGhpcyBsaWNlbnNlLCBhbmQgdGhhdApzdWNoIHRoaXJkLXBhcnR5IG93bmVkIG1hdGVyaWFsIGlzIGNsZWFybHkgaWRlbnRpZmllZCBhbmQgYWNrbm93bGVkZ2VkCndpdGhpbiB0aGUgdGV4dCBvciBjb250ZW50IG9mIHRoZSBzdWJtaXNzaW9uLgoKSUYgVEhFIFNVQk1JU1NJT04gSVMgQkFTRUQgVVBPTiBXT1JLIFRIQVQgSEFTIEJFRU4gU1BPTlNPUkVEIE9SIFNVUFBPUlRFRApCWSBBTiBBR0VOQ1kgT1IgT1JHQU5JWkFUSU9OIE9USEVSIFRIQU4gRFNVLCBZT1UgUkVQUkVTRU5UIFRIQVQgWU9VIEhBVkUKRlVMRklMTEVEIEFOWSBSSUdIVCBPRiBSRVZJRVcgT1IgT1RIRVIgT0JMSUdBVElPTlMgUkVRVUlSRUQgQlkgU1VDSApDT05UUkFDVCBPUiBBR1JFRU1FTlQuCgpEU1Ugd2lsbCBjbGVhcmx5IGlkZW50aWZ5IHlvdXIgbmFtZShzKSBhcyB0aGUgYXV0aG9yKHMpIG9yIG93bmVyKHMpIG9mIHRoZQpzdWJtaXNzaW9uLCBhbmQgd2lsbCBub3QgbWFrZSBhbnkgYWx0ZXJhdGlvbiwgb3RoZXIgdGhhbiBhcyBhbGxvd2VkIGJ5IHRoaXMKbGljZW5zZSwgdG8geW91ciBzdWJtaXNzaW9uLgo=