Large-scale date palm tree segmentation from multiscale UAV-based and aerial images using deep vision transformers
The reliable and efficient large-scale mapping of date palm trees from remotely sensed data is crucial for developing palm tree inventories, continuous monitoring, vulnerability assessments, environmental control, and long-term management. Given the increasing availability of UAV images with limited...
Saved in:
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI
2023
|
Online Access: | http://psasir.upm.edu.my/id/eprint/110312/1/drones-07-00093-v2.pdf http://psasir.upm.edu.my/id/eprint/110312/ https://www.mdpi.com/2504-446X/7/2/93 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.upm.eprints.110312 |
---|---|
record_format |
eprints |
spelling |
my.upm.eprints.1103122024-09-05T07:05:05Z http://psasir.upm.edu.my/id/eprint/110312/ Large-scale date palm tree segmentation from multiscale UAV-based and aerial images using deep vision transformers Gibril, Mohamed Barakat A. Mohd Shafri, Helmi Zulhaidi Al-Ruzouq, Rami Shanableh, Abdallah Nahas, Faten Al Mansoori, Saeed The reliable and efficient large-scale mapping of date palm trees from remotely sensed data is crucial for developing palm tree inventories, continuous monitoring, vulnerability assessments, environmental control, and long-term management. Given the increasing availability of UAV images with limited spectral information, the high intra-class variance of date palm trees, the variations in the spatial resolutions of the data, and the differences in image contexts and backgrounds, accurate mapping of date palm trees from very-high spatial resolution (VHSR) images can be challenging. This study aimed to investigate the reliability and the efficiency of various deep vision transformers in extracting date palm trees from multiscale and multisource VHSR images. Numerous vision transformers, including the Segformer, the Segmenter, the UperNet-Swin transformer, and the dense prediction transformer, with various levels of model complexity, were evaluated. The models were developed and evaluated using a set of comprehensive UAV-based and aerial images. The generalizability and the transferability of the deep vision transformers were evaluated and compared with various convolutional neural network-based (CNN) semantic segmentation models (including DeepLabV3+, PSPNet, FCN-ResNet-50, and DANet). The results of the examined deep vision transformers were generally comparable to several CNN-based models. The investigated deep vision transformers achieved satisfactory results in mapping date palm trees from the UAV images, with an mIoU ranging from 85% to 86.3% and an mF-score ranging from 91.62% to 92.44%. Among the evaluated models, the Segformer generated the highest segmentation results on the UAV-based and the multiscale testing datasets. The Segformer model, followed by the UperNet-Swin transformer, outperformed all of the evaluated CNN-based models in the multiscale testing dataset and in the additional unseen UAV testing dataset. In addition to delivering remarkable results in mapping date palm trees from versatile VHSR images, the Segformer model was among those with a small number of parameters and relatively low computing costs. Collectively, deep vision transformers could be used efficiently in developing and updating inventories of date palms and other tree species. MDPI 2023-01 Article PeerReviewed text en http://psasir.upm.edu.my/id/eprint/110312/1/drones-07-00093-v2.pdf Gibril, Mohamed Barakat A. and Mohd Shafri, Helmi Zulhaidi and Al-Ruzouq, Rami and Shanableh, Abdallah and Nahas, Faten and Al Mansoori, Saeed (2023) Large-scale date palm tree segmentation from multiscale UAV-based and aerial images using deep vision transformers. Drones, 7 (2). pp. 93-25. ISSN 2504-446X https://www.mdpi.com/2504-446X/7/2/93 10.3390/drones7020093 |
institution |
Universiti Putra Malaysia |
building |
UPM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Putra Malaysia |
content_source |
UPM Institutional Repository |
url_provider |
http://psasir.upm.edu.my/ |
language |
English |
description |
The reliable and efficient large-scale mapping of date palm trees from remotely sensed data is crucial for developing palm tree inventories, continuous monitoring, vulnerability assessments, environmental control, and long-term management. Given the increasing availability of UAV images with limited spectral information, the high intra-class variance of date palm trees, the variations in the spatial resolutions of the data, and the differences in image contexts and backgrounds, accurate mapping of date palm trees from very-high spatial resolution (VHSR) images can be challenging. This study aimed to investigate the reliability and the efficiency of various deep vision transformers in extracting date palm trees from multiscale and multisource VHSR images. Numerous vision transformers, including the Segformer, the Segmenter, the UperNet-Swin transformer, and the dense prediction transformer, with various levels of model complexity, were evaluated. The models were developed and evaluated using a set of comprehensive UAV-based and aerial images. The generalizability and the transferability of the deep vision transformers were evaluated and compared with various convolutional neural network-based (CNN) semantic segmentation models (including DeepLabV3+, PSPNet, FCN-ResNet-50, and DANet). The results of the examined deep vision transformers were generally comparable to several CNN-based models. The investigated deep vision transformers achieved satisfactory results in mapping date palm trees from the UAV images, with an mIoU ranging from 85% to 86.3% and an mF-score ranging from 91.62% to 92.44%. Among the evaluated models, the Segformer generated the highest segmentation results on the UAV-based and the multiscale testing datasets. The Segformer model, followed by the UperNet-Swin transformer, outperformed all of the evaluated CNN-based models in the multiscale testing dataset and in the additional unseen UAV testing dataset. In addition to delivering remarkable results in mapping date palm trees from versatile VHSR images, the Segformer model was among those with a small number of parameters and relatively low computing costs. Collectively, deep vision transformers could be used efficiently in developing and updating inventories of date palms and other tree species. |
format |
Article |
author |
Gibril, Mohamed Barakat A. Mohd Shafri, Helmi Zulhaidi Al-Ruzouq, Rami Shanableh, Abdallah Nahas, Faten Al Mansoori, Saeed |
spellingShingle |
Gibril, Mohamed Barakat A. Mohd Shafri, Helmi Zulhaidi Al-Ruzouq, Rami Shanableh, Abdallah Nahas, Faten Al Mansoori, Saeed Large-scale date palm tree segmentation from multiscale UAV-based and aerial images using deep vision transformers |
author_facet |
Gibril, Mohamed Barakat A. Mohd Shafri, Helmi Zulhaidi Al-Ruzouq, Rami Shanableh, Abdallah Nahas, Faten Al Mansoori, Saeed |
author_sort |
Gibril, Mohamed Barakat A. |
title |
Large-scale date palm tree segmentation from multiscale UAV-based and aerial images using deep vision transformers |
title_short |
Large-scale date palm tree segmentation from multiscale UAV-based and aerial images using deep vision transformers |
title_full |
Large-scale date palm tree segmentation from multiscale UAV-based and aerial images using deep vision transformers |
title_fullStr |
Large-scale date palm tree segmentation from multiscale UAV-based and aerial images using deep vision transformers |
title_full_unstemmed |
Large-scale date palm tree segmentation from multiscale UAV-based and aerial images using deep vision transformers |
title_sort |
large-scale date palm tree segmentation from multiscale uav-based and aerial images using deep vision transformers |
publisher |
MDPI |
publishDate |
2023 |
url |
http://psasir.upm.edu.my/id/eprint/110312/1/drones-07-00093-v2.pdf http://psasir.upm.edu.my/id/eprint/110312/ https://www.mdpi.com/2504-446X/7/2/93 |
_version_ |
1811686063796649984 |
score |
13.211869 |