TY - JOUR
T1 - Progressive Cross Attention Network for Flood Segmentation using Multispectral Satellite Imagery
AU - Feliren, Vicky
AU - Khikmah, Fithrothul
AU - Bhaswara, Irfan Dwiki
AU - Nasution, Bahrul I.
AU - Lechner, Alex M.
AU - Saputra, Muhamad Risqi U.
N1 - Publisher Copyright:
©2004-2012 IEEE.
PY - 2024/11/11
Y1 - 2024/11/11
N2 - In recent years, the integration of deep learning techniques with remote sensing technology has revolutionized the way natural hazards, such as floods, are monitored and managed. However, existing methods for flood segmentation using remote sensing data often overlook the utility of multispectral satellite information. In this study, we introduce a progressive cross attention network (ProCANet), a deep learning model that progressively applies both self- and cross-attention mechanisms to multispectral features, generating optimal feature combinations for flood segmentation. The proposed model was compared with state-of-the-art approaches using Sen1Floods11 dataset and our bespoke flood data generated for the Citarum River basin, Indonesia. Our model demonstrated superior performance with the highest Intersection over Union (IoU) score of 0.815. Our results in this study, coupled with the ablation assessment comparing scenarios with and without attention across various modalities, opens a promising path for enhancing the accuracy of flood analysis using remote sensing technology.
AB - In recent years, the integration of deep learning techniques with remote sensing technology has revolutionized the way natural hazards, such as floods, are monitored and managed. However, existing methods for flood segmentation using remote sensing data often overlook the utility of multispectral satellite information. In this study, we introduce a progressive cross attention network (ProCANet), a deep learning model that progressively applies both self- and cross-attention mechanisms to multispectral features, generating optimal feature combinations for flood segmentation. The proposed model was compared with state-of-the-art approaches using Sen1Floods11 dataset and our bespoke flood data generated for the Citarum River basin, Indonesia. Our model demonstrated superior performance with the highest Intersection over Union (IoU) score of 0.815. Our results in this study, coupled with the ablation assessment comparing scenarios with and without attention across various modalities, opens a promising path for enhancing the accuracy of flood analysis using remote sensing technology.
KW - flood segmentation
KW - multimodal deep learning
KW - progressive cross attention
KW - semantic segmentation
UR - http://www.scopus.com/inward/record.url?scp=85209764212&partnerID=8YFLogxK
U2 - 10.1109/LGRS.2024.3495974
DO - 10.1109/LGRS.2024.3495974
M3 - Article
AN - SCOPUS:85209764212
SN - 1545-598X
JO - IEEE Geoscience and Remote Sensing Letters
JF - IEEE Geoscience and Remote Sensing Letters
ER -