TY - JOUR
T1 - Progressive Cross-Attention Network for Flood Segmentation Using Multispectral Satellite Imagery
AU - Feliren, Vicky
AU - Khikmah, Fithrothul
AU - Dwiki Bhaswara, Irfan
AU - Nasution, Bahrul I.
AU - Lechner, Alex M.
AU - Saputra, Muhamad Risqi U.
N1 - Publisher Copyright:
© 2004-2012 IEEE.
PY - 2025
Y1 - 2025
N2 - In recent years, the integration of deep learning techniques with remote sensing technology has revolutionized the way natural hazards, such as floods, are monitored and managed. However, existing methods for flood segmentation using remote sensing data often overlook the utility of correlative features among multispectral satellite information. In this study, we introduce a progressive cross-attention network (ProCANet), a deep learning model that progressively applies both self- and cross-attention mechanisms to multispectral features, generating optimal feature combinations for flood segmentation. The proposed model was compared with state-of-the-art approaches using the Sen1Floods11 dataset and our bespoke flood data generated for the Citarum River basin, Indonesia. Our model demonstrated superior performance with the highest intersection over union (IoU) score of 0.815. Our results in this study, coupled with the ablation assessment comparing scenarios with and without attention across various modalities, open a promising path for enhancing the accuracy of flood analysis using remote sensing technology.
AB - In recent years, the integration of deep learning techniques with remote sensing technology has revolutionized the way natural hazards, such as floods, are monitored and managed. However, existing methods for flood segmentation using remote sensing data often overlook the utility of correlative features among multispectral satellite information. In this study, we introduce a progressive cross-attention network (ProCANet), a deep learning model that progressively applies both self- and cross-attention mechanisms to multispectral features, generating optimal feature combinations for flood segmentation. The proposed model was compared with state-of-the-art approaches using the Sen1Floods11 dataset and our bespoke flood data generated for the Citarum River basin, Indonesia. Our model demonstrated superior performance with the highest intersection over union (IoU) score of 0.815. Our results in this study, coupled with the ablation assessment comparing scenarios with and without attention across various modalities, open a promising path for enhancing the accuracy of flood analysis using remote sensing technology.
KW - Flood segmentation
KW - multimodal deep learning
KW - progressive cross-attention
KW - semantic segmentation
UR - https://www.scopus.com/pages/publications/86000385489
U2 - 10.1109/LGRS.2024.3495974
DO - 10.1109/LGRS.2024.3495974
M3 - Article
AN - SCOPUS:86000385489
SN - 1545-598X
VL - 22
JO - IEEE Geoscience and Remote Sensing Letters
JF - IEEE Geoscience and Remote Sensing Letters
M1 - 1500105
ER -