Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/27120
Title: Multi-modality and multi-scale attention fusion network for land cover classification from vhr remote sensing images
Authors: Lei, T
Li, L
Lv, Z
Zhu, M
Du, X
Nandi, AK
Keywords: land cover classification;multi-modality data fusion;deep learning;multi-scale spatial contextual information
Issue Date: 20-Sep-2021
Publisher: MDPI
Citation: Lei, T. et al. (2021) 'Multi-modality and multi-scale attention fusion network for land cover classification from vhr remote sensing images', Remote Sensing, 13 (18), 3771, pp. 1 - 18. doi: 10.3390/rs13183771.
Abstract: Copyright © 2021 by the authors. Land cover classification from very high-resolution (VHR) remote sensing images is a challenging task due to the complexity of geography scenes and the varying shape and size of ground targets. It is difficult to utilize the spectral data directly, or to use traditional multi-scale feature extraction methods, to improve VHR remote sensing image classification results. To address the problem, we proposed a multi-modality and multi-scale attention fusion network for land cover classification from VHR remote sensing images. First, based on the encoding-decoding network, we designed a multi-modality fusion module that can simultaneously fuse more useful features and avoid redundant features. This addresses the problem of low classification accuracy for some objects caused by the weak ability of feature representation from single modality data. Second, a novel multi-scale spatial context enhancement module was introduced to improve feature fusion, which solves the problem of a large-scale variation of objects in remote sensing images, and captures long-range spatial relationships between objects. The proposed network and comparative networks were evaluated on two public datasets—the Vaihingen and the Potsdam datasets. It was observed that the proposed network achieves better classification results, with a mean F1-score of 88.6% for the Vaihingen dataset and 92.3% for the Potsdam dataset. Experimental results show that our model is superior to the state-of-the-art network models.
Description: Data Availability Statement: The datasets used in this study have been published, and their addresses are https://www2.isprs.org/commissions/comm2/wg4/benchmark/2d-sem-label-potsdam/ (accessed on 30 January 2021) and https://www2.isprs.org/commissions/comm2/wg4/benchmark/2d-sem-label-vaihingen/ (accessed on 30 January 2021).
URI: https://bura.brunel.ac.uk/handle/2438/27120
DOI: https://doi.org/10.3390/rs13183771
Other Identifiers: ORCID iDs: by Tao Lei https://orcid.org/0000-0002-2104-9298; Mingzhe Zhu https://orcid.org/0000-0002-7962-3344; Asoke K. Nandi https://orcid.org/0000-0001-6248-2875.
3771
Appears in Collections:Dept of Electronic and Electrical Engineering Research Papers

Files in This Item:
File Description SizeFormat 
FullText.pdfCopyright © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).56.88 MBAdobe PDFView/Open


This item is licensed under a Creative Commons License Creative Commons