Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/26272
Full metadata record
DC FieldValueLanguage
dc.contributor.authorCheng, H-
dc.contributor.authorWang, Z-
dc.contributor.authorMa, L-
dc.contributor.authorWei, Z-
dc.contributor.authorAlsaadi, FE-
dc.contributor.authorLiu, X-
dc.date.accessioned2023-04-16T08:39:21Z-
dc.date.available2023-04-16T08:39:21Z-
dc.date.issued2023-03-27-
dc.identifierORCID iDs: Zidong Wang https://orcid.org/0000-0002-9576-7401; Xiaohui Liu https://orcid.org/0000-0003-1589-1267.-
dc.identifier.citationCheng, H. et al. (2023) 'Differentiable channel pruning guided via attention mechanism: a novel neural network pruning approach', Complex & Intelligent Systems, 9 (5), pp. 5611 - 5624. doi: 10.1007/s40747-023-01022-6.en_US
dc.identifier.issn2199-4536-
dc.identifier.urihttps://bura.brunel.ac.uk/handle/2438/26272-
dc.description.abstractCopyright © The Author(s) 2023. Neural network pruning offers great prospects for facilitating the deployment of deep neural networks on computational resource limited devices. Neural architecture search (NAS) provides an efficient way to automatically seek appropriate neural architecture design for compressed model. It is observed that, for existing NAS-based pruning methods, there is usually a lack of layer information when searching the optimal neural architecture. In this paper, we propose a new NAS approach, namely, differentiable channel pruning method guided via attention mechanism (DCP-A), where the adopted attention mechanism is able to provide layer information to guide the optimization of the pruning policy. The training process is differentiable with Gumbel-softmax sampling, while parameters are optimized under a two-stage training procedure. The neural network block with the shortcut is dedicatedly designed, which is of help to prune the network not only on its width but also on its depth. Extensive experiments are performed to verify the applicability and superiority of the proposed method. Detailed analysis with visualization of the pruned model architecture shows that our proposed DCP-A learns explainable pruning policies.en_US
dc.description.sponsorshipThe Deanship of Scientific Research (DSR) at King Abdulaziz University (KAU), Jeddah, Saudi Arabia has funded this project, under grant no. (RG-2-611-43). This work was supported in part by the European Union’s Horizon 2020 Research and Innovation Programme under Grant 820776 (INTEGRADDE), the Royal Society of the UK, and the Alexander von Humboldt Foundation of Germanyen_US
dc.format.extent5611 - 5624-
dc.format.mediumPrint-Electronic-
dc.language.isoen_USen_US
dc.publisherSpinger Natureen_US
dc.rightsCopyright © The Author(s) 2023. Rights and permissions: Open Access. This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit https://creativecommons.org/licenses/by/4.0/.-
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/-
dc.subjectartificial intelligenceen_US
dc.subjectnetwork pruningen_US
dc.subjectneural architecture searchen_US
dc.subjectGumbel-softmax samplingen_US
dc.subjectattention mechanismen_US
dc.titleDifferentiable channel pruning guided via attention mechanism: a novel neural network pruning approachen_US
dc.typeArticleen_US
dc.identifier.doihttps://doi.org/10.1007/s40747-023-01022-6-
dc.relation.isPartOfComplex & Intelligent Systems-
pubs.issue5-
pubs.publication-statusPublished online-
pubs.volume9-
dc.identifier.eissn2198-6053-
dc.rights.holderThe Author(s)-
Appears in Collections:Dept of Computer Science Research Papers

Files in This Item:
File Description SizeFormat 
FullText.pdfCopyright © The Author(s) 2023. Rights and permissions: Open Access. This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit https://creativecommons.org/licenses/by/4.0/.1.29 MBAdobe PDFView/Open


This item is licensed under a Creative Commons License Creative Commons