Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/24818
Full metadata record
DC FieldValueLanguage
dc.contributor.authorYang, W-
dc.contributor.authorChen, G-
dc.contributor.authorZhao, Z-
dc.contributor.authorSu, F-
dc.contributor.authorMeng, H-
dc.date.accessioned2022-07-07T10:30:24Z-
dc.date.available2022-06-
dc.date.available2022-07-07T10:30:24Z-
dc.date.issued2022-06-30-
dc.identifier.citationYang, W., Chen, G., Zhao, Z., Su, F., Meng, H. (2022) 'iCGPN: Interaction-Centric Graph Parsing Network for Human-Object Interaction Detection', Neurocomputing, 502, pp. 98 - 109. doi:10.1016/j.neucom.2022.06.100.en_US
dc.identifier.issn0925-2312-
dc.identifier.urihttp://bura.brunel.ac.uk/handle/2438/24818-
dc.description.abstractHuman-Object Interaction (HOI) detection aims to infer different interactions, which occur between humans and related objects of images. HOI is usually represented by a triplet and can be modeled as a graph. Thus, with global structural information of images, graph-based methods can detect interactions. However, in existing graph networks, although different fully-connected graphs are built, all detected bounding boxes are regarded as graph nodes equally or different types of nodes according to the category, thereby the dominant role of humans in HOI is ignored. In addition, object node representations mainly focus on appearance features, contributing little to HOI inference. To address these issues, a novel graph-based HOI detection model, named interaction-centric graph parsing network (iCGPN), models one human node as a central node, and other nodes as semantic nodes. Firstly, for each detected human instance, a human-centric fully-connected graph is constructed to learn related HOIs. Secondly, in order to reflect the difference between central nodes and semantic nodes, we design different feature representations and model different edge relationships. Through introducing the attention mechanism, global information related to human-object interaction is explored to enrich the semantic node representation, in which spatial layout, relative locations and object categories information are also combined. Finally, a multi-relation graph convolutional network is applied to update the node feature and infer the HOI. Furthermore, a multi-IOU random shift scheme is proposed to augment the data of the training set to fit the object detection deviation and enhance the generalization ability of our network. Extensive experimental results show that iCGPN achieves very competitive results in comparison with state-of-the-arraph-based methods on the V-COCO and HICO-DET datasets, which demonstrate the effectiveness of the proposed method.en_US
dc.languageen-
dc.language.isoen_USen_US
dc.publisherElsevier BVen_US
dc.rightsCopyright © 2022 Elsevier B.V. All rights reserved. This is the accepted manuscript version of an article which has been published in final form at http://dx.doi.org/10.1016/j.neucom.2022.06.100, archived on this repository under a Creative Commons CC BY-NC-ND attribution licence.-
dc.subjectHuman-object interaction detectionen_US
dc.subjectAttention mechanismen_US
dc.subjectMulti-relation graph convolutional networken_US
dc.subjectMulti-IOU random shiften_US
dc.titleiCGPN: Interaction-Centric Graph Parsing Network for Human-Object Interaction Detectionen_US
dc.typeArticleen_US
dc.identifier.doihttp://dx.doi.org/10.1016/j.neucom.2022.06.100-
dc.relation.isPartOfNeurocomputing-
pubs.publication-statusPublished-
Appears in Collections:Dept of Electronic and Electrical Engineering Research Papers

Files in This Item:
File Description SizeFormat 
FullText.pdfEmbargo till published3.1 MBAdobe PDFView/Open


Items in BURA are protected by copyright, with all rights reserved, unless otherwise indicated.