Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/26943
Title: Mode-Based Classifier: A Robust and Flexible Discriminant Analysis for High-Dimensional Data
Authors: Xiong, W
Härdle, WK
Wang, J
Yu, K
Tian, M
Keywords: componentwise modal distance;multimodal classifier;multimodality;quantile-mode;unimodal classifier
Issue Date: 7-Aug-2023
Publisher: Institute of Statistical Science, Academia Sinica & International Chinese Statistical Association
Citation: Xiong, W. et al. (2023) 'Mode-Based Classifier: A Robust and Flexible Discriminant Analysis for High-Dimensional Data', Statistica Sinica, 0 (ahead-of-print), pp. 1 - 43. doi: 10.5705/ss.202023.0014.
Abstract: High-dimensional classification is both challenging and of interest in numerous applications. Componentwise distance-based classifiers, which utilize partial information with known categories, such as mean, median and quantiles, provide a convenient way. However, when the input features are heavy-tailed or contain outliers, performance of the centroid classifier can be poor. Beyond that, it frequently occurs that a population consists of two or more subpopulations, the mean, median and quantiles in this scenario fail to capture such a structure that can be instead preserved by mode, which is an appealing measure of considerable significance but might be neglected. This paper thus introduces and investigates componentwise mode-based classifiers that can reveal important structures missed by existing distance-based classifiers. We explore several strategies for defining the family of mode-based classifiers, including the unimodal classifiers, the multimodal classifier and the quantilemode classifier. The unimodal classifiers are proposed based on componentwise unimodal distance and kernel mode estimation, and the multimodal classifier is constructed by identifying all the local modes of a distribution according to a novel introduced algorithm. We establish the asymptotic properties of these methods and demonstrate through simulation studies and three real datasets that the mode-based classifiers compare favorably to the current state-of-art methods.
Description: This file available on this institutional repository is a preprint. It has not been certified by peer review. It is freely available at http://www3.stat.sinica.edu.tw/ss_newpaper/SS-2023-0014_na.pdf.
Supplementary Materials: In the supplementary materials, we present additional results for simulation examples and real data analysis, and provide the technical results of Theorems 1-3.
URI: https://bura.brunel.ac.uk/handle/2438/26943
DOI: https://doi.org/10.5705/ss.202023.0014
ISSN: 1017-0405
Other Identifiers: ORCID iD: Keming Yu https://orcid.org/0000-0001-6341-8402
Appears in Collections:Dept of Mathematics Research Papers

Files in This Item:
File Description SizeFormat 
FullText.pdfCopyright @ [2023] Institute of Statistical Science, Academia Sinica. All rights reserved. The authors' preprint is freely available at http://www3.stat.sinica.edu.tw/ss_newpaper/SS-2023-0014_na.pdf.3.36 MBAdobe PDFView/Open


Items in BURA are protected by copyright, with all rights reserved, unless otherwise indicated.