Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/27243
Title: Constructing training set using distance between learnt graphical models of time series data on patient physiology, to predict disease score
Authors: Chakrabarty, D
Wang, K
Roy, G
Bhojgaria, A
Zhang, C
Pavlu, J
Chakrabartty, J
Keywords: physiological parameters;covariance;cancer risk factors;machine learning;kernel functions;probability density;learning;physicians
Issue Date: 19-Oct-2023
Publisher: PLOS
Citation: Chakrabarty, D. et al. (2023) 'Constructing training set using distance between learnt graphical models of time series data on patient physiology, to predict disease score', PLoS One, 18 (10), e0292404, pp. 1 - 28. doi: 10.1371/journal.pone.0292404.
Abstract: Copyright © 2023 Chakrabarty et al. Interventional endeavours in medicine include prediction of a score that parametrises a new subject’s susceptibility to a given disease, at the pre-onset stage. Here, for the first time, we provide reliable learning of such a score in the context of the potentially-terminal disease VOD, that often arises after bone marrow transplants. Indeed, the probability of surviving VOD, is correlated with early intervention. In our work, the VOD-score of each patient in a retrospective cohort, is defined as the distance between the (posterior) probability of a random graph variable—given the inter-variable partial correlation matrix of the time series data on variables that represent different aspects of patient physiology—and that given such time series data of an arbitrarily-selected reference patient. Such time series data is recorded from a pre-transplant to a post-transplant time, for each patient in this cohort, though the data available for distinct patients bear differential temporal coverage, owing to differential patient longevities. Each graph is a Soft Random Geometric Graph drawn in a probabilistic metric space, and the computed inter-graph distance is oblivious to the length of the time series data. The VOD-score learnt in this way, and the corresponding pre-transplant parameter vector of each patient in this retrospective cohort, then results in the training data, using which we learn the function that takes VOD-score as its input, and outputs the vector of pre-transplant parameters. We model this function with a vector-variate Gaussian Process, the covariance structure of which is kernel parametrised. Such modelling is easier than if the score variable were the output. Then for any prospective patient, whose pre-transplant variables are known, we learn the VOD-score (and the hyperparameters of the covariance kernel), using Markov Chain Monte Carlo based inference.
Description: Data Availability: All relevant data are within the paper and its Supporting information files.
URI: https://bura.brunel.ac.uk/handle/2438/27243
DOI: https://doi.org/10.1371/journal.pone.0292404
Other Identifiers: ORCID iD: Dalia Chakrabarty https://orcid.org/0000-0003-1246-4235
e0292404
Appears in Collections:Dept of Mathematics Research Papers

Files in This Item:
File Description SizeFormat 
FullText.pdfCopyright: © 2023 Chakrabarty et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.905.86 kBAdobe PDFView/Open


This item is licensed under a Creative Commons License Creative Commons