UNCLES: Method for the identification of genes differentially consistently co-expressed in a specific subset of datasets

Abu-Jamous, B; Fa, R; Roberts, DJ; Nandi, AK

Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/11040

Full metadata record

DC Field	Value	Language
dc.contributor.author	Abu-Jamous, B	-
dc.contributor.author	Fa, R	-
dc.contributor.author	Roberts, DJ	-
dc.contributor.author	Nandi, AK	-
dc.date.accessioned	2015-06-22T13:20:47Z	-
dc.date.available	2015-06-04	-
dc.date.available	2015-06-22T13:20:47Z	-
dc.date.issued	2015	-
dc.identifier.citation	BMC Bioinformatics, 16:184, (June 2015)	en_US
dc.identifier.issn	1471-2105	-
dc.identifier.uri	http://www.biomedcentral.com/1471-2105/16/184	-
dc.identifier.uri	http://bura.brunel.ac.uk/handle/2438/11040	-
dc.description.abstract	Background: Collective analysis of the increasingly emerging gene expression datasets are required. The recently proposed binarisation of consensus partition matrices (Bi-CoPaM) method can combine clustering results from multiple datasets to identify the subsets of genes which are consistently co-expressed in all of the provided datasets in a tuneable manner. However, results validation and parameter setting are issues that complicate the design of such methods. Moreover, although it is a common practice to test methods by application to synthetic datasets, the mathematical models used to synthesise such datasets are usually based on approximations which may not always be sufficiently representative of real datasets. Results: Here, we propose an unsupervised method for the unification of clustering results from multiple datasets using external specifications (UNCLES). This method has the ability to identify the subsets of genes consistently co-expressed in a subset of datasets while being poorly co-expressed in another subset of datasets, and to identify the subsets of genes consistently co-expressed in all given datasets. We also propose the M-N scatter plots validation technique and adopt it to set the parameters of UNCLES, such as the number of clusters, automatically. Additionally, we propose an approach for the synthesis of gene expression datasets using real data profiles in a way which combines the ground-truth-knowledge of synthetic data and the realistic expression values of real data, and therefore overcomes the problem of faithfulness of synthetic expression data modelling. By application to those datasets, we validate UNCLES while comparing it with other conventional clustering methods, and of particular relevance, biclustering methods. We further validate UNCLES by application to a set of 14 real genome-wide yeast datasets as it produces focused clusters that conform well to known biological facts. Furthermore, in-silico-based hypotheses regarding the function of a few previously unknown genes in those focused clusters are drawn. Conclusions: The UNCLES method, the M-N scatter plots technique, and the expression data synthesis approach will have wide application for the comprehensive analysis of genomic and other sources of multiple complex biological datasets. Moreover, the derived in-silico-based biological hypotheses represent subjects for future functional studies.	en_US
dc.description.sponsorship	The National Institute for Health Research (NIHR) under its Programme Grants for Applied Research Programme (Grant Reference Number RP-PG-0310-1004).	en_US
dc.language	eng	-
dc.language.iso	en	en_US
dc.publisher	BioMed Central Ltd.	en_US
dc.subject	Bi-CoPaM	en_US
dc.subject	Consistent co-expression	en_US
dc.subject	Genome-wide analysis	en_US
dc.subject	Multiple datasets analysis	en_US
dc.subject	UNCLES	en_US
dc.title	UNCLES: Method for the identification of genes differentially consistently co-expressed in a specific subset of datasets	en_US
dc.type	Article	en_US
dc.identifier.doi	http://dx.doi.org/10.1186/s12859-015-0614-0	-
dc.relation.isPartOf	BMC Bioinformatics	-
pubs.publication-status	Accepted	-
pubs.publication-status	Accepted	-
Appears in Collections:	Dept of Electronic and Electrical Engineering Research Papers

Files in This Item:

File	Description	Size	Format
Fulltext.pdf		2.47 MB	Adobe PDF	View/Open

Show simple item record