Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/4984
Full metadata record
DC FieldValueLanguage
dc.contributor.authorLuo, P-
dc.contributor.authorLü, K-
dc.contributor.authorHe, Q-
dc.contributor.authorShi, Z-
dc.date.accessioned2011-04-08T13:21:21Z-
dc.date.available2011-04-08T13:21:21Z-
dc.date.issued2006-
dc.identifier.citationLecture Notes in Computer Science 4042: 177-189, 2006en_US
dc.identifier.issn0302-9743-
dc.identifier.urihttp://www.springerlink.com/content/u7837xvu21857m68/en
dc.identifier.urihttp://bura.brunel.ac.uk/handle/2438/4984-
dc.descriptionThis is an open access article that can be obtained from the links below - Copyright @ 2006 Springer Verlagen_US
dc.description.abstractThe computing-intensive Data Mining (DM) process calls for the support of a Heterogeneous Computing (HC) system, which consists of multiple computers with different configurations, connected by a high-speed LAN, for increased computational power and resources. DM process can be described as a multi-phase pipeline process, and in each phase there could be many optional methods. This makes the workflow of DM very complex and can be modelled only by a Directed Acyclic Graph (DAG). An HC system needs an effective and efficient scheduling framework, which orchestrates all the computing hardware to perform multiple competitive DM workflows. Motivated by the need of a practical solution of the scheduling problem for the DM workflow, this paper proposes a dynamic DAG scheduling algorithm according to the characteristics of execution time estimation model for DM jobs. Based on an approximate estimation of job execution time, this algorithm first maps DM jobs to machines in a decentralized and diligent (defined in this paper) manner. Then the performance of this initial mapping can be improved through job migrations when necessary. The scheduling heuristic used in it considers the factors of both the minimal completion time criterion and the critical path in a DAG. We implement this system in an established Multi-Agent System (MAS) environment, in which the reuse of existing DM algorithms is achieved by encapsulating them into agents. Practical classification problems are used to test and measure the system performance. The detailed experiment procedure and result analysis are also discussed in this paper.en_US
dc.language.isoenen_US
dc.publisherSpringer-Verlagen_US
dc.subjectData miningen_US
dc.subjectHeterogeneous computingen_US
dc.subjectDirected acyclic graphen_US
dc.subjectMulti-agent system environmenten_US
dc.titleA heterogeneous computing system for data mining workflowsen_US
dc.typeResearch Paperen_US
dc.identifier.doihttp://dx.doi.org/10.1007/11788911_15-
Appears in Collections:Business and Management
Brunel Business School Research Papers

Files in This Item:
There are no files associated with this item.


Items in BURA are protected by copyright, with all rights reserved, unless otherwise indicated.