Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/20185
Title: Accelerating supply chains with Ant Colony Optimization across a range of hardware solutions
Other Titles: Accelerating supply chains with Ant Colony Optimization across range of hardware solutions
Authors: Dzalbs, I
Kalganova, T
Keywords: transportation network optimization;ant colony optimization;parallel ACO on Xeon Phi/GPU
Issue Date: 29-Jun-2020
Publisher: Elsevier
Citation: Dzalbs, I. and Kalganova, T. (2020) 'Accelerating supply chains with Ant Colony Optimization across a range of hardware solutions', Computers and Industrial Engineering, 147, 106610, pp. 1-14. doi: 10.1016/j.cie.2020.106610.
Abstract: Ant Colony algorithm has been applied to various optimization problems, however most of the previous work on scaling and parallelism focuses on Travelling Salesman Problems (TSPs). Although, useful for benchmarks and new idea comparison, the algorithmic dynamics does not always transfer to complex real-life problems, where additional meta-data is required during solution construction. This paper looks at real-life outbound supply chain problem using Ant Colony Optimization (ACO) and its scaling dynamics with two parallel ACO architectures - Independent Ant Colonies (IAC) and Parallel Ants (PA). Results showed that PA was able to reach a higher solution quality in fewer iterations as the number of parallel instances increased. Furthermore, speed performance was measured across three different hardware solutions - 16 core CPU, 68 core Xeon Phi and up to 4 Geforce GPUs. State of the art, ACO vectorization techniques such as SS-Roulette were implemented using C++ and CUDA. Although excellent for TSP, it was concluded that for the given supply chain problem GPUs are not suitable due to meta-data access footprint required. Furthermore, compared to their sequential counterpart, vectorized CPU AVX2 implementation achieved 25.4x speedup on CPU while Xeon Phi with its AVX512 instruction set reached 148x on PA with Vectorized (PAwV). PAwV is therefore able to scale at least up to 1024 parallel instances on the supply chain network problem solved.
Description: This pre-print, arXiv:2001.08102v1 [cs.NE], was published subsequently by Elsevier in Computers and Industrial Engineering, vol. 147, 106610, pp. 1-14 on 29 Jun 2020 and is available at https://doi.org/10.1016/j.cie.2020.106610
URI: https://bura.brunel.ac.uk/handle/2438/20185
DOI: https://doi.org/10.1016/j.cie.2020.106610
ISSN: 0360-8352
Other Identifiers: 106610
Appears in Collections:Dept of Electronic and Electrical Engineering Research Papers

Files in This Item:
File Description SizeFormat 
Pre-print.pdf781.75 kBAdobe PDFView/Open


Items in BURA are protected by copyright, with all rights reserved, unless otherwise indicated.