Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/26910
Title: Implementation of Deep Learning Models on an SoC-FPGA Device for Real-Time Music Genre Classification
Authors: Faizan, M
Intzes, I
Cretu, I
Meng, H
Keywords: SoC;FPGA;deep learning;classification;CNN;LSTM;DNN;Vitis;AI
Issue Date: 10-Jul-2023
Publisher: MDPI
Citation: Faizan, M. et al. (20323) 'Implementation of Deep Learning Models on an SoC-FPGA Device for Real-Time Music Genre Classification', Technologies, 11 (4), 91, pp. 1 - 18. doi: 10.3390/technologies11040091.
Abstract: Copyright © 2023 by the authors. Deep neutral networks (DNNs) are complex machine learning models designed for decision-making tasks with high accuracy. However, DNNs require high computational power and memory, which limits such models to fitting on edge devices, resulting in unnecessary processing delays and high energy consumption. Graphical processing units (GPUs) offer reliable hardware acceleration, but their bulky sizes prevent their utilization in portable equipment. System-on-chip field programmable gated arrays (SoC-FPGAs) provide considerable computational power with low energy consumption, making them ideal for edge computing applications, owing to their innovative, flexible, and small design. In this paper, we implement a deep-learning-based music genre classification system on a SoC-FPGA board, evaluate the model’s performance, and provide a comparative analysis across different platforms. Specifically, we compare the performance of long short-term memory (LSTM), convolutional neural networks (CNNs), and a hybrid model (CNN-LSTM) on an Intel Core i7-8550U by Intel Cooperation. The models are fed an acoustic feature called the Mel-frequency cepstral coefficient (MFCC) for training and testing (inference). Then, by using the advanced Vitis AI tool, a deployable version of the model is generated. The experimental results show that the execution speed is increased by 80%, and the throughput rises four times when the CNN-based music genre classification system is implemented on SoC-FPGA.
Description: Data Availability Statement: https://www.kaggle.com/datasets/andradaolteanu/gtzan-dataset-music-genre-classification (accessed on 30 June 2023).
URI: https://bura.brunel.ac.uk/handle/2438/26910
DOI: https://doi.org/10.3390/technologies11040091
Other Identifiers: ORCID iD: Muhammad Faizan https://orcid.org/0009-0005-8244-5349; Ioannis Intzes https://orcid.org/0000-0001-6033-3390; Hongying Meng https://orcid.org/0000-0002-8836-1382.
91
Appears in Collections:Dept of Electronic and Electrical Engineering Research Papers

Files in This Item:
File Description SizeFormat 
FullText.pdfCopyright © 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).7.15 MBAdobe PDFView/Open


This item is licensed under a Creative Commons License Creative Commons