Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/26245
Full metadata record
DC FieldValueLanguage
dc.contributor.authorOmran, T-
dc.contributor.authorSharef, B-
dc.contributor.authorGrosan, C-
dc.contributor.authorLi, Y-
dc.date.accessioned2023-04-03T09:19:03Z-
dc.date.available2023-04-03T09:19:03Z-
dc.date.issued2023-03-30-
dc.identifierORCID iD: Thuraya Omran https://orcid.org/0000-0002-7000-650X; Crina Grosan https://orcid.org/0000-0003-1049-2136; Yongmin Li https://orcid.org/0000-0003-1668-2440.-
dc.identifier68-
dc.identifier.citationOmran, T. et al. (2023) 'Sentiment Analysis of Multilingual Dataset of Bahraini Dialects, Arabic, and English', Data, 8 (4), 68, pp. 1 - 13. doi: 10.3390/data8040068.en_US
dc.identifier.urihttps://bura.brunel.ac.uk/handle/2438/26245-
dc.descriptionData Availability Statement: The dataset is openly available at: https://data.mendeley.com/datasets/5rhw2srzjj (accessed on 15 February 2023). Dataset: https://doi.org/10.17632/5rhw2srzjj.1 Dataset License: CC-BY-NC.en_US
dc.description.abstractCopyright © 2023 by the authors. Sentiment analysis is an application of natural language processing (NLP) that requires a machine learning algorithm and a dataset. In some cases, the dataset availability is scarce, particularly with Arabic dialects, precisely the Bahraini ones, which necessitates using an approach such as translation, where a rich source language is exploited to create the target language dataset. In this study, a dataset of Amazon product reviews in Bahraini dialects is presented. This dataset was generated using two cascading stages of translation—a machine translation followed by a manual one. Machine translation was applied using Google Translate to translate English Amazon product reviews into Standard Arabic. In contrast, the manual approach was applied to translate the resulting Arabic reviews into Bahraini ones by qualified native speakers utilizing constructed customized forms. The resulting parallel dataset of English, Standard Arabic, and Bahraini dialects is called English_Modern Standard Arabic_Bahraini Dialects product reviews for sentiment analysis “E_MSA_BDs-PR-SA”. The dataset is balanced, composed of 2500 positive and 2500 negative reviews. The sentiment analysis process was implemented using a stacked LSTM deep learning model. The Bahraini dialect product dataset can be utilized in the transfer learning process for sentimentally analyzing another dataset in Bahraini dialects.en_US
dc.description.sponsorshipThis research received no external funding.en_US
dc.format.extent1 - 13-
dc.format.mediumElectronic-
dc.language.isoen_USen_US
dc.publisherMDPIen_US
dc.rightsCopyright © 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).-
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/-
dc.subjectBahraini dialects resourcesen_US
dc.subjectBahraini resources scarcityen_US
dc.subjectdeep learningen_US
dc.subjectproducts reviewsen_US
dc.titleSentiment Analysis of Multilingual Dataset of Bahraini Dialects, Arabic, and Englishen_US
dc.typeArticleen_US
dc.identifier.doihttps://doi.org/10.3390/data8040068-
dc.relation.isPartOfData-
pubs.issue4-
pubs.publication-statusPublished-
pubs.volume8-
dc.rights.holderThe authors-
Appears in Collections:Dept of Computer Science Research Papers

Files in This Item:
File Description SizeFormat 
FullText.pdfCopyright © 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).3.39 MBAdobe PDFView/Open


This item is licensed under a Creative Commons License Creative Commons