Code-switching in automatic speech recognition: The issues and future directions

Code-switching (CS) in spoken language is where the speech has two or more languages within an utterance. It is an unsolved issue in automatic speech recognition (ASR) research as ASR needs to recognise speech in bilingual and multilingual settings, where the accuracy of ASR systems declines with CS...

Full description

Saved in:
Bibliographic Details
Main Authors: Mustafa, Mumtaz Begum, Yusoof, Mansoor Ali, Khalaf, Hasan Kahtan, Abushariah, Ahmad Abdel Rahman Mahmoud, Mat Kiah, Miss Laiha, Ting, Hua Nong, Muthaiyah, Saravanan
Format: Article
Published: MDPI 2022
Subjects:
Online Access:http://eprints.um.edu.my/41070/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.um.eprints.41070
record_format eprints
spelling my.um.eprints.410702023-08-17T04:31:16Z http://eprints.um.edu.my/41070/ Code-switching in automatic speech recognition: The issues and future directions Mustafa, Mumtaz Begum Yusoof, Mansoor Ali Khalaf, Hasan Kahtan Abushariah, Ahmad Abdel Rahman Mahmoud Mat Kiah, Miss Laiha Ting, Hua Nong Muthaiyah, Saravanan QA75 Electronic computers. Computer science TA Engineering (General). Civil engineering (General) Code-switching (CS) in spoken language is where the speech has two or more languages within an utterance. It is an unsolved issue in automatic speech recognition (ASR) research as ASR needs to recognise speech in bilingual and multilingual settings, where the accuracy of ASR systems declines with CS due to pronunciation variation. There are very few reviews carried out on CS, with none conducted on bilingual and multilingual CS ASR systems. This study investigates the importance of CS in bilingual and multilingual speech recognition systems. To meet the objective of this study, two research questions were formulated, which cover both the current issues and the direction of the research. Our review focuses on databases, acoustic and language modelling, and evaluation metrics. Using selected keywords, this research has identified 274 papers and selected 42 experimental papers for review, of which 24 (representing 57%) have discussed CS, while the rest look at multilingual ASR research. The selected papers cover many well-resourced and under-resourced languages, and novel techniques to manage CS in ASR systems, which are mapping, combining and merging the phone sets of the languages experimented with in the research. Our review also examines the performance of those methods. This review found a significant variation in the performance of CS in terms of word error rates, indicating an inconsistency in the ability of ASRs to handle CS. In the conclusion, we suggest several future directions that address the issues identified in this review. MDPI 2022-10 Article PeerReviewed Mustafa, Mumtaz Begum and Yusoof, Mansoor Ali and Khalaf, Hasan Kahtan and Abushariah, Ahmad Abdel Rahman Mahmoud and Mat Kiah, Miss Laiha and Ting, Hua Nong and Muthaiyah, Saravanan (2022) Code-switching in automatic speech recognition: The issues and future directions. Applied Sciences-Basel, 12 (19). ISSN 2076-3417, DOI https://doi.org/10.3390/app12199541 <https://doi.org/10.3390/app12199541>. 10.3390/app12199541
institution Universiti Malaya
building UM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaya
content_source UM Research Repository
url_provider http://eprints.um.edu.my/
topic QA75 Electronic computers. Computer science
TA Engineering (General). Civil engineering (General)
spellingShingle QA75 Electronic computers. Computer science
TA Engineering (General). Civil engineering (General)
Mustafa, Mumtaz Begum
Yusoof, Mansoor Ali
Khalaf, Hasan Kahtan
Abushariah, Ahmad Abdel Rahman Mahmoud
Mat Kiah, Miss Laiha
Ting, Hua Nong
Muthaiyah, Saravanan
Code-switching in automatic speech recognition: The issues and future directions
description Code-switching (CS) in spoken language is where the speech has two or more languages within an utterance. It is an unsolved issue in automatic speech recognition (ASR) research as ASR needs to recognise speech in bilingual and multilingual settings, where the accuracy of ASR systems declines with CS due to pronunciation variation. There are very few reviews carried out on CS, with none conducted on bilingual and multilingual CS ASR systems. This study investigates the importance of CS in bilingual and multilingual speech recognition systems. To meet the objective of this study, two research questions were formulated, which cover both the current issues and the direction of the research. Our review focuses on databases, acoustic and language modelling, and evaluation metrics. Using selected keywords, this research has identified 274 papers and selected 42 experimental papers for review, of which 24 (representing 57%) have discussed CS, while the rest look at multilingual ASR research. The selected papers cover many well-resourced and under-resourced languages, and novel techniques to manage CS in ASR systems, which are mapping, combining and merging the phone sets of the languages experimented with in the research. Our review also examines the performance of those methods. This review found a significant variation in the performance of CS in terms of word error rates, indicating an inconsistency in the ability of ASRs to handle CS. In the conclusion, we suggest several future directions that address the issues identified in this review.
format Article
author Mustafa, Mumtaz Begum
Yusoof, Mansoor Ali
Khalaf, Hasan Kahtan
Abushariah, Ahmad Abdel Rahman Mahmoud
Mat Kiah, Miss Laiha
Ting, Hua Nong
Muthaiyah, Saravanan
author_facet Mustafa, Mumtaz Begum
Yusoof, Mansoor Ali
Khalaf, Hasan Kahtan
Abushariah, Ahmad Abdel Rahman Mahmoud
Mat Kiah, Miss Laiha
Ting, Hua Nong
Muthaiyah, Saravanan
author_sort Mustafa, Mumtaz Begum
title Code-switching in automatic speech recognition: The issues and future directions
title_short Code-switching in automatic speech recognition: The issues and future directions
title_full Code-switching in automatic speech recognition: The issues and future directions
title_fullStr Code-switching in automatic speech recognition: The issues and future directions
title_full_unstemmed Code-switching in automatic speech recognition: The issues and future directions
title_sort code-switching in automatic speech recognition: the issues and future directions
publisher MDPI
publishDate 2022
url http://eprints.um.edu.my/41070/
_version_ 1775622746287374336
score 13.211869