Bilingual automatic speech recognition: A review, taxonomy and open challenges

In this technological era, smart and intelligent systems that are integrated with artificial intelligence (AI) techniques, algorithms, tools, and technologies, have impact on various aspects in our daily life. Communication and interaction between human and machine using speech become increasingly i...

Full description

Saved in:
Bibliographic Details
Main Authors: Abushariah, Ahmad A. M., Ting, Hua-Nong, Peer Mustafa, Mumtaz Begum, Mohd Khairuddin, Anis Salwa, Abushariah, Mohammad A. M., Tan, Tien-Ping
Format: Article
Published: Institute of Electrical and Electronics Engineers 2023
Subjects:
Online Access:http://eprints.um.edu.my/38912/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.um.eprints.38912
record_format eprints
spelling my.um.eprints.389122023-06-23T02:01:17Z http://eprints.um.edu.my/38912/ Bilingual automatic speech recognition: A review, taxonomy and open challenges Abushariah, Ahmad A. M. Ting, Hua-Nong Peer Mustafa, Mumtaz Begum Mohd Khairuddin, Anis Salwa Abushariah, Mohammad A. M. Tan, Tien-Ping QA75 Electronic computers. Computer science TK Electrical engineering. Electronics Nuclear engineering In this technological era, smart and intelligent systems that are integrated with artificial intelligence (AI) techniques, algorithms, tools, and technologies, have impact on various aspects in our daily life. Communication and interaction between human and machine using speech become increasingly important, since it is an obvious substitute for keyboards and screens in the communication process. Therefore, numerous technologies take advantage of speech such as Automatic Speech Recognition (ASR), where human natural speech for many languages is used as the means to interact with machines. Majority of the related works on ASR concentrate on the development and evaluation of ASR systems that serve a single language only, such as Arabic, English, Chinese, French, and many others. However, research attempts that combine multiple languages (bilingual and multilingual) during the development and evaluation of ASR systems are very limited. This paper aims to provide comprehensive research background and fundamentals of bilingual ASR, and related works that have combined two languages for ASR tasks from 2010 to 2021. It also formulates research taxonomy and discusses open challenges to the bilingual ASR research. Based on our literature investigation, it is clear that bilingual ASR using deep learning approach is highly demanded and is able to provide acceptable performance. In addition, many combinations of two languages such as Arabic-English, Arabic-Malay, and others, are still limited, which can open new research opportunities. Finally, it is clear that ASR research is moving towards not only bilingual ASR, but also multilingual ASR. Institute of Electrical and Electronics Engineers 2023 Article PeerReviewed Abushariah, Ahmad A. M. and Ting, Hua-Nong and Peer Mustafa, Mumtaz Begum and Mohd Khairuddin, Anis Salwa and Abushariah, Mohammad A. M. and Tan, Tien-Ping (2023) Bilingual automatic speech recognition: A review, taxonomy and open challenges. IEEE Access, 11. pp. 5944-5954. ISSN 2169-3536, DOI https://doi.org/10.1109/ACCESS.2022.3218684 <https://doi.org/10.1109/ACCESS.2022.3218684>. 10.1109/ACCESS.2022.3218684
institution Universiti Malaya
building UM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaya
content_source UM Research Repository
url_provider http://eprints.um.edu.my/
topic QA75 Electronic computers. Computer science
TK Electrical engineering. Electronics Nuclear engineering
spellingShingle QA75 Electronic computers. Computer science
TK Electrical engineering. Electronics Nuclear engineering
Abushariah, Ahmad A. M.
Ting, Hua-Nong
Peer Mustafa, Mumtaz Begum
Mohd Khairuddin, Anis Salwa
Abushariah, Mohammad A. M.
Tan, Tien-Ping
Bilingual automatic speech recognition: A review, taxonomy and open challenges
description In this technological era, smart and intelligent systems that are integrated with artificial intelligence (AI) techniques, algorithms, tools, and technologies, have impact on various aspects in our daily life. Communication and interaction between human and machine using speech become increasingly important, since it is an obvious substitute for keyboards and screens in the communication process. Therefore, numerous technologies take advantage of speech such as Automatic Speech Recognition (ASR), where human natural speech for many languages is used as the means to interact with machines. Majority of the related works on ASR concentrate on the development and evaluation of ASR systems that serve a single language only, such as Arabic, English, Chinese, French, and many others. However, research attempts that combine multiple languages (bilingual and multilingual) during the development and evaluation of ASR systems are very limited. This paper aims to provide comprehensive research background and fundamentals of bilingual ASR, and related works that have combined two languages for ASR tasks from 2010 to 2021. It also formulates research taxonomy and discusses open challenges to the bilingual ASR research. Based on our literature investigation, it is clear that bilingual ASR using deep learning approach is highly demanded and is able to provide acceptable performance. In addition, many combinations of two languages such as Arabic-English, Arabic-Malay, and others, are still limited, which can open new research opportunities. Finally, it is clear that ASR research is moving towards not only bilingual ASR, but also multilingual ASR.
format Article
author Abushariah, Ahmad A. M.
Ting, Hua-Nong
Peer Mustafa, Mumtaz Begum
Mohd Khairuddin, Anis Salwa
Abushariah, Mohammad A. M.
Tan, Tien-Ping
author_facet Abushariah, Ahmad A. M.
Ting, Hua-Nong
Peer Mustafa, Mumtaz Begum
Mohd Khairuddin, Anis Salwa
Abushariah, Mohammad A. M.
Tan, Tien-Ping
author_sort Abushariah, Ahmad A. M.
title Bilingual automatic speech recognition: A review, taxonomy and open challenges
title_short Bilingual automatic speech recognition: A review, taxonomy and open challenges
title_full Bilingual automatic speech recognition: A review, taxonomy and open challenges
title_fullStr Bilingual automatic speech recognition: A review, taxonomy and open challenges
title_full_unstemmed Bilingual automatic speech recognition: A review, taxonomy and open challenges
title_sort bilingual automatic speech recognition: a review, taxonomy and open challenges
publisher Institute of Electrical and Electronics Engineers
publishDate 2023
url http://eprints.um.edu.my/38912/
_version_ 1769842726502662144
score 13.211869