Audio visual tracking of a speaker based on FFT and Kalman filter
In this paper a simple audio visual information based speaker tracking technique is proposed for indoor environment. Specifically, a Kalman filter based image processing technique is used to extract visual information, and Fast Fourier Transform (FFT) based approach is used to extract audio informat...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Article |
Published: |
Asian Research Publishing Network
2016
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-84979205098&partnerID=40&md5=96cfaf1dea309b9b5c1d19991c3747a4 http://eprints.utp.edu.my/25500/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.utp.eprints.25500 |
---|---|
record_format |
eprints |
spelling |
my.utp.eprints.255002021-08-27T13:03:09Z Audio visual tracking of a speaker based on FFT and Kalman filter Muzammel, M. Yusoff, M.Y. Saad, M.N.M. Malik, A.S. In this paper a simple audio visual information based speaker tracking technique is proposed for indoor environment. Specifically, a Kalman filter based image processing technique is used to extract visual information, and Fast Fourier Transform (FFT) based approach is used to extract audio information for speaker tracking. Finally, a decision tree has been used to estimate the location of the speaker based on audio and visual information. One of the main advantages of the proposed technique is the use of a built-in microphone of the tracking camera; which makes this technique cost effective and simple. We have examined our method with case studies from the online SPEVI database. The proposed technique shows the best detection and works properly even when the speaker is not visible. © 2006-2016 Asian Research Publishing Network (ARPN). Asian Research Publishing Network 2016 Article NonPeerReviewed https://www.scopus.com/inward/record.uri?eid=2-s2.0-84979205098&partnerID=40&md5=96cfaf1dea309b9b5c1d19991c3747a4 Muzammel, M. and Yusoff, M.Y. and Saad, M.N.M. and Malik, A.S. (2016) Audio visual tracking of a speaker based on FFT and Kalman filter. ARPN Journal of Engineering and Applied Sciences, 11 (14). pp. 8947-8951. http://eprints.utp.edu.my/25500/ |
institution |
Universiti Teknologi Petronas |
building |
UTP Resource Centre |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Teknologi Petronas |
content_source |
UTP Institutional Repository |
url_provider |
http://eprints.utp.edu.my/ |
description |
In this paper a simple audio visual information based speaker tracking technique is proposed for indoor environment. Specifically, a Kalman filter based image processing technique is used to extract visual information, and Fast Fourier Transform (FFT) based approach is used to extract audio information for speaker tracking. Finally, a decision tree has been used to estimate the location of the speaker based on audio and visual information. One of the main advantages of the proposed technique is the use of a built-in microphone of the tracking camera; which makes this technique cost effective and simple. We have examined our method with case studies from the online SPEVI database. The proposed technique shows the best detection and works properly even when the speaker is not visible. © 2006-2016 Asian Research Publishing Network (ARPN). |
format |
Article |
author |
Muzammel, M. Yusoff, M.Y. Saad, M.N.M. Malik, A.S. |
spellingShingle |
Muzammel, M. Yusoff, M.Y. Saad, M.N.M. Malik, A.S. Audio visual tracking of a speaker based on FFT and Kalman filter |
author_facet |
Muzammel, M. Yusoff, M.Y. Saad, M.N.M. Malik, A.S. |
author_sort |
Muzammel, M. |
title |
Audio visual tracking of a speaker based on FFT and Kalman filter |
title_short |
Audio visual tracking of a speaker based on FFT and Kalman filter |
title_full |
Audio visual tracking of a speaker based on FFT and Kalman filter |
title_fullStr |
Audio visual tracking of a speaker based on FFT and Kalman filter |
title_full_unstemmed |
Audio visual tracking of a speaker based on FFT and Kalman filter |
title_sort |
audio visual tracking of a speaker based on fft and kalman filter |
publisher |
Asian Research Publishing Network |
publishDate |
2016 |
url |
https://www.scopus.com/inward/record.uri?eid=2-s2.0-84979205098&partnerID=40&md5=96cfaf1dea309b9b5c1d19991c3747a4 http://eprints.utp.edu.my/25500/ |
_version_ |
1738656738794536960 |
score |
13.159267 |