The efficient discrete tchebichef transform for spectrum analysis of speech recognition
Speech recognition is still a growing field of importance. The growth in computing power will open its strong potentials for full use in the near future. Spectrum analysis is an elementary operation in speech recognition. Fast Fourier Transform (FFT) has been a traditional technique to analyze frequ...
Saved in:
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
SPIE
2011
|
Subjects: | |
Online Access: | http://eprints.utem.edu.my/id/eprint/336/1/01-C00146-R001.pdf http://eprints.utem.edu.my/id/eprint/336/ http://www.ijmlc.org/papers/01-C00146-R001.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.utem.eprints.336 |
---|---|
record_format |
eprints |
spelling |
my.utem.eprints.3362023-07-20T12:18:57Z http://eprints.utem.edu.my/id/eprint/336/ The efficient discrete tchebichef transform for spectrum analysis of speech recognition Ernawan, Ferda Abu, Nor Azman QA Mathematics Speech recognition is still a growing field of importance. The growth in computing power will open its strong potentials for full use in the near future. Spectrum analysis is an elementary operation in speech recognition. Fast Fourier Transform (FFT) has been a traditional technique to analyze frequency spectrum of the signals in speech recognition. FFT is computationally complex especially with imaginary numbers. The Discrete Tchebichef Transform (DTT) is proposed instead of the popular FFT. DTT has lower computational complexity and it does not require complex transform dealing with imaginary numbers. This paper proposes a novel approach based on 256 discrete orthonormal Tchebichef polynomials as efficient technique to analyze a vowel and a consonant in spectral frequency of speech recognition. The comparison between 1024 discrete orthonormal Tchebichef transform and 256 discrete orthonormal Tchebichef transform has been done. The preliminary experimental results show that 256 DTT has the potential to be more efficient to transform time domain into frequency domain for speech recognition. 256 DTT produces simpler output than 1024 DTT in frequency spectrum. At the same time, 256 Discrete Tchebichef Transform can produce concurrently four formants F1, F2, F3 and F4. SPIE 2011-04-01 Article PeerReviewed text en http://eprints.utem.edu.my/id/eprint/336/1/01-C00146-R001.pdf Ernawan, Ferda and Abu, Nor Azman (2011) The efficient discrete tchebichef transform for spectrum analysis of speech recognition. International Journal of Machine Learning and Computing, 1 (1). 01-06. ISSN 0277-786X http://www.ijmlc.org/papers/01-C00146-R001.pdf 10.1117/12.2010642 |
institution |
Universiti Teknikal Malaysia Melaka |
building |
UTEM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Teknikal Malaysia Melaka |
content_source |
UTEM Institutional Repository |
url_provider |
http://eprints.utem.edu.my/ |
language |
English |
topic |
QA Mathematics |
spellingShingle |
QA Mathematics Ernawan, Ferda Abu, Nor Azman The efficient discrete tchebichef transform for spectrum analysis of speech recognition |
description |
Speech recognition is still a growing field of importance. The growth in computing power will open its strong potentials for full use in the near future. Spectrum analysis is an elementary operation in speech recognition. Fast Fourier Transform (FFT) has been a traditional technique to analyze frequency spectrum of the signals in speech recognition. FFT is computationally complex especially with imaginary numbers. The Discrete Tchebichef Transform (DTT) is
proposed instead of the popular FFT. DTT has lower computational complexity and it does not require complex transform dealing with imaginary numbers. This paper proposes a novel approach based on 256 discrete orthonormal Tchebichef polynomials as efficient technique to analyze a vowel and a consonant in spectral frequency of speech recognition. The comparison between 1024 discrete
orthonormal Tchebichef transform and 256 discrete orthonormal Tchebichef transform has been done. The preliminary experimental results show that 256 DTT has the potential to be more efficient to transform time domain into frequency domain for speech recognition. 256 DTT produces simpler output than 1024 DTT in frequency spectrum. At the same time, 256 Discrete Tchebichef Transform can produce concurrently four formants F1, F2, F3 and F4. |
format |
Article |
author |
Ernawan, Ferda Abu, Nor Azman |
author_facet |
Ernawan, Ferda Abu, Nor Azman |
author_sort |
Ernawan, Ferda |
title |
The efficient discrete tchebichef transform for spectrum analysis of speech recognition |
title_short |
The efficient discrete tchebichef transform for spectrum analysis of speech recognition |
title_full |
The efficient discrete tchebichef transform for spectrum analysis of speech recognition |
title_fullStr |
The efficient discrete tchebichef transform for spectrum analysis of speech recognition |
title_full_unstemmed |
The efficient discrete tchebichef transform for spectrum analysis of speech recognition |
title_sort |
efficient discrete tchebichef transform for spectrum analysis of speech recognition |
publisher |
SPIE |
publishDate |
2011 |
url |
http://eprints.utem.edu.my/id/eprint/336/1/01-C00146-R001.pdf http://eprints.utem.edu.my/id/eprint/336/ http://www.ijmlc.org/papers/01-C00146-R001.pdf |
_version_ |
1772815984839622656 |
score |
13.209306 |