International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
|
Volume 58 - Issue 10 |
Published: November 2012 |
Authors: Md. Mahfuzur Rahman, Sanjit Kumar Saha, Md. Zakir Hossain, Md. Babul Islam |
![]() |
Md. Mahfuzur Rahman, Sanjit Kumar Saha, Md. Zakir Hossain, Md. Babul Islam . Performance Evaluation of CMN for Mel-LPC based Speech Recognition in Different Noisy Environments. International Journal of Computer Applications. 58, 10 (November 2012), 6-10. DOI=10.5120/9316-3548
@article{ 10.5120/9316-3548, author = { Md. Mahfuzur Rahman,Sanjit Kumar Saha,Md. Zakir Hossain,Md. Babul Islam }, title = { Performance Evaluation of CMN for Mel-LPC based Speech Recognition in Different Noisy Environments }, journal = { International Journal of Computer Applications }, year = { 2012 }, volume = { 58 }, number = { 10 }, pages = { 6-10 }, doi = { 10.5120/9316-3548 }, publisher = { Foundation of Computer Science (FCS), NY, USA } }
%0 Journal Article %D 2012 %A Md. Mahfuzur Rahman %A Sanjit Kumar Saha %A Md. Zakir Hossain %A Md. Babul Islam %T Performance Evaluation of CMN for Mel-LPC based Speech Recognition in Different Noisy Environments%T %J International Journal of Computer Applications %V 58 %N 10 %P 6-10 %R 10.5120/9316-3548 %I Foundation of Computer Science (FCS), NY, USA
This study is intended to develop a noise robust distributed speech recognizer for real-world applications by employing Cepstral Mean Normalization (CMN) for robust feature extraction. The main focus of the work is to cope with different noisy environments. To realize this objective, Mel-LP based speech analysis has been used in speech coding on the linear frequency scale by applying a first-order all-pass filter instead of a unit delay. Mismatch between training and test phases is reduced through robust feature extraction by applying CMN on Mel-LP cepstral coefficients as an effort to reduce additive noise and channel distortion. The performance of the proposed system has been evaluated on test set A of Aurora-2 database which is a subset of TIDigits database contaminated by additive noises and channel effects. The experiment is conducted on four different noisy environments and the baseline performance, that is, for Mel-LPC the average word accuracy has found to be 59. 05%. By applying the CMN on Mel-LP cepstral coefficients, the performance has been improved to 68. 02%. It is found that CMN performs significantly better for different noisy environments.