Real-Time Arabic Speech Recognition

Zaid Y. Mohammed; Abdul Sattar M. Khidhir

Research Article

Real-Time Arabic Speech Recognition

by Zaid Y. Mohammed, Abdul Sattar M. Khidhir

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 81 - Issue 4

Published: November 2013

Authors: Zaid Y. Mohammed, Abdul Sattar M. Khidhir

10.5120/14003-2048

PDF

Zaid Y. Mohammed, Abdul Sattar M. Khidhir . Real-Time Arabic Speech Recognition. International Journal of Computer Applications. 81, 4 (November 2013), 43-45. DOI=10.5120/14003-2048

                        @article{ 10.5120/14003-2048,
                        author  = { Zaid Y. Mohammed,Abdul Sattar M. Khidhir },
                        title   = { Real-Time Arabic Speech Recognition },
                        journal = { International Journal of Computer Applications },
                        year    = { 2013 },
                        volume  = { 81 },
                        number  = { 4 },
                        pages   = { 43-45 },
                        doi     = { 10.5120/14003-2048 },
                        publisher = { Foundation of Computer Science (FCS), NY, USA }
                        }

                        %0 Journal Article
                        %D 2013
                        %A Zaid Y. Mohammed
                        %A Abdul Sattar M. Khidhir
                        %T Real-Time Arabic Speech Recognition%T 
                        %J International Journal of Computer Applications
                        %V 81
                        %N 4
                        %P 43-45
                        %R 10.5120/14003-2048
                        %I Foundation of Computer Science (FCS), NY, USA

Abstract

Speech recognition system needs to perform a high complex calculation and short time to complete it. This is a big challenge for the real-time systems. However, using a simple and fast algorithm may do this task for the slow systems. Thus, the main objective of this paper is to design and implement a Real-Time Arabic Speech Recognition system using MATLAB environment. It is capable of accurately identifying some letters while remaining simple and fast. It uses the Mel-Frequency Cepstral Coefficients (MFCCs) as a feature extraction and Euclidean distance to compare the test sound and the database. A recognition rate of 89. 6% has been reached.

References

S. J. Melnikoff, S. F. Quigley and M. J. Russell, Implementing a simple continuous speech recognition system on an FPGA, Proc. of the 10th Annual IEEE Symposium on Field-Programmable Custom Computing Machines, pp. 275-276, 2002.
J. R. Deller, J. H. L. Hansen and J. G. Proakis, "Discrete-Time Processing of Speech Signals", IEEE Press, 2000, 56-63 380-385
S. Sakti, K. Markov, S. Nakamura, W. Minker, "Incorporating Knowledge Sources into Statistical Speech Recognition", Springer, 2009, page 39-40.
Lawrence rabinar,biing-hwangjuang, "fundamental of speech recognition ",prentice hall,1993.
L. Deng and D. O'Shaughnessy, Speech Processing A Dynamic and Optimization-Oriented Approach, Marcel Dekker, New York, 2003.
W. C. Chu, Speech Coding Algorithms, John Wiley and Sons, Wiley-IEEE, 2003.
L. Muda, M. Begam, I. Elamvazuthi, " Voice Recognition Algorithms using MelFrequency Cepstral Coefficient (MFCC) andDynamic Time Warping (DTW) Techniques", Journal of Computing, Volume 2, Issue 3, March 2010, ISSN 2151-9617
X. Huang, A. Acero and H. Wuenon, Spoken Language Processing: A Guide to Theory, Algorithm and System Development, Pearson, 2005.
R. Vergin, "An algorithm for robust signal modeling in speech recognition," Proceedings of the 1998 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '98), Vol. 2, pp. 969-972, May, 1998.
B. P. Lathi, "Modern Digital and Analog Communication Systems", California state universtiy,1998
Wei Han, Cheong-Fat Chan, Chiu-Sing Choy and Kong-Pang Pun "An efficient MFCC extraction method in speech recognition" Department of Electronic Engineering" The Chinese University of Hong Kong, Hong, IEEE – ISCAS, 2006.
Steven B. Davis, and Paul Mermelstein, Comparison of parametricrepresentations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans. ASSP, 1980.
R. S. Kurcan, "Isolated Word Recognition From in-ear Microphone Data Using Hidden Markov Models (HMM)",M. Sc. Thisis,2006,178 pp.
Stephen E. Levinson, "Mathematical Models for Speech Technology", John Wiley&Sons,ltd, University of Illinois at Urbana-Champaign, USA,2005.

Index Terms

Computer Science

Information Sciences

No index terms available.

Keywords

Feature extraction Mel-Frequency Cepstral Coefficients (MFCCs) Feature match.