Research Article

Comparative Analysis of Automatic Speaker Recognition using Kekreís Fast Codebook Generation Algorithm in Time and Transform Domain

by  Dr. H. B. Kekre, Ms. Vaishali Kulkarni
journal cover
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 7 - Issue 1
Published: September 2010
Authors: Dr. H. B. Kekre, Ms. Vaishali Kulkarni
10.5120/1128-1479
PDF

Dr. H. B. Kekre, Ms. Vaishali Kulkarni . Comparative Analysis of Automatic Speaker Recognition using Kekreís Fast Codebook Generation Algorithm in Time and Transform Domain. International Journal of Computer Applications. 7, 1 (September 2010), 37-41. DOI=10.5120/1128-1479

                        @article{ 10.5120/1128-1479,
                        author  = { Dr. H. B. Kekre,Ms. Vaishali Kulkarni },
                        title   = { Comparative Analysis of Automatic Speaker Recognition using Kekreís Fast Codebook Generation Algorithm in Time and Transform Domain },
                        journal = { International Journal of Computer Applications },
                        year    = { 2010 },
                        volume  = { 7 },
                        number  = { 1 },
                        pages   = { 37-41 },
                        doi     = { 10.5120/1128-1479 },
                        publisher = { Foundation of Computer Science (FCS), NY, USA }
                        }
                        %0 Journal Article
                        %D 2010
                        %A Dr. H. B. Kekre
                        %A Ms. Vaishali Kulkarni
                        %T Comparative Analysis of Automatic Speaker Recognition using Kekreís Fast Codebook Generation Algorithm in Time and Transform Domain%T 
                        %J International Journal of Computer Applications
                        %V 7
                        %N 1
                        %P 37-41
                        %R 10.5120/1128-1479
                        %I Foundation of Computer Science (FCS), NY, USA
Abstract

In this paper, an approach based on Kekre’s fast code book Generation (KFCG) Algorithm in the transform domain has been proposed. KFCG is used for feature extraction in both the training and testing phases. Three methods for codebook generation have been used. In the 1st method, codebooks are generated from the speech samples by using Discrete Fourier Transform (DFT). In the 2nd method, the codebooks are generated using Discrete Cosine Transform (DCT). In the 3rd method, the codebooks are generated using the Discrete Sine Transform (DST). For speaker identification, the codebook of the test sample is similarly generated and compared with the codebooks of the reference samples stored in the database. The results obtained for the above methods in the transform domain are compared with the results obtained in the time domain analysis. The results show that KFCG gives better results in transform domain than in time domain. Also the results improve as the vector dimension while generating the codebook is increased.

References
  • Lawrence Rabiner, Biing-Hwang Juang and B.Yegnanarayana, ‚ÄúFundamental of Speech Recognition‚Äù, Prentice-Hall, Englewood Cliffs, 2009.
  • S Furui, ‚Äú50 years of progress in speech and speaker recognition research‚Äù, ECTI Transactions on Computer and Information Technology, Vol. 1, No.2, November 2005.
  • D. A. Reynolds, ‚ÄúAn overview of automatic speaker recognition technology‚Äù, Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP‚Äô02), 2002, pp. IV-4072‚ÄìIV-4075.
  • Joseph P. Campbell, Jr., Senior Member, IEEE, ‚ÄúSpeaker Recognition: A Tutorial‚Äù, Proceedings of the IEEE, vol. 85, no. 9, pp. 1437-1462, September 1997.
  • F. Bimbot, J.-F. Bonastre, C. Fredouille, G. Gravier, I. Magrin-Chagnolleau, S. Meignier, T. Merlin, J. Ortega-Garc√≠a, D.Petrovska-Delacr√©taz, and D. A. Reynolds, ‚ÄúA tutorial on text-independent speaker verification,‚Äù EURASIP J. Appl. Signal Process., vol. 2004, no. 1, pp. 430‚Äì451, 2004.
  • D. A. Reynolds, ‚ÄúExperimental evaluation of features for robust speaker identification,‚Äù IEEE Trans. Speech Audio Process., vol. 2, no. 4, pp. 639‚Äì643, Oct. 1994.
  • Tomi Kinnunen, Evgeny Karpov, and Pasi Fr¬®anti, ‚ÄúRealtime Speaker Identification‚Äù, ICSLP2004.
  • Marco Grimaldi and Fred Cummins, ‚ÄúSpeaker Identification using Instantaneous Frequencies‚Äù, IEEE Transactions on Audio, Speech, and Language Processing, vol., 16, no. 6, August 2008.
  • Zhong-Xuan, Yuan & Bo-Ling, Xu & Chong-Zhi, Yu. (1999). ‚ÄúBinary Quantization of Feature Vectors for Robust Text-Independent Speaker Identification‚Äù in IEEE Transactions on Speech and Audio Processing, Vol. 7, No. 1, January 1999. IEEE, New York, NY, U.S.A.
  • R. M. Gray.: ‚ÄòVector quantization‚Äô, IEEE ASSP Marg., pp. 4-29, Apr. 1984.
  • Y. Linde, A. Buzo, and R. M. Gray.: ‚ÄòAn algorithm for vector quantizer design,‚Äù IEEE Trans. Commun.‚Äô, vol. COM-28, no. 1, pp. 84-95, 1980.
  • A. Gersho, R.M. Gray.: ‚ÄòVector Quantization and Signal Compression‚Äô, Kluwer Academic Publishers, Boston, MA, 1991.
  • F. K. Soong, et. al., ‚ÄúA vector quantization approach to speaker recognition‚Äù, At & T Technical Journal, 66, pp. 14-26, 1987.
  • A. E. Rosenberg and F. K. Soong, ‚ÄúEvaluation of a vector quantization talker recognition system in text independent and text dependent models‚Äù, Computer Speech and Language 22, pp. 143-157, 1987.
  • Jeng-Shyang Pan, Zhe-Ming Lu, and Sheng-He Sun.: ‚ÄòAn Efficient Encoding Algorithm for Vector Quantization Based on Subvector Technique‚Äô, IEEE Transactions on image processing, vol 12 No. 3 March 2003.
  • F. Soong, E. Rosenberg, B. Juang, and L. Rabiner, "A Vector Quantization Approach to Speaker Recognition", AT&T Technical Journal, vol. 66, March/April 1987, pp. 1426.
  • Md. Rashidul Hasan, Mustafa Jamil, Md. Golam Rabbani Md. Saifur Rahman , ‚ÄúSpeaker Identification using Mel Frequency Cepstral Coefficients‚Äù, 3rd International Conference on Electrical & Computer Engineering ICECE held at Dhaka, Bangladesh , 28-30 December 2004.
  • Poonam Bansal, Amrita Dev, Shail Bala Jain, ‚ÄúAutomatic Speaker Identification using Vector Quantization‚Äù, Asian Journal of Information Technology 6 (9): 938-942, 2007.
  • Jyoti Singhai, ‚ÄúAutomatic Speaker Recognition :An Approach using DWT based Feature Extraction and Vector Quantization‚Äù, IETE Technical Review, vol. 24, No 5, pp 395-402, September-October 2007
  • H. B. Kekre, Tanuja K. Sarode, ‚ÄúSpeech Data Compression using Vector Quantization‚Äù, WASET International Journal of Computer and Information Science and Engineering (IJCISE), Fall 2008, Volume 2, Number 4, pp.: 251-254, 2008. http://www.waset.org/ijcise.
  • H. B. Kekre, Tanuja K. Sarode, ‚ÄúNew Fast Improved Codebook Generation Algorithm for Color Images using Vector Quantization,‚Äù International Journal of Engineering and Technology, vol.1, No.1, pp. 67-77, September 2008.
  • H. B. Kekre, Tanuja K. Sarode, ‚ÄúFast Codebook Generation Algorithm for Color Images using Vector Quantization,‚Äù International Journal of Computer Science and Information Technology, Vol. 1, No. 1, pp: 7-12, Jan 2009.
  • H. B. Kekre, Tanuja K. Sarode, ‚ÄúAn Efficient Fast Algorithm to Generate Codebook for Vector Quantization,‚Äù First International Conference on Emerging Trends in Engineering and Technology, ICETET-2008, held at Raisoni College of Engineering, Nagpur, India, 16-18 July 2008, Avaliable at online IEEE Xplore.
  • H B Kekre, Vaishali Kulkarni, ‚ÄúSpeaker Identification by using Vector Quantization‚Äù, International Journal of Engineering Science and Technology, May 2010 edition.
  • H B Kekre, Vaishali Kulkarni, ‚ÄúPerformance Comparison of Speaker Recognition using Vector Quantization by LBG and KFCG‚Äù, International Journal of Computer Applications, vol. 3, July 2010.
  • H.B. Kekre, Archana Athawale, Tanuja K. Sarode, Kalpana Sagvekar, ‚ÄúComparative Performance of Information Hiding in Vector Quantized Codebooks using LBG, KPE, KMCG and KFCG‚Äù, International Journal of Computer Science and Information Security, 2010 Vol: 8 Issue: 2,pp 89-95.
  • H B Kekre, Archana Athawale, Tanuja Sarode and Kalpana Sagvekar, ‚ÄúIncreased Capacity of Information Hiding using Mixed Codebooks of Vector Quantization Algorithms: LBG, KPE and KMCG, International Journal of Advances in Computational Sciences and Technology, Volume 3 Number 2 (2010) pp. 245‚Äì256.
Index Terms
Computer Science
Information Sciences
No index terms available.
Keywords

Vector Quantization (VQ) Code Vectors Code Book Discrete Fourier Transform (DFT) Discrete Sine Transform (DST) Discrete Cosine Transform (DCT)

Powered by PhDFocusTM