Research Article

Speech Synthesis - Automatic Segmentation

by  Poonam Bansal, Amita Pradhan, Ankita Goyal, Astha Sharma, Mona Arora
journal cover
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 98 - Issue 4
Published: July 2014
Authors: Poonam Bansal, Amita Pradhan, Ankita Goyal, Astha Sharma, Mona Arora
10.5120/17172-7253
PDF

Poonam Bansal, Amita Pradhan, Ankita Goyal, Astha Sharma, Mona Arora . Speech Synthesis - Automatic Segmentation. International Journal of Computer Applications. 98, 4 (July 2014), 29-31. DOI=10.5120/17172-7253

                        @article{ 10.5120/17172-7253,
                        author  = { Poonam Bansal,Amita Pradhan,Ankita Goyal,Astha Sharma,Mona Arora },
                        title   = { Speech Synthesis - Automatic Segmentation },
                        journal = { International Journal of Computer Applications },
                        year    = { 2014 },
                        volume  = { 98 },
                        number  = { 4 },
                        pages   = { 29-31 },
                        doi     = { 10.5120/17172-7253 },
                        publisher = { Foundation of Computer Science (FCS), NY, USA }
                        }
                        %0 Journal Article
                        %D 2014
                        %A Poonam Bansal
                        %A Amita Pradhan
                        %A Ankita Goyal
                        %A Astha Sharma
                        %A Mona Arora
                        %T Speech Synthesis - Automatic Segmentation%T 
                        %J International Journal of Computer Applications
                        %V 98
                        %N 4
                        %P 29-31
                        %R 10.5120/17172-7253
                        %I Foundation of Computer Science (FCS), NY, USA
Abstract

In this paper, after an a review of the previous work done in this field, the most frequently used approach using Hidden Markov Model (HMM) is used for implementation for phonetic segmentation. A baseline HMM phonetic segmentation tool is used for segmentation and analysis of speech at phonetic level. The results are approximately same as obtained using manual segmentation.

References
  • Automatic Phonetic Segmentation, D. T. Toledano, L. A. H. Gómez , Member, IEEE, and L. V. Grande IEEE transactions on speech and audio processing, Vol. 11, No. 6, Nov 2003.
  • The HTK Book, S. Young, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, Version 2. 1: Cambridge University, 1997.
  • Automatic speech segmentation with HTK, Kyle Gorman, Department of Linguistics,University of Pennsylvania, nstitute for Research in Cognitive Science .
  • HTK Tutorial,Giampiero Salvi, KTH (Royal Institute of Technology),Dep. of Speech, Music and Hearing, Drottning Kristinas v. 31,SE-100 44, Stockholm, Sweden .
  • L21, Introduction to Speech Processing | Ricardo Gutierrez Osuna | CSE@TAMU
  • P. Cosi, D. Falavigna, and M. Omologo, "A preliminary statistical evaluation of manual and automatic segmentation discrepancies," in Proceedings EUROSPEECH, 1991, pp. 693–696.
  • A. Ljolje, J. Hirschberg, and J. P. H. Van Santen,"Automatic speech segmentation for oncatenative inventory selection," in Progress in Speech Synthesis, J. P. H. Van Santen, Ed: Springer, 1997, pp. 305–311.
  • A. Ljolje and M. D. Riley, "Automatic segmentation of speech for TTS," in Proceedings EUROSPEECH, 1993, pp. 1445–1448.
Index Terms
Computer Science
Information Sciences
No index terms available.
Keywords

HMM HTK Phonetic Segmentation MFCC Speech Synthesis Viterbi

Powered by PhDFocusTM