Speech Synthesis - Automatic Segmentation

Poonam Bansal; Amita Pradhan; Ankita Goyal; Astha Sharma; Mona Arora

Research Article

Speech Synthesis - Automatic Segmentation

by Poonam Bansal, Amita Pradhan, Ankita Goyal, Astha Sharma, Mona Arora

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 98 - Issue 4

Published: July 2014

Authors: Poonam Bansal, Amita Pradhan, Ankita Goyal, Astha Sharma, Mona Arora

10.5120/17172-7253

PDF

Poonam Bansal, Amita Pradhan, Ankita Goyal, Astha Sharma, Mona Arora . Speech Synthesis - Automatic Segmentation. International Journal of Computer Applications. 98, 4 (July 2014), 29-31. DOI=10.5120/17172-7253

                        @article{ 10.5120/17172-7253,
                        author  = { Poonam Bansal,Amita Pradhan,Ankita Goyal,Astha Sharma,Mona Arora },
                        title   = { Speech Synthesis - Automatic Segmentation },
                        journal = { International Journal of Computer Applications },
                        year    = { 2014 },
                        volume  = { 98 },
                        number  = { 4 },
                        pages   = { 29-31 },
                        doi     = { 10.5120/17172-7253 },
                        publisher = { Foundation of Computer Science (FCS), NY, USA }
                        }

                        %0 Journal Article
                        %D 2014
                        %A Poonam Bansal
                        %A Amita Pradhan
                        %A Ankita Goyal
                        %A Astha Sharma
                        %A Mona Arora
                        %T Speech Synthesis - Automatic Segmentation%T 
                        %J International Journal of Computer Applications
                        %V 98
                        %N 4
                        %P 29-31
                        %R 10.5120/17172-7253
                        %I Foundation of Computer Science (FCS), NY, USA

Abstract

In this paper, after an a review of the previous work done in this field, the most frequently used approach using Hidden Markov Model (HMM) is used for implementation for phonetic segmentation. A baseline HMM phonetic segmentation tool is used for segmentation and analysis of speech at phonetic level. The results are approximately same as obtained using manual segmentation.

References

Automatic Phonetic Segmentation, D. T. Toledano, L. A. H. Gómez , Member, IEEE, and L. V. Grande IEEE transactions on speech and audio processing, Vol. 11, No. 6, Nov 2003.
The HTK Book, S. Young, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, Version 2. 1: Cambridge University, 1997.
Automatic speech segmentation with HTK, Kyle Gorman, Department of Linguistics,University of Pennsylvania, nstitute for Research in Cognitive Science .
HTK Tutorial,Giampiero Salvi, KTH (Royal Institute of Technology),Dep. of Speech, Music and Hearing, Drottning Kristinas v. 31,SE-100 44, Stockholm, Sweden .
L21, Introduction to Speech Processing | Ricardo Gutierrez Osuna | CSE@TAMU
P. Cosi, D. Falavigna, and M. Omologo, "A preliminary statistical evaluation of manual and automatic segmentation discrepancies," in Proceedings EUROSPEECH, 1991, pp. 693–696.
A. Ljolje, J. Hirschberg, and J. P. H. Van Santen,"Automatic speech segmentation for oncatenative inventory selection," in Progress in Speech Synthesis, J. P. H. Van Santen, Ed: Springer, 1997, pp. 305–311.
A. Ljolje and M. D. Riley, "Automatic segmentation of speech for TTS," in Proceedings EUROSPEECH, 1993, pp. 1445–1448.

Index Terms

Computer Science

Information Sciences

No index terms available.

Keywords

HMM HTK Phonetic Segmentation MFCC Speech Synthesis Viterbi