Research Article

Development of Isolated Numeric Speech Corpus for Swahili Language for Development of Automatic Speech Recognition System

by  Aaron M. Oirere, Ratnadeep R. Deshmukh, Pukhraj P. Shrishrimal
journal cover
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 74 - Issue 11
Published: July 2013
Authors: Aaron M. Oirere, Ratnadeep R. Deshmukh, Pukhraj P. Shrishrimal
10.5120/12929-9841
PDF

Aaron M. Oirere, Ratnadeep R. Deshmukh, Pukhraj P. Shrishrimal . Development of Isolated Numeric Speech Corpus for Swahili Language for Development of Automatic Speech Recognition System. International Journal of Computer Applications. 74, 11 (July 2013), 20-22. DOI=10.5120/12929-9841

                        @article{ 10.5120/12929-9841,
                        author  = { Aaron M. Oirere,Ratnadeep R. Deshmukh,Pukhraj P. Shrishrimal },
                        title   = { Development of Isolated Numeric Speech Corpus for Swahili Language for Development of Automatic Speech Recognition System },
                        journal = { International Journal of Computer Applications },
                        year    = { 2013 },
                        volume  = { 74 },
                        number  = { 11 },
                        pages   = { 20-22 },
                        doi     = { 10.5120/12929-9841 },
                        publisher = { Foundation of Computer Science (FCS), NY, USA }
                        }
                        %0 Journal Article
                        %D 2013
                        %A Aaron M. Oirere
                        %A Ratnadeep R. Deshmukh
                        %A Pukhraj P. Shrishrimal
                        %T Development of Isolated Numeric Speech Corpus for Swahili Language for Development of Automatic Speech Recognition System%T 
                        %J International Journal of Computer Applications
                        %V 74
                        %N 11
                        %P 20-22
                        %R 10.5120/12929-9841
                        %I Foundation of Computer Science (FCS), NY, USA
Abstract

Speech corpus being the basic requirement for the development of Automatic speech recognition (ASR) system, it should be done with much accuracy in order to enhance the performance of the system. This paper describes the proposed procedure to abide while collecting the speech corpus of Swahili language from the native and non native speaker for the development of Automatic Speech Recognition system in Swahili language.

References
  • Pukhraj Shrishrimal, R. R. Deshmukh, Vishal Waghmare, 2012 "Indian Language Speech Database: A Review", International Journal of Computer Application (IJCA), Vol 47, No. 5, (June – 2012), pp. 17-21.
  • Guy De Pauw and Gilles-Maurice de Schryver,2008 "Improving the Computational Morphological Analysis of a Swahili Corpus for Lexicographic Purposes" Lexikos 18 (AFRILEX-reeks/series 18: 2008): 303-318
  • G. De Pauw, G. M. de Schryver, and P. W. Wagacha, 2006 "Data-driven part-of-speech tagging of Kiswahili". In P. Sojka, I. Kope?cek, and K. Pala, editors, Proceedings of Text, Speech and Dialogue, 9th International Conference, volume 4188 of Lecture Notes in Computer Science, pages 197–204, Berlin, Germany, Springer Verlag.
  • Gakuru, Mucemi Iraki, Frederick K. Tucker, Roger Shalonova, Ksenia Ngugi, Kamanda 2005, "Development of a Kiswahili text to speech system", In INTERSPEECH-2005, 1481-1484.
  • http://en. wikipedia. org/wiki/Languages_of_the_Democratic_Republic_of_the_Congo dated 27/06/2012
  • E. A. Alpers, 1975 "Ivory and Slaves in East Central Africa", London, pp. 98– 99 ;
  • T. Vernet 2002, "Les cités-Etats Swahili et la puissance omanaise" (1650– 1720), Journal des Africanistes, 72(2), pp. 102–105.
  • Thomas J. Hinnebusch, 1992 "Ethnologue list of countries where Swahili is spoken", "Swahili", International Encyclopedia of Linguistics, Oxford, pp. 99–106
  • David Dalby, 1999/2000, "The Linguasphere Register of the World's Languages and Speech Communities", Linguasphere Press, Volume Two, pg. 733–735
  • Arvi Hurskainen, 2004 "Helsinki Corpus of Swahili. Compilers": Institute for Asian and African Studies (University of Helsinki) and CSC.
  • Guy De Pauw, Peter Waiganjo Wagacha, Gilles-Maurice de Schryver, 2011 "Exploring the SAWA corpus: collection and deployment of a parallel corpus English—Swahili", International Journal of Lang Resources & Evaluation, Springer Verlag, vol 45, pp 331-344.
  • Deen, Kamil Ud 2002 "The acquisition of Swahili verbal morphology", Palmela, Portugal. Costa, Joao & Freitas, Maria (Eds), in the proceedings to G. A. L. A conference (2002c) pp. 41-48.
  • Gakuru, Mucemi , Frederick K. Iraki, Roger Tucker, Ksenia Shalonova, Kamanda Ngugi, 2005"Development of a Kiswahili text to speech system", In INTERSPEECH-2005, pp1481-1484.
  • Hadrien Gelas, Laurent Besacier, F. Pellegrino, 2012 "Developments of Swahili resources for an automatic speech recognition system", SLTU – Workshop on Spoken Language Technologies for Under-Resourced Languages, Cape-Town, South Africa.
  • Aaron M. Oirere, Ratnadeep R. Deshmukh, Pukhraj P. Shrishrimal, Vishal B. Waghmare, "Swahili Text and Speech Corpus: A Review", Asian Journal of Computer Science and Information Technology, Vol. 2, No. 11, (Nov-2012), pp. 286-290.
Index Terms
Computer Science
Information Sciences
No index terms available.
Keywords

Swahili Swahili Text corpus Phonetics Text Corpus and Speech Corpus Automatic Speech Recognition

Powered by PhDFocusTM