International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
|
Volume 187 - Issue 2 |
Published: May 2025 |
Authors: Rekha S. Kotwal, Geetanjali Jindal |
![]() |
Rekha S. Kotwal, Geetanjali Jindal . AI Powered Speech Recognition System using Wavelet Multi-Resolution Analysis with One- Dimentional CNN-LSTM. International Journal of Computer Applications. 187, 2 (May 2025), 72-81. DOI=10.5120/ijca2025924807
@article{ 10.5120/ijca2025924807, author = { Rekha S. Kotwal,Geetanjali Jindal }, title = { AI Powered Speech Recognition System using Wavelet Multi-Resolution Analysis with One- Dimentional CNN-LSTM }, journal = { International Journal of Computer Applications }, year = { 2025 }, volume = { 187 }, number = { 2 }, pages = { 72-81 }, doi = { 10.5120/ijca2025924807 }, publisher = { Foundation of Computer Science (FCS), NY, USA } }
%0 Journal Article %D 2025 %A Rekha S. Kotwal %A Geetanjali Jindal %T AI Powered Speech Recognition System using Wavelet Multi-Resolution Analysis with One- Dimentional CNN-LSTM%T %J International Journal of Computer Applications %V 187 %N 2 %P 72-81 %R 10.5120/ijca2025924807 %I Foundation of Computer Science (FCS), NY, USA
The objective of current project is for developing deep learning (DL)-based speech emotion detection system that may identify and categorize emotional states including happiness and sadness. For capturing spatial and temporal patterns in audio input, system uses mel-spectrogram features, that are processed employing hybrid model that combines "convolutional neural networks (CNNs)" and "long short-term memory networks (LSTMs)". Pre-trained model's efficacy in this field is further demonstrated by refinement of transformer-based Wav2Vec2 model for emotion classification. The provided methods accurately identify speech emotions, thus being beneficial for customer service, healthcare, and human-computer interaction.