|
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
|
| Volume 187 - Issue 2 |
| Published: May 2025 |
| Authors: Rekha S. Kotwal, Geetanjali Jindal |
10.5120/ijca2025924807
|
Rekha S. Kotwal, Geetanjali Jindal . AI Powered Speech Recognition System using Wavelet Multi-Resolution Analysis with One- Dimentional CNN-LSTM. International Journal of Computer Applications. 187, 2 (May 2025), 72-81. DOI=10.5120/ijca2025924807
@article{ 10.5120/ijca2025924807,
author = { Rekha S. Kotwal,Geetanjali Jindal },
title = { AI Powered Speech Recognition System using Wavelet Multi-Resolution Analysis with One- Dimentional CNN-LSTM },
journal = { International Journal of Computer Applications },
year = { 2025 },
volume = { 187 },
number = { 2 },
pages = { 72-81 },
doi = { 10.5120/ijca2025924807 },
publisher = { Foundation of Computer Science (FCS), NY, USA }
}
%0 Journal Article
%D 2025
%A Rekha S. Kotwal
%A Geetanjali Jindal
%T AI Powered Speech Recognition System using Wavelet Multi-Resolution Analysis with One- Dimentional CNN-LSTM%T
%J International Journal of Computer Applications
%V 187
%N 2
%P 72-81
%R 10.5120/ijca2025924807
%I Foundation of Computer Science (FCS), NY, USA
The objective of current project is for developing deep learning (DL)-based speech emotion detection system that may identify and categorize emotional states including happiness and sadness. For capturing spatial and temporal patterns in audio input, system uses mel-spectrogram features, that are processed employing hybrid model that combines "convolutional neural networks (CNNs)" and "long short-term memory networks (LSTMs)". Pre-trained model's efficacy in this field is further demonstrated by refinement of transformer-based Wav2Vec2 model for emotion classification. The provided methods accurately identify speech emotions, thus being beneficial for customer service, healthcare, and human-computer interaction.