International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
|
Volume 187 - Issue 30 |
Published: August 2025 |
Authors: Zannirah Muhammed Sammani, Mohammed Abo Rizka |
![]() |
Zannirah Muhammed Sammani, Mohammed Abo Rizka . A Hybrid LSTM-CNN Approach for Multimodal Sentiment Analysis: Combining Text and Image Features. International Journal of Computer Applications. 187, 30 (August 2025), 34-42. DOI=10.5120/ijca2025925526
@article{ 10.5120/ijca2025925526, author = { Zannirah Muhammed Sammani,Mohammed Abo Rizka }, title = { A Hybrid LSTM-CNN Approach for Multimodal Sentiment Analysis: Combining Text and Image Features }, journal = { International Journal of Computer Applications }, year = { 2025 }, volume = { 187 }, number = { 30 }, pages = { 34-42 }, doi = { 10.5120/ijca2025925526 }, publisher = { Foundation of Computer Science (FCS), NY, USA } }
%0 Journal Article %D 2025 %A Zannirah Muhammed Sammani %A Mohammed Abo Rizka %T A Hybrid LSTM-CNN Approach for Multimodal Sentiment Analysis: Combining Text and Image Features%T %J International Journal of Computer Applications %V 187 %N 30 %P 34-42 %R 10.5120/ijca2025925526 %I Foundation of Computer Science (FCS), NY, USA
An efficient deep learning framework is proposed for sentiment analysis that leverages both textual and visual modalities. The architecture integrates Long Short-Term Memory (LSTM) networks for capturing sequential dependencies in textual data with Convolutional Neural Networks (CNNs) for analyzing visual content. This multimodal fusion enhances sentiment classification accuracy. The model is assessed on two benchmark datasets—Memes and MVSA—and its performance is compared to traditional machine learning models such as Support Vector Machines and Logistic Regression, as well as the transformer-based VisualBERT. Although VisualBERT achieves slightly higher accuracy (83.18% on Memes and 81.29% on MVSA), the proposed approach delivers comparable results (77.70% and 80.42%, respectively) while maintaining a much lower computational footprint. This balance between performance and efficiency highlights the model’s practical value for applications where computational resources are limited or real-time analysis is required.