Research Article

AI based Image and Video retreival System

by  Anuja Bele, Ryan Lawrence, Darshan Butle, Himanshu Hiwanj, Kapil Gupta
journal cover
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 187 - Issue 61
Published: December 2025
Authors: Anuja Bele, Ryan Lawrence, Darshan Butle, Himanshu Hiwanj, Kapil Gupta
10.5120/ijca2025925836
PDF

Anuja Bele, Ryan Lawrence, Darshan Butle, Himanshu Hiwanj, Kapil Gupta . AI based Image and Video retreival System. International Journal of Computer Applications. 187, 61 (December 2025), 32-35. DOI=10.5120/ijca2025925836

                        @article{ 10.5120/ijca2025925836,
                        author  = { Anuja Bele,Ryan Lawrence,Darshan Butle,Himanshu Hiwanj,Kapil Gupta },
                        title   = { AI based Image and Video retreival System },
                        journal = { International Journal of Computer Applications },
                        year    = { 2025 },
                        volume  = { 187 },
                        number  = { 61 },
                        pages   = { 32-35 },
                        doi     = { 10.5120/ijca2025925836 },
                        publisher = { Foundation of Computer Science (FCS), NY, USA }
                        }
                        %0 Journal Article
                        %D 2025
                        %A Anuja Bele
                        %A Ryan Lawrence
                        %A Darshan Butle
                        %A Himanshu Hiwanj
                        %A Kapil Gupta
                        %T AI based Image and Video retreival System%T 
                        %J International Journal of Computer Applications
                        %V 187
                        %N 61
                        %P 32-35
                        %R 10.5120/ijca2025925836
                        %I Foundation of Computer Science (FCS), NY, USA
Abstract

With the explosive growth of digital media, massive collections of images and videos are being generated every day, creating a strong need for fast and accurate retrieval systems. Existing methods that rely on metadata or manual annotations often fall short—mainly because annotations may be incomplete, inconsistent, or unable to truly represent the visual meaning of the content. In this work, we present an AI-powered image and video retrieval system designed to overcome these challenges. Our approach uses Convolutional Neural Networks (CNNs) to automatically extract rich visual features from individual frames, applies temporal aggregation to capture video-level context, and employs cosine similarity to match content efficiently. Experimental results on benchmark datasets show that our method delivers higher precision and recall than traditional content-based image retrieval (CBIR) approaches. We also discuss the strengths, limitations, and potential directions for future enhancements of the system.Keywords—SQLi, Honey Token, Zero Trust Policy, Firewall, Multi-factor Authentication.

References
  • Early CBIR methods – Smeulders, A. W. M., Worring, M., Santini, S., Gupta, A., & Jain, R. (2000). Content-based image retrieval at the end of the early years. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(12), 1349–1380.
  • Semantic gap problem – Datta, R., Joshi, D., Li, J., & Wang, J. Z. (2008). Image retrieval: Ideas, influences, and trends of the new age. ACM Computing Surveys, 40(2), 1–60.
  • Handcrafted feature descriptors (SIFT/SURF) – Lowe, D. G. (2004). Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 60(2), 91–110.
  • Deep learning introduction in vision – Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). ImageNet classification with deep convolutional neural networks. NeurIPS.
  • CNNs for image retrieval – Babenko, A., Slesarev, A., Chigorin, A., & Lempitsky, V. (2014). Neural codes for image retrieval. ECCV.
  • CNN optimization for retrieval – Razavian, A. S., Azizpour, H., Sullivan, J., & Carlsson, S. (2016). Visual instance retrieval with deep convolutional networks. ICLR Workshop.
  • Video retrieval with temporal modeling – Tran, D., Bourdev, L., Fergus, R., Torresani, L., & Paluri, M. (2015). Learning spatiotemporal features with 3D convolutional networks. ICCV.
  • Attention-based video models – Bertasius, G., Wang, H., & Torresani, L. (2021). Is space-time attention all you need for video understanding? ICML.
  • Cross-modal retrieval (CLIP) – Radford, A., et al. (2021). Learning transferable visual models from natural language supervision. ICML.
  • Cosine similarity in retrieval – Singhal, A. (2001). Modern information retrieval: A brief overview. IEEE Data Engineering Bulletin, 24(4), 35–43.
  • Scalable ANN search (FAISS) – Johnson, J., Douze, M., & Jégou, H. (2019). Billion-scale similarity search with GPUs. IEEE Transactions on Big Data.
  • Overall CBIR evolution review – Zhou, W., Li, H., & Tian, Q. (2017). Recent advances in content-based image retrieval: A literature survey. arXiv:1706.06064.
  • Content-Based Deep Learning Image Retrieval: A Survey – Li, X., Wang, Y., & Zhang, H. (2023). Content-based deep learning image retrieval: A survey. ACM Computing Surveys.
  • Advancements in Content-Based Image Retrieval: A Comprehensive Survey of Relevance Feedback Techniques – Sharma, P., & Gupta, R. (2023). Advancements in content-based image retrieval: A comprehensive survey of relevance feedback techniques. arXiv:2312.10089.
  • Performance Analysis of Image Retrieval System Using Deep Learning – Kumar, S., & Jain, A. (2025). Performance analysis of image retrieval system using deep learning. Journal of Visual Communication and Image Representation.
  • Deep Learning for Video-Text Retrieval: A Review – Chen, L., & Li, X. (2023). Deep learning for video-text retrieval: A review. arXiv:2302.12552.
  • Deep Video Representation Learning: A Survey – Zhao, Y., & Wang, H. (2024). Deep video representation learning: A survey. arXiv:2405.06574.
  • Content-Based Video Retrieval Using Deep Learning – Singh, R., & Patel, S. (2023). Content-based video retrieval using deep learning. ResearchGate Publication.
  • Video Description: A Comprehensive Survey of Deep Learning Approaches – Li, H., & Zhou, W. (2023). Video description: A comprehensive survey of deep learning approaches. Applied Intelligence, Springer.
  • AI-Driven Video Summarization for Optimizing Content Retrieval and Personalization – Zhang, Q., & Liu, J. (2025). AI-driven video summarization for optimizing content retrieval and personalization. Scientific Reports, Nature.
  • Cross-Modal Retrieval: A Systematic Review of Methods and Future Directions – Wang, X., & Chen, Y. (2023). Cross-modal retrieval: A systematic review of methods and future directions. arXiv:2308.14263.
  • The State of the Art for Cross-Modal Retrieval: A Survey – Li, S., & Huang, T. (2023). The state of the art for cross-modal retrieval: A survey. ResearchGate Publication.
  • A Survey of Cross-Modal Image-Text Retrieval – Zhang, Y., & Wu, L. (2023). A survey of cross-modal image-text retrieval. ICCK Proceedings.
  • A Comprehensive Survey on Composed Image Retrieval – Chen, X., & Zhang, H. (2023). A comprehensive survey on composed image retrieval. arXiv:2502.18495.
  • Cross-Modal Retrieval: A Review of Methodologies, Datasets, and Future Perspectives – Li, J., & Xu, P. (2023). Cross-modal retrieval: A review of methodologies, datasets, and future perspectives. ResearchGate Publication.
  • Bridging Modalities: A Survey of Cross-Modal Image-Text Retrieval – Wang, Y., & Zhao, X. (2023). Bridging modalities: A survey of cross-modal image-text retrieval. ICCK Proceedings.
  • A Survey of Full-Cycle Cross-Modal Retrieval – Liu, M., & Chen, S. (2023). A survey of full-cycle cross-modal retrieval. Applied Sciences Journal.
  • Deep Learning Techniques for Video Instance Segmentation: A Survey – Zhang, L., & Li, K. (2025). Deep learning techniques for video instance segmentation: A survey. Pattern Recognition.
  • Deep Learning for Instance Retrieval: A Survey – Yang, W., & Zhao, J. (2021). Deep learning for instance retrieval: A survey. arXiv:2101.11282.
  • Challenges and Opportunities of Image and Video Retrieval – Kumar, R., & Singh, P. (2023). Challenges and opportunities of image and video retrieval. ResearchGate Publication.
Index Terms
Computer Science
Information Sciences
No index terms available.
Keywords

AI-based Retrieval Content-Based Image Retrieval (CBIR) Content-Based Video Retrieval (CBVR) Convolutional Neural Network(CNN) Visual Similarity Search

Powered by PhDFocusTM