Research Article

Implementation of Exploratory Data Analysis (EDA) in Python

by  Ahmad Farhan Alshammari
journal cover
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 187 - Issue 32
Published: August 2025
Authors: Ahmad Farhan Alshammari
10.5120/ijca2025925577
PDF

Ahmad Farhan Alshammari . Implementation of Exploratory Data Analysis (EDA) in Python. International Journal of Computer Applications. 187, 32 (August 2025), 34-42. DOI=10.5120/ijca2025925577

                        @article{ 10.5120/ijca2025925577,
                        author  = { Ahmad Farhan Alshammari },
                        title   = { Implementation of Exploratory Data Analysis (EDA) in Python },
                        journal = { International Journal of Computer Applications },
                        year    = { 2025 },
                        volume  = { 187 },
                        number  = { 32 },
                        pages   = { 34-42 },
                        doi     = { 10.5120/ijca2025925577 },
                        publisher = { Foundation of Computer Science (FCS), NY, USA }
                        }
                        %0 Journal Article
                        %D 2025
                        %A Ahmad Farhan Alshammari
                        %T Implementation of Exploratory Data Analysis (EDA) in Python%T 
                        %J International Journal of Computer Applications
                        %V 187
                        %N 32
                        %P 34-42
                        %R 10.5120/ijca2025925577
                        %I Foundation of Computer Science (FCS), NY, USA
Abstract

The goal of this research is to develop an exploratory data analysis model in Python. Exploratory Data Analysis (EDA) is used to understand the nature of data. It helps to identify the main characteristics of data (patterns, trends, and relationships). The application of exploratory data analysis helps to build a solid foundation for more advanced analysis. The basic steps of exploratory data analysis are explained: importing libraries, reading data, displaying data, displaying general information, computing descriptive statistics, cleaning data (duplicates, missing values, and outliers), and analyzing data (univariate, bivariate, and multivariate). The developed model was tested on an experimental dataset. The model successfully performed the basic steps of exploratory data analysis and provided the required results.

References
  • Sammut, C., & Webb, G. I. (2011). "Encyclopedia of Machine Learning". Springer.
  • Jung, A. (2022). "Machine Learning: The Basics". Springer.
  • Kubat, M. (2021). "An Introduction to Machine Learning". Springer.
  • Li, H. (2023). "Machine Learning Methods". Springer.
  • Dey, A. (2016). "Machine Learning Algorithms: A Review". International Journal of Computer Science and Information Technologies, 7 (3), 1174-1179.
  • Bonaccorso, G. (2018). "Machine Learning Algorithms: Popular Algorithms for Data Science and Machine Learning". Packt Publishing.
  • Jo, T. (2021). "Machine Learning Foundations: Supervised, Unsupervised, and Advanced Learning". Springer.
  • Jordan, M. I., & Mitchell, T. M. (2015). "Machine Learning: Trends, Perspectives, and Prospects". Science, 349 (6245), 255-260.
  • Forsyth, D. (2019). "Applied Machine Learning". Springer.
  • Chopra, D., & Khurana, R. (2023). "Introduction to Machine Learning with Python". Bentham Science Publishers.
  • Müller, A. C., & Guido, S. (2016). "Introduction to Machine Learning with Python: A Guide for Data Scientists". O'Reilly Media.
  • Zollanvari, A. (2023). "Machine Learning with Python: Theory and Implementation". Springer.
  • Raschka, S. (2015). "Python Machine Learning". Packt Publishing.
  • Sarkar, D., Bali, R., & Sharma, T. (2018). "Practical Machine Learning with Python". Apress.
  • Igual, L., & Seguí, S. (2017). "Introduction to Data Science: A Python Approach to Concepts, Techniques and Applications". Springer.
  • VanderPlas, J. (2017). "Python Data Science Handbook: Essential Tools for Working with Data". O'Reilly Media.
  • Yale, K., Nisbet, R., & Miner, G. D. (2018). "Handbook of Statistical Analysis and Data Mining Applications". Academic Press.
  • Unpingco, J. (2022). "Python for Probability, Statistics, and Machine Learning". Springer.
  • James, G., Witten, D., Hastie, T., Tibshirani, R., & Taylor, J. (2023). "An Introduction to Statistical Learning: With Applications in Python". Springer.
  • Navlani, A., Fandango, A., & Idris, I. (2021). "Python Data Analysis". Packt Publishing.
  • Unpingco, J. (2021). "Python Programming for Data Analysis". Springer.
  • McKinney, W. (20128). "Python for Data Analysis". O'Reilly Media.
  • Embarak, O. (2018). "Data Analysis and Visualization using Python ". Apress.
  • Denis, D. J. (2021). "Applied Univariate, Bivariate, and Multivariate Statistics Using Python: A Beginner's Guide to Advanced Data Analysis". John Wiley & Sons.
  • Mukhiya, S. K., & Ahmed, U. (2020). "Hands-On Exploratory Data Analysis with Python". Packt Publishing.
  • Chen, D. (2018). "Pandas for Everyone: Python Data Analysis". Addison-Wesley.
  • Molin, S. (2019). "Hands-On Data Analysis with Pandas". Packt Publishing.
  • Myatt, G. J. (2014). "Making Sense of Data: A Practical Guide to Exploratory Data Analysis and Data Mining". John Wiley & Sons.
  • Vigni, M. L., Durante, C., & Cocchi, M. (2013). "Exploratory Data Analysis". In Data Handling in Science and Technology. 28, 55-126. Elsevier.
  • Komorowski, M., Marshall, D. C., Salciccioli, J. D., & Crutain, Y. (2016). "Exploratory Data Analysis". In Secondary Analysis of Electronic Health Records, pp. 185-203, Springer.
  • Sahoo, K., Samal, A. K., Pramanik, J., & Pani, S. K. (2019). "Exploratory Data Analysis using Python". International Journal of Innovative Technology and Exploring Engineering, 8 (12), 4727-4735.
  • Heydt, M. (2017). "Learning Pandas: High-Performance Data Manipulation and Analysis in Python". Packt Publishing.
  • Miller, C. (2018). "Hands-On Data Analysis with Numpy and Pandas". Packt Publishing.
  • Tukey, J. W. (1977). "Exploratory Data Analysis". Addison-Wesley.
  • Python: http://www.python.org
  • Numpy: http://www.numpy.org
  • Pandas: http://pandas.pydata.org
  • Matplotlib: http://www. matplotlib.org
  • Seaborn: http://seaborn.pydata.org
  • NLTK: http://www.nltk.org
  • SciPy: http://scipy.org
  • SK Learn: http://scikit-learn.org
  • Kaggle: http://www.kaggle.com
  • Jupyter: http://www.jupyter.org
Index Terms
Computer Science
Information Sciences
No index terms available.
Keywords

Artificial Intelligence Machine Learning Data Science Data Analysis Exploratory Data Analysis EDA Univariate Bivariate Multivariate Python Programming.

Powered by PhDFocusTM