Research Article

Implementation of Extractive Text Summarization using Word Frequency in Python

by  Ahmad Farhan Alshammari
journal cover
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 184 - Issue 47
Published: Feb 2023
Authors: Ahmad Farhan Alshammari
10.5120/ijca2023922583
PDF

Ahmad Farhan Alshammari . Implementation of Extractive Text Summarization using Word Frequency in Python. International Journal of Computer Applications. 184, 47 (Feb 2023), 23-26. DOI=10.5120/ijca2023922583

                        @article{ 10.5120/ijca2023922583,
                        author  = { Ahmad Farhan Alshammari },
                        title   = { Implementation of Extractive Text Summarization using Word Frequency in Python },
                        journal = { International Journal of Computer Applications },
                        year    = { 2023 },
                        volume  = { 184 },
                        number  = { 47 },
                        pages   = { 23-26 },
                        doi     = { 10.5120/ijca2023922583 },
                        publisher = { Foundation of Computer Science (FCS), NY, USA }
                        }
                        %0 Journal Article
                        %D 2023
                        %A Ahmad Farhan Alshammari
                        %T Implementation of Extractive Text Summarization using Word Frequency in Python%T 
                        %J International Journal of Computer Applications
                        %V 184
                        %N 47
                        %P 23-26
                        %R 10.5120/ijca2023922583
                        %I Foundation of Computer Science (FCS), NY, USA
Abstract

The goal of this research is to develop an extractive text summarization program using word frequency in Python. The steps of text summarization process are: preprocessing text, word-tokenization, creating bag of words, calculating word frequency, sentence-tokenization, calculating sentence score, calculating average score, and making summary. The developed program was examined on an experimental text from Wikipedia. The program performed the steps of text summarization and provided the required summary.

References
  • Lloret, E. & Palomar, M. (2012). "Text Summarization in Progress: A Literature Review". Artificial Intelligence Review. 37, 1-41.
  • Luhn, H. P. (1958). "The Automatic Creation of Literature Abstracts". IBM Journal of research and development, 2(2), 159-165.
  • Matplotlib: https://www. matplotlib.org
  • Nenkova, A., &McKeown, K. (2012). "A Survey of Text Summarization Techniques". In Mining Text Data, Springer, 43-76.
  • NLTK: https://www.nltk.org
  • Numpy: https://www.numpy.org
  • Orasan, Constantin. (2009). "Comparative Evaluation of Term-Weighting Methods for Automatic Summarization". Journal of Quantitative Linguistics. 16, 67-95.
  • Pandas: https:// pandas.pydata.org
  • Python: https://www.python.org
  • Salton, G. (1989). "Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer". Addison- Wesley Publishing Company, USA.
  • Salton, G. & McGill, M. (1983). "Introduction to Modern Information Retrieval". McGraw Hill Book Co, New York.
  • Salton, G., & Buckley, C. (1988). "Term-Weighting approaches in Automatic Text Retrieval". Information Processing and Management, 24(5), 513-523.
  • SciKit: https://scikit-learn.org
  • Sparck Jones, K. (1972). "A Statistical interpretation of Term Specificity and its Application in Retrieval". Journal of Documentation, 28(1), 11-21.
  • Wikipedia: https://en.wikipedia.org
Index Terms
Computer Science
Information Sciences
No index terms available.
Keywords

Artificial Intelligence Machine Learning Text Summarization Natural Language Processing Tokenization Word Frequency Sentence Score Summary Python.

Powered by PhDFocusTM