Research Article

Keyword based Automatic Summarization of HTML Documents

by  Shivangi Gupta, Mukesh Rawat
journal cover
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 127 - Issue 8
Published: October 2015
Authors: Shivangi Gupta, Mukesh Rawat
10.5120/ijca2015906421
PDF

Shivangi Gupta, Mukesh Rawat . Keyword based Automatic Summarization of HTML Documents. International Journal of Computer Applications. 127, 8 (October 2015), 24-29. DOI=10.5120/ijca2015906421

                        @article{ 10.5120/ijca2015906421,
                        author  = { Shivangi Gupta,Mukesh Rawat },
                        title   = { Keyword based Automatic Summarization of HTML Documents },
                        journal = { International Journal of Computer Applications },
                        year    = { 2015 },
                        volume  = { 127 },
                        number  = { 8 },
                        pages   = { 24-29 },
                        doi     = { 10.5120/ijca2015906421 },
                        publisher = { Foundation of Computer Science (FCS), NY, USA }
                        }
                        %0 Journal Article
                        %D 2015
                        %A Shivangi Gupta
                        %A Mukesh Rawat
                        %T Keyword based Automatic Summarization of HTML Documents%T 
                        %J International Journal of Computer Applications
                        %V 127
                        %N 8
                        %P 24-29
                        %R 10.5120/ijca2015906421
                        %I Foundation of Computer Science (FCS), NY, USA
Abstract

Automatic summarization [5] can be defined as the procedure to create a short version of a text by a computer program. Its product still contains the most important points of the existing text. Multi-document summarization [6] can be defined as an automatic procedure which extracts information from multiple texts that is written about the same topic. Resulting summary report allows individual users or professional information consumers, to quickly familiarize themselves with information that is contained in a large cluster of documents. Multi-document summarization creates information reports that are both concise and comprehensive.

References
  • James Allan, Jaime Carbonell, George Doddington,Jonathan Yamron, and Yiming Yang. 1998. Topic detection and tracking pilot study: Final report. In Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop.
  • Chinatsu Aone, M. E. Okurowski, J. Gorlinsky, and B. Larsen. 1997. A scalable summarization system using robust NLP. In Proceedings of the ACL'97/EACL'97 Workshop on Intelligent Scalable Text Summarization, pages 66-73, Madrid, Spain.
  • Breck Baldwin and Thomas S. Morton. 1998. Dynamic coreference-based summarization. In Proceedings of the Third Conference on Empirical Methods in Natural Language Processing (EMNLP-3), Granada, Spain, June.
  • Regina Barzilay and Michael Elhadad. 1997. Using lexical chains for text summarization. In Proceedings of the CL'97/EACL'97 Workshop on Intelligent Scalable Text Summarization, pages 10-17, Madrid, Spain.
  • A. Siddharthan, A. Nenkova, and K. McKeown. Syntactic simplification for improving content selection in multi-document summarization. In Proc.of COLING, 2004.
  • L. Vanderwende, H. Suzuki, and C. Brockett. Microsoft Research at DUC2006: Taskfocused summarization with sentence simplification and lexical expansion. In Proc. of DUC, 2006.
  • X. Wan and J. Yang. Improved affinity graph based multi-document summarization. In Proceedings of HLT-NAACL, Companion Volume: Short Papers, pages 181–184, 2006.
  • D.Zajic, B. Dorr, andR. Schwartz.Automatic headline generation for newspaper stories. In Proc. of DUC, 2002.
  • D. Zajic, B. Dorr, J. Lin, C. Monz, and R. Schwartz. A sentence-trimming approach to multidocument summarization. In Proc. of DUC, 2005.
  • Satoshi Sekine and C Nobata. Sentence Extraction with Information Extraction Technique. In Workshop on Text Summarization, 2001.
  • Christopher D. Manning, Prabhakar Raghavan,” An Introduction to information retrieval”,Cambridge University Press Cambridge, England, Online edition (c) 2009.
Index Terms
Computer Science
Information Sciences
No index terms available.
Keywords

Automatic summarization multi-document summarization multiple texts pre- processing of text.

Powered by PhDFocusTM