Research Article

Article:Sentence Boundary Disambiguation: A User Friendly Approach

by  Pritam Singh Negi, M.M.S. Rauthan, H.S. Dhami
journal cover
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 7 - Issue 8
Published: October 2010
Authors: Pritam Singh Negi, M.M.S. Rauthan, H.S. Dhami
10.5120/1269-1738
PDF

Pritam Singh Negi, M.M.S. Rauthan, H.S. Dhami . Article:Sentence Boundary Disambiguation: A User Friendly Approach. International Journal of Computer Applications. 7, 8 (October 2010), 33-37. DOI=10.5120/1269-1738

                        @article{ 10.5120/1269-1738,
                        author  = { Pritam Singh Negi,M.M.S. Rauthan,H.S. Dhami },
                        title   = { Article:Sentence Boundary Disambiguation: A User Friendly Approach },
                        journal = { International Journal of Computer Applications },
                        year    = { 2010 },
                        volume  = { 7 },
                        number  = { 8 },
                        pages   = { 33-37 },
                        doi     = { 10.5120/1269-1738 },
                        publisher = { Foundation of Computer Science (FCS), NY, USA }
                        }
                        %0 Journal Article
                        %D 2010
                        %A Pritam Singh Negi
                        %A M.M.S. Rauthan
                        %A H.S. Dhami
                        %T Article:Sentence Boundary Disambiguation: A User Friendly Approach%T 
                        %J International Journal of Computer Applications
                        %V 7
                        %N 8
                        %P 33-37
                        %R 10.5120/1269-1738
                        %I Foundation of Computer Science (FCS), NY, USA
Abstract

In the present work we have developed an algorithm based on maximum entropy and stop word removal modules, which works with almost 99% accuracy and have established supremacy over the existing paragraph breaker software developed by Text Mining Group, School of Computer Science, Manchester University, United Kingdom .

References
  • Berger A. 1996. A Brief Maxent Tutorial. http://www-2.cs.cmu.edu/~aberger/maxent.html.
  • Gillick Dan (2009) Sentence boundary detection and the problem with the U.S.(2009) Proceeding of the NAACL HLT2009:Association for computational linguistics, (short papers),241-244.Boulder,Colorado.
  • Kiss T. and Strunk J.(2006) Unsupervised multilingual sentence boundary detection, Computational linguistics, 32(4), 485-525.
  • Manning, C.D. and Schütze, H. (2002) Foundations of statistical natural language processing. The MIT Press, Cambridge/London.
  • Mikheev, A. (2000). Tagging Sentence Boundaries. In Proceedings of the NAACL, pp 264-271, Seattle, WA.
  • Palmer, D.D. & Hearst, M.A. (1997). Adaptive Multilingual Sentence Boundary Disambiguation. Computation Linguistics, 23(2), 241-269.
  • Siminski Krzysztof (2007) Sentence boundary verification in Polish text, Computer recognition systems 2, Advances in soft computing, Springer, Vol.45/2007,493-499.
  • Weijian, Xuan, Watson, Stanley J. and Meng Fan (2007) Tagging sentence bares to Biomedical literature, Computational linguistics and Intelligent text processing, Lect. Notes in Computer Science, Springer, No.7, Vol.4394/2007, 186-195.
Index Terms
Computer Science
Information Sciences
No index terms available.
Keywords

Sentence Boundary Information retrieval Evaluation

Powered by PhDFocusTM