Research Article

Arabic Sentences Classification via Deep Learning

by  Dania Sagheer, Fadel Sukkar
journal cover
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 182 - Issue 5
Published: Jul 2018
Authors: Dania Sagheer, Fadel Sukkar
10.5120/ijca2018917555
PDF

Dania Sagheer, Fadel Sukkar . Arabic Sentences Classification via Deep Learning. International Journal of Computer Applications. 182, 5 (Jul 2018), 40-46. DOI=10.5120/ijca2018917555

                        @article{ 10.5120/ijca2018917555,
                        author  = { Dania Sagheer,Fadel Sukkar },
                        title   = { Arabic Sentences Classification via Deep Learning },
                        journal = { International Journal of Computer Applications },
                        year    = { 2018 },
                        volume  = { 182 },
                        number  = { 5 },
                        pages   = { 40-46 },
                        doi     = { 10.5120/ijca2018917555 },
                        publisher = { Foundation of Computer Science (FCS), NY, USA }
                        }
                        %0 Journal Article
                        %D 2018
                        %A Dania Sagheer
                        %A Fadel Sukkar
                        %T Arabic Sentences Classification via Deep Learning%T 
                        %J International Journal of Computer Applications
                        %V 182
                        %N 5
                        %P 40-46
                        %R 10.5120/ijca2018917555
                        %I Foundation of Computer Science (FCS), NY, USA
Abstract

This paper presents a Convolutional Neural Network CNN Models to classify Arabic sentences into three topics. These sentences are derived from Essex Arabic Summaries Corpus (EASC) corpus, tokenized to words and transformed to sequences of word indices. All sequences are padded to be in the same length. The models of Convolution Neural Network are built on top of word embedding layer. The word embedding layer is either pre-trained or jointed into the model. Dropout and l2 weight regularization are used to overcome the overfitting during training. The CNN models achieve high performance in accuracy for Arabic sentences classification.

References
  • Sagheer Dania, Sukkar Fadel, "A hybrid Intelligent System for Abstractive summarization”, International Journal of computer Applications, vol (168),No (9), June, 2017.
  • Xiang Zhang,Yann LeCun ,“Text understanding from scratch”, arXiv:1502.01710v5 [cs.LG], Apr, 2016.
  • Aris Kosmopoulos, “large scale hierarchical text classification”, Ph.D. thesis department of informatics Athens university of economics and business, 2015.
  • Mrs. Manisha Pravin Mali, Dr. Mohammad Atique, "Applications of Text Classification using Text Mining", International Journal of Engineering Trends and Technology (IJETT), Volume 13, Number 5, Jul 2014.
  • Ian Goodfellow, Yoshua Bengio, Aaron Courville, “Deep Learning”, An MIT Press book, 2016
  • Marc Moreno, Lopez,Jugal Kalita "Deep Learning applied to NLP", arXiv:1703.03091v1 [cs.CL] 9 Mar 2017
  • Francois chollet, “Deep learning with python”, Manning publication Co, 2018
  • PengWang, JiamingXu, BoXu,Cheng-LinLiu, HengZhang FangyuanWang, HongweiHao, “Semantic Clustering and Convolutional Neural Network for Short Text Categorization”, Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Short Papers), pages 352–357, Beijing, China, July 26-31, 2015.
  • Daojian Zeng, Kang Liu, Siwei Lai, Guangyou Zhouand JunZhao, “Relation Classification via Convolutional Deep Neural Network”, Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers, pages 2335–2344, Dublin, Ireland, 2014.
  • Yoon Kim, “Convolutional Neural Networks for Sentence Classification”, arXiv:1408.5882v2 [cs.CL], 2016
  • Ye Zhang, Byron C. Wallace, "A Sensitivity Analysis of (and Practitioners’ Guide to) Convolutional Neural Networks for Sentence Classification", arXiv:1510.03820v4 [cs.CL], 2016
  • NalK alchbrenner, Edward Grefenstette, Phil Blunsom, “A Convolutional Neural Network for Modelling Sentences”, arXiv:1404.2188v1, [cs.CL], 2014
  • Rie Johnson, Tong Zhang, "Semi-supervised Convolutional Neural Networks for Text Categorization via Region Embedding", arXiv:1504.01255v3 [stat.ML], 2015
  • Xiang Zhang Junbo Zhao Yann Le Cun, “Character-level Convolutional Networks for Text Classification”, arXiv:1509.01626v3 [cs.LG] , 2016
  • Alexis Conneau, Holger Schwenk, Yann Le Cun, Lo¨ıc Barrault , “Very Deep Convolutional Networks for Text Classification”, arXiv:1606.01781 [cs.CL], 2017
  • Rie Johnson, Tong Zhang Baidu, “Effective Use of Word Order for Text Categorization with Convolutional Neural Networks”, Human Language Technologies: Annual Conference of the North American Chapter of the ACL, pages 103–112, Denver, Colorado, May 31 – June 5, 2015
  • Zichao Yang, Diyi Yang, Chris Dyer, Xiaodong He, Alex Smola, Eduard Hovy, “Hierarchical Attention Networks for Document Classification”, Proceedings of NAACL-HLT 2016, pages 1480–1489, San Diego, California, June 12-17, 2016.
  • Abdelghani Dahou,Shengwu Xiong, Junwei Zhou, Mohamed Houcine Haddoud and Pengfei Duan “Word Embeddings and Convolutional Neural Network for Arabic Sentiment Classification”, Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pages 2418–2427, Osaka, Japan, 2016.
  • Arabic Corpora. Essex Arabic Summaries Corpus: https://www.essex.ac.uk/linguistics/research/arabic/arabiccorpora/easc.aspx
  • Tomas Mikolov, Kai Chen, Greg Corrado, Jeffry Dean, “Efficient Estimation of Word Representations in Vector Space”, arXiv:1301.3781v3 [cs.CL] 7 Sep 2013.
  • Abu Bakr Soliman, Kareem Eissa, Samhaa R. El-Beltagy “AraVec: A set of Arabic Word Embedding Models for use in Arabic NLP”, 3rd International Conference on Arabic Computational Linguistics, ACLing 2017, 5-6 November 2017, Dubai, United Arab Emirates
  • Josh Patterson and Gibson, “Deep learning, A practitioner’s approach”, O’Reilly, Media, Inc., 2017
  • Hamed Habibi Aghdam, Elnaz Jahani Heravi, “Guide to Convolutional Neural Networks. A Practical Application to Traffic-Sign Detection and Classification”, Springer International Publishing AG 2017.
Index Terms
Computer Science
Information Sciences
No index terms available.
Keywords

Classification Convolutional neural network Word Embedding

Powered by PhDFocusTM