Research Article

Fraud Website Detection using Data Mining

by  Urvashi Prajapati, Neha Sangal, Deepti Patole
journal cover
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 141 - Issue 3
Published: May 2016
Authors: Urvashi Prajapati, Neha Sangal, Deepti Patole
10.5120/ijca2016909590
PDF

Urvashi Prajapati, Neha Sangal, Deepti Patole . Fraud Website Detection using Data Mining. International Journal of Computer Applications. 141, 3 (May 2016), 40-44. DOI=10.5120/ijca2016909590

                        @article{ 10.5120/ijca2016909590,
                        author  = { Urvashi Prajapati,Neha Sangal,Deepti Patole },
                        title   = { Fraud Website Detection using Data Mining },
                        journal = { International Journal of Computer Applications },
                        year    = { 2016 },
                        volume  = { 141 },
                        number  = { 3 },
                        pages   = { 40-44 },
                        doi     = { 10.5120/ijca2016909590 },
                        publisher = { Foundation of Computer Science (FCS), NY, USA }
                        }
                        %0 Journal Article
                        %D 2016
                        %A Urvashi Prajapati
                        %A Neha Sangal
                        %A Deepti Patole
                        %T Fraud Website Detection using Data Mining%T 
                        %J International Journal of Computer Applications
                        %V 141
                        %N 3
                        %P 40-44
                        %R 10.5120/ijca2016909590
                        %I Foundation of Computer Science (FCS), NY, USA
Abstract

Phishing attack is used to steal confidential information of user. Fraud websites appear similar to genuine websites with the logo and graphics of trusted website. Fraud Website Detection application aims to detect fraud websites using data mining techniques. This project provides intelligent solution to phishing attack. W3C standard defines characteristics which can be used to distinguish fraud and legal website. This application extracts some characteristics from URL and source code of a website. These features are used for classification. RIPPER algorithm is used to classify the websites. After classifying the websites, the application sends notification email to the administrator using WHOIS protocol. The administrator may block the fraud website after verification.

References
  • Peter Stavroulakis, Mark Stamp,“Handbook of Information and Communication Security”,Springer.
  • Phishing statistics, https://docs.apwg.org/reports/apwg_trends_report_q4_2014.pdf
  • M. Dunlop, S. Groat, and D. Shelly," GoldPhish: Using Images for Content-Based Phishing Analysis", in the Fifth International Conference on Internet Monitoring and Protection, 2010.
  • JRip algorithm pseudocode, http://weka.sourceforge.net/doc.dev/weka/classifiers/rules/JRip.html
  • Robert Stahlbock, Sven F. Crone, Stefan Lessmann ,“Data Mining: Special Issue in Annals of Information Systems”,Springer.
  • Zhongyu Lu, “Information Retrieval Methods for Multidisciplinary Applications”, Information Science Reference.
  • Mohammad, R., Thabtah, F., & McCluskey, L. (2012). ― An assessment of features related to phishing websites using an automated technique‖. In The 7th international conference for internet technology and secured transactions(ICITST-2012). London: ICITST.
  • Omar Abdullah Batarfi Mona Ghotaish Alkhozae, “Phishing websites detection based on phishing characteristics in the webpage source code,” International Journal of Information and Communication Technology Research, October 2011.
  • Garth O. Bruen ,“WHOIS Running the Internet: Protocol, Policy, and Privacy”,Wiley.
  • Ihab Shraim, Laura Mather, Patrick Cain, Rod Rasmussen, “Advisory on Utilization of Whois Data For Phishing Site Take Down ”, APWG Internet Protocol Committee,March 2008.
  • Data for legal websites, http://www.dmoz.org/
  • Data for Fraud websites, https://www.phishtank.com/
Index Terms
Computer Science
Information Sciences
No index terms available.
Keywords

Phishing JRip RIPPER Fraud website source code data mining WHOIS

Powered by PhDFocusTM