Research Article

Multifactor Affiliation Analysis: A Multifactor Dimensionality Reduction based Learning Model for Knowledge Discovery and Similarity Measure in 2-way Data Classification

by  Aditya C.R., M.B. Sanjay Pande
journal cover
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 130 - Issue 5
Published: November 2015
Authors: Aditya C.R., M.B. Sanjay Pande
10.5120/ijca2015906947
PDF

Aditya C.R., M.B. Sanjay Pande . Multifactor Affiliation Analysis: A Multifactor Dimensionality Reduction based Learning Model for Knowledge Discovery and Similarity Measure in 2-way Data Classification. International Journal of Computer Applications. 130, 5 (November 2015), 1-5. DOI=10.5120/ijca2015906947

                        @article{ 10.5120/ijca2015906947,
                        author  = { Aditya C.R.,M.B. Sanjay Pande },
                        title   = { Multifactor Affiliation Analysis: A Multifactor Dimensionality Reduction based Learning Model for Knowledge Discovery and Similarity Measure in 2-way Data Classification },
                        journal = { International Journal of Computer Applications },
                        year    = { 2015 },
                        volume  = { 130 },
                        number  = { 5 },
                        pages   = { 1-5 },
                        doi     = { 10.5120/ijca2015906947 },
                        publisher = { Foundation of Computer Science (FCS), NY, USA }
                        }
                        %0 Journal Article
                        %D 2015
                        %A Aditya C.R.
                        %A M.B. Sanjay Pande
                        %T Multifactor Affiliation Analysis: A Multifactor Dimensionality Reduction based Learning Model for Knowledge Discovery and Similarity Measure in 2-way Data Classification%T 
                        %J International Journal of Computer Applications
                        %V 130
                        %N 5
                        %P 1-5
                        %R 10.5120/ijca2015906947
                        %I Foundation of Computer Science (FCS), NY, USA
Abstract

Extracting useful information from the datasets of high dimension and representing the learnt knowledge in an efficient way is a challenge in knowledge discovery and data mining. Although many pattern recognition, knowledge discovery and data mining techniques are available in literature, there is a need for techniques that represent the high dimensional data in a low dimension by preserving useful information for supervised learning. In this work, we design a novel model which effectively captures both inter-feature and intrafeature relationships in the sample space for knowledge discovery by performing dimensionality reduction, using a modified version of multi-factor dimensionality reduction based algorithm. The model uses the learnt knowledge to quantify the similarity of a test sample with respect to a specific class. The evaluation of the model on Fisher

References
  • Marylyn D Ritchie Alison A Motsinger. Multifactor dimensionality reduction: An analysis strategy for modelling and detecting gene gene interactions in human genetics and pharmacogenomics studies. Human Genomics, 2(5)(5):318–328, march 2006.
  • Phipps Arabie, Lawrence J Hubert, and Geert De Soete. Clustering and classification. World Scientific, 1996.
  • Seung-Seok Choi, Sung-Hyuk Cha, and Charles C Tappert. A survey of binary similarity and distance measures. Journal of Systemics, Cybernetics and Informatics, 8(1):43–48, 2010.
  • E R Davies. Machine vision: theory, algorithms, practicalities. Signal Processing and Its Applications, 1996.
  • Michel Marie Deza and Elena Deza. Encyclopedia of distances. Springer, 2009.
  • Usama Fayyad, Gregory Piatetsky-Shapiro, and Padhraic Smyth. From data mining to knowledge discovery in databases. AI magazine, 17(3):37, 1996.
  • Usama M Fayyad, Gregory Piatetsky-Shapiro, Padhraic Smyth, et al. Knowledge discovery and data mining: Towards a unifying framework. In KDD, volume 96, pages 82–88, 1996.
  • Usama M Fayyad, Gregory Piatetsky-Shapiro, Padhraic Smyth, and Ramasamy Uthurusamy. Advances in knowledge discovery and data mining. 1996.
  • Jiawei Han. Data mining techniques. In ACM SIGMOD Record, volume 25, page 545. ACM, 1996. Markus Hegland. Data mining techniques. Acta Numerica 2001, 10:313–355, 2001.
  • E. Dallas Johnshon. Applied multivariate methods for data analysis. Kanas University, Duxbury Press, 1998.
  • M. Lichman. UCI machine learning repository, 2013.
  • Oded Maimon and Lior Rokach. Data mining and knowledge discovery handbook, volume 2. Springer, 2005.
  • Alison A Motsinger Marylyn D Ritchie. Multi factor dimensionality reduction for detecting gene-gene and geneenvironment interactions in pharmacogenomics studies. Pharmogenomics, 2005.
  • Alison A Motsinger and Marylyn D Ritchie. Multifactor dimensionality reduction: an analysis strategy for modelling and detecting gene-gene interactions in human genetics and pharmacogenomics studies. Human genomics, 2(5):318, 2006.
  • George Nagy. Twenty years of document image analysis in pami. IEEE Transactions on Pattern Analysis & Machine Intelligence, (1):38–62, 2000.
  • Thair Nu Phyu. Survey of classification techniques in data mining. In Proceedings of the International MultiConference of Engineers and Computer Scientists, volume 1, pages 18– 20, 2009.
  • Marylyn D Ritchie and Alison A Motsinger. Multifactor dimensionality reduction for detecting gene-gene and geneenvironment interactions in pharmacogenomics studies. 2005.
  • Avi Silberschatz and Alexander Tuzhilin. What makes patterns interesting in knowledge discovery systems. Knowledge and Data Engineering, IEEE Transactions on, 8(6):970–974, 1996.
  • Wikipedia. Multifactor dimensionality reduction — wikipedia, the free encyclopedia, 2014. [Online; accessed 15-September-2015].
  • Wikipedia. Euclidean distance—wikipedia, the free encyclopedia, 2015. [Online; accessed 15-September-2015].
Index Terms
Computer Science
Information Sciences
No index terms available.
Keywords

Multifactor Dimensionality Reduction Knowledge Discovery Similarity Measure Classification

Powered by PhDFocusTM