International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
|
Volume 128 - Issue 2 |
Published: October 2015 |
Authors: Anagha N. Chaudhari |
![]() |
Anagha N. Chaudhari . A Novel Approach for Development of an Expert IR System using Dimensionality Reduction Techniques and Clustering Approaches for High Dimensionality Dataset. International Journal of Computer Applications. 128, 2 (October 2015), 48-53. DOI=10.5120/ijca2015906459
@article{ 10.5120/ijca2015906459, author = { Anagha N. Chaudhari }, title = { A Novel Approach for Development of an Expert IR System using Dimensionality Reduction Techniques and Clustering Approaches for High Dimensionality Dataset }, journal = { International Journal of Computer Applications }, year = { 2015 }, volume = { 128 }, number = { 2 }, pages = { 48-53 }, doi = { 10.5120/ijca2015906459 }, publisher = { Foundation of Computer Science (FCS), NY, USA } }
%0 Journal Article %D 2015 %A Anagha N. Chaudhari %T A Novel Approach for Development of an Expert IR System using Dimensionality Reduction Techniques and Clustering Approaches for High Dimensionality Dataset%T %J International Journal of Computer Applications %V 128 %N 2 %P 48-53 %R 10.5120/ijca2015906459 %I Foundation of Computer Science (FCS), NY, USA
In day to day life huge amount of electronic data is generated from various resources. Such data is literally large and not easy to work with for storage and retrieval. This type of data can be treated with various efficient techniques for cleaning, compression and sorting of data. Preprocessing can be used to remove basic English stop-words from data making it compact and easy for further processing; later dimensionality reduction techniques make data more efficient and specific. This data later can be clustered for better information retrieval. This paper elaborates the various dimensionality reduction and clustering techniques applied on sample dataset C50test of 2500 documents giving promising results, their comparison and better approach for relevant information retrieval.