Research Article

The DF-ICF Algorithm- Modified TF-IDF

by  Puneet Goswami, Vidya Kamath
journal cover
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 93 - Issue 13
Published: May 2014
Authors: Puneet Goswami, Vidya Kamath
10.5120/16276-6036
PDF

Puneet Goswami, Vidya Kamath . The DF-ICF Algorithm- Modified TF-IDF. International Journal of Computer Applications. 93, 13 (May 2014), 28-30. DOI=10.5120/16276-6036

                        @article{ 10.5120/16276-6036,
                        author  = { Puneet Goswami,Vidya Kamath },
                        title   = { The DF-ICF Algorithm- Modified TF-IDF },
                        journal = { International Journal of Computer Applications },
                        year    = { 2014 },
                        volume  = { 93 },
                        number  = { 13 },
                        pages   = { 28-30 },
                        doi     = { 10.5120/16276-6036 },
                        publisher = { Foundation of Computer Science (FCS), NY, USA }
                        }
                        %0 Journal Article
                        %D 2014
                        %A Puneet Goswami
                        %A Vidya Kamath
                        %T The DF-ICF Algorithm- Modified TF-IDF%T 
                        %J International Journal of Computer Applications
                        %V 93
                        %N 13
                        %P 28-30
                        %R 10.5120/16276-6036
                        %I Foundation of Computer Science (FCS), NY, USA
Abstract

The tf-idf is an algorithm which is generally used where massive data processing is done. Tf-idf is the weight given to a particular term within a document and it is proportional to the importance of the term. This paper aims to use the idea behind the tf-idf algorithm to design the df-icf algorithm which finds the importance of a particular document within the given corpus.

References
  • SALTON G, BUCKLEY C. Term-weighting approaches in automatic text retrieval [J]. Information Processing and Management, 1988, PP513 - 523.
  • SALTON G, CLEMENT T Y. On the construction of effective vocabularies for information retrieval[C]. Proceedings of the 1973
  • Bin Li, Yuan Guoyong-" Improvement of tf-idf for Hadoop Framework" The 2nd International Conference on Computer Application and System Modeling (2012)
  • LiThomas H Davenport, Jill Dyche- "Big Data in Big Companies" International Institute for Analytics, may 2013.
  • Moty Fania, John David Miller- White paper- "Mining Big Data in the Enterprise for Better Business Intelligence", Intel july 2012
  • Puneet Goswami, Vidya Kamath-"Big Data- Driving force for innovation and Value Receation", IJARCSSE volume 4, issue 3-march 2014.
  • Dr. A. K Sharma, Puneet Goswami "Information Retrieval Tools: A Review". Published at the proceedings of national conference on Research and Practices in current areas of IT at SLIET Longowal Punjab on March 26-27 2004. Page No. : 71-74
  • Puneet Goswami paper titled "Security in Cloud Reference Models and Secure Identity Management Mechanism",1st International IBM Cloud Academy Conference ICA CON 2012,April 19-20, 2012 at the IBM Employee Activity and Fitness Center Building 400, Cornwallis Drive Research Triangle Park, North Carolina Organized by : The IBM Cloud Academy and IBM Centers for Advanced Studies (RTP, Chicago, Heritage Corridor, Tucson, Florida).
  • Puneet Goswami, Varun Kumar, Anuj Sharma "A Framework for Intelligent Meta Search Engine". Published in Voyager - The Journal of Computer Science and Information Technology ISSN 0973-4872, Vol. 4, No. 1 (2006) Institute of Technology & Management".
  • "Making data Analytics Work- three Key Challenges. McKinsey and Company. IDC Digital Universe Study, sponsored by EMC, June 2011
  • Stamatis Karnouskos-" Big data analytics for Smart Grid Cities" . Eurescom mess@ge 1- 2013.
  • Anastasius Gavras-" Big data – Overview on a much-hyped concept". Eurescom mess@ge 1- 2013.
Index Terms
Computer Science
Information Sciences
No index terms available.
Keywords

DF-ICF TF-IDF Document frequency Term frequency Corpus Page Ranking

Powered by PhDFocusTM