Research Article

XML based Keyword Search

by  G.Seethalakshmi, J. Swathi
journal cover
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 107 - Issue 15
Published: December 2014
Authors: G.Seethalakshmi, J. Swathi
10.5120/18824-0241
PDF

G.Seethalakshmi, J. Swathi . XML based Keyword Search. International Journal of Computer Applications. 107, 15 (December 2014), 1-3. DOI=10.5120/18824-0241

                        @article{ 10.5120/18824-0241,
                        author  = { G.Seethalakshmi,J. Swathi },
                        title   = { XML based Keyword Search },
                        journal = { International Journal of Computer Applications },
                        year    = { 2014 },
                        volume  = { 107 },
                        number  = { 15 },
                        pages   = { 1-3 },
                        doi     = { 10.5120/18824-0241 },
                        publisher = { Foundation of Computer Science (FCS), NY, USA }
                        }
                        %0 Journal Article
                        %D 2014
                        %A G.Seethalakshmi
                        %A J. Swathi
                        %T XML based Keyword Search%T 
                        %J International Journal of Computer Applications
                        %V 107
                        %N 15
                        %P 1-3
                        %R 10.5120/18824-0241
                        %I Foundation of Computer Science (FCS), NY, USA
Abstract

The success of information retrieval style keyword search on the web leads to the emergence of XML based keyword search. The text database and XML database differences leads to three new challenges: 1) The users search intention is to be identified, i. e. , the XML node types that user wants to search for and search via is identified. 2) The similarities in tag name, tag value and the structure of tags are identified. 3) New scoring function is needed to estimate the output of the search results (XML document) relevance to the given query. However, these challenges cannot be addressed by the existing system, which results in low quality results in terms of query relevance. In this paper, an IR-style approach is proposed which basically utilizes the statistics of underlying XML data to address these challenges. First, specific guidelines that a search engine should meet in both search intention identification and relevance oriented ranking for search results is proposed. Then, based on these guidelines, a novel XML TF*IDF ranking strategy to rank the individual matches of all possible search intentions is proposed.

References
  • Ziyang Liu Yi Chen, "Reasoning and Identifying Relevant Matches for XML Keyword Search", Proceedings of the VLDB Endowment Volume 1 Issue 1, August 2008.
  • Y. Xu and Y. Papakonstantinou. Efficient Keyword Search for Smallest LCAs in XML Databases. In SIGMOD, 2005.
  • L. Guo, F. Shao, C. Botev, and J. Shanmugasundaram. XRANK: Ranked Keyword Search over XML Documents. In SIGMOD, 2003.
  • Z. Liu and Y. Chen, "Identifying Meaningful Return Information for XML Keyword Search," Proc. ACM SIGMOD Conf. , 2007.
  • Cohen. S, Mamou. J, Kanza. Y, and Sagiv. Y, 'XSEarch: A semantic search engine for XML', In Proc. Of VLDB Conference, pp. 45–56. 2003
  • Yunyao Li, Cong Yu, H. V. Jagadish, "Schema-Free XQuery" Proceedings of the 30th VLDB Conference, Toronto, Canada, 2004
  • D. Chamberlin. XQuery: An XML query language. IBM System Journal, 41:597–615, 2003.
  • V. Hristidis, Y. Papakonstantinou, and A. Balmin, "Keyword proximity search on XML graphs," in ICDE, 2003, pp. 367–378.
  • Norbert Fuhr, Kai Großjohann, "XIRQL: A Query Language for Information Retrieval in XML Documents SIGIR'01, September 9–12, New Orleans, Louisiana, USA
  • Rosie Jones, Benjamin Rey and Omid Madani, Wiley Greiner, "Generating Query Substitutions" WWW 2006, May 23–26, 2006, Edinburgh, Scotland ACM 1-59593-323-9/06/0005.
  • Albrecht Schmidt, Martin Kersten, Menzo Windhouwer "Querying XML Documents Made Easy: Nearest Concept Queries" proceedings of 17th International conference on data engineering, 2001.
Index Terms
Computer Science
Information Sciences
No index terms available.
Keywords

XML Search ranking.

Powered by PhDFocusTM