WWW2008 Posters - WWW 2008: Posters
Skip to main content.

Posters


Track: Posters

Paper Title:
RACE: Finding and Ranking Compact Connected Trees for Keyword Proximity Search over XML Documents

Authors:

  • Guoliang Li(Tsinghua University)
  • Jianhua Feng(Tsinghua University)
  • Jianyong Wang(Tsinghua University)
  • Bei Yu(National University of Singapore)
  • Yukai He(Tsinghua University)

Abstract:
In this paper, we study the problem of keyword proximity search over XML documents and leverage the efficiency and effectiveness. We take the disjunctive semantics among input keywords into consideration and identify meaningful compact connected trees as the answers of keyword proximity queries. We introduce the notions of Compact Lowest Common Ancestor (CLCA) and Maximal CLCA (MCLCA) and propose Compact Connected Trees (CCTrees) and Maximal CCTrees (MCCTrees) to efficiently and effectively answer keyword queries. We propose a novel ranking mechanism, textsf{RACE}, to Rank compAct Connected trEes, by taking into consideration both the structural similarity and the textual similarity. Our extensive experimental study shows that our method achieves both high search efficiency and effectiveness, and outperforms existing approaches significantly.

PDF version












Inquiries can be sent to: Email contact: program-chairs at www2008.org

Valid XHTML 1.0 Transitional