CFP Navigation
  • Important Dates

  • Refereed Papers Track

  • Alternate Tracks

  • Posters

  • Tutorials and Workshops

  • Developers Day

  • Panels

  • Chairs & PC


Refereed Papers Track

  • Search
  • The Web has over three billion pages stored at millions of servers. HTML is the embedding medium of choice, with evolving XML-based data exchange standards. The extreme heterogeneity in organization and structure makes it difficult to harness the power in this critical information resource. While search engines have made great strides in recent years, especially in terms of hyperlink analysis, we anticipate further exciting developments in Web search, specifically, new technologies that will help break through the barriers of syntactic search and content heterogeneity. The focus of the Search area of the WWW2004 refereed papers track includes, but is not limited to, the following topics:

    • Indexing, information retrieval, metasearch
    • Hyperlink graph analysis and its application to search and ranking
    • Machine learning and mining for unstructured, semistructured, and relational data
    • Query mechanisms and index structures for semi-structured and XML querying
    • Novel models for text, hypertext and Web evolution
    • Statistical analysis of Web-level, site-level, page-level, markup-level, and language-level structure
    • Natural language analysis, especially robust statistical techniques
    • Integration and reconciliation of semantically similar structure and  schema across diverse sites
    • Information extraction and message understanding
    • Question answering
    If you would like information, or to volunteer, please contact Mae Isaac, mkisaac@us.ibm.com