WWW2006 - CiteSeerX: an Architecture and Web Service Design for an Academic Document Search Engine
| Skip to main content | Skip to navigation |

Register Now!

CiteSeerX: an Architecture and Web Service Design for an Academic Document Search Engine

  • Huajing Li, Pennsylvania State University, USA
  • Isaac Councill, Pennsylvania State University, USA
  • Wang-Chien Lee, Pennsylvania State University, USA
  • C. Lee Giles, Pennsylvania State University, USA

Full text:

Track: Posters

CiteSeer is a scientific literature digital library and search engine which automatically crawls and indexes scientific documents in the fields of computer and information science. After serving as a public search engine for nearly ten years, CiteSeer is starting to have scaling problems in terms of handling of more documents, adding new feature and more users. Its monolithic architecture design prevents it from effectively making use of new web technologies and providing new services. After analyzing the current system problems, we propose a new architecture and data model, CiteSeerX. CiteSeerX will overcome the existing problems as well as provide scalability and better performance plus new services and system features.

Citation

Li, H., Councill, I., Lee, W., and Giles, C. L. 2006. CiteSeerx: an architecture and web service design for an academic document search engine. In Proceedings of the 15th International Conference on World Wide Web (Edinburgh, Scotland, May 23 - 26, 2006). WWW '06. ACM Press, New York, NY, 883-884.
DOI= http://doi.acm.org/10.1145/1135777.1135926

Other items being presented by these speakers

  • Probabilistic Models for Discovering E-Communities (E* Applications: E-Communities, E-Learning, E-Commerce, E-Science, E-Government, and E-Humanities Track)

Organised by

ECS Logo

in association with

BCS Logo ACM Logo

Platinum Sponsors

Sponsor of The CIO Dinner


Become a sponsor or exhibitor
Valid XHTML 1.0! IFIP logo WWW Conference Committee logo Web Consortium logo Valid CSS!