WWW2006 - GoGetIt!: A Tool for Generating Structure-Driven Web Crawlers
| Skip to main content | Skip to navigation |

Register Now!

GoGetIt!: A Tool for Generating Structure-Driven Web Crawlers

  • Marcio Vidal, Universidade Federal do Amazonas, Brazil
  • Altigran Silva, Federal University of Amazonas, Brazil
  • Edleno Moura, Federal University of Amazonas, Brazil
  • Joao Marcos Cavalcanti, Federal University of Amazonas, Brazil

Track: Posters

We present GoGetIt!, a tool for generating structure-driven crawlers that requires a minimum effort from the users. The tool takes as input a sample page and an entry point to a Web site and generates a structure-driven crawler based on navigation patterns, sequences of patterns for the links a crawler has to follow to reach the pages structurally similar to the sample page. In the experiments we have performed, structure-driven crawlers generated by GoGetIt! were able to collect all pages that match the samples given, including those pages added after their generation.

Other items being presented by these speakers

  • GoGetIt!: Structure-Driven Crawler Generation by Example (Posters Track)

Organised by

ECS Logo

in association with

BCS Logo ACM Logo

Platinum Sponsors

Sponsor of The CIO Dinner


Become a sponsor or exhibitor
Valid XHTML 1.0! IFIP logo WWW Conference Committee logo Web Consortium logo Valid CSS!