WWW2006 - Detecting Nepotistic Links by Language Model
| Skip to main content | Skip to navigation |

Register Now!

Detecting Nepotistic Links by Language Model

  • András A. Benczúr, Computer and Automation Research Institute, Hungarian Academy of Sciences, Hungary
  • Istvan Biro, Eotvos University, Budapest, Hungary
  • Károly Csalogány, Computer and Automation Research Institute, Hungarian Academy of Sciences, Hungary
  • Mate Uher, Computer and Automation Research Institute, Hungarian Academy of Sciences, Hungary

Full text:

Poster:

Track: Posters

In this short note we demonstrate the applicability of hyperlink downweighting by means of language model disagreement. The method filters out hyperlinks with no relevance to the target page without the need of white and blacklists or human interaction. We fight both comment spam in blogs and guestbooks as well as various forms of nepotism such as common maintainers, ads or link exchanges. Our method is tested on a 31 M page crawl of the .de domain with a manually classified 1000-page random sample.

Citation

Benczúr, A. A., Bíró, I., Csalogány, K., and Uher, M. 2006. Detecting nepotistic links by language model disagreement. In Proceedings of the 15th International Conference on World Wide Web (Edinburgh, Scotland, May 23 - 26, 2006). WWW '06. ACM Press, New York, NY, 939-940.
DOI= http://doi.acm.org/10.1145/1135777.1135954

Other items being presented by these speakers

  • To Randomize or Not To Randomize: Space Optimal Summaries for Hyperlink Analysis (Search Track)

Organised by

ECS Logo

in association with

BCS Logo ACM Logo

Platinum Sponsors

Sponsor of The CIO Dinner


Become a sponsor or exhibitor
Valid XHTML 1.0! IFIP logo WWW Conference Committee logo Web Consortium logo Valid CSS!