WWW2008 Posters - WWW 2008: Posters
Skip to main content.


Track: Posters

Paper Title:
Collaborative Filtering on Skewed Datasets


  • Somnath Banerjee(Hewlett-Packard Labs)
  • Krishnan Ramanathan(Hewlett-Packard Labs)

Many real life datasets have skewed distributions of events when the probability of observing few events far exceeds the others. In this paper, we observed that in skewed datasets the state of the art collaborative filtering methods perform worse than a simple probabilistic model. Our test bench includes a real ad click stream dataset which is naturally skewed. The same conclusion is obtained even from the popular movie rating dataset when we pose a binary prediction problem of whether a user will give maximum rating to a movie or not.

PDF version

Inquiries can be sent to: Email contact: program-chairs at www2008.org

Valid XHTML 1.0 Transitional