Internet-Scale Collection of Human-Reviewed Data
  • Qi Su (Yahoo! Inc)
  • Dmitry Pavlov (Yahoo! Inc)
  • Jyh-Herng Chow (Yahoo! Inc)
  • Wendell Baker (Yahoo! Inc)
Enterprise data processing and content aggregation systems often require extensive use of human reviewed data (e.g. for training and monitoring machine learning-based applications). Today these needs are often met by in-house efforts or offshore contracting. Emerging applications attempt to provide automation for human reviewed data collection at Internet-scale. We conduct extensive experiments to study the effectiveness of one such application. We also study the feasibility of using Yahoo! Answers, a general question-answering forum, for human review data collection.
