Simon Willison’s Weblog

Subscribe

Items tagged mrjob, yelp in Oct, 2010

Filters: Year: 2010 × Month: Oct × mrjob × yelp × Sorted by date


mrjob: Distributed Computing for Everybody. Yelp use MapReduce with Hadoop (running on Amazon’s EMR service) to power all sorts of interesting features on the site, including spelling suggestions, review highlights, top searches and “people who viewed X also viewed...”. mrjob is their new open source Python framework for writing MapReduce jobs against the Hadoop streaming API. # 29th October 2010, 11:55 pm

Types

Years

Months

Tags