Simon Willison’s Weblog

Subscribe

October 2010

Oct. 29, 2010

mrjob: Distributed Computing for Everybody. Yelp use MapReduce with Hadoop (running on Amazon’s EMR service) to power all sorts of interesting features on the site, including spelling suggestions, review highlights, top searches and “people who viewed X also viewed...”. mrjob is their new open source Python framework for writing MapReduce jobs against the Hadoop streaming API.

# 11:55 pm / mrjob, yelp, recovered

Oct. 30, 2010

Running Yahoo! Pipes on Google App Engine. “The pipe2py package can compile a Yahoo! Pipe into pure Python source code, or it can interpret the pipe on-the-fly”—makes smart use of Python generators, and comes with tools to run the resulting compiled code on Google App Engine.

# 12:11 am / recovered

WebKit is Dropping HTML5 “popstate” Events (via) Really nasty bug with WebKit’s pushState support, discovered by Ben Cherry from Twitter. popState events get dropped if the user navigates while an outbound network request is in progress.

# 7:41 am / bugs, html5, pushstate, twitter, webkit, recovered

A predictable web of data—the why of YQL. Christian Heilmann is moving from Yahoo! to Mozilla to head up their evangelism team, and has marked the occasion by releasing the first chapter of a proposed book on YQL.

# 7:44 am / christian-heilmann, evangelism, mozilla, yahoo, yql, recovered

2010 » October

MTWTFSS
    123
45678910
11121314151617
18192021222324
25262728293031