Feed Sign in with OpenID OpenID

Simon Willison’s Weblog

3 items tagged “hadoop”

3 and 1/2 minutes to sort a Terabyte, and a look at Hadoop’s code structure. Bill de hÓra uses some clever static analysis tools to explore Hadoop’s 100,000+ lines of code. 0 7th July 2008, 2:15 pm

Python + Hadoop = Flying Circus Elephant. Last.fm have released Dumbo, a Python module that lets you easily write Hadoop map/reduce tasks using Python and generators. 0 31st May 2008, 2:14 pm

Writing An Hadoop MapReduce Program In Python. Hadoop (the open source map/reduce framework) can interact with any program that reads from stdin and outputs on stdout—so it’s trivial to drop in Python scripts for the map and reduce steps. 0 9th October 2007, 11:33 am

A django site