Simon Willison’s Weblog

Subscribe

Wednesday, 21st October 2009

Introducing BERT and BERT-RPC. Justification for inventing a brand new serialisation protocol: Thrift and Protocol Buffers both use IDLs and code generation, XML “is not convertible to a simple unambiguous data structure in any language I’ve ever used” and JSON lacks support for unencoded binary data. The result is BERT—Binary ERlang Term—which extracts a format from Erlang in much the same way that JSON extracted one from JavaScript. # 10:11 pm

How We Made GitHub Fast. Detailed overview of the new GitHub architecture. It’s a lot more complicated than I would have expected—lots of moving parts are involved in ensuring they can scale horizontally when they need to. Interesting components include nginx, Unicorn, Rails, DRBD, HAProxy, Redis, Erlang, memcached, SSH, git and a bunch of interesting new open source projects produced by the GitHub team such as BERT/Ernie and ProxyMachine. # 9:14 pm

Introducing Cloudera Desktop. It’s a GUI for Hadoop, and under the hood is a whole stack of open source software, including Python, Django, MooTools, Twisted, lxml, CherryPy, Mako, Java and AspectJ. # 6:48 pm

You count the “value” that is lost by people who would have made money selling rival goods, but can’t now because they can’t compete with free. But you don’t count the value that is created by people who build upon the freely given goods. [...] In other words, you only look at the first-order effects. It’s the same mistake a lot of people make when they accuse open source developers of “dumping” and ruining the market for competing software. That’s true, in a very narrow sense, but it ignores all the other people who took that software and used it to create something else of value.

Mark Pilgrim # 9:59 am

Comcast: Twitter Has Changed The Culture Of Our Company. “Frank Eliason (@Comcastcares on Twitter) now has 11 people working under him simply to respond to information about Comcast being broadcast on Twitter.” # 9:56 am