Simon Willison’s Weblog

Subscribe

Saturday, 13th December 2008

Freebase Sets (via) Give it some topics and it will tell you what they have in common and show further topics matching the same rules. Kind of like the old Google Labs sets tool but this one shows its workings.

# 9:26 am / freebase, freebasesets, sets

YQL—converting the web to JSON with mock SQL. YQL just got a whole lot more interesting to me—I had no idea they were exposing an HTML and RSS scraping tool over a JSONP API in addition to all of the Yahoo! web service methods.

# 9:39 am / html, json, jsonp, scraping, screenscraping, sql, yahoo, yql

Yahoo! Query Language Console. Neat developer tool for playing around with YQL.

# 9:39 am / console, yahoo, yql

ETags And Modification Times In Django. Part of Malcolm’s series of tutorials on implementing advanced HTTP concepts in Django.

# 9:49 am / caching, django, etags, http, malcolmtredinnick

Scaling memcached at Facebook. Fascinating techie details on how Facebook forked memcache to use UDP and increase performance from 50,000 requests a second to 200,000. Now running on 800 servers with 28 TB of memory, and their code is on GitHub. (They may scale like crazy, but they can’t put their blog entry title in the title element?)

# 10:08 am / facebook, memcached, scaling, udp

ZooBorns. Best blog idea ever: news and photos of baby animals born in zoos around the world. Nicely categorised as well.

# 10:18 pm / animals, babyanimals, blogs, cute, zooborns, zoos

There. Is. No. Long-Term. Data. Storage. Solution. There is only a series of short-term solutions punctuated by data migration from one medium to the next.

Mark Pilgrim

# 11:36 pm / backups, mark-pilgrim

2008 » December

MTWTFSS
1234567
891011121314
15161718192021
22232425262728
293031