Simon Willison’s Weblog

Subscribe
Atom feed

Blogmarks

Filters: Sorted by date

Delighting with Data. Tom Taylor’s full transcript and slides for his recent talk at Oxford Geek Night—talks about Twitter bots, wikinear, iamnear.net and various other small but neat data repurposing projects.

# 1st July 2008, 1:24 pm / fireeagle, iamnear, oxfordgeeknight, tom-taylor, wikinear

Evil GIFs: Partial Same Origin Bypass with Hybrid Files. First there were PNGs that had crossdomain.xml files embedded in them, now there are GIFs that contain Java applets (as JAR files). At this point I’d say don’t even bother trying to validate uploaded files, just make sure they’re served off an entirely different domain instead where XSS doesn’t matter.

# 1st July 2008, 8:58 am / applets, crossdomainxml, gifs, javaapplets, pngs, security, uploads, validation, xss

Javascript protocol fuzz results. If your HTML sanitizer uses blacklisting rather than whitelisting here are a few more weird ways of injecting javascript: in to a link that you need to worry about—but you should really switch to whitelisting http:// and https:// instead.

# 30th June 2008, 3:57 pm / blacklisting, firefox, fuzztesting, html, javascript, sanitization, security, whitelisting

The end of LugRadio. Wow. LugRadio was a podcast before the term podcast had even been coined. It will be sorely missed.

# 30th June 2008, 2:03 pm / lugradio, podcasts, stuart-langridge

Enough Already with the Connections! Comet doesn’t mean making long-lived HTTP connections (which most browsers do anyway thanks to HTTP keep-alive), it means making long-held HTTP requests. I’m guilty of spreading this misinformation in the past.

# 30th June 2008, 9:27 am / comet, connections, correction, http, keepalive, requests

Dark Launches, Gradual Ramps and Isolation: Testing the Scalability of New Features on your Web Site. Smart advice from Dare Obasanjo that extend the “dark launch” idea illustrated by Facebook chat a few weeks ago.

# 29th June 2008, 2:22 pm / dare-obasanjo, darklaunches, facebook, gradualramps, isolation, scaling, systemarchitecture

Dissecting today’s Internet traffic spikes (via) Theo Schlossnagle on how the increasing popularity of interest aggregation services such as Digg and Reddit result in traffic spikes that dwarf the old Slashdot effect, making a the old rules of thumb for capacity planning irrelevant.

# 29th June 2008, 2:12 pm / capacity-planning, digg, reddit, scaling, slashdotting, theo-schlossnagle

Microformats and accessibility: the soap opera that never ends. “Be sure to tune in next week, when we’ll drown a leading accessibility expert to see if she’s a witch.”

# 29th June 2008, 8:44 am / accessibility, funny, mark-pilgrim, microformats, witch

Graphite. Real-time graphing package for server monitoring, similar to RRDTool. Created by the team at Orbitz, using Django and ExtJS for the frontend and Cairo to generate the graphs.

# 28th June 2008, 11:53 pm / cairo, django, extjs, graphing, graphite, monitoring, orbitz, python, rrdtool

RefactorMyCode.com. Neat community for discussing improvements to code snippets. Login using OpenID.

# 28th June 2008, 11:46 pm / community, openid, refactoring, refactormycode

BBC iPlayer Beta. Preview of the new version of the iPlayer. Nice to be able to listen to Radio programmes in the same interface as TV without having to use the cramped popup window.

# 28th June 2008, 9:35 pm / bbc, iplayer, radio

How to sell your software for $20,000 (via) The best article I’ve read on software entrepreneurship in ages.

# 28th June 2008, 9:21 am / business, entrepreneurship, software, startups

Module Pattern Provides No Privacy... at least not in JavaScript(TM) (via) JavaScript variables hidden inside a closure aren’t as hidden as I thought—it turns out you can pass a closure as the second argument to eval (at least in Firefox) and “steal” private variables back out of it.

# 27th June 2008, 7:01 pm / closures, firefox, javascript, modulepattern, pete-michaux

Capital FM London Traffic Map. We launched this today at GCap (née Global Radio). I’m particularly impressed with how well the team handled clustering the traffic cameras on the Google map.

# 27th June 2008, 6:22 pm / capitalfm, clustering, gcap, google-maps, london, traffic

Browser Uploads to S3 using HTML POST Forms. I didn’t know you could do this: create a regular HTML form that gives people permission to upload direct to your own S3 bucket, using a signed JSON policy statement in a hidden form field to prevent third parties from abusing your S3 account.

# 27th June 2008, 12:11 pm / amazon, aws, forms, json, post, s3, signing

The Cron Commandments. How to write well-behaved cron scripts, from Dean Wilson.

# 27th June 2008, 9:48 am / cron, dean-wilson, unix

He/She/They: Grammar and Facebook. Facebook are going to start requiring gender information because foreign language translations wind up being too confusing when that information is not available. Aside: I wish they’d implement proper title elements on their blog posts.

# 27th June 2008, 9:06 am / facebook, gender, grammar, i18n, l10n, usability

How-to: Full-text search in Google App Engine. Use search.SearchableModel instead of db.Model—it’s pretty rough at the moment which is probably why it’s still undocumented.

# 27th June 2008, 8:25 am / full-text-search, google-app-engine, python, search

sfical.py. Neat idea: write a CGI script that turns a proprietary API (in this case the SalesForce events API) in to standard ical format, then run it on your Mac’s local Apache server and subscribe to it from iCal.

# 27th June 2008, 8:09 am / apache, cgi, icalendar, mac, macos, salesforce, simon-fell

OAuth for Google Data APIs (via) Awesome. Now, how’s OAuth support shaping up over at Twitter (who are serious offenders when it comes to encouraging the password anti-pattern, despite Twitter engineers being key to the creation of the original OAuth spec)?

# 27th June 2008, 7:49 am / apis, google, google-data, oauth, twitter

CookBookNewFormsFieldOrdering. Handy tip—change the order of fields in a Django newforms instance by over-riding form.fields.keyOrder (since fields is a SortedDict).

# 27th June 2008, 1:02 am / django, newforms, python, tip

BUG: XSS Security flaw in BaseCamp Messages (via) BaseCamp lets users include HTML and JavaScript in messages, on the basis that anyone with a BaseCamp account is a trusted party. I’m not convinced: you could use this to circumvent BaseCamp’s access control stuff and read messages you’re not meant to. On the flip side, you could also use this to add brand new features to BaseCamp by using JavaScript in a message as a server-side equivalent to Greasemonkey.

# 26th June 2008, 9:39 am / 37-signals, basecamp, greasemonkey, javascript, security, xss

mod_rpaf for Apache. A more secure alternative to Django’s equivalent middleware: sets the REMOTE_ADDR of incoming requests from whitelisted load balancers to the X-Forwarded-For header, without any risk that if the load balancers are missing attackers could abuse it to spoof their IP addresses.

# 24th June 2008, 5:02 pm / apache, django, http, load-balancing, middleware, modrpaf, rpaf, security, xforwardedfor

Oxford Geek Night 7: 25 June 2008. I won’t be able to make this one, but a reminder for anyone in the area that the seventh Oxford Geek Night takes place tomorrow night at the Jericho Tavern.

# 24th June 2008, 2:55 pm / events, oxford, oxford-geek-nights

Django snippets: Command to dump data as a python script. Extremely useful—dumps the data for an application as an executable Python script which will re-import it in to another database without any risk of colliding with existing IDs, sorting out foreign keys along the way.

# 24th June 2008, 12:07 pm / django, django-snippets, import, python

The basics of creating a tumblelog with Django (via) Ryan Berg suggests having a StreamItem model that links uses a GenericForeignKey to link to other content types, then using signals to cause a StreamItem to be created for every other model type. I should switch to doing that on this blog: at the moment I have to query three separate tables to build the tumblelog part which results in messy code for ordering and pagination.

# 24th June 2008, 11:09 am / contenttypes, django, genericforeignkey, python, ryan-berg, tumblelog

Tailor. “Tailor is a tool to migrate or replicate changesets between ArX, Bazaar, Bazaar-NG, CVS, Codeville, Darcs, Git, Mercurial, Monotone, Subversion and Tla repositories.”—written in Python.

# 24th June 2008, 9:59 am / bazaar, codeville, cvs, darcs, dvcs, git, mercurial, monotone, python, subversion, tailor, tla, version-control

Removing Microformats from bbc.co.uk/programmes. “Until these issues are resolved the BBC semantic markup standards have been updated to prevent the use of non-human-readable text in abbreviations.”

# 23rd June 2008, 9:04 pm / abbr, accessibility, bbc, microformats

jsontime. Nat and I threw this together this morning—it runs on Google App Engine and exposes Python’s pytz timezone library over JSONP.

# 21st June 2008, 7:07 pm / api, google-app-engine, javascript, json, jsontime, projects, python, pytz

Years

Tags