Simon Willison’s Weblog

Subscribe

Entries

Filters: Sorted by date

Conversation with Martin Fowler

A conversation with Martin Fowler, parts I, II, III and IV.

OSAF hire Robin Dunn

If you weren’t already exited about the OSAF, maybe it’s time you were. I just spotted on Mitch Kapor’s blog that they’ve hired Robin Dunn (of wxPython fame, and winner of the ActiveState 2002 Python award) on a 6 month contract to work on improvements to wxWindows and wxPython. They’ve also hired Mitchell Baker, Mozilla.org’s “Chief Lizard Wrangler” on a part time contract that helps fund her work on Mozilla while letting OSAF take advantage of her extensive experience of co-ordinating community open source efforts. Hiring Robin Dunn makes very good sense, seeing as the bulk of Chandler’s code will be written in Python with wxPython used for the front end. Any worries that Chandler might turn out to be vapourware have vanished from my mind—I can’t wait to see what they come up with.

XFML for Radio

Spotted on Guide to ease: Bill Kearney has written an XFML tool for Radio Userland. This sounds great—I’d love to see an example of a Radio blog that is using this.

Syndication is not publication

Mark Pilgrim pretty much single handedly killed the discussion thread on syndicating weblog content with XHTML started a few days ago by Anil Dash. Stuart’s reply to Mark’s post is definitely worth a read though.

Python as middleware

OpenEnterpriseTrends.com: Python Power: Growing Respect for an Open Source Integration Tool. Another excellent piece of Python advocacy, this time highlighting Python’s power and flexibility as a middleware tool to glue together varous large enterprise systems.

PHP training update

The aforementioned PHP training system was a mixed success. On the one hand, I discovered that trying to teach a room full of people is a lot tougher than I had expected (to cut a long story short, I stumbled through about five minutes of reiterating the same few points before all present agreed that the session would be best run using the ask-questions-when-you-get-stuck format). Luckily I had prepared some pretty comprehensive support material the night before. I’m chalking that one down to experience.

Coursework coursework...

No, I’m not dead—I’m just weighed under with coursework. Two large group projects and a whole bunch of personal pieces are conspiring to make me feel gulity whenever I load up a browser window. It’ll all be over by December the 17th thank Goodness. For the moment though I’m taking a well earned break to catch up on the goings on in the blogosphere over the past few days.

Nervous

No time to update tonight. I’m frantically putting together a 2 hour training course on HTML, PHP and mySQL which I shall be presenting tomorrow as my first ever experience of teaching a large(ish) group. Should be interesting.

[... 79 words]

Validator documentation

It seems the W3C have made some changes to their beta validator’s XML output option. The bad news is that this has (temporarily) broken my web service interface, but the good news is that the feature is now documented on the W3C’s site. Hopefully I’ll get a chance to fix the web service interface in the near future, but it will remain a toy rather than a tool for as long as the validator’s interface has not been frozen.

CSS filter guide

Spotted on the Web Manager’s Weblog, centricle : css filters is a list of all of the current CSS browser hiding techniques (all 17 of them!) along with a table showing which hack will hide things from which browser. Best of all, the name of each workaround is a link which will pop open a new window testing the hack in your browser.

Syndicating blogs with XHTML

Anil Dash suggests using structured XHTML as a blog syndication format. Scott Andrew points out that this has semantic problems in that it would mean using the class attribute to add additional meaning to a document. I was going to say that this would be an ideal opportunity to mix different namespaces in one XML document (as described by Lachlan Cannon) but techno-weenie beat me to it.

Taxomita

XFMLManager, Peter Van Dijck’s XFML construction and navigation tool to which I have contributed various chunks of XML related code, has been renamed Taxomita and given a brand new site. Peter hopes to have a public beta out within the next few weeks. It also has a mailing list. Meanwhile, XFML has been cropping up all over the place even despite the current lack of software, with the most recent sighting occurring over at Bill Kearney’s Syndication News.

OWASP Security guide

The Open Web Application Security Project (OWASP) have a free guide to building secure web applications, which covers a large range of common problems such as cross site scripting and SQL injection vulnerabilities. The report is a 60 page PDF and although I haven’t had time to go through it yet it looks like an excellent read.

phpPatterns

phpPatterns is a brand new site which advocates and documents the use of object oriented design patterns with PHP. It’s a great concept and the site already has some impressive content (although it could really do with a PHP references tutorial). The site is a project of Harry Fuecks, a regular contributor to SitePoint’s PHP forums.

Polluting the web

Hixie and Aaron Swartz are debating Hixie’s infamous Sending XHTML as text/html Considered Harmful on a W3C mailing list. While I am just as guilty of sending XHTML as text/html as anyone else (I’ve been meaning to fix this for a while but just haven’t found time yet) I’ll stick in an argument that Hixie hasn’t used yet. Sending XHTML as text/html basically amounts to pollution of the web. As far as the XML user agents of the future are concerned (which are supposedly one of the main reasons we use XHTML) an invalid XHTML document is unparseable and thus unreadable. Anyone who has tried to keep an XHTML blog valid will know how difficult it is to keep it that way, and without a browser refusing to display the document (as happened with Mozilla and diveintomark the other day thanks to an XML content type and a missing end tag) it can be all to easy to contribute to the pollution. As it stands, a massive proportion of the supposedly XHTML web may as well be just so many random floating bytes.

Get the look

Adam Polselli is a talented young designer who posts regularly on the SitePoint Forums. Get The Look is his new site (based on an article he wrote for SitePoint) which provides five excellent site templates and explains the design principles that were used to create them. I’m personally not interested in free site templates (though with my terrible design skills I probably should be) but the site is well worth visiting for the excellent advice provided with each design.

Information wants to be free

In Apple and the Pirate Everyman, Tom Coates discusses Apple’s attitude to copy protection and open standards. Choice quote:

[... 102 words]

Mimeo

Brian Graf has some very positive things to say about Mimeo, an innovative new web services based company in the States:

[... 156 words]

RESTLog

Joe Gregorio’s RESTLog is a fascinating piece of technology and a great example of the RESTian model of web service in action. Everything is built on XML and HTTP—new blog entries are POSTed to the index page as RSS 2.0 item elements, edits are done with the little-used HTTP PUT method and the DELETE method can be used to delete items. Content negotiation is in effect, so browsers recieve HTML while aggregators theoretically get RSS (though in practise existing aggregators fail to send a content negotiation header so an alternative URL scheme must be used). It’s very clever stuff. Further techie details are being posted on a regular basis, but a good overview can be had by reading RESTLog Overview and RESTLog Interface. The system was inspired by the RESTWiki’s RestidyingBlogger.

Aquari-gone-ics

Aquarionics is Gone (well, not entirely—the old site can still be found here). Aquarion has committed to building his new blogging system, Epistula 2, and vowed not to add any more entries until the new system is capable of taking them. Development of the new system will continue in the open at playground.aquarionics.com, which is currently throwing an HTTP 500 error but is likely to be an interesting place to watch in the next few days.

Interesting but ultimately useless

Via Stuart, Tantek has an intriguing new (valid) hack for adding HTML documentation to an external javascript file. The hack uses some clever multi-language comments to hide the HTML in the file from the script interpreter, while ensuring that the documentation remains readable when the file is interpreted as HTML. Unfortunately the trick does not work in Mozilla, as that browser respects the Content-Type served with the document (whereas IE “guesses” the content is HTML from clues in the document).

Mark’s tinkerings

Mark’s back, and he’s been tinkering. In addition to a whole bunch of changes made yesterday in response to Hixie’s markup critique, Mark now boasts php.net style shortcut URLs, an RDF powered about page and an automatically generated list of acronyms (in a semantically correct definition list). I use a similar technique (extracting structural markup tags using regular expressions) in my personal course notes—there’s something very rewarding about being able to extract information from a properly marked up document.

Changing backgrounds

Via Adrian Holovaty, Matt Jones points out an innovative new feature of the BBC’s new home page—background colours that adapt to your browsing preferences. Interesting stuff.

Search engines don’t care!

I’ve suspected this for ages, but finally it can be categorically announced that search engines just don’t care about the meta keywords attribute. The only major engine that still notices it is Inktomi, and even they treat it as a very low priority. Finally something concrete to refer people who still demand meta tags to. The description tag is still occasionally worth adding though.

Different browsers different DOMs

Hixie has put together a fascinating article describing how the Mozilla, Opera 7 beta and IE6 DOMs deal with incorrectly nested HTML tags. Internet Explorer goes as far as creating a malformed tree just to represent badly formed HTML!

Syndicated further reading recommendations

I frequently find myself reading something on someone elses blog and thinking “that’s interesting, and it fits in well with XXX that I read the other day”. I often end up blogging a link to both just to satisfy my need for completeness. Wouldn’t it be interesting if there was some standard for formalising this kind of further reading recommendation? I’m not sure exactly how it would work (it would almost certainly be XML based but I don’t know if it would require a new format or integrate with an existing one) but it could be an interesting avenue to explore. I think it’s a significantly different problem to the ones solved by XFML (external shared metadata) and Pingback for it to be worth committing some thought cycles to. Any ideas?

Content inventory tips

Peter has been blogging the progress of a 3828 page content inventory he is working on. Day Two describes his method of working with Excel, Day Three provides three useful inventory tips. Christina Wodtke’s Information Architecture: Blueprints for the Web has a nice overview of the content inventory process which recommends a dual monitor setup and links (well, footnotes) to these tips by Noel Franus. Peter has also commented on my decision to go with the blue RSS button in favour of the standard orange XML button—I’ve posted my reasons in a comment attached to his post.

A plea for sense

Peter Van Dijck: We need some sense in the naming of XML feed buttons.. I couldn’t agree more—in this day of syndication formats left, right and center the orange XML button could mean just about anything. Speaking of which, I’ve finally got around to adding a pretty blue RSS button to this site.

Condiment Clothing goodness

Thanks to CafeShops, pretty much anyone can set up a store to flog their own branded merchandise. Long time a favourite obscure site of mine, the Condiment Packet Museum now has their own line of goods. Dig those boxer shorts :)

OmniWeb CSS hack

The hack we’ve all been waiting for: How to hide CSS from OmniWeb. OmniWeb is a Mac browser which understands the @import rule but horribly mangles CSS layouts beyond all repair. Thanks to this hack CSS sites with a noticeable percentage of OmniWeb users (and anyone else who cares) can give OmniWeb the same treatment normally reserved for Netscape 4.