Simon Willison’s Weblog

Subscribe

Saturday, 22nd September 2007

robots.txt Adventure. Interesting notes from crawling 4.6 million robots.txt, including 69 different ways in which the word “disallow” can be mis-spelled.

# 12:36 am / robots-txt, crawling, andrew-wooster

Django GridContainer. Media Temple’s virtualized Django hosting is now accepting applications for beta testers.

# 12:01 pm / mediatemple, django, hosting, virtualization, gridcontainer

A typical phishing email will have a generic greeting, such as 'Dear User'. Note: All PayPal emails will greet you by your first and last name.

PayPal's Phishing Guide

# 2:33 pm / phishing, email, paypal, doh, security

Sun OpenID IdP: protocol and implementation review. Sun employees are posting lots of useful insights gathered during the implementation of their OpenID provider.

# 8:22 pm / openid, sunmicrosystems, sun

Sun’s OpenID IdP: Data Governance. Lauren Wood explains the checklist used to ensure Sun’s OpenID provider adequately respected user privacy and data governance (what happens to the data that is stored).

# 8:50 pm / openid, sun, sunmicrosystems, lauren-wood, governance, data-governance, privacy

Hello JS-CTYPES, Goodbye Binary Components. Mark Finkle is porting Python’s ctypes functionality to the Mozilla platform, to allow binary XPCOM components to be defined in pure JavaScript.

# 11:57 pm / mark-finkle, ctypes, javascript, mozilla, python, xpcom