Simon Willison’s Weblog

Subscribe

Saturday, 22nd September 2007

robots.txt Adventure. Interesting notes from crawling 4.6 million robots.txt, including 69 different ways in which the word “disallow” can be mis-spelled.

# 12:36 am / andrew-wooster, crawling, robots-txt

Django GridContainer. Media Temple’s virtualized Django hosting is now accepting applications for beta testers.

# 12:01 pm / django, gridcontainer, hosting, mediatemple, virtualization

A typical phishing email will have a generic greeting, such as 'Dear User'. Note: All PayPal emails will greet you by your first and last name.

PayPal's Phishing Guide

# 2:33 pm / doh, email, paypal, phishing, security

Sun OpenID IdP: protocol and implementation review. Sun employees are posting lots of useful insights gathered during the implementation of their OpenID provider.

# 8:22 pm / openid, sun, sunmicrosystems

Sun’s OpenID IdP: Data Governance. Lauren Wood explains the checklist used to ensure Sun’s OpenID provider adequately respected user privacy and data governance (what happens to the data that is stored).

# 8:50 pm / datagovernance, governance, laurenwood, openid, privacy, sun, sunmicrosystems

Hello JS-CTYPES, Goodbye Binary Components. Mark Finkle is porting Python’s ctypes functionality to the Mozilla platform, to allow binary XPCOM components to be defined in pure JavaScript.

# 11:57 pm / ctypes, javascript, mark-finkle, mozilla, python, xpcom