11th April 2005
In Meme tracking with Greasemonkey, Jon Udell introduces a userscript which grabs the number of references from del.icio.us and bloglines and appends that information to the top of every page you visit. To be fair on Jon, the version he has released defaults to only doing this for pages on Infoworld.com but modifying it to run on every web page is trivial.
The obvious downside of this kind of script is the amount of additional web traffic it induces. Every page you load in your browser induces an extra HTTP request to both del.icio.us and bloglines. Times that by several hundred users and those sites are going to be serving thousands of requests every minute.
Can this kind of thing scale? The Google toolbar retrieves the PageRank for every page you view, and the Alexa toolbar (and thousands of spyware applications) request information for every page viewed as well. The difference is that the developers host their own servers, and are responsible for their own bandwidth bills.
There are also very serious privacy implications involved in this kind of activity. Right now, Joshua Schachter’s del.icio.us access logs are collecting a detailed history of Jon Udell’s browsing history—and that of anyone else using the script.
This is a frustrating quandry, because the technique used in Jon’s script can be extended in almost limitless ways. Sadly, in a world where bandwidth and server resources are limited such scripts must be approached with caution.
More recent articles
- Datasette Enrichments: a new plugin framework for augmenting your data - 1st December 2023
- llamafile is the new best way to run a LLM on your own computer - 29th November 2023
- Prompt injection explained, November 2023 edition - 27th November 2023
- I'm on the Newsroom Robots podcast, with thoughts on the OpenAI board - 25th November 2023
- Weeknotes: DevDay, GitHub Universe, OpenAI chaos - 22nd November 2023
- Deciphering clues in a news article to understand how it was reported - 22nd November 2023
- Exploring GPTs: ChatGPT in a trench coat? - 15th November 2023
- Financial sustainability for open source projects at GitHub Universe - 10th November 2023
- ospeak: a CLI tool for speaking text in the terminal via OpenAI - 7th November 2023
- DALL-E 3, GPT4All, PMTiles, sqlite-migrate, datasette-edit-schema - 30th October 2023