More on screen scraping
4th February 2003
In response to yesterday’s screen scraping post, Richard Jones describes a screen scraping technique that uses PyWebPwerf, a Python performance measuring tool.
I forgot to mention it in the article, but Snoopy is a PHP web client library which can retrieve content and emulate a browser interacting with forms. I’ve used it for simple screen scraping before, but it still lacks some of the more impressive functionality that WWW::Mechanize demonstrates.
More recent articles
- What happens if AI labs train for pelicans riding bicycles? - 13th November 2025
- Reverse engineering Codex CLI to get GPT-5-Codex-Mini to draw me a pelican - 9th November 2025
- Video + notes on upgrading a Datasette plugin for the latest 1.0 alpha, with help from uv and OpenAI Codex CLI - 6th November 2025