More on screen scraping
4th February 2003
In response to yesterday’s screen scraping post, Richard Jones describes a screen scraping technique that uses PyWebPwerf, a Python performance measuring tool.
I forgot to mention it in the article, but Snoopy is a PHP web client library which can retrieve content and emulate a browser interacting with forms. I’ve used it for simple screen scraping before, but it still lacks some of the more impressive functionality that WWW::Mechanize demonstrates.
More recent articles
- LLM 0.27, the annotated release notes: GPT-5 and improved tool calling - 11th August 2025
- Qwen3-4B-Thinking: "This is art - pelicans don't ride bikes!" - 10th August 2025
- My Lethal Trifecta talk at the Bay Area AI Security Meetup - 9th August 2025