A plan for spam

16th August 2002

Paul Graham: A Plan for Spam. Paul suggests using content based filters that learn from users specifically marking messages as spam or legitimate mail. The system then picks emails apart looking for commmon terms (in both the body and the header of the message) that can then be used later on to identify spam messages. He claims his test have let through only 5 per 1000 spams, with 0 false positives. Impressive stuff, and great reading for the excellent explanations of some advanced alogithmic and statistical techniques.

Posted 16th August 2002 at 11:40 pm · Follow me on Mastodon or Twitter or subscribe to my newsletter

More recent articles

Weeknotes: Llama 3, AI for Data Journalism, llm-evals and datasette-secrets - 23rd April 2024
Options for accessing Llama 3 from the terminal using LLM - 22nd April 2024
AI for Data Journalism: demonstrating what we can do with this stuff right now - 17th April 2024
Three major LLM releases in 24 hours (plus weeknotes) - 10th April 2024
Building files-to-prompt entirely using Claude 3 Opus - 8th April 2024
Running OCR against PDFs and images directly in your browser - 30th March 2024
llm cmd undo last git commit - a new plugin for LLM - 26th March 2024
Building and testing C extensions for SQLite with ChatGPT Code Interpreter - 23rd March 2024
Claude and ChatGPT for ad-hoc sidequests - 22nd March 2024
Weeknotes: the aftermath of NICAR - 16th March 2024

Simon Willison’s Weblog

A plan for spam

More recent articles