Blacklisting Comment Spam
I’m fed up with comment spam. From now on, any comment I judge to be spam will be deleted, and the domains linked to from that comment will be blacklisted. Any future comments that contain links to those domains will be refused. My blacklist will be made available as a simple text file, one domain per line, at blacklist.txt. You are welcome to grab a copy of that file once every 24 hours and use it as part of your own comment spam prevention system. I will manually approve all domains that are added to it to ensure only domains of a dubious nature end up blacklisted.
If you start using a similar system, drop me a line and I will start using your blacklist as well (note that I will not merge it with my own public list). If I find you have been blacklisting innocent domains I will cancel my subscription to your blacklist. In this way, I hope to build a decentralised web of trust whereby other people’s recommendations help my system combat spam better.
nice one.
Oh BTW, here's your Python blacklist script.
dee - 2nd September 2003 20:04 - #
Tom Gilder - 2nd September 2003 20:16 - #
Simon Willison - 2nd September 2003 20:19 - #
Tim Fountain - 2nd September 2003 20:47 - #
The WordPress Development Blog was hit by this same comment spammer, earlier (cleaned up, since). We'll probably end up implementing a built-in admin interface for blacklists in a future release....
Dougal Campbell - 2nd September 2003 21:47 - #
Van Gale - 3rd September 2003 03:50 - #
Ian Lloyd - 3rd September 2003 08:55 - #
Sam Newman - 3rd September 2003 09:48 - #
Ian Lloyd - 3rd September 2003 13:31 - #
Sam Newman - 3rd September 2003 14:37 - #
I set one up too. Let's get this Web of Trust started.
Adrian Holovaty - 3rd September 2003 17:04 - #
Why a flat file?
Why not do the RBL-type thing and publish these as a DNS Zone? If an A record query returns "127.0.0.2", then that IP address is banned.
(You can find implementation details lots of places, including here.)
Using DNS is surely a lot more light-weight than shipping a flat-file around (or even than using HTTP to query a database).
Jacques Distler - 4th September 2003 04:52 - #
coda - 4th September 2003 10:40 - #
john - 4th September 2003 13:51 - #
Joe Grossberg - 6th September 2003 16:00 - #
Simon Willison - 6th September 2003 16:09 - #
Joe Grossberg - 7th September 2003 06:30 - #
Yoz - 9th September 2003 16:08 - #
Hey Simon, nice to know someone else has been thinking about these things as well.
Here is my comment URL blacklist. Up to the minute and replete with spammy goodness. We should really think about making this as standard as a robots.txt or foaf.rdf file... Maybe put it in meta tags for autodiscovery as well...
Oh, also, I just created a non-hack MT solution for keeping the blacklisted comments from ever showing up on the site: Killing Comment Spam Dead. Would love to have your input. I am very seriously considering starting a site devoted to the comment spam clearinghouse idea as I mention on that entry. Contact me if you want to collaborate.
Jay - 28th September 2003 00:29 - #
Ben - 13th October 2003 10:28 - #
Thanks for the blacklist file, that list formed the base entries in the service.
A.Sleep - 12th November 2003 12:58 - #
jerry - 25th December 2003 03:38 - #
Doug Alcorn - 3rd February 2004 18:42 - #
Kingsley Tart - 28th February 2004 14:29 - #
hgh - 4th May 2004 22:52 - #
A. Visitor - 3rd July 2004 06:09 - #
Bill Anness - 11th March 2005 20:08 - #
Rosa - 16th May 2005 16:01 - #
Linux guru - 25th December 2005 17:51 - #
WaLes - 12th January 2006 21:00 - #
kibol - 1st February 2006 14:37 - #