<?xml version="1.0" encoding="utf-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: how to solve the comment spam problem</title>
	<atom:link href="http://dan.hersam.com/2004/11/04/how-to-solve-the-comment-spam-problem/feed/" rel="self" type="application/rss+xml" />
	<link>http://dan.hersam.com/2004/11/04/how-to-solve-the-comment-spam-problem/</link>
	<description>A computer programmer&#039;s commentary on news, Linux, programming, music, web design, trivia, humor, usability and whatever else strikes his fancy at the time.</description>
	<lastBuildDate>Tue, 22 May 2012 06:10:27 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
	<item>
		<title>By: DavidH</title>
		<link>http://dan.hersam.com/2004/11/04/how-to-solve-the-comment-spam-problem/comment-page-1/#comment-4940</link>
		<dc:creator>DavidH</dc:creator>
		<pubDate>Tue, 09 Nov 2004 20:11:30 +0000</pubDate>
		<guid isPermaLink="false">/?p=1058#comment-4940</guid>
		<description>I don&#039;t know if I&#039;ll understand it, but I&#039;m curious what you did. Thanks</description>
		<content:encoded><![CDATA[<p>I don&#8217;t know if I&#8217;ll understand it, but I&#8217;m curious what you did. Thanks</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: dan</title>
		<link>http://dan.hersam.com/2004/11/04/how-to-solve-the-comment-spam-problem/comment-page-1/#comment-4933</link>
		<dc:creator>dan</dc:creator>
		<pubDate>Mon, 08 Nov 2004 23:06:10 +0000</pubDate>
		<guid isPermaLink="false">/?p=1058#comment-4933</guid>
		<description>That&#039;s not a bad idea but it would only be a matter of time until they started looking for a javascript function to submit.

On a brighter note, I&#039;ve just figured out how to stop the current rash of comments, and so far it has been 100% effective.

I want to keep it on the down low because if they learn how I did it, they&#039;ll probably try to thwart it, so I don&#039;t want to spill the beans.  It&#039;s similar to what I was doing before, but I went a bit further.  It&#039;s not likely to be of much use for other sites since they may not be dealing with the same spambot, but if you&#039;re interested post a comment here and I&#039;ll e-mail you what I did.</description>
		<content:encoded><![CDATA[<p>That&#8217;s not a bad idea but it would only be a matter of time until they started looking for a javascript function to submit.</p>
<p>On a brighter note, I&#8217;ve just figured out how to stop the current rash of comments, and so far it has been 100% effective.</p>
<p>I want to keep it on the down low because if they learn how I did it, they&#8217;ll probably try to thwart it, so I don&#8217;t want to spill the beans.  It&#8217;s similar to what I was doing before, but I went a bit further.  It&#8217;s not likely to be of much use for other sites since they may not be dealing with the same spambot, but if you&#8217;re interested post a comment here and I&#8217;ll e-mail you what I did.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: mckay</title>
		<link>http://dan.hersam.com/2004/11/04/how-to-solve-the-comment-spam-problem/comment-page-1/#comment-4931</link>
		<dc:creator>mckay</dc:creator>
		<pubDate>Mon, 08 Nov 2004 23:00:16 +0000</pubDate>
		<guid isPermaLink="false">/?p=1058#comment-4931</guid>
		<description>If they&#039;re just bots maybe they&#039;re just looking for an input-&gt;submit element. Maybe you could fool them by using a javascript function to submit the form instead of a &#039;submit&#039; button.</description>
		<content:encoded><![CDATA[<p>If they&#8217;re just bots maybe they&#8217;re just looking for an input->submit element. Maybe you could fool them by using a javascript function to submit the form instead of a &#8216;submit&#8217; button.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: dan</title>
		<link>http://dan.hersam.com/2004/11/04/how-to-solve-the-comment-spam-problem/comment-page-1/#comment-4889</link>
		<dc:creator>dan</dc:creator>
		<pubDate>Mon, 08 Nov 2004 06:27:22 +0000</pubDate>
		<guid isPermaLink="false">/?p=1058#comment-4889</guid>
		<description>I guess I misunderstood your initial solution.  I just found text that was common to about 40% of the comment spam I get and filtered it out.  If I had fields that were always identical, I would gladly use that as a 100% spam filter, but I do not have that luxury and I prefer not to add fields.  I&#039;m still looking for a better solution.</description>
		<content:encoded><![CDATA[<p>I guess I misunderstood your initial solution.  I just found text that was common to about 40% of the comment spam I get and filtered it out.  If I had fields that were always identical, I would gladly use that as a 100% spam filter, but I do not have that luxury and I prefer not to add fields.  I&#8217;m still looking for a better solution.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Levi</title>
		<link>http://dan.hersam.com/2004/11/04/how-to-solve-the-comment-spam-problem/comment-page-1/#comment-4886</link>
		<dc:creator>Levi</dc:creator>
		<pubDate>Mon, 08 Nov 2004 05:42:56 +0000</pubDate>
		<guid isPermaLink="false">/?p=1058#comment-4886</guid>
		<description>I&#039;m not sure you caught the meaning I meant to convey, because what you did doesn&#039;t sound like what I did at all.  I have two fields, A and B, that when they&#039;re part of a spam comment, always have the same text C in them.  The text C varies so much that bayesian training was going incredibly slowly, but it was a universal fact that in a single spam, A = B despite the variability of C between postings.

For now, this is a 100% spam filter.  If we&#039;re being hit by the same bots, perhaps you could capitalize on it by introducing a new field and seeing if it gets the same input as another field.</description>
		<content:encoded><![CDATA[<p>I&#8217;m not sure you caught the meaning I meant to convey, because what you did doesn&#8217;t sound like what I did at all.  I have two fields, A and B, that when they&#8217;re part of a spam comment, always have the same text C in them.  The text C varies so much that bayesian training was going incredibly slowly, but it was a universal fact that in a single spam, A = B despite the variability of C between postings.</p>
<p>For now, this is a 100% spam filter.  If we&#8217;re being hit by the same bots, perhaps you could capitalize on it by introducing a new field and seeing if it gets the same input as another field.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: dan</title>
		<link>http://dan.hersam.com/2004/11/04/how-to-solve-the-comment-spam-problem/comment-page-1/#comment-4884</link>
		<dc:creator>dan</dc:creator>
		<pubDate>Mon, 08 Nov 2004 04:40:50 +0000</pubDate>
		<guid isPermaLink="false">/?p=1058#comment-4884</guid>
		<description>I&#039;ve started doing the same thing because some of the comments contain the same text, but it&#039;s only catching 15-20 a day, meaning I still get 20-30.

I&#039;m also confused by the tactics being used on another blog.  They&#039;re comment spamming the blog with URLs that don&#039;t exist, using a person&#039;s first and last name.  I don&#039;t understand how they can benefit from that.</description>
		<content:encoded><![CDATA[<p>I&#8217;ve started doing the same thing because some of the comments contain the same text, but it&#8217;s only catching 15-20 a day, meaning I still get 20-30.</p>
<p>I&#8217;m also confused by the tactics being used on another blog.  They&#8217;re comment spamming the blog with URLs that don&#8217;t exist, using a person&#8217;s first and last name.  I don&#8217;t understand how they can benefit from that.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Levi</title>
		<link>http://dan.hersam.com/2004/11/04/how-to-solve-the-comment-spam-problem/comment-page-1/#comment-4876</link>
		<dc:creator>Levi</dc:creator>
		<pubDate>Mon, 08 Nov 2004 02:35:57 +0000</pubDate>
		<guid isPermaLink="false">/?p=1058#comment-4876</guid>
		<description>My blog is running on Drupal software, which allows for a &#039;Subject&#039; field.  For some reason, every single spam I&#039;ve got has identical subject and name fields.  I just added some logic to the spam module to assume a comment that has identical subject and name fields to be spam, and now I am 100% spam-free.  The bonus is that these are still analyzed and stuck in the bayesian filter, so hopefully if the spammers grow wise to this the filter will be well-trained enough to still work well.</description>
		<content:encoded><![CDATA[<p>My blog is running on Drupal software, which allows for a &#8216;Subject&#8217; field.  For some reason, every single spam I&#8217;ve got has identical subject and name fields.  I just added some logic to the spam module to assume a comment that has identical subject and name fields to be spam, and now I am 100% spam-free.  The bonus is that these are still analyzed and stuck in the bayesian filter, so hopefully if the spammers grow wise to this the filter will be well-trained enough to still work well.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: dan</title>
		<link>http://dan.hersam.com/2004/11/04/how-to-solve-the-comment-spam-problem/comment-page-1/#comment-4789</link>
		<dc:creator>dan</dc:creator>
		<pubDate>Fri, 05 Nov 2004 22:03:45 +0000</pubDate>
		<guid isPermaLink="false">/?p=1058#comment-4789</guid>
		<description>I just found out about &lt;a href=&quot;http://weblog.sinteur.com/index.php?p=7967&quot;&gt;a plugin that checks various blacklists&lt;/a&gt; for the IP of the commentor and denies the comment.  We&#039;ll see how well it works.  The previous link discusses the code and here is the &lt;a href=&quot;http://charles.gagalac.us/wordpress-plugins/ip-blocklist/&quot;&gt;source code for the plugin&lt;/a&gt;.</description>
		<content:encoded><![CDATA[<p>I just found out about <a href="http://weblog.sinteur.com/index.php?p=7967">a plugin that checks various blacklists</a> for the IP of the commentor and denies the comment.  We&#8217;ll see how well it works.  The previous link discusses the code and here is the <a href="http://charles.gagalac.us/wordpress-plugins/ip-blocklist/">source code for the plugin</a>.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: dan</title>
		<link>http://dan.hersam.com/2004/11/04/how-to-solve-the-comment-spam-problem/comment-page-1/#comment-4779</link>
		<dc:creator>dan</dc:creator>
		<pubDate>Fri, 05 Nov 2004 17:02:55 +0000</pubDate>
		<guid isPermaLink="false">/?p=1058#comment-4779</guid>
		<description>That&#039;s an interesting idea, but they may not be smart enough to realize no one is seeing the traffic and celebrate and their newfound popularity.</description>
		<content:encoded><![CDATA[<p>That&#8217;s an interesting idea, but they may not be smart enough to realize no one is seeing the traffic and celebrate and their newfound popularity.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Eric James Stone</title>
		<link>http://dan.hersam.com/2004/11/04/how-to-solve-the-comment-spam-problem/comment-page-1/#comment-4773</link>
		<dc:creator>Eric James Stone</dc:creator>
		<pubDate>Fri, 05 Nov 2004 05:56:26 +0000</pubDate>
		<guid isPermaLink="false">/?p=1058#comment-4773</guid>
		<description>Hmm.  The spammers may be looking for traffic, but do they want traffic that nobody sees?

Strip the URLs out of comment spam.  Add them to a file as the SRC of an image tag.  At the very bottom of your page, include an invisble IFRAME calling that file.  The result is a hit to their server every time someone visits your blog, but they get no benefit out of it because no one sees their page at all.  (By calling their page as an image source, you prevent them from causing popups or other nasty things.)

Sure, one blogger alone doing this won&#039;t make a difference.  But if a lot of bloggers did it...</description>
		<content:encoded><![CDATA[<p>Hmm.  The spammers may be looking for traffic, but do they want traffic that nobody sees?</p>
<p>Strip the URLs out of comment spam.  Add them to a file as the SRC of an image tag.  At the very bottom of your page, include an invisble IFRAME calling that file.  The result is a hit to their server every time someone visits your blog, but they get no benefit out of it because no one sees their page at all.  (By calling their page as an image source, you prevent them from causing popups or other nasty things.)</p>
<p>Sure, one blogger alone doing this won&#8217;t make a difference.  But if a lot of bloggers did it&#8230;</p>
]]></content:encoded>
	</item>
</channel>
</rss>


<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Minified using apc
Page Caching using apc
Database Caching 1/13 queries in 0.035 seconds using apc
Object Caching 319/320 objects using apc

Served from: dan.hersam.com @ 2012-05-22 10:21:47 -->
