<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0">
<channel>
	<title>Comments for Shane's Blog</title>
	
	<link>http://sbutler.com/blog</link>
	<description>data mining and things i find interesting</description>
	<lastBuildDate>Thu, 22 Mar 2007 00:10:41 +1100</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/rss+xml" href="http://feeds.feedburner.com/sbutler-comments" /><feedburner:info uri="sbutler-comments" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><feedburner:browserFriendly></feedburner:browserFriendly><item>
		<title>Comment on Smart SPAM &amp; Fighting it by John Aitchison</title>
		<link>http://sbutler.com/blog/2006/05/smart-spam/comment-page-1/#comment-3905</link>
		<dc:creator>John Aitchison</dc:creator>
		<pubDate>Thu, 22 Mar 2007 00:10:41 +0000</pubDate>
		<guid isPermaLink="false">http://sbutler.com/blog/2006/05/smart-spam/#comment-3905</guid>
		<description>Interesting ideas. 

I guess I am not as hopeful as you that a centralized solution like Gmail would be smart enough quickly enough to defeat spammers. 

For example I get a lot of spam emails with near duplicate but natural enough looking content - the problem is that the content (which has been harvested from the web) is not the real message (which is often male performance enhancing drugs) and swamps the Bayesian filters.  I am not sure that "Bayesian like" classification tools will ever be smart enough for this.

I think I have a blog entry on this .. no, I just checked, it is not up, so I will put it up now. Here it is http://dsanalytics.com/dsblog/why-bayesian-spam-filters-are-doomed_92


btw, the medium of choice for people under 20 seems to be texting.. email is seen as old hat and when it is used it is usually through eg hotmail who do seem to do a good job of spam suppression.  

Email is still, however, the medium of business communication although I am intrigued by the gmail idea of conversations (not universally loved though - see http://philwilson.org/blog/2005/06/gmail-conversations.html)</description>
		<content:encoded><![CDATA[<p>Interesting ideas. </p>
<p>I guess I am not as hopeful as you that a centralized solution like Gmail would be smart enough quickly enough to defeat spammers. </p>
<p>For example I get a lot of spam emails with near duplicate but natural enough looking content &#8211; the problem is that the content (which has been harvested from the web) is not the real message (which is often male performance enhancing drugs) and swamps the Bayesian filters.  I am not sure that &#8220;Bayesian like&#8221; classification tools will ever be smart enough for this.</p>
<p>I think I have a blog entry on this .. no, I just checked, it is not up, so I will put it up now. Here it is <a href="http://dsanalytics.com/dsblog/why-bayesian-spam-filters-are-doomed_92" rel="nofollow">http://dsanalytics.com/dsblog/why-bayesian-spam-filters-are-doomed_92</a></p>
<p>btw, the medium of choice for people under 20 seems to be texting.. email is seen as old hat and when it is used it is usually through eg hotmail who do seem to do a good job of spam suppression.  </p>
<p>Email is still, however, the medium of business communication although I am intrigued by the gmail idea of conversations (not universally loved though &#8211; see <a href="http://philwilson.org/blog/2005/06/gmail-conversations.html)" rel="nofollow">http://philwilson.org/blog/2005/06/gmail-conversations.html)</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Getting to know R Graphs by » Some inspiration from Statistical Graphics (R, Graphviz etc) [ Data Sciences Analytics ]</title>
		<link>http://sbutler.com/blog/2006/04/getting-to-know-r-graphs/comment-page-1/#comment-3901</link>
		<dc:creator>» Some inspiration from Statistical Graphics (R, Graphviz etc) [ Data Sciences Analytics ]</dc:creator>
		<pubDate>Wed, 21 Mar 2007 23:50:21 +0000</pubDate>
		<guid isPermaLink="false">http://sbutler.com/blog/2006/04/getting-to-know-r-graphs/#comment-3901</guid>
		<description>[...] graphics or their ability to convey nuances (sometimes words rule, OK?) I am indebted to Shane’s Blog for a pointer to  the R Graph [...]</description>
		<content:encoded><![CDATA[<p>[...] graphics or their ability to convey nuances (sometimes words rule, OK?) I am indebted to Shane&#8217;s Blog for a pointer to  the R Graph [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Future of Radio by Pierre</title>
		<link>http://sbutler.com/blog/2006/03/pandora-musicminer/comment-page-1/#comment-2351</link>
		<dc:creator>Pierre</dc:creator>
		<pubDate>Wed, 03 Jan 2007 13:01:35 +0000</pubDate>
		<guid isPermaLink="false">http://sbutler.com/blog/2006/03/pandora-musicminer/#comment-2351</guid>
		<description>There is also "last.fm" , a music recommader system based on collaborative filtering methods.</description>
		<content:encoded><![CDATA[<p>There is also &#8220;last.fm&#8221; , a music recommader system based on collaborative filtering methods.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on DD-WRT, Another Alternative by Jerry</title>
		<link>http://sbutler.com/blog/2006/01/dd-wrt-another-alternative/comment-page-1/#comment-2216</link>
		<dc:creator>Jerry</dc:creator>
		<pubDate>Thu, 14 Dec 2006 12:38:19 +0000</pubDate>
		<guid isPermaLink="false">http://sbutler.com/blog/2006/01/dd-wrt-another-alternative/#comment-2216</guid>
		<description>Working online configuration menus for DD-WRT V23 SP2 running on a Buffalo WHR-G54S wireless router: 

http://www.clearnet.com.au/xcart/Rips/DD-WRT_Buffalo_WHR_G54S/Buffalo_AP/192.168.20.43/index.asp.htm</description>
		<content:encoded><![CDATA[<p>Working online configuration menus for DD-WRT V23 SP2 running on a Buffalo WHR-G54S wireless router: </p>
<p><a href="http://www.clearnet.com.au/xcart/Rips/DD-WRT_Buffalo_WHR_G54S/Buffalo_AP/192.168.20.43/index.asp.htm" rel="nofollow">http://www.clearnet.com.au/xcart/Rips/DD-WRT_Buffalo_WHR_G54S/Buffalo_AP/192.168.20.43/index.asp.htm</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on DD-WRT, Another Alternative by Jerry</title>
		<link>http://sbutler.com/blog/2006/01/dd-wrt-another-alternative/comment-page-1/#comment-2109</link>
		<dc:creator>Jerry</dc:creator>
		<pubDate>Sat, 25 Nov 2006 19:37:22 +0000</pubDate>
		<guid isPermaLink="false">http://sbutler.com/blog/2006/01/dd-wrt-another-alternative/#comment-2109</guid>
		<description>DD-WRT is now up to V2.3 SP2 - released this September:

http://www.dd-wrt.com/dd-wrtv2/down.php?path=downloads%2Fdd-wrt.v23+SP2/

See an online working config menu for DD-WRT V2.3 SP1 here: http://www.clearnet.com.au/ssl/xcart/Rips/DD-WRT/192.168.1.1/index.asp.htm</description>
		<content:encoded><![CDATA[<p>DD-WRT is now up to V2.3 SP2 &#8211; released this September:</p>
<p><a href="http://www.dd-wrt.com/dd-wrtv2/down.php?path=downloads%2Fdd-wrt.v23+SP2/" rel="nofollow">http://www.dd-wrt.com/dd-wrtv2/down.php?path=downloads%2Fdd-wrt.v23+SP2/</a></p>
<p>See an online working config menu for DD-WRT V2.3 SP1 here: <a href="http://www.clearnet.com.au/ssl/xcart/Rips/DD-WRT/192.168.1.1/index.asp.htm" rel="nofollow">http://www.clearnet.com.au/ssl/xcart/Rips/DD-WRT/192.168.1.1/index.asp.htm</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Using Gmail for Backups by Haochi</title>
		<link>http://sbutler.com/blog/2006/05/gmail-for-backups/comment-page-1/#comment-229</link>
		<dc:creator>Haochi</dc:creator>
		<pubDate>Wed, 03 May 2006 00:46:15 +0000</pubDate>
		<guid isPermaLink="false">http://sbutler.com/blog/2006/05/gmail-for-backups/#comment-229</guid>
		<description>I think you can only backup files that are less than 10MB.</description>
		<content:encoded><![CDATA[<p>I think you can only backup files that are less than 10MB.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on WEKA in Jython or even C# by Dalibor Topic</title>
		<link>http://sbutler.com/blog/2006/03/weka-jython-csharp/comment-page-1/#comment-226</link>
		<dc:creator>Dalibor Topic</dc:creator>
		<pubDate>Mon, 03 Apr 2006 01:27:21 +0000</pubDate>
		<guid isPermaLink="false">http://sbutler.com/blog/2006/03/weka-jython-csharp/#comment-226</guid>
		<description>Heya, you may want to check out gcj &amp; how PyLucene does it, if you want to call Java code from Python without having to port your code to a JVM.

cheers,
dalibor topic</description>
		<content:encoded><![CDATA[<p>Heya, you may want to check out gcj &amp; how PyLucene does it, if you want to call Java code from Python without having to port your code to a JVM.</p>
<p>cheers,<br />
dalibor topic</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Got Zeitgeist? Mining Online Trends by Shane</title>
		<link>http://sbutler.com/blog/2006/03/mining-online-trends/comment-page-1/#comment-225</link>
		<dc:creator>Shane</dc:creator>
		<pubDate>Tue, 07 Mar 2006 09:12:36 +0000</pubDate>
		<guid isPermaLink="false">http://sbutler.com/blog/2006/03/mining-online-trends/#comment-225</guid>
		<description>Nice, thanks for the link (and the book :P)</description>
		<content:encoded><![CDATA[<p>Nice, thanks for the link (and the book <img src='http://sbutler.com/blog/wp-includes/images/smilies/icon_razz.gif' alt=':P' class='wp-smiley' /> )</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Got Zeitgeist? Mining Online Trends by Swaroop C H</title>
		<link>http://sbutler.com/blog/2006/03/mining-online-trends/comment-page-1/#comment-224</link>
		<dc:creator>Swaroop C H</dc:creator>
		<pubDate>Tue, 07 Mar 2006 07:01:57 +0000</pubDate>
		<guid isPermaLink="false">http://sbutler.com/blog/2006/03/mining-online-trends/#comment-224</guid>
		<description>Have you seen http://buzz.yahoo.com ?</description>
		<content:encoded><![CDATA[<p>Have you seen <a href="http://buzz.yahoo.com" rel="nofollow">http://buzz.yahoo.com</a> ?</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Online Advertising Gone Mad? by Shane’s Blog » Blog Archive » More on Online Advertising</title>
		<link>http://sbutler.com/blog/2006/01/online-advertising/comment-page-1/#comment-223</link>
		<dc:creator>Shane’s Blog » Blog Archive » More on Online Advertising</dc:creator>
		<pubDate>Wed, 22 Feb 2006 00:58:45 +0000</pubDate>
		<guid isPermaLink="false">http://sbutler.com/blog/2005/11/adsense/#comment-223</guid>
		<description>[...] I previously blogged about advertisers going troppo for The Million Dollar Homepage. Well, online advertising is hot all over the net and is gaining on all traditional mediums fast. Australian Internet advert spending was AUD$620 Million last year, and will be AUD$1 Billion by the end of the year. Interestingly Internet advertising only accounts for around 6% of Australian advertising budgets even though Australians are spending 15% of their media contact time online. [...]</description>
		<content:encoded><![CDATA[<p>[...] I previously blogged about advertisers going troppo for The Million Dollar Homepage. Well, online advertising is hot all over the net and is gaining on all traditional mediums fast. Australian Internet advert spending was AUD$620 Million last year, and will be AUD$1 Billion by the end of the year. Interestingly Internet advertising only accounts for around 6% of Australian advertising budgets even though Australians are spending 15% of their media contact time online. [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Online Advertising Gone Mad? by Michael Frans</title>
		<link>http://sbutler.com/blog/2006/01/online-advertising/comment-page-1/#comment-200</link>
		<dc:creator>Michael Frans</dc:creator>
		<pubDate>Wed, 04 Jan 2006 04:55:43 +0000</pubDate>
		<guid isPermaLink="false">http://sbutler.com/blog/2005/11/adsense/#comment-200</guid>
		<description>My hat goes off to Alex Tew, he certainly found a unique idea to sell ad space. I actually took his idea one step further... selling ad space by the BYTE! There are so many "pixel pushers" out there now that I needed to put a spin on the pixel advertising concept. Check out http://www.buckabyte.com</description>
		<content:encoded><![CDATA[<p>My hat goes off to Alex Tew, he certainly found a unique idea to sell ad space. I actually took his idea one step further&#8230; selling ad space by the BYTE! There are so many &#8220;pixel pushers&#8221; out there now that I needed to put a spin on the pixel advertising concept. Check out <a href="http://www.buckabyte.com" rel="nofollow">http://www.buckabyte.com</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on OpenWRT on a Linksys WRT54G by Shane’s Blog » Blog Archive » DD-WRT, Another Alternative</title>
		<link>http://sbutler.com/blog/2005/12/openwrt-on-wrt54g/comment-page-1/#comment-199</link>
		<dc:creator>Shane’s Blog » Blog Archive » DD-WRT, Another Alternative</dc:creator>
		<pubDate>Mon, 02 Jan 2006 12:30:49 +0000</pubDate>
		<guid isPermaLink="false">http://sbutler.com/blog/?p=75#comment-199</guid>
		<description>[...] sbutler.com  Blog -  Research -  CV -  Photos -  Contact    « OpenWRT on a Linksys WRT54G [...]</description>
		<content:encoded><![CDATA[<p>[...] sbutler.com  Blog &#8211;  Research &#8211;  CV &#8211;  Photos &#8211;  Contact    &laquo; OpenWRT on a Linksys WRT54G [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on TV for Geeks by Shane</title>
		<link>http://sbutler.com/blog/2005/09/tv-for-geeks/comment-page-1/#comment-29</link>
		<dc:creator>Shane</dc:creator>
		<pubDate>Wed, 28 Sep 2005 03:39:55 +0000</pubDate>
		<guid isPermaLink="false">http://sbutler.com/blog/?p=57#comment-29</guid>
		<description>Also http://www.filefarmer.com/techshows/ has links to lots of other shows.</description>
		<content:encoded><![CDATA[<p>Also <a href="http://www.filefarmer.com/techshows/" rel="nofollow">http://www.filefarmer.com/techshows/</a> has links to lots of other shows.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on TV for Geeks by jB: no - that's definitely not good enough</title>
		<link>http://sbutler.com/blog/2005/09/tv-for-geeks/comment-page-1/#comment-13</link>
		<dc:creator>jB: no - that's definitely not good enough</dc:creator>
		<pubDate>Fri, 09 Sep 2005 10:48:38 +0000</pubDate>
		<guid isPermaLink="false">http://sbutler.com/blog/?p=57#comment-13</guid>
		<description>&lt;strong&gt;It's interview time&lt;/strong&gt;

Mark Stephens a.k.a. Robert X. Cringely started his weekly series of interviews named NerdTV over at http://www.pbs.org/cringely/nerdtv/ which includes talks with 'celebrities' like Sun Microsystem's Bill Joy (Sept. 20th), Apple's co-founder Steve ...</description>
		<content:encoded><![CDATA[<p><strong>It&#8217;s interview time</strong></p>
<p>Mark Stephens a.k.a. Robert X. Cringely started his weekly series of interviews named NerdTV over at <a href="http://www.pbs.org/cringely/nerdtv/" rel="nofollow">http://www.pbs.org/cringely/nerdtv/</a> which includes talks with &#8216;celebrities&#8217; like Sun Microsystem&#8217;s Bill Joy (Sept. 20th), Apple&#8217;s co-founder Steve &#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on The Joys of Gentoo by Shane</title>
		<link>http://sbutler.com/blog/2005/08/the-joys-of-gentoo/comment-page-1/#comment-12</link>
		<dc:creator>Shane</dc:creator>
		<pubDate>Wed, 07 Sep 2005 23:10:21 +0000</pubDate>
		<guid isPermaLink="false">http://sbutler.com/blog/?p=50#comment-12</guid>
		<description>Yes I'd agree generally but in practice I have not always found it to be the case. My system was not working properly as a result of udev rules not being updated (because i wasn't using etc-update). Also, I have noticed some software, such as OpenVPN for example, seems to constantly move around their config files!</description>
		<content:encoded><![CDATA[<p>Yes I&#8217;d agree generally but in practice I have not always found it to be the case. My system was not working properly as a result of udev rules not being updated (because i wasn&#8217;t using etc-update). Also, I have noticed some software, such as OpenVPN for example, seems to constantly move around their config files!</p>
]]></content:encoded>
	</item>
</channel>
</rss>
