<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" version="2.0">

<channel>
	<title>Data Mining, Down Under</title>
	
	<link>http://www.dataminingdownunder.com</link>
	<description>Welcome to "Data Mining, Down Under", a blog by Aussie data miner Shane Butler.</description>
	<lastBuildDate>Tue, 23 Feb 2010 09:34:28 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/rss+xml" href="http://feeds.feedburner.com/DataMiningDownUnder" /><feedburner:info xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" uri="dataminingdownunder" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><item>
		<title>Reducing Churn Through Social Network Analysis</title>
		<link>http://www.dataminingdownunder.com/2010/02/tim-manns-syddm/</link>
		<comments>http://www.dataminingdownunder.com/2010/02/tim-manns-syddm/#comments</comments>
		<pubDate>Tue, 23 Feb 2010 09:33:16 +0000</pubDate>
		<dc:creator>Shane Butler</dc:creator>
				<category><![CDATA[Australia]]></category>
		<category><![CDATA[Industry]]></category>
		<category><![CDATA[optus]]></category>
		<category><![CDATA[SPSS]]></category>
		<category><![CDATA[telcos]]></category>
		<category><![CDATA[teradata]]></category>

		<guid isPermaLink="false">http://www.dataminingdownunder.com/?p=307</guid>
		<description><![CDATA[Earlier in the month local data miner Tim Manns presented at the Sydney Data Miners group.  Tim spoke on some work been doing at Optus around using mobile call patterns to establish social networks and using these networks to reduce customer churn.  Interestingly, there are also applications in many other areas, including data cleansing, for [...]]]></description>
			<content:encoded><![CDATA[<p>Earlier in the month local data miner <a href="http://timmanns.blogspot.com/">Tim Manns</a> presented at the <a href="http://www.meetup.com/datarati/">Sydney Data Miners</a> group.  Tim spoke on some work been doing at <a href="http://www.optus.com.au">Optus</a> around using mobile call patterns to establish social networks and using these networks to reduce customer churn.  Interestingly, there are also applications in many other areas, including data cleansing, for example, where one person has purchased two mobile phones and given one to their spouse or child.  Using this analysis we can try to determine which account is likely to be the actual account holder and infer the details (such as age) of the other customer.</p>
<p>For a full write up of Tim&#8217;s work, check out <a href="http://jtonedm.com/2009/10/20/know-your-customers-by-knowing-who-they-know-paw/">James Taylor&#8217;s PAW 2009 summary</a> or head over to Tim&#8217;s <a href="http://timmanns.blogspot.com/">data mining blog</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dataminingdownunder.com/2010/02/tim-manns-syddm/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>PMML Tree Model to Code Converter</title>
		<link>http://www.dataminingdownunder.com/2010/01/pmml-tree-model-to-code-converter/</link>
		<comments>http://www.dataminingdownunder.com/2010/01/pmml-tree-model-to-code-converter/#comments</comments>
		<pubDate>Sat, 30 Jan 2010 00:33:58 +0000</pubDate>
		<dc:creator>Shane Butler</dc:creator>
				<category><![CDATA[Software]]></category>
		<category><![CDATA[PMML]]></category>
		<category><![CDATA[R]]></category>

		<guid isPermaLink="false">http://www.dataminingdownunder.com/?p=301</guid>
		<description><![CDATA[Lately I&#8217;ve been trying to come up with a generic way to deploy models on any platform.  So I&#8217;d like to share some early code that takes a PMML TreeModel and converts it to R code.  The intention is to get the R code generation working right, then extend to support generation for other languages. [...]]]></description>
			<content:encoded><![CDATA[<p>Lately I&#8217;ve been trying to come up with a generic way to deploy models on any platform.  So I&#8217;d like to share some early code that takes a PMML TreeModel and converts it to R code.  The intention is to get the R code generation working right, then extend to support generation for other languages.  Anyway, <a href="http://www.dataminingdownunder.com/pmmltreemodel2R.R">here it is</a> (remember &#8212; early alpha, very rough still!!).  Updates to follow soon!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dataminingdownunder.com/2010/01/pmml-tree-model-to-code-converter/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>AusDM 09 &amp; Analytic Challenge</title>
		<link>http://www.dataminingdownunder.com/2009/07/ausdm09-2/</link>
		<comments>http://www.dataminingdownunder.com/2009/07/ausdm09-2/#comments</comments>
		<pubDate>Tue, 07 Jul 2009 13:47:52 +0000</pubDate>
		<dc:creator>Shane Butler</dc:creator>
				<category><![CDATA[Australia]]></category>
		<category><![CDATA[Industry]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[ausdm]]></category>

		<guid isPermaLink="false">http://www.dataminingdownunder.com/?p=282</guid>
		<description><![CDATA[Australian Data Mining conference (AusDM09) will be held in Melbourne next December and Dr Phil Brierley of Tiberius Data Mining has put out the call for proposals for an analytic challenge to accompany the conference.  Competitions are quite popular in data mining circles and provide a good training ground for new practitioners to get access [...]]]></description>
			<content:encoded><![CDATA[<p>Australian Data Mining conference (AusDM09) will be held in Melbourne next December and Dr Phil Brierley of <a href="http://www.tiberius.biz/" target="_blank">Tiberius Data Mining</a> has put out the call for proposals for an analytic challenge to accompany the conference.  Competitions are quite <a href="http://www.kdnuggets.com/datasets/competitions.html">popular</a> in data mining circles and provide a good training ground for new practitioners to get access to real data and solve real problems.  They also often have surprising results, such as the team who used <a href="http://www.cybaea.net/Blogs/Data/How-to-win-the-KDD-Cup-Challenge-with-R-and-gbm.html">laptop with 2GB RAM</a> to beat IBM&#8217;s mighty clusters.</p>
<p>For businesses, this is a great opportunity to find out what is available by having others suggest new ideas and methods, or even to test your internally deployed models against the best of the best. <strong>So if you&#8217;re a business who has data, please consider being invloved!</strong> For further details, see the <a href="http://ausdm09.togaware.com/competition.html">competition webpage</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dataminingdownunder.com/2009/07/ausdm09-2/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>PMML 4.0 Released</title>
		<link>http://www.dataminingdownunder.com/2009/06/pmml-40/</link>
		<comments>http://www.dataminingdownunder.com/2009/06/pmml-40/#comments</comments>
		<pubDate>Thu, 18 Jun 2009 09:50:37 +0000</pubDate>
		<dc:creator>Shane Butler</dc:creator>
				<category><![CDATA[Industry]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[DMG]]></category>
		<category><![CDATA[PMML]]></category>
		<category><![CDATA[Zementis]]></category>

		<guid isPermaLink="false">http://www.dataminingdownunder.com/?p=278</guid>
		<description><![CDATA[The DMG has released a new version of the PMML open format for representing predictive models.  The new version includes support for ensembles, new model types and more built in functions to name just a few of the enhancements.  For a detailed summary, see the Zementis blog.
]]></description>
			<content:encoded><![CDATA[<p>The <a href="http://www.dmg.org">DMG</a> has released a new version of the PMML open format for representing predictive models.  The new version includes support for ensembles, new model types and more built in functions to name just a few of the enhancements.  For a detailed summary, see the <a href="http://adapasupport.zementis.com/2009/06/pmml-40-is-here.html">Zementis blog</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dataminingdownunder.com/2009/06/pmml-40/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Ten Data Mining Mistakes to Avoid</title>
		<link>http://www.dataminingdownunder.com/2009/05/ten-mistakes-to-avoid/</link>
		<comments>http://www.dataminingdownunder.com/2009/05/ten-mistakes-to-avoid/#comments</comments>
		<pubDate>Fri, 15 May 2009 10:19:37 +0000</pubDate>
		<dc:creator>Shane Butler</dc:creator>
				<category><![CDATA[Tips & Tutorials]]></category>
		<category><![CDATA[john elder]]></category>
		<category><![CDATA[video]]></category>

		<guid isPermaLink="false">http://www.dataminingdownunder.com/?p=258</guid>
		<description><![CDATA[Some really good advice here from John Elder in a series of video tutorials on data mining mistakes to avoid.  Tip #5, regarding contaminating the project with future data is a good one, although sometimes it can be quite tricky (if not impossible) to &#8216;rewind&#8217; the data!  I believe the video series is [...]]]></description>
			<content:encoded><![CDATA[<p>Some really good advice here from John Elder in a <a href="http://www.youtube.com/view_play_list?p=79E8168EA02996A3&#038;sort_field=title">series of video tutorials on data mining mistakes to avoid</a>.  Tip #5, regarding contaminating the project with future data is a good one, although sometimes it can be quite tricky (if not impossible) to &#8216;rewind&#8217; the data!  I believe the video series is a part of the launch of <a href="http://www.elsevierdirect.com/datamining">The Handbook of Statistical Analysis and Data Mining Applications</a>.  You can watch part one below or head over to YouTube for the <a href="http://www.youtube.com/view_play_list?p=79E8168EA02996A3&#038;sort_field=title">entire series</a>.</p>
<p><object width="532" height="323"><param name="movie" value="http://www.youtube.com/v/Rd60vmoMMRY&#038;hl=en&#038;fs=1"></param><param name="allowFullScreen" value="true"></param><param name="allowscriptaccess" value="always"></param><embed src="http://www.youtube.com/v/Rd60vmoMMRY&#038;hl=en&#038;fs=1" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="532" height="323"></embed></object></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dataminingdownunder.com/2009/05/ten-mistakes-to-avoid/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>RapidMiner to get dual GUIs</title>
		<link>http://www.dataminingdownunder.com/2009/05/rapidminer-v5-gui/</link>
		<comments>http://www.dataminingdownunder.com/2009/05/rapidminer-v5-gui/#comments</comments>
		<pubDate>Wed, 13 May 2009 16:55:44 +0000</pubDate>
		<dc:creator>Shane Butler</dc:creator>
				<category><![CDATA[Software]]></category>
		<category><![CDATA[open source]]></category>
		<category><![CDATA[rapid miner]]></category>

		<guid isPermaLink="false">http://www.dataminingdownunder.com/?p=252</guid>
		<description><![CDATA[A forum post by Ingo Mierswa of Rapid-I indicates the upcoming RapidMiner v5 will feature two GUIs: the existing tree-based designer and a new graph-based designer!  I&#8217;m quite excited about this because I&#8217;ve personally found the existing UI a bit clunky.  Details and screenshots over at the
user forum.
]]></description>
			<content:encoded><![CDATA[<p>A forum post by Ingo Mierswa of Rapid-I indicates the upcoming RapidMiner v5 will feature two GUIs: the existing tree-based designer and a new graph-based designer!  I&#8217;m quite excited about this because I&#8217;ve personally found the existing UI a bit <a href="http://www.dataminingdownunder.com/2008/11/rapidminer-43-released/">clunky</a>.  Details and screenshots over at the<br />
<a href="http://rapid-i.com/rapidforum/index.php?topic=527.msg3324#msg3324">user forum</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dataminingdownunder.com/2009/05/rapidminer-v5-gui/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>SAS hints at future R integration</title>
		<link>http://www.dataminingdownunder.com/2009/02/sas-hints-at-future-r-integration/</link>
		<comments>http://www.dataminingdownunder.com/2009/02/sas-hints-at-future-r-integration/#comments</comments>
		<pubDate>Tue, 17 Feb 2009 11:03:23 +0000</pubDate>
		<dc:creator>Shane Butler</dc:creator>
				<category><![CDATA[News]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[R]]></category>
		<category><![CDATA[SAS]]></category>

		<guid isPermaLink="false">http://www.dataminingdownunder.com/?p=238</guid>
		<description><![CDATA[In more R news, it appears SAS isn&#8217;t as worried about airplane safety as originally thought, and has indicated they will include R support in an upcoming update to the SAS/IML product.  For details see NYTimes &#38; Adventures in Consulting.
]]></description>
			<content:encoded><![CDATA[<p>In more R news, it appears SAS isn&#8217;t as worried about <a href="http://blogs.sas.com/sascom/index.php?/archives/434-This-post-is-rated-R.html">airplane safety</a> as originally thought, and has indicated they will include R support in an upcoming <a href="http://support.sas.com/rnd/app/studio/Rinterface2.html">update</a> to the SAS/IML product.  For details see <a href="http://bits.blogs.nytimes.com/2009/02/16/sas-warms-to-open-source-one-letter-at-a-time/">NYTimes</a> &amp; <a href="http://minequest.com/WordPress/?p=109">Adventures in Consulting</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dataminingdownunder.com/2009/02/sas-hints-at-future-r-integration/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>R in the New York Times</title>
		<link>http://www.dataminingdownunder.com/2009/01/r-project-in-nyt/</link>
		<comments>http://www.dataminingdownunder.com/2009/01/r-project-in-nyt/#comments</comments>
		<pubDate>Thu, 08 Jan 2009 02:32:17 +0000</pubDate>
		<dc:creator>Shane Butler</dc:creator>
				<category><![CDATA[Software]]></category>
		<category><![CDATA[R]]></category>
		<category><![CDATA[S-Plus]]></category>
		<category><![CDATA[SAS]]></category>

		<guid isPermaLink="false">http://www.dataminingdownunder.com/?p=232</guid>
		<description><![CDATA[The New York Times has an interesting story on the increasing use of R for data analysis within academia and industry.  Several large corporates are cited as having selected R over commercial conterparts such as S and SAS.
[via Slashdot]
Update: For more R news, see also Ajay Ohri&#8217;s interview with Dr Graham Williams, the author of [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.nytimes.com">The New York Times</a> has an interesting story on the <a href="http://www.nytimes.com/2009/01/07/technology/business-computing/07program.html">increasing use of R</a> for data analysis within academia and industry.  Several large corporates are cited as having selected <a href="http://www.r-project.org">R</a> over commercial conterparts such as <a href="http://www.insightful.com/">S</a> and <a href="http://www.sas.com">SAS</a>.</p>
<p style="text-align: right;">[<a href="http://developers.slashdot.org/article.pl?sid=09/01/07/2316227">via Slashdot</a>]</p>
<p><strong>Update:</strong> For more R news, see also <a href="http://www.decisionstats.com/2009/01/interview-dr-graham-williams/">Ajay Ohri&#8217;s interview</a> with <a href="http://www.togaware.com">Dr Graham Williams</a>, the author of <a title="Rattle data mining suite for R" href="http://rattle.togaware.com">Rattle</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dataminingdownunder.com/2009/01/r-project-in-nyt/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>RapidMiner 4.3 Released</title>
		<link>http://www.dataminingdownunder.com/2008/11/rapidminer-43-released/</link>
		<comments>http://www.dataminingdownunder.com/2008/11/rapidminer-43-released/#comments</comments>
		<pubDate>Fri, 28 Nov 2008 04:19:56 +0000</pubDate>
		<dc:creator>Shane Butler</dc:creator>
				<category><![CDATA[Software]]></category>
		<category><![CDATA[Tips & Tutorials]]></category>
		<category><![CDATA[lift chart]]></category>
		<category><![CDATA[open source]]></category>
		<category><![CDATA[rapid miner]]></category>
		<category><![CDATA[video]]></category>

		<guid isPermaLink="false">http://www.dataminingdownunder.com/?p=211</guid>
		<description><![CDATA[Rapid-I has released an new and improved version of the open source data mining suite RapidMiner (formely called YALE).  I&#8217;ve been evaluating RapidMiner lately as a possible addition to my data mining toolbox.  I&#8217;ve found the biggest hurdle in learning how to use it is probably the GUI.  It is a tree-based GUI which I [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://rapid-i.com">Rapid-I</a> has released an new and <a href="http://rapid-i.com/content/view/133/1/">improved</a> version of the open source data mining suite <a href="http://rapidminer.com">RapidMiner</a> (formely called YALE).  I&#8217;ve been evaluating RapidMiner lately as a possible addition to my data mining toolbox.  I&#8217;ve found the biggest hurdle in learning how to use it is probably the GUI.  It is a tree-based GUI which I find much harder to understand than the graph-style approach used by <a href="http://www.spss.com/clementine/">many</a> <a href="http://www.sas.com/technologies/analytics/datamining/miner/">others</a>.  However RapidMiner is quite a powerful tool, and the Community Edition is free, so there is probably a lot of benefit in getting used to the strange GUI.</p>
<p>The built in tutorial is a really good way to get a grasp of the system and I highly recommend spending some time on this if you are interested in learning RapidMiner.  I would also recommend a series of <a href="http://www.neuralmarkettrends.com/tutorials/">RapidMiner video turtorials</a> over at <a href="http://www.neuralmarkettrends.com/">Neural Market Trends</a> that are worth checking out too.</p>
<div id="attachment_216" class="wp-caption aligncenter" style="width: 211px"><a href="http://rapid-i.com/images/stories/rapidi/yale/releases/4_3/01_lift.jpg"><img class="size-full wp-image-216" title="RapidMiner 4.3" src="http://www.dataminingdownunder.com/wp-content/uploads/2008/11/rmnewsml.jpg" alt="RapidMiner 4.3 includes a 3d lift chart" width="201" height="150" /></a><p class="wp-caption-text">RapidMiner 4.3 includes a 3D lift chart</p></div>
]]></content:encoded>
			<wfw:commentRss>http://www.dataminingdownunder.com/2008/11/rapidminer-43-released/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>SAS Forum (Australia) presentations available online</title>
		<link>http://www.dataminingdownunder.com/2008/09/sas-forum-australia-presentations/</link>
		<comments>http://www.dataminingdownunder.com/2008/09/sas-forum-australia-presentations/#comments</comments>
		<pubDate>Mon, 29 Sep 2008 23:30:52 +0000</pubDate>
		<dc:creator>Shane Butler</dc:creator>
				<category><![CDATA[Australia]]></category>
		<category><![CDATA[Industry]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[banking]]></category>
		<category><![CDATA[case study]]></category>
		<category><![CDATA[customer analytics]]></category>
		<category><![CDATA[fraud]]></category>
		<category><![CDATA[government]]></category>
		<category><![CDATA[SAS]]></category>

		<guid isPermaLink="false">http://www.dataminingdownunder.com/?p=3</guid>
		<description><![CDATA[The SAS Forum (Australia) was held in Sydney back in August.  I was unable to attend but luckily the presentations have been put online.  Here are some that I found interesting:

Make Sure Your Insight is Insightful: Analytical Marketing at NAB by Antony Ugoni (National Australia Bank)
Model Deployment and Management &#8211; The ATO Story by Warwick [...]]]></description>
			<content:encoded><![CDATA[<p>The <a href="http://www.sasforum.com/anz/index.php?option=com_content&amp;view=article&amp;id=151&amp;Itemid=93">SAS Forum (Australia)</a> was held in Sydney back in August.  I was unable to attend but luckily the presentations have been put <a href="http://www.sasforum.com/anz/index.php?option=com_content&amp;view=article&amp;id=151&amp;Itemid=93">online</a>.  Here are some that I found interesting:</p>
<ul>
<li><a href="http://www.sasforum.com/anz/presentations/NAB%20-%20Antony%20Ugoni.pdf">Make Sure Your Insight is Insightful: Analytical Marketing at NAB</a> by Antony Ugoni (National Australia Bank)</li>
<li><a href="http://www.sasforum.com/anz/presentations/Model%20Deployment%20and%20Management%20-%20The%20ATO%20Story.pdf">Model Deployment and Management &#8211; The ATO Story</a> by Warwick Graco (Australian Taxation Office)<a href="http://www.iapa.org.au"></a></li>
<li><a href="http://www.sasforum.com/anz/presentations/Offlode%20-%20Paul%20Bracewell.pdf">Putting Cheques in Place to Identify Fraud</a> by Dr Paul Bracewell (Offlode NZ) and Flavio Palaci (Marsh Australia)</li>
<li><a href="http://www.sasforum.com/anz/presentations/Customer%20Value%20Creation%20Using%20Analysis.pdf">Customer Value Creation Using Analytics</a> by Arun VS (Satyam)</li>
<li><a href="http://www.sasforum.com/anz/presentations/SAS%20-%20Bill%20Gibson.pdf">Analysing Performance and Tuning your SAS Application</a> by Bill Gibson (SAS)</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.dataminingdownunder.com/2008/09/sas-forum-australia-presentations/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
