<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" version="2.0">

<channel>
	<title>Data Mining Research</title>
	
	<link>http://www.dataminingblog.com</link>
	<description>Data mining crossroads - research, applications, news, list of blogs and customized search engine about data mining.</description>
	<lastBuildDate>Sat, 12 May 2012 18:11:10 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.5</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/rss+xml" href="http://feeds.feedburner.com/dataminingblog" /><feedburner:info xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" uri="dataminingblog" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><feedburner:emailServiceId xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0">dataminingblog</feedburner:emailServiceId><feedburner:feedburnerHostname xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0">http://feedburner.google.com</feedburner:feedburnerHostname><item>
		<title>Outlier detection in two review articles (Part 1)</title>
		<link>http://www.dataminingblog.com/outlier-detection-in-two-review-articles-part-1/</link>
		<comments>http://www.dataminingblog.com/outlier-detection-in-two-review-articles-part-1/#comments</comments>
		<pubDate>Sat, 12 May 2012 18:11:10 +0000</pubDate>
		<dc:creator>Sandro Saitta</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.dataminingblog.com/?p=1640</guid>
		<description><![CDATA[If you need to read two review articles about outlier detection, the first one is&#8230;
Outlier Detection: A Survey
The first one, Outlier Detection: A Survey, is written by Chandola, Banerjee and Kumar. They define outlier detection as the problem of &#8220;[...] finding patterns in data that do not conform to expected normal behavior&#8220;. After an introduction [...]]]></description>
			<content:encoded><![CDATA[<p>If you need to read two review articles about outlier detection, the first one is&#8230;<em></em></p>
<p><strong>Outlier Detection: A Survey</strong></p>
<p>The first one, <em>Outlier Detection: A Survey</em>, is written by Chandola, Banerjee and Kumar. They define outlier detection as the problem of &#8220;<em>[...] finding patterns in data that do not conform to expected normal behavior</em>&#8220;. After an introduction to what outliers are, authors present current challenges in this field. In my experience, non-availability of labeled data is a major one.</p>
<p>The authors proposes three types of supervisions. In supervised outlier detection we make the assumption that labeled data are available. Semi-supervised outlier detection assumes that only one class of labeled data is available. Techniques which models normal instances as the only class are more popular (since normal instances are easier to obtain). The third approach, unsupervised outlier detection, is the most widely used one. The paper continues by describing three types of outliers. Authors then describes several applications of outliers detection in areas such as intrusion detection, fraud detection, industrial damage detection, image processing, etc.</p>
<p>Techniques used for outlier detection are then described. It is surprising to read that most data mining techniques can be applied to the task of outlier detection. For example: neural networks, SVM, rule-based, clustering, nearest neighbors, regression, etc. The articles continues with several other techniques. Authors also describe ways to evaluate results of outlier detection with false positive, false negative and ROC curve. To be noted the 19 pages (!) of references to other articles in the field. One of their main conclusions is that &#8220;<em>[...] outlier detection is not a well-formulated problem</em>&#8220;. It is your job, as a data miner, to formulate it correctly.</p>
<p>Link to <a href="http://www.cs.umn.edu/tech_reports_upload/tr2007/old_files/07-017.pdf">Outlier Detection: A Survey</a></p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fwww.dataminingblog.com%2Foutlier-detection-in-two-review-articles-part-1%2F&amp;linkname=Outlier%20detection%20in%20two%20review%20articles%20%28Part%201%29"><img src="http://www.dataminingblog.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a>
<p><a href="http://feedads.g.doubleclick.net/~a/Qf-Zhn6DBIsMn0NzYlfV8EXJIYY/0/da"><img src="http://feedads.g.doubleclick.net/~a/Qf-Zhn6DBIsMn0NzYlfV8EXJIYY/0/di" border="0" ismap="true"></img></a><br/>
<a href="http://feedads.g.doubleclick.net/~a/Qf-Zhn6DBIsMn0NzYlfV8EXJIYY/1/da"><img src="http://feedads.g.doubleclick.net/~a/Qf-Zhn6DBIsMn0NzYlfV8EXJIYY/1/di" border="0" ismap="true"></img></a></p><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/dataminingblog?a=dRTj2cR50Iw:XEEh82bAoFw:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/dataminingblog?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=dRTj2cR50Iw:XEEh82bAoFw:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/dataminingblog?i=dRTj2cR50Iw:XEEh82bAoFw:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=dRTj2cR50Iw:XEEh82bAoFw:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/dataminingblog?i=dRTj2cR50Iw:XEEh82bAoFw:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=dRTj2cR50Iw:XEEh82bAoFw:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/dataminingblog?i=dRTj2cR50Iw:XEEh82bAoFw:gIN9vFwOqvQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=dRTj2cR50Iw:XEEh82bAoFw:dnMXMwOfBR0"><img src="http://feeds.feedburner.com/~ff/dataminingblog?d=dnMXMwOfBR0" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/dataminingblog/~4/dRTj2cR50Iw" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.dataminingblog.com/outlier-detection-in-two-review-articles-part-1/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Data Mining Interview: Meta Brown</title>
		<link>http://www.dataminingblog.com/data-mining-interview-meta-brown/</link>
		<comments>http://www.dataminingblog.com/data-mining-interview-meta-brown/#comments</comments>
		<pubDate>Sun, 29 Apr 2012 16:00:24 +0000</pubDate>
		<dc:creator>Sandro Saitta</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.dataminingblog.com/?p=1621</guid>
		<description><![CDATA[Meta Brown, General Manager of Analytics at LinguaSys, has kindly accepted to answer a few questions for Data Mining Research. I would like to thank Meta for her time and her advices in Analytics.
Data Mining Research: Who are you and what is your story?
Meta Brown: I&#8217;m a practical, plain-talking data analyst and engineer.  I use [...]]]></description>
			<content:encoded><![CDATA[<p><em><a href="http://www.dataminingblog.com/wp-content/uploads/Meta1.jpg"><img class="alignright size-full wp-image-1623" title="Meta" src="http://www.dataminingblog.com/wp-content/uploads/Meta1.jpg" alt="Meta" width="137" height="135" /></a>Meta Brown, General Manager of Analytics at LinguaSys, has kindly accepted to answer a few questions for Data Mining Research. I would like to thank Meta for her time and her advices in Analytics.</em></p>
<p><strong>Data Mining Research: Who are you and what is your story?</strong></p>
<p><strong>Meta Brown:</strong> I&#8217;m a practical, plain-talking data analyst and engineer.  I use data to tell true stories, and teach others to do the same. So many of my clients do worthwhile things I could not do myself &#8211; improve medical care, provide valuable social services, create and sell useful products &#8211; that it is my pleasure to help each of them find ways to work a little better (or a lot) through analytics.</p>
<p><strong>DMR: What services do you offer as a consultant in analytics?</strong></p>
<p><strong>MB:</strong> Guidance for organizations initiating or expanding analytics programs. Analytics only improves your bottom line if you begin with a realistic plan for making that happen. My role is to help you create and act on that plan, beginning with identification of goals and working backwards to define the steps you must take to meet those goals. This is an empowerment process that leaves your organization stronger, and more independent.</p>
<p>My availability for consulting is limited now, as my primary role is General Manager of Analytics at LinguaSys, a language technology firm. My work there is all about the coolest in text analytics.</p>
<p><strong>DMR: What is the most important thing you have learned as a consultant in analytics?</strong><strong></strong></p>
<p><strong>MB:</strong> Consulting is valuable to those who are open to changing what they do. Others are wasting their money. Consultants have expertise, the ability to offer valuable information and advice, but only clients have the power to act on that input. Advice that is merely good, but executed brilliantly, yields far more value than brilliant advice without action.</p>
<p><strong>DMR: What advice would you give to a business analytics manager?</strong><strong></strong></p>
<p><strong>MB:</strong> Set goals based on the primary concerns of decision makers, and make certain that most of your goals are things you&#8217;re sure you can do well and on time. Break up big goals into bite-size goals, and document your success as you go. This ensures survival, and leaves a little room for risk.</p>
<p>You can find more information about Meta Brown on her website: <a href="http://www.metabrown.com/">www.metabrown.com</a></p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fwww.dataminingblog.com%2Fdata-mining-interview-meta-brown%2F&amp;linkname=Data%20Mining%20Interview%3A%20Meta%20Brown"><img src="http://www.dataminingblog.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a>
<p><a href="http://feedads.g.doubleclick.net/~a/yeka965bZsyLgY7UUi5lBl6m2TQ/0/da"><img src="http://feedads.g.doubleclick.net/~a/yeka965bZsyLgY7UUi5lBl6m2TQ/0/di" border="0" ismap="true"></img></a><br/>
<a href="http://feedads.g.doubleclick.net/~a/yeka965bZsyLgY7UUi5lBl6m2TQ/1/da"><img src="http://feedads.g.doubleclick.net/~a/yeka965bZsyLgY7UUi5lBl6m2TQ/1/di" border="0" ismap="true"></img></a></p><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/dataminingblog?a=VSmiSkXk-ak:fSj7pB6mip8:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/dataminingblog?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=VSmiSkXk-ak:fSj7pB6mip8:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/dataminingblog?i=VSmiSkXk-ak:fSj7pB6mip8:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=VSmiSkXk-ak:fSj7pB6mip8:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/dataminingblog?i=VSmiSkXk-ak:fSj7pB6mip8:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=VSmiSkXk-ak:fSj7pB6mip8:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/dataminingblog?i=VSmiSkXk-ak:fSj7pB6mip8:gIN9vFwOqvQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=VSmiSkXk-ak:fSj7pB6mip8:dnMXMwOfBR0"><img src="http://feeds.feedburner.com/~ff/dataminingblog?d=dnMXMwOfBR0" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/dataminingblog/~4/VSmiSkXk-ak" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.dataminingblog.com/data-mining-interview-meta-brown/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Data Mining Book Review: The Value of Business Analytics</title>
		<link>http://www.dataminingblog.com/data-mining-book-review-the-value-of-business-analytics/</link>
		<comments>http://www.dataminingblog.com/data-mining-book-review-the-value-of-business-analytics/#comments</comments>
		<pubDate>Fri, 13 Apr 2012 17:59:54 +0000</pubDate>
		<dc:creator>Sandro Saitta</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.dataminingblog.com/?p=1615</guid>
		<description><![CDATA[Today&#8217;s book review is about The Value of Business Analytics &#8211; Identifying The Path to Profitability, from Evan Stubbs. I won&#8217;t keep any suspense: the book is excellent! It&#8217;s a must have for any person trying to apply analytics within a company. The book is published by Wiley/SAS, but don&#8217;t worry, there is no promotion of SAS [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.dataminingblog.com/wp-content/uploads/Value-of-BA.jpg"><img class="alignleft size-full wp-image-1617" title="Value of BA" src="http://www.dataminingblog.com/wp-content/uploads/Value-of-BA.jpg" alt="Value of BA" width="194" height="259" /></a>Today&#8217;s book review is about <em>The Value of Business Analytics &#8211; Identifying The Path to Profitability</em>, from Evan Stubbs. I won&#8217;t keep any suspense: the book is excellent! It&#8217;s a must have for any person trying to apply analytics within a company. The book is published by Wiley/SAS, but don&#8217;t worry, there is no promotion of SAS tools. The book is clearly about going from analytics to action. In other words, how do you transform analytics results into an operational process in the company. Such kind of books exist for a long time in the field of Business Intelligence, but not in the data mining area. Out of the <em>Analytics at Work</em>, I cannot think of books that answer the question of industrializing analytics. I have been waiting for Stubbs&#8217; book for years, and you will understand why below.</p>
<p>After a short introduction, the book is divided into chapters with the following business analytics topics: importance, delivery, value, communication, execution plan and measurement framework. The final chapter brings everything together. Stubbs clearly shows the importance of business analytics (BA) and how it is useful for competitive advantage. Stubbs not only explains why, but how to find these advantages. The book goes one step further from <em>Analytics at Work</em>, by providing methodologies and procedures to achieve what is advised by the author. In addition to the process of making value from analytics, Stubbs define the analytics team members, one by one. To be noted, the excellent and comprehensive glossary at the end of the book.</p>
<p>The presence of checklists for BA managers are very appreciated. Stubbs emphasizes on the importance of communication when dealing with BA. Fundamental aspects of communications are explained with focus on BA. The funny thing with Stubbs&#8217; book is that I can find myself in several of his examples. <em>The Value of Business Analytics</em> is definitely THE book for managing analytics projects. After reading his book, it&#8217;s clear for me that Stubbs has a lot of experience in the field. I will finish with a quote that summarizes the book: &#8220;<em>Value is created only when action is taken, not when insight is generated.</em>&#8221;</p>
<p><a href="http://www.amazon.com/gp/product/1118012399/ref=as_li_tf_tl?ie=UTF8&amp;tag=dataminirese-20&amp;linkCode=as2&amp;camp=1789&amp;creative=9325&amp;creativeASIN=1118012399">The Value of Business Analytics: Identifying the Path to Profitability</a><img style="border:none !important; margin:0px !important;" src="http://www.assoc-amazon.com/e/ir?t=dataminirese-20&amp;l=as2&amp;o=1&amp;a=1118012399" border="0" alt="" width="1" height="1" /></p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fwww.dataminingblog.com%2Fdata-mining-book-review-the-value-of-business-analytics%2F&amp;linkname=Data%20Mining%20Book%20Review%3A%20The%20Value%20of%20Business%20Analytics"><img src="http://www.dataminingblog.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a>
<p><a href="http://feedads.g.doubleclick.net/~a/KUXjXpT0Z-H3bbK-ZXEzfJlDj-I/0/da"><img src="http://feedads.g.doubleclick.net/~a/KUXjXpT0Z-H3bbK-ZXEzfJlDj-I/0/di" border="0" ismap="true"></img></a><br/>
<a href="http://feedads.g.doubleclick.net/~a/KUXjXpT0Z-H3bbK-ZXEzfJlDj-I/1/da"><img src="http://feedads.g.doubleclick.net/~a/KUXjXpT0Z-H3bbK-ZXEzfJlDj-I/1/di" border="0" ismap="true"></img></a></p><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/dataminingblog?a=Y4--yjPnlwA:LzkoAkbWkVY:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/dataminingblog?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=Y4--yjPnlwA:LzkoAkbWkVY:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/dataminingblog?i=Y4--yjPnlwA:LzkoAkbWkVY:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=Y4--yjPnlwA:LzkoAkbWkVY:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/dataminingblog?i=Y4--yjPnlwA:LzkoAkbWkVY:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=Y4--yjPnlwA:LzkoAkbWkVY:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/dataminingblog?i=Y4--yjPnlwA:LzkoAkbWkVY:gIN9vFwOqvQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=Y4--yjPnlwA:LzkoAkbWkVY:dnMXMwOfBR0"><img src="http://feeds.feedburner.com/~ff/dataminingblog?d=dnMXMwOfBR0" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/dataminingblog/~4/Y4--yjPnlwA" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.dataminingblog.com/data-mining-book-review-the-value-of-business-analytics/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Data Mining Guest Post: Gaurav Vohra</title>
		<link>http://www.dataminingblog.com/data-mining-guest-post-gaurav-vohra/</link>
		<comments>http://www.dataminingblog.com/data-mining-guest-post-gaurav-vohra/#comments</comments>
		<pubDate>Sun, 01 Apr 2012 16:31:44 +0000</pubDate>
		<dc:creator>Sandro Saitta</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.dataminingblog.com/?p=1608</guid>
		<description><![CDATA[It&#8217;s my pleasure to welcome Gaurav Vohra on Data Mining Research for a guest post. He writes about analytics in India. Thanks Gaurav, for your contribution!
How India became the global hub for analytics?
The economic reforms at  the start of the 90s led to liberalization of government policies and  encouraged multi-nationals such as GE, [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.dataminingblog.com/wp-content/uploads/Gaurav.jpg"><img class="alignright size-full wp-image-1609" title="Gaurav" src="http://www.dataminingblog.com/wp-content/uploads/Gaurav.jpg" alt="Gaurav" width="146" height="192" /></a><em>It&#8217;s my pleasure to welcome Gaurav Vohra on Data Mining Research for a guest post. He writes about analytics in India. Thanks Gaurav, for your contribution!</em></p>
<p><strong>How India became the global hub for analytics?</strong></p>
<p>The economic reforms at  the start of the 90s led to liberalization of government policies and  encouraged multi-nationals such as GE, American Express and British  airways to enter India. Initially these companies sought to move their  back-end operations such as transaction processing and customer service.  These areas were traditionally considered low-value and resource  intensive and these companies realised huge cost savings due to the low  cost of labor in India.</p>
<p>As  offshoring matured, so did the kind of work that was offshored. GE and  American Express were amongst the first companies to identify an  abundance of analytic talent in India. They started to move some of the  high-end work such as data analysis and predictive modelling to India.  The results were very successful and with the availability of PhDs and  other highly skilled resources in India at about one-tenth the cost in  the US, this strategy was a clear win for the businesses.</p>
<p>Soon,  financial services companies such as Citibank and HSBC, online  companies like Google and Amazon, and retailers such as Target, Tesco  and Supervalu also set up their analytics teams in India.</p>
<p>Indian  IT companies had already made huge strides in the global arena and they  also realized that offering analytics services to their clients was the  ideal way to move up the value chain. IT giants like Infosys, Wipro,  Cognizant and TCS also jumped into the fray.</p>
<p><strong>Last 10 years</strong></p>
<p>All  this happened in 90s and the first half of the 2000s and by this time,  there were several people in India who had spent significant time in  this field. Entrepreneurs with strong analytics experience set up  companies such as MuSigma, Fractal analytics, Manthan systems and  Marketelligent – niche analytics companies serving clients across the  globe.</p>
<p>Finally,  the latest entrants in the field of analytics in India are Indian  businesses themselves. Financial services companies such as ICICI and  HDFC led the way for Indian businesses. Now analytics usage has spread  to retail, telecom, e-commerce, insurance, healthcare and even sports.</p>
<p>There  are no official figures available around the size of the analytics  industry in India. As per crude estimates, the analytics industry in  India employs anywhere between 80000 to 300000 people, depending on how  you define analytics. And it is growing at a rapid pace.</p>
<p>Analytics salaries in India have grown by 400% in the last 10 years and even the recession had little impact on this growth.</p>
<p><strong>Future prospects</strong></p>
<p>The  future of analytics industry in India looks very promising. The growth  is fuelled not only by global companies but also by the domestic market  which is increasingly turning to analytics to stay competitive. The  author firmly believes that businesses not only in India but around the  world will continue to rely increasingly on analytics and it is truly a  career for the future.</p>
<p><strong>Author bio</strong></p>
<p><em>Gaurav Vohra is an alumnus of IIM Bangalore with over 10 years of experience in the field of analytics. <span>Gaurav has been in the analytics industry from its initial days and h</span>is  career has spanned companies like Capital One and Information resources  Inc., recognized as thought-leaders in the analytics space. Gaurav is now the co-founder of <strong>Jigsaw academy</strong>,  a training institute that aims to meet the growing demand for talent in  the field of analytics by providing industry-relevant training to  develop business-ready professionals. You can visit Gaurav’s blog at <strong><span style="color: #0f243e;"><a href="http://blog.jigsawacademy.in/" target="_blank">http://blog.jigsawacademy.in</a></span></strong></em></p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fwww.dataminingblog.com%2Fdata-mining-guest-post-gaurav-vohra%2F&amp;linkname=Data%20Mining%20Guest%20Post%3A%20Gaurav%20Vohra"><img src="http://www.dataminingblog.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a>
<p><a href="http://feedads.g.doubleclick.net/~a/iiitP1CYFlPgsg8wZCyIJCXAE6U/0/da"><img src="http://feedads.g.doubleclick.net/~a/iiitP1CYFlPgsg8wZCyIJCXAE6U/0/di" border="0" ismap="true"></img></a><br/>
<a href="http://feedads.g.doubleclick.net/~a/iiitP1CYFlPgsg8wZCyIJCXAE6U/1/da"><img src="http://feedads.g.doubleclick.net/~a/iiitP1CYFlPgsg8wZCyIJCXAE6U/1/di" border="0" ismap="true"></img></a></p><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/dataminingblog?a=rUfbFHHnGLc:Bel8-HRic3M:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/dataminingblog?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=rUfbFHHnGLc:Bel8-HRic3M:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/dataminingblog?i=rUfbFHHnGLc:Bel8-HRic3M:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=rUfbFHHnGLc:Bel8-HRic3M:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/dataminingblog?i=rUfbFHHnGLc:Bel8-HRic3M:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=rUfbFHHnGLc:Bel8-HRic3M:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/dataminingblog?i=rUfbFHHnGLc:Bel8-HRic3M:gIN9vFwOqvQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=rUfbFHHnGLc:Bel8-HRic3M:dnMXMwOfBR0"><img src="http://feeds.feedburner.com/~ff/dataminingblog?d=dnMXMwOfBR0" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/dataminingblog/~4/rUfbFHHnGLc" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.dataminingblog.com/data-mining-guest-post-gaurav-vohra/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Statistical Analysis: Common Mistakes</title>
		<link>http://www.dataminingblog.com/statistical-analysis-common-mistakes/</link>
		<comments>http://www.dataminingblog.com/statistical-analysis-common-mistakes/#comments</comments>
		<pubDate>Wed, 21 Mar 2012 18:02:42 +0000</pubDate>
		<dc:creator>Sandro Saitta</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.dataminingblog.com/?p=1585</guid>
		<description><![CDATA[Whether you are a beginner in the field or an expert in statistics, the article by Dubey and Rajaram, 5 Common Mistakes People Make in the Name of Statistical Analysis, is a must read. The paper starts with this excellent example:
&#8220;Imagine you are a regional sales head for a major retailer in U.S. and you [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.dataminingblog.com/wp-content/uploads/caution21.png"><img class="alignright size-full wp-image-1588" title="caution2" src="http://www.dataminingblog.com/wp-content/uploads/caution21.png" alt="caution2" width="140" height="140" /></a>Whether you are a beginner in the field or an expert in statistics, the article by Dubey and Rajaram, <em>5 Common Mistakes People Make in the Name of Statistical Analysis</em>, is a must read. The paper starts with this excellent example:</p>
<p>&#8220;<em>Imagine you are a regional sales head for a major retailer in U.S. and you want to know what drives sales in your top performing stores. Your research team comes back with a revealing insight &#8211; the most significant predictor in their model is the average number of cars present in stores’ parking lots.</em>&#8221;</p>
<p>The five following mistakes are detailed and explained by the authors:</p>
<ol>
<li>Sophistication in statistics compensates for lack of data and/or business understanding.</li>
<li>Extracting meaning out of randomness</li>
<li>Correlation versus causation – modeling will help uncover causal relationships</li>
<li>Extrapolating the models way beyond the permissible limits</li>
<li>Imputing missing values with mean or median is the best way of treating missing values</li>
</ol>
<p>This article is an excellent reminder for practitioners and I strongly advise it.</p>
<p>Read the full article: <a href="http://www.information-management.com/newsletters/statistics-analytics-data-quality-mistakes-10021692-1.html">5 Common Mistakes People Make in the Name of Statistical Analysis</a></p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fwww.dataminingblog.com%2Fstatistical-analysis-common-mistakes%2F&amp;linkname=Statistical%20Analysis%3A%20Common%20Mistakes"><img src="http://www.dataminingblog.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a>
<p><a href="http://feedads.g.doubleclick.net/~a/KW_g3TJWDmKusw3F-LQO4Exa6-w/0/da"><img src="http://feedads.g.doubleclick.net/~a/KW_g3TJWDmKusw3F-LQO4Exa6-w/0/di" border="0" ismap="true"></img></a><br/>
<a href="http://feedads.g.doubleclick.net/~a/KW_g3TJWDmKusw3F-LQO4Exa6-w/1/da"><img src="http://feedads.g.doubleclick.net/~a/KW_g3TJWDmKusw3F-LQO4Exa6-w/1/di" border="0" ismap="true"></img></a></p><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/dataminingblog?a=Q2jE2yVWCFQ:7S4KnxeXcF0:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/dataminingblog?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=Q2jE2yVWCFQ:7S4KnxeXcF0:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/dataminingblog?i=Q2jE2yVWCFQ:7S4KnxeXcF0:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=Q2jE2yVWCFQ:7S4KnxeXcF0:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/dataminingblog?i=Q2jE2yVWCFQ:7S4KnxeXcF0:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=Q2jE2yVWCFQ:7S4KnxeXcF0:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/dataminingblog?i=Q2jE2yVWCFQ:7S4KnxeXcF0:gIN9vFwOqvQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=Q2jE2yVWCFQ:7S4KnxeXcF0:dnMXMwOfBR0"><img src="http://feeds.feedburner.com/~ff/dataminingblog?d=dnMXMwOfBR0" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/dataminingblog/~4/Q2jE2yVWCFQ" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.dataminingblog.com/statistical-analysis-common-mistakes/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Data Mining Book Review: Handbook of Statistical Analysis and Data Mining Applications</title>
		<link>http://www.dataminingblog.com/data-mining-book-review-handbook-of-statistical-analysis-and-data-mining-applications/</link>
		<comments>http://www.dataminingblog.com/data-mining-book-review-handbook-of-statistical-analysis-and-data-mining-applications/#comments</comments>
		<pubDate>Sat, 10 Mar 2012 21:35:24 +0000</pubDate>
		<dc:creator>Sandro Saitta</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.dataminingblog.com/?p=1571</guid>
		<description><![CDATA[The book from Robert Nisbet, John Elder and Gary Miner is a fresh addition to any data miner library. First surprise: the book is in full colors with a lot of pictures which is a good point. With a focus on data mining applications, the book also covers introduction and more profound data mining concepts. [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.dataminingblog.com/wp-content/uploads/saadm.PNG"><img class="alignleft size-full wp-image-1572" title="saadm" src="http://www.dataminingblog.com/wp-content/uploads/saadm.PNG" alt="saadm" width="150" height="185" /></a>The book from Robert Nisbet, John Elder and Gary Miner is a fresh addition to any data miner library. First surprise: the book is in full colors with a lot of pictures which is a good point. With a focus on data mining applications, the book also covers introduction and more profound data mining concepts. The book isn&#8217;t very technical and doesn&#8217;t contain equations and complex formulas. However, complex data mining challenges are addressed. The main advantage of the book is to give examples of each concept using three famous data mining tools: SPSS, SAS and STATISTICA.</p>
<p>The biggest added value of the book, and what makes it a unique resource in the field, are the tutorials. Several applications are detailed: aviation safety, unsatisfied customers, credit scoring and process control among others. Whatever your field of application is, you will find useful examples in the book. Even if the case studies aren&#8217;t in your field of interest, reading them takes you to an amazing journey in the world of analytics.</p>
<p>To be noted the interesting chapters about text mining and fraud detection. In the last part of the book, five chapters provide expert advices to both beginners and experienced data miners. The chapter &#8220;Top 10 Data Mining Mistakes&#8221; is remarkable. To conclude, this book is about applying data mining, rather than programming it. It gives excellent examples of enterprise applications in analytics.</p>
<p><a href="http://www.amazon.com/gp/product/0123747651/ref=as_li_tf_tl?ie=UTF8&amp;tag=dataminirese-20&amp;linkCode=as2&amp;camp=1789&amp;creative=9325&amp;creativeASIN=0123747651">Handbook of Statistical Analysis and Data Mining Applications</a><img style="border:none !important; margin:0px !important;" src="http://www.assoc-amazon.com/e/ir?t=dataminirese-20&amp;l=as2&amp;o=1&amp;a=0123747651" border="0" alt="" width="1" height="1" /></p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fwww.dataminingblog.com%2Fdata-mining-book-review-handbook-of-statistical-analysis-and-data-mining-applications%2F&amp;linkname=Data%20Mining%20Book%20Review%3A%20Handbook%20of%20Statistical%20Analysis%20and%20Data%20Mining%20Applications"><img src="http://www.dataminingblog.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a>
<p><a href="http://feedads.g.doubleclick.net/~a/NWvtn3VpumCIssPObvQ5x0zOKa8/0/da"><img src="http://feedads.g.doubleclick.net/~a/NWvtn3VpumCIssPObvQ5x0zOKa8/0/di" border="0" ismap="true"></img></a><br/>
<a href="http://feedads.g.doubleclick.net/~a/NWvtn3VpumCIssPObvQ5x0zOKa8/1/da"><img src="http://feedads.g.doubleclick.net/~a/NWvtn3VpumCIssPObvQ5x0zOKa8/1/di" border="0" ismap="true"></img></a></p><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/dataminingblog?a=ogzC4wE05jU:uMJdo8EOYIE:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/dataminingblog?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=ogzC4wE05jU:uMJdo8EOYIE:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/dataminingblog?i=ogzC4wE05jU:uMJdo8EOYIE:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=ogzC4wE05jU:uMJdo8EOYIE:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/dataminingblog?i=ogzC4wE05jU:uMJdo8EOYIE:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=ogzC4wE05jU:uMJdo8EOYIE:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/dataminingblog?i=ogzC4wE05jU:uMJdo8EOYIE:gIN9vFwOqvQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=ogzC4wE05jU:uMJdo8EOYIE:dnMXMwOfBR0"><img src="http://feeds.feedburner.com/~ff/dataminingblog?d=dnMXMwOfBR0" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/dataminingblog/~4/ogzC4wE05jU" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.dataminingblog.com/data-mining-book-review-handbook-of-statistical-analysis-and-data-mining-applications/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Selling Data Mining to Management</title>
		<link>http://www.dataminingblog.com/selling-data-mining-to-management/</link>
		<comments>http://www.dataminingblog.com/selling-data-mining-to-management/#comments</comments>
		<pubDate>Sun, 19 Feb 2012 16:26:13 +0000</pubDate>
		<dc:creator>Sandro Saitta</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.dataminingblog.com/?p=1576</guid>
		<description><![CDATA[Preparing data and building data mining models are two very well documented steps of analytics projects. However, whatever interesting your results are, they are useless if no action is taken. Thus, the step from analytics to action is a crucial one in any analytics project. Imagine you have the best data and found the best [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.dataminingblog.com/wp-content/uploads/sellingDM.PNG"><img class="alignright size-full wp-image-1578" title="sellingDM" src="http://www.dataminingblog.com/wp-content/uploads/sellingDM.PNG" alt="sellingDM" width="200" height="150" /></a>Preparing data and building data mining models are two very well documented steps of analytics projects. However, whatever interesting your results are, they are useless if no action is taken. Thus, the step from analytics to action is a crucial one in any analytics project. Imagine you have the best data and found the best model of all time. You need to industrialize the data mining solution to make your company benefits from them. Often, you will first need to sell your project to the management.</p>
<p>I recently read three interesting articles on this topic. The first one, <em>Selling Information Governance to Business Leaders</em>, by Sunil Soares, gives four tips explaining the value of analytics to business leaders. The first tip, focusing on business outcome, is extremely important. You may have a perfect data mining application, predicting your target with 99% of accuracy. If you can&#8217;t transform this 99% accuracy into a usable and industrialized solution for the company, the project will not bring any ROI for the company.</p>
<p>The second paper, <em>Selling a Data Mining Project to Management</em> is written by Casey Klimasauskas. It proposes an approach for selling a new data mining project to management. Part of the strategy deals with developing relationships with the stakeholders from the very beginning of the project. This minimizes the risk of future objections and increase the supporters of the project.</p>
<p>Finally, <em>What the C-suite should know about analytics</em>, is an interesting article written by Kishore S. Swaminathan. Found in the sascom magazine (but originally from Accenture), the article lists five areas to focus on in order to bring analytics to the management. The advices given in this article are very relevant. I particularly liked the 10 characteristics of an analytic leader. For more information, I provided the links to these three articles below.</p>
<p><a href="http://www.information-management.com/newsletters/governance-ROI-BI-business-rules-GRC-10021663-1.html">Selling Information Governance to Business Leaders</a></p>
<p><a href="http://www.b-eye-network.com/view/15734">Selling a Data Mining Project to Management</a></p>
<p><a href="http://www.sas.com/news/sascom/2012q1/business_analytics.html">What the C-suite should know about analytics</a></p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fwww.dataminingblog.com%2Fselling-data-mining-to-management%2F&amp;linkname=Selling%20Data%20Mining%20to%20Management"><img src="http://www.dataminingblog.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a>
<p><a href="http://feedads.g.doubleclick.net/~a/L3c8u2vxXdxwE75wFdItFrtQDUs/0/da"><img src="http://feedads.g.doubleclick.net/~a/L3c8u2vxXdxwE75wFdItFrtQDUs/0/di" border="0" ismap="true"></img></a><br/>
<a href="http://feedads.g.doubleclick.net/~a/L3c8u2vxXdxwE75wFdItFrtQDUs/1/da"><img src="http://feedads.g.doubleclick.net/~a/L3c8u2vxXdxwE75wFdItFrtQDUs/1/di" border="0" ismap="true"></img></a></p><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/dataminingblog?a=48_pVMzG-e8:oR-iHP5kE8A:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/dataminingblog?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=48_pVMzG-e8:oR-iHP5kE8A:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/dataminingblog?i=48_pVMzG-e8:oR-iHP5kE8A:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=48_pVMzG-e8:oR-iHP5kE8A:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/dataminingblog?i=48_pVMzG-e8:oR-iHP5kE8A:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=48_pVMzG-e8:oR-iHP5kE8A:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/dataminingblog?i=48_pVMzG-e8:oR-iHP5kE8A:gIN9vFwOqvQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=48_pVMzG-e8:oR-iHP5kE8A:dnMXMwOfBR0"><img src="http://feeds.feedburner.com/~ff/dataminingblog?d=dnMXMwOfBR0" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/dataminingblog/~4/48_pVMzG-e8" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.dataminingblog.com/selling-data-mining-to-management/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Analytics and Data Science Reports</title>
		<link>http://www.dataminingblog.com/analytics-and-data-science-reports/</link>
		<comments>http://www.dataminingblog.com/analytics-and-data-science-reports/#comments</comments>
		<pubDate>Sun, 05 Feb 2012 18:04:55 +0000</pubDate>
		<dc:creator>Sandro Saitta</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.dataminingblog.com/?p=1581</guid>
		<description><![CDATA[I would like to share with you two reports regarding analytics that I read recently. The first one is Intelligence for Everyone: Transforming Business Analytics Across the Enterprise is written by Irfan Khan from SAP. According to the author, big data is a big lie. The fear from the industry regarding big data is not [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.dataminingblog.com/wp-content/uploads/report.png"><img class="alignright size-full wp-image-1582" title="report" src="http://www.dataminingblog.com/wp-content/uploads/report.png" alt="report" width="180" height="180" /></a>I would like to share with you two reports regarding analytics that I read recently. The first one is <em>Intelligence for Everyone: Transforming Business Analytics Across the Enterprise</em> is written by Irfan Khan from SAP. According to the author, big data is a big lie. The fear from the industry regarding big data is not justified. The article proposes use cases in analytics for different industry sectors.</p>
<p>The second paper, <em>Data Science Revealed: A Data-Driven Glimpse into the Burgeoning New Field</em> (EMC Data Science Community), proposes a benchmark of the data science community (understand data mining and business intelligence practitioners). The study tries to differentiate data scientists and business intelligence people. For that, about 500 practitioners have been interviewed. I find their concluding paragraph very relevant:</p>
<p>&#8220;<em>Once companies have brought in the right talent, they need to create an environment conducive to effective data science. That means building high-performing, cross-functional teams that include a variety of roles, including programmers, statisticians, and graphic designers, and aligning them to directly support interested business decision makers. They should also loosen restrictions on data in the enterprise, allowing employees to more freely run data-driven experiments. Finally, data scientists should be given free access to run experiments on data, without bureaucratic obstacles, so that they can rapidly translate their own intellectual curiosity into business results.</em>&#8221;</p>
<p><a href="http://www.emc.com/collateral/about/news/emc-data-science-study-wp.pdf">Data Science Revealed: A Data-Driven Glimpse into the Burgeoning New Field</a></p>
<p><a href="http://www.sybase.com/analyticsguide">Intelligence for Everyone: Transforming Business Analytics Across the Enterprise</a></p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fwww.dataminingblog.com%2Fanalytics-and-data-science-reports%2F&amp;linkname=Analytics%20and%20Data%20Science%20Reports"><img src="http://www.dataminingblog.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a>
<p><a href="http://feedads.g.doubleclick.net/~a/QhbtvRVuYk-GFGTjkK3LYEJhl9E/0/da"><img src="http://feedads.g.doubleclick.net/~a/QhbtvRVuYk-GFGTjkK3LYEJhl9E/0/di" border="0" ismap="true"></img></a><br/>
<a href="http://feedads.g.doubleclick.net/~a/QhbtvRVuYk-GFGTjkK3LYEJhl9E/1/da"><img src="http://feedads.g.doubleclick.net/~a/QhbtvRVuYk-GFGTjkK3LYEJhl9E/1/di" border="0" ismap="true"></img></a></p><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/dataminingblog?a=3Vt0--HIZRk:7PyKzrkxS2E:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/dataminingblog?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=3Vt0--HIZRk:7PyKzrkxS2E:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/dataminingblog?i=3Vt0--HIZRk:7PyKzrkxS2E:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=3Vt0--HIZRk:7PyKzrkxS2E:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/dataminingblog?i=3Vt0--HIZRk:7PyKzrkxS2E:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=3Vt0--HIZRk:7PyKzrkxS2E:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/dataminingblog?i=3Vt0--HIZRk:7PyKzrkxS2E:gIN9vFwOqvQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=3Vt0--HIZRk:7PyKzrkxS2E:dnMXMwOfBR0"><img src="http://feeds.feedburner.com/~ff/dataminingblog?d=dnMXMwOfBR0" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/dataminingblog/~4/3Vt0--HIZRk" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.dataminingblog.com/analytics-and-data-science-reports/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Data Mining Guest Post: Eric Greenwood</title>
		<link>http://www.dataminingblog.com/data-mining-guest-post-eric-greenwood/</link>
		<comments>http://www.dataminingblog.com/data-mining-guest-post-eric-greenwood/#comments</comments>
		<pubDate>Wed, 25 Jan 2012 18:38:23 +0000</pubDate>
		<dc:creator>Sandro Saitta</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.dataminingblog.com/?p=1554</guid>
		<description><![CDATA[Today, Eric Greenwood, expert in data storage, is our guest blogger on Data Mining Research. Thanks for his post and feel free to comment about his input.
Data Mining, Refining, and Storage
As companies are able to more effectively store and retrieve information via online storage services, the relevance of data mining as the crucial first step [...]]]></description>
			<content:encoded><![CDATA[<p><em>Today, Eric Greenwood, expert in data storage, is our guest blogger on Data Mining Research. Thanks for his post and feel free to comment about his input.</em></p>
<p><strong>Data Mining, Refining, and Storage</strong></p>
<p>As companies are able to more effectively store and retrieve information via online storage services, the relevance of data mining as the crucial first step towards intelligent business decisions will become an important tool that was previously unavailable for small and medium-sized businesses and may usher in a new era of success for those who can afford to make the change.</p>
<p><strong>Online Storage: Addition by Subtraction</strong></p>
<p>The effect on large and small companies of using services that provide online storage should be evaluated by looking at what the process takes away rather than what it necessarily adds.  More and more companies are finding themselves practically bombarded with the exponential growth of data that is available about their customers, provided in no small part by the increasing number of internet-connected sensors that people own—smart phones, tablets, laptops, to name just a few.</p>
<p>Just consider this snippet taken from a white paper produced by Globalknowledge, a leader in IT and business skills training:</p>
<p><em>“According to International Data Corporation (IDC), ‘The proliferation of devices, compliance, improved systems performance, online commerce and increased replication to secondary or backup sites is contributing to an annual doubling of the amount of information transmitted over the Internet.’”</em></p>
<p>Indeed, a veritable ocean of information has been created, and its unexplored depths contain the potential for game-changing discoveries.  Unfortunately for medium-sized businesses, or even larger businesses that find themselves underprepared, the challenges of collecting their relevant data efficiently and storing it securely are enough to keep them too busy to participate in this feeding frenzy of information.  That’s where the developing industry of online storage comes in.</p>
<p>Online storage as a service is a young industry, and as with any young industry there are still some very real customer concerns that need to be addressed before it can be widely accepted as a solution.  Specifically, those who wish to provide online storage to customers at a large scale will need to bridge the security gaps that are still associated with it; however, as those issues are resolved with time, purveyors of online storage services that position themselves to accommodate this massive influx of data from small to medium-sized businesses may find that they have customers knocking down their doors.</p>
<p>The introduction of online storage as a service to small and medium-sized businesses that are having difficulty managing their data has a more profound effect on their bottom lines than it might initially seem.  That’s because rather than spending too much time trying to manage data, companies can shift their primary focus toward understanding it, which is a monumental task unto itself.</p>
<p>If the process of local storage and management was a difficult burden, then the process of knowledge discovery required to understand it is a seemingly insurmountable one—how can any person possibly hope to make sense of petabytes of information?  The answer to this question, and the key first step toward vital business intelligence for curious companies, is data mining.  As an automated process, data mining can help companies identify patterns in their data that can be analyzed in detail and used in predictive or product-oriented capacities.</p>
<p>In a dark room, data mining is like emergency lights that come on and provide much needed guidance and direction.  Without the benefit of online storage to relieve the mounting pressure from data overload, though, small and medium-sized businesses are in danger of being too busy to see the light.</p>
<p><em>Eric Greenwood is a technophile whose interests span the range of data management, online storage, business intelligence and much more – Read more of his work at the blog </em><a href="http://www.onlinestorage.org/"><em>Online Storage</em></a><em>!</em></p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fwww.dataminingblog.com%2Fdata-mining-guest-post-eric-greenwood%2F&amp;linkname=Data%20Mining%20Guest%20Post%3A%20Eric%20Greenwood"><img src="http://www.dataminingblog.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a>
<p><a href="http://feedads.g.doubleclick.net/~a/F_cbTg4d0qWWXlhQDKuo4FkWL5A/0/da"><img src="http://feedads.g.doubleclick.net/~a/F_cbTg4d0qWWXlhQDKuo4FkWL5A/0/di" border="0" ismap="true"></img></a><br/>
<a href="http://feedads.g.doubleclick.net/~a/F_cbTg4d0qWWXlhQDKuo4FkWL5A/1/da"><img src="http://feedads.g.doubleclick.net/~a/F_cbTg4d0qWWXlhQDKuo4FkWL5A/1/di" border="0" ismap="true"></img></a></p><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/dataminingblog?a=Sb7gmwWTb8w:YXoNkyO4tR4:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/dataminingblog?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=Sb7gmwWTb8w:YXoNkyO4tR4:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/dataminingblog?i=Sb7gmwWTb8w:YXoNkyO4tR4:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=Sb7gmwWTb8w:YXoNkyO4tR4:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/dataminingblog?i=Sb7gmwWTb8w:YXoNkyO4tR4:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=Sb7gmwWTb8w:YXoNkyO4tR4:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/dataminingblog?i=Sb7gmwWTb8w:YXoNkyO4tR4:gIN9vFwOqvQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=Sb7gmwWTb8w:YXoNkyO4tR4:dnMXMwOfBR0"><img src="http://feeds.feedburner.com/~ff/dataminingblog?d=dnMXMwOfBR0" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/dataminingblog/~4/Sb7gmwWTb8w" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.dataminingblog.com/data-mining-guest-post-eric-greenwood/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Data Mining Book Review: Decision Management Systems</title>
		<link>http://www.dataminingblog.com/data-mining-book-review-decision-management-systems/</link>
		<comments>http://www.dataminingblog.com/data-mining-book-review-decision-management-systems/#comments</comments>
		<pubDate>Mon, 16 Jan 2012 16:54:09 +0000</pubDate>
		<dc:creator>Sandro Saitta</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.dataminingblog.com/?p=1542</guid>
		<description><![CDATA[I recently read the last book from James Taylor, Decision Management Systems: A Practical Guide to Using Business Rules and Predictive Analytics. As a data miner, I was interested by the subtitle of the book. Although, the book is really well written, I&#8217;m a bit disappointed regarding the content for someone in analytics. I was [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.dataminingblog.com/wp-content/uploads/DMS.PNG"><img class="alignleft size-full wp-image-1543" title="DMS" src="http://www.dataminingblog.com/wp-content/uploads/DMS.PNG" alt="DMS" width="173" height="260" /></a>I recently read the last book from James Taylor, <em>Decision Management Systems: A Practical Guide to Using Business Rules and Predictive Analytics</em>. As a data miner, I was interested by the subtitle of the book. Although, the book is really well written, I&#8217;m a bit disappointed regarding the content for someone in analytics. I was expecting real methodologies and examples to move from analytics to actions in the company. How to successfully apply predictive analytics in the industry. The book only partly answers this question and gives mainly examples of <em>business rules</em> and how they are applied in companies.</p>
<p>If you come from the business side (e.g. C-level), the book may be interesting but the explanations about predictive analytics are quite light and you won&#8217;t see all benefits of these techniques in the company. I know that the main focus of the book is not about teaching analytics. It seems also not to be about filling the gap between analytics and action. I&#8217;m thus a bit confused about the real objective of the book. It is also explaining concepts at a very high level of abstraction. It is thus not directly usable in practice.</p>
<p>The book is divided in three parts. In the first part, James explains what are DMS and why they are useful for the company. The second part focuses on building these DMS. The third part is about the enablers (people, processes and technology), i.e. the aspects that will allow such DMS to be a successful initiative. Personally, I found the book very interesting starting from chapter 6 (Design and Implement Decision Services). The topic of fraud detection and prevention is very well studied throughout the book.</p>
<p>A very strange choice has been made to repeat in full text the expression <em>Decision Management Systems</em> hundreds of times. It thus makes the reading sometimes a bit tiring. The simple use of the abbreviation DMS would have solved this issue. To conclude, I found the book interesting and well written. However, keep in mind that it is written with a very high level of abstraction. You will thus have a clear understanding of the domain, but no practical advices.</p>
<p><a href="http://www.amazon.com/gp/product/0132884380/ref=as_li_tf_tl?ie=UTF8&amp;tag=dataminirese-20&amp;linkCode=as2&amp;camp=1789&amp;creative=9325&amp;creativeASIN=0132884380">Decision Management Systems: A Practical Guide to Using Business Rules and Predictive Analytics</a></p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fwww.dataminingblog.com%2Fdata-mining-book-review-decision-management-systems%2F&amp;linkname=Data%20Mining%20Book%20Review%3A%20Decision%20Management%20Systems"><img src="http://www.dataminingblog.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a>
<p><a href="http://feedads.g.doubleclick.net/~a/t0yAhOoGZicDrsZdx1396Q9N_0k/0/da"><img src="http://feedads.g.doubleclick.net/~a/t0yAhOoGZicDrsZdx1396Q9N_0k/0/di" border="0" ismap="true"></img></a><br/>
<a href="http://feedads.g.doubleclick.net/~a/t0yAhOoGZicDrsZdx1396Q9N_0k/1/da"><img src="http://feedads.g.doubleclick.net/~a/t0yAhOoGZicDrsZdx1396Q9N_0k/1/di" border="0" ismap="true"></img></a></p><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/dataminingblog?a=-2-mYIx3gT8:CNOgysiCIOA:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/dataminingblog?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=-2-mYIx3gT8:CNOgysiCIOA:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/dataminingblog?i=-2-mYIx3gT8:CNOgysiCIOA:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=-2-mYIx3gT8:CNOgysiCIOA:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/dataminingblog?i=-2-mYIx3gT8:CNOgysiCIOA:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=-2-mYIx3gT8:CNOgysiCIOA:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/dataminingblog?i=-2-mYIx3gT8:CNOgysiCIOA:gIN9vFwOqvQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/dataminingblog?a=-2-mYIx3gT8:CNOgysiCIOA:dnMXMwOfBR0"><img src="http://feeds.feedburner.com/~ff/dataminingblog?d=dnMXMwOfBR0" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/dataminingblog/~4/-2-mYIx3gT8" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.dataminingblog.com/data-mining-book-review-decision-management-systems/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
	</channel>
</rss>

