<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss1full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:admin="http://webns.net/mvcb/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:cc="http://web.resource.org/cc/" xmlns="http://purl.org/rss/1.0/" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0">

<channel rdf:about="http://datamining.typepad.com/data_mining/">
<title>Data Mining: Text Mining, Visualization and Social Media</title>
<link>http://datamining.typepad.com/data_mining/</link>
<description />
<dc:language>en-US</dc:language>
<dc:creator />
<dc:date>2012-01-26T21:37:21-05:00</dc:date>
<admin:generatorAgent rdf:resource="http://www.typepad.com/" />


<items>
<rdf:Seq><rdf:li rdf:resource="http://datamining.typepad.com/data_mining/2012/01/bitly-users-vote-with-their-clicks-on-vatican-scandal.html" />
<rdf:li rdf:resource="http://datamining.typepad.com/data_mining/2012/01/super-hot-news.html" />
<rdf:li rdf:resource="http://datamining.typepad.com/data_mining/2012/01/journalists-tweets-provide-additional-story-context.html" />
<rdf:li rdf:resource="http://datamining.typepad.com/data_mining/2012/01/visualizing-the-roi-of-news-articles.html" />
<rdf:li rdf:resource="http://datamining.typepad.com/data_mining/2012/01/exploring-news.html" />
<rdf:li rdf:resource="http://datamining.typepad.com/data_mining/2012/01/did-web-search-kill-artificial-intelligence.html" />
<rdf:li rdf:resource="http://datamining.typepad.com/data_mining/2012/01/farewell-to-blogpulse.html" />
<rdf:li rdf:resource="http://datamining.typepad.com/data_mining/2012/01/bing-visualizes-corporate-diversity.html" />
<rdf:li rdf:resource="http://datamining.typepad.com/data_mining/2012/01/google-adds-new-satellite-overlay-control.html" />
<rdf:li rdf:resource="http://datamining.typepad.com/data_mining/2012/01/bing-has-the-answer.html" />
</rdf:Seq>
</items>

<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/rdf+xml" href="http://feeds.feedburner.com/DataMining" /><feedburner:info uri="datamining" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><geo:lat>40.468968</geo:lat><geo:long>-79.918639</geo:long><feedburner:emailServiceId>DataMining</feedburner:emailServiceId><feedburner:feedburnerHostname>http://feedburner.google.com</feedburner:feedburnerHostname><feedburner:browserFriendly>This is an XML content feed. It is intended to be viewed in a newsreader or syndicated to another site, subject to copyright and fair use.</feedburner:browserFriendly></channel>

<item rdf:about="http://datamining.typepad.com/data_mining/2012/01/bitly-users-vote-with-their-clicks-on-vatican-scandal.html">
<title>Bitly Users Vote with their Clicks on Vatican Scandal</title>
<link>http://feedproxy.google.com/~r/DataMining/~3/eaqb9X2Yu34/bitly-users-vote-with-their-clicks-on-vatican-scandal.html</link>
<description>By far the biggest story right now according to clicks on the bit.ly links to articles published by Reuters (as shown on d8taplex) is: This story doesn't make it on to Google News front page or even their page of...</description>
<content:encoded><![CDATA[<p>By far the biggest story right now according to clicks on the bit.ly links to articles published by Reuters (as shown on <a href="http://bit.ly/rfQPgd" target="_self">d8taplex</a>) is:</p>
<p><a href="http://datamining.typepad.com/.a/6a00d8341c994053ef01676125b59a970b-pi" style="display: inline;"><img alt="Vatican" class="asset  asset-image at-xid-6a00d8341c994053ef01676125b59a970b" src="http://datamining.typepad.com/.a/6a00d8341c994053ef01676125b59a970b-500wi" title="Vatican" /></a><br />This story doesn&#39;t make it on to Google News front page or even their page of world news. It&#39;s not on the BBC&#39;s front page or in its European news section. Could be quite simply due to the fact that Reuters got the news late and everyone has moved on, but I see an article on HuffPo on this (from the same source) posted 3 hours ago.</p>
<p>&#0160;</p><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/DataMining?a=eaqb9X2Yu34:cHfJRzbV8lQ:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/DataMining?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/DataMining?a=eaqb9X2Yu34:cHfJRzbV8lQ:7Q72WNTAKBA"><img src="http://feeds.feedburner.com/~ff/DataMining?d=7Q72WNTAKBA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/DataMining?a=eaqb9X2Yu34:cHfJRzbV8lQ:2mJPEYqXBVI"><img src="http://feeds.feedburner.com/~ff/DataMining?d=2mJPEYqXBVI" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/DataMining?a=eaqb9X2Yu34:cHfJRzbV8lQ:I9og5sOYxJI"><img src="http://feeds.feedburner.com/~ff/DataMining?d=I9og5sOYxJI" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/DataMining/~4/eaqb9X2Yu34" height="1" width="1"/>]]></content:encoded>


<dc:subject>news</dc:subject>

<dc:creator>Matthew Hurst</dc:creator>
<dc:date>2012-01-26T21:37:21-05:00</dc:date>
<feedburner:origLink>http://datamining.typepad.com/data_mining/2012/01/bitly-users-vote-with-their-clicks-on-vatican-scandal.html</feedburner:origLink></item>
<item rdf:about="http://datamining.typepad.com/data_mining/2012/01/super-hot-news.html">
<title>Super-hot News</title>
<link>http://feedproxy.google.com/~r/DataMining/~3/Nq8Jfok9rh4/super-hot-news.html</link>
<description>A simple list of articles may tell you that the ones at the top are 'more something' (relevant, popular, etc.) than those lower, but not by how much. I just noticed on the d8taplex news page that this article, entitled...</description>
<content:encoded><![CDATA[<p>A simple list of articles may tell you that the ones at the top are &#39;more something&#39; (relevant, popular, etc.) than those lower, but not by how much.</p>
<p>I just noticed on the <a href="http://bit.ly/rfQPgd" target="_self">d8taplex news page</a> that this article, entitled &#39;<a href="http://bit.ly/yg9C9b" target="_self">Starbucks to sell alcohol in some U.S. cafes</a>&#39; just hit 11, 430 clicks on bit.ly. This is an order of magnitude more than most articles get to in terms of bit.ly clicks.</p>
<p><a href="http://datamining.typepad.com/.a/6a00d8341c994053ef016760fa5c8c970b-pi" style="display: inline;"><img alt="Starbucks" class="asset  asset-image at-xid-6a00d8341c994053ef016760fa5c8c970b" src="http://datamining.typepad.com/.a/6a00d8341c994053ef016760fa5c8c970b-500wi" title="Starbucks" /></a><br /><br /></p><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/DataMining?a=Nq8Jfok9rh4:Zgm5lHRVyMo:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/DataMining?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/DataMining?a=Nq8Jfok9rh4:Zgm5lHRVyMo:7Q72WNTAKBA"><img src="http://feeds.feedburner.com/~ff/DataMining?d=7Q72WNTAKBA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/DataMining?a=Nq8Jfok9rh4:Zgm5lHRVyMo:2mJPEYqXBVI"><img src="http://feeds.feedburner.com/~ff/DataMining?d=2mJPEYqXBVI" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/DataMining?a=Nq8Jfok9rh4:Zgm5lHRVyMo:I9og5sOYxJI"><img src="http://feeds.feedburner.com/~ff/DataMining?d=I9og5sOYxJI" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/DataMining/~4/Nq8Jfok9rh4" height="1" width="1"/>]]></content:encoded>


<dc:subject>news</dc:subject>

<dc:creator>Matthew Hurst</dc:creator>
<dc:date>2012-01-23T21:06:45-05:00</dc:date>
<feedburner:origLink>http://datamining.typepad.com/data_mining/2012/01/super-hot-news.html</feedburner:origLink></item>
<item rdf:about="http://datamining.typepad.com/data_mining/2012/01/journalists-tweets-provide-additional-story-context.html">
<title>Journalists' Tweets Provide Additional Story Context</title>
<link>http://feedproxy.google.com/~r/DataMining/~3/69hwJcC1U7E/journalists-tweets-provide-additional-story-context.html</link>
<description>I'm watching the bit.ly counts for Reuters story on the RIM guard change continue to jump up on the d8taplex news page. Because the d8taplex page links to the Twitter accounts of the journalists when they are available, I took...</description>
<content:encoded><![CDATA[<p>I&#39;m watching the bit.ly counts for<a href="http://www.reuters.com/article/2012/01/23/us-rim-idUSTRE80M04920120123" target="_self"> Reuters story</a> on the <a href="http://online.wsj.com/article_email/SB10001424052970204624204577177184275959856-lMyQjAxMTAyMDIwMjEyNDIyWj.html" target="_self">RIM guard change</a> continue to jump up on the <a href="http://bit.ly/rfQPgd" target="_self">d8taplex news page</a>.</p>
<p><a href="http://datamining.typepad.com/.a/6a00d8341c994053ef0162fffbf03a970d-pi" style="display: inline;"><img alt="Rim" class="asset  asset-image at-xid-6a00d8341c994053ef0162fffbf03a970d" src="http://datamining.typepad.com/.a/6a00d8341c994053ef0162fffbf03a970d-320wi" title="Rim" /></a></p>
<p>Because the d8taplex page links to the Twitter accounts of the journalists when they are available, I took a look at what Alastair Sharp was sharing on Twitter. Alastair provides a couple of tweets on the topic, including a link to a video interviewing the new CEO.</p>
<p><a href="http://datamining.typepad.com/.a/6a00d8341c994053ef0168e5f1d7c8970c-pi" style="display: inline;"><img alt="Alastairsharp" class="asset  asset-image at-xid-6a00d8341c994053ef0168e5f1d7c8970c" src="http://datamining.typepad.com/.a/6a00d8341c994053ef0168e5f1d7c8970c-500wi" title="Alastairsharp" /></a></p>
<p>I&#39;ll figure out a way to integrate recent tweets from the journalists into the interface in the near future.</p>
<p>&#0160;</p><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/DataMining?a=69hwJcC1U7E:KZ2cwbpoJ9g:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/DataMining?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/DataMining?a=69hwJcC1U7E:KZ2cwbpoJ9g:7Q72WNTAKBA"><img src="http://feeds.feedburner.com/~ff/DataMining?d=7Q72WNTAKBA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/DataMining?a=69hwJcC1U7E:KZ2cwbpoJ9g:2mJPEYqXBVI"><img src="http://feeds.feedburner.com/~ff/DataMining?d=2mJPEYqXBVI" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/DataMining?a=69hwJcC1U7E:KZ2cwbpoJ9g:I9og5sOYxJI"><img src="http://feeds.feedburner.com/~ff/DataMining?d=I9og5sOYxJI" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/DataMining/~4/69hwJcC1U7E" height="1" width="1"/>]]></content:encoded>


<dc:subject>news</dc:subject>

<dc:creator>Matthew Hurst</dc:creator>
<dc:date>2012-01-22T23:49:44-05:00</dc:date>
<feedburner:origLink>http://datamining.typepad.com/data_mining/2012/01/journalists-tweets-provide-additional-story-context.html</feedburner:origLink></item>
<item rdf:about="http://datamining.typepad.com/data_mining/2012/01/visualizing-the-roi-of-news-articles.html">
<title>Visualizing the ROI of News Articles</title>
<link>http://feedproxy.google.com/~r/DataMining/~3/5yOby-YuIFI/visualizing-the-roi-of-news-articles.html</link>
<description>By filtering on a contributors name (in the example below, I've filtered on the enigmatic Cynthia Johnston) the d8taplex news page can instantly show to what degree the contributors articles are getting attention (according to bit.ly stats). Here we see...</description>
<content:encoded><![CDATA[<p>By filtering on a contributors name (in the example below, I've filtered on <a href="http://datamining.typepad.com/data_mining/2012/01/web-search-its-worse-than-you-think.html" target="_self">the enigmatic Cynthia Johnston</a>) the <a href="http://bit.ly/rfQPgd" target="_self">d8taplex news page</a> can instantly show to what degree the contributors articles are getting attention (according to bit.ly stats).</p>
<p>Here we see that from the pool of recent articles with CJ's name on, one has made it to the hot column, while nine are still in probation looking for some clicks.</p>
<p><a style="display: inline;" href="http://datamining.typepad.com/.a/6a00d8341c994053ef0168e5ee2a6c970c-pi"><img class="asset  asset-image at-xid-6a00d8341c994053ef0168e5ee2a6c970c" title="Cynthia" src="http://datamining.typepad.com/.a/6a00d8341c994053ef0168e5ee2a6c970c-500wi" alt="Cynthia" /></a></p>
<p>Over time, we will likely see articles move from the right to the left while new ones populate the unloved column.</p>
<p>By filtering on country names, or other keywords, one can get a sense of where attention is being paid.</p><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/DataMining?a=5yOby-YuIFI:X6S5C7jUNbU:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/DataMining?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/DataMining?a=5yOby-YuIFI:X6S5C7jUNbU:7Q72WNTAKBA"><img src="http://feeds.feedburner.com/~ff/DataMining?d=7Q72WNTAKBA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/DataMining?a=5yOby-YuIFI:X6S5C7jUNbU:2mJPEYqXBVI"><img src="http://feeds.feedburner.com/~ff/DataMining?d=2mJPEYqXBVI" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/DataMining?a=5yOby-YuIFI:X6S5C7jUNbU:I9og5sOYxJI"><img src="http://feeds.feedburner.com/~ff/DataMining?d=I9og5sOYxJI" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/DataMining/~4/5yOby-YuIFI" height="1" width="1"/>]]></content:encoded>


<dc:subject>news</dc:subject>

<dc:creator>Matthew Hurst</dc:creator>
<dc:date>2012-01-22T13:41:01-05:00</dc:date>
<feedburner:origLink>http://datamining.typepad.com/data_mining/2012/01/visualizing-the-roi-of-news-articles.html</feedburner:origLink></item>
<item rdf:about="http://datamining.typepad.com/data_mining/2012/01/exploring-news.html">
<title>Exploring News</title>
<link>http://feedproxy.google.com/~r/DataMining/~3/DCQeb75Eo6c/exploring-news.html</link>
<description>In experimenting with news aggregation and mining on the d8taplex site, I've come up with the following questions: Why are some news articles picked up and others not? News sources such as Reuters create articles that are either directly consumed...</description>
<content:encoded><![CDATA[<p>In experimenting with news aggregation and mining on the <a href="http://d8taplex.com/hapaxPage.html" target="_self">d8taplex</a> site, I&#39;ve come up with the following questions:</p>
<ol>
<li>Why are some news articles picked up and others not? News sources such as Reuters create articles that are either directly consumed or which are picked up by other publications and passed along.</li>
<li>Who are these people writing these articles? What are their interests, areas of expertise and personalities?</li>
<li>What is the role of the editor and how do they influence the selection and form of the content produced by the news machine?</li>
</ol>
<p>The next round of experimentation with news aggregation has resulted in the current new site. It has the following features.</p>
<p>Firstly, it presents two lists of articles. On the left, are those articles which are currently getting a reasonable amount of buzz (as determined by bit.ly clicks). The second column, on the right, presents those articles which are not yet receiving much attention. As you read the left you will probably realise that you are already familiar with these stories.</p>
<p><a href="http://datamining.typepad.com/.a/6a00d8341c994053ef0162ffeb0d5a970d-pi" style="display: inline;"><img alt="Hapax2" class="asset  asset-image at-xid-6a00d8341c994053ef0162ffeb0d5a970d" src="http://datamining.typepad.com/.a/6a00d8341c994053ef0162ffeb0d5a970d-500wi" title="Hapax2" /></a></p>
<p>The amount of bit.ly juice is indicated by the number prefixed by &#39;B:&#39; The article block consists of the title, the time stamp of the article, the link and the list of contributors.</p>
<p><a href="http://datamining.typepad.com/.a/6a00d8341c994053ef0162ffeb111f970d-pi" style="display: inline;"><img alt="Article" class="asset  asset-image at-xid-6a00d8341c994053ef0162ffeb111f970d" src="http://datamining.typepad.com/.a/6a00d8341c994053ef0162ffeb111f970d-500wi" title="Article" /></a><br />Secondly, by providing the dynamic filtering one can very quickly get a view of which unloved articles are present relating to those stories that are already getting attention. For example, in the screenshot below we can see that one article about Gingrich is getting buzz while four others that mention him are languishing on the right.</p>
<p><a href="http://datamining.typepad.com/.a/6a00d8341c994053ef0162ffeb0bd8970d-pi" style="display: inline;"><img alt="Gingrich" class="asset  asset-image at-xid-6a00d8341c994053ef0162ffeb0bd8970d" src="http://datamining.typepad.com/.a/6a00d8341c994053ef0162ffeb0bd8970d-500wi" title="Gingrich" /></a><br />Thirdly, it connects the user with the personality of the author and editor by providing, where possible, various facets of information such as job title, twitter account, email address, phone number, role, languages spoken, etc. Some of these are gleaned from the Reuters site while others are mined from various social network sites and so on.</p>
<p><a href="http://datamining.typepad.com/.a/6a00d8341c994053ef0168e5e0f05a970c-pi" style="display: inline;"><img alt="Profile" class="asset  asset-image at-xid-6a00d8341c994053ef0168e5e0f05a970c" src="http://datamining.typepad.com/.a/6a00d8341c994053ef0168e5e0f05a970c-320wi" title="Profile" /></a></p>
<p>Please <a href="http://d8taplex.com/hapaxPage.html" target="_self">take a look at the site</a> and let me know what you think. There are many things that I&#39;d like to do with it, such as surfacing social metrics for the authors, predicting the potential upside for an article based on the historical bit.ly scores of the authors, etc.</p>
<p>If you click through to an article from the site, you will be contributing to its bit.ly score!</p><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/DataMining?a=DCQeb75Eo6c:al2T5g7ccp4:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/DataMining?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/DataMining?a=DCQeb75Eo6c:al2T5g7ccp4:7Q72WNTAKBA"><img src="http://feeds.feedburner.com/~ff/DataMining?d=7Q72WNTAKBA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/DataMining?a=DCQeb75Eo6c:al2T5g7ccp4:2mJPEYqXBVI"><img src="http://feeds.feedburner.com/~ff/DataMining?d=2mJPEYqXBVI" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/DataMining?a=DCQeb75Eo6c:al2T5g7ccp4:I9og5sOYxJI"><img src="http://feeds.feedburner.com/~ff/DataMining?d=I9og5sOYxJI" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/DataMining/~4/DCQeb75Eo6c" height="1" width="1"/>]]></content:encoded>


<dc:subject>news</dc:subject>

<dc:creator>Matthew Hurst</dc:creator>
<dc:date>2012-01-20T19:15:14-05:00</dc:date>
<feedburner:origLink>http://datamining.typepad.com/data_mining/2012/01/exploring-news.html</feedburner:origLink></item>
<item rdf:about="http://datamining.typepad.com/data_mining/2012/01/did-web-search-kill-artificial-intelligence.html">
<title>Did Web Search kill Artificial Intelligence?</title>
<link>http://feedproxy.google.com/~r/DataMining/~3/aL9IMRnRJfU/did-web-search-kill-artificial-intelligence.html</link>
<description>The most commonly referenced definition of Artificial Intelligence (AI) is probably the Turing Test which avoids the tricky questions like 'what is intelligence' or 'what does it mean to think' and replaces them simply by the test of recognizing an...</description>
<content:encoded><![CDATA[<p>The most commonly referenced definition of Artificial Intelligence (AI) is probably the <a href="http://en.wikipedia.org/wiki/Turing_test" target="_self">Turing Test</a>&#0160;which avoids the tricky questions like &#39;what is intelligence&#39; or &#39;what does it mean to think&#39; and replaces them simply by the test of recognizing an anonymous agent as a fellow human (and, therefore, intelligent).</p>
<p>A key aspect of this is, of course, that the interrogator is communicating with an equal in terms of the mode, pace and structure of dialogue (it is possible that a computer could succeed at this deception but reveal itself by being implausibly smarter than a human, but that is a question we can enjoy a little later).</p>
<p>As we have since learned, creating AIs is extremely hard and requires a very large portion of the smart people coming out of a broad spectrum of disciplines including software engineering, robotics, psychology, cognitive science and linguistics.</p>
<p>Another enabler is industrial context which brings the pragmatics of real world problems, and the scale and funding that is often not available to academics and if so doesn&#39;t benefit from stark focus required to make progress that is provided by industry.</p>
<p>However, we currently have the following:</p>
<ul>
<li>Search engines that don&#39;t understand language and which attempt to mediate between people (searches by people and documents by people),</li>
<li>The best and the brightest coming to work for document oriented web companies.</li>
</ul>
<p>I can&#39;t help but wonder where the AI project would be today if web search (as it is currently envisioned) hadn&#39;t gobbled up so much bandwidth.</p><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/DataMining?a=aL9IMRnRJfU:EIu5d0sM0dM:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/DataMining?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/DataMining?a=aL9IMRnRJfU:EIu5d0sM0dM:7Q72WNTAKBA"><img src="http://feeds.feedburner.com/~ff/DataMining?d=7Q72WNTAKBA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/DataMining?a=aL9IMRnRJfU:EIu5d0sM0dM:2mJPEYqXBVI"><img src="http://feeds.feedburner.com/~ff/DataMining?d=2mJPEYqXBVI" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/DataMining?a=aL9IMRnRJfU:EIu5d0sM0dM:I9og5sOYxJI"><img src="http://feeds.feedburner.com/~ff/DataMining?d=I9og5sOYxJI" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/DataMining/~4/aL9IMRnRJfU" height="1" width="1"/>]]></content:encoded>


<dc:subject>AI</dc:subject>
<dc:subject>search</dc:subject>

<dc:creator>Matthew Hurst</dc:creator>
<dc:date>2012-01-15T13:42:53-05:00</dc:date>
<feedburner:origLink>http://datamining.typepad.com/data_mining/2012/01/did-web-search-kill-artificial-intelligence.html</feedburner:origLink></item>
<item rdf:about="http://datamining.typepad.com/data_mining/2012/01/farewell-to-blogpulse.html">
<title>Farewell To BlogPulse</title>
<link>http://feedproxy.google.com/~r/DataMining/~3/8q4CMCxpB6A/farewell-to-blogpulse.html</link>
<description>Today, according to the announcement on the BlogPulse homepage, is the day that Neilsen will shutter the longstanding blog analysis and search site. BlogPulse was originally envisioned by Natalie Glance, myself and other colleagues at Intelliseek as a way to...</description>
<content:encoded><![CDATA[<p>Today, according to the announcement on the BlogPulse homepage, is the day that Neilsen will shutter the longstanding blog analysis and search site.</p>
<p>BlogPulse was originally envisioned by Natalie Glance, myself and other colleagues at Intelliseek as a way to track popular blog posts, people and phrases in the blogosphere. It launched with a sparse set of features in 2003 with the following design:</p>
<p><a href="http://datamining.typepad.com/.a/6a00d8341c994053ef0168e578b9a4970c-pi" style="display: inline;"><img alt="Blogpulse1" class="asset  asset-image at-xid-6a00d8341c994053ef0168e578b9a4970c" src="http://datamining.typepad.com/.a/6a00d8341c994053ef0168e578b9a4970c-500wi" title="Blogpulse1" /></a><br />including key phrases which, at that time looked like this:</p>
<p><a href="http://datamining.typepad.com/.a/6a00d8341c994053ef0162ff831c85970d-pi" style="display: inline;"><img alt="Blogpulse2" class="asset  asset-image at-xid-6a00d8341c994053ef0162ff831c85970d" src="http://datamining.typepad.com/.a/6a00d8341c994053ef0162ff831c85970d-320wi" title="Blogpulse2" /></a></p>
<p>The site evolved to become a full blog search engine (I was actually against this at the time preferring to focus on analytics) and launched what was my favourite feature in 2004 : trend search. By that time the site was looking far more mature:</p>
<p><a href="http://datamining.typepad.com/.a/6a00d8341c994053ef01676077f935970b-pi" style="display: inline;"><img alt="Blogpulse3" class="asset  asset-image at-xid-6a00d8341c994053ef01676077f935970b" src="http://datamining.typepad.com/.a/6a00d8341c994053ef01676077f935970b-500wi" title="Blogpulse3" /></a><br /><br /></p>
<p>In that year, BlogPulse was named a &#39;trendsetting product&#39; by KMWorld.</p>
<p>On top of the evolving set of capabilities, we started to publish longer analytical articles including this analysis of the Swift Boat story:</p>
<p><a href="http://datamining.typepad.com/.a/6a00d8341c994053ef0168e578d222970c-pi" style="display: inline;"><img alt="Blogpulse4" class="asset  asset-image at-xid-6a00d8341c994053ef0168e578d222970c" src="http://datamining.typepad.com/.a/6a00d8341c994053ef0168e578d222970c-500wi" title="Blogpulse4" /></a></p>
<p>And these views of the 2005 Tsunami:</p>
<p><a href="http://datamining.typepad.com/.a/6a00d8341c994053ef01676078141d970b-pi" style="display: inline;"><img alt="Blogpulse5" class="asset  asset-image at-xid-6a00d8341c994053ef01676078141d970b" src="http://datamining.typepad.com/.a/6a00d8341c994053ef01676078141d970b-500wi" title="Blogpulse5" /></a><br /> <a href="http://datamining.typepad.com/.a/6a00d8341c994053ef0168e578d5e0970c-pi" style="display: inline;"><img alt="Blogpulse6" class="asset  asset-image at-xid-6a00d8341c994053ef0168e578d5e0970c" src="http://datamining.typepad.com/.a/6a00d8341c994053ef0168e578d5e0970c-500wi" title="Blogpulse6" /></a><br />While (I assume) these assets will no longer be available from their adoptive source, the are archived at the wonderful <a href="http://wayback.archive.org/web/*/http://blogpulse.com" target="_self">Web Archive</a>, so if you are feeling nostalgic for a simpler, slightly less caffeinated social media world, head on over and take a look.</p>
<p>One final note: I&#39;ve always felt that social media analysis, just as it gets comfortable with a particular form of content, gets distracted by a new, shiny channel, abandons the legacy data and starts furiously reinventing its wheels. This certainly happened with the blogosphere when Twitter and Facebook showed up and consumers started to believe that &#39;real time&#39; was something the blogosphere couldn&#39;t provide. From where I&#39;m standing, blogs are alive!</p>
<p>If you have any fond memories of BlogPulse, please drop them in the comments to this blog. If you have generated some interesting trend graphs, send me a link and I&#39;ll summarize them in an update to this post.</p>
<p>For the record, the number of discovered blogs reported by BlogPulse right now is&#0160;182,397,015.</p><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/DataMining?a=8q4CMCxpB6A:RUaPsriBouw:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/DataMining?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/DataMining?a=8q4CMCxpB6A:RUaPsriBouw:7Q72WNTAKBA"><img src="http://feeds.feedburner.com/~ff/DataMining?d=7Q72WNTAKBA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/DataMining?a=8q4CMCxpB6A:RUaPsriBouw:2mJPEYqXBVI"><img src="http://feeds.feedburner.com/~ff/DataMining?d=2mJPEYqXBVI" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/DataMining?a=8q4CMCxpB6A:RUaPsriBouw:I9og5sOYxJI"><img src="http://feeds.feedburner.com/~ff/DataMining?d=I9og5sOYxJI" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/DataMining/~4/8q4CMCxpB6A" height="1" width="1"/>]]></content:encoded>


<dc:subject>blogpulse</dc:subject>

<dc:creator>Matthew Hurst</dc:creator>
<dc:date>2012-01-13T12:03:05-05:00</dc:date>
<feedburner:origLink>http://datamining.typepad.com/data_mining/2012/01/farewell-to-blogpulse.html</feedburner:origLink></item>
<item rdf:about="http://datamining.typepad.com/data_mining/2012/01/bing-visualizes-corporate-diversity.html">
<title>Bing Visualizes Corporate Diversity</title>
<link>http://feedproxy.google.com/~r/DataMining/~3/mcaktorMIMA/bing-visualizes-corporate-diversity.html</link>
<description>I've just noticed this feature on Bing's finance pages which summarizes visually the diversity of a company's assets in terms of the stock price. Microsoft: Google I can imagine a mashup of this data using eggs and one or more...</description>
<content:encoded><![CDATA[<p>I&#39;ve just noticed this feature on <a href="http://finance.bing.com" target="_self">Bing&#39;s finance</a> pages which summarizes visually the diversity of a company&#39;s assets in terms of the stock price.</p>
<p>Microsoft:</p>
<p><a href="http://datamining.typepad.com/.a/6a00d8341c994053ef0168e53fda51970c-pi" style="display: inline;"><img alt="Bingstock1" class="asset  asset-image at-xid-6a00d8341c994053ef0168e53fda51970c" src="http://datamining.typepad.com/.a/6a00d8341c994053ef0168e53fda51970c-500wi" title="Bingstock1" /></a></p>
<p>Google</p>
<p><a href="http://datamining.typepad.com/.a/6a00d8341c994053ef0162ff4a1ce2970d-pi" style="display: inline;"><img alt="Bingstock2" class="asset  asset-image at-xid-6a00d8341c994053ef0162ff4a1ce2970d" src="http://datamining.typepad.com/.a/6a00d8341c994053ef0162ff4a1ce2970d-500wi" title="Bingstock2" /></a></p>
<p>I can imagine a mashup of this data using eggs and one or more baskets.</p><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/DataMining?a=mcaktorMIMA:ze5ftkOTX2A:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/DataMining?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/DataMining?a=mcaktorMIMA:ze5ftkOTX2A:7Q72WNTAKBA"><img src="http://feeds.feedburner.com/~ff/DataMining?d=7Q72WNTAKBA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/DataMining?a=mcaktorMIMA:ze5ftkOTX2A:2mJPEYqXBVI"><img src="http://feeds.feedburner.com/~ff/DataMining?d=2mJPEYqXBVI" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/DataMining?a=mcaktorMIMA:ze5ftkOTX2A:I9og5sOYxJI"><img src="http://feeds.feedburner.com/~ff/DataMining?d=I9og5sOYxJI" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/DataMining/~4/mcaktorMIMA" height="1" width="1"/>]]></content:encoded>


<dc:subject>bing</dc:subject>
<dc:subject>search</dc:subject>

<dc:creator>Matthew Hurst</dc:creator>
<dc:date>2012-01-09T12:23:08-05:00</dc:date>
<feedburner:origLink>http://datamining.typepad.com/data_mining/2012/01/bing-visualizes-corporate-diversity.html</feedburner:origLink></item>
<item rdf:about="http://datamining.typepad.com/data_mining/2012/01/google-adds-new-satellite-overlay-control.html">
<title>Google adds new Satellite Overlay Control</title>
<link>http://feedproxy.google.com/~r/DataMining/~3/Vc6kdiPjMKk/google-adds-new-satellite-overlay-control.html</link>
<description>I've just noticed a new feature on Google Maps. To help with discoverability and understanding of the satellite imagery (as opposed to the basic map) they have introduced an overlay window in the top right hand corner which exposes the...</description>
<content:encoded><![CDATA[<p>I&#39;ve just noticed a new feature on Google Maps. To help with discoverability and understanding of the satellite imagery (as opposed to the basic map) they have introduced an overlay window in the top right hand corner which exposes the satellite image for that part of the map:</p>
<p><a href="http://datamining.typepad.com/.a/6a00d8341c994053ef0167601d01f8970b-pi" style="display: inline;"><img alt="Satellite" class="asset  asset-image at-xid-6a00d8341c994053ef0167601d01f8970b" src="http://datamining.typepad.com/.a/6a00d8341c994053ef0167601d01f8970b-320wi" title="Satellite" /></a></p>
<p>As you pan and scroll the map, the image changes tracking these movements.</p><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/DataMining?a=Vc6kdiPjMKk:IsdaxHP1xeg:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/DataMining?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/DataMining?a=Vc6kdiPjMKk:IsdaxHP1xeg:7Q72WNTAKBA"><img src="http://feeds.feedburner.com/~ff/DataMining?d=7Q72WNTAKBA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/DataMining?a=Vc6kdiPjMKk:IsdaxHP1xeg:2mJPEYqXBVI"><img src="http://feeds.feedburner.com/~ff/DataMining?d=2mJPEYqXBVI" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/DataMining?a=Vc6kdiPjMKk:IsdaxHP1xeg:I9og5sOYxJI"><img src="http://feeds.feedburner.com/~ff/DataMining?d=I9og5sOYxJI" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/DataMining/~4/Vc6kdiPjMKk" height="1" width="1"/>]]></content:encoded>


<dc:subject>GIS</dc:subject>
<dc:subject>google</dc:subject>

<dc:creator>Matthew Hurst</dc:creator>
<dc:date>2012-01-07T02:35:24-05:00</dc:date>
<feedburner:origLink>http://datamining.typepad.com/data_mining/2012/01/google-adds-new-satellite-overlay-control.html</feedburner:origLink></item>
<item rdf:about="http://datamining.typepad.com/data_mining/2012/01/bing-has-the-answer.html">
<title>Bing Has The Answer</title>
<link>http://feedproxy.google.com/~r/DataMining/~3/tt65N3-37_M/bing-has-the-answer.html</link>
<description>Initially, web search engines were restricted to returning results which were simple pointers to web pages. Modern search engines often include an 'answer' on the first page. This might be a block of information about movies (if you searched for...</description>
<content:encoded><![CDATA[<p>Initially, web search engines were restricted to returning results which were simple pointers to web pages. Modern search engines often include an &#39;answer&#39; on the first page. This might be a block of information about movies (if you searched for movies) or a small map with some restaurant information.</p>
<p>Bing and Google both present this type of result. If you are a skier, you&#39;ll know that before heading out you want to know the condition of the slopes. Searching on Bing for &#39;snoqualmie&#39; or &#39;whistler&#39; does just that:</p>
<p><a href="http://datamining.typepad.com/.a/6a00d8341c994053ef0167601cd4ff970b-pi" style="display: inline;"><img alt="Answer1" class="asset  asset-image at-xid-6a00d8341c994053ef0167601cd4ff970b" src="http://datamining.typepad.com/.a/6a00d8341c994053ef0167601cd4ff970b-500wi" title="Answer1" /></a></p>
<p>Google doesn&#39;t have any answer for skiers as far as I can tell. What I did note though was that if certain snow report websites surface in the results, summar data is extracted:</p>
<p><a href="http://datamining.typepad.com/.a/6a00d8341c994053ef0167601ce6ae970b-pi" style="display: inline;"><img alt="Googlesnow" class="asset  asset-image at-xid-6a00d8341c994053ef0167601ce6ae970b" src="http://datamining.typepad.com/.a/6a00d8341c994053ef0167601ce6ae970b-500wi" title="Googlesnow" /></a><br />This is a weaker approach, however, as it relies on the site having the right content and ranking as opposed to Bing strong analysis of intention and curration of appropriate answer data.</p>
<p>Bing, with its rich image and video based home page, gets to show its personality, and these answers shouldn&#39;t surprise anyone whose seen today&#39;s video:</p>
<p><a href="http://datamining.typepad.com/.a/6a00d8341c994053ef0162ff280035970d-pi" style="display: inline;"><img alt="Bingsnow" class="asset  asset-image at-xid-6a00d8341c994053ef0162ff280035970d" src="http://datamining.typepad.com/.a/6a00d8341c994053ef0162ff280035970d-500wi" title="Bingsnow" /></a></p>
<p>BTW, there&#39;s a line at the end of Gosford Park spoken by the head of the staff which goes something like &#39;I&#39;m the perfect servant - I know what they want before they know it themselves.&#39;</p><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/DataMining?a=tt65N3-37_M:PrAPIF33o9I:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/DataMining?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/DataMining?a=tt65N3-37_M:PrAPIF33o9I:7Q72WNTAKBA"><img src="http://feeds.feedburner.com/~ff/DataMining?d=7Q72WNTAKBA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/DataMining?a=tt65N3-37_M:PrAPIF33o9I:2mJPEYqXBVI"><img src="http://feeds.feedburner.com/~ff/DataMining?d=2mJPEYqXBVI" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/DataMining?a=tt65N3-37_M:PrAPIF33o9I:I9og5sOYxJI"><img src="http://feeds.feedburner.com/~ff/DataMining?d=I9og5sOYxJI" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/DataMining/~4/tt65N3-37_M" height="1" width="1"/>]]></content:encoded>


<dc:subject>search</dc:subject>

<dc:creator>Matthew Hurst</dc:creator>
<dc:date>2012-01-07T02:28:20-05:00</dc:date>
<feedburner:origLink>http://datamining.typepad.com/data_mining/2012/01/bing-has-the-answer.html</feedburner:origLink></item>


</rdf:RDF><!-- ph=1 -->

