<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/atom10full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><feed xmlns="http://www.w3.org/2005/Atom" xmlns:openSearch="http://a9.com/-/spec/opensearch/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:gd="http://schemas.google.com/g/2005" xmlns:thr="http://purl.org/syndication/thread/1.0" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" gd:etag="W/&quot;D0EDQXY9cCp7ImA9WhVVFU8.&quot;"><id>tag:blogger.com,1999:blog-6738477378021528685</id><updated>2012-05-08T18:07:50.868-07:00</updated><category term="customers" /><category term="machine learning" /><category term="careers" /><category term="predictive analytics" /><category term="lifetime customer value" /><category term="causality" /><category term="cross channel" /><category term="CMO" /><category term="engineering" /><category term="ceo" /><category term="big data" /><title>Blog – Causata</title><subtitle type="html" /><link rel="http://schemas.google.com/g/2005#feed" type="application/atom+xml" href="http://blog.causata.com/feeds/posts/default" /><link rel="alternate" type="text/html" href="http://blog.causata.com/" /><link rel="next" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default?start-index=26&amp;max-results=25&amp;redirect=false&amp;v=2" /><author><name>John Graham-Cumming</name><uri>http://www.blogger.com/profile/03426267581535741881</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><generator version="7.00" uri="http://www.blogger.com">Blogger</generator><openSearch:totalResults>29</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>25</openSearch:itemsPerPage><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/atom+xml" href="http://feeds.feedburner.com/causata-blog" /><feedburner:info uri="causata-blog" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><entry gd:etag="W/&quot;CUIGQns7cSp7ImA9WhVVEEk.&quot;"><id>tag:blogger.com,1999:blog-6738477378021528685.post-6951168790325941144</id><published>2012-05-02T08:37:00.003-07:00</published><updated>2012-05-03T04:12:03.509-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-05-03T04:12:03.509-07:00</app:edited><title>Causata at London Big Data week</title><content type="html">Last week was Big Data Week in London, with a packed week of excellent meetups on Big Data, Data Science, Data Visualisation, and the opportunities afforded by Big Data. We at Causata made the most of the week, attending lots of events and meeting some interesting people in the thriving London tech community.&lt;br /&gt;
&lt;br /&gt;
I spoke at the Data Science meetup and talked about what it takes to determine cause and effect from data, and how I believe that the key is to perform experiments. One of the great strengths of Causata's real-time decisioning is that it allows statistically rigorous, controlled experimentation, allowing us to move beyond correlation to understand causation. Here are the slides I showed.&lt;br /&gt;
—Jason&lt;br /&gt;
&lt;div id="__ss_12716335" style="border: 1px solid #cccccc; height: 497px; overflow: hidden; width: 595px;"&gt;
&lt;iframe frameborder="0" height="497" marginheight="0" marginwidth="0" scrolling="no" src="http://www.slideshare.net/slideshow/embed_code/12716335" style="margin: 0;" width="595"&gt;&lt;/iframe&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6738477378021528685-6951168790325941144?l=blog.causata.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/causata-blog/~4/rzlF-EsR5vA" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://blog.causata.com/feeds/6951168790325941144/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://blog.causata.com/2012/05/causata-at-london-big-data-week.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/6951168790325941144?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/6951168790325941144?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/causata-blog/~3/rzlF-EsR5vA/causata-at-london-big-data-week.html" title="Causata at London Big Data week" /><author><name>Jason</name><uri>http://www.blogger.com/profile/08121121124929099130</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://blog.causata.com/2012/05/causata-at-london-big-data-week.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C0cGQX8zeip7ImA9WhVWFUo.&quot;"><id>tag:blogger.com,1999:blog-6738477378021528685.post-7834746875178016430</id><published>2012-04-17T22:46:00.000-07:00</published><updated>2012-04-27T16:57:00.182-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-04-27T16:57:00.182-07:00</app:edited><title>The importance of getting data into production quickly</title><content type="html">&lt;a href="http://flowingdata.com"&gt;Flowing Data&lt;/a&gt; is a great website for data visualizations. I love the one showing &lt;a href="http://projects.flowingdata.com/walmart/"&gt;the sequence of Walmart store openings over time&lt;/a&gt; or the perfect choice of &lt;a href="http://flowingdata.com/2011/08/19/bikes-of-san-francisco/"&gt;bike for each San Francisco neighborhood&lt;/a&gt;.&lt;br /&gt;
&lt;br /&gt;
However, as someone who not only enjoys looking at data but is also focused on actually doing something with it, I felt today's &lt;a href="http://flowingdata.com/2012/04/17/why-1m-netflix-algorithm-never-went-to-production/"&gt;article&lt;/a&gt; really hit home.  &lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/-fYbwRe5aOc4/T45SQemEdlI/AAAAAAAAACo/kyHdEtkWi-c/s1600/netflix.jpeg" imageanchor="1" style="clear:left; float:left;margin-right:1em; margin-bottom:1em"&gt;&lt;img border="0" height="120" width="160" src="http://2.bp.blogspot.com/-fYbwRe5aOc4/T45SQemEdlI/AAAAAAAAACo/kyHdEtkWi-c/s320/netflix.jpeg" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
For a few years the Netflix prize has brought the top data scientists on the planet into an annual open competition to set a new bar on best movie predictions. It's the modern day equivalent of the &lt;a href="http://blog.causata.com/2009/10/benchmarking-kdd-and-netflix.html"&gt;KDD Cup&lt;/a&gt;, which was where Causata's COO Paul Phillips cut his teeth. The article highlights that the algorithm from the latest winning team was deemed not practical enough to put into production. I'm actually not that surprised -- though to be fair I doubt the competition had any requirements for that.&lt;br /&gt;
&lt;br /&gt;
Nevertheless it does underscore a real gap I see in how people look at big data and making predictions with machine learning, especially when it comes to marketing or customer interactions.  It's just not appreciated enough that the whole point of going through the effort to build a statistical model is to extract business value from it and that doing so quickly is essential.  When I hear SAS analysts talk about how long it typically takes to put a model into production I'm always amazed.  The months of recoding SAS code into SQL, the many compromises along the way, and the heavy validation effort in a database or data warehouse raise questions about whether it's the right approach.&lt;br /&gt;
&lt;br /&gt;
This is a topic where I'm 100% sure we're way ahead at Causata.  If you've built a statistical model in a tool like SAS or R it can be imported into Causata in seconds and any individual can then be scored in the next instant. Causata's &lt;b&gt;real-time scoring&lt;/b&gt; also incorporates the latest data, including customer interactions in the last second. So if a new website visitor arrives via a high-value search term, a score will scream that they are a hot prospect. There are dozens of important use cases related to this capability. It's a huge deal. The philosophy for us is a bit like the agile software development process applied to data.  You don't want a long period of time to pass before releasing a statistical model into production. Release early, release often. &lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/-q7UHdBZowVM/T45TsAtmvaI/AAAAAAAAAC0/LNi6zk-oOg0/s1600/porsche%2Bproduction%2Bline.jpeg" imageanchor="1" style="margin-left:1em; margin-right:1em"&gt;&lt;img border="0" height="183" width="275" src="http://3.bp.blogspot.com/-q7UHdBZowVM/T45TsAtmvaI/AAAAAAAAAC0/LNi6zk-oOg0/s320/porsche%2Bproduction%2Bline.jpeg" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
Check out the full &lt;a href="http://techblog.netflix.com/2012/04/netflix-recommendations-beyond-5-stars.html"&gt;Netflix blog article&lt;/a&gt; that the Flowing Data piece refers to. The folks at Netflix explain how their business is evolving and this has changed the nature of the predictions and personalization they need to perform. This is just yet another reason why you need to get the data into production rapidly.&lt;br /&gt;
&lt;br /&gt;
My prediction... in the next 12 months this agility topic is going to be top of mind for a lot more people. We see this growing awareness among the top marketing and analytics customers we're working with.  How top of mind is it for you?&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6738477378021528685-7834746875178016430?l=blog.causata.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/causata-blog/~4/TxXDz8-ACFs" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://blog.causata.com/feeds/7834746875178016430/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://blog.causata.com/2012/04/importance-of-getting-data-into.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/7834746875178016430?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/7834746875178016430?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/causata-blog/~3/TxXDz8-ACFs/importance-of-getting-data-into.html" title="The importance of getting data into production quickly" /><author><name>Gareth O'Loughlin</name><uri>http://www.blogger.com/profile/07325321846459173339</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://2.bp.blogspot.com/-fYbwRe5aOc4/T45SQemEdlI/AAAAAAAAACo/kyHdEtkWi-c/s72-c/netflix.jpeg" height="72" width="72" /><thr:total>0</thr:total><feedburner:origLink>http://blog.causata.com/2012/04/importance-of-getting-data-into.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C0IBRns6fSp7ImA9WhVXF08.&quot;"><id>tag:blogger.com,1999:blog-6738477378021528685.post-4495249530230321091</id><published>2012-03-04T23:41:00.001-08:00</published><updated>2012-04-17T20:59:17.515-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-04-17T20:59:17.515-07:00</app:edited><title>Keystone partnership and eMetrics SF</title><content type="html">We announced a new partnership this past week with Keystone Solutions. They've got a great team with a similar digital marketing heritage to us. We're really excited about working more closely together. The full press release is &lt;a href="http://www.businesswire.com/news/home/20120301006708/en/Keystone-Solutions-Selects-Causata-Premiere-Partner-deliver"&gt;here&lt;/a&gt;.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both;"&gt;
&lt;a href="http://3.bp.blogspot.com/-t9GZETXLyWg/T1Rrd5d9qpI/AAAAAAAAACE/LAEJS6qLz6c/s1600/Logo_KeystoneSolutions_1081x246.png" imageanchor="1"&gt;&lt;img border="0" height="73" src="http://3.bp.blogspot.com/-t9GZETXLyWg/T1Rrd5d9qpI/AAAAAAAAACE/LAEJS6qLz6c/s320/Logo_KeystoneSolutions_1081x246.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;br /&gt;
Stay tuned for many more partnership announcements to come.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Many of the Causata team will be at eMetrics this week in San Francisco. Paul Phillips will be speaking on Tuesday morning. Don't miss it!&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; margin-left: -20px;"&gt;
&lt;a href="http://2.bp.blogspot.com/-hlsUPQnYasY/T1Rt4a0ie5I/AAAAAAAAACc/ApDN6l31UmY/s1600/eMetrics%2Blogo.png" imageanchor="1"&gt;&lt;img border="0" height="127" src="http://2.bp.blogspot.com/-hlsUPQnYasY/T1Rt4a0ie5I/AAAAAAAAACc/ApDN6l31UmY/s320/eMetrics%2Blogo.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6738477378021528685-4495249530230321091?l=blog.causata.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/causata-blog/~4/8_uoXz4idXc" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://blog.causata.com/feeds/4495249530230321091/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://blog.causata.com/2012/03/keystone-partnership-and-emetrics-sf.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/4495249530230321091?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/4495249530230321091?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/causata-blog/~3/8_uoXz4idXc/keystone-partnership-and-emetrics-sf.html" title="Keystone partnership and eMetrics SF" /><author><name>Gareth O'Loughlin</name><uri>http://www.blogger.com/profile/07325321846459173339</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://3.bp.blogspot.com/-t9GZETXLyWg/T1Rrd5d9qpI/AAAAAAAAACE/LAEJS6qLz6c/s72-c/Logo_KeystoneSolutions_1081x246.png" height="72" width="72" /><thr:total>0</thr:total><feedburner:origLink>http://blog.causata.com/2012/03/keystone-partnership-and-emetrics-sf.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CkMCRnkyeyp7ImA9WhVXF08.&quot;"><id>tag:blogger.com,1999:blog-6738477378021528685.post-4018721916257084327</id><published>2011-10-18T12:34:00.000-07:00</published><updated>2012-04-17T20:41:07.793-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-04-17T20:41:07.793-07:00</app:edited><title>Banking on data</title><content type="html">Causata’s CEO Paul Phillips (left) moderated a panel at the &lt;a href="http://www.bai.org/retaildelivery/index.aspx"&gt;BAI retail banking conference&lt;/a&gt; in Chicago last week.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: left;"&gt;
&lt;img border="0" height="180" src="http://2.bp.blogspot.com/-5nRUjR0MFjc/Tp3SFbGBX_I/AAAAAAAAABY/N1xBM2Es3ps/s320/BAI%2Bpanelists.png" width="320" /&gt;&lt;/div&gt;
&lt;div class="separator" style="clear: both; text-align: left;"&gt;
&lt;img border="0" height="30" src="http://3.bp.blogspot.com/-O8tOEu9Z3SM/Tp3ScfySNRI/AAAAAAAAABk/a_1qq_9Uu9M/s320/BAI%2Bpanel%2Blogos.png" width="320" /&gt;&lt;/div&gt;
&lt;br /&gt;
Paul was joined (left to right) by:&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;li&gt; Michael Wexler, Director of Digital Insights and Marketing Effectiveness at Citibank&lt;br /&gt;
&lt;/li&gt;
&lt;li&gt; Mike Olson, CEO at Cloudera &lt;br /&gt;
&lt;/li&gt;
&lt;li&gt; and Andrew Rosen, CMO at Bank of the West&lt;br /&gt;
&lt;br /&gt;
The panel topic was “Winning and Losing with Customer Data in an Accelerating Digital World”.  Michael and Andrew shared some illuminating stories about how important they see customer data and just how big an opportunity there is to impact their business and the customer experience. Paul and Mike gave their Silicon Valley thought leader take on where the technology is today and how companies should best take advantage of it. There was a clear consensus that the companies that embrace data will be the ones that win.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: left;"&gt;
&lt;img border="0" height="74" src="http://3.bp.blogspot.com/-DCV1SimrhVI/Tp3SijGBUII/AAAAAAAAABw/_GlZ4g0Rkvg/s320/BAI%2Bimage.png" width="250" /&gt;&lt;/div&gt;
&lt;/li&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6738477378021528685-4018721916257084327?l=blog.causata.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/causata-blog/~4/U36HO22Ch1U" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://blog.causata.com/feeds/4018721916257084327/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://blog.causata.com/2011/10/banking-on-data.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/4018721916257084327?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/4018721916257084327?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/causata-blog/~3/U36HO22Ch1U/banking-on-data.html" title="Banking on data" /><author><name>Gareth O'Loughlin</name><uri>http://www.blogger.com/profile/07325321846459173339</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://2.bp.blogspot.com/-5nRUjR0MFjc/Tp3SFbGBX_I/AAAAAAAAABY/N1xBM2Es3ps/s72-c/BAI%2Bpanelists.png" height="72" width="72" /><thr:total>0</thr:total><feedburner:origLink>http://blog.causata.com/2011/10/banking-on-data.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CEINRHY7eip7ImA9WhdbEkg.&quot;"><id>tag:blogger.com,1999:blog-6738477378021528685.post-6739190156974552505</id><published>2011-10-10T06:11:00.000-07:00</published><updated>2011-10-10T06:16:35.802-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-10-10T06:16:35.802-07:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="predictive analytics" /><category scheme="http://www.blogger.com/atom/ns#" term="big data" /><title>Determining the value of your data</title><content type="html">John Lovett has a &lt;a href="http://www.clickz.com/clickz/column/2114856/dilemma"&gt;thoughtful post&lt;/a&gt; at Clickz.com summarizing many of the struggles enterprises are having with Big Data.  One problem is enterprises often store as much data as possible, for as long as possible, fearing they’ll be discarding valuable information if they don’t.   Enterprises struggle to understand which data-points are valuable and which ones aren’t.  Lovett writes:&lt;br /&gt;&lt;br /&gt;&lt;span style="font-style:italic;"&gt;“Understanding what data matters to your business requires empathizing with business stakeholders, examining marketing programs, and getting to the mission-critical values of the organization. In my experience, I've found that simply asking business stakeholders what metrics or KPIs are most important to them is a futile endeavor.”&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;At Causata, we believe that the relative value of data should be determined by &lt;span style="font-style:italic;"&gt;how it can be used to drive your business forward and help you achieve your goals.&lt;/span&gt;  We begin by organizing all of the data around a customer.  You should be able to see every interaction a customer has had with your company.  Those interactions should be stored in time order, so you can see how events in the past, predict future behavior. &lt;br /&gt;&lt;br /&gt;The next step is to understand your business goals.  Most often these are centered on increasing profit either by selling more products and services or increasing customer satisfaction.  Take reducing attrition.  You’ll need to understand the behaviors that lead customers to terminate their relationship with your company.  You may look at things like:  How many products did a customer browse online in the past week?  How many times has she called to register a complaint over the past month?  When was the last time she visited a store location? Leverage as much of your data as possible, especially web and mobile data rich with customer intent.  Now conduct an analysis to see how each of these behaviors is correlated with customers who end the relationship with your firm.  In the process you’ve transformed your data into customer intelligence.  This intelligence can be used to power campaigns to reach out to those customers most likely to attrite.&lt;br /&gt;&lt;br /&gt;This is a tremendously valuable view of the data because it gives you the power to anticipate dissatisfied customers and reach out to them &lt;span style="font-style:italic;"&gt;before&lt;/span&gt; they leave.  In doing so, you’ll be able to significantly increase your business performance.  &lt;br /&gt;&lt;br /&gt;This can be extended to any business goal.   In the end you’re left with thousands of data-points that express customer intent.  This, in turn, can guide an assessment of the relative value of various data-points.  More importantly, it can be used to help you really understand how customer behavior relates to your business performance.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6738477378021528685-6739190156974552505?l=blog.causata.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/causata-blog/~4/G8sPxFrkNng" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://blog.causata.com/feeds/6739190156974552505/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://blog.causata.com/2011/10/determining-value-of-your-data.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/6739190156974552505?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/6739190156974552505?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/causata-blog/~3/G8sPxFrkNng/determining-value-of-your-data.html" title="Determining the value of your data" /><author><name>Brian Ivanovick</name><uri>http://www.blogger.com/profile/12004275696761180983</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://blog.causata.com/2011/10/determining-value-of-your-data.html</feedburner:origLink></entry><entry gd:etag="W/&quot;AkICQnc5eyp7ImA9WhdQEU0.&quot;"><id>tag:blogger.com,1999:blog-6738477378021528685.post-675988450315085344</id><published>2011-08-11T17:18:00.000-07:00</published><updated>2011-08-11T17:22:43.923-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-08-11T17:22:43.923-07:00</app:edited><title>A look at Causata</title><content type="html">There's a good &lt;a href="http://jtonedm.com/2011/08/11/first-look-%E2%80%93-causata/"&gt;post about Causata&lt;/a&gt; by James Taylor on his blog Everything Decision Management.&lt;br /&gt;
&lt;br /&gt;
Gareth&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6738477378021528685-675988450315085344?l=blog.causata.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/causata-blog/~4/4HHGfcw24GE" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://blog.causata.com/feeds/675988450315085344/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://blog.causata.com/2011/08/look-at-causata.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/675988450315085344?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/675988450315085344?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/causata-blog/~3/4HHGfcw24GE/look-at-causata.html" title="A look at Causata" /><author><name>Gareth O'Loughlin</name><uri>http://www.blogger.com/profile/07325321846459173339</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://blog.causata.com/2011/08/look-at-causata.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CUANQnY5eCp7ImA9WhZbFUQ.&quot;"><id>tag:blogger.com,1999:blog-6738477378021528685.post-8764327149429035118</id><published>2011-06-20T11:06:00.000-07:00</published><updated>2011-06-20T11:16:33.820-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-06-20T11:16:33.820-07:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="CMO" /><category scheme="http://www.blogger.com/atom/ns#" term="causality" /><category scheme="http://www.blogger.com/atom/ns#" term="big data" /><title>Management doesn’t value marketing</title><content type="html">&lt;div&gt;According to a study cited &lt;a href="http://www.marketingweek.co.uk/sectors/industry/73-of-ceos-say-marketers-lack-credibility/3027423.article"&gt;in this article&lt;/a&gt; over at Marketing Week, &lt;b&gt;73% of CEOs believe that marketers “lack business credibility because they fail to quantify the success of their campaigns”&lt;/b&gt;.   It goes on further to state that: “marketers focus too much on the ‘arty and fluffy’ side of marketing and not enough on its business science”.&lt;br /&gt;&lt;br /&gt;Putting a hard number on the value of a marketing campaign is an ongoing struggle for many marketers.  It requires a complete view of the customer, something that marketers may not have access to.  Imagine a marketing campaign for a bank that leverages banner advertising, call center and direct mail with the goal of increasing product holdings among existing customers.  Management rightfully wants to understand the business value created through this campaign.&lt;br /&gt;&lt;br /&gt;To begin to answer this question, you’ll need a complete view of the customer, both before and after the after the life of the campaign.  This complete view includes all customer interactions along with all other customer attributes.  What banners was the customer exposed to?  How many did she click on?  Was she targeted for with direct mail?  How did she respond?  What are her total account holdings before and after the campaign?  And so on.&lt;br /&gt;&lt;br /&gt;Without this view it’s almost impossible to understand the cause of changes in product holdings because you’ll only be analyzing part of the story.  This leads to approximations that can be divorced from business reality.  Marketers are often forced into approximations because some number is considered better than no number at all.  But this can lead to a big problem:  When marketers report numbers that fail to align with financial statements, distrust of marketing grows.&lt;br /&gt;&lt;br /&gt;Developing a complete view of the customer is a necessary condition to truly understand the success or failure of marketing campaigns.  Marketers are going to continue to have trouble understanding the value they’re creating without it.  And without understanding the true value they’re creating, business credibility may continue to elude them.&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6738477378021528685-8764327149429035118?l=blog.causata.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/causata-blog/~4/SZdX-AHHVUc" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://blog.causata.com/feeds/8764327149429035118/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://blog.causata.com/2011/06/management-doesnt-value-marketing.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/8764327149429035118?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/8764327149429035118?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/causata-blog/~3/SZdX-AHHVUc/management-doesnt-value-marketing.html" title="Management doesn’t value marketing" /><author><name>Brian Ivanovick</name><uri>http://www.blogger.com/profile/12004275696761180983</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://blog.causata.com/2011/06/management-doesnt-value-marketing.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CUYNQX8zeSp7ImA9WhZVEUo.&quot;"><id>tag:blogger.com,1999:blog-6738477378021528685.post-3640230612197824557</id><published>2011-05-23T10:13:00.000-07:00</published><updated>2011-05-23T10:53:10.181-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-05-23T10:53:10.181-07:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="CMO" /><category scheme="http://www.blogger.com/atom/ns#" term="cross channel" /><category scheme="http://www.blogger.com/atom/ns#" term="big data" /><category scheme="http://www.blogger.com/atom/ns#" term="lifetime customer value" /><title>A CMO's Most Important Task</title><content type="html">The CMO Council’s report &lt;b&gt;&lt;i&gt;Renovate to Innovate: Building Performance-Driven Marketing Organizations (1)&lt;/i&gt;&lt;/b&gt; suggests that the most important task for CMOs is to drive their organizations to execute strategic, cross-functional campaigns:&lt;div&gt;&lt;blockquote&gt;&lt;/blockquote&gt;&lt;blockquote&gt;“Most importantly, chief marketing executives have to revitalize marketing group cultures and mindsets. Narrowly focused, risk-averse managers in isolated silos of tactical execution (research, PR, advertising, events, marketing services and Web) must be integrated into cohesive, cross-functional campaign teams.”&lt;/blockquote&gt;Prior to integration of this sort, organizations must first understand the lifetime value of each customer regardless of how the customer chooses to interact with the organization. Only then can cross-functional teams can be organized around a common goal: &lt;i&gt;maximizing lifetime customer value&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;In the past I worked with a major retailer who failed to understand the drivers of lifetime customer value. A tension quickly emerged between the brand team, who viewed value creation on a long-term horizon, and the e-commerce team, who wanted to maximize in-session purchases. The question became: What metric should we optimize for?&lt;br /&gt;&lt;br /&gt;Without an understanding of lifetime customer value, there is a significant risk in failure of cross-functional team alignment as each team clings to the metrics its used to. More importantly, there’s a real risk of creating teams that optimize the wrong metrics.&lt;br /&gt;&lt;br /&gt;Understanding lifetime customer value requires organizations to:&lt;br /&gt;&lt;ol&gt;&lt;li&gt;&lt;b&gt;Connect the data&lt;/b&gt;: Connect data from different channels organized around the customer.&lt;/li&gt;&lt;li&gt;&lt;b&gt;Understand the data&lt;/b&gt;: Understand the key drivers of lifetime customer value. It’s just as crucial to understand how this changes over time.&lt;/li&gt;&lt;li&gt;&lt;b&gt;Act on the data&lt;/b&gt;: Leverage all the data in any channel, ideally in real-time.&lt;/li&gt;&lt;/ol&gt;&lt;br /&gt;A common understanding of lifetime customer value will provide a solid foundation for these cross-functional marketing teams. Enterprises risk misalignment and suboptimal performance without it.&lt;br /&gt;--&lt;br /&gt;1. http://www.cmocouncil.org/resources/forms/download-report/index.php?id=204&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6738477378021528685-3640230612197824557?l=blog.causata.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/causata-blog/~4/5A0inKRS9JU" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://blog.causata.com/feeds/3640230612197824557/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://blog.causata.com/2011/05/cmos-most-important-task.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/3640230612197824557?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/3640230612197824557?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/causata-blog/~3/5A0inKRS9JU/cmos-most-important-task.html" title="A CMO's Most Important Task" /><author><name>Brian Ivanovick</name><uri>http://www.blogger.com/profile/12004275696761180983</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://blog.causata.com/2011/05/cmos-most-important-task.html</feedburner:origLink></entry><entry gd:etag="W/&quot;AkUFR3k4eSp7ImA9WhZXGU8.&quot;"><id>tag:blogger.com,1999:blog-6738477378021528685.post-796976301456379618</id><published>2011-05-09T01:03:00.000-07:00</published><updated>2011-05-09T01:03:36.731-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-05-09T01:03:36.731-07:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="ceo" /><title>Podcast: "Real-time customer intelligence with Paul Phillips"</title><content type="html">Causata CEO Paul Phillips was recently interviewed for &lt;a href="http://www.beyondwebanalytics.com/"&gt;Beyond Web Analytics&lt;/a&gt; by Adam Greco and Rudi Shumpert:&lt;br /&gt;
&lt;blockquote&gt;Join Adam and Rudi as they talk with Paul Phillips of Causata and real-time customer intelligence.   Paul provides the team with insight on what exactly Causata is and the business problems that they are helping their clients solve. The conversation explores the real-time customer intelligence, pulling multi-channel data into a single source, and dives into what this means for companies of all sizes.&lt;br /&gt;
&lt;/blockquote&gt;The full podcast is &lt;a href="http://www.beyondwebanalytics.com/2011/05/07/episode-45/"&gt;here&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6738477378021528685-796976301456379618?l=blog.causata.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/causata-blog/~4/a3xVl1FQLbM" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://blog.causata.com/feeds/796976301456379618/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://blog.causata.com/2011/05/podcast-real-time-customer-intelligence.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/796976301456379618?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/796976301456379618?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/causata-blog/~3/a3xVl1FQLbM/podcast-real-time-customer-intelligence.html" title="Podcast: &quot;Real-time customer intelligence with Paul Phillips&quot;" /><author><name>John Graham-Cumming</name><uri>http://www.blogger.com/profile/03426267581535741881</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://blog.causata.com/2011/05/podcast-real-time-customer-intelligence.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C0UBSXY9eCp7ImA9WhVXF08.&quot;"><id>tag:blogger.com,1999:blog-6738477378021528685.post-3098789286396541064</id><published>2011-02-25T06:50:00.000-08:00</published><updated>2012-04-17T20:54:18.860-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-04-17T20:54:18.860-07:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="big data" /><title>New Causata whitepapers page</title><content type="html">We've gathered together all the Causata whitepapers on a single &lt;a href="http://www.causata.com/whitepaper.html"&gt;page&lt;/a&gt; for easy access.  In addition to the whitepaper entitled &lt;a href="http://blog.causata.com/2011/02/five-challenges-to-web-data-integration.html"&gt;Web data integration: Build or Buy?&lt;/a&gt; two other technical whitepapers are available.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Causata Distributed Data&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
At Causata’s core is a &lt;a href="http://www.causata.com/res/pdf/Causata_Distributed_Data.pdf"&gt;data structure&lt;/a&gt; that captures every interaction between your business and your customers.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: left;"&gt;
&lt;a href="http://www.causata.com/res/pdf/Causata_Distributed_Data.pdf" imageanchor="1"&gt;&lt;img height="400" src="http://1.bp.blogspot.com/-Ql1gK3j9qdw/TWfA816EzBI/AAAAAAAAABc/UtQJesHFg78/s400/Picture%2B4.png" style="border-bottom-style: solid; border-bottom-width: 1px; border-color: initial; border-image: initial; border-left-style: solid; border-left-width: 1px; border-right-style: solid; border-right-width: 1px; border-top-style: solid; border-top-width: 1px;" width="307" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;br /&gt;
&lt;b&gt;Causata Identity Association&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
Causata has an &lt;a href="http://www.causata.com/res/pdf/Causata_Identity_Association.pdf"&gt;identity association layer&lt;/a&gt; that disambiguates customer identity information to maximize the use of data.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: left;"&gt;
&lt;a href="http://www.causata.com/res/pdf/Causata_Identity_Association.pdf" imageanchor="1"&gt;&lt;img height="400" src="http://3.bp.blogspot.com/--BG4temZazo/TWfBDDfBkrI/AAAAAAAAABk/i1SUQySBIdM/s400/Picture%2B5.png" style="border-bottom-style: solid; border-bottom-width: 1px; border-color: initial; border-image: initial; border-left-style: solid; border-left-width: 1px; border-right-style: solid; border-right-width: 1px; border-top-style: solid; border-top-width: 1px;" width="306" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6738477378021528685-3098789286396541064?l=blog.causata.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/causata-blog/~4/QVzARRMyzRw" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://blog.causata.com/feeds/3098789286396541064/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://blog.causata.com/2011/02/new-causata-whitepapers-page.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/3098789286396541064?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/3098789286396541064?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/causata-blog/~3/QVzARRMyzRw/new-causata-whitepapers-page.html" title="New Causata whitepapers page" /><author><name>John Graham-Cumming</name><uri>http://www.blogger.com/profile/03426267581535741881</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://1.bp.blogspot.com/-Ql1gK3j9qdw/TWfA816EzBI/AAAAAAAAABc/UtQJesHFg78/s72-c/Picture%2B4.png" height="72" width="72" /><thr:total>0</thr:total><feedburner:origLink>http://blog.causata.com/2011/02/new-causata-whitepapers-page.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C0MGQX85eyp7ImA9WhVXF08.&quot;"><id>tag:blogger.com,1999:blog-6738477378021528685.post-3296648995720645806</id><published>2011-02-25T06:33:00.001-08:00</published><updated>2012-04-17T20:57:00.123-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-04-17T20:57:00.123-07:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="customers" /><title>Causata and Intuit: working together</title><content type="html">We're very happy to be able to talk publicly about Causata's relationship with Intuit.  Intuit and Causata have been working together since 2010 on integration of diverse data sources using the Causata platform.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: left;"&gt;
&lt;a href="http://www.causata.com/solutions/spotlight" imageanchor="1" style="margin-left: auto; margin-right: auto;"&gt;&lt;img border="0" src="http://4.bp.blogspot.com/-Eau60pSbDYk/T447TevWusI/AAAAAAAAAC0/JYl8p_T4yCM/s1600/intuit.png" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;br /&gt;
More &lt;a href="http://www.causata.com/solutions/spotlight"&gt;details&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6738477378021528685-3296648995720645806?l=blog.causata.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/causata-blog/~4/Kv9-hj2oCo4" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://blog.causata.com/feeds/3296648995720645806/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://blog.causata.com/2011/02/causata-and-intuit-working-together_25.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/3296648995720645806?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/3296648995720645806?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/causata-blog/~3/Kv9-hj2oCo4/causata-and-intuit-working-together_25.html" title="Causata and Intuit: working together" /><author><name>John Graham-Cumming</name><uri>http://www.blogger.com/profile/03426267581535741881</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://4.bp.blogspot.com/-Eau60pSbDYk/T447TevWusI/AAAAAAAAAC0/JYl8p_T4yCM/s72-c/intuit.png" height="72" width="72" /><thr:total>0</thr:total><feedburner:origLink>http://blog.causata.com/2011/02/causata-and-intuit-working-together_25.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C0MDQH4_eCp7ImA9WhVXF08.&quot;"><id>tag:blogger.com,1999:blog-6738477378021528685.post-4594743633554615304</id><published>2011-02-25T06:28:00.000-08:00</published><updated>2012-04-17T20:57:51.040-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-04-17T20:57:51.040-07:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="big data" /><title>The five challenges to web data integration</title><content type="html">Data captured from the web brings with it a tough new set of analytical and low latency demands if its full operational value is to be realized.  These challenges are:&lt;br /&gt;
&lt;br /&gt;
&lt;ul&gt;
&lt;li&gt;1. Web data is high volume&lt;/li&gt;
&lt;li&gt;2. Use cases are diverse and complex&lt;/li&gt;
&lt;li&gt;3. Web data is low information density&lt;/li&gt;
&lt;li&gt;4. Associating web observations is inherently ambiguous&lt;/li&gt;
&lt;li&gt;5. Web “in-market” predictors depreciate very rapidly&lt;/li&gt;
&lt;/ul&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: left;"&gt;
&lt;a href="hhttp://www.causata.com/whitepaper.html"&gt;&lt;img border="1" height="400" src="http://2.bp.blogspot.com/-qHV4o9DMNRQ/TWe95o5ONYI/AAAAAAAAABU/-X-kKf-XW9M/s400/Picture%2B2.png" style="border-bottom-style: solid; border-bottom-width: 1px; border-color: initial; border-image: initial; border-left-style: solid; border-left-width: 1px; border-right-style: solid; border-right-width: 1px; border-top-style: solid; border-top-width: 1px;" width="307" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;p&gt;The Causata whitepaper "&lt;a href="http://www.causata.com/whitepaper.html"&gt;Web data integration: Build or Buy?&lt;/a&gt;" explores these challenges in depth.&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6738477378021528685-4594743633554615304?l=blog.causata.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/causata-blog/~4/ahFUKaMtwg4" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://blog.causata.com/feeds/4594743633554615304/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://blog.causata.com/2011/02/five-challenges-to-web-data-integration.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/4594743633554615304?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/4594743633554615304?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/causata-blog/~3/ahFUKaMtwg4/five-challenges-to-web-data-integration.html" title="The five challenges to web data integration" /><author><name>John Graham-Cumming</name><uri>http://www.blogger.com/profile/03426267581535741881</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://2.bp.blogspot.com/-qHV4o9DMNRQ/TWe95o5ONYI/AAAAAAAAABU/-X-kKf-XW9M/s72-c/Picture%2B2.png" height="72" width="72" /><thr:total>0</thr:total><feedburner:origLink>http://blog.causata.com/2011/02/five-challenges-to-web-data-integration.html</feedburner:origLink></entry><entry gd:etag="W/&quot;D0INSHc7fSp7ImA9Wx9VF0U.&quot;"><id>tag:blogger.com,1999:blog-6738477378021528685.post-6197544819828479474</id><published>2011-02-03T09:25:00.000-08:00</published><updated>2011-02-03T18:13:19.905-08:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-02-03T18:13:19.905-08:00</app:edited><title>Big Data at Strataconf</title><content type="html">I and several of the Causata team are at &lt;a href="http://strataconf.com/strata2011"&gt;O'Reilly Strataconf&lt;/a&gt; this week, a conference all about big data and the exciting possibilities it offers.&lt;br /&gt;&lt;br /&gt;In a &lt;a href="http://strataconf.com/strata2011/public/schedule/detail/18313#"&gt;panel discussion in the opening session&lt;/a&gt; yesterday, Mike Olson of Cloudera made a very important point: big data is all very well, but things really get interesting when you combine diverse datasets.&lt;br /&gt;&lt;br /&gt;This is an emerging theme in the conference, I'm seeing it everywhere.&lt;br /&gt;&lt;br /&gt;In a tutorial on tools and techniques for the data analyst, Drew Conway and Hilary Mason took bit.ly data of web users who had clicked on links to Strataconf, and combined this with location data to plot a map. Interestingly for me, the Bay Area was only the second largest circle on the map - the largest was London! (Causata is in both).&lt;br /&gt;&lt;br /&gt;Pete Skomoroch of LinkedIn combined the Strataconf attendee list with LinkedIn data to produce a visualization of the skills, companies, and job titles of the conference attendees.&lt;br /&gt;&lt;br /&gt;Anthony Goldbloom of Kaggle announced the &lt;a href="http://www.heritagehealthprize.com/"&gt;$3 million Heritage Health Prize&lt;/a&gt;, for using data mining to predict hospital admissions. There followed an interesting discussion on the whether it was fair to include external data sets, as is increasingly done to win data mining competitions. The consensus was that it's fine if it brings better predictions, provided it doesn't deter entrants to the competition who don't have access to this data. For me this just highlights the need for tools and datasets to do this well. I saw an interesting demo of using Google Refine to reconcile data against publicly available datasets, and there has been much discussion about the emergence of data markets.&lt;br /&gt;&lt;br /&gt;All this is music to our ears at Causata: we derive powerful actionable insights precisely by combining customer data from multiple channels to give a comprehensive view of every  individual customer.&lt;br /&gt;&lt;br /&gt;Jason&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6738477378021528685-6197544819828479474?l=blog.causata.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/causata-blog/~4/O-2COSmUQhw" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://blog.causata.com/feeds/6197544819828479474/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://blog.causata.com/2011/02/big-data-at-strataconf.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/6197544819828479474?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/6197544819828479474?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/causata-blog/~3/O-2COSmUQhw/big-data-at-strataconf.html" title="Big Data at Strataconf" /><author><name>Jason</name><uri>http://www.blogger.com/profile/08121121124929099130</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://blog.causata.com/2011/02/big-data-at-strataconf.html</feedburner:origLink></entry><entry gd:etag="W/&quot;D0YMSX87fCp7ImA9Wx9VEk4.&quot;"><id>tag:blogger.com,1999:blog-6738477378021528685.post-2440852754214701331</id><published>2011-01-06T05:44:00.000-08:00</published><updated>2011-01-28T09:19:48.104-08:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-01-28T09:19:48.104-08:00</app:edited><title>Video: Paul Phillips, Causata's CEO, on creating business value from customer data</title><content type="html">&lt;p&gt;In this short video, Paul Phillips, Causata's CEO, talks about how to create business value from customer data.  With many companies exploring &lt;a href="http://blog.causata.com/2010/12/integrating-web-data-first-step.html"&gt;how to use web data&lt;/a&gt; to make better customer decisions, Paul explains the importance of connecting, learning and acting on data.&lt;/p&gt;
&lt;script src="http://www.causata.com/res/video/flowplayer/flowplayer-3.2.4.min.js" type="text/javascript" charset="utf-8"&gt;&lt;/script&gt;
&lt;style&gt;
a.rtmp {
display:block;
width:512px;
height:288px; 
margin:25px 0;
text-align:center;
background-color:black;
}
&lt;/style&gt;
&lt;a class="rtmp" href="mp4:res/video/causataFinal_C_ending_iPhone"&gt;&lt;img src="http://3.bp.blogspot.com/_yvL50CtAcII/TSW6VBFLVzI/AAAAAAAAABI/ghQ9nOae79w/s1600/Blog-video.png" style="padding: 0px;"&gt;&lt;/a&gt;
&lt;script type="text/javascript"&gt;
--&gt;
flowplayer("a.rtmp", "http://www.causata.com/res/video/flowplayer/flowplayer.commercial-3.2.5-4.swf", { 
key: "#$3598dcc382733cc8da7", // key for *.causata.com
clip: { 
provider: 'rtmp'
}, 
plugins: {  
rtmp: {  
url: 'http://www.causata.com/res/video/flowplayer/flowplayer.rtmp-3.2.3.swf',     
netConnectionUrl: 'rtmp://stream.causata.com/cfx/st'  
}
}      
});
&lt;/script&gt;
&lt;a href="http://www.causata.com/technology/"&gt;Learn more&lt;/a&gt; about Causata Technology&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6738477378021528685-2440852754214701331?l=blog.causata.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/causata-blog/~4/nTKEzN1zZb0" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://blog.causata.com/feeds/2440852754214701331/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://blog.causata.com/2011/01/video-paul-phillips-causatas-ceo-on.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/2440852754214701331?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/2440852754214701331?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/causata-blog/~3/nTKEzN1zZb0/video-paul-phillips-causatas-ceo-on.html" title="Video: Paul Phillips, Causata's CEO, on creating business value from customer data" /><author><name>Fiann O'Hagan</name><uri>http://www.blogger.com/profile/17845976576754870762</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://3.bp.blogspot.com/_yvL50CtAcII/TSW6VBFLVzI/AAAAAAAAABI/ghQ9nOae79w/s72-c/Blog-video.png" height="72" width="72" /><thr:total>0</thr:total><feedburner:origLink>http://blog.causata.com/2011/01/video-paul-phillips-causatas-ceo-on.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C0cCR3s8eyp7ImA9WhVXF08.&quot;"><id>tag:blogger.com,1999:blog-6738477378021528685.post-4537989088237135195</id><published>2010-12-16T06:35:00.000-08:00</published><updated>2012-04-17T20:51:06.573-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-04-17T20:51:06.573-07:00</app:edited><title>Integrating web data: the first step</title><content type="html">A hot topic at the moment is connecting web behavioral data to “offline” data –everything else that you know about your customers stored in legacy and backend systems. Clients tells us every day that using data from the web to inform and improve offline decisions is now a priority.&lt;br /&gt;
&lt;br /&gt;
Many organizations have developed their customer analytics in silos with separate data infrastructure in different parts of the business. It’s common that the dotcom group in a company is so detached from the traditional offline part of the business that no data is shared at all.  Not only are these companies not harnessing the power of the web data outside of the dotcom group, but they don’t have a good idea of which customers, are doing what online.&lt;br /&gt;
&lt;br /&gt;
This can be the case even if they’ve got a well-managed CRM system and a sophisticated web analytics team. Web analytics systems are designed to show aggregated information across web site visitors, rather than store information about individual customers and weren’t designed to be connected to other systems.&lt;br /&gt;
&lt;br /&gt;
At Causata we believe that connecting web data is essential.&lt;br /&gt;
&lt;br /&gt;
&lt;div style="text-align: left;"&gt;
&lt;img alt="Combine web and offline data" src="http://cdn.causata.com/blog/2010/12/WithWithoutWebData_02.png" style="margin-left: -120px;" /&gt;&lt;/div&gt;
&lt;br /&gt;
With all the predictive power of web data there are a lot of cool things you can do because web data gives you customer intent.  Customers are literally telling you “I like this TV”, “I’m interested in accessories”, “I’m in the market for a loan”, etc.  You should to be listening to this information and acting on it quickly or you’ll lose out on opportunities to help customers and grow your business. We’ve seen again and again that online behavior can provide much more predictive power than traditional customer segmentation when it comes to understanding key events such as product purchases and attrition.&lt;br /&gt;
&lt;br /&gt;
The conversations we have with clients generally focus on how they can understand why their customers behave the way they do and which customer actions can be influenced.  There are huge opportunities for gains from linking data. There is, however, a basic step that nearly everyone can and should take right away.&lt;br /&gt;
&lt;br /&gt;
If you’re a company that takes web data seriously then you probably already have access to some useful data. While many Fortune 1000 companies are experimenting with Google Analytics, which is free, about 40% of the Fortune 1000 care enough about web data to purchase a paid web analytics tool such as Coremetrics, Omniture, or Webtrends.&lt;br /&gt;
&lt;br /&gt;
&lt;div style="float: right; padding-left: 12px;"&gt;
&lt;img alt="40% of Fortune 1000 companies care about web data enough to use a paid web analytics solution" src="http://cdn.causata.com/blog/2010/12/WithWithoutWebData_03.png" /&gt;&lt;/div&gt;
If you’re using a paid web analytics tool you’ll probably have already marked up your web pages to capture key interaction information for reporting purposes. Most of these tools will also let you access months of historical web data in batch form. It’s possible to combine this with your offline data.&lt;br /&gt;
&lt;br /&gt;
The first step for integrating web data is to connect this historical batch data from your web analytics tool to your offline data. The challenge is to find a common key that will identify customers so you can join the two data sets.&lt;br /&gt;
&lt;br /&gt;
Ideally, you will already have a Customer ID or Transaction ID of some sort in your web markup that you can use as a key to link the web data to the offline data. Otherwise, you’ll want to consider a few small changes to the website – typically a single field added to a handful of web pages is enough to get started. The benefit is huge because once a user identifies themselves you can understand their browsing history with your offine data. This data matching process can be improved further the more keys you add: things like a Cart ID on a checkout funnel or an email address when someone clicks through to your site from an email campaign (among others) can be used to merge offline and online data.&lt;br /&gt;
&lt;br /&gt;
Once you have the online data connected it can be very enlightening. Suddenly it’s possible to see how people behave online, how online behavior differs according to different customer segments, and how web site activity relates to other offline activities in a store, a call center or anywhere else.&lt;br /&gt;
&lt;br /&gt;
&lt;div style="text-align: left;"&gt;
&lt;img alt="Chart of predictive factors for loan application" src="http://cdn.causata.com/blog/2010/12/WithWithoutWebData_01.png" style="margin-left: -17px;" /&gt;&lt;/div&gt;
&lt;br /&gt;
Of course, connecting the batch history from your web analytics tool to offline data isn’t the be-all-and-end-all – with the web you really want to be working in real-time – but it’s a great first step that everybody should be taking… now.&lt;br /&gt;
&lt;br /&gt;
Gareth&lt;br /&gt;
&lt;br /&gt;
&lt;span style="font-style: italic;"&gt;&lt;a href="http://www.causata.com/technology/"&gt;Learn more&lt;/a&gt; about Causata Technology&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6738477378021528685-4537989088237135195?l=blog.causata.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/causata-blog/~4/nc7tGkmpgGI" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://blog.causata.com/feeds/4537989088237135195/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://blog.causata.com/2010/12/integrating-web-data-first-step.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/4537989088237135195?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/4537989088237135195?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/causata-blog/~3/nc7tGkmpgGI/integrating-web-data-first-step.html" title="Integrating web data: the first step" /><author><name>Gareth O'Loughlin</name><uri>http://www.blogger.com/profile/07325321846459173339</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://blog.causata.com/2010/12/integrating-web-data-first-step.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C0YAQHY-eip7ImA9WhVXF08.&quot;"><id>tag:blogger.com,1999:blog-6738477378021528685.post-1215106383072567931</id><published>2010-12-16T04:24:00.000-08:00</published><updated>2012-04-17T20:52:21.852-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-04-17T20:52:21.852-07:00</app:edited><title>More prediction, less reporting. Freeing up an analyst’s time so they can make a greater impact</title><content type="html">Why do so many organizations struggle to wring value from their data?   For most the problem is not data quantity – it’s relevance and connectedness.  Much of the data scattered across enterprises is practically worthless until it’s processed and connected in one place.  Indeed, expecting valuable insights to spring from your data is like expecting a pile of coal to turn into diamonds.  Raw ingredients have little value without the right conditions.  &lt;br /&gt;
&lt;br /&gt;
Irrelevant, unmerged data is so common that data scientists, analysts, and modelers spend much of their time trying to squeeze it into more valuable forms. Among the people I’ve asked it’s not a stretch to say they spend over 90% of their time on this. They’ll also tell you they’d prefer to be spending their time building predictive models because they know that’s where they can create most business value.&lt;br /&gt;
&lt;br /&gt;
This picture gets worse when you start to tackle web data. Web data is notoriously sparse.  Real-data lacks the pretty normal distributions people see in textbooks and this is only amplified on the web.  Consider the example below showing page views: it may seem odd but most customers have none because not everyone is using the web yet, and the most active online customers with two or more page views are a small fraction of the overall population.  Are these active online customers making more purchases across all channels?  You can’t answer this question without connecting your data at the customer level across channels.  &lt;br /&gt;
&lt;br /&gt;
&lt;div style="margin-left: -18px; text-align: left;"&gt;
&lt;img alt="Chart of visitor count in the last 90 days by number of page views for each visitor" src="http://cdn.causata.com/blog/2010/12/MorePrediction_01.png" /&gt;&lt;/div&gt;
&lt;br /&gt;
All the work to collect the web data in the first place, structure it into a dataset that can be used in a statistical tool like SAS or R, etc. can take a lot of time. The world would be a much better place if some of this effort were done automatically for a data analyst. Causata was designed with this in mind.&lt;br /&gt;
&lt;br /&gt;
Causata automates the collection and connection of real-time web data so that it is structured around customers and can feed predictive models. Rich markup that characterizes customers is placed on web pages. This web data is combined with offline data and stored where it can be accessed easily using a unique identifier for each customer.  Causata provides a large set of predefined variables suitable for customer analysis and modeling.  More variables can be added as needed. These variables are constructed at query time from raw events so there is enormous flexibility for an analyst in terms of adding or changing variables at any time.  &lt;br /&gt;
&lt;br /&gt;
Causata displays the data graphically allowing an analyst to easily understand the distributions of the variables across any set of customers and also which variables have the most predictive power for a particular outcome. From there a dataset suitable for modeling can be exported to SAS/R. This means datasets for modeling can be created in minutes rather than weeks.&lt;br /&gt;
&lt;br /&gt;
&lt;div style="float: right; padding-left: 12px;"&gt;
&lt;img alt="Quote: allow data analysts to spend more time on high value tasks" src="http://cdn.causata.com/blog/2010/12/MorePrediction_02.png" /&gt;&lt;/div&gt;
A huge portion of the end-to-end effort of creating value from customer data has been automated in Causata. If you’re an analyst, suddenly you can spend much more of your time on what you enjoy. You can actually explore the statistical models you want, put them into production and give your results a big boost. &lt;br /&gt;
&lt;br /&gt;
Companies need to shift the analytical emphasis from reporting to operationalizing the power of predictive models by embracing opportunities to allow data analysts to spend more time on higher value tasks.  There are so many places in a business that stand to benefit by being a lot smarter with data - marketing, personalizing an online experience and service are just a few examples.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Gareth&lt;br /&gt;
&lt;br /&gt;
&lt;span style="font-style: italic;"&gt;&lt;a href="http://www.causata.com/technology/"&gt;Learn more&lt;/a&gt; about Causata Technology&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6738477378021528685-1215106383072567931?l=blog.causata.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/causata-blog/~4/3YnS1hZgPS4" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://blog.causata.com/feeds/1215106383072567931/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://blog.causata.com/2010/12/more-prediction-less-reporting-freeing.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/1215106383072567931?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/1215106383072567931?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/causata-blog/~3/3YnS1hZgPS4/more-prediction-less-reporting-freeing.html" title="More prediction, less reporting. Freeing up an analyst’s time so they can make a greater impact" /><author><name>Gareth O'Loughlin</name><uri>http://www.blogger.com/profile/07325321846459173339</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://blog.causata.com/2010/12/more-prediction-less-reporting-freeing.html</feedburner:origLink></entry><entry gd:etag="W/&quot;Ak4DQX46fip7ImA9Wx9SEk4.&quot;"><id>tag:blogger.com,1999:blog-6738477378021528685.post-5956708770615333254</id><published>2010-12-01T06:49:00.000-08:00</published><updated>2010-12-01T13:29:30.016-08:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2010-12-01T13:29:30.016-08:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="engineering" /><title>Simpson's Paradox meets marketing</title><content type="html">Part of Causata's name is drawn from the word 'causal'. &amp;nbsp;In delving into masses of customer data Causata's goal is to surface causal effects. &amp;nbsp;To do that Causata keeps incredibly granular data about events (any customer interaction) along a timeline. &amp;nbsp;By examining events across time and controlling which people are exposed to particular interactions (such as who sees certain content on a web site), Causata teases out causal effects.&lt;br /&gt;
&lt;br /&gt;
One of the surprising things when dealing with the data is how counterintuitive some results can be. &amp;nbsp;A great example of this is &lt;a href="http://en.wikipedia.org/wiki/Simpson's_paradox"&gt;Simpson's Paradox&lt;/a&gt; (it's not strictly a paradox, just very unexpected).&lt;br /&gt;
&lt;br /&gt;
Imagine a situation where two pieces of advertising (called Version A and Version B) have been prepared for use in print and on the web. &amp;nbsp;The advertisements are placed and the percentage of people responding to the ads is measured. &amp;nbsp;From this data a simple table can be drawn up showing the number of responses, the number of impressions and the response rate for each of the versions and media:&lt;br /&gt;
&lt;br /&gt;
&lt;table width="80%"&gt;&lt;tbody&gt;
&lt;tr&gt;&lt;th&gt;&lt;/th&gt;&lt;th align="left"&gt;Version A&lt;/th&gt;&lt;th align="left"&gt;Version B&lt;/th&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;th&gt;Web&lt;/th&gt;&lt;td&gt;1800/130000 (1.38%)&lt;/td&gt;&lt;td&gt;500/40000 (1.25%)&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;th&gt;Print&lt;/th&gt;&lt;td&gt;750/40000 (1.88%)&lt;/td&gt;&lt;td&gt;2200/130000 (1.69%)&lt;/td&gt;&lt;/tr&gt;
&lt;/tbody&gt;&lt;/table&gt;&lt;br /&gt;
It's clear that Version A outperforms Version B in both print and on the web.  From that it would be obvious to conclude that Version A 'works better' than Version B.  But there's a problem.&lt;br /&gt;
&lt;br /&gt;
When the results are combined a different picture emerges:&lt;br /&gt;
&lt;br /&gt;
&lt;table width="80%"&gt;&lt;tbody&gt;
&lt;tr&gt;&lt;th&gt;&lt;/th&gt;&lt;th align="left"&gt;Version A&lt;/th&gt;&lt;th align="left"&gt;Version B&lt;/th&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;th&gt;Web + Print&lt;/th&gt;&lt;td&gt;2550/17000 (1.50%)&lt;/td&gt;&lt;td&gt;2700/170000 (1.59%)&lt;/td&gt;&lt;/tr&gt;
&lt;/tbody&gt;&lt;/table&gt;&lt;br /&gt;
Here it's clear that Version B produces a better response rate. That is Simpson's Paradox in action.&lt;br /&gt;
&lt;br /&gt;
In this case this change occurred for two reasons: there's a large difference in the number of impressions each advertisement received (perhaps because of different sized print runs for a catalog or a difference in targeting on the web) and there's a &lt;a href="http://en.wikipedia.org/wiki/Confounding"&gt;confounding variable&lt;/a&gt; (a variable that needs to be taken into account when interpreting the results).&lt;br /&gt;
&lt;br /&gt;
Here the confounding variable is the medium (web or print).  Notice how for both versions of the advertisement print is more successful.  For whatever reason people seeing the ad in print respond better to it than those who see it on the web.  The choice of medium and the disparity in the number of impressions together mean that the wrong conclusion can be drawn if data is mixed.&lt;br /&gt;
&lt;br /&gt;
Simpson's Paradox occurs because of an incorrect interpretation of correlation and causality.  When looking at the combined data above it's tempting to say that Version B works better than Version A.  There's a leap from the numbers to the conclusion, and because the confounding variable is ignored in the combined numbers its effect is also ignored.&lt;br /&gt;
&lt;br /&gt;
Of course, the opposite is true.  Version A is better than Version B.  Once the true causal relationship is examined (the effect on response rate of versions A and B for web and separately for print) the true answer is revealed.&lt;br /&gt;
&lt;br /&gt;
If you want to dig deep into this read &lt;a href="http://bayes.cs.ucla.edu/R264.pdf"&gt;Simpson's Paradox: An Anatomy&lt;/a&gt;.&lt;br /&gt;
&lt;br /&gt;
At Causata we are mindful of Simpson's Paradox and its effect on conclusions drawn from data and are careful that our system is not itself fooled by this simple but tricky problem.&lt;br /&gt;
&lt;br /&gt;
John.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6738477378021528685-5956708770615333254?l=blog.causata.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/causata-blog/~4/op8G61wWnSM" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://blog.causata.com/feeds/5956708770615333254/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://blog.causata.com/2010/12/simpsons-paradox.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/5956708770615333254?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/5956708770615333254?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/causata-blog/~3/op8G61wWnSM/simpsons-paradox.html" title="Simpson's Paradox meets marketing" /><author><name>John Graham-Cumming</name><uri>http://www.blogger.com/profile/03426267581535741881</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://blog.causata.com/2010/12/simpsons-paradox.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CkAMQH05eCp7ImA9WhVXF08.&quot;"><id>tag:blogger.com,1999:blog-6738477378021528685.post-3692007804884455194</id><published>2010-10-13T14:18:00.000-07:00</published><updated>2012-04-17T20:46:21.320-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-04-17T20:46:21.320-07:00</app:edited><title>Focusing on value 'to the customer'</title><content type="html">One of the most important things I learnt as Skype went from tiny company to household name was that our success depended on focusing on the user.  At every company meeting we'd hear the mantra about it being the user that matters.  Niklas Zennstrom, who founded the company, would tell us that if we ever found ourselves making a choice between what was best for the business and what was best for the user to choose the user every single time.  It’s something that as a consumer I wish every company embraced.&lt;br /&gt;
&lt;br /&gt;
This advice would be well taken by every corporation in America. Anyone who's struggled with their cell phone service can appreciate how important putting the user first really is. In fact, the number of times in the last year that I’ve dealt with companies where the person on the other end made the wrong choice shows there is a lot of room for improvement.&lt;br /&gt;
&lt;br /&gt;
At Causata we’re engaged with many of the largest companies in the world.  We speak to a lot of forward-looking people at retailers and financial companies and it is heartening to hear positive reactions to our approach to end user focus and customer value.&lt;br /&gt;
&lt;br /&gt;
We explain, as a data driven company would, that you need to be aware of the value of every customer, specifically the potential value of customers going forward, but we’re very clear that we don’t think it’s enough to be thinking about the value to the business, you need to be thinking about the value to the customer.&lt;br /&gt;
&lt;br /&gt;
&lt;a href="http://2.bp.blogspot.com/_u7hF81ub7TY/TLb3XYUodlI/AAAAAAAAAAM/Js3RBOCvcB8/s1600/Emails+people+don%27t+want.png"&gt;&lt;img alt="" border="0" id="BLOGGER_PHOTO_ID_5527877573681903186" src="http://2.bp.blogspot.com/_u7hF81ub7TY/TLb3XYUodlI/AAAAAAAAAAM/Js3RBOCvcB8/s320/Emails+people+don%27t+want.png" style="display: block; height: 152px; margin-bottom: 10px; margin-left: -60px; margin-top: 0px; text-align: left; width: 320px;" /&gt;&lt;/a&gt;&lt;br /&gt;
A lot of companies blast out emails to millions of people. A couple of weeks later a small percentage of them respond and the company pats itself on the back. We’d say: Not so fast! What about the more than 90% of people whose time was wasted and whose inbox was cluttered with a message they didn't want to receive at that moment?  Most consumers would say that their time is precious. Was the value to the consumer considered against the expected value of that email to the company? It's disappointing to say but I'd be surprised if it were.&lt;br /&gt;
&lt;br /&gt;
At Causata, we perform predictions and calculate value at an individual customer level and know that you can predict with high accuracy who is likely to be interested in an email and who is likely to unsubscribe.  We make available a host of other user-focused information too. So, if a company wants to take a user focused approach to sending that email, they absolutely can.&lt;br /&gt;
&lt;br /&gt;
The world would be a better place if, as they conduct business - better yet at every customer interaction, companies didn’t just consider the value to them but the value to the customer.&lt;br /&gt;
&lt;br /&gt;
Gareth&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6738477378021528685-3692007804884455194?l=blog.causata.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/causata-blog/~4/kMvnV1m8nZQ" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://blog.causata.com/feeds/3692007804884455194/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://blog.causata.com/2010/10/focusing-on-value-to-customer.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/3692007804884455194?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/3692007804884455194?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/causata-blog/~3/kMvnV1m8nZQ/focusing-on-value-to-customer.html" title="Focusing on value 'to the customer'" /><author><name>Gareth O'Loughlin</name><uri>http://www.blogger.com/profile/07325321846459173339</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://2.bp.blogspot.com/_u7hF81ub7TY/TLb3XYUodlI/AAAAAAAAAAM/Js3RBOCvcB8/s72-c/Emails+people+don%27t+want.png" height="72" width="72" /><thr:total>0</thr:total><feedburner:origLink>http://blog.causata.com/2010/10/focusing-on-value-to-customer.html</feedburner:origLink></entry><entry gd:etag="W/&quot;Ck8AQn88fyp7ImA9WhVXF08.&quot;"><id>tag:blogger.com,1999:blog-6738477378021528685.post-7768650494059823727</id><published>2010-10-06T09:10:00.000-07:00</published><updated>2012-04-17T20:47:23.177-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-04-17T20:47:23.177-07:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="engineering" /><title>What Causata means by real-time</title><content type="html">Throughout Causata's web site one term is repeated constantly: real-time.  But 'real-time' means different things to different people.  For example, anyone who has worked with a real-time operating system will instantly think of hard constraints on execution time for critical routines.&lt;br /&gt;
&lt;br /&gt;
At Causata we are actually using real-time in two senses: first to mean that the system responds within a small window of time when queried for a decision and second to mean that the system takes into account information shortly after it has arrived.&lt;br /&gt;
&lt;br /&gt;
Causata is designed to provide decisions and customer profiles based on information in its distributed data-store to improve existing touchpoint systems.  These profiles and decisions are designed to enhance interactions that a customer is having on the web, the phone, through a mobile device, via email, or at a teller.  That means that Causata needs to be able to make a decision or serve a profile rapidly when the customer is interacting with the business.  On the web that implies decisions and profiles in milliseconds.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: left;"&gt;
&lt;a href="http://2.bp.blogspot.com/_yvL50CtAcII/TKyfWrVhR0I/AAAAAAAAABA/Qs-XYG20GD8/s1600/CollectionPoints.png" imageanchor="1" style="margin-left: 0;"&gt;&lt;img border="0" height="149" src="http://2.bp.blogspot.com/_yvL50CtAcII/TKyfWrVhR0I/AAAAAAAAABA/Qs-XYG20GD8/s320/CollectionPoints.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;br /&gt;
To fully use customer interaction data Causata has to be up to date.  The value of customer data depreciates quickly.  For example, on the web the customer's trail of clicks indicates what's on their mind at that very moment.  If a system can't accept and act upon the trail of clicks while the visitor is still on the site a great deal of information is wasted.  Causata is designed to receive and incorporate the most up to date information as it happens.&lt;br /&gt;
&lt;br /&gt;
And it's not just the web that benefits from real-time.  With real-time profiles and decisions based on the absolute latest data any touchpoint can use information from any other.  For example, emails being sent for marketing purposes can be personalized using information derived from Causata gathered from the web, call center, transactions and more.&lt;br /&gt;
&lt;br /&gt;
John.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6738477378021528685-7768650494059823727?l=blog.causata.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/causata-blog/~4/dn691KclPHU" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://blog.causata.com/feeds/7768650494059823727/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://blog.causata.com/2010/10/what-causata-means-by-real-time.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/7768650494059823727?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/7768650494059823727?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/causata-blog/~3/dn691KclPHU/what-causata-means-by-real-time.html" title="What Causata means by real-time" /><author><name>John Graham-Cumming</name><uri>http://www.blogger.com/profile/03426267581535741881</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://2.bp.blogspot.com/_yvL50CtAcII/TKyfWrVhR0I/AAAAAAAAABA/Qs-XYG20GD8/s72-c/CollectionPoints.png" height="72" width="72" /><thr:total>0</thr:total><feedburner:origLink>http://blog.causata.com/2010/10/what-causata-means-by-real-time.html</feedburner:origLink></entry><entry gd:etag="W/&quot;Ck4EQ3o_fSp7ImA9WhVXF08.&quot;"><id>tag:blogger.com,1999:blog-6738477378021528685.post-321705745870340656</id><published>2010-10-04T07:19:00.001-07:00</published><updated>2012-04-17T20:48:22.445-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-04-17T20:48:22.445-07:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="engineering" /><title>The challenge of identity association</title><content type="html">Because Causata wants to bring together all the information about a customer, a necessary first step is identifying which data is associated with which customer.  And, unfortunately, customer identification is quite a challenging problem.&lt;br /&gt;
&lt;br /&gt;
Imagine a customer researching digital cameras online. The customer visits the website of an electronics store and views a number of cameras; Causata records the web session with only an anonymous cookie tracking the customer.&lt;br /&gt;
&lt;br /&gt;
A week later, the customer is walking by that same electronics store and decides to walk in. Having already researched cameras, the customer heads straight for a specific digital camera.&lt;br /&gt;
&lt;br /&gt;
The customer purchases the camera with a credit card and presents a loyalty card. Causata records the transaction (down to the line item level) and the associated identity information as a retail session. The loyalty card has a name and address associated with it; it was captured when the customer signed up. Causata also has that information in its data store.&lt;br /&gt;
&lt;br /&gt;
The next day, at a different location of the same electronics store, the customer buys a memory card for the camera and presents the same credit card (but has forgotten their loyalty card). Causata again creates a retail session and records the purchase with the associated credit card.&lt;br /&gt;
&lt;br /&gt;
A week later, the customer logs in to the electronics store website to see how many loyalty card points they have. Causata records a web session capturing every click made plus the loyalty card information.  &lt;br /&gt;
&lt;br /&gt;
Causata notices that the last time this machine was used—when the customer was researching cameras—the same anonymous cookie was used. It can now associate the original research with the customer who eventually bought the camera.&lt;br /&gt;
&lt;br /&gt;
Finally, the customer drops in to the same electronics store near their office and buys a lens cloth using cash and presents a loyalty card. Causata records a retail session with the loyalty card information.&lt;br /&gt;
&lt;br /&gt;
All of these interactions build a complete picture of the customer:  a customer who researched online, continued on to buy in store, made follow-up purchases, and tracked their loyalty points. Clearly, this is a loyal customer.&lt;br /&gt;
&lt;br /&gt;
But without identity association it’s hard to see that complete picture.&lt;br /&gt;
&lt;br /&gt;
The image below shows the individual sessions and their relationships.&lt;br /&gt;
&lt;br /&gt;
&lt;a href="http://3.bp.blogspot.com/_dt4ksD7hyDE/TKnhlACbyVI/AAAAAAAAAN4/IiwtgStC1X8/s1600/12345.png"&gt;&lt;img alt="" border="0" id="BLOGGER_PHOTO_ID_5524194443728243026" src="http://3.bp.blogspot.com/_dt4ksD7hyDE/TKnhlACbyVI/AAAAAAAAAN4/IiwtgStC1X8/s400/12345.png" style="display: block; height: 171px; margin-bottom: 10px; margin-left: -6px; margin-top: 0px; text-align: left; width: 400px;" /&gt;&lt;/a&gt;&lt;br /&gt;
After the customer has researched the camera online and bought it in store, the system records two separate, unlinked sessions: the anonymous web surfing session and the purchase.  When the customer buys the memory card with the same credit card used to buy the camera, those purchases are linked via the credit card.&lt;br /&gt;
&lt;br /&gt;
The picture becomes clearer when the customer logs in to the website to check their loyalty card points. The anonymous research web session is then linked to the purchase of the camera, and four separate sessions are joined together. Finally, the lens cloth is purchased, and this is added to the knowledge about the customer.&lt;br /&gt;
&lt;br /&gt;
The connections between the interactions in the diagram above can also have probabilities associated with them. For example, it’s highly likely that the camera purchase, lens cloth purchase, and online loyalty card point tracking are sessions performed by the same person. They are, after all, linked by an individual loyalty card.&lt;br /&gt;
&lt;br /&gt;
But the connection between the initial research session and the rest of the data is more tenuous. It’s possible that it’s the same person, but it’s also possible that more than one person used the same computer, thereby blurring the identity associated with the anonymous cookie.&lt;br /&gt;
&lt;br /&gt;
Much more about the technology behind Causata can be found &lt;a href="http://www.causata.com/technology/technical"&gt;here&lt;/a&gt;.&lt;br /&gt;
&lt;br /&gt;
John.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6738477378021528685-321705745870340656?l=blog.causata.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/causata-blog/~4/s050q8DOfE0" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://blog.causata.com/feeds/321705745870340656/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://blog.causata.com/2010/10/challenge-of-identity-association.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/321705745870340656?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/321705745870340656?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/causata-blog/~3/s050q8DOfE0/challenge-of-identity-association.html" title="The challenge of identity association" /><author><name>John Graham-Cumming</name><uri>http://www.blogger.com/profile/03426267581535741881</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://3.bp.blogspot.com/_dt4ksD7hyDE/TKnhlACbyVI/AAAAAAAAAN4/IiwtgStC1X8/s72-c/12345.png" height="72" width="72" /><thr:total>0</thr:total><feedburner:origLink>http://blog.causata.com/2010/10/challenge-of-identity-association.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DU4MRno9eSp7ImA9Wx5WGEg.&quot;"><id>tag:blogger.com,1999:blog-6738477378021528685.post-7576103686182427910</id><published>2010-09-30T07:44:00.001-07:00</published><updated>2010-09-30T07:46:27.461-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2010-09-30T07:46:27.461-07:00</app:edited><title>The Connected Customer</title><content type="html">When I talk about a connected customer I'm actually referring to three things&lt;br /&gt;
&lt;br /&gt;
1. Customers of a business are connected to the business in multiple ways through different channels&lt;br /&gt;
&lt;br /&gt;
2. The same customers are also constantly connected to the business because of the proliferation of mobile devices&lt;br /&gt;
&lt;br /&gt;
3. Businesses need a connected view of each customer that takes into account all the ways customers interact with them&lt;br /&gt;
&lt;br /&gt;
Causata's business is a consequence of these three.  Causata talks about being in the business of providing real-time customer intelligence.  What this means is gathering together all the clues a customer leaves when interacting with a business, analyzing them to discover customer desires, and providing the intelligence to systems that can act upon it.&lt;br /&gt;
&lt;br /&gt;
Customer clues come in many forms both inbound and outbound. Inbound customer clues include every click the customer makes on a business' web site, the transactions they perform with the business, and calls they make to a call center.  Outbound customer 'clues' are things like direct mail pieces that were sent to the customer, emails they've received, offers they've been shown on the web.&lt;br /&gt;
&lt;br /&gt;
When drawn together this total view of the customer and their interaction history provides a rich set of data to analyze.  But drawing it together can be very difficult.  Causata's software is designed to make gathering customer clues easy: we integrate with a wide range of systems through standard APIs.  &lt;br /&gt;
&lt;br /&gt;
Of course, it's not just API integration but also data integration.  That's why Causata's distributed datastore is schema-free and based on an indexed key/value store that we developed for the specific purpose of serving up real-time customer intelligence.  This is designed to make bringing together disparate sources of customer data easy.  And, happily, it has worked well in the real-world.&lt;br /&gt;
&lt;br /&gt;
One significant wrinkle in developing such a system is that the identity of a customer is both fluid and vague.  A customer might authenticate with a web site and provide strong identity information, might use a loyalty card in a store providing a different sort of identity, or simply be observed using the same credit card online and off.  Equally a non-authenticated user of a business' web site might be one of many people as computers are frequently shared.  To intelligently merge customer identity information Causata build 'identity association' into the platform.&lt;br /&gt;
&lt;br /&gt;
Once you've got robust data about every customer interaction and can identity the customer it's possible to perform statistical analysis or machine learning based on very rich information.  We've built both a user interface and machine learning facilities into Causata to provide both human and machine access to real-time customer intelligence.&lt;br /&gt;
&lt;br /&gt;
Humans can sit at the Causata Customer Insight tool and analyze their customer base quickly and easily.  Equally the intelligence coming out of Causata can be used to drive other tools.  For example, Causata can feed into a web content targeting system, or an email targeting platform, or a system producing checkout coupons.  &lt;br /&gt;
&lt;br /&gt;
Ultimately our goal is to produce rich information that other systems can use.  To do that we provide both access to raw information inside Causata and to the output of our machine learning algorithms.&lt;br /&gt;
&lt;br /&gt;
A lot more detailed information about the technology can be found &lt;a href="http://www.causata.com/technology/technical"&gt;here&lt;/a&gt;.&lt;br /&gt;
&lt;br /&gt;
John.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6738477378021528685-7576103686182427910?l=blog.causata.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/causata-blog/~4/BgvytuTNfTk" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://blog.causata.com/feeds/7576103686182427910/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://blog.causata.com/2010/09/connected-customer.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/7576103686182427910?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/7576103686182427910?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/causata-blog/~3/BgvytuTNfTk/connected-customer.html" title="The Connected Customer" /><author><name>John Graham-Cumming</name><uri>http://www.blogger.com/profile/03426267581535741881</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://blog.causata.com/2010/09/connected-customer.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CkEMRH06eCp7ImA9WhVXF08.&quot;"><id>tag:blogger.com,1999:blog-6738477378021528685.post-6582490108177787411</id><published>2010-09-30T07:43:00.000-07:00</published><updated>2012-04-17T20:44:45.310-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-04-17T20:44:45.310-07:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="causality" /><category scheme="http://www.blogger.com/atom/ns#" term="engineering" /><title>The Arrow of Time</title><content type="html">In building a system to glean real-time intelligence from customer interactions we had to make a number of decisions about data storage.  One of these was how much data to store.  If a specific use for customer data is known (such as mining it for particular values or relationships, or building a statistical model based on certain characteristics) then the data can be summarized before storage.&lt;br /&gt;
&lt;br /&gt;
For example, instead of keeping a record of every transaction a customer makes, it might be good enough to store just the number of transactions and the average value.  Or store the same thing on a monthly basis.  This would greatly reduce the amount of data needing to be stored.  Of course, the problem with that is that you have to know up front what question you are going to ask of your data. &lt;br /&gt;
&lt;br /&gt;
At Causata we don't know what questions are going to be asked (we supply a flexible querying mechanism to ask any question) and we also wanted to build in future-proofness.  If you summarize data you are inherently throwing away information.  And that means throwing away information, means throwing away the chance to answer unseen questions.&lt;br /&gt;
&lt;br /&gt;
So, at Causata we decided that the only real solution was to store every single piece of information about customer interactions.  We call the smallest piece of information an event and store everything: every click on the company's web site (with all the available metadata), every transaction, every time an email is sent to the customer, etc. &lt;br /&gt;
&lt;br /&gt;
Naturally, that means that we have to manage a huge amount of data which resulted in our parallel, distributed architecture using key/value pairs.  And there's a massive benefit to doing this: we don't lose the arrow of time.&lt;br /&gt;
&lt;br /&gt;
&lt;a href="http://2.bp.blogspot.com/_dt4ksD7hyDE/TKNCM2WFuuI/AAAAAAAAAMg/EknUVrMebXc/s1600/inspiration.png"&gt;&lt;img alt="" border="0" id="BLOGGER_PHOTO_ID_5522330356600388322" src="http://2.bp.blogspot.com/_dt4ksD7hyDE/TKNCM2WFuuI/AAAAAAAAAMg/EknUVrMebXc/s400/inspiration.png" style="display: block; height: 105px; margin-bottom: 10px; margin-top: 0px; text-align: left; width: 400px;" /&gt;&lt;/a&gt;&lt;br /&gt;
When data is summarized the relationships between events that can only been seen from their order is lost.  In the Causata system we keep everything a customer does (and the company does to the customer such as sending a direct mail piece) in time order.  That means we can reconstruct the complete customer history.&lt;br /&gt;
&lt;br /&gt;
And it unlocks something even more valuable: we can examine cause and effect.  We can, for example, look at everyone who received a particular marketing email and discover what they did next.  By comparing with those who did not receive the email we can uncover the real effect of the message.&lt;br /&gt;
&lt;br /&gt;
John.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6738477378021528685-6582490108177787411?l=blog.causata.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/causata-blog/~4/nbpfUwkn7z4" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://blog.causata.com/feeds/6582490108177787411/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://blog.causata.com/2010/09/arrow-of-time.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/6582490108177787411?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/6582490108177787411?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/causata-blog/~3/nbpfUwkn7z4/arrow-of-time.html" title="The Arrow of Time" /><author><name>John Graham-Cumming</name><uri>http://www.blogger.com/profile/03426267581535741881</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://2.bp.blogspot.com/_dt4ksD7hyDE/TKNCM2WFuuI/AAAAAAAAAMg/EknUVrMebXc/s72-c/inspiration.png" height="72" width="72" /><thr:total>0</thr:total><feedburner:origLink>http://blog.causata.com/2010/09/arrow-of-time.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CkYBR349eSp7ImA9Wx5REk4.&quot;"><id>tag:blogger.com,1999:blog-6738477378021528685.post-3937876710595099967</id><published>2010-08-19T08:02:00.000-07:00</published><updated>2010-08-19T08:02:36.061-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2010-08-19T08:02:36.061-07:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="careers" /><title>Careers: Senior Interaction/UX Designer</title><content type="html">Causata, Inc. is recruiting a London-based Senior Interaction/UX Designer to join our venture-backed startup.  This challenging role involves creating a web-based user interface for a product that is used to visualize massive amounts of complex data.  You'll be working with the latest technologies and on a ground-breaking product.&lt;br /&gt;
&lt;br /&gt;
We know that a great user experience is at the heart of any successful product. As our Senior Interaction Designer we want you to be a driving force in gathering insight into our users' needs, behaviors and intentions. You'll be translating this into innovative interface designs for state-of-the-art software that enables our users to explore and interact with huge amounts of data. You will be responsible for the UX and the UI working closely with (and managing) a visual designer, engineers and the product team throughout all stages of the product cycle.&lt;br /&gt;
&lt;br /&gt;
Responsibilities:&lt;br /&gt;
• Taking part in requirements gathering and user research.&lt;br /&gt;
• Defining the user and interaction model.&lt;br /&gt;
• Creating a detailed user interface design for all our products.&lt;br /&gt;
• Developing high level and/or detailed storyboards, mockups and prototypes to effectively communicate interaction and design ideas.&lt;br /&gt;
• Championing user needs and the overall user experience.&lt;br /&gt;
&lt;br /&gt;
Requirements:&lt;br /&gt;
• Solid academic background, Masters degree or equivalent experience, specializing in design, human-computer interaction, ergonomics, computer science or cognitive psychology.&lt;br /&gt;
• At least five years experience in application design, online interaction design or information architecture.&lt;br /&gt;
• Outstanding conceptual design skills and familiarity with high-level deliverables (scenarios, personas, flow diagrams, etc.) as well as low-level interaction design best practice.&lt;br /&gt;
• Skilled in at least one method of UI prototyping (Axure, Flash Catalyst, Flex, Fireworks, Illustrator, JavaScript frameworks etc.)&lt;br /&gt;
• Contribute to the creation of our product roadmap.&lt;br /&gt;
• Rationalize design decisions to stakeholders and peers.&lt;br /&gt;
• Create detailed specifications as needed.&lt;br /&gt;
• Enthusiasm for IxD, UX, and IA with a broad knowledge of current practice.&lt;br /&gt;
• Excellent leadership, communication and teamwork skills.&lt;br /&gt;
• Experience or interest in data visualisation and statistical analysis tools would be highly beneficial.&lt;br /&gt;
&lt;br /&gt;
Visit &lt;a href="http://www.causata.com/about/careers"&gt;Careers&lt;/a&gt; to apply.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6738477378021528685-3937876710595099967?l=blog.causata.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/causata-blog/~4/wVTX8UDR-TA" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://blog.causata.com/feeds/3937876710595099967/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://blog.causata.com/2010/08/careers-senior-interactionux-designer.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/3937876710595099967?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/3937876710595099967?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/causata-blog/~3/wVTX8UDR-TA/careers-senior-interactionux-designer.html" title="Careers: Senior Interaction/UX Designer" /><author><name>John Graham-Cumming</name><uri>http://www.blogger.com/profile/03426267581535741881</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://blog.causata.com/2010/08/careers-senior-interactionux-designer.html</feedburner:origLink></entry><entry gd:etag="W/&quot;A0YBSHw-eCp7ImA9Wx5RE0k.&quot;"><id>tag:blogger.com,1999:blog-6738477378021528685.post-8796841295162948089</id><published>2010-08-18T02:21:00.000-07:00</published><updated>2010-08-20T17:05:59.250-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2010-08-20T17:05:59.250-07:00</app:edited><title>The Evolving Enterprise</title><content type="html">&lt;blockquote&gt;"We have only two sources of competitive advantage:&lt;br /&gt;1. The ability to learn more about our customers faster than the competition&lt;br /&gt;2. The ability to turn that learning into action faster than the competition"&lt;br /&gt;&lt;/blockquote&gt;Quotation is attributed to Jack Welch, CEO General Electric&lt;br /&gt;&lt;br /&gt;If Jack Welch’s assertions are true (and I believe they are) then&lt;br /&gt;there is a major disconnect between the way enterprises currently&lt;br /&gt;think about data management and how they should be thinking about it.&lt;br /&gt;If rapid learning from customer data and acting upon that learning are&lt;br /&gt;the only real sources of competitive advantage then why aren’t&lt;br /&gt;enterprises worrying much more about organizing data to efficiently do&lt;br /&gt;just that? Why is the focus of data management today on basic tasks&lt;br /&gt;such as storage, retrieval and querying for reporting? This can’t be&lt;br /&gt;right.&lt;br /&gt;&lt;br /&gt;I expect this situation to change quickly. Over the last few years&lt;br /&gt;there has been a rapidly accelerating effort on developing powerful&lt;br /&gt;new analytical architectures, and these technologies are creating a&lt;br /&gt;growing appreciation that enterprise data has been an enormous&lt;br /&gt;unrealized asset.&lt;br /&gt;&lt;br /&gt;If monetary value really can be extracted from new analytical&lt;br /&gt;technologies in a way that very quickly generates returns of many&lt;br /&gt;times cost, then enterprises will be quick to implement these new&lt;br /&gt;technologies. What enterprises need right now more than anything else&lt;br /&gt;is hard evidence that this can actually be done.&lt;br /&gt;&lt;br /&gt;Rather tantalizingly, what we will see in this accelerating market are&lt;br /&gt;widening gaps between the technically possible and the commercially&lt;br /&gt;proven, and between the commercially proven and published case studies&lt;br /&gt;in the public domain.&lt;br /&gt;&lt;br /&gt;Here at Causata we like Jack Welch's words quoted above. It would be&lt;br /&gt;hard to more succinctly express the essence of what Causata's software&lt;br /&gt;encapsulates as a product than Welch’s points 1 and 2.&lt;br /&gt;&lt;br /&gt;Paul.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6738477378021528685-8796841295162948089?l=blog.causata.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/causata-blog/~4/l5vMmyagZG0" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://blog.causata.com/feeds/8796841295162948089/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://blog.causata.com/2010/08/evolving-enterprise.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/8796841295162948089?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/8796841295162948089?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/causata-blog/~3/l5vMmyagZG0/evolving-enterprise.html" title="The Evolving Enterprise" /><author><name>John Graham-Cumming</name><uri>http://www.blogger.com/profile/03426267581535741881</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://blog.causata.com/2010/08/evolving-enterprise.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CUMHQn0-fCp7ImA9Wx5REU8.&quot;"><id>tag:blogger.com,1999:blog-6738477378021528685.post-3773099015622293389</id><published>2010-03-12T03:58:00.000-08:00</published><updated>2010-08-18T02:23:53.354-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2010-08-18T02:23:53.354-07:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="big data" /><title>Big Answers</title><content type="html">In a recent &lt;a href="http://www.economist.com/surveys/PrinterFriendly.cfm?story_id=15557465"&gt;Economist&lt;/a&gt; story two striking facts extracted from Big Data were reported:&lt;br /&gt;
&lt;blockquote&gt;Best Buy, a retailer, found that 7% of its customers accounted for 43% of its sales.&lt;br /&gt;
&lt;/blockquote&gt;and&lt;br /&gt;
&lt;blockquote&gt;In Britain the Royal Shakespeare Company (RSC) sifted through seven years of sales data for a marketing campaign that increased regular visitors by 70%.&lt;br /&gt;
&lt;/blockquote&gt;And it's not hard to find other stories indicating that enormous gains in revenue can be extracted from the data that companies already have. For example, the web site &lt;a href="http://www.napastyle.com/"&gt;NapaStyle.com&lt;/a&gt; reports a 10% lift in revenue per visitor after two months of mining customer clicks to understand behavior.  &lt;a href="http://worldkitchen.co.uk/"&gt;World Kitchen&lt;/a&gt; reported an average order value increase of 20% using similar technology.&lt;br /&gt;
&lt;br /&gt;
Anecdotal evidence says that Amazon.com is getting an additional 10% sales revenue by mining the books that its customers buy to recommend other titles they might like. And Netflix gets 60% of its rentals from people clicking through on similarly mined data guiding its recommendations.&lt;br /&gt;
&lt;br /&gt;
Startup &lt;a href="http://flightcaster.com/"&gt;FlightCaster&lt;/a&gt; is taking a Big Data approach mining airline records, weather and FAA real-time traffic information to provide accurate predictions of flight delays well ahead of the airlines themselves.  Their application provides flight delay information 6 hours before the airlines tell their own customers.&lt;br /&gt;
&lt;br /&gt;
&lt;a href="https://www.foxaudiencenetwork.com/"&gt;Fox Audience Network&lt;/a&gt;, which handles advertising across all Fox properties (including MySpace), is using a large cluster of machines crunching Big Data to obtain an order of magnitude increase in advertising margins.&lt;br /&gt;
&lt;br /&gt;
And Europe's largest ad targeting network, &lt;a href="http://nugg.ad/"&gt;nugg.ad&lt;/a&gt; is &lt;a href="http://www.cloudera.com/blog/2010/03/why-europes-largest-ad-targeting-platform-uses-hadoop/"&gt;using&lt;/a&gt; Big Data to drive everything they do.&lt;br /&gt;
&lt;br /&gt;
Users of Big Data shouldn't be surprised by Big Answers: Big Data represents a massive untapped resource that all companies already have.  It's just a question of integrating the right software to obtain Big Answers.&lt;br /&gt;
&lt;br /&gt;
John.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6738477378021528685-3773099015622293389?l=blog.causata.com' alt='' /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/causata-blog/~4/K7cSexFvK8o" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://blog.causata.com/feeds/3773099015622293389/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://blog.causata.com/2010/03/big-answers.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/3773099015622293389?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6738477378021528685/posts/default/3773099015622293389?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/causata-blog/~3/K7cSexFvK8o/big-answers.html" title="Big Answers" /><author><name>John Graham-Cumming</name><uri>http://www.blogger.com/profile/03426267581535741881</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://blog.causata.com/2010/03/big-answers.html</feedburner:origLink></entry></feed>

