<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/atom10full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><feed xmlns="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:thr="http://purl.org/syndication/thread/1.0" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0">
    <title>Data Management &amp; Decision Support</title>
    
    
    <link rel="alternate" type="text/html" href="http://prudenza.typepad.com/dwh/" />
    <id>tag:typepad.com,2003:weblog-634712</id>
    <updated>2011-12-20T04:53:48-08:00</updated>
    <subtitle>Veni, vidi..... vici?

</subtitle>
    <generator uri="http://www.typepad.com/">TypePad</generator>
    <atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/atom+xml" href="http://feeds.feedburner.com/DataWarehousingSomeThoughts" /><feedburner:info uri="datawarehousingsomethoughts" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://hubbub.api.typepad.com/" /><entry>
        <title>Whitepaper: Kimball Revisited – Context is our guide</title>
        <link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/DataWarehousingSomeThoughts/~3/odndgkoFa4Q/whitepaper-kimball-revisited-context-is-our-guide.html" />
        <link rel="replies" type="text/html" href="http://prudenza.typepad.com/dwh/2011/12/whitepaper-kimball-revisited-context-is-our-guide.html" thr:count="0" />
        <id>tag:typepad.com,2003:post-6a00d8354d01ac69e201675f091761970b</id>
        <published>2011-12-20T04:53:48-08:00</published>
        <updated>2011-12-21T02:48:18-08:00</updated>
        <summary>Taking sustainability and agility in data modelling to the next level Intro Both authors (Frederik Hofstra and myself) of this whitepaper are involved in a data warehouse project with a university in the Netherlands. This university implemented a new SAP...</summary>
        <author>
            <name>Ronald Damhof</name>
        </author>
        
        
<content type="xhtml" xml:lang="en-US" xml:base="http://prudenza.typepad.com/dwh/">
<div xmlns="http://www.w3.org/1999/xhtml"><p><strong>Taking sustainability and agility in data modelling to the next level</strong></p>
<p> </p>
<p><em>Intro</em></p>
<p>Both authors (Frederik Hofstra and myself) of this whitepaper are involved in a data warehouse project with a university in the Netherlands. This university implemented a new SAP module; Student LifeCycle Management. A comprehensive module covering all processes from enrolling students, to credit scoring, to progress reports, to eventually graduating. This module is also the main source for master data like the academic structure of programmes and modules, as well as the organisational structure (faculties, divisions, subdivisions, etc.) and the student information.</p>
<p>The strategy of the university is to deliver a service to their organisation that is capable of delivering data in various ‘truths’ to people or information systems and with an array of functionality (e.g.: reporting, analysis, etc.).</p>
<p>The solution architecture consists of two storage layers. The first one is based on Data Vault principles and heuristics and the second one is a dataset layer. The latter is the object of discussion in this paper. This solution architecture is currently available in production mode and is - as such - a proven solution.</p>
<p> </p>
<p><em>Abstract</em></p>
<p><em> </em>This paper is all about delivering quality data products. However, quality is by definition a relative construct. Its requirements can only be fixed by the time the user needs the data<strong>. </strong>Context - as we use the word in this paper - is a function of the <em>person involved </em>at a specific moment in <em>time</em> with the <em>quality attributes</em> he desires. And these contexts change all the time. Accommodating these constant changes is partly covered by continuous innovation in technology. However, the use of innovative technologies does not free us from proper software design. This paper addresses a software design perspective to offer the required agility.</p>
<p>It is a continuing effort in trying to evolve the work of giants like Dr. R. Kimball, W.H. Imnon, B. Devlin, C.Imhoff, D.Linstedt and many many more.</p>
<p><span class="asset  asset-generic at-xid-6a00d8354d01ac69e20162fe14e165970d"><a href="http://prudenza.typepad.com/files/2011---damhof-hofstra---kimballrevisited-contextisourguide.pdf"> </a><a href="http://prudenza.typepad.com/files/2011---damhof-hofstra---kimballrevisited-contextisourguide-1.pdf"> <span class="asset  asset-generic at-xid-6a00d8354d01ac69e201675f15647f970b"><a href="http://prudenza.typepad.com/files/2011---damhof-hofstra---kimballrevisited-contextisourguide-2.pdf">Download 2011 - Damhof, Hofstra - KimballRevisited-Contextisourguide</a></span></a></span></p></div>
</content>



    <feedburner:origLink>http://prudenza.typepad.com/dwh/2011/12/whitepaper-kimball-revisited-context-is-our-guide.html</feedburner:origLink></entry>
    <entry>
        <title>Tales of a consultant in decision support and data management (3) - Saying 'I don't know'</title>
        <link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/DataWarehousingSomeThoughts/~3/VceiHUaCN78/tales-of-a-consultant-in-decision-support-and-data-management-3-saying-i-dont-know.html" />
        <link rel="replies" type="text/html" href="http://prudenza.typepad.com/dwh/2011/12/tales-of-a-consultant-in-decision-support-and-data-management-3-saying-i-dont-know.html" thr:count="1" thr:updated="2011-12-16T03:28:03-08:00" />
        <id>tag:typepad.com,2003:post-6a00d8354d01ac69e201675ed4b4a3970b</id>
        <published>2011-12-16T00:44:10-08:00</published>
        <updated>2011-12-16T00:44:10-08:00</updated>
        <summary>In my previous post on this topic I talked about saying 'I don't know' when you do not know how long it takes to finish (say) a project. But, when pressured enough, engineers are going to guess - and I...</summary>
        <author>
            <name>Ronald Damhof</name>
        </author>
        
        
<content type="xhtml" xml:lang="en-US" xml:base="http://prudenza.typepad.com/dwh/">
<div xmlns="http://www.w3.org/1999/xhtml"><p>In my <a href="http://prudenza.typepad.com/dwh/2011/12/tales-of-a-consultant-in-decision-support-and-data-management-2-saying-i-dont-know.html" target="_blank">previous post</a> on this topic I talked about  saying 'I don't know' when you do not know how long it takes to finish (say) a project.</p>
<p>But, when pressured enough, engineers are going to guess - and I completely understand!</p>
<p>STOP PRESSURING US.</p>
<p>When planning, if you don't know the answer, don't guess. Explain in your plan exactly what you are <a href="http://prudenza.typepad.com/.a/6a00d8354d01ac69e20162fde0c55f970d-pi" style="float: right;"><img alt="Test" class="asset  asset-image at-xid-6a00d8354d01ac69e20162fde0c55f970d" src="http://prudenza.typepad.com/.a/6a00d8354d01ac69e20162fde0c55f970d-120wi" style="margin: 0px 0px 5px 5px;" title="Test" /></a> going to do to get the answer. Then, when more information is available, update the plan. </p>
<p>Now, why isn't senior management keen on this? And this is really bugging me, because I want to understand. We - as engineers - really don't know...really...we don't. Why would you insist on an answer you know is not correct at all??</p>
<p>My take on this: because it's not easy. It means that management have to participate throughout the project to make <em>decisions </em>about each tasks as sufficient information becomes available (Agile manifesto principle 4)<sup>1</sup>.</p>
<p> </p>
<p><sup>1</sup><span style="font-size: 8pt;">Integrating CMMI and Agile Development- Paul E. McMahon -2011</span></p></div>
</content>



    <feedburner:origLink>http://prudenza.typepad.com/dwh/2011/12/tales-of-a-consultant-in-decision-support-and-data-management-3-saying-i-dont-know.html</feedburner:origLink></entry>
    <entry>
        <title>Agile Data Warehousing - Ralph Hughes visits Data Vault customers</title>
        <link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/DataWarehousingSomeThoughts/~3/WrVY3xbE5y4/agile-data-warehousing-ralph-hughes-visits-data-vault-customers.html" />
        <link rel="replies" type="text/html" href="http://prudenza.typepad.com/dwh/2011/12/agile-data-warehousing-ralph-hughes-visits-data-vault-customers.html" thr:count="0" />
        <id>tag:typepad.com,2003:post-6a00d8354d01ac69e201543804531c970c</id>
        <published>2011-12-08T05:21:34-08:00</published>
        <updated>2011-12-08T23:54:15-08:00</updated>
        <summary>Wednesday November 16th 2011 Ralph Hughes from Ceregenics was in the Netherlands. Ralph is author of the book 'Agile Data Warehousing: Delivering World-Class Business Intelligence Systems Using Scrum and XP'. Ralph is currently under contract to write more books on...</summary>
        <author>
            <name>Ronald Damhof</name>
        </author>
        
        
<content type="xhtml" xml:lang="en-US" xml:base="http://prudenza.typepad.com/dwh/">
<div xmlns="http://www.w3.org/1999/xhtml"><p>Wednesday November 16<sup>th</sup> 2011 Ralph Hughes from <a href="http://www.ceregenics.com/">Ceregenics</a> was in the Netherlands. Ralph is author of the book <a href="http://www.amazon.com/Agile-Data-Warehousing-World-Class-Intelligence/dp/0595471676/ref=sr_1_1?ie=UTF8&amp;qid=1323338289&amp;sr=8-1">'Agile Data Warehousing: Delivering World-Class Business Intelligence Systems Using Scrum and XP'</a>. Ralph is currently under contract to write more books on the topic of agility in data warehouse development.</p>
<p>I had been in contact with Ralph for some time; he wanted to know more about data vault, getting the facts, how it is actually used, what customers use it, how they develop and deploy, how it contributes to agility and how it impacted the business.</p>
<p><a href="http://prudenza.typepad.com/.a/6a00d8354d01ac69e201539431df6b970b-pi" style="float: right;"><img alt="IMG_7815" class="asset  asset-image at-xid-6a00d8354d01ac69e201539431df6b970b" src="http://prudenza.typepad.com/.a/6a00d8354d01ac69e201539431df6b970b-320wi" style="margin: 0px 0px 5px 5px;" title="IMG_7815" /></a></p>
<p>Of course, anything can be explained in writing or conceptually, but the 'real proof of the pudding, is in the eating'. Opportunity knocked when Ralph was in the Netherlands for his TDWI course on Agile data warehousing. He asked me whether or not I could arrange some customer visits in Amsterdam. Customers that use and deploy Data Vault and have attained a high agree of agility.</p>
<p><a href="http://www.xlntconsulting.com/about/">Tom Breur</a> and me were hosts for Ralph and we visited the <a href="http://www.vu.nl/en/index.asp">Free University</a> (client of mine) and <a href="http://www.binck.nl/">BinckBank</a> (client of Tom), both in Amsterdam. Hans Hultgren (Genesee Academy) happened to be in the Netherlands that week and joined us as well. We met with both management and technical team members of the university and BinckBank.</p>
<p>Both clients were particularly interesting because their data warehouses are in production and in a mode of constant change. Both clients showed a remarkable predictability and reliability in coping with these changes. Change equated to 'business as usual'. I remember Ralph asking an engineer 'how long does it take to deploy a new data element to the warehouse?' The engineer replied: 'do you want to know the lead-time including my coffee break?'.</p>
<p>Ralph, Tom, me and Hans were impressed with the accomplishments of these clients in getting their data warehouse deployment in control while constantly adding value/changes to the business in a predictable fashion. </p>
<p><a href="http://prudenza.typepad.com/.a/6a00d8354d01ac69e20162fd877fdf970d-pi" style="float: left;"><img alt="IMG_7828" class="asset  asset-image at-xid-6a00d8354d01ac69e20162fd877fdf970d" src="http://prudenza.typepad.com/.a/6a00d8354d01ac69e20162fd877fdf970d-320wi" style="margin: 0px 5px 5px 0px;" title="IMG_7828" /></a>I will not transcribe the whole interview in this blog - that is simply too much - send me a note if you want to know more. Interesting differences between Free University and BinckBank were the fact that they used different automating techniques and also the level of business key integration differed slightly. Free University used templating (generating XML and import in Business Objects Data Services) for data warehouse automation and the data warehouse was driven by business keys. BinckBank used <a href="http://www.datawarehousemanagement.org/Quipu.aspx">Quipu</a> for data warehouse automation and the data warehouse was partly driven by business key, and some by surrogate key (see also my <a href="http://slidesha.re/lF03rr">presentation</a> on the Data Vault advanced seminar about different Data Vault species - slide 25-30). In terms of software development methods, BinckBank used the Scrum method and Free University was based on waterfall/iterative with lots of lean practises being used.</p>
<p>I will try to summarize both visits from the perspective of me and Tom, particularly slanted towards Agile software development, by asking my blog readers, three questions: </p>
<ol>
<li>Why is it that you can build and deploy extremely small particles in Data Vault and not in other approaches, without having an increase in the overhead and coordination of these particles? In other words; 'Divide and Conquer to beat the Size / Complexity Dynamic'<sup>1</sup></li>
<li>Why is it that you can re-engineer your existing model and guarantee that the changes remain local? Something that is hugely beneficial in data warehouses that - by definition - grow over time.</li>
<li>Why is it that - as your (Data Vault based) data warehouse grows - your costs grow ‘merely’ in linear fashion initially, and as you approach the end state marginal growth in cost <em>decreases</em> exponentially (as opposed to exponential cost <em>increase</em> for Kimball warehouses)?</li>
</ol>
<p><a href="http://prudenza.typepad.com/.a/6a00d8354d01ac69e20162fd87808b970d-pi" style="display: inline;"><img alt="P1060033" border="0" class="asset  asset-image at-xid-6a00d8354d01ac69e20162fd87808b970d image-full" src="http://prudenza.typepad.com/.a/6a00d8354d01ac69e20162fd87808b970d-800wi" title="P1060033" /></a><br />I want to thank Free University as well as BinckBank for offering their time, their energy and enthusiasm to the general cause of knowledge sharing. Of course I want to thank Tom Breur and Hans Hultgren for putting in their time as well. </p>
<p>My special thanks of course to Ralph Hughes as being an open minded, inquisitive and knowledgeable peer. It was great being your host in the Netherlands. </p>
<p> </p>
<p><span style="font-size: 8pt;"><sup>1 - Gerald M. Weinberg - Quality Software Management - 1992</sup></span></p>
<p><span style="font-size: 8pt;">Photo #1: Left in the corner sits Ralph Hughes, next to him Tom Breur. On the other side the Free University; Jaap Roos (project manager), Dorien Heijting (Data Warehouse Engineer), Erwin Vreeman (Project Lead).</span></p>
<p><span style="font-size: 8pt;">Photo #2: Sitting with the american flag - Ralph Hughes and Hans Hultgren. At the top of the table - BinkBank: Michel Uittenbogaard (Data Warehouse Engineer) and on the right Paul Delgman (BI manager). </span></p>
<p><span style="font-size: 8pt;">Photo #3: Sitting near the window looking down: me, myself and I</span></p></div>
</content>



    <feedburner:origLink>http://prudenza.typepad.com/dwh/2011/12/agile-data-warehousing-ralph-hughes-visits-data-vault-customers.html</feedburner:origLink></entry>
    <entry>
        <title>Tales of a consultant in decision support and data management (2) - Saying 'I don't know'</title>
        <link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/DataWarehousingSomeThoughts/~3/1dH3MSQb2P4/tales-of-a-consultant-in-decision-support-and-data-management-2-saying-i-dont-know.html" />
        <link rel="replies" type="text/html" href="http://prudenza.typepad.com/dwh/2011/12/tales-of-a-consultant-in-decision-support-and-data-management-2-saying-i-dont-know.html" thr:count="0" />
        <id>tag:typepad.com,2003:post-6a00d8354d01ac69e2015394265853970b</id>
        <published>2011-12-07T07:00:43-08:00</published>
        <updated>2011-12-09T02:08:55-08:00</updated>
        <summary>It is hard to admit that you don't know something - this is human nature. But a lot of the time, the pressures mount up. CFO: "Ronald, we need the product to be done asap, when do you think it...</summary>
        <author>
            <name>Ronald Damhof</name>
        </author>
        
        
<content type="xhtml" xml:lang="en-US" xml:base="http://prudenza.typepad.com/dwh/">
<div xmlns="http://www.w3.org/1999/xhtml"><p><br />It is hard to admit that you don't know something - this is human nature. But a lot of the time, the pressures mount up. </p>
<p>CFO: "Ronald, we need the product to be done asap, when do you think it can be ready?"<br />Me: "well, I can tell you when we will finish, but I will not know exactly what will be done" (I don't know)<br />CFO: "Huh? Aren't you the expert? We pay you for being the expert, so spit it out"</p>
<p>Now - ask yourself - what kind of consultant are you? Will you eventually cave in to the pressure and call out a number? Or do you dig your heels and persist in 'I don't know'. If you call out a number; please be reminded that <em>the software graveyard is strewn with the carcasses of partially completed projects that were three to five times larger than anyone dreamed<span style="font-size: 8pt;"><sup>1</sup></span></em>.</p>
<p>There is a subtle difference between speed and progress. You can drive with 200 miles per hour, but without a map, your progress will be zero. In other words, you need some kind of 'map'/plan and some kind of thoughtful analysis of the problem you wish to solve.</p>
<p>But even so, a plan is just what it is; 'a plan'. THE PROJECT WILL NEVER GO ACCORDING TO PLAN. Why; because most plans I come across envision some kind of fixed end-result. This end-result is often described by means of requirements. Even if we were to embark on a monumental requirements gathering process where we interview all people involved and write down all quality attributes, we would still not have a complete requirements specification that covers the quality demands of our business users (simply because they themselves do not fully comprehend them<span style="font-size: 11px;"><sup>2</sup></span>). But  apart from that; such an elaborate process simply isn’t attainable.</p>
<div><a href="http://prudenza.typepad.com/.a/6a00d8354d01ac69e2015394266e94970b-pi" style="float: left;"><img alt="Finishline1" class="asset  asset-image at-xid-6a00d8354d01ac69e2015394266e94970b" src="http://prudenza.typepad.com/.a/6a00d8354d01ac69e2015394266e94970b-320wi" style="margin: 0px 5px 5px 0px;" title="Finishline1" /></a>If the finish line is not precisely known, the number of miles to the finish line is not known either. So - writing down in your 'plan' that the project will take '42' weeks to finish, is just another way of caving in to the pressure of calling a number. Don't do it.....resist....Do not comply.</div>
<p> </p>
<p> </p>
<div>If the finish line is not known, why not say 'give me xx amount of time and I will know precisely when I will finish, but I don't know exactly what I will deliver'. Or as Mike Cohn wrote it: '<em>When we acknowledge that the result is both somewhat unknown as well as unknowable in advance, planning becomes a process of setting and revising goals that lead to a longer term objective'<sup>3</sup>.</em></div>
<p> </p>
<div>So - shouldn't we make a plan? Of course you should make a plan and a thoughful analysis of the problem you want to solve, but refrain from 'painting the finish line'. </div>
<p><span style="font-size: 8pt;"><sup>1 </sup>Humprey - Managing the Software Proces - 1990<br /></span><span style="font-size: 11px;"><sup>2</sup> Which is especially true for data products where the future use of the data is - by definition - not fully known<br /><sup>3</sup> M.Cohn - Agile estimating and Planning - 2006 </span></p></div>
</content>



    <feedburner:origLink>http://prudenza.typepad.com/dwh/2011/12/tales-of-a-consultant-in-decision-support-and-data-management-2-saying-i-dont-know.html</feedburner:origLink></entry>
    <entry>
        <title>Tales of a consultant in decision support and data management (1) - IT Governance</title>
        <link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/DataWarehousingSomeThoughts/~3/yM3qzuasNQU/religious-wars-tool-x-or-tool-y.html" />
        <link rel="replies" type="text/html" href="http://prudenza.typepad.com/dwh/2011/12/religious-wars-tool-x-or-tool-y.html" thr:count="2" thr:updated="2012-01-10T09:17:17-08:00" />
        <id>tag:typepad.com,2003:post-6a00d8354d01ac69e2015393e3c641970b</id>
        <published>2011-12-03T03:35:24-08:00</published>
        <updated>2011-12-03T03:35:24-08:00</updated>
        <summary>I often get asked by clients to help them jiggle1 their decision support or data management efforts. The pattern that unfolds can be described as follows: 1) As any consultant I always want to know why I am being hired....</summary>
        <author>
            <name>Ronald Damhof</name>
        </author>
        
        
<content type="xhtml" xml:lang="en-US" xml:base="http://prudenza.typepad.com/dwh/">
<div xmlns="http://www.w3.org/1999/xhtml"><p><strong><br /></strong></p>
<p><a href="http://prudenza.typepad.com/.a/6a00d8354d01ac69e20162fd484a84970d-pi" style="display: inline;"><img alt="Rootcause" class="asset  asset-image at-xid-6a00d8354d01ac69e20162fd484a84970d" src="http://prudenza.typepad.com/.a/6a00d8354d01ac69e20162fd484a84970d-500wi" title="Rootcause" /></a><br /><br />I often get asked by clients to help them jiggle<sup>1</sup> their decision support or data management efforts. The pattern that unfolds can be described as follows:</p>
<p>1) As any consultant I always want to know why I am being hired. In other words, I ask: 'what are your current problems?' 'Any new challenges on the horizon?'</p>
<p>2) Client; 'we need new tech/tools'</p>
<p>3) Me: 'Then you don't need me, just write a check to the vendor. But again, what problem are you trying to solve?'</p>
<p>4) Client: 'Look, we have all the tech money can buy (me: I’m not kidding, this actually happens), but it is just not working at the moment'</p>
<p>5) Me: 'Hmm, why is it not working for you?'</p>
<p>6) Client: 'it's expensive, changes take forever, business is extremely unhappy and we keep buying more expensive hardware to increase performance'</p>
<p>7) Me: 'ok, so leadtime, quality and costs are a problem?'</p>
<p>8) Client: 'aye'</p>
<p>9) Me: 'and why do you think new tech will help you out?'</p>
<p>10) Client: 'we think this new appliance stuff - super-dooper-in-memory-shared-nothing-all-in-one-box - can save our asses'</p>
<p>By now, a quote comes to mind from Einstein: 'The significant problems we face cannot be solved by the same level of thinking that created them.'</p>
<p>11) Me: 'so how does this new super-dooper tech stuff help increase quality?'</p>
<p>12) Client: 'ehhhhh'</p>
<p>13) Me: 'Is it an idea to first perform a root-cause analysis? Lets try - together with stakeholders - to specify the problem'</p>
<p>14) Client: "We know the problems - we just need tech'</p>
<p>Obviously I am polarizing a bit, but this pattern seems to repeat itself over and over again. Of course, it could also be me ;-), but I do not think I am that special.</p>
<p>There is a lot of publicly available research which proves that technology is an extremely small part of any IT failure. By far the biggest part of the problem is management itself. More precisely: IT governance, or the lack thereof, usually is the core of the problem. Allowing technology to be purchased without proper IT governance is the same as creating waste.</p>
<p>It would be much better to just donate the money to charity.  </p>
<p>If management in IT is incapable of clearly explaining by whom and how decisions are made regarding the use of IT, then how do you expect conformance to principles? Or how do you expect to attain some kind of sustainable robust/pervasive architecture? How do you expect infrastructure (including shared and standard applications) to accommodate the needs of the many? How do you cope with business needs and how to align it with IT? How are investments in IT justified? Who is prioritizing? Who monitors the ROI (and acts on deviations)? Etc.</p>
<p>Obviously in the pattern I described above, I urge my clients to invest in a proper root-cause analysis and retrospective (what went wrong and how can we improve). Then I will try to establish some leading principles, all in the context of data management and decision support. These leading principles will be translated to an '<em>organizing logic for data, BI applications and infrastructure, captured in a set of policies, relationships and technical choices to achieve desired business and technical standardization and integration'</em><em><sup>2</sup></em><em>.  </em></p>
<p>Technology choices, tool choices, vendor choices can be logically derived from this exercise. And don't be surprised if it turns out that you already have all the technology/tools/skills/people you need.....</p>
<p>15) - client: 'ehh....Ronald, that's a lot of work - are you sure - I just wanna buy tech.'</p>
<p>16) - me: 'aye - I am sure - but don’t be fooled - you, my dear management, are in the center of the problem that needs to be solved. It will be hard work...'</p>
<p>Two scenario's unfold now:</p>
<p>1) - Client: bye, bye, Ronald - I think we need another consultant. This happens to me occasionally, and I am perfectly fine with it. I am congruent with myself in my consulting because I sincerely believe this is how it can work. I would not be congruent with myself if I went along in a technology selection process of some kind or implementing some tool that will never solve the real problem.</p>
<p>2) - Client: ok - lets do this.</p>
<p> </p>
<p> </p>
<p><span style="font-size: 8pt;">1 - A jiggler gets an organization unstuck by providing a small change in how the client sees the world</span></p>
<p><span style="font-size: 8pt;"> </span><span style="font-size: 11px;">2 - IT governance, 2004, Weill, Ross</span></p>
<p> </p></div>
</content>



    <feedburner:origLink>http://prudenza.typepad.com/dwh/2011/12/religious-wars-tool-x-or-tool-y.html</feedburner:origLink></entry>
    <entry>
        <title>A seminar about valuable innovation in data warehousing</title>
        <link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/DataWarehousingSomeThoughts/~3/O11sC9WqPko/a-seminar-about-valuable-innovation-in-data-warehousing.html" />
        <link rel="replies" type="text/html" href="http://prudenza.typepad.com/dwh/2011/10/a-seminar-about-valuable-innovation-in-data-warehousing.html" thr:count="2" thr:updated="2012-01-24T18:05:58-08:00" />
        <id>tag:typepad.com,2003:post-6a00d8354d01ac69e201539228a28c970b</id>
        <published>2011-10-08T06:54:17-07:00</published>
        <updated>2011-10-08T06:54:53-07:00</updated>
        <summary>For those of you not present at the Data Vault automation seminar (oct.6 2011); hereby the foreword of the conference proceedings. We are still debating how we will publish these proceedings - will let you know. Attendees to the Data...</summary>
        <author>
            <name>Ronald Damhof</name>
        </author>
        
        
<content type="xhtml" xml:lang="en-US" xml:base="http://prudenza.typepad.com/dwh/">
<div xmlns="http://www.w3.org/1999/xhtml"><p><span style="font-size: 8pt;"><em>For those of you not present at the Data Vault automation seminar (oct.6 2011); hereby the foreword of the conference proceedings. We are still debating how we will publish these proceedings - will let you know</em></span>.</p>
<div>Attendees to the Data Vault Automation Seminar,<br /><br />We would like to welcome you to - what we expect to be - a wonderful conference dedicated to the innovation of passionate people in the industry of data warehousing and business intelligence. These people and their companies discovered methods to further enhance the maturity and professionalism in which we - as an industry - developed data warehouses. Of course, to the great benefit of clients; either to adhere to compliancy regulations, being able to deal with constant changes in the business, boost innovation in product development or to improve and control operational costs.<br /><br />Tom and myself came up with the idea of this conference simply because we both feel that there is a lot to gain from automating data warehouses. Of course ‘automating’ as such, did not came out of the blue. Examples of companies that laid the groundwork are Kalido and BIReady (where the latter is of Dutch origin). Then in 2007, Data Vault was introduced in the Netherlands<span style="font-size: 8pt;"><sup>1</sup></span> by Dan Linstedt, Ronald Damhof, and DNV/CIBIT. Back then we did several courses with mainly senior data warehouse consultants - and we had a blast! The rest is history. Data Vault skyrocketed and the number of Certified Data Vault modelers has since increased rapidly. Without any scientific research to back backing up this claim, it is estimated that the majority of new data warehouse projects in the Netherlands are now based on Data Vault.<br /><br />In this turmoil of new ways to deliver data warehouses, innovative people and organisations saw opportunities to support and increase the velocity and quality of the delivery process. One of those areas of opportunity is the automation of data warehouse production. And although we originally intended the seminar to be generic, we ended up with automation innovations particularly oriented towards Data Vault. <br /><br />This conference is a platform for the various innovations to show us their stuff and teach us how we can benefit from their hard work. It is also an opportunity for the attendees to ‘gauge the temperature’ the market of Data Vault innovations by listening and talking to the people behind these innovations. And finally we hope that you take the opportunity to catch up with the community. It is safe to say that just about everybody who has an opinion on Data Vault, will be present. <br /><br />We would like to thank DIKW- academy and the sponsors for sharing our passion and making it possible to keep the fee for this conference low, thereby contributing to sharing of knowledge and improving ourselves as a community.<br /><br /><br />Ronald Damhof &amp; Tom Breur</div>
<p> </p>
<p> </p>
<p> </p>
<p><span style="font-size: 11px;"><sup>1</sup> Data Vault was published way before 2007 by of course Dan Linstedt himself and others as well (M.Ketelaars for example)</span></p></div>
</content>



    <feedburner:origLink>http://prudenza.typepad.com/dwh/2011/10/a-seminar-about-valuable-innovation-in-data-warehousing.html</feedburner:origLink></entry>
    <entry>
        <title>IQCP, a means to an end - it is all about quality</title>
        <link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/DataWarehousingSomeThoughts/~3/qkTlGq6NXK4/iqcp-a-means-to-an-end-it-is-all-about-quality.html" />
        <link rel="replies" type="text/html" href="http://prudenza.typepad.com/dwh/2011/09/iqcp-a-means-to-an-end-it-is-all-about-quality.html" thr:count="3" thr:updated="2011-09-30T00:11:55-07:00" />
        <id>tag:typepad.com,2003:post-6a00d8354d01ac69e2015391acc536970b</id>
        <published>2011-09-17T03:28:02-07:00</published>
        <updated>2011-09-19T04:03:29-07:00</updated>
        <summary>May 2011 I was given an opportunity to be part of the first group of pioneers that field-tested the IQCP program. IQCP stands for Information Quality Certified Professional and is organized by IAIDQ - the International association for Information Quality...</summary>
        <author>
            <name>Ronald Damhof</name>
        </author>
        
        
<content type="xhtml" xml:lang="en-US" xml:base="http://prudenza.typepad.com/dwh/">
<div xmlns="http://www.w3.org/1999/xhtml"><p>May 2011 I was given an opportunity to be part of the first group of pioneers that field-tested the <a href="http://www.iaidq.org/iqcp/iqcp.shtml" target="_blank">IQCP program</a>. IQCP stands for <em>Information Quality Certified Professional</em> and is organized by <a href="http://http://www.iaidq.org/" target="_blank">IAIDQ</a> - <em>the International association for Information Quality and Data Quality.</em>  To be fair; I really do not like certifications. It is said that with certification, clients can have some kind of objective measurement regarding knowledge and skill. Well, for the majority of certifications this is utterly deceiving.</p>
<p>That being said, why did I go for IQCP?</p>
<p>For one thing; IAIDQ is a not-for-profit organisation and vendor-neutral. People involved in IAIDQ are people with a passion to spread the word with regard to quality in general and Information Quality in particular. </p>
<p>Second; the certification process was extremely well prepared, grounded in known research and adheres to widely accepted standards and regulations such as the ISO/IEC 17024 and the Standards for the Accreditation of Certification Programs, published by the USA National Commission for Certifying Agencies (NCCA, 2002).</p>
<p>Third; I was not mainly interested in the certification as such....To put in poetic; I was interested in the journey, not the destination. The IQCP certification is based on a reference list of books and articles, <a href="http://prudenza.typepad.com/.a/6a00d8354d01ac69e20154357fe159970c-pi" style="float: right;"><img alt="Carrot_and_the_stick" class="asset  asset-image at-xid-6a00d8354d01ac69e20154357fe159970c" src="http://prudenza.typepad.com/.a/6a00d8354d01ac69e20154357fe159970c-120wi" style="margin: 0px 0px 5px 5px;" title="Carrot_and_the_stick" /></a> carefully selected by people who know their stuff. The certification was a ‘carrot on a stick’ to me, pressuring me to actually study and invest time. The deadline was end of July 2011 - I have never in my life read so many books, but I have never in my life been so authentically interested in a subject as well. I just kept reading, because there is so much clever stuff written out there.</p>
<p>Fourth and maybe the most important reason; in may 2011 I also happened to follow the legendary PSL (problem solving leadership) workshop given by <a href="http://www.geraldmweinberg.com/Site/Home.html" target="_blank">Jerry Weinberg</a>, <a href="http://www.estherderby.com/" target="_blank">Esther Derby</a> and <a href="http://www.jrothman.com/" target="_blank">Johanna Rothman</a>. What I have learned is immense and influences me daily. The root of this workshop is - in my opinion - founded in quality principles; learning, continuous improvement, root-causes, self organizing team, people, communication, understanding the problem, understanding context and so much more. The people I met in this workshop were incredible and a huge inspiration. A lot of the attendees came from the world of context-driven testing and agile software development. Both - in my opinion - strongly rooted in quality principles. </p>
<p>It somehow all got together when I was asked to participate in the first batch of people to go for IQCP.</p>
<p>For me; IQCP was the means to an end - I have studied amazing books (for a reference list download this <a href="http://www.iaidq.org/iqcp/doc/iqcp-reference-list.pdf" target="_blank">pdf</a>) go to written by people like Edwards Deming, Joseph Duran, Jerry Weinberg, Kaoru Ishikawa, Masaaki Imai, Richard Wang and so many more. My library has grown very rapidly lately and I have still so much more to read (learn). Somehow, all I have read, learned and still learning are pieces of a puzzle that fit seamlessly in my field of expertise #datamanagement #decisionsupport #datagovernance #dataquality #datavault #architecture #softwaredevelopment.</p>
<p>The journey never ends....</p>
<p>My gratitude goes to Tom Breur, IAIDQ, Jerry Weinberg, Esther Derby, Johanna Rothman, those crazy canadians ('it is what it is'), Testsidestory, Olav, Markus, Griffin and so many more.</p>
<p><span style="font-size: 8pt;">ps. I passed the exam.....I am now an Information Quality Certified professional </span></p>
<p> </p></div>
</content>



    <feedburner:origLink>http://prudenza.typepad.com/dwh/2011/09/iqcp-a-means-to-an-end-it-is-all-about-quality.html</feedburner:origLink></entry>
    <entry>
        <title>Why Data Vault is supported by all ETL tools</title>
        <link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/DataWarehousingSomeThoughts/~3/HomjUP4dcVw/why-data-vault-is-supported-by-all-etl-tools.html" />
        <link rel="replies" type="text/html" href="http://prudenza.typepad.com/dwh/2011/09/why-data-vault-is-supported-by-all-etl-tools.html" thr:count="0" />
        <id>tag:typepad.com,2003:post-6a00d8354d01ac69e2014e8b8aa6fb970d</id>
        <published>2011-09-14T01:08:12-07:00</published>
        <updated>2011-09-21T02:40:12-07:00</updated>
        <summary>Recently a discussion raged on LinkedIn regarding the 'ETL tools that support Data Vault OUT OF THE BOX' (link). I gotta be honest - I was annoyed by the discussion and was stupid enough to display this by commenting kind...</summary>
        <author>
            <name>Ronald Damhof</name>
        </author>
        
        
<content type="xhtml" xml:lang="en-US" xml:base="http://prudenza.typepad.com/dwh/">
<div xmlns="http://www.w3.org/1999/xhtml"><p>Recently a discussion raged on LinkedIn regarding the 'ETL tools that support Data Vault OUT OF THE BOX' (<a href="http://www.linkedin.com/groups/Which-ETL-tools-can-support-46641.S.69087633?qid=d4449fb0-a753-49c5-ba2f-4b433ac365a2&amp;trk=group_most_popular-0-b-ttl&amp;goback=%2Egmp_46641" target="_blank">link</a>). I gotta be honest - I was annoyed by the discussion and was stupid enough to display this  by commenting kind of harshly. I would like to apologize to everyone and especially to Daan.</p>
<p>In this blogpost I would like to explain my point of view regarding this question. </p>
<p>In the above mentioned discussion I commented very briefly 'All ETL tools support Data Vault". Allow me to explain this by paraphrasing an argument that was also used by Daan in the subsequent comments. He mentioned that technology brought about efficiency gains in the last 20 to 30 years. I agree with that, the data is quite clear about it ;-). Trying to explain these gains I leave to applied science, but I would like to take one tiny piece of the puzzle and put it in the context of my remark that 'all ETL tools support Data Vault'.</p>
<p>One of the 'variables' in the function of this tremendous leap - in my opinion - is <em>uniformity</em>. Organizing uniform systems (I use the term 'systems' in the broadest sense - People, Technology, Processes) opened the door towards repeatability, predictability, limiting waste and improving quality. In writing this I think Dr. W. <em>Edwards Deming</em> would agree with me.</p>
<p>Now, back to the subject of ETL and Data Vault. With Data Vault we design the system of modeling and logistics of data in advance. Bothg go hand in hand. What we want to achieve is uniformaty <em>as much as we possibly can</em>. Uniformity in modeling, <em>balanced</em><em> </em>with the uniformity in loading. </p>
<p>Let me elaborate some more.</p>
<p>In Data Vault and more generally speaking in 'systems thinking', all objects in a system are interrelated. How I construct a data model has a strong impact on the way I (can) construct the loading (ETL). With Data Vault we standardize the data model as much as we can (there are quite some heuristics in Data Vault, it should not be applied in some dogmatic way), in a limited number of constructs (hub, link, sat). But we also design the loading constructs, which are also extremely limited in number (hubload, linkload, satload). Every load construct has got a standardized pattern, see the figure regarding the pattern for a hub load.</p>
<p><a href="http://prudenza.typepad.com/.a/6a00d8354d01ac69e201543569e7d9970c-pi" /> <a href="http://prudenza.typepad.com/.a/6a00d8354d01ac69e2014e8b8a887a970d-pi" style="display: inline;"><img alt="Schermafbeelding 2011-09-14 om 09.36.29" border="0" class="asset  asset-image at-xid-6a00d8354d01ac69e2014e8b8a887a970d image-full" src="http://prudenza.typepad.com/.a/6a00d8354d01ac69e2014e8b8a887a970d-800wi" title="Schermafbeelding 2011-09-14 om 09.36.29" /></a> <br />If I were to translate this to SQL it would be something like: INSERT &lt;distinct values&gt; to HUB where NOT EXIST in HUB. Of course any ETL tool would support such a simple construct! Data Vaults are thus being build with SSIS, Informatica, InfoSphere, Business Objects, Pentaho,SAS etc...</p>
<p>Please be advised that in real live Data Vault implementations the above is a simplified example. The principles however remain unchanged;</p>
<p>- A limited number of loading patterns</p>
<p>- The patterns are standardized in type</p>
<p>- The patterns are simple</p>
<p>- The patterns can be executed asynchronous</p>
<p>- The type of patterns can make use of parallel loading</p>
<p>I would like to summarize the above with two words; <em>uniformity</em><em> </em>and <em>automation.</em><em> </em>Because of uniformity in modeling and logistics we open the door towards repeatability/automation. Making it a lot cheaper to maintain, but also easy to change or supplement (testability is designed in the system, as well as repairability). Agile software development find great support by these kinds of systems (this is worthy of an entirely new blogpost ;-)).</p>
<p>We now can design a predictable system of loading data in a data model. We have created a uniform structure of the data in the data warehouse, opening the way for more uniformity towards Kimball datamarts as well (be it in-memory, on file, virtualised, etc..).</p>
<p>Uniformity and automation have ignited a wave of innovation in the Netherlands. Innovation led by independent consultants and consultancy firms - that saw great opportunity in the daily problems they face - to take the data logistics to a new level of automation; metadata driven ETL.</p>
<p><span style="font-size: 8pt;">(want to know more - come to the Data Vault automation seminar, October 6. Various suppliers of Data Vault automation software are talking and showing their stuff. The organizers decided to keep entrance fees extremely low (100 euro) in order for everyone to contribute, learn and improve themselves and the industry - <a href="http://www.dikw-academy.nl/opleidingen/21" target="_blank">register now</a>)  </span></p>
<p> </p>
<p> </p>
<p> </p>
<p> </p></div>
</content>



    <feedburner:origLink>http://prudenza.typepad.com/dwh/2011/09/why-data-vault-is-supported-by-all-etl-tools.html</feedburner:origLink></entry>
    <entry>
        <title>For the love of Data (Quality)</title>
        <link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/DataWarehousingSomeThoughts/~3/Vks9tYwb9wI/for-the-love-of-data-quality.html" />
        <link rel="replies" type="text/html" href="http://prudenza.typepad.com/dwh/2011/07/for-the-love-of-data-quality.html" thr:count="2" thr:updated="2011-11-28T01:14:44-08:00" />
        <id>tag:typepad.com,2003:post-6a00d8354d01ac69e2015433dc60c2970c</id>
        <published>2011-07-20T02:39:45-07:00</published>
        <updated>2011-07-20T03:01:14-07:00</updated>
        <summary>"Cleaning the lake or reducing the pollution from the factory" - is an analogy used by Thomas Redman. It perfectly paints a picture of data quality issues 'we' all face in our data management projects. In projects we often have...</summary>
        <author>
            <name>Ronald Damhof</name>
        </author>
        
        
<content type="xhtml" xml:lang="en-US" xml:base="http://prudenza.typepad.com/dwh/">
<div xmlns="http://www.w3.org/1999/xhtml"><p>"Cleaning the lake or reducing the pollution from the factory" - is an analogy used by Thomas Redman. It perfectly paints a picture of data quality issues 'we' all face in our data management projects. </p>
<p>In projects we often have to struggle against forces that 'just wanna create the freekin report'. Whether the data is wrong is of no concern. In these instances the goal apparently is the information system ('our DWH is running', 'The report is build' or 'SAP is live') and not the data. Put in other words; data is often treated as a by- product and the information system is the main product.</p>
<p>Lets take a closer look - by using (among others) Richard Wang's analogy with a manufacturing process<sup>1</sup>:</p>
<p><a href="http://prudenza.typepad.com/.a/6a00d8354d01ac69e2015433dc4df8970c-pi" style="display: inline;"><img alt="Schermafbeelding 2011-07-20 om 11.23.21" class="asset  asset-image at-xid-6a00d8354d01ac69e2015433dc4df8970c" src="http://prudenza.typepad.com/.a/6a00d8354d01ac69e2015433dc4df8970c-320wi" title="Schermafbeelding 2011-07-20 om 11.23.21" /></a></p>
<p>I have never seen any manager, CEO or foreman happy with a succesful implemented assembly line but a lousy product. Have you? In software engineering I sometimes have the feeling we lost touch....</p>
<p>The Information System - be it a Data Warehouse, a report or an ERP - is not the purpose, it is a means to an end. And the end should at least be <strong>sufficient data quality </strong>(where data quality is defined in the perpective of the customer, fit for his/her task)<strong>. </strong></p>
<p>The cool thing about the (somewhat) oversimplified analogy of Richard Y Wang is its usefulness for another reason. It stresses the <strong>system perspective</strong> you gotta have on dealing with data quality issues. You cannot go about 'cleaning your lake while the factory is still polluting'. Producing quality Information Products is executed by means of a system. Now, do not translate this 'system' into 'information system'. This system consists of people, processes and technology. Dealing with data quality issues requires a system perspective to really add value in terms of better quality products and a 'greener' environment.</p>
<p>So - do not blame your ERP department for creating bad data</p>
<p>So - do not blame your report builder for creating useless reports</p>
<p>So - do not blame the person entering the data</p>
<p>Maybe, something to consider; who do you think is accountable for organising the 'system'? Yes - management should embrace quality in its DNA...</p>
<p>I know I am corny - management should have read Deming, Juran, Crosby in their MBA's. Knowledge that is like half a century old.</p>
<p> </p>
<p><sup>1 - Richard Wang - A Product perspective on Total Data Quality Management - feb.1998, Communications of the ACM</sup></p></div>
</content>



    <feedburner:origLink>http://prudenza.typepad.com/dwh/2011/07/for-the-love-of-data-quality.html</feedburner:origLink></entry>
    <entry>
        <title>Data Vault Introductions - Download</title>
        <link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/DataWarehousingSomeThoughts/~3/L46tVnQSBto/data-vault-introductions-download.html" />
        <link rel="replies" type="text/html" href="http://prudenza.typepad.com/dwh/2011/05/data-vault-introductions-download.html" thr:count="1" thr:updated="2011-07-13T06:59:07-07:00" />
        <id>tag:typepad.com,2003:post-6a00d8354d01ac69e201538e876afd970b</id>
        <published>2011-05-16T19:54:14-07:00</published>
        <updated>2011-07-02T04:23:01-07:00</updated>
        <summary>For those who want to familiarize themselves with Data Vault, I hereby offer the links to some papers I wrote (sometimes with others) and a presentation I gave recently (see other post for details): Published originally in Dutch for Database...</summary>
        <author>
            <name>Ronald Damhof</name>
        </author>
        
        
<content type="xhtml" xml:lang="en-US" xml:base="http://prudenza.typepad.com/dwh/">
<div xmlns="http://www.w3.org/1999/xhtml"><p>For those who want to familiarize themselves with Data Vault, I hereby offer the links to some papers I wrote (sometimes with others) and a presentation I gave recently (see <a href="http://prudenza.typepad.com/dwh/2011/05/advanced-data-vault-seminar-may-5-6-2011.html" target="_self">other post</a> for details):</p>
<p>Published originally in Dutch for Database Magazine, an article that is the first part of a triptych regarding the 'Next generation Enterprise Data Warehouaing', this part was an introduction into Data Vault named 'Letting go of the idea of a single version of the truth'  - August 2008 -  <span class="asset  asset-generic at-xid-6a00d8354d01ac69e2014e887adb0a970d"><a href="http://prudenza.typepad.com/files/damhof_dbm0508_eng-1.pdf">Download</a></span></p>
<p><span class="asset  asset-generic at-xid-6a00d8354d01ac69e2014e887adb0a970d">Published originally in Dutch for Database Magazine, an article that is the second </span>part of a triptych regarding the 'Next generation Enterprise Data Warehouaing', this part deals with the post-Data Vault processing, the business rules in particular - November 2008 -  <span class="asset  asset-generic at-xid-6a00d8354d01ac69e2014e887adca2970d"><a href="http://prudenza.typepad.com/files/damhof_dbm1108_eng2-1.pdf">Download</a></span></p>
<p>Published originally in Dutch for Database Magazine, an article that is the final part of a triptych regarding the 'Next generation Enterprise Data Warehouaing', this part deals with 'Development processes in Data Warehouse environments' - June 2009 - <a href="http://www.beyenetwork.nl/channels/1551/view/10744" target="_blank">link</a></p>
<p>Published in Belgium for BI-community.org, a keynote article titled: Data Vault, Business Objectives for next generation data warehousing - January 2011 -  <span class="asset  asset-generic at-xid-6a00d8354d01ac69e201538e875d20970b"><a href="http://prudenza.typepad.com/files/pages-from-infomagazine-bi-community-2011-01-january.pdf">Download</a></span></p>
<p>And finally a link to the presentaion I held on the Advanced Data Vault seminar, May 5 &amp; 6 in Baarn, the Netherlands: <a href="http://www.slideshare.net/rdamhof/data-warehousing-data-vault-as-evolutionary-step" target="_blank">link</a></p>
<p>**July 2nd,2011 - Tom Breur wrote a <a href="http://www.xlntconsulting.com/newsletter-archive/downloads/XLNT%20Consulting%20TomsTenDataTips%20201106%20Data%20Vault.pdf" target="_blank">good piece</a> "Tom's ten data tips", very much Data Vault related.</p></div>
</content>



    <feedburner:origLink>http://prudenza.typepad.com/dwh/2011/05/data-vault-introductions-download.html</feedburner:origLink></entry>
 
</feed><!-- ph=1 -->

