<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:creativeCommons="http://backend.userland.com/creativeCommonsRssModule" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0">

<channel>
	<title>Tod means Fox</title>
	
	<link>http://blog.todmeansfox.com</link>
	<description>Supporting decisions through sound data management</description>
	<pubDate>Thu, 15 Oct 2009 05:14:48 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.7.1</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/rss+xml" href="http://feeds.feedburner.com/TodMeansFox" /><feedburner:info uri="todmeansfox" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><geo:lat>42.350909</geo:lat><geo:long>-71.54753</geo:long><creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license><item>
		<title>Moving Day</title>
		<link>http://feedproxy.google.com/~r/TodMeansFox/~3/FlwE7SW-Fjc/</link>
		<comments>http://blog.todmeansfox.com/2009/10/15/moving-day/#comments</comments>
		<pubDate>Thu, 15 Oct 2009 05:14:48 +0000</pubDate>
		<dc:creator>Tod McKenna</dc:creator>
		
		<category><![CDATA[Personal]]></category>

		<category><![CDATA[Announcement]]></category>

		<category><![CDATA[moving]]></category>

		<guid isPermaLink="false">http://blog.todmeansfox.com/?p=923</guid>
		<description><![CDATA[Quick TmF update: Over the past 6 weeks, I&#8217;ve been in the process of moving to Holland. While it can be bad blog-form to make excuses for not posting in a while, I thought emigrating from Belgium to The Netherlands would be a good enough excuse.  After another week or two, I expect to [...]]]></description>
			<content:encoded><![CDATA[<p>Quick TmF update: Over the past 6 weeks, I&#8217;ve been in the process of moving to Holland. While it can be bad blog-form to make excuses for not posting in a while, I thought emigrating from Belgium to The Netherlands would be a good enough excuse.  After another week or two, I expect to be fully settled and back on track. </p>
<p>In the 3 years that TmF has been online, I&#8217;ve never gone this long without posting. Feels weird. </p>
<div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/TodMeansFox?a=FlwE7SW-Fjc:O0Cnyg_rqeY:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/TodMeansFox?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/TodMeansFox?a=FlwE7SW-Fjc:O0Cnyg_rqeY:YwkR-u9nhCs"><img src="http://feeds.feedburner.com/~ff/TodMeansFox?d=YwkR-u9nhCs" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/TodMeansFox/~4/FlwE7SW-Fjc" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.todmeansfox.com/2009/10/15/moving-day/feed/</wfw:commentRss>
		<feedburner:origLink>http://blog.todmeansfox.com/2009/10/15/moving-day/</feedburner:origLink></item>
		<item>
		<title>Understanding the Business Cycle</title>
		<link>http://feedproxy.google.com/~r/TodMeansFox/~3/IjctRPpFhtU/</link>
		<comments>http://blog.todmeansfox.com/2009/08/31/understanding-business-cycle/#comments</comments>
		<pubDate>Mon, 31 Aug 2009 04:55:44 +0000</pubDate>
		<dc:creator>Tod McKenna</dc:creator>
		
		<category><![CDATA[Decision Support]]></category>

		<category><![CDATA[econometrics]]></category>

		<category><![CDATA[economy]]></category>

		<category><![CDATA[financial]]></category>

		<category><![CDATA[friedman]]></category>

		<category><![CDATA[GDP]]></category>

		<category><![CDATA[keynes]]></category>

		<category><![CDATA[NBER]]></category>

		<guid isPermaLink="false">http://blog.todmeansfox.com/?p=897</guid>
		<description><![CDATA[There&#8217;s been a lot of talk lately about how the US economy is crawling out of recession. You may have heard terms like &#8220;bottom&#8221; and &#8220;trough&#8221;, seen graphs of GDP growth, and read articles referencing NBER. 
You may be highly skeptical and unwilling to believe that the latest recession is a thing of the past; [...]]]></description>
			<content:encoded><![CDATA[<p>There&#8217;s been a lot of talk lately about how the US economy is crawling out of recession. You may have heard terms like &#8220;bottom&#8221; and &#8220;trough&#8221;, seen graphs of GDP growth, and read articles referencing NBER. </p>
<p>You may be highly skeptical and unwilling to believe that the latest recession is a thing of the past; after all, you likely know someone still looking for a job and you likely know someone else who recently lost a home. Or, you may be very optimistic, and knowing how the business cycle revolves, you are beginning to invest and think about growth. Understanding the business cycle will help you make better decisions &#8212; individually and in business. </p>
<p>Off the bat, I should state that economics is not an exact science (but you likely knew that already). And while there are a few axioms (supply/demand, money supply) and indicators (GDP, unemployment) that can help paint a realistic picture, most of what you see and read is based on analyst insight and experience, backed up with historical data and predictive models. There are faults all along the way. When you consider the complexity of the global economy, it should be clear that economics is more of an art than a science. The Business Cycle, though, seems to be something you can count on.</p>
<h3>What is the Business Cycle?</h3>
<p>Plainly put, the Business Cycle represents 4 phases of aggregate economic movement, ranging from periods of high growth to recession and back again. John Maynard Keynes (d. 1946) referred to these cycles as &#8220;waves of optimism and pessimism&#8221;, or to put it another way, waves of expansion and contraction. These phases were first written about more than 50 years ago by <a href="http://www.econlib.org/library/Enc/bios/Burns.html">Arthur Burns</a> in his book &#8220;Measuring Business Cycles&#8221;. </p>
<h4>The four phases</h4>
<ol style="padding-left:20px">
<li><strong>Peak</strong> - The highest point of economic output just before a downturn</li>
<li><strong>Recession</strong> - When the economy actually shrinks, or contracts</li>
<li><strong>Trough</strong> - The &#8220;bottom&#8221;</li>
<li><strong>Recovery</strong> - The economy has stopped shrinking is growing once more</li>
</ol>
<h4>Last 5 US Business Cycles</h4>
<table>
<hr />
<th>Peak YYYY-MM</th>
<th>Recession Period</th>
<th>Trough YYYY-MM</th>
<th>Recovery Period</th>
</hr>
<tr>
<td>1980-01</td>
<td>6</td>
<td>1980-07</td>
<td>12</td>
</tr>
<tr>
<td>1981-07</td>
<td>16</td>
<td>1982-11</td>
<td>93</td>
</tr>
<tr>
<td>1990-07</td>
<td>8</td>
<td>1991-03</td>
<td>120</td>
</tr>
<tr>
<td>2001-03</td>
<td>8</td>
<td>2001-11</td>
<td>73</td>
</tr>
<tr>
<td>2007-12</td>
<td>?</td>
<td>?</td>
<td>?</td>
</tr>
</table>
<p><em>Data Source: <a href="http://wwwdev.nber.org/cycles/cyclesmain.html">NBER</a></em></p>
<p>Notice that peaks and troughs are represented by month and year, while the other two phases are measured over a number of months. You&#8217;ll also notice that recession and recovery periods can vary greatly.</p>
<h3>Who determines when each phase begins and ends?</h3>
<p>In the US, the task is managed by the <a href="http://www.nber.org/">National Bureau of Economic Research</a> (NBER). NBER is a private, nonprofit, and <em>nonpartisan</em> research organization (stocked with Nobel Prize winning economists). They work on many economics projects and work closely with businesses and universities. They self-proclaim their dedication to promote &#8220;a greater understanding of how the economy works&#8221;. </p>
<p>For example, NBER most recently <a href="http://www.nber.org/cycles/dec2008.html">concluded</a> that &#8220;the last [US economic] expansion ended in December 2007&#8243;. We know that after such expansion, according to the Business Cycle, will come a period of recession, followed by a bottom, leading to a new period of expansion.</p>
<h3>Does everyone agree?</h3>
<p>In short, nope. <a href="http://www.econlib.org/library/Enc/bios/Friedman.html">Milton Friedman</a>, to name one prominent example, believed that the economy fluctuates rather than cycles. The new classical framework states that the economy is much more flexible than that which is implied by the business cycle framework. </p>
<p>There&#8217;s also an issue of market equilibrium. Having a somewhat predictable business cycle implies that the markets will be out of sync quite a lot &#8212; allowing some speculators and investors to take advantage of price differences at different phases of the cycle (this is called <a href="http://www.investopedia.com/terms/a/arbitrage.asp">arbitrage</a>, take a look into Rational Expectations Theory as well). Other differing methodologies include the credit/debt cycle, political cycles, and Marxian cycles.  </p>
<h3>Final thoughts</h3>
<p>Even against dissenter argument and alternate viewpoints, the Business Cycle framework still works and is easily observable. Economists at NBER continue to assign dates to peaks and troughs. Individuals, businesses and organizations still base many of their purchasing and hiring decisions on what phase we&#8217;re currently in. This likely won&#8217;t change any time soon. </p>
<p>The only problem with relying on NBER is that they lag reality. For example, we have very likely reached the bottom of the current cycle and are now in a phase of recovery. NBER might make it official at some point this year, maybe next year. Investors waiting for official announcements will find that they&#8217;re missing the boat. </p>
<div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/TodMeansFox?a=IjctRPpFhtU:vOuavdykoLQ:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/TodMeansFox?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/TodMeansFox?a=IjctRPpFhtU:vOuavdykoLQ:YwkR-u9nhCs"><img src="http://feeds.feedburner.com/~ff/TodMeansFox?d=YwkR-u9nhCs" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/TodMeansFox/~4/IjctRPpFhtU" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.todmeansfox.com/2009/08/31/understanding-business-cycle/feed/</wfw:commentRss>
		<feedburner:origLink>http://blog.todmeansfox.com/2009/08/31/understanding-business-cycle/</feedburner:origLink></item>
		<item>
		<title>Data Warehousing ETL Checklist</title>
		<link>http://feedproxy.google.com/~r/TodMeansFox/~3/i8Af8jKN1_U/</link>
		<comments>http://blog.todmeansfox.com/2009/08/27/data-warehousing-etl-checklist/#comments</comments>
		<pubDate>Thu, 27 Aug 2009 15:46:51 +0000</pubDate>
		<dc:creator>Tod McKenna</dc:creator>
		
		<category><![CDATA[Data Management]]></category>

		<category><![CDATA[Compliance]]></category>

		<category><![CDATA[Data Integration]]></category>

		<category><![CDATA[Data Profiling]]></category>

		<category><![CDATA[ETL]]></category>

		<category><![CDATA[Metadata]]></category>

		<category><![CDATA[Quality]]></category>

		<category><![CDATA[scope]]></category>

		<category><![CDATA[Security]]></category>

		<guid isPermaLink="false">http://blog.todmeansfox.com/?p=872</guid>
		<description><![CDATA[IT Business Edge writer Michael Stevens recently published a document called the &#8220;Data Warehousing ETL Checklist&#8221; (link to article with document). To get the file you&#8217;ll need to register with ITBusinessEdge (if you don&#8217;t already have an account).  
The document isn&#8217;t a development checklist, but rather &#8220;a high-level checklist of the important topics&#8221;. Stevens [...]]]></description>
			<content:encoded><![CDATA[<p>IT Business Edge writer Michael Stevens recently published a document called the &#8220;Data Warehousing ETL Checklist&#8221; (<a href="http://www.itbusinessedge.com/cm/docs/DOC-1530?utm_source=itbe&#038;utm_medium=email&#038;utm_campaign=knpr0018&#038;nr=knpr0018">link to article with document</a>). To get the file you&#8217;ll need to register with ITBusinessEdge (if you don&#8217;t already have an account).  </p>
<p>The document isn&#8217;t a development checklist, but rather &#8220;a high-level checklist of the important topics&#8221;. Stevens has made a good effort to distill the various data integration topics and issues you&#8217;ll need to consider before you start your project.   </p>
<h3>Highlights and feedback</h3>
<p>Mind scope and do not &#8220;try to boil the ocean&#8221;. Good advice. But I disagree with Stevens on how to approach one aspect of the scoping issue. He suggests that &#8220;in some cases, it may be better for the business as a whole to leave certain data sources untouched&#8221;. As I wrote <a href="http://blog.todmeansfox.com/2009/08/12/scoping-data-warehouse-initiatives/">here</a>, I don&#8217;t think that scoping to the data source is a great approach. But you can decide. </p>
<p>I would also add that instead of looking at &#8220;Target System Content&#8221;, you should focus specifically on <a href="http://blog.todmeansfox.com/2009/08/12/scoping-data-warehouse-initiatives/">Business Processes</a> &#8212; and this is regardless of the type of system your integrating into (MDM, DW, OLAP, etc.).</p>
<p><a href="http://blog.todmeansfox.com/2007/12/07/etl-subsystem-1-data-profiling/">Data Profiling</a> is a very important topic, and I&#8217;m ecstatic to see it on the list. But if it were up to me, I would have made it second or third. Almost every single failed or over budget ETL project can be linked to data source woes (quality, data availability, inadequate documentation, etc.). Most of these woes can be relieved by conducting good data profiles &#8212; a process that can begin during the initial requirement gathering phase of the lifecycle. </p>
<p>The data profile will reveal which source systems are useful. There may be redundancy, data quality, and data access issues compelling you to exclude a particular source. This isn&#8217;t a scoping issue, though. Go back to the project sponsor and to the board; report your findings. You may have other issues in your organization to resolve first.</p>
<p>In addition to Data profiling, I&#8217;m very happy to see that Naming Conventions made the list. Tied in closely with naming is the concept of conformity. I&#8217;ve written about this before, so please check <a href="http://blog.todmeansfox.com/2008/03/17/etl-subsystem-8-data-conformance/">here</a>. In general, a label and its data must conform and be consistent throughout the entire system. And as Stevens points out, they must be agreed upon by the users.</p>
<p>For metadata, I also consider a third type: Process Metadata. This is technical data about all the processes (ETL, jobs, automated processes, reports, etc.). This data is then utilized directly by business users and administrators to monitor and audit the data warehouse. Is my report ready? consult the process metadata. Did that job finish? consult the process metadata.  </p>
<p>Lastly, don&#8217;t forget to think about auditing, compliance, and lineage. If you&#8217;re making decisions based on the data retrieved from your data warehouse, you&#8217;ll need ways to trace data lineage. You&#8217;ll need to know when data was loaded, what version of the ETL system and business logic was used, and even why a particular data item might have been changed in transit.</p>
<h3>Conclusion</h3>
<p>This is a good high level checklist that you can use before you begin to dig deeper. It gives you a snapshot of some of the high-level decisions you&#8217;ll need to make before you get started with your next data warehousing project. </p>
<p>Perhaps Stevens will incorporate some of my feedback into a version 2!</p>
<div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/TodMeansFox?a=i8Af8jKN1_U:ApE-c6FQ7h4:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/TodMeansFox?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/TodMeansFox?a=i8Af8jKN1_U:ApE-c6FQ7h4:YwkR-u9nhCs"><img src="http://feeds.feedburner.com/~ff/TodMeansFox?d=YwkR-u9nhCs" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/TodMeansFox/~4/i8Af8jKN1_U" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.todmeansfox.com/2009/08/27/data-warehousing-etl-checklist/feed/</wfw:commentRss>
		<feedburner:origLink>http://blog.todmeansfox.com/2009/08/27/data-warehousing-etl-checklist/</feedburner:origLink></item>
		<item>
		<title>Scoping Data Warehouse Initiatives</title>
		<link>http://feedproxy.google.com/~r/TodMeansFox/~3/Va43nC2D-zM/</link>
		<comments>http://blog.todmeansfox.com/2009/08/12/scoping-data-warehouse-initiatives/#comments</comments>
		<pubDate>Wed, 12 Aug 2009 03:53:47 +0000</pubDate>
		<dc:creator>Tod McKenna</dc:creator>
		
		<category><![CDATA[Business & IT Issues]]></category>

		<category><![CDATA[BI]]></category>

		<category><![CDATA[Data Integration]]></category>

		<category><![CDATA[DW]]></category>

		<category><![CDATA[Efficiency]]></category>

		<category><![CDATA[ETL]]></category>

		<category><![CDATA[Kimball]]></category>

		<category><![CDATA[MDM]]></category>

		<category><![CDATA[pm]]></category>

		<category><![CDATA[Productivity]]></category>

		<category><![CDATA[scope]]></category>

		<guid isPermaLink="false">http://blog.todmeansfox.com/?p=879</guid>
		<description><![CDATA[Data warehousing is a complex operation. From start to finish (if there is a finish), project teams are faced with many challenges. In all phases of the lifecycle, there are opportunities for derailment. The best way to mitigate potential issues and stay on time and within budget is to carefully define and manage scope. Managing [...]]]></description>
			<content:encoded><![CDATA[<p><img src="http://blog.todmeansfox.com/wp-content/uploads/2009/08/focus.jpg" alt="focus Scoping Data Warehouse Initiatives" title="focus" width="126" height="104" class="alignleft size-full wp-image-882" />Data warehousing is a complex operation. From start to finish (if there is a finish), project teams are faced with many challenges. In all phases of the <a href="http://www.amazon.com/Data-Warehouse-Lifecycle-Toolkit-Developing/dp/0471255475">lifecycle</a>, there are opportunities for derailment. The best way to mitigate potential issues and stay on time and within budget is to carefully define and manage scope. Managing scope can be an ongoing struggle (especially if requirements are not clearly defined or justified). While this is really a PM101-type of topic, I feel there are some fine points in a DW/BI environment that are not mentioned enough.     </p>
<p>Consider the following:</p>
<h3>Programs verses projects</h3>
<p>I won&#8217;t get into a deep PM discussion here, but it is important to point out that data warehousing (or business intelligence, master data management, etc.) initiatives should be thought of as <em>programs</em> and not projects. This mindset will help in scoping.</p>
<p>A <strong>program</strong> (which might also be called a &#8220;project portfolio&#8221; in some circles) is basically just a set of related projects. With a program, the emphasis is on organizing, prioritizing, and allocating resources to the right projects. Program scope is more strategic, and answers long-term questions about what type of value the organization hopes to achieve from the initiative.</p>
<p>A <strong>project</strong>, on the other hand, is much more specific &#8212; with a set number of deliverables and goals that have a high immediate impact. The scope at the project level is therefore more tactical in nature: high impact, fast delivery. Be aware that some projects may never be given the green light (for example, if there is a low business impact or if there is a low feasibility rating because of data source or data quality complications).</p>
<p>What I find odd is that organizations still choose to tackle immense data warehousing initiatives in one or two shots, trying to deliver everything at once over a period of 18 or more months. This is the wrong approach (<a href="http://en.wikipedia.org/wiki/Six_phases_of_a_big_project">here&#8217;s why</a>). Break this large initiative into individual projects and try to deliver functionality every 6 to 8 weeks. </p>
<h3>The business process</h3>
<p>The best way to break down data warehousing programs into high-impact projects is along business process lines. A business process, as defined <a href="http://uis.georgetown.edu/departments/eets/dw/GLOSSARY0816.html#B">here</a>, is:</p>
<blockquote><p>The complete response that a business makes to an event. A business process entails the execution of a sequence of one or more process steps. It has a clearly defined deliverable or outcome. A Business Process is defined by the business event that triggers the process, the inputs and outputs, all the operational steps required to produce the output, the sequential relationship between the process steps, the business decisions that are part of the event response, and the flow of material and/or information between process steps.</p></blockquote>
<p>Some example of the above: inventory tracking, Internet sales, retail sales, marketing, tax assessment, tax collection, pitching, batting.</p>
<p>In any data warehousing environment, you can expect to have several business processes to model. Each business process you tackle will have elements touching upon different aspects of the data warehouse, including infrastructure, middleware, data modeling, ETL, business logic development, presentation elements, and so on. If you scope each project to the business process, you can deliver complete solutions in the shortest amount of time. (It should be obvious that the very first business process you implement will take the longest, as the team works out the core infrastructure. Most of this infrastructure will be reused by other business processes.)</p>
<h3>Avoid scoping to a data source</h3>
<p>Do not fall into the trap of scoping to a data source. Scoping to a data source is almost guaranteed to deliver mediocre outcomes. These projects typically include many unfinished or inadequate business processes all delivered at once some time in the distant future and long after the excitement over the initiative has subsided. </p>
<p>While it is true that only one or two data sources might exist in some organizations, it is not true that inventory, customers, sales, procurement, shipping, and other business processes need to be taken on at once. Create a single project for each business process, prioritize based on impact and feasibility, and then badabing badaboom, you deliver. Next.</p>
<p>Along the same lines, do not adjust your scope if the data source is unavailable, uncooperative, or lacking in quality. Instead, bring the fight to the data source (here is where a good, preferable C-Leveled, business sponsor can come in handy) and set things right. This is obviously a project risk, and also an organizational risk. If you are having problems extracting inventory data then maybe its time to put down your data warehousing gloves and get a new inventory system.</p>
<h3>Last thoughts</h3>
<p>Scoping the data warehouse is a difficult problem. Troubles start early on with the initial idea, it moves on through requirement gathering, and finally into the development phase of the lifecycle. There is not a lot of good advice in this area for data warehousing (if you happen to know of a good source, please send me a link or title). But I do find that if you work towards business processes, think in terms of programs and projects, and avoid the data source trap, scoping decisions will settle into the real needs of the business. </p>
<div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/TodMeansFox?a=Va43nC2D-zM:04cgHXtswoo:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/TodMeansFox?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/TodMeansFox?a=Va43nC2D-zM:04cgHXtswoo:YwkR-u9nhCs"><img src="http://feeds.feedburner.com/~ff/TodMeansFox?d=YwkR-u9nhCs" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/TodMeansFox/~4/Va43nC2D-zM" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.todmeansfox.com/2009/08/12/scoping-data-warehouse-initiatives/feed/</wfw:commentRss>
		<feedburner:origLink>http://blog.todmeansfox.com/2009/08/12/scoping-data-warehouse-initiatives/</feedburner:origLink></item>
		<item>
		<title>The Three Faces of a Good ETLer</title>
		<link>http://feedproxy.google.com/~r/TodMeansFox/~3/flDLYEjmZk4/</link>
		<comments>http://blog.todmeansfox.com/2009/07/24/faces-good-etler/#comments</comments>
		<pubDate>Fri, 24 Jul 2009 15:55:01 +0000</pubDate>
		<dc:creator>Tod McKenna</dc:creator>
		
		<category><![CDATA[Business & IT Issues]]></category>

		<category><![CDATA[BI]]></category>

		<category><![CDATA[Data Integration]]></category>

		<category><![CDATA[ETL]]></category>

		<category><![CDATA[Productivity]]></category>

		<category><![CDATA[programming]]></category>

		<category><![CDATA[Quality]]></category>

		<guid isPermaLink="false">http://blog.todmeansfox.com/?p=865</guid>
		<description><![CDATA[Hiring a &#8220;data integration expert&#8221; or consultant for your next, greatest, data warehousing project? Don&#8217;t take it lightly. ETL personnel are critical to the success or failure of your project. 
The following are what I deem to be essential technology-related aspects, or faces, of a good ETL developer and/or architect (herein referred to as an [...]]]></description>
			<content:encoded><![CDATA[<p>Hiring a &#8220;<strong>data integration</strong> expert&#8221; or consultant for your next, greatest, <strong>data warehousing</strong> project? Don&#8217;t take it lightly. ETL personnel are critical to the success or failure of your project. </p>
<p>The following are what I deem to be essential technology-related aspects, or faces, of a good ETL developer and/or architect (herein referred to as an ETLer for lack of creativity). While you need to consider business and industry knowledge, personality, and experience in your team-building process, you should start by checking off the following on your interview sheet:</p>
<h3>First Face: the technologist</h3>
<p>Programming must come natural to an ETLer. Objects, logical constructs, expression construction, program flow, and the like, must be well understood. The truth is that no matter how much your vendor proclaims that their tool does it all, chances are excellent that some hand coding will be required. On top of that, ETL tools work a lot like procedural programs. Technologists are very good at putting their right foot forward, and will generally think of things to make the ETL flow perform better. They also think about logging, auditing, and exception handling; all important.</p>
<h3>Second Face: the theorist</h3>
<p>But a solid programming background is not enough. Knowledge of <strong>Data Integration</strong> theory and best practices are equally important. While I believe in and use <a href="http://blog.todmeansfox.com/2007/11/26/34-subsystems-of-etl-data-integration/">Kimball&#8217;s methodologies</a> for integrating data into a dimensional data warehouse, other methodologies exist that may be more suitable to your business and integration needs. Following a proven methodology, with slight modifications to suit your environment will get you further, faster. Having little or no theory behind what you&#8217;re doing gets you somewhere, slower. Identify your methodology, and then find someone who understands it. </p>
<h3>Third Face: the specialist</h3>
<p>Knowing the ins and outs of your ETL tool (SSIS, OWB, Datastage, Talend Open Studio, etc.) is essential. I would venture to guess that a solid programmer who has a great understanding of ETL theory will be able to get by using most tools with little learning curve. What I worry about (and you should too) are the nuances in the tooling that can stump even the best. These nuances (SSIS, my tool of *ehem* choice &#8212; sorry, I needed to clear my throat, has many of these nuances) can cost you many project hours and force rewrites if blocking issues are encountered. Tool knowledge is also essential to know when it is appropriate to forgo the tool because of I/O issues, or because hierarchical data is better handled elsewhere, or because business logic is best not bundled within a data flow. </p>
<h3>About Face</h3>
<p>While junior members of your data integration team can be one or two-faced (that came out funny), senior members and architects must have more meat on the bone. </p>
<p>I suppose this is why good ETLers are difficult to come by. The ETLer needs to have a healthy mix of programming talent, an approach discipline, and tool knowledge. Trained DBAs and software developers might have a lot to offer, as might a troop of certified tool jocks and method junkies, but to get your project in on time and within budget, don&#8217;t settle.</p>
<div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/TodMeansFox?a=flDLYEjmZk4:H3vhDtORSJw:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/TodMeansFox?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/TodMeansFox?a=flDLYEjmZk4:H3vhDtORSJw:YwkR-u9nhCs"><img src="http://feeds.feedburner.com/~ff/TodMeansFox?d=YwkR-u9nhCs" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/TodMeansFox/~4/flDLYEjmZk4" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.todmeansfox.com/2009/07/24/faces-good-etler/feed/</wfw:commentRss>
		<feedburner:origLink>http://blog.todmeansfox.com/2009/07/24/faces-good-etler/</feedburner:origLink></item>
		<item>
		<title>10 Commandments of Data Integration</title>
		<link>http://feedproxy.google.com/~r/TodMeansFox/~3/yIDDlHlBThM/</link>
		<comments>http://blog.todmeansfox.com/2009/06/30/10-commandments-data-integration/#comments</comments>
		<pubDate>Tue, 30 Jun 2009 14:20:52 +0000</pubDate>
		<dc:creator>Tod McKenna</dc:creator>
		
		<category><![CDATA[Data Management]]></category>

		<category><![CDATA[Auditing]]></category>

		<category><![CDATA[Cleansing]]></category>

		<category><![CDATA[Conformity]]></category>

		<category><![CDATA[Data Integration]]></category>

		<category><![CDATA[Data Profiling]]></category>

		<category><![CDATA[Quality]]></category>

		<guid isPermaLink="false">http://blog.todmeansfox.com/?p=273</guid>
		<description><![CDATA[

You shall compile and document all requirements and mappings; segregate the work by business process. You may have more than one of these business processes, some of which may come before others.
Do not begin without first conducting a thorough data profile; otherwise, you will be punished for your inequities, as will the generations that come [...]]]></description>
			<content:encoded><![CDATA[<div style="padding-left:20px">
<ol>
<li style="padding-bottom:8px">You shall compile and document all requirements and mappings; segregate the work by business process. You may have more than one of these business processes, some of which may come before others.</li>
<li style="padding-bottom:8px">Do not begin without first conducting a thorough data profile; otherwise, you will be punished for your inequities, as will the generations that come after you.</li>
<li style="padding-bottom:8px">Do not think commandments one or two are in vain, lest you will become overrun by the dead line, scope creepers, and a great exodus of people from your tribe; if this happens to you, do not swear or curse, for you have been warned.</li>
<li style="padding-bottom:8px">Remember that latency and timeliness are equal in importance to non-volatility and having a traceable lineage; a staging area may lead you to this promised land. </li>
<li style="padding-bottom:8px">Honor the rules of data conformance.</li>
<li style="padding-bottom:8px">Do not kill dirty data: you shall clean them, or take them back to their sources for retribution.</li>
<li style="padding-bottom:8px">Do not commit the worst data integration transgression of all and ignore data quality, your ignorance will not be forgiven.</li>
<li style="padding-bottom:8px">Do not be shy about stealing your neighbor&#8217;s work, for his trials have led to best practices that you can make equally good use of.</li>
<li style="padding-bottom:8px">Do not rely solely on business keys; surrogates are your friend and will permit you to engage in slowly changing your dimensions.</li>
<li style="padding-bottom:8px">You shall covet a proper audit and log system; for on the day of judgment, you will need proof of your compliance.</li>
</ol>
</div>
<div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/TodMeansFox?a=yIDDlHlBThM:KgxZQnxUShs:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/TodMeansFox?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/TodMeansFox?a=yIDDlHlBThM:KgxZQnxUShs:YwkR-u9nhCs"><img src="http://feeds.feedburner.com/~ff/TodMeansFox?d=YwkR-u9nhCs" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/TodMeansFox/~4/yIDDlHlBThM" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.todmeansfox.com/2009/06/30/10-commandments-data-integration/feed/</wfw:commentRss>
		<feedburner:origLink>http://blog.todmeansfox.com/2009/06/30/10-commandments-data-integration/</feedburner:origLink></item>
		<item>
		<title>Actionable Business-IT Alignment</title>
		<link>http://feedproxy.google.com/~r/TodMeansFox/~3/2ztfGLeka2A/</link>
		<comments>http://blog.todmeansfox.com/2009/06/22/actionable-business-it-alignment/#comments</comments>
		<pubDate>Mon, 22 Jun 2009 17:23:44 +0000</pubDate>
		<dc:creator>Tod McKenna</dc:creator>
		
		<category><![CDATA[Business & IT Issues]]></category>

		<category><![CDATA[alignment]]></category>

		<category><![CDATA[contribute]]></category>

		<category><![CDATA[financial]]></category>

		<category><![CDATA[governance]]></category>

		<category><![CDATA[innovation]]></category>

		<category><![CDATA[language]]></category>

		<category><![CDATA[participate]]></category>

		<category><![CDATA[understand]]></category>

		<guid isPermaLink="false">http://blog.todmeansfox.com/?p=317</guid>
		<description><![CDATA[For several years I&#8217;ve been following the Business-IT alignment movement. Alignment is necessary for fostering innovation and realizing greater profits as IT resources are used to their fullest to satisfy business goals and objectives. 
But many organizations still struggle to make the connection. 
Some suggest varying 3-, 5-, and 7-step approaches for better alignment. They [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://geekandpoke.typepad.com/geekandpoke/2009/02/businessit-alignment.html"><img alt="geek &#038; poke" src="http://geekandpoke.typepad.com/.a/6a00d8341d3df553ef011279142b0528a4-pi" title="geek &#038; poke" width="262" height="375" /></a>For several years I&#8217;ve been following the <a href="http://en.wikipedia.org/wiki/Business/IT_alignment">Business-IT alignment</a> movement. Alignment is necessary for fostering innovation and realizing greater profits as IT resources are used to their fullest to satisfy business goals and objectives. </p>
<p>But many organizations still struggle to make the connection. </p>
<p><a href="http://www.syntelinc.com/syntelligence/index.aspx?id=513">Some </a>suggest varying 3-, 5-, and 7-step approaches for better alignment. They typically consist of diving into new organizational structures, or implementing new frameworks and service models. Other sources say that the business is to blame, that IT is misunderstood and underutilized. While finally, <a href="http://blogs.zdnet.com/service-oriented/?p=1548">some</a> say it is a dead issue altogether.</p>
<p>I&#8217;m not convinced this is a dead issue, nor am I convinced that the business is to blame (especially nowadays). I&#8217;m also not convinced that 3-, 5-, and 7- step plans will magically work.</p>
<p>A few basic actions, initiated by IT, must happen first.</p>
<h3>The Actions</h3>
<p>Business has long ago recognized that better alignment with IT is essential. We can argue about how well they&#8217;ve understood IT and how it can best be utilized, but across the board, the recognition is there. IT, after many years of making the case for alignment, seems to be coming up short during this crucial period. I hope that the following actions, from an IT-perspective, can help:</p>
<ul>
<li><strong>Understand:</strong> Learn the language of the business. Plainly put, this means you need to become more financially intelligent. All businesses exist to earn a profit, and understanding how this works is the first critical step. You must understand where the estimates and assumptions are in the numbers, when you would want to depreciate or amortize, and what constitutes a capital expenditure or operating expense. Among many things, you must understand ROI, cash conversion, and how to use profitability ratios.</li>
<li><strong>Participate:</strong> Take part in strategic and other long-term planning initiatives. IT professionals must be able to see the company&#8217;s vision and turn the vision into actionable IT initiatives. If a representative from IT is not present during steering committee and other board meetings, make this a priority. You will need to convince the business that IT is capable of playing an important role in all long-term decision making. Fortunately, businesses already realize this need, but in many cases don&#8217;t feel that IT can effectively contribute (perhaps because IT doesn&#8217;t understand the business language).</li>
<li><strong>Contribute:</strong> Provide business with the ability to make fast and accurate decisions. This means pioneering smart business intelligence initiatives that can provide decision-makers with the tools and reports &#8212; distilled &#8212; that business can utilize. You need to prove your <em>flexibility</em>, <em>agility</em>, and <em>ability to understand</em> what the business really needs. Much of this is tied to understanding, and extended with participation. These initiatives also include business process improvements; tighter integration of business, meta, and master data; and managing performance.</li>
</ul>
<p>If IT can take action, better alignment can be achieved. This isn&#8217;t to say that the business can&#8217;t do more, but before the business can make IT a full partner, understanding, participation, and contributions from IT are a must.</p>
<div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/TodMeansFox?a=2ztfGLeka2A:ZeAewpjLDPA:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/TodMeansFox?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/TodMeansFox?a=2ztfGLeka2A:ZeAewpjLDPA:YwkR-u9nhCs"><img src="http://feeds.feedburner.com/~ff/TodMeansFox?d=YwkR-u9nhCs" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/TodMeansFox/~4/2ztfGLeka2A" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.todmeansfox.com/2009/06/22/actionable-business-it-alignment/feed/</wfw:commentRss>
		<feedburner:origLink>http://blog.todmeansfox.com/2009/06/22/actionable-business-it-alignment/</feedburner:origLink></item>
	</channel>
</rss>
