<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0">
<channel>
<title>ApplyIT</title>
<link>http://applyit.typepad.com/applyit/</link>
<description>The application of information technology to create value.</description>
<language>en-US</language>
<lastBuildDate>Fri, 06 May 2011 21:44:09 -0400</lastBuildDate>
<generator>http://www.typepad.com/</generator>

<docs>http://www.rssboard.org/rss-specification</docs>
<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/rss+xml" href="http://feeds.feedburner.com/Applyit" /><feedburner:info uri="applyit" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><item>
<title>FAST VP Customer Example</title>
<link>http://feedproxy.google.com/~r/Applyit/~3/rTd3FhirWgE/fast-vp.html</link>
<guid isPermaLink="false">http://applyit.typepad.com/applyit/2011/05/fast-vp.html</guid>
<description>Customers prefer to spend as little as they need to on infrastructure.  With the price of disk capacity dropping each year, customers would love to buy the largest/cheapest drives that they can to support their workload.  This is a description of the smart new Symmetrix VMAX FAST VP software, with an example of a customer getting the benefits of simplified storage tiering.</description>
<content:encoded><![CDATA[<p>Customers prefer to spend as little as they need to on infrastructure. One of the tricks is to understand what you really need and where you may be able to cut. With the price of disk capacity dropping each year, customers would love to buy the largest/cheapest drives that they can to support their workload. This results in near-term savings when the gear is purchased, and longer term savings from reduced power, cooling, and maintenance bills. But how large can the drives be before they cannot sustain the business workload?</p>
<h1><span style="font-size: 16pt;"><strong>The Problem</strong></span></h1>
<p><a href="http://applyit.typepad.com/.a/6a0133f1749bb8970b015432261548970c-popup" onclick="window.open( this.href, &#39;_blank&#39;, &#39;width=640,height=480,scrollbars=no,resizable=no,toolbar=no,directories=no,location=no,menubar=no,status=no,left=0,top=0&#39; ); return false" style="float: left;"><img alt="ILM" class="asset  asset-image at-xid-6a0133f1749bb8970b015432261548970c" src="http://applyit.typepad.com/.a/6a0133f1749bb8970b015432261548970c-120wi" style="margin: 0px 5px 5px 0px;" title="ILM" /></a> Storage tiering is not a new idea. Back in 2003, EMC started a big campaign to help customers save money by tiering their data. It was called ILM - Information Lifecycle Management. The idea was to help customers put the right data on the right storage at the right time, saving them money. There were only two problems: the tiering placements were manual (like this <a href="http://www.emc.com/collateral/news/02-2004-ilm-ecm-tech-and-bus-meta-practice.pdf" target="_blank" title="ILM and Content Management">process example</a>), and to place it correctly you needed to understand what the profile for the data was currently and what it would be in the future.</p>
<p>Many customers decided that the <a href="http://blogs.computerworld.com/node/3302" target="_blank" title="ILM: rest in peace">manpower was more expensive</a> than just buying additional hardware &#39;to be safe.&#39; Sure, you could buy a mix of drive types, but buying all the same drives and spreading the workload across all of them is much easier to plan for. And if you did it right, the workload would push the drives hard (to about 70% busy) about the same time you &#39;ran out&#39; of space (at 75-90% full). Of course, if the workload turns out to have much different needs, you either waste space (the drives get too busy) or waste money (you bought more performance than you need).</p>

With storage capacities growing at a 60% annual rate for many large customers, they have to do something new. They are running out of data center space. They are running out of power and cooling. They cannot afford to just estimate and hope that what they buy is right. But they also cannot afford to have their staff constantly moving data around to deal with configuration challenges.<img alt="" src="http://static.typepad.com/.shared:v20110505.01-0-ga0218a6:typepad:en_us/js/tinymce/plugins/pagebreak/img/trans.gif" />
<h1><span style="font-size: 16pt;"><strong>The Solution</strong></span></h1>
<p>In December of 2010, EMC introduced Symmetrix VMAX Fully Automated Storage Tiering for Virtual Pools (FAST VP). This new software does what ILM set out to do 7 years before - place data on the right tier of storage in a timely manner. The difference is that now all of that storage fits in the same array, and the user gets to manage the movement by setting simple policies. Details on how this works appear in Barry&#39;s blog <a href="http://thestorageanarchist.typepad.com/weblog/2011/01/3018-fast-vpworlds-smartest-storage-tiering-part-1.html" target="_self" title="FAST VP Part 1">here</a> and <a href="http://thestorageanarchist.typepad.com/weblog/2011/01/3019-fast-vp-worlds-smartest-storage-tiering-part-2.html" target="_self" title="FAST VP Part 2">here</a>, and the Enterprise Stratey Group did a nice white paper covering their testing of<a href="http://www.emc.com/collateral/analyst-reports/esg-vmax-enginuity-5875.pdf" target="_blank" title="VMAX Enginuity 5875"> FAST VP with Oracle</a>. And I have included some additional notes in a section below.</p>
<p>In general, the plan is to replace an all high speed disk (10/15K drives) configuration with a tiered solution that combines Enterprise Flash, high speed, and bulk drives. The Flash drives are quite expensive for the capacity, so they can drive costs up. However, with FAST VP, the hottest data is moved to the small Flash capacity. By using about 3% Flash capacity, most workloads can move over 50% of their disk I/O to Flash (assuming 7.5 MB movements). Not only does this drive down the I/O requirements for the remaining drives, it also means that the <strong>top 50% of the disk I/O is happening at 1 ms</strong>, decreasing the response time for the most active space used by the applications.</p>
<p>So now that half of the workload is off of the remaining drives, using larger high speed drives presents less of a risk. After all, if the workload becomes hotter than was planned, additional Flash can be added to the array to take off more of the heavy I/O areas. Managing changes in I/O density is much easier with these automated policies.</p>
<p>And there is a lot of data in most systems that is just rarely accessed. In general, about 70% of the capacity does less than 10% of the overall I/O workload (assuming 7.5 MB movements). The bulk drives hold a lot of data and use less power, so the savings in purchase and operational costs per TB are both wonderful. And EMC prices all VMAX software 50% less for bulk capacity drives, providing even more savings. Many customer workloads will see improved performance, a smaller power and floor space configuration, and lower costs (both initial and ongoing) by moving to such a tiered design.</p>
<p>To map this out, EMC has Tier Advisor, which can analyze current workloads and model them on various storage configurations. When the array arrives, Symmetrix Management Console (SMC) makes it easy to define tiers and policies, and to apply them to Virtually Provisioned devices. Once FAST VP is running, Symmetrix Performance Advisor (SPA) gathers performance details on the array, metrics on data moving between tiers, and capacity levels of each tier.</p>
<h1><span style="font-size: 16pt;"><strong>The Results</strong></span></h1>
<p>I got a note last week from one of our Technical Consultants who had helped his customer to implement a FAST VP solution. They were very pleased with the implementation and the results - Oracle was running faster, and they could see how this was going to make storage management easier AND make storage cheaper for them going forward.</p>
<p><a href="http://applyit.typepad.com/.a/6a0133f1749bb8970b014e88489eef970d-popup" onclick="window.open( this.href, &#39;_blank&#39;, &#39;width=640,height=480,scrollbars=no,resizable=no,toolbar=no,directories=no,location=no,menubar=no,status=no,left=0,top=0&#39; ); return false" style="float: left;"><img alt="Improved Response Times" class="asset  asset-image at-xid-6a0133f1749bb8970b014e88489eef970d" src="http://applyit.typepad.com/.a/6a0133f1749bb8970b014e88489eef970d-320wi" style="margin: 0px 5px 5px 0px;" title="Improved Response Times" /></a> The customer decided to build two policies, one for high priority production and the other for standard priority production. Since they were just getting started on growing into the array, they decided to start fairly conservatively. They set the high priority policy at 20% Flash and 100% high performance. With this policy, nothing gets demoted to bulk drives, and up to 20% can move up to Flash if the I/O density warrants it. They set the standard priority policy at 5%/65%/40%, which will force at least 30% of the space managed by this policy onto bulk storage. It also allows up to 5% to move up to Flash if it is justified. And note that these policies share the same 3 tiers of storage on the same pools and drives. The applications were running pretty well on the Fiber Channel space they were originally assigned to. Then FAST VP was activated, and response times for reads went down nicely. The <strong>blue line </strong>here is a day before FAST VP was live, and the <strong>green line</strong> is a week later. It was clear that the Flash drives were having the desired impact. (This discussion focuses on read time since the writes all go to cache, so the performance of the drives make little difference for writes as long as they can destage in time.)</p>
<p><a href="http://applyit.typepad.com/.a/6a0133f1749bb8970b015432281e5e970c-popup" onclick="window.open( this.href, &#39;_blank&#39;, &#39;width=640,height=480,scrollbars=no,resizable=no,toolbar=no,directories=no,location=no,menubar=no,status=no,left=0,top=0&#39; ); return false" style="float: right;"><img alt="Customer Tier Results" class="asset  asset-image at-xid-6a0133f1749bb8970b015432281e5e970c" src="http://applyit.typepad.com/.a/6a0133f1749bb8970b015432281e5e970c-320wi" style="margin: 0px 0px 5px 5px;" title="Customer Tier Results" /></a> The layout that resulted in the storage pools also validated the percentages they had planned. The Flash tier was holding 2.7% of the allocated capacity and driving over 50% of the workload. The ATA tier was holding 32% of the allocated capacity and getting less than 1% of the workload. And the FC tier was taking the data in the middle. Over time, as they chose to increase the percentage of capacity in the array that is ATA, and allow the policies to move more data down, the effective costs will continue to improve.</p>
<h1><span style="font-size: 16pt;"><strong>Additional Information</strong></span></h1>
<p>There are several products in the market that are talking about automated tiering within or between arrays. And the fundamental concepts of <a href="http://en.wikipedia.org/wiki/Hierarchical_storage_management" target="_self" title="HSM">Hierarchical Storage Management</a> (HSM) are nothing new. But there are several things that make FAST VP different.</p>
<h2><em><span style="font-size: 14pt;">Engineered for Smart Efficiency</span></em></h2>
<p>VMAX was designed with FAST VP in mind. EMC did extensive reseach, collecting I/O profiles from millions of workloads on thousands of arrays. Based on this research, the VP extents (allocation units) were set at 768 KB. Looking at how to isolate the hot data from the warm data from the cold data, we worked to use the smallest size without too much overhead. A general goal was to get 50% of the workload onto something near 3% of the overall capacity (which we would place on Enterprise Flash Drives - EFDs) for over 90% of the workloads we reviewed. Our research showed that managing data at the sub-1 MB level, we could hit the 50% workload with about 2.5% of the capacity. With 7.5 MB (10 VMAX VP extents), the number was 3% - a small increase, and it cut the management overhead by a factor of 10. Managed at the 40 MB level, the number jumped to around 6%. And at 1 GB, the number jumped again to around 12%. So we went with the 10 VP extents per unit we would move with FAST VP (called an extent group).</p>
<h2><em><span style="font-size: 14pt;">Fast Reactions</span></em></h2>
<p>FAST VP policies set the percentage of data that can live on each of 3 tiers. Based on the percentages in the policy and the historical activity patterns, FAST VP sets performance thresholds for promotion and demotion. These thresholds are adjusted for each device in the array every 10 minutes.</p>
<p>For each disk I/O that happens from the device, a counter is updated to note the activity. The counters decay over time, so while there is historical data, there is also a bias toward more recent activity. This activity level is continuously being checked against the thresholds, with promotions being even more heavily biased toward recent data (move hot things up quickly, move things that were hot for a while down slowly). A huge jump in activity on an area of a LUN can get that space promoted in a few minutes.</p>
<p>And FAST VP is ready to make lots of moves. If the performance profile changes justify it, a VMAX may move over 10 TBs of data each day even on the default relocation rate (priority of reacting to change). And since the change is ongoing, this presents a background load, not a huge burst to get in the way of normal operations. Since the research was done before VMAX was designed, the bandwidth to keep up with any needed moves without getting in the way of production workloads was part of the orginal specifications.</p>
<h2><em><span style="font-size: 14pt;">Simple Tier Management</span></em></h2>
<p>FAST VP supports 3 tiers of storage, and it does this without mixing the drive types into mixed pools. The virtual pools are constructed from protect space on a given technology (flash, high speed 10/15K drives, or bulk drives) and RAID type. FAST VP Tiers are built from up to 4 pools that are based on the same technology. FAST VP Policies are used to combine 2 or 3 tiers based on different technologies.</p>
<p>What makes this really different is the granularity of control this gives the storage manager over how a given device will be treated. Assume that a customer has an array with 3% of the capacity on flash, 27% on high speed drives, and 70% on bulk storage. Now they want to support 2 business needs: production needs all 3 tiers, but test does not need any flash, and does not need much high speed. With FAST VP, we can build policies of 5%/100%/100% for production and 0%/25%/100% for test. They can all share the same drives, so that when test is not running, production has the full use of every drive in the array. And no matter what changes the business may need to support changing I/O levels, a simple policy adjustment will reallocate the storage based on the best use given the historical performance data. If a few production devices need special attention, such as a desire to keep them on flash even when their performace may not warrant that, adding a new policy can make this happen rapidly.</p>
<p>Now consider what happens when tiering is built from RAID groups and combining drives into mixed pools. To allow test to have access to high speed capacity, I have to dedicate that capacity to the test pool in RAID group increments. To allow production to move data down to bulk storage, I again have to dedicate space for that in RAID group increments. And if I need to move space between different pools, that also has to be done in RAID group increments. And when customers are using 2+ TB drives, even a single RAID group can get pretty big. In the end, each of the mixed pools has to have the right physical drives in it to meet the current need, plus unused space to allow for things to move between tiers. This does not make for a pretty management picture.</p>
<h2><em><span style="font-size: 14pt;">EMC World 2011</span></em></h2>
<p>For those attending EMC World 2011 in Las Vegas next week, I will be presenting &quot;Virtual Storage: Overview and Direction&quot; on Tuesday and Wednesday at 10 AM, which includes a review of the value of FAST VP. I would welcome your attendance at the session and a chance to discuss any questions you may have.</p>
<h1><span style="font-size: 16pt;"><strong>Conclusion</strong></span></h1>
<p>VMAX FAST VP makes it simple for customers to take advantage of the performance of Flash drives in the most cost effective way - by only putting the hottest data on them. It also makes it easy to save money by pushing inactive data to cheap bulk storage drives. The VMAX has the power and the smarts to move the right data between tiers while improving production throughputs and decreasing response times. Now the trusted Symmetrix brand is a smarter choice than ever.</p><img src="http://feeds.feedburner.com/~r/Applyit/~4/rTd3FhirWgE" height="1" width="1"/>]]></content:encoded>


<category>VMAX Tech</category>

<dc:creator>Vince Westin</dc:creator>
<pubDate>Fri, 06 May 2011 21:44:09 -0400</pubDate>

<feedburner:origLink>http://applyit.typepad.com/applyit/2011/05/fast-vp.html</feedburner:origLink></item>
<item>
<title>Simplified Storage Expansion</title>
<link>http://feedproxy.google.com/~r/Applyit/~3/uG1-4kZrukM/simplified-storage-expansion.html</link>
<guid isPermaLink="false">http://applyit.typepad.com/applyit/2011/01/simplified-storage-expansion.html</guid>
<description>VMAX customers can more easily manage both private and shared LUN access to clustered servers thanks to Auto-provisioning Groups and the cascaded initiator groups they support. And simpler storage management is always helpful.</description>
<content:encoded><![CDATA[<p>Many customers thinking of a new storage array focus on the work required to set up a new server with storage. This is an important item to research. &#0160;And some storage vendors can make it a bit simpler than others to connect the host HBA WWNs to the capacity in the array. However, I submit that that there is a much more common storage management challenge, and one that can be made a lot simpler.</p>
<p><span style="font-size: 16pt;"><strong>The Problem</strong></span></p>
<p><a href="http://applyit.typepad.com/.a/6a0133f1749bb8970b0148c8367579970c-pi" style="float: left;"><img alt="VMware Infrastructure" border="0" class="asset  asset-image at-xid-6a0133f1749bb8970b0148c8367579970c" src="http://applyit.typepad.com/.a/6a0133f1749bb8970b0148c8367579970c-800wi" style="margin: 0px 5px 5px 0px;" title="VMware Infrastructure" /></a> Customers are constantly adding more capacity to their systems. Application retain more data. New applications are added to servers. Upgrades need more space. It seems that every few months each server is getting some additional storage capacity. So how hard should it be? After all, the servers and storage are already connected.</p>
<p>So let&#39;s make this a little more complicated. Many customers are now rolling out VMware ESX clusters of 8 or more servers. Some of these servers have more than 2 HBA ports. So when adding a device to the cluster, there may be 30 or more HBAs that all need to get their access updated.</p>
<div>
<p><span style="font-size: 16pt;"><strong> </strong></span></p>
</div>


<p><span style="font-size: 16pt;"><strong>The Solution</strong></span></p>
<p>The EMC Symmetrix VMAX has a unique way of managing host access to LUNs. Most arrays fit into one of two storage methods:</p>
<ol>
<li>Manage LUNs to each HBA, or</li>
<li>Manage LUNs to groups of HBAs (one one or more servers)</li>
</ol>
<p>The EMC Symmetrix DMX fits into the first camp. Each LUN to HBA relationship is managed independently. For large cluster environments (ESX or otherwise), manging access to each HBA is both time consuming and risky. It takes careful processes to ensure that the new LUNs are always matched with all of the right HBAs when it is added. Are the processes always updated when an HBA is replaced or a server is added to/removed from the cluster? This was a very workable design when SANs were smaller (and so were the clusters). A better option is needed.</p>
<p>The EMC CLARiiON line has been a great example in the second camp for years. Host groups are defined, and a new host can fairly easily be added to or removed from the group. HBA changes are made on the host and reflected in the group, so that the group view stays up to date. Adding a new LUN to the group takes seconds, and is reflected across all of the hosts in the group.</p>
<p><a href="http://applyit.typepad.com/.a/6a0133f1749bb8970b0148c836f389970c-popup" onclick="window.open( this.href, &#39;_blank&#39;, &#39;width=640,height=480,scrollbars=no,resizable=no,toolbar=no,directories=no,location=no,menubar=no,status=no,left=0,top=0&#39; ); return false" style="float: right;"><img alt="AP-CascadedDiagram" class="asset  asset-image at-xid-6a0133f1749bb8970b0148c836f389970c" src="http://applyit.typepad.com/.a/6a0133f1749bb8970b0148c836f389970c-320wi" style="margin: 0px 0px 5px 5px;" title="AP-CascadedDiagram" /></a> The EMC Symmetrix VMAX with Auto-provisioning Groups (AP) takes this one step further. AP adds the idea of cascaded initiator groups - like nested host groups. So ESX servers can boot from the SAN, and each set of HBAs will be stored in an initiator group for that server. These are paired with private LUN(s) to create a storage view. Then a cluster-wide initiator group is created which contains the initiator group of each server - not by WWN, but using the (cascaded) names of the server initiator groups. The cluster-wide LUNs are then associated with the cluster-wide initiator group to expand the storage &#39;view&#39; for each host in the cluster. In this way, it remains easy to manage both the host access to private LUNs and the cluster access to shared LUNs.</p>
<p>A nice description of all the details of such an implementation with VMware can be found in the EMC White Paper <a href="http://www.emc.com/collateral/hardware/white-papers/h6209-symmetrix-vmax-vmware-virtual-infrastructure-wp.pdf" target="_blank" title="HERE">HERE</a>.</p>
<p><span style="font-size: 16pt;"><strong>The Results</strong></span></p>
<p>VMAX customers can easily manage large amounts of storage even in complex clustering environments. For cluster hosts that need private LUNs, whether for boot information, local configuration storage, logging, etc., each server can easily have a private storage view. Adding a private LUN just means adding it to the private view for that host. And changing an HBA is a simple update to the initiator group for that host.</p>
<p>For the shared LUNs, adding capacity is just as easy. Add the LUN to the shared view, and all servers in the cluster get access on all of their defined HBAs. And since the initiator group is cascaded, updates to the HBA information for a given server automatically flow up to the cluster view.</p>
<p><span style="font-size: 21px;"><strong>Additional Information</strong></span></p>
<p>I am confident that some folks reading this will say that the answer is easy: use &#39;Thin Provisioning&#39;&#0160;(for VMAX, Virtual Provisioning)&#0160;to give the servers huge LUNs, and let them consume the space as they need it. And for some environments this works very well. However, it also presents a new risk: storage bankruptcy. That is, an application (like an Oracle database) goes to write to some of the &#39;free&#39; space it has on its drive, only to have the write fail because the array (or at least that thin pool) is out of space. Hopefully, the application can crash cleanly and be recovered when new storage is added. Maybe not. The point is, storage management under Thin Provisioning changes from something that can slow down the implementation of a project to something that needs to be monitored 24x7 or it can create catastrophic data loss.</p>
<p>The point is, some customers have a very tough time ordering &#39;just in time&#39; storage. They need a purchasing process that allows them to purchase at a later time the storage they did not buy when the applications were implemented. If the project was justified based on the initial costs, then subsequent growth costs may be left on the budget of IT, rather than being paid by the business unit that is using the application. This presents a major process and accounting challenge for some businesses, which they will need to adjust to as IT moves into a service delivery model.</p>
<p>There is also the challenge that handing out huge LUNs invites the users of those systems to use the space. After all, who would not like to have space for an extra database export, or room to rebuild a set of image files, or.... Unused disk space has a way of filling up, and unless there is a reason to be careful about what it fills up with, there can be a lot of waste.</p>
<p>Hopefully this won&#39;t be taken the wrong way. I believe that Thin Provisioning is a great thing for customers (would we make if if EMC did not see the value?). I also think that it cannot live up to the industry hype (&quot;all systems at 100% utilization with less administrative work and no risk&quot; - if your thin pools are at 100% full, there is a lot of risk...). I recommend that customers not over-subscribe their production &#39;thin&#39; space until they have run out of space twice in a non-production setting. Once that has happened, they will understand what to watch for, how serious it is, and what needs to be done to restore normal operations (even if that means deleting some devices so that others have the space to function).</p>
<p>Better storage management tools is another option. There are some applications, such as EMC Ionix ControlCenter, that can mask the complexity of storage management from the users. By creating cluster entities and allocating storage to those entities rather than the individual servers, customers are able to achieve a simplified view of the ongoing operations. The actual array views of the access, however, are still those of the native array tools (where VMAX excels).&#0160;</p>
<p><span style="font-size: 21px;"><strong>Conclusion</strong></span></p>
<p>VMAX customers can more easily manage both private and shared LUN access to clustered servers thanks to Auto-provisioning Groups and the cascaded initiator groups they support. And simpler storage management is always helpful.</p><img src="http://feeds.feedburner.com/~r/Applyit/~4/uG1-4kZrukM" height="1" width="1"/>]]></content:encoded>


<category>VMAX Tech</category>

<dc:creator>Vince Westin</dc:creator>
<pubDate>Mon, 31 Jan 2011 23:59:39 -0500</pubDate>

<feedburner:origLink>http://applyit.typepad.com/applyit/2011/01/simplified-storage-expansion.html</feedburner:origLink></item>
<item>
<title>What did I just buy?</title>
<link>http://feedproxy.google.com/~r/Applyit/~3/v6OoMplO9rA/what-did-i-just-buy.html</link>
<guid isPermaLink="false">http://applyit.typepad.com/applyit/2010/11/what-did-i-just-buy.html</guid>
<description>Customers can benefit from documenting the business/application requirements for their systems and leaning on their vendors to meet those needs.  The vendors understand the abilities of their systems much better than most customers ever will.  And, of course, the customer needs to 'trust but verify.'  In the process, everyone will learn more about the final solution before it goes into production.</description>
<content:encoded><![CDATA[<p>When a company looks to build a major new IT system, they have a set of business goals in mind. &#0160;Mapping those goals to specific pieces of technology to acquire is not always the easiest thing to do. &#0160;This is a discussion of how EMC worked with one customer to take some of the questions out of what they were getting and increase the customer confidence in the final solution at the same time.</p>
<p><span style="font-size: 16pt;"><strong>The Problem</strong></span></p>
<p>This customer deals with continuous data feeds, both coming in and going out. &#0160;They need to have the data integrated into their overall information framework within very specific time windows. &#0160;They know how much data they need to be able to move - after all, they are in the information business. &#0160;But how does that translate into a storage solution?</p>
<p><a href="http://www.culturekitchen.com/m_loutre/blog/news_you_can_use_trust_but_verify" style="float: left;"><img alt="Trust_but_Verify_Martians" border="0" class="asset  asset-image at-xid-6a0133f1749bb8970b0147e043301d970b" src="http://applyit.typepad.com/.a/6a0133f1749bb8970b0147e043301d970b-800wi" style="margin: 0px 5px 5px 0px;" title="Trust but Verify: Martians" /></a> They mapped their performance needs to a set of hardware requirements. &#0160;They took these requirements to multiple vendors, and they got very different solutions from each. &#0160;They have enough relationship with their vendors to know that all of the solutions would eventually meet the need. &#0160;However, it was possible that some were not going to support the full (planned) production workload with only the gear being proposed. &#0160;Price was the major factor in the purchase decision between workable solutions, and the vendors were well aware of this. &#0160;Might some be planning to fix things later with upgrades, or hoping that the planned workload would never appear? <em>&#0160;(Customers have to consider this possibility, even if only causes them to do enough checking to &#39;trust but verify&#39; - thus the pictured fake news clipping)</em></p>

The real trick was how they built the hardware requirements. &#0160;After all, they were not experts in storage systems. &#0160;How might they be able to better quantify what they were looking for AND to have a way to ensure that they were getting what they were expecting?
<p>The customer originally started out by asking for a collection of specific storage components - drives, cache, etc. &#0160;The vendor responses either had all of those components, or had substitutes and explanations for why the substitutes were suitable. &#0160;The differences in the architecture of the systems that they were looking at purchasing turned this into comparing apples, oranges, and grapefruit (as a general metaphor for the problem, not a specific comparison of these products). &#0160;So how were they really supposed to compare the proposed solutions?</p>
<p>Sure, the solutions all had to have the right usable capacity. &#0160;But their application had serious performance needs. &#0160;If they got into production and then found a problem with the system, would they be asking questions about the ability of the storage to keep up with the rest of the environment? &#0160;They needed a way to ensure that the solution was mapped back to their true business requirements.</p>
<p>This was also a challenge for the vendors. &#0160;Each had their own solution which they believed was well suited to the need as it was described to them. &#0160;<em>(I assume that the competition has the same goal of a satisfied customer at the end of the day.)</em> &#0160;So how could they convince the customer that their solution was complete enough to meet the business need?</p>
<p>As a part of the discussion with the customer, we noticed that they were making a leap from the performance that they needed for their application to the storage components that they were looking to buy. &#0160;This is a necessary step, but one we were convinced they should not be handling (or at least not alone).</p>
<p><span style="font-size: 16pt;"><strong>The Solution</strong></span></p>
<p>EMC proposed that the customer let the storage vendors do the translation from the application performance needs to the solution. &#0160;Since the solutions were not going to be simple to compare at the hardware level, and the customer did not have specific hardware need (it was a business need), why not make the purchase based directly on the business need?</p>
<p>We worked with the customer to detail their application performance profile. &#0160;They not only had their older production system as a model, but they had large-scale testing of the new version well enough developed to have a really good handle on their I/O profile. &#0160;We worked to build a model.</p>
<p>Now the customer had a new set of requirements: &#0160;T iops at 8 KB that are 20% write and 95% random, L iops at 8 KB that are 60% write and 100% sequential, and W iops at 32 KB that are 15% write and 80% random. &#0160;(The model here is a rough set of requirements, not the actual requirements of the customer.) &#0160;They could then ask the vendors to propose both a solution to meet these requirements and a method to demonstrate that the requirements were met.</p>
<p><span style="font-size: 16pt;"><strong>The Results</strong></span></p>
<p>The change in the discussion was immediate. &#0160;Now the vendors had a specific performance target to meet. &#0160;EMC believed that we met the new performance requirements with the configuration that we had been proposing. &#0160;We offered guarantee language that put a performance test in as a part of the acceptance criteria, which included the use of the <a href="http://iorate.org/" target="_blank" title="iorate performance tool">iorate performance tool</a>&#0160;and detailed the performance model translated into the iorate input files.</p>
<p>This was also much simpler on the customer. &#0160;Rather than trying to reason out all of the vendor claims about how the work was done, they were able to work from their specification of what they needed the array to handle. &#0160;It was then up to the vendors to propose configurations to meet the need, and to discuss how to document that to the customer.</p>
<p><span style="font-size: 21px;"><strong>Additional Information</strong></span></p>
<p>It turns out that there was additional value in actually performing the storage performance test. &#0160;There were some host settings that were not optimized for multi-path I/O. &#0160;As a result, the array was not able to meet the performance needs during the initial test - we were waiting on one HBA to do the work of 4. &#0160;By finding this early, the customer eliminated a problem that might not have otherwise been found until production was live, when it would be both harder to diagnose and more complex to change (as changes to a live production system always are). &#0160;This gave the team confidence in what they could expect out of the array and from an individual server to the array. &#0160;When questions came up later in the pre-production testing, the team knew if the storage was getting close to their tested values - and if not, then there was probably not a problem in the storage area.</p>
<p>As a proper disclaimer, those who read through the details of <a href="http://iorate.org/" target="_blank" title="iorate">iorate</a>, or read <a href="http://applyit.typepad.com/about.html" target="_self" title="my &#39;about&#39; page">my about page</a> or <a href="http://www.linkedin.com/in/vincewestin" target="_blank" title="LinkedIn profile">LinkedIn profile</a>, will find that I am the author of the iorate performace tool discussed here. &#0160;That has limited impact (beyond being familiar with it) on the choice of this tool for general storage testing. &#0160;The tool is array independent, and was developed to help another customer do storage performance profile testing back in the late 1990s. &#0160;Since then it has become fairly widely used across the industry (I have seen IBM propose it as a tool to test their storage, among others) and around the world (I get e-mail questions from all over). &#0160;I don&#39;t ask for anyone to trust that the code is independent - feel free to read through the source code, or even compile your own version.</p>
<p>One additional note on storage performance testing is that not all arrays test the same. &#0160;For example, the IBM XIV array does not save blocks of all zeros to disk (they log it as empty tracks of data). &#0160;So a performance tool that was only writing zeros (and then reading them back) would be very skewed if testing XIV against other platforms, since the XIV would not be doing any I/O to disk. &#0160;Since the real data will be non-zero, the testing data needs to be as well (iorate uses non-zero data by default). &#0160;Similarly, the NetApp arrays with WAFL work to turn all writes into sequential writes (much easier on the back-end disk drives). &#0160;As a result, the write performance seen from a &#39;fresh&#39; NetApp file system can be dramatically different from one that is very full/fragmented (so that the array has to find smaller places to put writes, forcing the RAID read/modify/write workload onto the drives). &#0160;Customers need to understand such attributes of their arrays to be able to model the performance that will be sustained for their actual production workloads.</p>
<p>This customer has a need to provide replicated data for disaster recovery. &#0160;Their current choice for this system is to use the database tools to perform the replication. &#0160;However, they have a complex (3 site) recovery model, and made sure that they understood the storage-based options that were available should the database replication not met their needs in the future.</p>
<p><span style="font-size: 16pt;"><strong>Conclusion</strong></span></p>
<p>Customers can benefit from documenting the business/application requirements for their systems and leaning on their vendors to meet those needs. &#0160;The vendors understand the abilities of their systems much better than most customers ever will. &#0160;And, of course, the customer needs to &#39;trust but verify.&#39; &#0160;In the process, everyone will learn more about the final solution before it goes into production.</p><img src="http://feeds.feedburner.com/~r/Applyit/~4/v6OoMplO9rA" height="1" width="1"/>]]></content:encoded>


<category>Customer Success</category>

<dc:creator>Vince Westin</dc:creator>
<pubDate>Tue, 30 Nov 2010 09:12:44 -0500</pubDate>

<feedburner:origLink>http://applyit.typepad.com/applyit/2010/11/what-did-i-just-buy.html</feedburner:origLink></item>
<item>
<title>Sharing Information</title>
<link>http://feedproxy.google.com/~r/Applyit/~3/HXwFy_nHVYE/sharing-info.html</link>
<guid isPermaLink="false">http://applyit.typepad.com/applyit/2010/10/sharing-info.html</guid>
<description>I am a geek.  As such, I have a tendency to over-engineer solutions, especially when it comes to solving personal challenges.  And while I may be able to make my ‘solutions’ work, my family sometimes finds them difficult/frustrating/impossible to live with.  This is the story of how I made data sharing work in my home.</description>
<content:encoded><![CDATA[<p>I am a geek. &#0160;As such, I have a tendency to over-engineer solutions, especially when it comes to solving personal challenges. &#0160;And while I may be able to make my ‘solutions’ work, my family sometimes finds them difficult/frustrating/impossible to live with. &#0160;This is the story of how I made data sharing work in my home.</p>
<p><strong><span style="font-size: 16px;">The Problem</span></strong></p>
<p>I have a corporate laptop. &#0160;I also have a home desktop system. &#0160;I need to be able to easily move files between them – mostly personal documents and media. &#0160;Some of the files are rather large. &#0160;Thumb drives are inconvenient. &#0160;And since I have a home network, using anything other than the network to move these files just feels wrong.</p>
<p>I also need to share things with my family. &#0160;My wife has a home desktop system and a laptop as well. &#0160;The kids each have a desktop system (hand-me-downs). &#0160;And when someone creates a document they want the others to have access to, we need a way to do it easily.</p>
<p>Then there are the friends and family that come by. &#0160;Many use the WiFi network. &#0160;Then we want to share pictures (without cutting them down to nibs or posting them to a public area like Facebook). &#0160;And again, we want it to be easy.</p>
<p>Of course, I also have tax records and other documents that, while I would like to store them centrally for easy access, I have a real need to secure. &#0160;Sorry, I don’t trust the WiFi security for much.</p>
<p><strong><span style="font-size: 16px;">
</span></strong></p>
The Solution
<p><a href="http://applyit.typepad.com/.a/6a0133f1749bb8970b0133f4c8f393970b-popup" onclick="window.open( this.href, &#39;_blank&#39;, &#39;width=640,height=480,scrollbars=no,resizable=no,toolbar=no,directories=no,location=no,menubar=no,status=no,left=0,top=0&#39; ); return false" style="float: left;"><img alt="Ix2-Dashboard" class="asset  asset-image at-xid-6a0133f1749bb8970b0133f4c8f393970b" src="http://applyit.typepad.com/.a/6a0133f1749bb8970b0133f4c8f393970b-120wi" style="margin: 0px 5px 5px 0px;" title="Ix2-Dashboard" /></a> After trying and failing with a Windows security solution (more below), I decided it was time for a simple NAS solution. &#0160;After all, I helped roll out one of the first NFS networks for CAD design while I worked at Lockeed, so I was not shy about the technology. &#0160;And EMC had recently purchased Iomega, which had just released a new low-end NAS device. &#0160;So I went online and bought a <a href="http://go.iomega.com/en-us/products/network-storage-desktop/storcenter-network-storage-solution/network-hard-drive-ix2-200/#tech_specsItem_tab" target="_blank" title="2 TB ix2">2 TB ix2</a>.</p>
<p>&#0160;<strong><span style="font-size: 16px;">The Results</span></strong></p>
<p><a href="http://applyit.typepad.com/.a/6a0133f1749bb8970b0133f4c8f302970b-popup" onclick="window.open( this.href, &#39;_blank&#39;, &#39;width=640,height=480,scrollbars=no,resizable=no,toolbar=no,directories=no,location=no,menubar=no,status=no,left=0,top=0&#39; ); return false" style="float: left;"><img alt="Ix2-share" class="asset  asset-image at-xid-6a0133f1749bb8970b0133f4c8f302970b" src="http://applyit.typepad.com/.a/6a0133f1749bb8970b0133f4c8f302970b-320wi" style="margin: 0px 5px 5px 0px;" title="Ix2-share" /></a> I have to say that the Iomega team got the idea of simple down well for this little unit. &#0160;The ix2 connected to the network with DHCP and was ready to go in minutes. &#0160;The GUI allowed me to quickly set the configuration items I wanted (mirror the 1 TB SATA drives for data protection, lock the box down to a hard IP address so that it would always be easy to find, etc.). &#0160;Within 10 minutes I was sharing data on my network.</p>
<p><a href="http://applyit.typepad.com/.a/6a0133f1749bb8970b013487e8d479970c-popup" onclick="window.open( this.href, &#39;_blank&#39;, &#39;width=640,height=480,scrollbars=no,resizable=no,toolbar=no,directories=no,location=no,menubar=no,status=no,left=0,top=0&#39; ); return false" style="float: left;"><img alt="Ix2-settings" class="asset  asset-image at-xid-6a0133f1749bb8970b013487e8d479970c" src="http://applyit.typepad.com/.a/6a0133f1749bb8970b013487e8d479970c-320wi" style="margin: 0px 5px 5px 0px;" title="Ix2-settings" /></a> Now that I had a shared repository for data, I could place things like the software distributions in a central place, and easily be able to reload them on systems that were wiped or replaced. &#0160;We can easily share data, like robot and project information for our FIRST LEGO League team, between systems. &#0160;And friends and family that come to visit can push pictures and more to and from the shared space simply and easily.</p>
<p>Then we had our first Mac come to visit. &#0160;I was not sure exactly how well this was going to work. &#0160;The answer turned out to be better than expected. &#0160;The Mac was able to quickly find and attach to the ix2. &#0160;And it could access all of the same files that the PCs were sharing. &#0160;Then we created a directory from the Mac with some new files. &#0160;The PCs could not read the directory. &#0160;The default permissions on the Mac had the file system locked down. &#0160;So we went to the Mac and opened up the permissions, and bingo the PCs were able to work with the files from the Mac. &#0160;Really easy sharing, and nicely secure.&#0160;</p>
<p><a href="http://applyit.typepad.com/.a/6a0133f1749bb8970b0133f4c8f0e1970b-popup" onclick="window.open( this.href, &#39;_blank&#39;, &#39;width=640,height=480,scrollbars=no,resizable=no,toolbar=no,directories=no,location=no,menubar=no,status=no,left=0,top=0&#39; ); return false" style="float: right;"><img alt="WinShare-FAIL" class="asset  asset-image at-xid-6a0133f1749bb8970b0133f4c8f0e1970b" src="http://applyit.typepad.com/.a/6a0133f1749bb8970b0133f4c8f0e1970b-320wi" style="margin: 0px 0px 5px 5px;" title="WinShare-FAIL" /></a> Finally, I was ready to place my financial data onto the device. &#0160;So I created a new share that was password protected. &#0160;The security worked great – I had easy access, and everyone else was unable to open the share at all. &#0160;The only challenge was the Windows mount code – it seems that I have to mount the secure share before mounting the others. &#0160;Windows will not mount multiple devices from the same server with different authentication credentials, so the secure device has to be mounted first. &#0160;Figuring that out from the Microsoft error message was a challenge!</p>
<p>We have all of our digital photos shared on the ix2. &#0160;And a copy of the music repository for iTunes. &#0160;I noticed that TiVo can see the ix2 as a media source, and it has offered to play music straight from the device. &#0160;More useful sharing. &#0160;For now, I am using iTunes, but that sharing may come in handy later.</p>
<p><strong><span style="font-size: 16px;">Additional Information&#0160;</span></strong></p>
<p>My first solution was to use the Microsoft tools at hand. &#0160;So I put Windows 2003 Server in a VM on ESX 3.5 and created a real home Windows security domain. &#0160;I wound up making it the DHCP server (taking over from the WiFi box) to make the hierarchy work right. &#0160;Then I upgraded to XP Pro on the desktops to get the domain logins – which meant converting the file permissions and settings for everyone on their systems. &#0160;In the end, the security was overly restrictive and the systems were all more difficult to use. &#0160;I had a home rebellion. &#0160;These are great tools for the enterprise, but they did not work for my home environment.</p>
<p>At one point I had my own small CLARiiON CX3-20 in the basement, complete with Brocade switches and a pair of old 1 Gb FC HBAs for my server. &#0160;The power draw was significant, and the HVAC was working overtime. &#0160;Since I was not trying to run an OLTP system from home, I passed the solution to someone else for their lab and went back to the drawing board.</p>
<p>As for the purchase of the ix2, there is an EMC Employee discount. &#0160;After everything it said and done, that is about $5 cheaper than the online deals you can get on the box. &#0160;I would be glad to have it even if I were not working for EMC, or if EMC did not own them. &#0160;This is a solid, simple solution to a real problem I had.</p>
<p>Once I installed the ix2 and decided that I liked it, I added an Gb switch to the home network. &#0160;It just connects the ix2 and a few high-use devices (my desktop Dell and my Mac Mini, primarily). &#0160;The fact that the ix2 and most of the new systems all come with 1 Gb Ethernet, and the new switches being under $100, it seemed too cheap to pass up.</p>
<p><strong><span style="font-size: 16px;">Conclusion</span></strong></p>
<p>Data sharing has become a part of life in this digital era. &#0160;Having an easy way to share data at home between devices and platform types without a lot of effort makes the digital life a bit easier to enjoy.</p><img src="http://feeds.feedburner.com/~r/Applyit/~4/HXwFy_nHVYE" height="1" width="1"/>]]></content:encoded>


<category>Personal Tech</category>

<dc:creator>Vince Westin</dc:creator>
<pubDate>Fri, 01 Oct 2010 23:02:10 -0400</pubDate>

<feedburner:origLink>http://applyit.typepad.com/applyit/2010/10/sharing-info.html</feedburner:origLink></item>
<item>
<title>Better Oracle Performance in a Flash</title>
<link>http://feedproxy.google.com/~r/Applyit/~3/6yyXiT8IKkU/oracle-flash.html</link>
<guid isPermaLink="false">http://applyit.typepad.com/applyit/2010/08/oracle-flash.html</guid>
<description>Enterprise Flash Drives (EFDs) have made a big splash in the storage world over the past 2 years.  Many customers are interested in how they might apply this new technology to speed up applications.  And given a cost of roughly 20 times that of the same size 15K Fibre Channel (FC) drive, they want to be sure they can get a good return on their investment.  Finding the right targets to make that happen can be fairly simple.</description>
<content:encoded><![CDATA[<p><a href="http://en.wikipedia.org/wiki/Enterprise_Flash_Drive#Enterprise_Flash_Drives" target="_blank" title="Enterprise Flash Drive definition on Wikipedia">Enterprise Flash Drives</a>&#0160;(EFDs) have made a big splash in the storage world over the <a href="http://chucksblog.emc.com/chucks_blog/2008/07/enterprise-flas.html" target="_blank" title="EFD discussion on Chuck Hollis&#39; blog">past 2 years</a>. &#0160;Many customers are interested in how they might apply this new technology to speed up applications. &#0160;And given a cost of roughly 20 times that of the same size 15K Fibre Channel (FC) drive, they want to be sure they can get a good return on their investment. &#0160;Finding the right targets to make that happen can be fairly simple.</p>
<p><strong><span style="font-size: 16px;">The Problem</span></strong></p>
<p>A retail customer purchased a set of 8 EFDs with a new array. &#0160;They had a few systems that were critical to their overall operations that they were hoping to improve. &#0160;The business would get the most value from an improvement in the <a href="http://www.oracle.com/us/products/applications/peoplesoft-enterprise/index.htm" target="_blank" title="Oracle PeopleSoft Applications">PeopleSoft</a> HR batch runs, ensuring that payroll computations were completed on time. &#0160;They had upgraded to new <a href="http://h71028.www7.hp.com/enterprise/us/en/os/hpux11i-overview.html" target="_blank" title="HP HP-UX 11i Servers">HP-UX servers</a>, and found that they were now usually waiting for disk I/O.</p>
<p>The systems had been running on a <a href="http://www.emc.com/products/family/clariion-family.htm" target="_blank" title="EMC CLARiiON Family">CLARiiON</a> <a href="http://www.emc.com/products/series/cx3-ultrascale-series.htm" target="_blank" title="EMC CLARiiON CX3 Series">CX3</a> array with dedicated groups of 15K FC drives. &#0160;They were to be moved to a new <a href="http://www.emc.com/products/family/symmetrix-family.htm" target="_blank" title="EMC Symmetrix Family">Symmetrix</a> <a href="http://www.emc.com/products/series/symmetrix-dmx-4.htm" target="_blank" title="EMC Symmetrix DMX-4 Family">DMX-4</a> which had the EFDs. &#0160;The natural discussion was what data to place on this valuable EFD space.</p>
<p><strong><span style="font-size: 16px;">
</span></strong></p>
The Solution
<p>The initial idea from the customer was that the <a href="http://www.oracle.com/index.html" target="_blank" title="Oracle Software">Oracle</a> logs would be best served by this faster space, since Oracle transactions cannot complete until the log writes are on disk. &#0160;And the logs do tend to be very busy with lots of small writes. &#0160;However, the nature of the logs is that they are all small and sequential writes. &#0160;Disk array cache eats small sequential writes for breakfast. &#0160;Not only does the cache get to reduce the number of physical disk I/Os by coalescing them, but it also gets to perform optimized RAID writes. &#0160;So while <a href="http://en.wikipedia.org/wiki/RAID#Standard_levels" target="_blank" title="Redundant Array of Independent Disks (RAID)">RAID</a> 1 protection may be desired for data availability, RAID 5 will work great for the overall performance need. &#0160;And EFD would be wasted on this workload, where the only reads are sequential reads for archiving (or during recovery).</p>
<p>Since this is an OLTP system, most of the random I/O will be directed at the index files. &#0160;It happens that this database was already separated by index and data. &#0160;The total of the index space was about 1 TB, which fit well since they had a 7+1 RAID 5 group of 200 GB EFD, which provided more than the needed 1 TB. &#0160;New devices were created, and one weekend the file systems were moved over.</p>
<p><strong><span style="font-size: 16px;">The Results</span></strong></p>
<p><a href="http://applyit.typepad.com/.a/6a0133f1749bb8970b0134867603d7970c-pi" onclick="window.open(this.href,&#39;_blank&#39;,&#39;scrollbars=no,resizable=yes,toolbar=no,directories=no,location=no,menubar=no,status=no,left=0,top=0&#39;); return false" style="float: left;"><img alt="Dual Lightning Strike - originally from http://onlygizmos.com/when-lightning-strikes/2007/05/" border="0" class="asset asset-image at-xid-6a0133f1749bb8970b0134867603d7970c " src="http://applyit.typepad.com/.a/6a0133f1749bb8970b0134867603d7970c-320pi" style="margin-top: 0px; margin-right: 5px; margin-bottom: 5px; margin-left: 0px;" title="Dual Lightning Strike - originally from http://onlygizmos.com/when-lightning-strikes/2007/05/" /></a>As expected, the results were magical. &#0160;The weekend batch runs were not just faster, they flew. &#0160;Overall wall clock time was cut roughly in half. &#0160;The maximum host device busy went from 100% to around 25%, clearly showing that the disks were no longer the bottleneck. &#0160;The combination of being able to have more I/Os on these devices and having the response times drop was clear.</p>
<p><strong>Then Monday morning came, and the phone lines on the IT Help Desk lit up</strong>. &#0160;Everyone using the PeopleSoft system was wondering what was going on. &#0160;They system had never been this fast. &#0160;The comments were effectively, &quot;whatever you did, do that again!&quot; &#0160;That is always a nice way for the IT team to start a week.</p>
<p>Getting both the batch performance improvement and the tremendous user feedback was like lightning striking twice. &#0160;Moving the rest of the database onto the DMX-4 also went well. &#0160;Performance was slightly better than it had been on the older CLARiiON, though the difference from this array move was nothing like the change to EFD.</p>
<p><strong><span style="font-size: 16px;">Additional Information&#0160;</span></strong></p>
<p>For those not familiar with EFD, this is the high end of solid state storage. &#0160;These devices use <a href="http://en.wikipedia.org/wiki/SLC_flash#Single-level_cell" target="_blank" title="Single Layer Cell definition on Wikipedia">Single Layer Cell</a> (SLC) technology, which provides faster access, lower error rates, and a much higher numbers of erases (re-writes) before the cells die than MLC. &#0160;EMC provides these drives with a similar maintenance cost to other hardware items as a percentage of purchase price - so having them wear out in much less than 5-6 years could get really expensive for EMC. &#0160;To help this, about 20% of the raw space is reserved for spare tracks (as opposed to a couple of cylinders on the average hard drive), so that the drive continues to operate normally even after some of the cells have failed. &#0160;And EMC holds a lot less space in reserve than other vendors shipping these same drives, perhaps because of the additional experience we got testing them for 2 years before we released them.</p>
<p><strong><em><span style="font-size: 14px;">EFD Magic</span></em></strong></p>
<p>The first magic of the EFD drives is that they can do a lot more I/O – think at least 30 times more iops per drive than a 15K FC drive before the response time starts to head up because of queuing. &#0160;The additional magic is that all of these I/Os are handled in less than 1ms. &#0160;This compares to around 6-7ms for random reads on a 15K FC drive. &#0160;And the response time and throughput for EFD are about the same whether the workload is random or sequential, and fairly independent of block size. &#0160;This is definitely a big change.</p>
<p><span style="font-size: medium;"><span style="font-size: 14px; line-height: 17px;"><strong><em>Data Placement</em></strong></span></span></p>
<p>For those looking for general Oracle LUN suggestions, the index files should be placed on the fastest random read drives, since that will deliver the most value. &#0160;This is what makes index data such a great target for EFD. &#0160;A large SGA can cut the amount of physical I/O dramatically, making the performance demands on the drives much lower – the fastest I/O is the one you never do. &#0160;If a large portion of the data can be served from the SGA, it is likely that EFD drives will have limited impact and may not be worth the investment. &#0160;Data files should also be placed on fast drives, though they are less important for most OLTP systems than the index files. &#0160;Log files can be placed on almost any cached RAID type, since the cache will handle the writes. &#0160;Logs should be placed on their own host devices (LUNs), so that their I/O does not wait for any other devices. &#0160;The archive logs are fine to place on large ATA space, especially since they are only used in a sequential manner and ATA drives have good sequential performance. &#0160;Systems will generally perform better if there are at least 10 active devices per HBA port driving I/O, so that queuing does not add latency. &#0160;These recommendations are in place for this customer, and have served them well.</p>
<p><strong><em><span style="font-size: 14px;">Array Cache</span></em></strong></p>
<p>The Oracle SGA can do a fine job of caching recently used data. &#0160;Customers need the array cache to work on holding onto data that the SGA was not able to retain. &#0160;Or reading ahead to get data into cache that the database is likely to use in the near future. &#0160;And, of course, the array can cache writes in a fault&#0160;tolerant design that is not&#0160;available&#0160;in the server memory. &#0160;If there are multiple workloads sharing a single array, and there usually are these days, then the ability to partition the cache by workload may be necessary to ensure that high bandwidth applications (like backups) don&#39;t swamp out the data from more response time sensitive applications. &#0160;The Symmetrix cache is very good in these areas. &#0160;When selecting an array to use for Oracle databases, customers should consider the cache management abilities of the array.</p>
<p><strong><em><span style="font-size: 14px;">Database Logs</span></em></strong></p>
<p>One final note on Oracle logs. &#0160;They should always be placed on separate physical drives (separate RAID groups) from the data in the database. &#0160;This is a consistent recommendation from Oracle, and applies to Microsoft SQL Server and most other databases as well.</p>
<p><strong><span style="font-size: 16px;">Conclusion</span></strong></p>
<p>Enterprise Flash Drives can make a huge difference in system and application performance. &#0160;By placing the most heavily used random read data on these devices, response times will drop and the number of I/Os will increase dramatically. &#0160;This can be a huge political win for an IT organization by driving increased satisfaction for internal and external customers. &#0160;It can also drive competitive advantage for the business.</p><img src="http://feeds.feedburner.com/~r/Applyit/~4/6yyXiT8IKkU" height="1" width="1"/>]]></content:encoded>


<category>Customer Success</category>

<dc:creator>Vince Westin</dc:creator>
<pubDate>Wed, 25 Aug 2010 16:41:07 -0400</pubDate>

<feedburner:origLink>http://applyit.typepad.com/applyit/2010/08/oracle-flash.html</feedburner:origLink></item>
<item>
<title>Big Warehouse, Small Backup</title>
<link>http://feedproxy.google.com/~r/Applyit/~3/ZXhxZd8tsC8/warehousebackup.html</link>
<guid isPermaLink="false">http://applyit.typepad.com/applyit/2010/08/warehousebackup.html</guid>
<description>This customer has a data warehouse that drives internal research and is sold to external customers. The data needs to be accurate and available at all times. Their goal was a new storage architecture to maximize availability and data protection while optimizing operational complexity and cost. We were able to...</description>
<content:encoded><![CDATA[<p>This customer has a data warehouse that drives internal research and is sold to external customers. &#0160;The data needs to be accurate and available at all times. &#0160;Their goal was a new storage architecture to maximize availability and data protection while optimizing operational complexity and cost. &#0160;We were able to help them meet those goals.</p>
<p><span style="font-size: 16px;"><strong>The Problem</strong></span></p>
<p>The starting size of the data warehouse was 25 TB, which would increase by about 10% per year. &#0160;This was for 2 years of retained data. &#0160;Some data was available to load throughout the day, while other sources would provide bulk feeds during the night. &#0160;With 2 years of retention, the change rate for the database was under 2% per week. &#0160;In general, backups every few days were OK, since there was a separate record of all data imported into the system and the ‘current’ image could be recreated from even a few day old copy within a matter of hours.</p>
<p>The business required that the data be available, and that backups be able to restore the system to a point in time up to 1 year old (there had been times of some small feeds of errant data that were corrected weeks later, so prudence dictated that extended recovery be possible). &#0160;Given the value of the operational system, there needed to be space to perform a full restore while continuing production operations. &#0160;This would also provide for space to conduct quarterly testing of the restore processes.</p>
<p>Given the value of the data, the business was interested in the option of having two full images of the data. &#0160;This would allow for quick resumption of production operations should there be an environmental or other event at the primary location. &#0160;This was, however, a desire with limited funding, and would only be included if it could be accomplished without greatly increasing the cost of the solution.</p>
<p>Looking to the future, there would be business value in being able to extend the data retention period from 2 years to 5 or even 7 years. &#0160;The more the solution is able to scale at reasonable cost and without driving up complexity, the sooner it would be cost effective to keep the data active in the warehouse for a longer period of time.</p>
<p><strong><span style="font-size: 16px;">
</span></strong></p>
The Solution
<p><a href="http://applyit.typepad.com/.a/6a0133f1749bb8970b0133f31a6a5b970b-pi" onclick="window.open(this.href,&#39;_blank&#39;,&#39;scrollbars=no,resizable=yes,toolbar=no,directories=no,location=no,menubar=no,status=no,left=0,top=0&#39;); return false" style="float: left;"><img alt="SRDF and Snap for Warehouse Backup" border="0" class="asset asset-image at-xid-6a0133f1749bb8970b0133f31a6a5b970b " src="http://applyit.typepad.com/.a/6a0133f1749bb8970b0133f31a6a5b970b-320pi" style="margin-top: 0px; margin-right: 5px; margin-bottom: 5px; margin-left: 0px;" title="SRDF and Snap for Warehouse Backup" /></a>We met this challenge by focusing on snap technology. &#0160;This type of data warehouse that has large capacity and a low change rate is a perfect candidate for storage array snaps. &#0160;We delivered a pair of <a href="http://www.emc.com/products/family/symmetrix-family.htm" target="_blank">Symmetrix</a> arrays with <a href="http://www.emc.com/products/detail/software/timefinder.htm" target="_blank">TimeFinder</a>/Snap and <a href="http://www.emc.com/products/detail/software/srdf.htm" target="_blank">SRDF</a>, each with 35 TB of usable capacity (protected with 7+1 RAID 5 on 15K 300G drives). &#0160;This provided room for the current capacity plus another year’s growth. &#0160;By including 5 TB of capacity for the save area (Symm SAV devices), multiple snaps were possible.</p>
<p>In order to be able to have the space to do full restores, building a clone in the same array was considered. &#0160;However, this would also mean that the array would need to be sized for performing a full restore and testing on that restore without significant impact to production.</p>
<p>Given that there was also business value in being able to have a separate image of the data, the solution included a second Symmetrix array connected with SRDF. &#0160;By placing the second image in a separate array, we were able to have a completely separate platform to conduct testing of restores or other operations on if needed. &#0160;And when that was not needed, it was a business continuity solution with minimal additional cost. &#0160;The two arrays were within the same campus, providing plenty of bandwidth at minimal latency.</p>
<p><span style="font-size: 16px;"><strong>The Results</strong></span></p>
<p>The customer chose to make snaps 3 times per week, and they could do more if they found value in that later. &#0160;The customer makes database snaps on Tuesday and Thursday evenings in the production (source) array, each kept for one week. &#0160;They make alternating Saturday snaps from the target system, each kept for 2 weeks.</p>
<p>Backups are run weekly from the Saturday snap copy. &#0160;After the backup, the Saturday snap is often mounted on a failover server for further data integrity testing, just to be sure. &#0160;Quarterly restore testing is conducted against the target array, validating the backup and restore processes. &#0160;Of course, this also invalidates the snaps on the target, as well as the business continuity options while this testing is conducted.</p>
<p>In the event of data problems, the various snaps are mounted to standby systems to validate the most recent alternate image that does not have the problem. &#0160;Then a decision can be made to either repair production, or to update the snap with all of the most recent data and a repaired copy of the damaged data feed. &#0160;If the repairs are done on the local snaps, the production database is shut down. &#0160;The disk restore process is almost instant, and the production database is restarted on the restored image (with the data copy completing in the background).</p>
<p>In the event that the repairs are done on the target array, the process is only a bit more involved. &#0160;The chosen snap copy is updated just like the local snap would be. &#0160;SRDF is then suspended, and snap is locally restored. &#0160;Once that restore is in process, the production database is shut down. &#0160;An SRDF restore is initiated with the ‘invalidate local updates’ option, which is required since both the source and target have changed. &#0160;SRDF then begins the differential resynchronization of the local data to match the desired target image. &#0160;Once this resynchronization has started, the production database is restarted on the restored image (and as with the local snap, the data copy continues in the background).</p>
<p>In the event that a tape restore of an even older copy is needed, the target array is used as the restore location, and then updates are applied just like they would be to a target snap copy. &#0160;Since SRDF is tracking changes between the source and target as a bit flag on each track, the restore will still be differential. &#0160;However, since all of the data will be read back from the backup copy (even if it has not changed), the SRDF restore will effectively do a full array copy. &#0160;The process remains the same, though with so much more data the copy will take longer to complete. &#0160;And production can still be restarted while the copy is in process.</p>
<p>Note that for major database changes, SRDF is suspended between the arrays. &#0160;In cases where there is a problem, SRDF can undo the updates at the storage level, ensuring that the change window can always be met even if something goes very wrong.</p>
<p>All of the devices are collected into device groups on the arrays. &#0160;The Symmetrix <a href="http://www.emc.com/collateral/hardware/white-papers/h2313-overview-grps-symmetrix-sol-enblr-env.pdf" target="_blank" title="Solutions Enabler paper, including GNS details">Group Name Service</a> (GNS) ensures that device group updates are kept consistent between the arrays, and are available to all hosts that may be used to control the replication operations. &#0160;The group operations make management of the various copies simple.</p>
<p><span style="font-size: 16px;"><strong>Additional Options</strong></span></p>
<p>The customer could run more snaps on the source or target. &#0160;However, most data corruption is fixed by making changes to production, not by restoring from a copy. &#0160;Since the daily change rate is so small and there are so few restores, there is no justification yet for performing the snaps more frequently. &#0160;Since it can take days to discover the details behind a bad data import, doing daily snaps but only keeping them a couple of days was thought to be much less useful.</p>
<p>The customer could add ATA space for a clone at the target site, allowing for testing of restores without disrupting the disaster recovery copy.</p>
<p>The backups are currently being done to actual tape for vaulting offsite. &#0160;A Data Domain backup target deduplication solution would both improve the speed of the backup and be able to replicate the changes to a remote location with minimal bandwidth. &#0160;This sort of low change rate database will get the most benefit from backup deduplication technology. &#0160;It should be very cost effective to keep 3 months of weekly backups on such a system.</p>
<p><span style="font-size: 16px;"><strong>Conclusion</strong></span></p>
<p>This customer was able to meet all of their business requirements without buying an excess of infrastructure. &#0160;The snaps give them quick recovery from multiple points in time with very little additional disk capacity. &#0160;All the data recovery options can be executed while production continues with the best available data. &#0160;SRDF provides a target system for backups and restore testing, and also provides a target for system recovery should the source site be compromised. &#0160;With the proper tools, large warehouses are easy to store and protect.</p><img src="http://feeds.feedburner.com/~r/Applyit/~4/ZXhxZd8tsC8" height="1" width="1"/>]]></content:encoded>


<category>Customer Success</category>

<dc:creator>Vince Westin</dc:creator>
<pubDate>Mon, 16 Aug 2010 10:28:45 -0400</pubDate>

<feedburner:origLink>http://applyit.typepad.com/applyit/2010/08/warehousebackup.html</feedburner:origLink></item>
<item>
<title>Enterprise Business Restart</title>
<link>http://feedproxy.google.com/~r/Applyit/~3/AH8vPFLrCu0/enterprise-business-restart.html</link>
<guid isPermaLink="false">http://applyit.typepad.com/applyit/2010/08/enterprise-business-restart.html</guid>
<description>Recovering a single system after a disaster can be difficult.  Recovering hundreds is a mess.  Recovering them in a consistent fashion, so that their data dependencies are all correct, is a huge task.  And what if they run different operating systems?  There is an effective answer.</description>
<content:encoded><![CDATA[<p>Recovering a single system after a disaster can be difficult. &#0160;Recovering hundreds is a mess. &#0160;Recovering them in a consistent fashion, so that their data dependencies are all correct, is a huge task. &#0160;And what if they run different operating systems? &#0160;There is an effective answer.</p>
<p><strong><span style="font-size: 16px;">The Problem</span></strong></p>
<p>This customer has over 1,000 separate systems that are critical to ongoing operations and are located in a core data center. &#0160;The systems are a combination of Mainframe, UNIX variations (AIX, HP-UX, LINUX, etc.), and Windows. &#0160;They use SAN and NAS storage (the Mainframe CKD over FICON I include in the SAN), with over half a Petabyte of unique data and various additional copies. &#0160;They have SAP and other systems that have data flowing between different them, often with different operating systems, with no easy way to perform data isolation (picture an interaction of web services, shipping, billing, and warehouse data and you get the general idea). &#0160;The customer wanted to implement business restart, with clear business requirements.</p>
<p><strong><em><span style="font-size: 14px;">Recovery Point Objective (RPO)</span></em></strong></p>
<p>There is value in having the data during any site recovery be as current as possible. &#0160;These are not public finance systems, so the value of a minute of data was not measured in millions of dollars, but it has real corporate value. &#0160;The goal was set to lose less than 5 minutes of data.</p>
<p><span style="font-size: 14px;"><strong><em>
</em></strong></span></p>
Recovery Time Objective (RTO)
<p>These systems are critical to ongoing operation of customer-facing systems, including web sites. &#0160;In the event of a true site disaster, there was a desire to have the systems recover quickly. &#0160;Some systems are more customer-facing than others, so there were RTO objectives from 2-12 hours. &#0160;Of course, the &#39;commutative property of Disaster Recovery&#39; (any system that I depend on needs to be recovered as fast as I am so that I can use it) brought many systems down toward the 2 hour target. &#0160;Since this short time window was not much longer than the time required to restart most of these systems, the data needed to be ready to go when the restart decision was made. &#0160;Restoring from an alternate source was not going to be an option.</p>
<p><span style="font-size: 14px;"><strong><em>Recovery Geography Objective (RGO)</em></strong></span></p>
<p>These systems support customers across the US and, with some functions, internationally as well. &#0160;The recovery needed to be able to handle geographic issues such as the northeast power outage and still be available. &#0160;The plan had to deal with the loss of power or communications not just for a site, but for a 500 mile radius.</p>
<p><strong><span style="font-size: 16px;">The Solution</span></strong></p>
<p>We proposed a solution of <a href="http://www.emc.com/products/family/symmetrix-family.htm" target="_blank">Symmetrix</a> arrays with <a href="http://www.emc.com/products/detail/software/srdf.htm" target="_blank">SRDF</a>/A and Multi-Session Consistency (<a href="http://www.emc.com/collateral/hardware/technical-documentation/h4118-srdfa-srafa-multi-sessn-constncy-zos-tb.pdf" target="_blank">MSC</a>). &#0160;To handle this scale of unique data, there were 8 arrays at each site. &#0160;SRDF/A provided the data replication, connected with Cisco FCIP technology over 4 <a href="http://en.wikipedia.org/wiki/Optical_Carrier_transmission_rates#OC-12_.2F_STM-4x" target="_blank">OC-12</a> links. &#0160;SRDF moves the data from the (outsourced) source site to the target.</p>
<p><a href="http://applyit.typepad.com/.a/6a0133f1749bb8970b01348607e450970c-pi" style="float: left;"><img alt="SRDF/A MSC Delta Set Sync" border="0" class="asset asset-image at-xid-6a0133f1749bb8970b01348607e450970c selected " src="http://applyit.typepad.com/.a/6a0133f1749bb8970b01348607e450970c-320pi" style="margin-top: 0px; margin-right: 5px; margin-bottom: 5px; margin-left: 0px;" title="SRDF/A MSC Delta Set Sync" /></a>SRDF MSC has the unique ability not only to create multiple groups within a Symmetrix system, but also to define groups acrosssystems, and then take actions on a group as a whole without sacrificing consistency. &#0160;SRDF/A sends the differences between one point in time and the next (the &#39;delta set&#39;) from the source to the target with each cycle switch. &#0160;By default, the cycles operate within a given array and are independent of each other. &#0160;MSC has the ability to synchronize the cycle switches, so that multiple arrays will make the switch at the same point in time. &#0160;This creates a single image of all of the data that is consistent from a write-dependency point of view. &#0160;The delta sets are then transmitted to the remote site, and are either all applied (if they all make it completely) or all discarded. &#0160;In this way, the target site always has a valid, consistent view of the data on disk across the arrays.</p>
<p>The big magic of MSC is that we are not trying to get all of the writes to the target in order. &#0160;Given the level of I/O on these systems, the complexity of putting a precise enough time on each I/O without some common clock to reference would be massive. &#0160; The common Mainframe solution for this problem is to use the clock from MVS to provide the timestamp on the I/Os, so that they can be replayed at the target site in the right order even of there are multiple arrays involved. &#0160; In this case, there are many unrelated systems without the level of clock synchronization to make such a solution possible. &#0160;By managing the cycle switch times across the arrays, SRDF MSC eliminates this problem.</p>
<p>Servers are located at both the source and target sites. &#0160;People and processes are in place to deliver a quick restart after the declaration of a disaster at the source site.</p>
<p>The sites are over 600 miles apart, with separate power and telecommunications grids.</p>
<p><span style="font-size: 16px;"><strong>The Results</strong></span></p>
<p>The customer has a single, consistent replica of over 500 TBs of data at over 600 miles that is within 2 minutes of the state of the source at any given time. &#0160;While MVS is in control, the data also supports UNIX and Windows systems running a wide variety of applications. &#0160;With this design, recovery is greatly simplified since all of the interconnecting systems will recover with data from the same point in time.</p>
<p>All of the business objectives have been met, and there are no near-term limits on scale or other items that would impact the future usefulness of this solution. &#0160;They are free to add or change anything above the storage layer and maintain the needed business consistency.</p>
<p><span style="font-size: 16px;"><strong>Additional Observations</strong></span></p>
<p>Most large businesses have a DR plan. &#0160;However, many of these plans overlook critical items or assuming things that may not be accurate. &#0160;It is difficult for many businesses to understand the value of the last X minutes of data. &#0160;For a manufacturing concern, the value of the last hour of production and shipping, or even the last day, may not make a significant difference in the value of the business. &#0160;For an online business, the value of the last hour may be very significant. &#0160;For an investment or banking firm, losing the last hour of records may mean the end of the business. &#0160;Figuring out where each part of a business lives on the recovery value spectrum can be an arduous task, and shortcuts can lead to bad decisions.</p>
<p>This customer realized that they did not want to own two data centers. &#0160;So they found a good partner, and only have ownership of one.</p>
<p>They also considered the fact that the restart of the business after a disaster can actually be more difficult than ongoing operations. &#0160;And if things are bad enough that you have to shut down a data center, there are probably real problems in the primary data center geography. &#0160;Even if air travel or roads are available, employees may have family or other concerns that could prevent them from traveling to the alternate site. &#0160;So how do you get all that expertise to the target site, and make it OK for them to stay there for what may be months?</p>
<p>They decided that they needed to staff both sites. &#0160;They have an operational team for each. &#0160;And the big difference from what many customers have done is that they have the engineering team at the target site. &#0160;The primary data center is the outsourced one, supporting ongoing operations with a staff trained for that. &#0160;The alternate data center is where the ongoing changes are designed, and where all the expertise lives to handle any challenges that come up in the event of a site failover. &#0160;This is a very innovative solution to the problem of staffing the target site, and I believe more customers will follow this in the future.</p>
<p><span style="font-size: 16px;"><strong>Conclusion</strong></span></p>
<p>This customer has a long-standing relationship with EMC. &#0160;The companies have been doing business together for years, as EMC provides them with solutions that help them to meet their business needs. &#0160;The customer takes the time to meet with EMC and discuss their needs in detail, including trips to meet with our management and engineering staff to help us better understand where they are going with IT and what we might do to help. &#0160;By building a real partnership, we are able to work together to solve their ongoing challenges. &#0160;SRDF MSC is an example of a technology that EMC has built to help our customers meet their business needs.</p><img src="http://feeds.feedburner.com/~r/Applyit/~4/AH8vPFLrCu0" height="1" width="1"/>]]></content:encoded>


<category>Customer Success</category>

<dc:creator>Vince Westin</dc:creator>
<pubDate>Fri, 06 Aug 2010 14:06:47 -0400</pubDate>

<feedburner:origLink>http://applyit.typepad.com/applyit/2010/08/enterprise-business-restart.html</feedburner:origLink></item>
<item>
<title>The Fastest I/O</title>
<link>http://feedproxy.google.com/~r/Applyit/~3/zAAksKOh48Y/fastest-io.html</link>
<guid isPermaLink="false">http://applyit.typepad.com/applyit/2010/07/fastest-io.html</guid>
<description>The fastest I/O is the one you never do.  By applying optimization, customers can dramatically cut the number of disk I/Os needed to run a database query.  This is the story of one customer’s results from a 4-day EMC Oracle Assessment, and how they were able to avoid over 1 billion I/Os per day on a single system.</description>
<content:encoded><![CDATA[<p>The fastest I/O is the one you never do. By applying optimization, customers can dramatically cut the number of disk I/Os needed to run a database query. This is the story of one customer’s results from a 4-day EMC Oracle Assessment, and how they were able to avoid over 1 billion I/Os per day on a single database.</p>
<p><span style="font-size: 16px;"><strong>The Problem</strong></span></p>
<p>The customer is a transportation company. They perform large nightly database updates. These updates have critical timelines for completion, so that the results are ready for use when other processes kick off in the morning. The updated data relates directly to the profitability of ongoing operations.&#0160;</p>
<p>The application had been running OK, though it used a lot of hardware (Sun E10K servers and a pair of DMX-4 arrays, duplicated for DR and linked with SRDF). Since the application was purchased from an outside vendor, the tuning options were limited. The customer had worked with the vendor on several occasions to optimize the nightly batch processing. Benchmarks were conducted. Extensive examination was done of the data model. It appeared that the database and application were about as tuned as they could be, and more hardware was going to be the only path to keep up with growth.</p>
<p><span style="font-size: 16px;"><strong>
</strong></span></p>
The Solution&#0160;
<p>EMC was engaged to perform an Oracle assessment. The proposed assessment provides a complete analysis of a single production database instance. It looks at everything from network traffic to CPU and memory usage to semaphores and buffer usage to file systems and disk I/O. This particular assessment is conducted in a 4-day format:&#0160;</p>
<ul>
<li>Day 1: Install the tools</li>
<li>Day 2: Gather the data</li>
<li>Day 3: Process the data</li>
<li>Day 4: Discuss the results</li>
</ul>
<p>When a system generates this level of I/O, the reports coming out can get rather large. There is also an index noting which areas are the highest priority to fix, and the risk level associated with each change.</p>
<p><strong><span style="font-size: 16px;">The Results</span></strong></p>
<p>There were several high priority suggestions that came out of this report, but one was critical. There were a few SQL statements that were shown to be generating a huge volume of I/O that was flagged as unnecessary.&#0160;</p>
<blockquote>
<p>APPLICATION Observation: Some SQL statements are performing an excessive amount of I/O. This can slow performance, since this might cause other cached data buffers to be quickly aged out of the SGA, thereby causing subsequent queries to perform physical I/O rather than logical I/O. This can be caused by over-indexing tables, joining tables in the wrong order, or using an inappropriate index, or if there is a missing index that Oracle could have otherwise used to reduce the total number of I/O&#39;s. This can also be caused by SQL statements which use numeric group functions (such as AVG, COUNT, MIN, MAX, and SUM). Queries which perform many Logical I/O&#39;s consume an excessive amount of CPU cycles and can cause latch contention problems, due to the serialization of latches. The highest number of I/O&#39;s per execution measured was 36,406,850. The total number of excessive I/O&#39;s measured was 2,505,090,174.</p>
</blockquote>
<p>With the additional data around the execution plan for these statements that was included in the detailed report, it was easy to identify what should be changed. As a result, the customer was able to take a clear set of directions back to the application vendor for a fairly simple adjustment.</p>
<p><span style="font-size: 16px;"><strong>Conclusion&#0160;</strong></span></p>
<p>The vendor was able to quickly integrate the changes and eliminate over 1 billion I/Os per day. The assessment cost less than the value generated by not missing the nightly batch window for a single night. And it may have eliminated the next hardware upgrade cycle, or at least allowed it to be completed with a much lower scale of system and storage.  This is an example of the results that are possible with a custom Oracle engagement from EMC. Obviously, most systems will not get this level of performance boost, as most systems never get near this level of I/O. So how many I/Os might your systems be able to avoid?</p><img src="http://feeds.feedburner.com/~r/Applyit/~4/zAAksKOh48Y" height="1" width="1"/>]]></content:encoded>


<category>Customer Success</category>

<dc:creator>Vince Westin</dc:creator>
<pubDate>Sat, 31 Jul 2010 23:57:35 -0400</pubDate>

<feedburner:origLink>http://applyit.typepad.com/applyit/2010/07/fastest-io.html</feedburner:origLink></item>

</channel>
</rss><!-- ph=1 -->

