<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet href="http://feeds.feedburner.com/~d/styles/rss2full.xsl" type="text/xsl" media="screen"?><?xml-stylesheet href="http://feeds.feedburner.com/~d/styles/itemcontent.css" type="text/css" media="screen"?><!-- generator="wordpress/wordpress-mu-1.0" --><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" version="2.0">
<channel>
	<title>Comments for Storage Soup</title>
	<link>http://storage.blogs.techtarget.com</link>
	<description>A SearchStorage.com blog</description>
	<pubDate>Wed, 14 May 2008 18:51:38 +0000</pubDate>
	<generator>http://wordpress.org/?v=wordpress-mu-1.0</generator>

	<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" href="http://feeds.feedburner.com/CommentsForTheStorageBlog" type="application/rss+xml" /><item>
		<title>Comment on VendorFights: Data Deduplication Edition by Beth Pariseau</title>
		<link>http://storage.blogs.techtarget.com/2008/05/12/vendorfights-data-deduplication-edition/#comment-2958</link>
		<pubDate>Wed, 14 May 2008 18:01:41 +0000</pubDate>
		<guid>http://storage.blogs.techtarget.com/2008/05/12/vendorfights-data-deduplication-edition/#comment-2958</guid>
					<description>love the inflation adjustment, Jesse. i think you're right...and it's also very dependent on the environment, as we've seen in this discussion. 

still, i've at least gotten DD to cop to a single-stream transfer rate of about 150 MBps in the past. was hoping to have some kind of genuinely comparitive number like that w/r/t avamar. to at least have specific, numerical, competing *claims* would be a start...:)

EMC'ers always act shocked when i bring up that scalability thing about avamar, but they've got to know that persistent viewpoint is out there. i've tried to dig into it in the past, and haven't yet personally spoken with any avamar users with 1000s of servers protecting hundreds of TB as was mentioned above. i've spoken with one who had an overall environment that could be described that way, but had a more limited amount of avamar in production. 

tho really, at this point, all of the above might be moot in the face of the 'rip and replace' factor when it comes to EMC's ability to use / sell Avamar's software...</description>
		<content:encoded><![CDATA[<p>love the inflation adjustment, Jesse. i think you&#8217;re right&#8230;and it&#8217;s also very dependent on the environment, as we&#8217;ve seen in this discussion. </p>
<p>still, i&#8217;ve at least gotten DD to cop to a single-stream transfer rate of about 150 MBps in the past. was hoping to have some kind of genuinely comparitive number like that w/r/t avamar. to at least have specific, numerical, competing *claims* would be a start&#8230 <img src='http://storage.blogs.techtarget.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p>EMC&#8217;ers always act shocked when i bring up that scalability thing about avamar, but they&#8217;ve got to know that persistent viewpoint is out there. i&#8217;ve tried to dig into it in the past, and haven&#8217;t yet personally spoken with any avamar users with 1000s of servers protecting hundreds of TB as was mentioned above. i&#8217;ve spoken with one who had an overall environment that could be described that way, but had a more limited amount of avamar in production. </p>
<p>tho really, at this point, all of the above might be moot in the face of the &#8216;rip and replace&#8217; factor when it comes to EMC&#8217;s ability to use / sell Avamar&#8217;s software&#8230;
</p>
]]></content:encoded>
				</item>
	<item>
		<title>Comment on Kazeon and Clearwell square off in e-discovery price prizefight by Benjamin Wright</title>
		<link>http://storage.blogs.techtarget.com/2008/05/12/kazeon-and-clearwell-square-off-in-e-discovery-price-prizefight/#comment-2957</link>
		<pubDate>Wed, 14 May 2008 16:13:03 +0000</pubDate>
		<guid>http://storage.blogs.techtarget.com/2008/05/12/kazeon-and-clearwell-square-off-in-e-discovery-price-prizefight/#comment-2957</guid>
					<description>Beth:  E-discovery is such a complex topic.  As businesses create ever-growing mountains of electronic records, lawsuits fight over the records in &lt;a href="http://hack-igations.blogspot.com/2008/05/nix-smoking-gun-e-discovery.html" rel="nofollow"&gt;e-discovery and record retention&lt;/a&gt; disputes.  Knowing that litigation is inevitable, businesses can use technology proactively to render the records potentially more benign. --Ben &lt;a href="http://hack-igations.blogspot.com/2008/05/nix-smoking-gun-e-discovery.html" rel="nofollow"&gt;http://hack-igations.blogspot.com/2008/05/nix-smoking-gun-e-discovery.html&lt;/a&gt;</description>
		<content:encoded><![CDATA[<p>Beth:  E-discovery is such a complex topic.  As businesses create ever-growing mountains of electronic records, lawsuits fight over the records in <a href="http://hack-igations.blogspot.com/2008/05/nix-smoking-gun-e-discovery.html" rel="nofollow">e-discovery and record retention</a> disputes.  Knowing that litigation is inevitable, businesses can use technology proactively to render the records potentially more benign. &#8211;Ben <a href="http://hack-igations.blogspot.com/2008/05/nix-smoking-gun-e-discovery.html" rel="nofollow">http://hack-igations.blogspot.com/2008/05/nix-smoking-gun-e-discovery.html</a>
</p>
]]></content:encoded>
				</item>
	<item>
		<title>Comment on VendorFights: Data Deduplication Edition by Jesse</title>
		<link>http://storage.blogs.techtarget.com/2008/05/12/vendorfights-data-deduplication-edition/#comment-2956</link>
		<pubDate>Wed, 14 May 2008 14:45:44 +0000</pubDate>
		<guid>http://storage.blogs.techtarget.com/2008/05/12/vendorfights-data-deduplication-edition/#comment-2956</guid>
					<description>Performance evaluations are always subjective, especially when they are being done by the company who made the product.  Testing is done in a pristine environment without much regard to the real-world, but yet the results are used to justify introduction of these products into the real world.

Any product can be set up to perform well in a lab setting that may or may not be indicitave of "Production" situations.

I always take such performance claims with a grain of salt.  I've worked in R&amp;D, specifically in Qualification and Test, and I remember times when the same test run back to back produced slightly skewed results.  Not enough to invalidate the test, but definately enough to make you wonder what else is going on in the background of the software.

I'm not a big fan of compression (or de-duplication as their calling it now).  I remember having a sales guy come in to pitch Avamar to me at one of my last jobs, and the fact that they wanted to sell me a product for tens of thousands of dollars to save a few bucks on my tape footprint was just staggering.

De-Duplication is essentially the same process used in zip/tar operations.  You're taking repetitive blocks of data within a stream and replacing it with a kind of count/pointer system.  All of it takes cycles.  If it is done on the host it takes CPU cycles, if it's done in-line via an appliance it increases latency.

Just my $2.12 (adjusted for inflation)</description>
		<content:encoded><![CDATA[<p>Performance evaluations are always subjective, especially when they are being done by the company who made the product.  Testing is done in a pristine environment without much regard to the real-world, but yet the results are used to justify introduction of these products into the real world.</p>
<p>Any product can be set up to perform well in a lab setting that may or may not be indicitave of &#8220;Production&#8221; situations.</p>
<p>I always take such performance claims with a grain of salt.  I&#8217;ve worked in R&amp;D, specifically in Qualification and Test, and I remember times when the same test run back to back produced slightly skewed results.  Not enough to invalidate the test, but definately enough to make you wonder what else is going on in the background of the software.</p>
<p>I&#8217;m not a big fan of compression (or de-duplication as their calling it now).  I remember having a sales guy come in to pitch Avamar to me at one of my last jobs, and the fact that they wanted to sell me a product for tens of thousands of dollars to save a few bucks on my tape footprint was just staggering.</p>
<p>De-Duplication is essentially the same process used in zip/tar operations.  You&#8217;re taking repetitive blocks of data within a stream and replacing it with a kind of count/pointer system.  All of it takes cycles.  If it is done on the host it takes CPU cycles, if it&#8217;s done in-line via an appliance it increases latency.</p>
<p>Just my $2.12 (adjusted for inflation)
</p>
]]></content:encoded>
				</item>
	<item>
		<title>Comment on EDS acquisition makes HP Avis to IBM’s Hertz by Jesse</title>
		<link>http://storage.blogs.techtarget.com/2008/05/13/eds-acquisition-makes-hp-avis-to-ibms-hertz/#comment-2955</link>
		<pubDate>Wed, 14 May 2008 14:26:45 +0000</pubDate>
		<guid>http://storage.blogs.techtarget.com/2008/05/13/eds-acquisition-makes-hp-avis-to-ibms-hertz/#comment-2955</guid>
					<description>That's, uh, quite the metaphor.   

HP's aquisition of EDS is a last and desparate attempt to stay relevant in the enterprise market.  They are losing Unix market-share right and left, (I can't remember the last time I connected an HP server to a SAN), and they're having to fight Dell for every x86 architecture server they can.

I think that this buyout is going to prove to be another Compaq for HP....  Expensive and difficult to maintain.  (Although the fact that they are not trying to rebrand EDS as "HP Global Services" or something similar, leads me to believe they may have learned something in the process of the Compaq fiasco.

Jesse</description>
		<content:encoded><![CDATA[<p>That&#8217;s, uh, quite the metaphor.   </p>
<p>HP&#8217;s aquisition of EDS is a last and desparate attempt to stay relevant in the enterprise market.  They are losing Unix market-share right and left, (I can&#8217;t remember the last time I connected an HP server to a SAN), and they&#8217;re having to fight Dell for every x86 architecture server they can.</p>
<p>I think that this buyout is going to prove to be another Compaq for HP&#8230;.  Expensive and difficult to maintain.  (Although the fact that they are not trying to rebrand EDS as &#8220;HP Global Services&#8221; or something similar, leads me to believe they may have learned something in the process of the Compaq fiasco.</p>
<p>Jesse
</p>
]]></content:encoded>
				</item>
	<item>
		<title>Comment on VendorFights: Data Deduplication Edition by Beth Pariseau</title>
		<link>http://storage.blogs.techtarget.com/2008/05/12/vendorfights-data-deduplication-edition/#comment-2954</link>
		<pubDate>Wed, 14 May 2008 12:41:06 +0000</pubDate>
		<guid>http://storage.blogs.techtarget.com/2008/05/12/vendorfights-data-deduplication-edition/#comment-2954</guid>
					<description>no worries on monopolizing, scott, this is a good discussion and informative for everybody, i think. still, i have to look back on the original posts that sparked this disucssion and wonder...we all seem to agree that the products are apples and oranges (and avamar's performance is dependent on so many variables), so how could either side be claiming one performs better than the other? now i'm REALLY curious as to where data domain's multiple came from in their press release...</description>
		<content:encoded><![CDATA[<p>no worries on monopolizing, scott, this is a good discussion and informative for everybody, i think. still, i have to look back on the original posts that sparked this disucssion and wonder&#8230;we all seem to agree that the products are apples and oranges (and avamar&#8217;s performance is dependent on so many variables), so how could either side be claiming one performs better than the other? now i&#8217;m REALLY curious as to where data domain&#8217;s multiple came from in their press release&#8230;
</p>
]]></content:encoded>
				</item>
	<item>
		<title>Comment on VendorFights: Data Deduplication Edition by Storage Dork</title>
		<link>http://storage.blogs.techtarget.com/2008/05/12/vendorfights-data-deduplication-edition/#comment-2952</link>
		<pubDate>Wed, 14 May 2008 01:19:52 +0000</pubDate>
		<guid>http://storage.blogs.techtarget.com/2008/05/12/vendorfights-data-deduplication-edition/#comment-2952</guid>
					<description>"first with the delay in qualification of SIR by OEMs, now with the outright lack of qualification by major OEMs"
I agree and I do hope someone from falconstor steps in here and explains this for us (or an on the ball author gives them a call ;-) ).  But, even if they're late, they do offer something different from the "host-based" and "in-band" solutions out there.  There are a lot of good reasons to implement a SIR solution.</description>
		<content:encoded><![CDATA[<p>&#8220;first with the delay in qualification of SIR by OEMs, now with the outright lack of qualification by major OEMs&#8221;<br />
I agree and I do hope someone from falconstor steps in here and explains this for us (or an on the ball author gives them a call <img src='http://storage.blogs.techtarget.com/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' /> ).  But, even if they&#8217;re late, they do offer something different from the &#8220;host-based&#8221; and &#8220;in-band&#8221; solutions out there.  There are a lot of good reasons to implement a SIR solution.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>Comment on VendorFights: Data Deduplication Edition by Stephan Sebuit</title>
		<link>http://storage.blogs.techtarget.com/2008/05/12/vendorfights-data-deduplication-edition/#comment-2951</link>
		<pubDate>Wed, 14 May 2008 00:40:03 +0000</pubDate>
		<guid>http://storage.blogs.techtarget.com/2008/05/12/vendorfights-data-deduplication-edition/#comment-2951</guid>
					<description>In a physical server the CPU cost is probably not a big deal given most physical hosts are grossly underutilized in terms of CPU cycles. 

However, the same statement can not be made for server virtualization environment. So even if backups complete faster, the CPU cost are "more than a traditional application would too" 

What that says to me is that I have to schedule backups in VMware environments in such as way as to incur as minimal impact as possible on the physical host. 

Dealing with 1200 VMs and 95 servers in my environment this has to be a backup scheduling nightmare although the dedup aspect is rather intriguing.

Stephan</description>
		<content:encoded><![CDATA[<p>In a physical server the CPU cost is probably not a big deal given most physical hosts are grossly underutilized in terms of CPU cycles. </p>
<p>However, the same statement can not be made for server virtualization environment. So even if backups complete faster, the CPU cost are &#8220;more than a traditional application would too&#8221; </p>
<p>What that says to me is that I have to schedule backups in VMware environments in such as way as to incur as minimal impact as possible on the physical host. </p>
<p>Dealing with 1200 VMs and 95 servers in my environment this has to be a backup scheduling nightmare although the dedup aspect is rather intriguing.</p>
<p>Stephan
</p>
]]></content:encoded>
				</item>
	<item>
		<title>Comment on Kazeon and Clearwell square off in e-discovery price prizefight by athgeek</title>
		<link>http://storage.blogs.techtarget.com/2008/05/12/kazeon-and-clearwell-square-off-in-e-discovery-price-prizefight/#comment-2949</link>
		<pubDate>Tue, 13 May 2008 21:33:02 +0000</pubDate>
		<guid>http://storage.blogs.techtarget.com/2008/05/12/kazeon-and-clearwell-square-off-in-e-discovery-price-prizefight/#comment-2949</guid>
					<description>It's a silly argument because the two vendors do completely different things -- Kazeon is a data collection/culling product that filters terabytes down to litigation-relevant gigabytes or megabytes at high speed; Clearwell excels at searching through those relevant megabytes and finding smoking guns or exculpatory evidence...true e-discovery.  So Kazeon is the bigger end of the funnel, and Clearwell is the microscope at the smaller end of it. Simple to distinguish.  Totally different value props.  The two make a great team.</description>
		<content:encoded><![CDATA[<p>It&#8217;s a silly argument because the two vendors do completely different things &#8212; Kazeon is a data collection/culling product that filters terabytes down to litigation-relevant gigabytes or megabytes at high speed; Clearwell excels at searching through those relevant megabytes and finding smoking guns or exculpatory evidence&#8230;true e-discovery.  So Kazeon is the bigger end of the funnel, and Clearwell is the microscope at the smaller end of it. Simple to distinguish.  Totally different value props.  The two make a great team.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>Comment on VendorFights: Data Deduplication Edition by Scott W</title>
		<link>http://storage.blogs.techtarget.com/2008/05/12/vendorfights-data-deduplication-edition/#comment-2948</link>
		<pubDate>Tue, 13 May 2008 21:19:05 +0000</pubDate>
		<guid>http://storage.blogs.techtarget.com/2008/05/12/vendorfights-data-deduplication-edition/#comment-2948</guid>
					<description>Beth: Chris Mellor was talking about Avamar nodes. Nodes are component pieces of the server "grid" (hate the term under the circumstances, but it is as good as any). There is no correlation between node/grid speed and client speed.

Specific client speed is, as you can imagine, dependent on the speed and power of the client. The best way I can put it (and most accurate without a specific machine sizing to refer to) is that it is "somewhat" more intensive than a typical backup client. But, as I said, it is also usually 90%+ faster to complete the backup job.

Network latency plays a surprisingly small role. That was one of my first questions too. :) Having said that, the biggest area you might think latency enters is the checking of hashes between client and server to determine if the server already has a segment. But Avamar bundles many of these hashes into a single packet, to reduce latency. About the only other specific comment I can make is the annecdotal comment that I have run an Avamar client under a wide variety of network conditions (some hotel networks are really bad) and performance and backup time did not seem to degrade much at all irrespective of the network I was on. For a typical client, so little data is transmitted, and backups require so little time to complete, the whole issue is a non issue.

Finally, not that there are a whole bunch of options on how to deploy the Avamar server: from virtual editions under VMware to full blown multi node grids, to single nodes, all with and without replication as you choose. So there is significant flexibility.

We really are discussing apples and oranges when it comes to this and DD, more on my blog in the next little while... I don't want to monopolize your space!</description>
		<content:encoded><![CDATA[<p>Beth: Chris Mellor was talking about Avamar nodes. Nodes are component pieces of the server &#8220;grid&#8221; (hate the term under the circumstances, but it is as good as any). There is no correlation between node/grid speed and client speed.</p>
<p>Specific client speed is, as you can imagine, dependent on the speed and power of the client. The best way I can put it (and most accurate without a specific machine sizing to refer to) is that it is &#8220;somewhat&#8221; more intensive than a typical backup client. But, as I said, it is also usually 90%+ faster to complete the backup job.</p>
<p>Network latency plays a surprisingly small role. That was one of my first questions too. <img src='http://storage.blogs.techtarget.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> Having said that, the biggest area you might think latency enters is the checking of hashes between client and server to determine if the server already has a segment. But Avamar bundles many of these hashes into a single packet, to reduce latency. About the only other specific comment I can make is the annecdotal comment that I have run an Avamar client under a wide variety of network conditions (some hotel networks are really bad) and performance and backup time did not seem to degrade much at all irrespective of the network I was on. For a typical client, so little data is transmitted, and backups require so little time to complete, the whole issue is a non issue.</p>
<p>Finally, not that there are a whole bunch of options on how to deploy the Avamar server: from virtual editions under VMware to full blown multi node grids, to single nodes, all with and without replication as you choose. So there is significant flexibility.</p>
<p>We really are discussing apples and oranges when it comes to this and DD, more on my blog in the next little while&#8230; I don&#8217;t want to monopolize your space!
</p>
]]></content:encoded>
				</item>
	<item>
		<title>Comment on IT angst by yfeefy</title>
		<link>http://storage.blogs.techtarget.com/2008/05/07/it-angst/#comment-2947</link>
		<pubDate>Tue, 13 May 2008 21:12:40 +0000</pubDate>
		<guid>http://storage.blogs.techtarget.com/2008/05/07/it-angst/#comment-2947</guid>
					<description>The 'hefty disconnect' is because the sale is often made between the vendor's sales/marketing person and the customer's CEx: neither of these is qualified in any way to perform a business transaction involving items of a highly technical nature. 

Is it any wonder the technical folks at companies want your ( as the vendor's representative) throat?</description>
		<content:encoded><![CDATA[<p>The &#8216;hefty disconnect&#8217; is because the sale is often made between the vendor&#8217;s sales/marketing person and the customer&#8217;s CEx: neither of these is qualified in any way to perform a business transaction involving items of a highly technical nature. </p>
<p>Is it any wonder the technical folks at companies want your ( as the vendor&#8217;s representative) throat?
</p>
]]></content:encoded>
				</item>
</channel>
</rss>
