<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0">

<channel>
	<title>BigDataCloud.com</title>
	
	<link>http://www.bigdatacloud.com</link>
	<description>Blogs about Big Data &amp; Cloud</description>
	<lastBuildDate>Tue, 24 Jan 2012 11:18:05 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/rss+xml" href="http://feeds.feedburner.com/BigDataCloud" /><feedburner:info uri="bigdatacloud" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><item>
		<title>BIG TALK with Micheal Dalton of ZettaSet, Inc.</title>
		<link>http://feedproxy.google.com/~r/BigDataCloud/~3/NHPJl1-7Y-A/</link>
		<comments>http://www.bigdatacloud.com/big-talk-with-micheal-dalton-of-zettaset-inc/#comments</comments>
		<pubDate>Tue, 24 Jan 2012 10:06:17 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://bigdatacloud.com/?p=1150</guid>
		
			<content:encoded><![CDATA[
<p><a href="http://feedads.g.doubleclick.net/~a/w-V8Ui8xoHhPcaMp8Z5xdFmJwJ0/0/da"><img src="http://feedads.g.doubleclick.net/~a/w-V8Ui8xoHhPcaMp8Z5xdFmJwJ0/0/di" border="0" ismap="true"></img></a><br/>
<a href="http://feedads.g.doubleclick.net/~a/w-V8Ui8xoHhPcaMp8Z5xdFmJwJ0/1/da"><img src="http://feedads.g.doubleclick.net/~a/w-V8Ui8xoHhPcaMp8Z5xdFmJwJ0/1/di" border="0" ismap="true"></img></a></p><img src="http://feeds.feedburner.com/~r/BigDataCloud/~4/NHPJl1-7Y-A" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.bigdatacloud.com/big-talk-with-micheal-dalton-of-zettaset-inc/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://www.bigdatacloud.com/big-talk-with-micheal-dalton-of-zettaset-inc/?utm_source=rss&amp;utm_medium=rss&amp;utm_campaign=big-talk-with-micheal-dalton-of-zettaset-inc</feedburner:origLink></item>
		<item>
		<title>BIG TALK with Micheal Dalton of ZettaSet, Inc.</title>
		<link>http://feedproxy.google.com/~r/BigDataCloud/~3/PG9mg-RpEcY/</link>
		<comments>http://www.bigdatacloud.com/new/#comments</comments>
		<pubDate>Mon, 23 Jan 2012 13:18:11 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://bigdatacloud.com/?p=1086</guid>
		<description><![CDATA[BIG TALK with Micheal Dalton of ZettaSet, Inc.]]></description>
			<content:encoded><![CDATA[<p>BIG TALK with Micheal Dalton of ZettaSet, Inc.</p>

<p><a href="http://feedads.g.doubleclick.net/~a/kjM6-xLcOAlyoMwcWhRX4GyfGDw/0/da"><img src="http://feedads.g.doubleclick.net/~a/kjM6-xLcOAlyoMwcWhRX4GyfGDw/0/di" border="0" ismap="true"></img></a><br/>
<a href="http://feedads.g.doubleclick.net/~a/kjM6-xLcOAlyoMwcWhRX4GyfGDw/1/da"><img src="http://feedads.g.doubleclick.net/~a/kjM6-xLcOAlyoMwcWhRX4GyfGDw/1/di" border="0" ismap="true"></img></a></p><img src="http://feeds.feedburner.com/~r/BigDataCloud/~4/PG9mg-RpEcY" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.bigdatacloud.com/new/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://www.bigdatacloud.com/new/?utm_source=rss&amp;utm_medium=rss&amp;utm_campaign=new</feedburner:origLink></item>
		<item>
		<title>Top Reasons for Attending the “BigDataCloud – Today!” Event on Nov 29</title>
		<link>http://feedproxy.google.com/~r/BigDataCloud/~3/EURmMuEyEi8/</link>
		<comments>http://www.bigdatacloud.com/top-reasons-for-attending-the-bigdatacloud-today-event-on-nov-29/#comments</comments>
		<pubDate>Wed, 23 Nov 2011 18:18:08 +0000</pubDate>
		<dc:creator>Dj Das</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Cloud]]></category>

		<guid isPermaLink="false">http://www.bigdatacloud.com/?p=989</guid>
		<description><![CDATA[1. What Can Big Data Cloud Do For You?  Whether you’re just coming into the area, or are already a seasoned professional with big data cloud solutions, &#8220;BigDataCloud &#8211; Today!&#8221; is the place to gain unique insight into how big data problems can be solved today, from both a technical and a business perspective. This [...]]]></description>
			<content:encoded><![CDATA[<p><strong>1. What Can Big Data Cloud Do For You?  </strong><br />
Whether you’re just coming into the area, or are already a seasoned professional with big data cloud solutions, &#8220;BigDataCloud &#8211; Today!&#8221; is the place to gain unique insight into how big data problems can be solved today, from both a technical and a business perspective. This will give you a competitive edge to accelerate bringing solutions to market and gain substantial in-depth expertise.</p>
<p><strong>2. Meet with Big Data Cloud Experts</strong><br />
With over 250 individuals in attendance, all of them focused on creating and leveraging Big Data Cloud solution, the &#8220;BigDataCloud &#8211; Today!&#8221; event will be THE place to meet people and companies who share your interest in solving real-world business.</p>
<p><strong>3. Don&#8217;t be Scared</strong><br />
Expand your knowledge base by jumping into technical deep dives with the most advanced big data cloud experts.</p>
<p><strong>4. Discuss your ideas 1 to 1 with the Gurus</strong><br />
In-depth training classes are available in the early part of the event. Subjects covered range from MapReduce, Hadoop, Pig, Hive and others. The trainers are Gurus on their subjects and can provide real-life, in-depth information.</p>
<p><strong>5. Expand your Professional Network:</strong><br />
This focused gathering of Big Data Cloud users, developers, industry experts, business leaders, entrepreneurs and innovative companies in leveraging &#8220;BigDataCloud &#8211; Today!&#8221; provides a unique opportunity to expand your professional network. The event is special structured to allow you plenty of opportunity to meet like minded individuals in a casual setting over snacks and wine.<br />
<strong><br />
6. Know the Future:</strong><br />
&#8220;BigDataCloud &#8211; Today!&#8221; is focused on real world solutions, and our industry experts on the panels will look into their crystal balls and forecast where the next big opportunities will lie in the future.</p>

<p><a href="http://feedads.g.doubleclick.net/~a/VYG0Kqm29H5nHQAYB9Lr6j7lNeU/0/da"><img src="http://feedads.g.doubleclick.net/~a/VYG0Kqm29H5nHQAYB9Lr6j7lNeU/0/di" border="0" ismap="true"></img></a><br/>
<a href="http://feedads.g.doubleclick.net/~a/VYG0Kqm29H5nHQAYB9Lr6j7lNeU/1/da"><img src="http://feedads.g.doubleclick.net/~a/VYG0Kqm29H5nHQAYB9Lr6j7lNeU/1/di" border="0" ismap="true"></img></a></p><img src="http://feeds.feedburner.com/~r/BigDataCloud/~4/EURmMuEyEi8" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.bigdatacloud.com/top-reasons-for-attending-the-bigdatacloud-today-event-on-nov-29/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://www.bigdatacloud.com/top-reasons-for-attending-the-bigdatacloud-today-event-on-nov-29/?utm_source=rss&amp;utm_medium=rss&amp;utm_campaign=top-reasons-for-attending-the-bigdatacloud-today-event-on-nov-29</feedburner:origLink></item>
		<item>
		<title>Big Data Analytics for Industries – Join the discussions on the 8th!</title>
		<link>http://feedproxy.google.com/~r/BigDataCloud/~3/hhdEO9nLzCo/</link>
		<comments>http://www.bigdatacloud.com/big-data-analytics-for-industries-join-the-discussions-on-the-8th/#comments</comments>
		<pubDate>Sun, 04 Sep 2011 19:11:08 +0000</pubDate>
		<dc:creator>Dj Das</dc:creator>
				<category><![CDATA[Monthly Meetup]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[BigDataCloud]]></category>
		<category><![CDATA[BigDataCloud Meetup]]></category>
		<category><![CDATA[Business]]></category>
		<category><![CDATA[Business/Finance]]></category>
		<category><![CDATA[Data]]></category>
		<category><![CDATA[Data management]]></category>
		<category><![CDATA[Data Mining]]></category>
		<category><![CDATA[Data warehouse]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Internet activism]]></category>
		<category><![CDATA[Management]]></category>
		<category><![CDATA[Meetup.com]]></category>
		<category><![CDATA[Online social networking]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[Third eye]]></category>
		<category><![CDATA[thirdeyecloud]]></category>

		<guid isPermaLink="false">http://www.bigdatacloud.com/?p=983</guid>
		<description><![CDATA[This is just amazing – the kind of growth we have seen at BigDataCloud, and it wouldn’t have been possible without the help &#38; support from all of you, BigDataCloud readers/members. Thanks to all of you for making it happen! For the meetup on this Thursday, September 8th, for the first time, we are having [...]]]></description>
			<content:encoded><![CDATA[<p>This is just amazing – the kind of growth we have seen at BigDataCloud, and it wouldn’t have been possible without the help &amp; support from all of you, BigDataCloud readers/members. Thanks to all of you for making it happen!</p>
<p>For the meetup on this <strong><a href="http://www.meetup.com/BigDataCloud/events/28156671/">Thursday, September 8th</a></strong>, for the first time, we are having an industry track. We would be looking at various Big Data solutions as practiced in sprawling industries like Healthcare, Financial &amp; Social Gaming. So, join us at these lively discussions!</p>
<p>This meetup is sponsored by <strong><a href="http://zettaset.com/">ZettaSet </a></strong>. They have recently launched a new Security Data Warehouse that enables Big Data Mining for Forensic Analysis.</p>
<p>Again, thanks to all of you for your support &amp; help. If there is anything that we could do to make your experience better or enhance the value proposition of these meetups, please feel free to let me know at any time.</p>
<p>I am looking forward to seeing all of you on the 8th at 6:00 pm at the TechMart!</p>

<p><a href="http://feedads.g.doubleclick.net/~a/bv8jtRsw8r6yWGydHlqkxImXdc8/0/da"><img src="http://feedads.g.doubleclick.net/~a/bv8jtRsw8r6yWGydHlqkxImXdc8/0/di" border="0" ismap="true"></img></a><br/>
<a href="http://feedads.g.doubleclick.net/~a/bv8jtRsw8r6yWGydHlqkxImXdc8/1/da"><img src="http://feedads.g.doubleclick.net/~a/bv8jtRsw8r6yWGydHlqkxImXdc8/1/di" border="0" ismap="true"></img></a></p><img src="http://feeds.feedburner.com/~r/BigDataCloud/~4/hhdEO9nLzCo" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.bigdatacloud.com/big-data-analytics-for-industries-join-the-discussions-on-the-8th/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://www.bigdatacloud.com/big-data-analytics-for-industries-join-the-discussions-on-the-8th/?utm_source=rss&amp;utm_medium=rss&amp;utm_campaign=big-data-analytics-for-industries-join-the-discussions-on-the-8th</feedburner:origLink></item>
		<item>
		<title>Data Loader for NOSQL Databases</title>
		<link>http://feedproxy.google.com/~r/BigDataCloud/~3/XxvC4XL03FE/</link>
		<comments>http://www.bigdatacloud.com/data-loader-for-nosql-databases/#comments</comments>
		<pubDate>Sun, 04 Sep 2011 07:05:44 +0000</pubDate>
		<dc:creator>Pranab Ghosh</dc:creator>
				<category><![CDATA[Cassandra]]></category>
		<category><![CDATA[CouchBase]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[Solr]]></category>
		<category><![CDATA[Apache Solr]]></category>
		<category><![CDATA[Computing]]></category>
		<category><![CDATA[Cross-platform software]]></category>
		<category><![CDATA[CSV application support]]></category>
		<category><![CDATA[Free software]]></category>
		<category><![CDATA[HBase]]></category>
		<category><![CDATA[JSON]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[MySQL AB]]></category>
		<category><![CDATA[Oracle Corporation]]></category>
		<category><![CDATA[RDBMS]]></category>
		<category><![CDATA[search purpose]]></category>
		<category><![CDATA[System software]]></category>
		<category><![CDATA[Technology/Internet]]></category>

		<guid isPermaLink="false">http://www.bigdatacloud.com/?p=981</guid>
		<description><![CDATA[In one of my recent projects, I had to load product data from a CSV file into HBase and also to index it for search purpose.. I decided to separate out the loader part of the project as a stand alone tool and make available as open source. Currently, it’s hosted in github. It supports [...]]]></description>
			<content:encoded><![CDATA[<p>In one of my recent projects, I had to load product data from a CSV file into HBase and also to index it for search purpose.. I decided to separate out the loader part of the project as a stand alone tool and make available as open source. Currently, it’s hosted in github. It supports HBase. I will be adding support for Cassandra soon. I am working on Solr indexing right now.</p>
<p><strong>Introduction<br />
</strong></p>
<blockquote><p>The tool is very generic and configurable. It takes a CSV file as input and writes to HBase or Cassandra.</p></blockquote>
<p>The CSV file could have been generated from queries on Oracle or MySQL. So it could be used to migrate data from RDBMS to NOSQL databases.</p>
<blockquote><p>It also takes a JSON file, which defines the the mapping between the columns in the CSV and the NOSQL column family and column along with other metadata.</p></blockquote>
<p>Here is a quick summary of the features. The terminology I am using is based on HBase.</p>
<ul>
<li>Loads data from CSV file.</li>
<li>Mapping between CSV columns and NOSQL column family and column is provided in JSON file.</li>
<li>There is many to many association between CSV column and NOSQL column family and column.</li>
<li>The row key for NOSQL could be created by concatenating multiple CSV columns.</li>
<li>Solr indexing of data as it’s being loaded.</li>
</ul>
<p>The indexing feature is not implemented yet. I will be working on it next. A CSV column could be split into multiple parts and used to populate multiple NOSQL columns. On the flip side, multiple CSV columns could be consolidated to populate one NOSQL column.</p>
<hr />
<ul>
<li><a href="http://pkghosh.wordpress.com/2011/08/31/data-loader-for-nosql-databases/"><strong><em>Follow this posting on Mawazo&#8230;</em></strong></a></li>
<li><a href="http://www.bigdatacloud.com/tag/Mawazo/"><em>Find other postings from Mawazo&#8230;</em></a></li>
</ul>

<p><a href="http://feedads.g.doubleclick.net/~a/99eMCd55-OClR_UO0oVxuf6v_SA/0/da"><img src="http://feedads.g.doubleclick.net/~a/99eMCd55-OClR_UO0oVxuf6v_SA/0/di" border="0" ismap="true"></img></a><br/>
<a href="http://feedads.g.doubleclick.net/~a/99eMCd55-OClR_UO0oVxuf6v_SA/1/da"><img src="http://feedads.g.doubleclick.net/~a/99eMCd55-OClR_UO0oVxuf6v_SA/1/di" border="0" ismap="true"></img></a></p><img src="http://feeds.feedburner.com/~r/BigDataCloud/~4/XxvC4XL03FE" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.bigdatacloud.com/data-loader-for-nosql-databases/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://www.bigdatacloud.com/data-loader-for-nosql-databases/?utm_source=rss&amp;utm_medium=rss&amp;utm_campaign=data-loader-for-nosql-databases</feedburner:origLink></item>
		<item>
		<title>Puppet, Chef Ease Transition to Cloud Computing</title>
		<link>http://feedproxy.google.com/~r/BigDataCloud/~3/-u1bJVnC_ek/</link>
		<comments>http://www.bigdatacloud.com/puppet-chef-ease-transition-to-cloud-computing/#comments</comments>
		<pubDate>Thu, 01 Sep 2011 20:00:20 +0000</pubDate>
		<dc:creator>From BusinessWeek</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Cloud]]></category>
		<category><![CDATA[Amazon.com.dedc LLC.]]></category>
		<category><![CDATA[bloomberg]]></category>
		<category><![CDATA[BusinessWeek]]></category>
		<category><![CDATA[chief executive officer]]></category>
		<category><![CDATA[Cloud computing]]></category>
		<category><![CDATA[cloud computing revolution]]></category>
		<category><![CDATA[Computing]]></category>
		<category><![CDATA[Concurrent computing]]></category>
		<category><![CDATA[Cycle Computing]]></category>
		<category><![CDATA[Distributed computing architecture]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Google Inc.]]></category>
		<category><![CDATA[Harvard University]]></category>
		<category><![CDATA[Human-computer interaction]]></category>
		<category><![CDATA[Hypertext]]></category>
		<category><![CDATA[Jason Stowe]]></category>
		<category><![CDATA[Mobile Payment]]></category>
		<category><![CDATA[Northrop Grumman]]></category>
		<category><![CDATA[NORTHROP GRUMMAN CORPORATION]]></category>
		<category><![CDATA[NYSE Euronext]]></category>
		<category><![CDATA[Parallel computing]]></category>
		<category><![CDATA[software tools]]></category>
		<category><![CDATA[Supercomputer]]></category>
		<category><![CDATA[Technology/Internet]]></category>
		<category><![CDATA[USD]]></category>
		<category><![CDATA[Web operations]]></category>
		<category><![CDATA[World Wide Web]]></category>

		<guid isPermaLink="false">http://www.bigdatacloud.com/?p=986</guid>
		<description><![CDATA[Organizations as diverse as Northrop Grumman (NOC), Harvard University, Zynga, and the New York Stock Exchange (NYX) have filled job websites with requests for talented puppeteers and master chefs. A quick dig into the job listings reveals that these positions have nothing to do with office entertainment or gourmet meals. Instead, the companies want people [...]]]></description>
			<content:encoded><![CDATA[<p>Organizations as diverse as Northrop Grumman (NOC), Harvard University, Zynga, and the New York Stock Exchange (NYX) have filled job websites with requests for talented puppeteers and master chefs. A quick dig into the job listings reveals that these positions have nothing to do with office entertainment or gourmet meals. Instead, the companies want people who have mastered Puppet or Chef, competing software tools that sit at the heart of the cloud computing revolution.</p>
<p>In essence, Puppet and Chef are levers used to control data center computers in a more automated fashion. The software has helped companies tap vast stores of computing power in new ways, accelerating research in fields such as financial modeling and genetics. “This really changes the way science gets done,” says Jason Stowe, the chief executive officer of Cycle Computing, a startup that uses Chef to configure thousands of computers at a time so that clients can perform calculations at supercomputer speeds. Before adopting Chef, doing such configurations took hours or even days. “We’re down to single-digit minutes now,” Stowe says.</p>
<p>The need for such tools originated with Google (GOOG), Amazon.com (AMZN), and their peers, who have long had to deal with the burden of managing tens or even hundreds of thousands of servers to support vast Web operations. Over the years these companies developed custom tools that can quickly turn, say, a thousand new servers into machines capable of displaying Web pages or handling sales. These programs allow the companies to run enormous, $500 million computing centers with about three dozen people at each one.</p>
<hr />
<ul>
<li><a href="http://www.businessweek.com/magazine/puppet-chef-ease-transition-to-cloud-computing-09012011.html"><strong><em>Follow this posting on BusinessWeek&#8230;</em></strong></a></li>
<li><a href="http://www.bigdatacloud.com/tag/BusinessWeek/"><em>Find other postings from BusinessWeek&#8230;</em></a></li>
</ul>

<p><a href="http://feedads.g.doubleclick.net/~a/iUTo2XRIIURsRrkAIWwIl-bkuHE/0/da"><img src="http://feedads.g.doubleclick.net/~a/iUTo2XRIIURsRrkAIWwIl-bkuHE/0/di" border="0" ismap="true"></img></a><br/>
<a href="http://feedads.g.doubleclick.net/~a/iUTo2XRIIURsRrkAIWwIl-bkuHE/1/da"><img src="http://feedads.g.doubleclick.net/~a/iUTo2XRIIURsRrkAIWwIl-bkuHE/1/di" border="0" ismap="true"></img></a></p><img src="http://feeds.feedburner.com/~r/BigDataCloud/~4/-u1bJVnC_ek" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.bigdatacloud.com/puppet-chef-ease-transition-to-cloud-computing/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://www.bigdatacloud.com/puppet-chef-ease-transition-to-cloud-computing/?utm_source=rss&amp;utm_medium=rss&amp;utm_campaign=puppet-chef-ease-transition-to-cloud-computing</feedburner:origLink></item>
		<item>
		<title>Best Practices for Selecting Apache Hadoop Hardware</title>
		<link>http://feedproxy.google.com/~r/BigDataCloud/~3/j9VHLlzW1ws/</link>
		<comments>http://www.bigdatacloud.com/best-practices-for-selecting-apache-hadoop-hardware/#comments</comments>
		<pubDate>Thu, 01 Sep 2011 06:00:18 +0000</pubDate>
		<dc:creator>From HortonWorks</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Cloud computing]]></category>
		<category><![CDATA[Cloud infrastructure]]></category>
		<category><![CDATA[Cluster]]></category>
		<category><![CDATA[Computing]]></category>
		<category><![CDATA[Hard disk drive]]></category>
		<category><![CDATA[HortonWorks]]></category>
		<category><![CDATA[Parallel ATA]]></category>
		<category><![CDATA[Parallel computing]]></category>
		<category><![CDATA[Personal computer hardware]]></category>
		<category><![CDATA[quality commodity equipment]]></category>
		<category><![CDATA[RAID]]></category>
		<category><![CDATA[RAM]]></category>
		<category><![CDATA[Scott Carey]]></category>
		<category><![CDATA[Serial ATA]]></category>
		<category><![CDATA[Technology/Internet]]></category>
		<category><![CDATA[worker node hardware]]></category>
		<category><![CDATA[Yahoo! Communications Europe Ltd.]]></category>

		<guid isPermaLink="false">http://www.bigdatacloud.com/?p=969</guid>
		<description><![CDATA[We get asked a lot of questions about how to select Apache Hadoop worker node hardware. During my time at Yahoo!, we bought a lot of nodes with 6*2TB SATA drives, 24GB RAM and 8 cores in a dual socket configuration. This has proven to be a pretty good configuration. This year, I’ve seen systems [...]]]></description>
			<content:encoded><![CDATA[<p>We get asked a lot of questions about how to select Apache Hadoop worker node hardware. During my time at Yahoo!, we bought a lot of nodes with 6*2TB SATA drives, 24GB RAM and 8 cores in a dual socket configuration. This has proven to be a pretty good configuration. This year, I’ve seen systems with 12*2TB SATA drives, 48GB RAM and 8 cores in a dual socket configurations. We will see a move to 3TB drives this year.</p>
<p>What configuration makes sense for any given organization is driven by such ratios as the storage-to-compute ratio of your workload and other factors that cannot be answered in a generic way. Further, the hardware industry moves quickly. In this post I’ll try to outline the principles that have generally guided Hadoop hardware configuration selections over the last six years. All of these thoughts are aimed at designing medium to large Apache Hadoop clusters. Scott Carey made a good case for smaller machines for small clusters the other day on the Apache mailing list.</p>
<p>The key for Hadoop clusters is to buy quality commodity equipment. Most Hadoop purchasers are cost conscious and as your clusters grow, their cost can be significant. When thinking about cost, one needs to think about the whole system, including network, power and the extra components included in many high-end systems. Remember that Hadoop is built to handle component failure well and to scale out on low cost gear. RAID cards, redundant power supplies and other per-component reliability features are not needed. Buy error-correcting RAM and SATA drives with good MTBF numbers. Good RAM allows you to trust the quality of your computations. Hard drives are the largest source of failures, so buy decent ones.</p>
<hr />
<ul>
<li><a href="http://www.hortonworks.com/best-practices-for-selecting-apache-hadoop-hardware/"><strong><em>Follow this posting on HortonWorks&#8230;</em></strong></a></li>
<li><a href="http://www.bigdatacloud.com/tag/HortonWorks/"><em>Find other postings from HortonWorks&#8230;</em></a></li>
</ul>

<p><a href="http://feedads.g.doubleclick.net/~a/Z63yJXDUAkaT_Q22Q1jAWq3eplQ/0/da"><img src="http://feedads.g.doubleclick.net/~a/Z63yJXDUAkaT_Q22Q1jAWq3eplQ/0/di" border="0" ismap="true"></img></a><br/>
<a href="http://feedads.g.doubleclick.net/~a/Z63yJXDUAkaT_Q22Q1jAWq3eplQ/1/da"><img src="http://feedads.g.doubleclick.net/~a/Z63yJXDUAkaT_Q22Q1jAWq3eplQ/1/di" border="0" ismap="true"></img></a></p><img src="http://feeds.feedburner.com/~r/BigDataCloud/~4/j9VHLlzW1ws" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.bigdatacloud.com/best-practices-for-selecting-apache-hadoop-hardware/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://www.bigdatacloud.com/best-practices-for-selecting-apache-hadoop-hardware/?utm_source=rss&amp;utm_medium=rss&amp;utm_campaign=best-practices-for-selecting-apache-hadoop-hardware</feedburner:origLink></item>
		<item>
		<title>IBM Taps i2 for Big Data Analytics Expertise</title>
		<link>http://feedproxy.google.com/~r/BigDataCloud/~3/89ZQMDyyEoc/</link>
		<comments>http://www.bigdatacloud.com/ibm-taps-i2-for-big-data-analytics-expertise/#comments</comments>
		<pubDate>Wed, 31 Aug 2011 06:09:17 +0000</pubDate>
		<dc:creator>From InformationWeek</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Analytics]]></category>
		<category><![CDATA[banking]]></category>
		<category><![CDATA[Business]]></category>
		<category><![CDATA[Business/Finance]]></category>
		<category><![CDATA[Cambridge]]></category>
		<category><![CDATA[Data analysis]]></category>
		<category><![CDATA[EWeek]]></category>
		<category><![CDATA[intelligence analytics tools]]></category>
		<category><![CDATA[International Business Machines Corporation]]></category>
		<category><![CDATA[law enforcement]]></category>
		<category><![CDATA[Law/Crime]]></category>
		<category><![CDATA[Mathematical finance]]></category>
		<category><![CDATA[retail]]></category>
		<category><![CDATA[retail banks]]></category>
		<category><![CDATA[U.S. headquarters]]></category>
		<category><![CDATA[United Kingdom]]></category>
		<category><![CDATA[United States]]></category>
		<category><![CDATA[Virginia]]></category>

		<guid isPermaLink="false">http://www.bigdatacloud.com/?p=973</guid>
		<description><![CDATA[IBM has signed a definitive agreement to acquire i2, a maker of intelligence analytics tools for crime and fraud prevention. IBM has announced an agreement to acquire i2 to accelerate its business analytics initiatives and help clients in the public and private sectors address crime, fraud and security threats. Financial terms of the deal were [...]]]></description>
			<content:encoded><![CDATA[<p>IBM has signed a definitive agreement to acquire i2, a maker of intelligence analytics tools for crime and fraud prevention.</p>
<p>IBM has announced an agreement to acquire i2 to accelerate its business analytics initiatives and help clients in the public and private sectors address crime, fraud and security threats.</p>
<p>Financial terms of the deal were not disclosed.</p>
<p>i2, with more than 4,500 customers in 150 countries, is a provider of intelligence analytics for crime and fraud prevention based in Cambridge, U.K., with U.S. headquarters in McLean, Va. i2&#8242;s clients span multiple sectors globally such as banking, defense, health care, insurance, law enforcement, national security and retail. i2&#8242;s solutions are currently used by 12 of the top 20 retail banks globally and eight of the top 10 largest companies in the world.</p>
<hr />
<ul>
<li><a href="http://www.eweek.com/c/a/IT-Management/IBM-Taps-i2-for-Big-Data-Analytics-Expertise-283439/"><strong><em>Follow this posting on eWeek&#8230;</em></strong></a></li>
<li><a href="http://www.eweek.com/c/a/IT-Management/IBM-Taps-i2-for-Big-Data-Analytics-Expertise-283439/"><em>Find other postings from eWeek&#8230;</em></a></li>
</ul>

<p><a href="http://feedads.g.doubleclick.net/~a/36eR5Za5syJ02V1nc_p4uPNZHmw/0/da"><img src="http://feedads.g.doubleclick.net/~a/36eR5Za5syJ02V1nc_p4uPNZHmw/0/di" border="0" ismap="true"></img></a><br/>
<a href="http://feedads.g.doubleclick.net/~a/36eR5Za5syJ02V1nc_p4uPNZHmw/1/da"><img src="http://feedads.g.doubleclick.net/~a/36eR5Za5syJ02V1nc_p4uPNZHmw/1/di" border="0" ismap="true"></img></a></p><img src="http://feeds.feedburner.com/~r/BigDataCloud/~4/89ZQMDyyEoc" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.bigdatacloud.com/ibm-taps-i2-for-big-data-analytics-expertise/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://www.bigdatacloud.com/ibm-taps-i2-for-big-data-analytics-expertise/?utm_source=rss&amp;utm_medium=rss&amp;utm_campaign=ibm-taps-i2-for-big-data-analytics-expertise</feedburner:origLink></item>
		<item>
		<title>New tools driving big data analytics, survey finds</title>
		<link>http://feedproxy.google.com/~r/BigDataCloud/~3/iRQBUal7fxQ/</link>
		<comments>http://www.bigdatacloud.com/new-tools-driving-big-data-analytics-survey-finds/#comments</comments>
		<pubDate>Thu, 25 Aug 2011 06:46:10 +0000</pubDate>
		<dc:creator>From InformationWeek</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[analyst and author]]></category>
		<category><![CDATA[Analytics]]></category>
		<category><![CDATA[analytics tools]]></category>
		<category><![CDATA[Business]]></category>
		<category><![CDATA[Business intelligence]]></category>
		<category><![CDATA[Business/Finance]]></category>
		<category><![CDATA[Clickstream]]></category>
		<category><![CDATA[Data analysis]]></category>
		<category><![CDATA[Data management]]></category>
		<category><![CDATA[Data Mining]]></category>
		<category><![CDATA[Data warehouse]]></category>
		<category><![CDATA[Data Warehousing Institute]]></category>
		<category><![CDATA[data warehousing technologies]]></category>
		<category><![CDATA[Information technology management]]></category>
		<category><![CDATA[Intelligence]]></category>
		<category><![CDATA[Philip Russom]]></category>
		<category><![CDATA[Predictive analytics]]></category>
		<category><![CDATA[social media data]]></category>
		<category><![CDATA[social media marketing capabilities]]></category>
		<category><![CDATA[Technology/Internet]]></category>
		<category><![CDATA[The Data Warehousing Institute]]></category>

		<guid isPermaLink="false">http://www.bigdatacloud.com/?p=976</guid>
		<description><![CDATA[New technologies are enabling companies to perform increasingly sophisticated data analytics on very large and very diverse data sets, an upcoming report from The Data Warehousing Institute (TDWI) shows. The report is based on responses from 325 IT managers, business users and consultants at small, medium and large companies. Slightly more than a third of [...]]]></description>
			<content:encoded><![CDATA[<p>New technologies are enabling companies to perform increasingly sophisticated data analytics on very large and very diverse data sets, an upcoming report from The Data Warehousing Institute (TDWI) shows.</p>
<p>The report is based on responses from 325 IT managers, business users and consultants at small, medium and large companies.</p>
<p>Slightly more than a third of the respondents said they are currently running some form of advanced analytics on big data &#8212; mostly for business intelligence, predictive analytics, data mining and statistical analysis tasks.</p>
<p>Close to 45% of those surveyed expect that big data analytics will enable more accurate business insights while 38% are looking to use the technology to better recognize sales and market opportunities better. More than 60% are hoping that big data analytics can boost their company&#8217;s social media marketing capabilities.</p>
<p>The fastest growing use case for big data analytics is advanced data visualization, according to the TDWI survey. A growing number of companies are running sophisticated analytics tools on big data sets in order to build highly complex visual representations of their data.</p>
<p>&#8220;Big data used to be a technical problem when companies were struggling to deal with the management of large volumes of data,&#8221; said Philip Russom, a TDWI analyst and author of the report. &#8220;Now, if you apply analytics to it, there is a lot that can be gained from big data, that you could not get&#8221; from traditional BI and data warehousing technologies.</p>
<p>The term &#8220;big data&#8221; refers to very large data sets, often hundreds of terabytes or petabytes in scale. Increasingly, the term is used to describe not just large volumes of structured data but also unstructured data such as weblogs, clickstream data, machine and sensor data and social media data.</p>
<hr />
<ul>
<li><a href="http://www.computerworld.com/s/article/9219487/New_tools_driving_big_data_analytics_survey_finds?taxonomyId=18"><strong><em>Follow this posting on ComputerWorld&#8230;</em></strong></a></li>
<li><a href="http://www.bigdatacloud.com/tag/ComputerWorld/"><em>Find other postings from ComputerWorld&#8230;</em></a></li>
</ul>

<p><a href="http://feedads.g.doubleclick.net/~a/s9Urm60o2__D-v0bUeppP4ZGiYc/0/da"><img src="http://feedads.g.doubleclick.net/~a/s9Urm60o2__D-v0bUeppP4ZGiYc/0/di" border="0" ismap="true"></img></a><br/>
<a href="http://feedads.g.doubleclick.net/~a/s9Urm60o2__D-v0bUeppP4ZGiYc/1/da"><img src="http://feedads.g.doubleclick.net/~a/s9Urm60o2__D-v0bUeppP4ZGiYc/1/di" border="0" ismap="true"></img></a></p><img src="http://feeds.feedburner.com/~r/BigDataCloud/~4/iRQBUal7fxQ" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.bigdatacloud.com/new-tools-driving-big-data-analytics-survey-finds/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://www.bigdatacloud.com/new-tools-driving-big-data-analytics-survey-finds/?utm_source=rss&amp;utm_medium=rss&amp;utm_campaign=new-tools-driving-big-data-analytics-survey-finds</feedburner:origLink></item>
		<item>
		<title>O’Reilly Strata – BigDataCloud readers get 30% off!</title>
		<link>http://feedproxy.google.com/~r/BigDataCloud/~3/QrGsJe-kPGM/</link>
		<comments>http://www.bigdatacloud.com/oreilly-strata-bigdatacloud-readers-get-30-off/#comments</comments>
		<pubDate>Thu, 25 Aug 2011 05:52:27 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[News]]></category>
		<category><![CDATA[Business/Finance]]></category>
		<category><![CDATA[Marriott International]]></category>
		<category><![CDATA[New York City]]></category>
		<category><![CDATA[New York Marriott Marquis]]></category>
		<category><![CDATA[technology pioneers]]></category>
		<category><![CDATA[Times Square]]></category>

		<guid isPermaLink="false">http://www.bigdatacloud.com/?p=965</guid>
		<description><![CDATA[O’Reilly Strata New York 2011 is part of a week-long series of data-driven events in New York this September 19-23, which includes: Strata Jumpstart, 9/19, New York Marriott Marquis &#8211; A daylong crash course for managers, strategists, and entrepreneurs on how to manage the data deluge that&#8217;s transforming traditional business practices. Strata Summit, 9/20-9/21, New [...]]]></description>
			<content:encoded><![CDATA[<p>O’Reilly Strata New York 2011 is part of a week-long series of data-driven events in New York this September 19-23, which includes:</p>
<ul>
<li><strong>Strata Jumpstart, 9/19, New York Marriott Marquis</strong> &#8211; A daylong crash course for managers, strategists, and entrepreneurs on how to manage the data deluge that&#8217;s transforming traditional business practices.
</li>
<li><strong>Strata Summit, 9/20-9/21, New York Marriott Marquis</strong> &#8211; Two days on the essential high-level strategies for thriving in &#8220;the harsh light of data,&#8221; delivered by the battle-tested business and technology pioneers who are leading the way.</li>
<li><strong>Strata Conference, 9/22-9/23, New York Hilton</strong> &#8211; Two days of the nuts-and-bolts needed for building a data-driven business—the latest on skills, tools, and technologies you need to make data work.
</li>
</ul>
<p><a href="http://strataconf.com/public/content/home?cmp=mp-conf-st11-bigdatacloud-event-listing">Register</a> for a Super Pass, which gives you access to the whole week of conference and evening events, at a reduced rate. </p>
<p><strong>BigDataCloud readers get a 30% discount. <a href="http://strataconf.com/public/content/home?cmp=mp-conf-st11-bigdatacloud-event-listing">Use discount code DataCloud</a>.</strong></p>

<p><a href="http://feedads.g.doubleclick.net/~a/BYOajz9ltrLfyRwtckrEDQX2d_g/0/da"><img src="http://feedads.g.doubleclick.net/~a/BYOajz9ltrLfyRwtckrEDQX2d_g/0/di" border="0" ismap="true"></img></a><br/>
<a href="http://feedads.g.doubleclick.net/~a/BYOajz9ltrLfyRwtckrEDQX2d_g/1/da"><img src="http://feedads.g.doubleclick.net/~a/BYOajz9ltrLfyRwtckrEDQX2d_g/1/di" border="0" ismap="true"></img></a></p><img src="http://feeds.feedburner.com/~r/BigDataCloud/~4/QrGsJe-kPGM" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.bigdatacloud.com/oreilly-strata-bigdatacloud-readers-get-30-off/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://www.bigdatacloud.com/oreilly-strata-bigdatacloud-readers-get-30-off/?utm_source=rss&amp;utm_medium=rss&amp;utm_campaign=oreilly-strata-bigdatacloud-readers-get-30-off</feedburner:origLink></item>
	</channel>
</rss>

