<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/atom10full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><feed xmlns="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:thr="http://purl.org/syndication/thread/1.0" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0">
    <title>Grid Gurus</title>
    
    <link rel="hub" href="http://hubbub.api.typepad.com/" />
    <link rel="alternate" type="text/html" href="http://gridgurus.typepad.com/grid_gurus/" />
    <id>tag:typepad.com,2003:weblog-1370854</id>
    <updated>2009-05-06T09:43:25-05:00</updated>
    
    <generator uri="http://www.typepad.com/">TypePad</generator>
    <atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/atom+xml" href="http://feeds.feedburner.com/typepad/QALv" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com" /><entry>
        <title>Parsing SGE Accounting File</title>
        <link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/typepad/QALv/~3/qo-WmjvkorQ/parsing-sge-accounting-file.html" />
        <link rel="replies" type="text/html" href="http://gridgurus.typepad.com/grid_gurus/2009/05/parsing-sge-accounting-file.html" thr:count="6" thr:updated="2009-12-04T04:12:19-06:00" />
        <id>tag:typepad.com,2003:post-58173300</id>
        <published>2009-05-06T09:43:25-05:00</published>
        <updated>2009-05-06T09:43:33-05:00</updated>
        <summary>Anyone managing an HPC cluster has probably wondered at some point about the overall performance and usage of his/her cluster. How many jobs were completed last month, what was the average job duration time, how long were they pending in...</summary>
        <author>
            <name>Rich Wellner</name>
        </author>
        
        
<content type="xhtml" xml:lang="en-US" xml:base="http://gridgurus.typepad.com/grid_gurus/"><div xmlns="http://www.w3.org/1999/xhtml"><img src="http://gridgurus.typepad.com/grid_gurus//code_000000237891Small.jpg" alt="code_000000237891Small.jpg" border="0" width="282" align="right" />Anyone managing an HPC cluster has probably wondered at some point about the overall performance and usage of his/her cluster. How many jobs were completed last month, what was the average job duration time, how long were they pending in queue, how many CPU slots did jobs require…?  These are all good questions with answers buried somewhere in your DRM’s accounting files. 

If you are using the Grid Engine, and assuming you have the usual “default cell” installation, the relevant file is $SGE_ROOT/default/common/accounting. The corresponding command that extracts information from this file is “qacct”. When you type something like “man qacct”, you will notice that qacct produces a summary of information for wall-clock, cpu and system time, and for different categories of such as hostname, queue-name, owner-name, etc., so that there is a good chance that information you are looking for is readily available. If, however, you happen to look for something that qacct does not provide, the accounting file is formatted for easy parsing. Each line in the file corresponds to one computing task, and there are more than different 40 accounting fields (separated by the ‘:’ character) on each line. The meaning of different fields is documented in the man pages (“man accounting”), so that getting information you need with standard UNIX tools should not be difficult at all. 
<xhtml:img xmlns:xhtml="http://www.w3.org/1999/xhtml" src="http://feeds.feedburner.com/~r/typepad/QALv/~4/qo-WmjvkorQ" height="1" width="1" /></div></content>


    <feedburner:origLink>http://gridgurus.typepad.com/grid_gurus/2009/05/parsing-sge-accounting-file.html</feedburner:origLink></entry>
    <entry>
        <title>The Private Clouds</title>
        <link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/typepad/QALv/~3/DUd569l_njE/the-private-clouds.html" />
        <link rel="replies" type="text/html" href="http://gridgurus.typepad.com/grid_gurus/2008/12/the-private-clouds.html" thr:count="4" thr:updated="2009-11-06T23:47:29-06:00" />
        <id>tag:typepad.com,2003:post-58541110</id>
        <published>2008-12-08T07:24:00-06:00</published>
        <updated>2008-12-08T07:24:00-06:00</updated>
        <summary>Last month I was invited to give a couple of talks about Cloud computing in the wonderful C3RS (Cisco Cloud Computing Research Symposium). The slides are available online, if you want to check. Although the audiences were quite heterogeneous, there...</summary>
        <author>
            <name>Ignacio Martin Llorente</name>
        </author>
        
        
<content type="xhtml" xml:lang="en-US" xml:base="http://gridgurus.typepad.com/grid_gurus/"><div xmlns="http://www.w3.org/1999/xhtml"><p><span style="font-family: helvetica; font-size: 12px; line-height: 17px;" /></p><p style="margin: 0px; padding: 0px 0px 10px;">Last month I was invited to give a couple of talks about Cloud computing in the wonderful C3RS (Cisco Cloud Computing Research Symposium).  <a href="http://www.dsa-research.org/doku.php?id=people:ruben#talks_opinion_pieces" style="margin: 0px; padding: 0px;">The slides are available online</a>, if you want to check. Although the audiences were quite heterogeneous, there is a recurrent question among the participants of these events: <em style="margin: 0px; padding: 0px;">How can I set my private cloud?</em>. Let me briefly summarize the motivation of the people asking this:</p><ul style="margin: 10px 0px 10px 40px; padding: 0px;"><li style="margin: 0px; padding: 0px;"><strong style="margin: 0px; padding: 0px;">Lease compute capacity from the local infrastructure</strong>. These people acknowledge the benefits of virtualizing their own infrastructure as a whole. However, they are not interested, in selling this capacity over the internet, or at least is not a priority for them. This is, they do not want to become a EC2 competitor, so they do not need to expose to the world a cloud interface.</li>
<li style="margin: 0px; padding: 0px;"><strong style="margin: 0px; padding: 0px;">Capacity in the cloud</strong>. They do not want to be the new EC2 but they want to use EC2. The ability of moving some services, or part of the capacity of a service, to an external provider is very attractive to them.</li>
<li style="margin: 0px; padding: 0px;"><strong style="margin: 0px; padding: 0px;">Open Source</strong>. Current cloud solutions are proprietary and closed, they need an open source solution to play with. Also, they are using some virtualization technologies that would like to see integrated in the final solution.</li>
</ul>
<p style="margin: 0px; padding: 0px 0px 10px; text-align: center;"><a href="http://blog.dsa-research.org/wp-content/uploads/2008/11/virtualinf.jpg" style="margin: 0px; padding: 0px; color: #6b7d8f; font-weight: bold;"><img alt="" class="size-medium wp-image-105 aligncenter " height="214" src="http://blog.dsa-research.org/wp-content/uploads/2008/11/virtualinf-300x214.jpg" style="border-width: 0px; margin: 0px; padding: 0px; vertical-align: middle;" width="300" /></a></p><p style="margin: 0px; padding: 0px 0px 10px;">I said to these people, take a look to <a href="http://www.opennebula.org/" style="margin: 0px; padding: 0px;">OpenNebula</a>. OpenNebula is a distributed virtual machine manager that allows you to virtualize your infrastructure. It also features an integral management of your virtual services, including networking and image management. Additionally, it is shipped with EC2 plug-ins that allow you to <em style="margin: 0px; padding: 0px;">simultaneously deploy</em> virtual machines in your local infrastructure and in Amazon EC2.</p><p style="margin: 0px; padding: 0px 0px 10px;">OpenNebula is modular-by-design to allow its integration with any other tool, like the <a href="http://haizea.cs.uchicago.edu/" style="margin: 0px; padding: 0px; color: #6b7d8f; font-weight: bold;">Haziea lease manager</a>, or <a href="http://workspace.globus.org/" style="margin: 0px; padding: 0px; color: #6b7d8f; font-weight: bold;">Nimbus</a> that gives you a EC2 compatible interface in case you need one. It is a healthy open source software being improved in several projects like <a href="http://blog.dsa-research.org/www.reservoir-fp7.eu/" style="margin: 0px; padding: 0px; color: #6b7d8f; font-weight: bold;">RESERVOIR</a>, and it has a <a href="http://www.opennebula.org/doku.php?id=ecosystem" style="margin: 0px; padding: 0px; color: #6b7d8f; font-weight: bold;">growing community</a>.<cite style="margin: 0px; padding: 0px;" /></p><p style="margin: 0px; padding: 0px 0px 10px;"><a href="http://opennebula.org/cgi-bin/opennebula/download.cgi?one-1.0" style="margin: 0px; padding: 0px;">Go here if you want to set up your private cloud!</a></p><p style="margin: 0px; padding: 0px 0px 10px;"><a href="http://www.dsa-research.org/doku.php?id=people:ruben" style="margin: 0px; padding: 0px; color: #6b7d8f; font-weight: bold;">Ruben S. Montero</a></p><p style="margin: 0px; padding: 0px 0px 10px;"><span style="color: #eeeeee; font-size: 13px; line-height: 19px;"><a href="http://blog.dsa-research.org/?p=98" style="text-decoration: underline; color: #eeeeee;" /><a href="http://blog.dsa-research.org/?p=104">Reprinted from blog.dsa-research.org</a><a href="http://dsa-research.org/doku.php?id=people:ruben" mce_href="http://dsa-research.org/doku.php?id=people:ruben" style="text-decoration: underline; color: #eeeeee;"><span style="text-decoration: underline; color: #eeeeee; font-family: 'Trebuchet MS';"> </span></a></span></p><p /><xhtml:img xmlns:xhtml="http://www.w3.org/1999/xhtml" src="http://feeds.feedburner.com/~r/typepad/QALv/~4/DUd569l_njE" height="1" width="1" /></div></content>


    <feedburner:origLink>http://gridgurus.typepad.com/grid_gurus/2008/12/the-private-clouds.html</feedburner:origLink></entry>
    <entry>
        <title>Managing Resource Quotas in Grid Engine</title>
        <link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/typepad/QALv/~3/eO-j_hJJpXM/managing-resource-quotas-in-grid-engine.html" />
        <link rel="replies" type="text/html" href="http://gridgurus.typepad.com/grid_gurus/2008/12/managing-resource-quotas-in-grid-engine.html" thr:count="2" thr:updated="2009-10-28T07:42:34-05:00" />
        <id>tag:typepad.com,2003:post-58534050</id>
        <published>2008-12-02T10:36:24-06:00</published>
        <updated>2008-12-02T10:36:24-06:00</updated>
        <summary>It is often the case that cluster administrators must impose limits on using certain resources. Good example here would be preventing a particular user (or a set of users), from utilizing entire queue (or cluster) at any point. If you’ve...</summary>
        <author>
            <name>Sinisa Veseli</name>
        </author>
        
        
<content type="xhtml" xml:lang="en-US" xml:base="http://gridgurus.typepad.com/grid_gurus/"><div xmlns="http://www.w3.org/1999/xhtml"><img src="http://gridgurus.typepad.com/grid_gurus//code_000000237891Small.jpg" alt="code_000000237891Small.jpg" border="0" width="282" align="right" />It is often the case that cluster administrators must impose limits on using certain resources. Good example here would be preventing a particular user (or a set of users), from utilizing entire queue (or cluster) at any point. If you’ve ever tried doing something like that for Grid Engine (SGE), then you know that it is not immediately obvious how to impose limits on resource usage.

SGE has a concept of “resource quota sets” (RQS), which can be used to limit maximum resource consumption by any job. The relevant qconf command line switches for manipulating resource quota sets are “-srqs” and “-srqsl” (show), “-arqs” (add), “-mrqs” (modify) and “-drqs” (delete). 

Each RQS must have the following parameters: name, description, enabled and limit. RQS name cannot have spaces, but its description can be an arbitrary string. The boolean “enabled” flag specifies whether the RQS is enabled or not, while the “limit” field denotes resource quota rule that consists of an optional name, filters for a specific job request and the resource quota limit. Note that one can have multiple “limit” fields associated with a given RQS. For example, the following RQS prevents user “ahogger” to occupy more than 1 job slot in general, and it also limits the same user from running jobs in the headnodes.q queue:

<pre>
$ qconf -srqs ahogger_job_limit
{
   name         ahogger_job_limit
   description  "limit ahogger jobs"
   enabled      TRUE
   limit        users ahogger to slots=1
   limit        users {ahogger} queues {headnodes.q} to slots=0
}
</pre>

The exact format in which RQS have to be specified is, like everything else, well documented in SGE man pages (“man sge_resource_quota”).  
<xhtml:img xmlns:xhtml="http://www.w3.org/1999/xhtml" src="http://feeds.feedburner.com/~r/typepad/QALv/~4/eO-j_hJJpXM" height="1" width="1" /></div></content>


    <feedburner:origLink>http://gridgurus.typepad.com/grid_gurus/2008/12/managing-resource-quotas-in-grid-engine.html</feedburner:origLink></entry>
    <entry>
        <title>Automating Grid Engine Monitoring</title>
        <link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/typepad/QALv/~3/3X3bRYZG4sE/automating-grid.html" />
        <link rel="replies" type="text/html" href="http://gridgurus.typepad.com/grid_gurus/2008/11/automating-grid.html" thr:count="1" thr:updated="2009-03-20T01:50:19-05:00" />
        <id>tag:typepad.com,2003:post-56733455</id>
        <published>2008-11-17T08:04:21-06:00</published>
        <updated>2008-11-17T08:04:21-06:00</updated>
        <summary>When visiting client sites I often notice various issues with the existing distributed resource management software installations. The problems usually vary from configuration issues to queues in an error state. While things like inadequate resources and queue structure usually require...</summary>
        <author>
            <name>Rich Wellner</name>
        </author>
        <category scheme="http://www.sixapart.com/ns/types#category" term="Web/Tech" />
        
        
<content type="html" xml:lang="en-US" xml:base="http://gridgurus.typepad.com/grid_gurus/">
&lt;div xmlns="http://www.w3.org/1999/xhtml"&gt;&lt;img src="http://gridgurus.typepad.com/grid_gurus//code_000000237891Small.jpg" alt="code_000000237891Small.jpg" border="0" width="282" align="right" /&gt;When visiting client sites I often notice various issues with the existing distributed resource management software installations. The problems usually vary from configuration issues to queues in an error state. While things like inadequate resources and queue structure usually require more analysis and better design, problems like queues in an error state are easily detectable. So, cluster administrators, who are often busy with many other duties, should try to automate monitoring tasks as much as they can. For example, if you are using Grid Engine, you can easily come up with scripts like the one below, which looks for several different kinds of problems in your SGE installation:

&lt;pre&gt;
#!/bin/sh

. /usr/local/unicluster/unicluster-user-env.sh 

explainProblem() {
  qHost=$1   # queue where the problem is found
  msg=`qstat -f -q $qHost -explain aAEc | tail -1 | sed 's?-??g' | sed '/^$/d'`
  echo $msg
}

checkProblem() {
  description=$1  # problem description
  signature=$2    # problem signature
  for q in `qconf -sql`; do
    cmd="qstat -f -q $q | grep $q | awk '{if(NF&gt;5 &amp;&amp; index(\$NF, \"$signature\")&gt;0) print \$1}'"
    qHostList=`eval $cmd`
    if [ "$qHostList" != "" ]; then
      for qHost in $qHostList; do
        msg=`explainProblem $qHost`
        echo "$description on $qHost:"
        echo "  $msg"
        echo ""
      done
    fi
  done
}

echo "Grid Engine Issue Summary"
echo "========================="
echo ""
checkProblem Error E
checkProblem SuspendThreshold A
checkProblem Alarm a
checkProblem ConfigProblem c
&lt;/pre&gt;

Note that the above script should work with Unicluster Express 3.2 installed in the default (/usr/local/unicluster) location. It can be easily modified to, for example, send email to administrators in case problems are found that need attention. Although simple, such scripts usually go long way towards ensuring that your Grid Engine installation operates smoothly.
&lt;/div&gt;
&lt;img src="http://feeds.feedburner.com/~r/typepad/QALv/~4/3X3bRYZG4sE" height="1" width="1"/&gt;</content>


    <feedburner:origLink>http://gridgurus.typepad.com/grid_gurus/2008/11/automating-grid.html</feedburner:origLink></entry>
    <entry>
        <title>Who Cares What's inside a Cloud?</title>
        <link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/typepad/QALv/~3/iOEG7Lew3iI/who-cares-whats.html" />
        <link rel="replies" type="text/html" href="http://gridgurus.typepad.com/grid_gurus/2008/11/who-cares-whats.html" thr:count="1" thr:updated="2009-03-20T01:51:46-05:00" />
        <id>tag:typepad.com,2003:post-56699301</id>
        <published>2008-11-06T07:57:23-06:00</published>
        <updated>2008-11-06T07:57:23-06:00</updated>
        <summary>When I consider my microwave, telephone, or television I see fairly sophisticated applications that I simply plug into service providers and get useful results. If I choose to switch between individual service providers I can do so easily (assuming certain...</summary>
        <author>
            <name>Roderick Flores</name>
        </author>
        
        
<content type="xhtml" xml:lang="en-US" xml:base="http://gridgurus.typepad.com/grid_gurus/"><div xmlns="http://www.w3.org/1999/xhtml"><img src="http://gridgurus.typepad.com/grid_gurus//squarepeg_000004493197XSmall.jpg" alt="squarepeg_000004493197XSmall.jpg" border="0" width="282" align="right" /><p>When I consider my microwave, telephone, or television I see fairly sophisticated applications that I simply plug into service providers and get useful results. If I choose to switch between individual service providers I can do so easily (assuming certain levels of deregulation of utility monopolies of course). Most importantly, while I understand how these appliances work, I would never want to build one myself. Yet I am not required to do so because the providers use standardized interfaces that appliance manufactures can easily offer: I buy my appliances as I might any other tool. Consequently, I can switch out the manufacturer or models for each of the services I use without interacting with the provider. I use these tools in a way that makes my work and life more efficient.</p>

<p>Nobody listens in on my conversations, nor do they receive services at my expense, I can use these services how I wish, and because of competition, I can expect an outstanding quality of service. At the end of the month, I get a bill from my providers for the services I used. These monetary costs are far outweighed by the convenience these services offer.</p>

<p>It is this sort of operational simplicity that motivated the first call for computational power as a utility in 1965. Like the electrical grid, a consumer would simply plug in their favorite application and use the compute power offered by a provider. Beginning in the 1990s, this effort centered around the concept of Grid computing.</p>

<p>Just like the early-days of electricity services, there were many issues with providing Grid computing. The very first offerings were proprietary or narrowly focused. The parallels with the electric industry are easily recognized. Some might provide street lighting whereas others would provide power for home lighting and still others for transportation and yet another group industrial applications. Moreover, each provider used different interfaces to get the power. Thus switching between providers, not a rare occurrence in a volatile industry, was no small undertaking. This, clearly was very costly for the consumer. </p>

<p>It took an entrepreneur to come to the industry and unify electrical services for all applications while also creating a standardized product (see http://www.eei.org/industry_issues/industry_overview_and_statistics/history for a quick overview). Similarly several visionaries had to step in and define what a Grid computer needed to do in order to create a widely consumable product. While these goals were largely met and several offerings became very successful, Grid computing never really became the firmly rooted utility-like service that we hoped for. Rather, it seems to have become an offering for specialized high-performance computing users. </p>

<p>This market is not the realm of service that I started thinking about early in this post. Take television service: this level of service is neither for a single viewer nor a small-business who might want to repackage a set of programs to its customers (say a sports bar). Rather it is for large-scale industries whose service requirements are unimaginable by all but a few people. I cannot even draw a parallel to television service. In telecommunication it would be the realm of a <a href="http://en.wikipedia.org/wiki/CLEC">CLEC</a>.</p>

<p>Furthermore, unlike my microwave, I am expected to customize my application to work well on a grid. I cannot simply plug it in and get better service than I can from my own PC. It would be the equivalent of choosing to reheat my food on my stove or building my own microwave. You see, my microwave, television service, and phone services are not just basic offerings of food preparation, entertainment, and communication. Instead, these are sophisticated systems that make my work and life easier. Grid computing, while very useful, does not simplify program implementation.</p>

<p>So in steps cloud computing: an emerging technology that seems to have significant overlap with grid computing while also providing simplifying services (something as a service). I may still have to assemble a microwave from pre-built pieces but everything is ready for me to use. I only have to add my personal touches to assemble a meal. It really isn't relevant whether the microwave is central to the task or just one piece of many.</p>

<p>When I approach a task that I hope to solve using a program, how might I plug that in just as easily? Let's quickly consider how services are provided for television. When I plug my application(TV) in to the electricity provider as well as a broadcaster of some sort, it just works. I can change the channel to the streams that I like. I can buy packages that provide me the best set of streams. In addition, some providers will offer me on-demand programming as well as internet and telephone services. If anything breaks, I call a number and they deal with it. None of this requires anything of me. I pay my bill and I get services.</p>

<p>Okay, how would that work for a computation? Say I want to find the inverse for a matrix. I would send out my data to the channel that inverted matrices the way I like them. The provider will worry about attaining the advertised performance, reliability, scalability, security, sustainability, device/location independence, tenancy, and capital expenditure: those <a href="http://en.wikipedia.org/wiki/Cloud_computing#Key_characteristics">characteristics of the cloud</a> that I could not care less about. Additionally, the <a href="http://gridgurus.typepad.com/grid_gurus/2008/09/a-cloud-by-any.html">cloud properties</a> that Rich Wellner assembled don't interest me much either. Certainly they may be differentiators, but the actual implementation is somebody else's problem in the same way that continuous electrical service provision is not my chief concern when I turn on the TV. What I want and will get is an inverse to the matrix I submitted in the time frame I requested deposited where I requested it to be put. I may use the inverted matrix to simultaneously solve for earthquake locations and earth properties or for material stresses and strains in a two-dimensional plate. That is my recipe and my problem.</p>

<p>After all, I should get services "without knowledge of, expertise with, or control over the technology infrastructure that supports them," as the <a href="http://en.wikipedia.org/wiki/Cloud_computing">cloud computing wiki page</a> claims. Essentially the aforementioned cloud characteristics are directed towards service providers rather than to the non-expert consumer that highlights the wiki definition. Isn't the differentiator between the Cloud and the Grid the concealment of the complex infrastructure underneath? If the non-expert consumer is expected to worry about algorithm scalability, distributing data, starting and stopping resources and all of that, they certainly will need to gain some expertise quickly. Further, once they have that skill, why wouldn't they just use a mature Grid offering rather than deal with the non-standardized and chaotic clouds? Are these provider-specific characteristics not just a total rebranding of Grid?</p>

<p>As such, I suggest that several consumer-based characteristics should replace the rather inconsequential provider-internal ones that currently exist.</p>

<p>A cloud is characterized by services that:</p>

<ul><li>use a specified algorithm to solve a particular problem;</li>

<li>can be purchased for one-time, infrequent use, or regular use;</li>

<li>state their peak, expected, and minimum performances;</li>

<li>state the expected response time;</li>

<li>can be queried for changes to expected response time;</li>

<li>support asynchronous messaging. A consumer must be able to discover when things are finished;</li>

<li>use standard, open, general-purpose protocols and interfaces (clearly);</li>

<li>have specified entry-points;</li>

<li>can interact with other cloud service providers. In particular, a service should be able to send output to long-term cloud-storage providers; </li></ul>

<p>Now that sounds more like Computation-as-a-Service.</p><xhtml:img xmlns:xhtml="http://www.w3.org/1999/xhtml" src="http://feeds.feedburner.com/~r/typepad/QALv/~4/iOEG7Lew3iI" height="1" width="1" /></div></content>


    <feedburner:origLink>http://gridgurus.typepad.com/grid_gurus/2008/11/who-cares-whats.html</feedburner:origLink></entry>
    <entry>
        <title>Cloud Computing: Commodity or Value Sale?</title>
        <link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/typepad/QALv/~3/MU9wlBP3pdw/cloud-computing-commodity-or-value-sale.html" />
        <link rel="replies" type="text/html" href="http://gridgurus.typepad.com/grid_gurus/2008/11/cloud-computing-commodity-or-value-sale.html" thr:count="2" thr:updated="2009-03-20T01:56:50-05:00" />
        <id>tag:typepad.com,2003:post-57956803</id>
        <published>2008-11-03T13:18:31-06:00</published>
        <updated>2008-11-03T13:18:31-06:00</updated>
        <summary>There is a controversy in the cloud community today about whether the market is going to be one based on value or price. Rephrased, will cloud computing be a commodity or an enablement technology. A poster on one of the...</summary>
        <author>
            <name>Rich Wellner</name>
        </author>
        
        
<content type="xhtml" xml:lang="en-US" xml:base="http://gridgurus.typepad.com/grid_gurus/"><div xmlns="http://www.w3.org/1999/xhtml"><p><img src="http://gridgurus.typepad.com/grid_gurus//packages_000002944682XSmall.jpg" alt="packages_000002944682XSmall.jpg" border="0" width="282" align="right" />There is a controversy in the cloud community today about whether the market is going to be one based on value or price.  Rephrased, will cloud computing be a commodity or an enablement technology.</p>

<p>A poster on one of the cloud computing lists asserted that electricity would be a key component of pricing.  He was then jumped on by people saying that value would be the key.</p>

<p>It seems like folks are talking past one another.</p>

<p>His assertion is true if CC is a commodity.</p>

<p>Now that said, there are precious few commodities in IT.  Maybe internet connectivity is one.  Monitors might be another.  Maybe there are a few more.</p>

<p>But very quickly you get past swappable components that do very nearly the same job and into the realm of 'stuff' that is not easily replaceable.  Then the discussion turns to one of value.</p>

<p>Amazon recognized the commodity of books and won the war over people who were trying to sell value.  They appear to be attempting to do the same with computer time, which makes the battle they will fight over the next few years with Microsoft (and the increasing number of smaller players) extra interesting.</p>

<p>There is also the problem of making sweeping statements like "the market will figure things out".  There is no "the market".  Even on Wall Street.  The reason things happen is because different people and institutions have different investment goals.  Those goals vary over time and create growing or shrinking windows of opportunity for other people and institutions.</p>

<p>I've made my bet on how "the market" for cloud computing will shake out in the short to medium term.  Now I'm just hoping that there are enough of the people and institutions my bet is predicated on in existence.</p><xhtml:img xmlns:xhtml="http://www.w3.org/1999/xhtml" src="http://feeds.feedburner.com/~r/typepad/QALv/~4/MU9wlBP3pdw" height="1" width="1" /></div></content>


    <feedburner:origLink>http://gridgurus.typepad.com/grid_gurus/2008/11/cloud-computing-commodity-or-value-sale.html</feedburner:origLink></entry>
    <entry>
        <title>Elastic Management of Computing Clusters</title>
        <link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/typepad/QALv/~3/zFjh8QthAMI/elastic-managem.html" />
        <link rel="replies" type="text/html" href="http://gridgurus.typepad.com/grid_gurus/2008/10/elastic-managem.html" thr:count="1" thr:updated="2009-03-20T01:57:32-05:00" />
        <id>tag:typepad.com,2003:post-56437631</id>
        <published>2008-10-29T11:26:05-05:00</published>
        <updated>2008-10-29T11:26:05-05:00</updated>
        <summary>Besides all the hype, clouds (i.e. a service for the on-demand provision of virtual machines, others would say IaaS) are making utility computing a reality, check for example the the Amazon EC2 case studies . This new model, and virtualization...</summary>
        <author>
            <name>Ignacio Martin Llorente</name>
        </author>
        
        
<content type="html" xml:lang="en-US" xml:base="http://gridgurus.typepad.com/grid_gurus/">
&lt;div xmlns="http://www.w3.org/1999/xhtml"&gt;&lt;p&gt;Besides all the hype, clouds (i.e. a service for the on-demand
provision of virtual machines, others would say IaaS) are making
utility computing a reality, check for example the &lt;a mce_href="http://aws.amazon.com/solutions/case-studies/" href="http://aws.amazon.com/solutions/case-studies/"&gt; the Amazon EC2 case studies &lt;/a&gt;.
This new model, and virtualization technologies in general, is also
being actively explored by the scientific community. There are quite a
few initiatives that integrates virtualization with a range of
computing platforms, from clusters to &lt;a mce_href="http://indico.cern.ch/sessionDisplay.py?sessionId=15&amp;amp;slotId=0&amp;amp;confId=32220#2008-09-25" href="http://indico.cern.ch/sessionDisplay.py?sessionId=15&amp;amp;slotId=0&amp;amp;confId=32220#2008-09-25"&gt;Grid infrastructures&lt;/a&gt;.
Once this integration is achieved the next step is natural, jump to the
clouds and provision the VMs from an external site. For example, a
recent work from &lt;a mce_href="http://www.univaud.com/" href="http://www.univaud.com/"&gt;UNIVA UD&lt;/a&gt; has demonstrated the feasibility of supplementing a UNIVA Express cluster with EC2 resources (you can &lt;a mce_href="http://www.univaud.com/about/resources/files/wp-unicluster-amazon-ec2.pdf" href="http://www.univaud.com/about/resources/files/wp-unicluster-amazon-ec2.pdf"&gt;download the whitepaper&lt;/a&gt; to learn more).&lt;/p&gt;
&lt;p&gt;&lt;a mce_href="http://blog.dsa-research.org/wp-content/uploads/2008/10/one.png" href="http://blog.dsa-research.org/wp-content/uploads/2008/10/one.png" title="Elastic Computing Cluster"&gt;&lt;img width="500" alt="OpenNebula virtual infrastructure engine components and its&amp;lt;br /&amp;gt; integration with Amazon EC2" mce_src="http://blog.dsa-research.org/wp-content/uploads/2008/10/one.png" src="http://blog.dsa-research.org/wp-content/uploads/2008/10/one.png" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;This cloud provision model can be further integrated with the
in-house physical infrastructure when it is combined with a virtual
machine (VM) management system, like &lt;a mce_href="http://www.opennebula.org" href="http://www.opennebula.org/"&gt;OpenNebula&lt;/a&gt;.
A VM manager is responsible for the efficient management of the virtual
infrastructure as a whole, by providing basic functionality for the
deployment, control and monitoring of VMs on a distributed pool of
resources. The use of this new virtualization layer decouples the
computing cluster from the physical infrastructure, and so extends the
classical benefits of VMs to the cluster level (i.e. cluster
consolidation, cluster isolation, cluster partitioning and elastic
cluster capacity).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Architecture of an Elastic Cluster&lt;/strong&gt;&lt;br /&gt;
A computing cluster can be easily virtualized by putting the front-end
and worker nodes into VMs. In our case, the virtual cluster front-end
(SGE master host) is deployed in the local resources with Internet
connectivity to be able to communicate with Amazon EC2 VMs. This
cluster front-end acts also as NFS and NIS server for every worker node
in the virtual cluster.&lt;/p&gt;
&lt;p&gt;The virtual worker nodes communicate with the front-end through a private local area network. The &lt;em&gt;local worker nodes&lt;/em&gt; are connected to this vLAN through a virtual bridge configured in every physical host.&amp;nbsp; The &lt;em&gt;EC2 worker nodes&lt;/em&gt;
are connected to the vLAN with an OpenVPN tunnel, which is established
between each remote node (OpenVPN clients) and the cluster front-end
(OpenVPN server). With this configuration, every worker node (either
local or remote) can communicate with the front-end and can use the
common network services transparently. The architecture of the cluster
is shown in the following figure:&lt;/p&gt;
&lt;p&gt;&lt;a mce_href="http://blog.dsa-research.org/wp-content/uploads/2008/10/infrastructure2.png" href="http://blog.dsa-research.org/wp-content/uploads/2008/10/infrastructure2.png" title="Virtual Cluster Architecture"&gt;&lt;img width="500" alt="Virtual Cluster Architecture" mce_src="http://blog.dsa-research.org/wp-content/uploads/2008/10/infrastructure2.png" src="http://blog.dsa-research.org/wp-content/uploads/2008/10/infrastructure2.png" /&gt;&lt;/a&gt;&lt;br /&gt;
&lt;em&gt;Figure courtesy of Prof. Rafael Moreno&lt;/em&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Deploying a SGE cluster with OpenNebula and Amazon EC2&lt;/strong&gt;&lt;br /&gt;
The last release of OpenNebula includes a driver to deploy VMs in the
EC2 cloud, and so it integrates the Amazon infrastructure with your
local resources. The EC2 is managed by OpenNebula just as another local
resource with a configurable pre-fixed &lt;em&gt;size&lt;/em&gt;,
to limit the cluster capacity (i.e. SGE workernodes) that can be
allocated in the cloud. In this set-up, your local resources would look
like as follows:&lt;/p&gt;
&lt;pre&gt;&amp;gt;onehost list&lt;br /&gt; HID NAME&amp;nbsp; &amp;nbsp;&amp;nbsp; RVM&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp;TCPU&amp;nbsp; &amp;nbsp;FCPU&amp;nbsp; &amp;nbsp;ACPU&amp;nbsp; &amp;nbsp; TMEM&amp;nbsp; &amp;nbsp; FMEM STAT&lt;br /&gt;&amp;nbsp; &amp;nbsp;0 ursa01&amp;nbsp; &amp;nbsp;&amp;nbsp; 0&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp; 800&amp;nbsp; &amp;nbsp; 798&amp;nbsp; &amp;nbsp; 800 8387584 7663616&amp;nbsp; off&lt;br /&gt;&amp;nbsp; &amp;nbsp;1 ursa02&amp;nbsp; &amp;nbsp;&amp;nbsp; 0&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp; 800&amp;nbsp; &amp;nbsp; 798&amp;nbsp; &amp;nbsp; 800 8387584 7663616&amp;nbsp; off&lt;br /&gt;&amp;nbsp; &amp;nbsp;2 ursa03&amp;nbsp; &amp;nbsp;&amp;nbsp; 0&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp; 800&amp;nbsp; &amp;nbsp; 798&amp;nbsp; &amp;nbsp; 800 8387584 7663616&amp;nbsp; on&lt;br /&gt;&amp;nbsp; &amp;nbsp;3 ursa04&amp;nbsp; &amp;nbsp;&amp;nbsp; 2&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp; 800&amp;nbsp; &amp;nbsp; 798&amp;nbsp; &amp;nbsp; 600 8387584 6290432&amp;nbsp; on&lt;br /&gt;&amp;nbsp; &amp;nbsp;4 ursa05&amp;nbsp; &amp;nbsp;&amp;nbsp; 1&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp; 800&amp;nbsp; &amp;nbsp; 799&amp;nbsp; &amp;nbsp; 700 8387584 7339008&amp;nbsp; on&lt;br /&gt;&amp;nbsp; &amp;nbsp;5 ec2&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp; 0&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp; 500&amp;nbsp; &amp;nbsp; 500&amp;nbsp; &amp;nbsp; 500 8912896 8912896&amp;nbsp; on&lt;/pre&gt;
&lt;p&gt;&lt;em&gt;The last line corresponds to EC2, currently configured to host up to 5 m1.small instances.&lt;/em&gt;&lt;/p&gt;
&lt;p&gt;The OpenNebula EC2 driver translates a general VM deployment file in
an EC2 instance description. The driver assumes that a suitable Amazon
machine image (AMI) has been previously packed and registered in the S3
storage service. So when a given VM is to be deployed in EC2 its AMI
counterpart is instantiated. A typical SGE worker node VM template
would be like this:&lt;/p&gt;
&lt;pre&gt;NAME&amp;nbsp; &amp;nbsp;= sge_workernode&lt;br /&gt;CPU&amp;nbsp; &amp;nbsp; = 1&lt;br /&gt;MEMORY = 128&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp;&lt;br /&gt;&lt;br /&gt;#Xen or KVM template machine, used when deploying in the local resources&lt;br /&gt;OS&amp;nbsp; &amp;nbsp;= [kernel=&amp;quot;/vmlinuz&amp;quot;,initrd= &amp;quot;/initrd.img&amp;quot;,root=&amp;quot;sda1&amp;quot; ]&lt;br /&gt;DISK = [source=&amp;quot;/imges/sge/workernode.img&amp;quot;,target=&amp;quot;sda&amp;quot;,readonly=&amp;quot;no&amp;quot;]&lt;br /&gt;DISK = [source=&amp;quot;/imges/sge/workernode.swap&amp;quot;,target=&amp;quot;sdb&amp;quot;,readonly=&amp;quot;no&amp;quot;]&lt;br /&gt;NIC&amp;nbsp; = [bridge=&amp;quot;eth0&amp;quot;]&lt;br /&gt;&lt;br /&gt;#EC2 template machine, this will be use wen submitting this VM to EC2&lt;br /&gt;EC2 = [ AMI=&amp;quot;ami-d5c226bc&amp;quot;,&lt;br /&gt;&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp; KEYPAIR=&amp;quot;gsg-keypair&amp;quot;,&lt;br /&gt;&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp; AUTHORIZED_PORTS=&amp;quot;22&amp;quot;,&lt;br /&gt;&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp; INSTANCETYPE=m1.small]&lt;/pre&gt;
&lt;p&gt;Once deployed, the cluster would look like this (sge master, 2 local worker nodes and 2 ec2 worker nodes:&lt;/p&gt;
&lt;pre&gt;&amp;gt;onevm list&lt;br /&gt;&amp;nbsp; ID&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp;NAME STAT CPU&amp;nbsp; &amp;nbsp;&amp;nbsp; MEM&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp; HOSTNAME&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp; TIME&lt;br /&gt;&amp;nbsp; 27&amp;nbsp; sgemast runn 100 1232896&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp; ursa05 00 00:41:57&lt;br /&gt;&amp;nbsp; 28&amp;nbsp; sgework runn 100 1232896&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp; ursa04 00 00:31:45&lt;br /&gt;&amp;nbsp; 29&amp;nbsp; sgework runn 100 1232896&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp; ursa04 00 00:32:33&lt;br /&gt;&amp;nbsp; 30&amp;nbsp; sgework runn&amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp; ec2 00 00:23:12&lt;br /&gt;&amp;nbsp; 31&amp;nbsp; sgework runn&amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp; &amp;nbsp; ec2 00 00:21:02&lt;/pre&gt;
&lt;p&gt;&lt;em&gt;You can get additional info from your ec2 VMs, like the IP, using the onvm show command&lt;/em&gt;&lt;/p&gt;
&lt;p&gt;So, it is easy to manage your virtual cluster with OpenNebula and
EC2, but what about efficiency?. Besides the inherent overhead induced
by virtualization (around a 10% for processing), the average deployment
time of a remote EC2 worker node is 23.6s while a local one takes only
3.3s. Moreover, when executing a HTC workload, the overhead induced by
using EC2 (vpn, and a slower network connection) can be neglected.&lt;/p&gt;
&lt;p&gt;&lt;a mce_href="http://dsa-research.org/doku.php?id=people:ruben" href="http://dsa-research.org/doku.php?id=people:ruben"&gt;Ruben S. Montero&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;This is a joint work with &lt;a mce_href="http://dsa-research.org/doku.php?id=people:moreno" href="http://dsa-research.org/doku.php?id=people:moreno"&gt;Rafael Moreno&lt;/a&gt; and &lt;a mce_href="http://dsa-research.org/doku.php?id=people:llorente" href="http://dsa-research.org/doku.php?id=people:llorente"&gt;Ignacio M. Llorente&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="http://blog.dsa-research.org/?p=98"&gt;Reprinted from blog.dsa-research.org&lt;/a&gt;&lt;a mce_href="http://dsa-research.org/doku.php?id=people:ruben" href="http://dsa-research.org/doku.php?id=people:ruben"&gt;&amp;nbsp;&lt;/a&gt;&lt;/p&gt;&lt;/div&gt;
&lt;img src="http://feeds.feedburner.com/~r/typepad/QALv/~4/zFjh8QthAMI" height="1" width="1"/&gt;</content>


    <feedburner:origLink>http://gridgurus.typepad.com/grid_gurus/2008/10/elastic-managem.html</feedburner:origLink></entry>
    <entry>
        <title>Auditing the Cloud</title>
        <link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/typepad/QALv/~3/t9k4FZ3Uk5Q/auditing-the-cl.html" />
        <link rel="replies" type="text/html" href="http://gridgurus.typepad.com/grid_gurus/2008/10/auditing-the-cl.html" thr:count="1" thr:updated="2009-03-20T01:58:10-05:00" />
        <id>tag:typepad.com,2003:post-57303139</id>
        <published>2008-10-20T14:25:28-05:00</published>
        <updated>2008-10-20T14:25:28-05:00</updated>
        <summary>I've written here about the importance of SLAs for useful cloud computing platforms on a few occasions in the past. The idea behind clouds, that you can get access to resources on demand, is an appealing one. However, it is...</summary>
        <author>
            <name>Rich Wellner</name>
        </author>
        
        
<content type="html" xml:lang="en-US" xml:base="http://gridgurus.typepad.com/grid_gurus/">
&lt;div xmlns="http://www.w3.org/1999/xhtml"&gt;&lt;p&gt;&lt;img src="http://gridgurus.typepad.com/grid_gurus//patronize_000003227641XSmall.jpg" alt="patronize_000003227641XSmall.jpg" border="0" height="282" align="right" /&gt;I've written here about the importance of SLAs for useful cloud computing platforms on a few occasions in the past.  The idea behind clouds, that you can get access to resources on demand, is an appealing one.  However, it is only part of the total picture.  Without an ability to state what you want and go to bed, there isn't much value in the cloud.&lt;/p&gt;

&lt;p&gt;Think about that for a minute.  With the cloud computing offerings currently available there are no meaningful SLAs written down anywhere.  Yet people, every day, run their production applications on an implicit SLA that is internalized something like "amazon is going to give me N units of work for M price".&lt;/p&gt;

&lt;p&gt;There are two problems with this.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Amazon doesn't scale your resources.  Your demand may have spiked and you are still running on the resource you signed up for.
&lt;li&gt;There is no audit capability on EC2.
&lt;/ul&gt;
In the &lt;a href="http://wiki.cloudcommunity.org/wiki/CloudComputing:Bill_of_Rights"&gt;Cloud Computing Bill of Rights&lt;/a&gt; we wrote about three important attributes that need to be available to do an audit.

&lt;ul&gt;
&lt;li&gt;Events -- The state changes and other factors that effected your system availability.
&lt;li&gt;Logs -- Comprehensive information about your application and its runtime environment.
&lt;li&gt;Monitoring -- Should not be intrusive and must be limited to what the cloud provider reasonably needs in order to run their facility.
&lt;/ul&gt;

&lt;p&gt;The idea here is that rather than just accepting what your cloud provider sends you at the end of the month as a bill, the world of cloud computing is complex enough that a reasonable set of runtime information must be made available to substantiate the providers claim for compensation.&lt;/p&gt;

&lt;p&gt;This is particularly true in the world of SLAs.  If my infrastructure is regularly scaling up, out, down or in to meet demands it is essential to be able to verify that the infrastructure is reacting the way that was contracted.  Without that, it will be very hard to get people to trust the cloud.&lt;/p&gt;&lt;/div&gt;
&lt;img src="http://feeds.feedburner.com/~r/typepad/QALv/~4/t9k4FZ3Uk5Q" height="1" width="1"/&gt;</content>


    <feedburner:origLink>http://gridgurus.typepad.com/grid_gurus/2008/10/auditing-the-cl.html</feedburner:origLink></entry>
    <entry>
        <title>Using GridWay With Unicluster Express - Part II</title>
        <link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/typepad/QALv/~3/PPgNOOANu10/using-gridway-w.html" />
        <link rel="replies" type="text/html" href="http://gridgurus.typepad.com/grid_gurus/2008/10/using-gridway-w.html" thr:count="1" thr:updated="2009-03-20T01:58:56-05:00" />
        <id>tag:typepad.com,2003:post-55728016</id>
        <published>2008-10-20T14:05:13-05:00</published>
        <updated>2008-10-20T14:05:13-05:00</updated>
        <summary>In my previous post I described how to build and install GridWay metascheduler on top of Unicluster Express 3.2 (UCE). However, before one can actually use the software, there are several configuration steps that have to be completed. For the...</summary>
        <author>
            <name>Sinisa Veseli</name>
        </author>
        <category scheme="http://www.sixapart.com/ns/types#category" term="Web/Tech" />
        
        
<content type="xhtml" xml:lang="en-US" xml:base="http://gridgurus.typepad.com/grid_gurus/"><div xmlns="http://www.w3.org/1999/xhtml"><img src="http://gridgurus.typepad.com/grid_gurus//code_000000237891Small.jpg" alt="code_000000237891Small.jpg" border="0" width="282" align="right" />In my previous post I described how to build and install <a href="http://www.gridway.org">GridWay</a> metascheduler on top of <a href="http://www.grid.org/download">Unicluster Express 3.2</a> (UCE). However, before one can actually use the software, there are several configuration steps that have to be completed. 

For the notes below I will assume that GridWay is installed in the /opt/gw directory, and that UCE is installed in its default location (/usr/local/unicluster), with the default “ucluster” administrative account. Note that you will need root access on your GridWay machine.

1)	Create new UNIX group (e.g., gwusers). Your UCE administrative account (ucluster) and all users that will be submitting jobs to GridWay must be members of that group. 

2)	Make sure that your GridWay installation directory is owned by the UCE administrative account. Assuming GridWay is installed in /opt/gw, invoking something like “chown –R ucluster.gwusers /opt/gw” would do the trick.

3)	Edit the /etc/sudoers file, and add the following entries:

<pre>
...
# User alias specification
...
Runas_Alias     GW_USERS = %gwusers
...
# Defaults specification
Defaults&gt;GW_USERS env_keep="GW_LOCATION GLOBUS_LOCATION"
...
# GridWay entries.
ucluster ALL=(GW_USERS) NOPASSWD: /opt/gw/bin/gw_em_mad_ws *
ucluster ALL=(GW_USERS) NOPASSWD: /opt/gw/bin/gw_tm_mad_ftp *
</pre>


4)	Configure GridWay. At minimum you must edit the GridWay daemon configuration file /opt/gw/etc/gridway/gwd.conf in order to add the following entries appropriate for UCE:

<pre>
IM_MAD = mds4:gw_im_mad_mds4:-s petruchio.univaud.com:gridftp:ws
EM_MAD = ws:gw_em_mad_ws::rsl2
TM_MAD = gridftp:gw_tm_mad_ftp:
</pre>

The only entry that you will need to change in the above example is the host that is running your UCE container, which was set to petruchio.univaud.com in my case. Other files that you might want to inspect are /opt/gw/etc/gridway/sched.conf (contains GridWay scheduler configuration), /opt/gw/etc/gridway/job_template.default (default values for job templates), and /opt/gw/etc/gridway/gwrc (default environment variables for GridWay's so-called middleware access drivers, or MADs).

5)	Edit UCE configuration file
/usr/local/unicluster/etc/globus_wsrf_mds_usefulrp/gluerp.xml and enable ganglia information provider by un-commenting the following line: 

<pre>
&lt;defaultProvider&gt;java org.globus.mds.usefulrp.glue.GangliaElementProducer&lt;/defaultProvider&gt;
</pre>

6)	Edit UCE configuration file for the SGE GRAM service to use ganglia information provider (/usr/local/unicluster/etc/gram-service-SGE/gluerp-config.xml) and add the following xml excerpt after the "&lt;ns1:resourcePropertyImpl&gt;
org.globus.mds.usefulrp.rpprovider.GLUEResourceProperty&lt;/ns1:resourcePropertyImpl&gt;" element:

<pre>
&lt;ns1:resourcePropertyElementProducers&gt;
  &lt;ns1:className&gt;org.globus.mds.usefulrp.glue.GangliaElementProducer&lt;/ns1:className&gt;
  &lt;ns1:arguments&gt;localhost&lt;/ns1:arguments&gt;
  &lt;ns1:arguments&gt;8649&lt;/ns1:arguments&gt;
  &lt;ns1:period&gt;300&lt;/ns1:period&gt;
  &lt;ns1:transformClass&gt;org.globus.mds.usefulrp.rpprovider.transforms.GLUEComputeElementTransform&lt;ns1:transformClass&gt;
&lt;/ns1:resourcePropertyElementProducers&gt;
</pre>

7)	Restart the UCE container (as root, run “/etc/rc.d/init.d/unicluster-container restart”).

8)	Start the GridWay daemon (gwd) under the ucluster account:

<pre>
source /usr/local/unicluster/unicluster-user-env.sh
export JAVA_HOME=/opt/jdk
export GW_LOCATION=/opt/gw
export PATH=$JAVA_HOME/bin:$GW_LOCATION/bin:$PATH
gwd
</pre>

The GridWay daemon should now be able to get information from your UCE container and you should be able to see your available SGE resources using the gwhost command. Note that the gwd command requires the “-m” flag for the multi-user mode.

9)	Create simple job template file for your testing (sample job template files can be found in the /opt/gw/test/jt directory), acquire grid proxy using unicluster-grid-logon, and submit your test job via the gwsubmit command. 

Note that most of the configuration steps I outlined above are described in more detail in the GridWay <a href="http://www.gridway.org/documentation/stable5.4/admin/">System Administrator’s Guide</a>. If anything goes wrong, GridWay log files located in the /opt/gw/var directory might help your troubleshooting efforts.


<xhtml:img xmlns:xhtml="http://www.w3.org/1999/xhtml" src="http://feeds.feedburner.com/~r/typepad/QALv/~4/PPgNOOANu10" height="1" width="1" /></div></content>


    <feedburner:origLink>http://gridgurus.typepad.com/grid_gurus/2008/10/using-gridway-w.html</feedburner:origLink></entry>
    <entry>
        <title>Cloud and Grid are Complementary Technologies</title>
        <link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/typepad/QALv/~3/DRy1SlzI-ZM/cloud-and-grid.html" />
        <link rel="replies" type="text/html" href="http://gridgurus.typepad.com/grid_gurus/2008/10/cloud-and-grid.html" thr:count="1" thr:updated="2009-05-24T11:18:30-05:00" />
        <id>tag:typepad.com,2003:post-56435091</id>
        <published>2008-10-13T08:34:28-05:00</published>
        <updated>2008-10-13T08:34:28-05:00</updated>
        <summary>There is a growing number of posts and articles trying to show how cloud computing is a new paradigm that supersedes Grid computing by extending its functionality and simplifying its exploitation, even announcing that Grid computing is dead. It seems...</summary>
        <author>
            <name>Ignacio Martin Llorente</name>
        </author>
        
        
<content type="xhtml" xml:lang="en-US" xml:base="http://gridgurus.typepad.com/grid_gurus/"><div xmlns="http://www.w3.org/1999/xhtml"><img src="http://gridgurus.typepad.com/grid_gurus//bolts_000000884335XSmall.jpg" alt="bolts_000000884335XSmall.jpg" border="0" width="282" align="right" /><p>There is a growing number of posts and articles trying to show how
cloud computing is a new paradigm that supersedes Grid computing by
extending its functionality and simplifying its exploitation, even
announcing that <a href="http://elasticvapor.com/2008/08/grid-is-dead.html" mce_href="http://elasticvapor.com/2008/08/grid-is-dead.html">Grid computing is dead</a>.
It seems that new technologies and paradigms have always the mission
objective to substitute existing ones. Some of these contributions do
not fully understand what grid computing is, focusing their comparative
analysis on <a href="http://ianfoster.typepad.com/blog/2008/09/a-critique-of-u.html" mce_href="http://ianfoster.typepad.com/blog/2008/09/a-critique-of-u.html">simplicity of interfaces</a>, implementation details or basic computing aspects. Others posts define <a href="http://ianfoster.typepad.com/blog/2008/08/cloud-grid-what.html" mce_href="http://ianfoster.typepad.com/blog/2008/08/cloud-grid-what.html">Cloud in the same terms as Grid</a> or create a <a href="http://peterlaird.blogspot.com/2008/09/visual-map-of-cloud-computingsaaspaas.html" mce_href="http://peterlaird.blogspot.com/2008/09/visual-map-of-cloud-computingsaaspaas.html">taxonomy which includes Grid and cluster computing technologies</a>.</p>



<p><strong>Grid</strong> <strong>is as an interoperability technology</strong>, enabling
the integration and management of services and resources in a
distributed, heterogeneous environment. The technology provides support
for the deployment of different kinds of infrastructures joining
resources which belong to different administrative domains. In the
special case of a Compute Grid infrastructure, such as <a href="http://blog.dsa-research.org/wp-admin/www.eu-egee.org/" mce_href="www.eu-egee.org/">EGEE</a> or <a href="http://www.teragrid.org/" mce_href="http://www.teragrid.org/">TeraGrid</a>,
Grid technology is used to federate computing resources spanning
multiple sites for job execution and data processing. There are many
success cases demonstrating that Grid technology provides the support
required to fulfill the demands of several collaborative scientific and
business processes.</p>

<p>On the other hand, I do not think there is a single definition for <strong>cloud computing</strong> as it denotes <a href="http://knol.google.com/k/sam-johnston/cloud-computing/1ndljz4ypn6xa/2#" mce_href="http://knol.google.com/k/sam-johnston/cloud-computing/1ndljz4ypn6xa/2#">multiples meanings for different communities</a> (<a href="http://www.webguild.org/2008/07/cloud-computing-basics.php" mce_href="http://www.webguild.org/2008/07/cloud-computing-basics.php">SaaS, PaaS, IaaS...</a>). From my view, the only new feature offered by cloud systems is the provision of virtualized resources as a service, being <strong>virtualization</strong> the enabling technology. In other words, <strong>the relevant contribution of cloud computing is the Infrastructure as a Service (IaaS) model</strong>.
Virtualization rather than other non significant issues, such as the
interfaces, is the key advance. At this point, I should remark that <a href="http://gridgurus.typepad.com/grid_gurus/2008/03/grid-vs-clouds.html" mce_href="http://gridgurus.typepad.com/grid_gurus/2008/03/grid-vs-clouds.html">virtualization has been used by the Grid community before the arrival of the "Cloud"</a>.</p>

<p>Once I have clearly stated my position about Cloud and Grid, let me
show how I see Cloud (and virtualization as enabling technology) and
Grid as complementary technologies that will coexist and cooperate at
different levels of abstraction in future infrastructures.</p>
<h2><strong>There will be a Grid on top of the Cloud</strong></h2>
<p>Before explaining the role of cloud computing as resource provider
for Grid sites, we should understand the benefits of the virtualization
of the local infrastructure (<a href="http://gridgurus.typepad.com/grid_gurus/2008/09/a-cloud-by-any.html" mce_href="http://gridgurus.typepad.com/grid_gurus/2008/09/a-cloud-by-any.html">Enterprise or Local Cloud?</a>). How can I access on demand to a cloud provider if I have not previously virtualized my local infrastructure?.</p>
<p><a href="http://blog.dsa-research.org/?p=39" mce_href="http://blog.dsa-research.org/?p=39">Existing virtualization technologies allow a full separation of resource provisioning from service management</a>.
A new virtualization layer between the service and the infrastructure
layers decouples a server not only from the underlying physical
resource but also from its physical location, without requiring any modification within service layers from both the service administrator and the end-user perspectives. <strong>Such</strong><strong> decoupling is the key to support
the scale-out of a infrastructure in order to supplement local
resources with cloud resources to satisfy peak or fluctuating demands. </strong></p>
<p mce_style="text-align: center;" style="text-align: center;"><a href="http://blog.dsa-research.org/wp-content/uploads/2008/10/untitled2.png" mce_href="http://blog.dsa-research.org/wp-content/uploads/2008/10/untitled2.png"><img width="500" height="141" class="alignnone size-full wp-image-100 aligncenter" title="untitled2" src="http://blog.dsa-research.org/wp-content/uploads/2008/10/untitled2.png" mce_src="http://blog.dsa-research.org/wp-content/uploads/2008/10/untitled2.png" /></a></p>
<p>Getting back to the Grid computing case, the virtualization of a Grid site provides several benefits, which <strong>overcome many of the technical barriers for Grid adoption</strong>:</p>
<ul><li>Easy support for VO-specific worker nodes</li>

<li>Reduce gridification cycles</li>

<li>Dynamic balance of resources between VO’s</li>

<li>Fault tolerance of key infrastructure components</li>

<li>Easier deployment and testing of new middleware distributions</li>

<li>Distribution of pre-configured components</li>

<li>Cheaper development nodes</li>

<li>Simplified training machines deployment</li>

<li>Performance partitioning between local and grid services</li>

<li>On-demand access to cloud providers</li></ul>
<p>If you are interested in more details about how virtualization
and cloud computing can support compute Grid infrastructures you can
have a look at my presentation <a href="http://indico.cern.ch/sessionDisplay.py?sessionId=15&amp;slotId=0&amp;confId=32220#2008-09-25" mce_href="http://indico.cern.ch/sessionDisplay.py?sessionId=15&amp;slotId=0&amp;confId=32220#2008-09-25">"An Introduction to Virtualization and Cloud Technologies to Support Grid Computing"</a> (<a href="http://egee08.eu-egee.org/" mce_href="http://egee08.eu-egee.org/">EGEE08</a>). I also recommend the report <a href="http://news.eu-egee.com/news-detail/article/clouds-v-grid-5.html" mce_href="http://news.eu-egee.com/news-detail/article/clouds-v-grid-5.html">"An EGEE Comparative study: Clouds and grids - evolution or revolution?"</a>. </p>
<p>There exist technology which supports the above use case. The <a href="http://www.opennebula.org/" mce_href="http://www.opennebula.org">OpenNebula engine</a>
enables the dynamic deployment and re-allocation of virtual machines on
a pool of physical resources, providing support to access on-demand to <a href="http://aws.amazon.com/ec2/" mce_href="http://aws.amazon.com/ec2/">Amazon EC2 resources</a>. On the other hand, <a href="http://workspace.globus.org/" mce_href="http://workspace.globus.org/">Globus Nimbus</a>
provides a free, open source infrastructure for remote deployment and
management of virtual machines, allowing you to create compute clouds.</p>
<h2><strong>There will be a Grid under the Cloud</strong></h2>
<p>There is a growing interest in the <strong>federation of cloud sites</strong>. Cloud providers are opening new infrastructure centers at different geographical locations (see <a href="http://www-03.ibm.com/press/us/en/pressrelease/25196.wss" mce_href="http://www-03.ibm.com/press/us/en/pressrelease/25196.wss">IBM</a> or<a href="http://developer.amazonwebservices.com/connect/entry.jspa?externalID=1347" mce_href="http://developer.amazonwebservices.com/connect/entry.jspa?externalID=1347"> Amazon Availability Zones</a>)
and it is clear that no single facility/provider can create a seemingly
infinite infrastructure capable of serving massive amounts of users at
all times, from all locations. <a href="http://en.wikipedia.org/wiki/David_Wheeler_%28computer_scientist%29" mce_href="http://en.wikipedia.org/wiki/David_Wheeler_(computer_scientist)">David Wheeler</a> once said, <em>"Any problem in computer science can be solved with another layer of indirection… But that usually will create another problem“</em>,
in the same line, federation of cloud sites involves many technological
and research challenges, but the good news is that some of them are not
new, and have been already studied and solved by the Grid community.</p>
<p>As stated above Grid is not only about computing. <strong>Grid is a technology for federation</strong>.
In the last years, there has been a huge investment in research and
development of technological components for sharing of resources across
sites. Several middleware components for file transferring, SLA
negotiation, QoS, accounting, monitoring... are available, most of them
are open-source. As also predicted by Ian Foster in his post "<a href="http://ianfoster.typepad.com/blog/2008/01/theres-grid-in.html" mce_href="http://ianfoster.typepad.com/blog/2008/01/theres-grid-in.html">There's Grid in them thar Clouds",</a>
those will be the components that could enable the federation of cloud
sites. On the other hand, other components have to be defined and
developed from scratch, mainly those related to the efficient
management of virtual machines and services within and across
administrative domains. That is exactly the aim of the <a href="http://www.reservoir-fp7.eu/" mce_href="http://www.reservoir-fp7.eu/">Reservoir project</a>, the European initiative in Cloud Computing.</p>
<h2><strong>Conclusions</strong></h2>
<p>In order to conclude this post let me <strong>venture some predictions about the coexistence of Grid and Cloud computing in future infrastructures</strong>:</p>
<ul><li>Virtualization, cloud, grid and cluster are complementary
technologies that will coexist and cooperate at different levels of
abstraction</li>

<li>Although there are early adopters of virtualization in the
Grid/cluster/HPC community, its full potential has not been exploited
yet</li>

<li>In few years, the separation of job management from resource
management through a virtualized infrastructure will be a common
practice</li>

<li>Emerging open-source VM managers, such as OpenNebula, will contribute to speed up the adoption</li>

<li>Grid/cluster/HPC infrastructures will maintain a resource base
scaled to meet the average workload demand and will transparently
access to cloud providers to meet peak demands</li>

<li>Grid technology will be used for the federation of clouds</li></ul>
<p>In summary, let's try to forget about hypes and concentrate on the
complementary functionality provided by both paradigms. My message to
the user community, the relevant issue is to evaluate which technology
meets your requirements. It is unlikely that a <em><em>single technology will meet</em> </em>all
needs. My message to the Grid community, please do not see Cloud as a
threat. Virtualization and Cloud are needed to solve many of the
technical barriers for wider Grid adoption. My message to the Cloud
community, please try to take advantage of the research and development
performed by the Grid community in the last decade.</p>
<p><a href="http://dsa-research.org/doku.php?id=people:llorente" mce_href="http://dsa-research.org/doku.php?id=people:llorente">Ignacio Martín Llorente</a></p>

<p><a href="http://blog.dsa-research.org/?p=84">Reprinted from blog.dsa-research.org</a> </p>

<xhtml:img xmlns:xhtml="http://www.w3.org/1999/xhtml" src="http://feeds.feedburner.com/~r/typepad/QALv/~4/DRy1SlzI-ZM" height="1" width="1" /></div></content>


    <feedburner:origLink>http://gridgurus.typepad.com/grid_gurus/2008/10/cloud-and-grid.html</feedburner:origLink></entry>
 
</feed><!-- ph=1 --><!-- nhm:dynamic-ssi -->
