<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0">
<channel>
  <title>What's That Noise?! [Ian Kallen's Weblog]</title>
  <link>http://www.arachna.com/roller/spidaman/</link>
      
    <description>percolations | ruminations | illustrations | inspirations</description>
  <language>en-us</language>
  <copyright>Copyright 2009</copyright>
  <lastBuildDate>Thu, 10 Sep 2009 12:50:12 -0700</lastBuildDate>
  <generator>Apache Roller (incubating) 4.0 (20071120033321:dave)</generator>
        <atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" href="http://feeds.feedburner.com/spidaman" type="application/rss+xml" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com" /><item>
    <guid isPermaLink="true">http://www.arachna.com/roller/spidaman/entry/hub_a_dub_bub_feed</guid>
    <title>Hub-a-dub-bub, feed clouds in a tub</title>
    <dc:creator>spidaman</dc:creator>
    <link>http://www.arachna.com/roller/spidaman/entry/hub_a_dub_bub_feed</link>
        <pubDate>Thu, 10 Sep 2009 12:35:37 -0700</pubDate>
    <category>What's That Noise?</category>
            <description>&lt;p&gt;

&lt;a href="/roller/spidaman/entry/hub_a_dub_bub_feed" title="Pubsubhubbub and rssCloud"&gt;
&lt;img src="http://www.arachna.com/images/push_rsscloud.gif" style="float:right; margin:5px 5px 5px 5px;cursor:pointer; cursor:hand;" border="0" width="171" height="52" /&gt;&lt;/a&gt;
While at Technorati, I observed a distinct shift around summer of 2005 in the flow of pings from largely worthwhile pings to increasingly worthless ones. A ping is a simple message with a URL. Invented by Dave Winer, it was originally implemented with XML-RPC but RESTful variants also emerged. The intention of the message is "this URL updated." For a blog, the URL is the main page of the blog; it's not the feed URL, a post URL ("permalink") or any other resource on the site - it's just the main page. Immediately, that narrow scope has a problem; as a service that should &lt;em&gt;do something&lt;/em&gt; with the ping, the notification has no more information about &lt;em&gt;what&lt;/em&gt; changed on the site. A new post? Multiple new posts? Um, what are the new post URL(s)? Did the blogroll change? And so on. Content fetching and analysis is cheap; network latencies, parsing content and doing interesting stuff with it is easy at low volumes but is difficult to scale.
&lt;/p&gt;
&lt;p&gt;
In essence, a ping is a cheap message to prompt an expensive analysis to detect change. It's an imbalanced market of costs and benefits. Furthermore, even if the ping had a richer payload of information, pings lack another very important component: authenticity. These cheap messages can be produced by anybody for any URL and the net result I observed was that lots of people produced pings for lots of things, most of which weren't representative of real blogs (other types of sites or just spam) or real changes on blogs, just worthless events that were resource intensive to operate on. When I left Technorati in March (2009), we were getting around 10 million pings a day, roughly 95% of them of no value. Pings are the SMTP of the blogosphere; weak identity systems, spammy and difficult to manage at scale.
&lt;/p&gt;
&lt;p&gt;
Besides passively waiting for pings, the other method to find things that have changed is to poll sites. But &lt;a href="http://code.google.com/p/pubsubhubbub/wiki/WhyPollingSucks"&gt;polling is terribly inefficient&lt;/a&gt;. To address those inefficiencies, FriendFeed implemented &lt;a href="http://code.google.com/p/simpleupdateprotocol/"&gt;Simple Update Protocol&lt;/a&gt;. SUP is reminiscent of another Dave Winer invention, &lt;a href="http://oldweblogscomblog.scripting.com/changesXml"&gt;changes.xml&lt;/a&gt; however SUP accounts for discovery and offers a more compact format.
&lt;/p&gt;
&lt;p&gt;
But SUP wasn't the first effort to address the aforementioned deficiencies, 2005 saw a lot of activity around something called "feedmesh." Unfortunately, the activity degenerated into a lot of babble; noise I attribute to unclear objectives and ego battles but I'm sure others have their own interpretation. The feedmesh discussion petered out and little value beyond ping-o-matic and other ping relayers emerged from it. Shortly thereafter SixApart created their &lt;a href="http://www.sixapart.com/labs/update/"&gt;atom stream&lt;/a&gt;, I think spearheaded by Brad Fitzpatrick. The atom stream is essentially a long lived HTTP connection that streams atom elements to the client. The content flow was limited to SixApart's hosted publishing platforms (TypePad, LiveJournal and Vox) and the reliability wasn't that great, at least in the early days, but it was a big step in the right direction for the blog ecosystem. The atom stream was by far the most efficient way to propogate content from the publishing platform to interested parties such as Technorati operating search, aggregation and analytic systems. It eliminates the heavyweight chatter of cheap ping messages and the heavyweight process that follows: fetch page, discover feed, fetch feed, inspect contents and do stuff with it.
&lt;/p&gt;
&lt;p&gt;
So here we are in 2009 and it feels like deja-vu all over again. This time Dave is promoting &lt;a href="http://rsscloud.org/walkthrough.html"&gt;rssCloud&lt;/a&gt;. rssCloud does a waltz of change notification; the publisher notifies a hub, the hub broadcasts notifications to event subscribers and the subscriber does the same old content fetch and analysis cycle above. rssCloud seems fundamentally flawed in that it is dependent on IP addresses as identifiers. Notification receivers who must use new a IP address must re-subscribe. I'm not sure how an aggregator should distribute notificaton handling load across a pool of IP addresses. The assumption is that notification receivers will have stable, singular IP addresses; rssCloud appears scale limited and a support burden.  The focus on specification simplicity has its merits, I think we all hate gratuitous complexity (observe the success of JSON over SOAP). However, Dave doesn't regard system operations as an important concern; he'll readily evangelize a message format and protocol but the real world operability is Other Peoples Problem.
&lt;/p&gt;
&lt;p&gt;
&lt;a href="http://code.google.com/p/pubsubhubbub/"&gt;Pubsubhubub&lt;/a&gt; (hey, Brad again) has a similar waltz but eliminates the content fetch and analysis cycle ergo it's fundamentally more efficient than rssCloud's waltz. Roughly, Pubsubhubub is to the SixApart atom stream what rssCloud is to old school XML-RPC ping. If I were still at Technorati (or working on event clients like Seesmic, Tweetdeck, etc), I would be implementing Pubsubhubbub and taking a pass on rssCloud.  With both systems, I'd be concerned with a few issues. How can the authenticity of the event be trusted? Yes, we all like simplicity but looking at SMTP is apt; now mail systems must be aware of SPF, DKIM, SIDF, blahty blah blah. Its common for mail clients to access MTA's over cryptographically secure connections. Mail is a mess but it works, albeit with a bunch of junk overlaid. I guess this is why &lt;a href="http://www.waveprotocol.org/"&gt;Wave Protocol&lt;/a&gt; has gathered interest. Anyway, Pubsubhubbub has a handshake during the subscription processes, though that looks like a malicious party could still spin up endpoints with bogus POSTs. I'd like to see an OAuth or digest authentication layer in the ecosystem. Yea, it's a little more complicated, but nothing onerous... suck it up. Whatever. Brad knows the authenticity rap, I mean he also invented OpenID (BTW, we adopted &lt;a href="http://technorati.com/weblog/2006/10/144.html"&gt;OpenID at Technorati back in 2007&lt;/a&gt;). At Technorati we had to implement ping throttling to combat extraneous ping bots; either daemons or cron jobs that just ping continuously, whether the URL has changed or not. You can't just blacklist that URL, we don't know it was the author generating those pings, there's no identity authenticity there. We resorted to blocking IP addresses but that scales poorly and creates other support problems. We had 1000's of domain names blocked and whole class C networks blocked but it was always a wackamole exercise; so much for an open blogosphere, a whitelist is the only thing that scales within those constraints. Meanwhile, back to rssCloud and Pubsubhubbub, what event delivery guarantees can be made? If a data center issue or something else interrupts event flow, are events spooled so that event consumption can be caught up? How can subscribers track the authenticity of the event originators? How can publishers keep track of who their subscribers are? Well, I'm not at Technorati anymore so I'm no longer losing any sleep over these kinds of concerns but do I care about the ecosystem. For more, &lt;a href="http://grack.com/blog/"&gt;Matt Mastracci&lt;/a&gt; is posting interesting stuff about Pubsubhubbub and a sound comparison was posted to &lt;a href="http://www.techcrunch.com/2009/09/09/rsscloud-vs-pubsubhubbub-why-the-fat-pings-win/" title="RSSCloud Vs. PubSubHubbub: Why The Fat Pings Win"&gt;Techcrunch&lt;/a&gt; yesterday.
&lt;/p&gt;
&lt;p&gt;
&lt;a href="http://technorati.com/tag/rsscloud" rel="tag"&gt;rsscloud&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/pubsubhubbub" rel="tag"&gt;pubsubhubbub&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/ping" rel="tag"&gt;ping&lt;/a&gt; &amp;nbsp;
&lt;/p&gt;
</description>          </item>
    <item>
    <guid isPermaLink="true">http://www.arachna.com/roller/spidaman/entry/business_models_in_consulting_contracting</guid>
    <title>Business Models in Consulting, Contracting and Training</title>
    <dc:creator>spidaman</dc:creator>
    <link>http://www.arachna.com/roller/spidaman/entry/business_models_in_consulting_contracting</link>
        <pubDate>Sun, 2 Aug 2009 20:57:08 -0700</pubDate>
    <category>What's That Noise?</category>
            <description>&lt;p&gt;
Since leaving &lt;a href="http://technorati.com/"&gt;Technorati&lt;/a&gt; last spring, I've been working independently with a few entrepreneurs on their technical platforms. This has mostly entailed working with tools around Infrastructure-As-A-Service (&lt;a href="http://aws.amazon.com/"&gt;AWS&lt;/a&gt;), configuration management (&lt;a href="http://wiki.opscode.com/display/chef/Home"&gt;Chef&lt;/a&gt; and &lt;a href="http://www.rightscale.com/"&gt;RightScale&lt;/a&gt;), search (&lt;a href="http://lucene.apache.org/solr/"&gt;Solr&lt;/a&gt;) and learning a lot about programming and going into production with ruby and rails. I've spoken to a number of friends and acquaintances who are working as consultants, contractors, technical authors and trainers. Some are working as lone-wolves and others working within or have founded larger organizations. I'm always sniffing for where the upside opportunities are and the question that comes to my mind is: how do such businesses scale?
&lt;/p&gt;
&lt;p&gt;
A number of technology services companies that I've taken notice of have been funded in the last year or so including &lt;a href="http://www.opscode.com/"&gt;OpsCode&lt;/a&gt;, &lt;a href="http://reductivelabs.com/"&gt;Reductive Labs&lt;/a&gt;, &lt;a href="http://cloudera.com/"&gt;Cloudera&lt;/a&gt; and &lt;a href="http://www.lucidimagination.com/"&gt;Lucid Imagination&lt;/a&gt;. I think all of these guys are in great positions; virtual infrastructure (which is peanut butter to the chocolate of IaaS), big data and information retrieval technologies provide the primordial goo that will support new mobile, real time and social software applications. They are all working in rapid innovation spaces that hold high value potentials but also new learning and implementation challenges that rarefy specialized knowledge.
&lt;/p&gt;
&lt;p&gt;
Years ago when I was working with Covalent Technologies, we tried to build a business around "enterprise open source" with Apache, Tomcat, etc as the basis. Packaging and selling free software is difficult. On the one hand, offering a proven technology stack configuration to overcome the usual integration and deployment challenges as well as providing a support resource is really valuable to Fortune 500's and such. However, my experience there and observations of what's happened with similar businesses (such as Linuxcare and SpikeSource) has left me skeptical how big the opportunity is. After all, while you're competing with the closed-source proprietary software vendors, you're also competing with Free. 
&lt;/p&gt;
&lt;p&gt;
The trend I'm noticing is the branching out away from the packaging and phone support and into curriculum. Considering that most institutional software technology education, CS degrees, extended programs, etc have curricula that are perpetually behind the times, it makes sense that the people who possess specialized knowledge on the bleeding edge lead the educational charge. &lt;a href="http://www.lucidimagination.com/How-We-Can-Help/Training/Online-Training-Schedule"&gt;Lucid Imagination&lt;/a&gt;, &lt;a href="http://www.cloudera.com/hadoop-training"&gt;Cloudera&lt;/a&gt; and &lt;a href="http://www.scaleunlimited.com/courses"&gt;Scale Unlimited&lt;/a&gt; are illustrating this point. While on-premise training can be lucrative, I think online courseware may provide a good answer to the business scale question.
&lt;/p&gt;
&lt;p&gt;
For myself, I'm working with and acquiring knowledge in these areas tactically. Whatever my next startup will be, it should be world-changing and lucrative. And I'll likely be using all of these technologies. Thank goodness these guys are training the workforce of tomorrow!
&lt;/p&gt;
&lt;p&gt;
&lt;a href="http://technorati.com/tag/startups" rel="tag"&gt;startups&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/scalability" rel="tag"&gt;scalability&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/entrepreneuring" rel="tag"&gt;entrepreneuring&lt;/a&gt; &amp;nbsp;
&lt;/p&gt;</description>          </item>
    <item>
    <guid isPermaLink="true">http://www.arachna.com/roller/spidaman/entry/configuring_web_clouds_with_chef</guid>
    <title>Configuring Web Clouds with Chef</title>
    <dc:creator>spidaman</dc:creator>
    <link>http://www.arachna.com/roller/spidaman/entry/configuring_web_clouds_with_chef</link>
        <pubDate>Thu, 21 May 2009 12:30:07 -0700</pubDate>
    <category>What's That Noise?</category>
            <description>&lt;p&gt;
&lt;a href="http://code.fotonauts.com/blog/2009/05/introducing-casserole.html" title="Casserole"&gt;
&lt;img src="http://www.arachna.com/images/casserole-icon-small-725869.png" style="float:right; margin:5px 5px 5px 5px;cursor:pointer; cursor:hand;" border="0" width="128" height="128" /&gt;&lt;/a&gt;
I'm not generally passionate about network and system operations, I prefer to focus my attention and creativity on system and software architectures. However, infrastructure provisioning, application deployment, monitoring and maintenance are facts of life for online services. When those basic functions aren't functioning well, then I get passionate about them. When service continuity is impacted and operations staff are overworked, it really bothers me; it tells me that I or other developers I'm working with are doing a poor job of delivering resilient software. I've had many conversations with folks who've accepted as a given that development teams and operations teams have friction between them; some even suggest that they &lt;em&gt;should&lt;/em&gt;. After all, so goes that line of thinking, the developers are graded on how rapidly they implement features and fix bugs whereas the operators are graded on service availability and performance. Well, you can sell that all you want but I won't buy it. 
&lt;/p&gt;
&lt;p&gt;
In my view, developers need to deliver software that can be operated smoothly and operators need to provide feedback on how smoothly the software is operating; dev and ops must collaborate. I accept as a given that developers
&lt;ul&gt;
&lt;li&gt;Use source control&lt;/li&gt;
&lt;li&gt;Write unit tests (after the fact or before/during &lt;a href="http://en.wikipedia.org/wiki/Test-driven_development"&gt;TDD&lt;/a&gt; style)&lt;/li&gt;
&lt;li&gt;Write functional and integration tests&lt;/li&gt;
&lt;li&gt;Maintain a build system for running test harnesses and packaging code&lt;/li&gt;
&lt;li&gt;Document internal architecture and operating interfaces&lt;/li&gt;
&lt;li&gt;Plan for change with respect to scale charactistics and functionality&lt;/li&gt;
&lt;/ul&gt;
Conversely, I accept as a given that operators
&lt;ul&gt;
&lt;li&gt;Use configuration management&lt;/li&gt;
&lt;li&gt;Automate infrastructure provisioning, code deployment and rollback&lt;/li&gt;
&lt;li&gt;Monitor infrastructure and application metrics&lt;/li&gt;
&lt;/ul&gt;
I don't want to oversimplify, there's more to the obligations that dev and ops have to each other in order to collaborate effectively. What I've noticed though is that a lot of operators might be skilled with configuring specific server infrastructure or performing OS analysis but configuration management and automation requires using really good tools that they lack. I've seen situations where the available tools are perceived as too complicated and so tools are developed that usually consist of a lot of specialized shell scripts (or perhaps it's just plain old NIH). &lt;a href="http://www.cfengine.org/"&gt;Cfengine&lt;/a&gt; is a good start but the reports I have is that it's difficult to work with and, if you're not very careful, may automatically manage to &lt;em&gt;mis&lt;/em&gt;configure your systems. &lt;a href="http://reductivelabs.com/products/puppet/"&gt;Puppet&lt;/a&gt; was developed to be a more powerful system for configuration management but the feedback I've seen on it is adding new functionality is hard; it has its own configuration language and when you want to extend it you have to deal with a lot of complicated mechanisms. &lt;a href="http://wiki.opscode.com/display/chef/Home"&gt;Chef&lt;/a&gt; was developed to answer that frustration; by making the configuration language a &lt;a href="http://en.wikipedia.org/wiki/Domain-specific_programming_language"&gt;DSL&lt;/a&gt; on top of an already widely used scripting language (ruby), the chef code base provides an easier way to extend it and the chef codebase itself is reportedly an order of magnitude smaller and simpler than puppet's (cavaet: I generally distrust &lt;a href="http://en.wikipedia.org/wiki/Source_lines_of_code"&gt;SLoC&lt;/a&gt; metrics but just sayin').
&lt;/p&gt;
&lt;p&gt;
So I've been giving Chef a test-drive for this infrastructure-on-EC2 management project that's been cooking. The system implemented the following use cases:
&lt;ol&gt;
&lt;li&gt;Launch web app servers on EC2 with Apache, &lt;a href="http://www.modrails.com/"&gt;Passenger&lt;/a&gt;, RoR (+other gems) and overlay a set of rails apps out of git&lt;/li&gt;
&lt;li&gt;Launch a pair of reverse proxies (with ha-proxy) in front of the app servers - and reconfigure them when the set of app servers is expanded or contracted&lt;/li&gt;
&lt;li&gt;Configure the proxy for failover with &lt;a href="http://www.linux-ha.org/HeartbeatProgram"&gt;heartbeat&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;Add new rails apps to the set of app servers&lt;/li&gt;
&lt;li&gt;Updating/rolling back rails apps&lt;/li&gt;
&lt;/ol&gt;
The system is enabled through a combination of the EC2 API (via &lt;a href="http://rubyforge.org/projects/rightaws/"&gt;RightAWS&lt;/a&gt;) and Chef's REST API as well as using &lt;a href="http://github.com/ezmobius/chef-deploy/tree/master"&gt;chef-deploy&lt;/a&gt; (think: Capistrano run by a system provisioning agent) to augment Chef's functionality. So far, it seems to be working great!
&lt;/p&gt;
&lt;p&gt;
 There's a lot of energy in the Chef community (check out &lt;a href="http://code.fotonauts.com/blog/2009/05/introducing-casserole.html"&gt;Casserole&lt;/a&gt;), combined with monitoring, log management and cloud technologies, I think there's a lot of IT streamlining ahead. Perhaps the old days of labor and communication intensive operations will give way to a new era of autonomic computing. I'll post further about some of the mechanics of working with ruby, rails, chef, EC2, chef-deploy and other tools in the weeks ahead (particularly now that EC2 has native &lt;a href="http://developer.amazonwebservices.com/connect/ann.jspa?annID=446"&gt;load balancing, monitoring and auto-scaling&lt;/a&gt; capabilities). I'll also talk a bit about this stuff at a &lt;a href="http://en.oreilly.com/velocity2009/public/cfp/66"&gt;Velocity BoF&lt;/a&gt;. If you're thinking about attending Velocity, O'Reilly is offering &lt;a href="http://en.oreilly.com/velocity2009"&gt;30% off to the first 30 people to register today with the code vel09d30&lt;/a&gt; &lt;strong&gt;today&lt;/strong&gt; (no I'm not getting any kinduva kickback from O'Reilly). And you can catch &lt;a href="http://en.oreilly.com/velocity2009/public/schedule/detail/8324"&gt;Infrastructure in the Cloud Era&lt;/a&gt; with &lt;em&gt;Adam Jacob (Opscode), Ezra Zygmuntowicz (EngineYard) &lt;/em&gt; to learn more about Chef and cloud management.
&lt;/p&gt;
&lt;p&gt;
&lt;a href="http://technorati.com/tag/chef" rel="tag"&gt;chef&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/puppet" rel="tag"&gt;puppet&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/cfengine" rel="tag"&gt;cfengine&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/ec2" rel="tag"&gt;ec2&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/aws" rel="tag"&gt;aws&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/ruby" rel="tag"&gt;ruby&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/cloud+computing" rel="tag"&gt;cloud computing&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/velocity" rel="tag"&gt;velocity&lt;/a&gt; &amp;nbsp;
&lt;/p&gt;
</description>          </item>
    <item>
    <guid isPermaLink="true">http://www.arachna.com/roller/spidaman/entry/cloning_vmware_machines</guid>
    <title>Cloning VMware Machines</title>
    <dc:creator>spidaman</dc:creator>
    <link>http://www.arachna.com/roller/spidaman/entry/cloning_vmware_machines</link>
        <pubDate>Tue, 31 Mar 2009 09:12:22 -0700</pubDate>
    <category>What's That Noise?</category>
            <description>&lt;p&gt;
&lt;a href="http://www.vmware.com/" title="VMware"&gt;
&lt;img src="http://www.arachna.com/images/vmware_black_logo.gif" style="float:right; margin:5px 5px 5px 5px;cursor:pointer; cursor:hand;" border="0" width="187" height="72" /&gt;&lt;/a&gt;
I bought a copy of &lt;a href="http://www.vmware.com/products/fusion/"&gt;VMware Fusion&lt;/a&gt; on special from &lt;a href="http://www.smithmicro.com/"&gt;Smith Micro&lt;/a&gt; (icing on the cake: they had a 40% off special that week) specifically so I could simulate a network of machines on my local MacBook Pro. While I've heard good things about &lt;a href="http://www.virtualbox.org/"&gt;Virtual Box&lt;/a&gt;, one of the other key capabilities I was looking for from MacIntosh virtualization software was the ability convert an existing Windows installation to a virtual machine. VMware reportedly has the best tools for that kind of thing. I have an aging Dell with an old XP that I'd like to preserve when I finally decide to get rid of the hardware; when it's time to Macify, I'll be good to go.
&lt;/p&gt;
&lt;p&gt;
I started building my virtual network very simply, by &lt;a href="http://lists.centos.org/pipermail/centos-virt/2008-February/000203.html"&gt;creating a CentOS VM&lt;/a&gt;. Once I had my first VM running, I figured I could just grow the network from there; I was expecting to find a "clone" item in the Fusion menus but alas, no joy. So, it's time to hack. Looking around at the artifacts that Fusion created, a bunch of files in a directory named for the VM, I started off by copying the directory, the files it contained that had the virtual machine name as components of the file name and edited the metadata files ({vm name}.vmdk/.vmx/.vmxf). Telling Fusion to launch that machine, it prompted if this was a copy or a moved VM - I told it that it was copied and the launch continued. Both launched VM's could ping each other so voila: my virtual network came into existence. 
&lt;/p&gt;
&lt;p&gt;
I've since found &lt;a href="http://communities.vmware.com/docs/DOC-5611"&gt;another procedure to create "linked clones" in VMware Fusion&lt;/a&gt;. It looks like this will be really useful for my next scenario of having two different flavors of VM's running on my virtual network. The setup I want to get to is one where I can have "manager" host (to run provisioning, monitoring and other management applications) and cookie-cutter "worker" hosts (webservers, databases, etc). Ultimately, this setup will help me tool up for cloud platform operations; I have more Evil Plans there.
&lt;/p&gt;
&lt;p&gt;
So all of this has me wondering: why doesn't VMware support this natively? Where's that menu option I was looking for? Is there an alternative to this hackery that I just overlooked?
&lt;/p&gt;
&lt;p&gt;
&lt;a href="http://technorati.com/tag/vmware" rel="tag"&gt;vmware&lt;/a&gt; &amp;nbsp; 
&lt;a href="http://technorati.com/tag/virtualization" rel="tag"&gt;virtualization&lt;/a&gt; &amp;nbsp; 
&lt;a href="http://technorati.com/tag/centos" rel="tag"&gt;centos&lt;/a&gt; &amp;nbsp; 
&lt;a href="http://technorati.com/tag/vmware+fusion" rel="tag"&gt;vmware fusion&lt;/a&gt; &amp;nbsp; 
&lt;a href="http://technorati.com/tag/vm+cloning" rel="tag"&gt;vm cloning&lt;/a&gt; &amp;nbsp; 

&lt;/p&gt;</description>          </item>
    <item>
    <guid isPermaLink="true">http://www.arachna.com/roller/spidaman/entry/going_to_metallica_s_rock</guid>
    <title>Going to Metallica's Rock and Roll Hall of Fame Induction</title>
    <dc:creator>spidaman</dc:creator>
    <link>http://www.arachna.com/roller/spidaman/entry/going_to_metallica_s_rock</link>
        <pubDate>Thu, 12 Mar 2009 12:22:51 -0700</pubDate>
    <category>Music</category>
            <description>&lt;p&gt;
&lt;a href="http://www.rockhall.com/" title="Metallica - Rock and Roll Hall of Fame"&gt;
&lt;img src="http://www.arachna.com/images/metallica_rrhof_invite.gif" style="float:right; margin:5px 5px 5px 5px;cursor:pointer; cursor:hand;" border="0" width="295" height="137" /&gt;&lt;/a&gt;
Those &lt;a href="http://www.time.com/time/arts/article/0,8599,1877187,00.html"&gt;25 things you should know about me&lt;/a&gt; memes circulating rarely interest me (honestly, I don't care that you have a collection of rare El Salvadoran currency). However, one thing that my friends know but regular readers may not is that I have a fairly eclectic background. Did you know that I used to hang around the &lt;a href="http://art.sfsu.edu/"&gt;art department's hot glass studio in college&lt;/a&gt; to blow glass? Did you know that I learned to program in &lt;a href="http://en.wikipedia.org/wiki/Pascal_(programming_language)"&gt;Pascal&lt;/a&gt; when I was in college and hated it? Yea, yea, I don't care much anymore either. But anyway, back in the 80's I was friends with this Danish dude from LA who shared my interest in the underground heavy metal scene that was burgeoning, particularly in Britain ("New Wave of British Heavy Metal" AKA NWOBHM) and Europe. We used to trade records and demos (the first &lt;em&gt;Def Leppard&lt;/em&gt; 3 song EP on 9" vinyl, I was tired of it so I traded him for a bunch of &lt;em&gt;Tygers of Pan Tang&lt;/em&gt; and other crap I didn't own already). I think he, like myself, used to pick up copies of Melody Maker and Sounds at the local record store to read about what was going on overseas. Eventually, &lt;a href="http://www.kerrang.com/"&gt;Kerrang!&lt;/a&gt; came out providing fuller coverage of the metal scene, complete with glossy pictures. But in the meantime back in San Francisco, I helped a friend of mine (Ron Quintana) operate his fanzine &lt;em&gt;Metal Mania&lt;/em&gt; (don't be confused, the name was re-appropriated by various larger publishing concerns at different times in the years since but none of them had any relationship to the original gangstas). 
&lt;/p&gt;
&lt;p&gt;
Back in the day, &lt;a href="http://downwithtyranny.blogspot.com/"&gt;Howie Klein&lt;/a&gt; was a muckety muck in the music industry, haunting the local clubs like &lt;em&gt;The Old Waldorf&lt;/em&gt; and &lt;em&gt;Mabuhay Gardens&lt;/em&gt;. Howie hooked us up with a show on &lt;a href="http://kusf.org/"&gt;KUSF&lt;/a&gt;. I dubbed the show &lt;a href="http://www.rampageradio.org/"&gt;Rampage Radio&lt;/a&gt;, it ran in the wee hours every Saturday night (right after &lt;a href="http://www.bigrick.fm/"&gt;Big Rick Stuart&lt;/a&gt; finished up his late night reggae show with those dudes from Green Apple Records on Clement Street). In between hurling insults at "album oriented rock" and big-hair metal bands (posers!), we played a lot of stuff you couldn't hear anywhere else. Among the many obscure noises we aired were demos from East Bay metalheads &lt;a href="http://www.exodusattack.com/"&gt;Exodus&lt;/a&gt;. Amazingly, Rampage Radio is &lt;em&gt;still on the air&lt;/em&gt;. Well, that Danish kid and one of the guys I befriended from Exodus were Lars Ulrich and Kirk Hammett, respectively. In short order, they would be playing together in a band Lars named &lt;a href="http://www.metallica.com/"&gt;Metallica&lt;/a&gt; (after haggling with Ron about &lt;em&gt;not&lt;/em&gt; taking that name for the 'zine).
&lt;/p&gt;
&lt;p&gt;
I eventually lost interest in the metal &lt;em&gt;scene&lt;/em&gt; (not enough innovation, too much sound-alike derivatives to keep me listening); even though the music from then is still on my playlist, my repertoire has broadened widely (talk to me about gypsy style string jazz, please). I've been peripherally in touch with friends from back then. Over the years, I'd go to a few Metallica shows but the guys are always mobbed at the backstage parties, there's not much of an opportunity to actually talk about anything. Anyway, we have little in common now. I develop software and &lt;a href="http://technorati.com/"&gt;crazy assed online services&lt;/a&gt;; they tour the world to perform in front of throngs. And I don't drink Jaegermeister anymore. In 2000, I introduced one of the friends I've stayed in touch with, &lt;a href="http://haggisbuffet.blogspot.com/"&gt;Brian Lew&lt;/a&gt;, who also had a fanzine Back In The Day, to editors at &lt;a href="http://www.salon.com/"&gt;salon.com&lt;/a&gt; (where I was working at the time). He contributed a great article expressing a sentiment that I shared, &lt;a href="http://archive.salon.com/ent/log/2000/05/09/metallica_fan/index.html" title="Metallica, how could you?"&gt;dismay at Metallica's war on Napster&lt;/a&gt;. I don't think I've actually talked to Lars in 15 years. After seeing news coverage of him ranting about how people (his most valued asset: his fans) where ripping him off, I'm not sure I wanted to. But I think we're all over that now, let's just play &lt;a href="http://www.rockband.com/news/metallica_lets_fans_dig_deeper"&gt;Rock Band&lt;/a&gt; and fuhgedaboutit.
&lt;/p&gt;
&lt;p&gt;
So here we are decades later and Metallica hasn't just warped the music industry, they &lt;em&gt;are&lt;/em&gt; the industry. They're up there with Elvis and the Beatles and all of that (except, barring Cliff Burton, they're not dead). Last week, Brian pings me that Q-Prime (Metallica's management company in New York) is trying to reach me. After a few phone calls, it turns out that Metallica is honoring a handful of us old-schoolers by inviting us to a big shindig in Cleveland for their &lt;a href="http://www.rockhall.com/"&gt;Rock and Roll Hall of Fame&lt;/a&gt; induction next month. How cool is that?! I'm still kind blown away that this is really happening (am I being punkd??). 
&lt;/p&gt;
&lt;p&gt;
So, I may be &lt;a href="http://www.arachna.com/roller/spidaman/entry/more_changes_at_technorati_this"&gt;leaving Technorati&lt;/a&gt; but I'm going to the Rock and Roll Hall of Fame! w00t! That tune keeps humming through my conscience, "...living in sin with a safety pin, Cleveland rocks! Cleveland rocks!" but the way it sounds in my head, it's ganked up, roaring from a massive PA and a wall of &lt;a href="http://www.marshallamps.com/"&gt;Marshall&lt;/a&gt; stacks. So now you know what my plans will be in a few weeks and now you've learned a dozen or so things about me (if not 25) that you may not have known before.
&lt;/p&gt;
&lt;p&gt;
&lt;em&gt;
&lt;a href="http://technorati.com/tag/metallica" rel="tag"&gt;metallica&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/rock+and+roll+hall+of+fame" rel="tag"&gt;rock and roll hall of fame&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/metal" rel="tag"&gt;metal&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/kusf" rel="tag"&gt;kusf&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/cleveland" rel="tag"&gt;cleveland&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/nwobhm" rel="tag"&gt;nwobhm&lt;/a&gt; &amp;nbsp;

&lt;/em&gt;
&lt;/p&gt;</description>          </item>
    <item>
    <guid isPermaLink="true">http://www.arachna.com/roller/spidaman/entry/more_changes_at_technorati_this</guid>
    <title>More Changes At Technorati (this time, it's personal)</title>
    <dc:creator>spidaman</dc:creator>
    <link>http://www.arachna.com/roller/spidaman/entry/more_changes_at_technorati_this</link>
        <pubDate>Tue, 10 Mar 2009 14:06:20 -0700</pubDate>
    <category>What's That Noise?</category>
            <description>&lt;p&gt;
&lt;a href="http://technorati.com/" title="Technorati"&gt;
&lt;img src="http://www.arachna.com/images/technorati_logo.png" style="float:right; margin:5px 5px 5px 5px;cursor:pointer; cursor:hand;" border="0" width="110" height="15" /&gt;&lt;/a&gt;
My post &lt;a href="http://www.arachna.com/roller/spidaman/entry/new_crawlers_at_technorati" title="New Crawlers At Technorati"&gt;last week&lt;/a&gt; focused on some of the technology changes that I've been spearheading at &lt;a href="http://technorati.com/"&gt;Technorati&lt;/a&gt; but this time, I have a personal change to discuss. When I joined Technorati in 2004, the old world of the web was in shambles. The 1990's banner-ads-on-a-CPM-basis businesses were collapsed. The editorial teams using big workflow-oriented content management system (CMS) infrastructure (which I worked on in the 90's) were increasingly eclipsed by the ecosystem of blogs. Web 2.0 wasn't yet the word on everyone's lips. But five years ago, &lt;a href="http://www.sifry.com/alerts"&gt;Dave Sifry&lt;/a&gt;'s infectious vision for providing "connective tissue" for the blog ecosystem, tapping the attention signals and creating an emergent distributed meta-CMS helped put it there. Being of service to bloggers just sounded too good, so I jumped aboard. 
&lt;/p&gt;
&lt;p&gt;
Through many iterations of blogospheric expansion, building data flow, search and discovery applications, dealing with data center outages (and migrations) and other adventures, it's been a long strange trip. I've made a lot of fantastic friends, contributed a lot of insight and determination and learned a great deal along the way. I am incredibly proud of what we've built over the last five years. However today it's time for me to move on, my last day at Technorati will be next week.
&lt;/p&gt;
&lt;p&gt;
Technorati has a lot of great people, technology and possibilities. The aforementioned crawler rollout provides the technology platform with a better foundation that I'm sure &lt;a href="http://dorion.blogspot.com/"&gt;Dorion&lt;/a&gt; and the rest of the team will build great things on. The ad platform will create an abundance of valuable opportunities for bloggers and other social media. I know from past experiences what a successful media business looks like and under &lt;a href="http://www.jalichandra.com/" title="Richard Jalichandra"&gt;Richard Jalichandra&lt;/a&gt;'s leadership, I see all of the right things happening.  The ad platform will leverage Technorati's social media data assets with the publisher and advertiser tools that will make Technorati an ad delivery powerhouse. I'm going to remain a friend of the company's and do what I can to help its continued success, but I will be doing so from elsewhere.
&lt;/p&gt;
&lt;p&gt;
I want to take a moment to thank all of my colleagues, past and present, who have worked with me to get Technorati this far. The brainstorms, the hard work, the arguments and the epiphanies have been tremendously valuable to me. &lt;em&gt;Thank You!&lt;/em&gt;
&lt;/p&gt;
&lt;p&gt;
I'm not sure what's next for me. I feel strongly that the changes afoot in cloud infrastructure, open source data analytics, real time data stream technologies, location based services (specifically, GPS ubiquity) and improved mobile devices are going to build on Web 2.0. These social and technology shifts will provide primordial goo out of which new innovations will spring. And I intend to build some of them, so brace yourself for Web 3.0. It's times like these when the economy is athrash that the best opportunities emerge and running for cover isn't my style. The next few years will see incumbent players in inefficient markets crumble and more powerful paradigms take their place. I'm bringing my hammer.
&lt;/p&gt;
&lt;p&gt;
&lt;a href="http://technorati.com/tag/technorati" rel="tag"&gt;technorati&lt;/a&gt;
&lt;/p&gt;</description>          </item>
    <item>
    <guid isPermaLink="true">http://www.arachna.com/roller/spidaman/entry/welcome_to_the_technorati_top</guid>
    <title>Welcome to the Technorati Top 100, Mr. President</title>
    <dc:creator>spidaman</dc:creator>
    <link>http://www.arachna.com/roller/spidaman/entry/welcome_to_the_technorati_top</link>
        <pubDate>Wed, 4 Mar 2009 22:59:16 -0800</pubDate>
    <category>What's That Noise?</category>
            <description>&lt;p&gt;
&lt;a href="http://technorati.com/blogs/www.whitehouse.gov/blog" title="White House Blog"&gt;
&lt;img src="http://www.arachna.com/images/white_house_blog_98.png" style="float:right; margin:5px 5px 5px 5px;cursor:pointer; cursor:hand;" border="0" width="352" height="347" /&gt;&lt;/a&gt;
Since its inception just 6 weeks ago, the &lt;a href="http://www.whitehouse.gov/blog"&gt;White House Blog&lt;/a&gt; has had a tremendous center of gravity. I noted the &lt;a href="http://www.arachna.com/roller/spidaman/entry/inbloguration_one_week_later"&gt;volume of links&lt;/a&gt; coming in to the White House Blog after the first week.  This is an existential moment for the blogosphere because today the White House Blog has 3216 links from 2323 blogs. And so it's official: the White House Blog has reached the &lt;a href="http://technorati.com/pop/blogs"&gt;Technorati Top 100&lt;/a&gt;.
&lt;/p&gt;
&lt;p&gt;
I find myself reflecting on what the top 100 looked like four years ago, after the prior presidential inauguration, and what it looks like today; the blogosphere is a very different place. Further down memory lane, who recalls when &lt;a href="http://web.archive.org/web/20030401084126/http://www.technorati.com/cosmos/top100.html"&gt;Dave Winer and Instapundit were among the top blogs&lt;/a&gt;? Yep, most of the small publishers have been displaced by those with big businesses behind them. Well, at least &lt;a href="http://www.boingboing.net/"&gt;BoingBoing&lt;/a&gt; endures but Huffpo and Gizmo better watch out, here comes Prezbo.
&lt;/p&gt;
&lt;p style="clear:both;"&gt;&lt;em&gt;
&lt;a href="http://technorati.com/tag/technorati" rel="tag"&gt;technorati&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/white+house" rel="tag"&gt;white house&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/inauguration" rel="tag"&gt;inauguration&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/blog" rel="tag"&gt;blog&lt;/a&gt; &amp;nbsp;
&lt;/em&gt;&lt;/p&gt;</description>          </item>
    <item>
    <guid isPermaLink="true">http://www.arachna.com/roller/spidaman/entry/new_crawlers_at_technorati</guid>
    <title>New Crawlers At Technorati</title>
    <dc:creator>spidaman</dc:creator>
    <link>http://www.arachna.com/roller/spidaman/entry/new_crawlers_at_technorati</link>
        <pubDate>Wed, 4 Mar 2009 20:31:16 -0800</pubDate>
    <category>What's That Noise?</category>
            <description>&lt;p&gt;
&lt;a href="http://www.flickr.com/photos/kitchenmage/1395622544/" title="Spider Web"&gt;
&lt;img src="http://www.arachna.com/images/spiderweb.jpg" style="float:right; margin:5px 5px 5px 5px;cursor:pointer; cursor:hand;" border="0" width="128" height="134" /&gt;&lt;/a&gt;

A lot of changes are afoot at Technorati. Over the last year or so, we've been looking inward at the infrastructure and asking ourselves, "How can we do this better?". The data spigot that Technorati builds on was the first thing to focus on, it's a critical part in one leg of the back-end infrastructure tripod. The tripod consists of data acquisition, search and analytics Technorati; while the ping handling and queuing are relatively simple affairs the crawler is the most sophisticated of the data acquisition subsystems. It's proper functioning is critical to the functioning of the other legs; when it doesn't function well, search and analytics don't either (GIGO="garbage in/garbage out").
&lt;/p&gt;
&lt;p&gt;
As &lt;a href="http://dorion.blogspot.com/"&gt;Dorion&lt;/a&gt; mentioned recently, &lt;a href="http://technorati.com/weblog/2009/02/473.html"&gt;we're retiring the old crawler&lt;/a&gt;. Why are we giving the old crawler getting an engraved watch and showing it to the door? Well, old age is one reason. The original spider is a technology that dates back to 2003,  the blogosphere has changed a lot since then and we have a much better developed understanding of the requirements. The original spider code has presented a sufficient number of GIGO-related and code maintenance challenges to warrant a complete re-thinking. It contrasts starkly with the replacement.
&lt;dl&gt;
&lt;dt&gt;Data model&lt;/dt&gt;
&lt;dd&gt;There are a lot of ways to derive structural information out of the pages and feeds that a blog presents. The old spider used event driven parses, building a complex state as it went with flat data structures (lists and hashes). The new one uses the composed web documents to populate a well-defined object model; all crawls normalize the semi-structured data found on the web to that model.&lt;/dd&gt;
&lt;dt&gt;Crawl persistence&lt;/dt&gt;
&lt;dd&gt;The old spider was hard-wired to persist the aforementioned data structure elements to relational databases (sharded MySQL instances) while it was parsing, so that the flow of saving parsed data was closely coupled with parsing events, forsaking transactional integrity and consuming costly resources. The new spider composes and saves its parse result as a big discreet object (not collections of little objects in an RDBMS). This reduced the hardware footprint by an order of magnitude.
&lt;/dd&gt;
&lt;dt&gt;Operational visibility&lt;/dt&gt;
&lt;dd&gt;Whether a blog's page structure was understood (or not), the feed was  well formed (or not) or any of the many other things that determine the success or quality of a blog's crawl was opaque under the old spider. With the new spider, detailed metadata and metrics are tracked during the crawl cycles. This better enables the team to support bloggers and extend the system's capabilities.&lt;/dd&gt;
&lt;dt&gt;Unit tests&lt;/dt&gt;
&lt;dd&gt;Wherever you have complex, critical software you want to have unit tests. The old spider had almost no unit tests and was developed in a way that made testing the things that mattered most exceptionally difficult. The new spider was developed with a test harness upfront, it now has hundreds of tests that validate thousands of aspects of the code. The test are uniformly invoked by the developers and automatically whenever the code is updated (AKA under &lt;a href="http://martinfowler.com/articles/continuousIntegration.html"&gt;continuous integration&lt;/a&gt;).
&lt;/dd&gt;
&lt;/dl&gt;
The old spider didn't leverage packages to logically separate the different concerns (fetching, parsing, validation, change determination, etc), the aforementioned flat data structures, mingling of concerns and absence of unit tests made changing it exceedingly difficult. Now, we have a whole that is greater than the sum of the parts; having a well defined data model, sensible persistence, operational visibility and unit tests has added up to an order of magnitude improvement across several dimensions. The real benefits are seen when we've shown that the system is easy to change, I mentioned this several weeks ago when I noted the ease with which we could &lt;a href="http://www.arachna.com/roller/spidaman/entry/inbloguration_one_week_later" title="Inbloguration: One Week Later"&gt;adapt custom requirements to crawl the White House blog&lt;/a&gt;.
&lt;/p&gt;
&lt;p&gt;
Another change that we've made is to the legacy assumption that everything that pings is a blog. That assumption proved to be increasingly untenable as the ping meme spread amongst those who didn't really understand the difference between some random page and a blog, nefarious publishers (spammers) and other perpetrators of &lt;a href="http://en.wikipedia.org/wiki/Sping"&gt;spings&lt;/a&gt;. Over 90% of the pings hitting Technorati are rejected outright because they've been identified as invalid pings. A large portion of the remainder are later determined to be invalid but we now have a rigorous system in place for filtering out the noise.  We've reduced the spam level considerably (as mentioned in a  &lt;a href="http://www.arachna.com/roller/spidaman/entry/social_media_backlash_against_cheaters" title="Social Media Backlash Against Cheaters and Fleshmongers"&gt;prior post&lt;/a&gt;). For instance, there's a whole genre of &lt;a href="http://en.wikipedia.org/wiki/Spam_blog"&gt;splogs&lt;/a&gt; that are pornography focused (hardcore pictures, paid affiliate links, etc) that previously plagued our data; now we've eliminated a lot of that nonsense from the index.
&lt;/p&gt;
&lt;p&gt;
Here are a pair of charts showing the daily occurrence of a particular porn term in the index.
&lt;div style="float:left; width:48%; height: 220px;"&gt;
6 month retrospective as of November 3rd, 2008: &lt;br/&gt;
&lt;img src="http://www.arachna.com/images/blowjob_180_day_20081103.png" width="300" height="180" style="bottom:10px; margin:5px 5px 5px 5px;cursor:pointer; cursor:hand; vertical-align:bottom;" border="0" /&gt;
&lt;/div&gt;

&lt;div style="float:right; width:48%; height: 220px;"&gt;
6 month retrospective today, 5 months later: &lt;br/&gt;
&lt;img src="http://www.arachna.com/images/blowjob_180_day_20090303.png" width="300" height="180" style="bottom:10px; margin:5px 5px 5px 5px;cursor:pointer; cursor:hand;" border="0" /&gt;
&lt;/div&gt;
&lt;/p&gt;
&lt;p style="clear:both;"&gt;
As you can see, that's an order of magnitude reduction; 90% of the occurrences of that term was spam. 
&lt;/p&gt;
&lt;p style="clear:both;"&gt;
So what's next for the crawler? We've got some stragglers on the old spider, we're going to migrate them over in the next few days. There are still a lot of issues to shake out, as with any new software (for instance, there are still some error recovery scenarios to deal with). But it's getting better all of the time (love that song). We'll be rolling out new tools internally for identifying where improvements are needed, ultimately we'd like to enable bloggers to help themselves to publish, get crawled, be found and recognized more effectively. And there are more changes afoot, stay tuned.
&lt;/p&gt;
&lt;p&gt;&lt;em&gt;
&lt;a href="http://technorati.com/tag/technorati" rel="tag"&gt;technorati&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/web+crawling" rel="tag"&gt;web crawling&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/software" rel="tag"&gt;software&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/spam" rel="tag"&gt;spam&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/splogs" rel="tag"&gt;splogs&lt;/a&gt; &amp;nbsp;
&lt;/em&gt;&lt;/p&gt;</description>          </item>
    <item>
    <guid isPermaLink="true">http://www.arachna.com/roller/spidaman/entry/code_is_social</guid>
    <title>Code Is Social</title>
    <dc:creator>spidaman</dc:creator>
    <link>http://www.arachna.com/roller/spidaman/entry/code_is_social</link>
        <pubDate>Fri, 27 Feb 2009 18:26:15 -0800</pubDate>
    <category>What's That Noise?</category>
            <description>&lt;p&gt;
&lt;a href="http://www.amazon.com/exec/obidos/ASIN/B0019D8HTI/wwwarachnacom-20" title="Social Code"&gt;
&lt;img src="http://www.arachna.com/images/social_code.jpg" style="float:right; margin:5px 5px 5px 5px;cursor:pointer; cursor:hand;" border="0" width="115" height="115" /&gt;&lt;/a&gt;
Like many software developers, I confess that I have found myself on occasion coming back to old code that I wrote asking, "What was I thinking?" Where it's my code, this &lt;a href="http://www.urbandictionary.com/define.php?term=WWIT"&gt;WWIT&lt;/a&gt; question doesn't happen very often these days but in general I can't count the number of times I've encountered and had to work on code that was not written to be read. I think sometimes programmers write terse code and regard its brevity as a badge of honor, "If I'm wizardly enough to write this, then only True Wizards will read it." Or maybe it's just laziness or hurriedness, these code mysteries are akin to omitting comments, API documentation and other communication artifacts. When I see non-descriptive variable names, gratuitous indirection, excessive right indenting, monkey patching or unnecessary &lt;a href="http://en.wikipedia.org/wiki/Cyclomatic_complexity"&gt;cyclomatic complexity&lt;/a&gt;, it's almost anti-social behavior; it's a communication fail more than a functional one. Likewise, gratuitous verbosity stifles communication in the opposite manner; this isn't pre-school - grasp of the ABC's is assumed. So spelling out what code is doing in this belabored fashion is just silly:
&lt;pre&gt;
# an array to collect permalinks
permalinks = []
# loop over the feed entries
for entry in feed.entries:
    permalinks.append(entry.link)
&lt;/pre&gt;
Whereas this is clear
&lt;pre&gt;
# extract an array of permalinks from the feed entries
permalinks = [ entry.link for entry in feed.entries ]
&lt;/pre&gt;
OK, I'm assuming the reader knows what a Python &lt;a href="http://en.wikipedia.org/wiki/List_comprehension"&gt;list comprehension&lt;/a&gt; does. The first one is using a lot of vertical space to satisfy a very simple intent. I often find the opposite problem, excessive brevity, is authored by those enamored with their language's idioms. Software written with scripting languages often exhibit this; Perl is famous for &lt;a href="http://en.wikipedia.org/wiki/There_is_more_than_one_way_to_do_it"&gt;expressiveness&lt;/a&gt; (I say this lovingly as a repentant x-Perl Wizard) but even the languages with adherents claiming their tongue is more "readable" have those same users donning wizard hats, trying to be clever. Ever tried to maintain Python code riddled with nested list comprehensions containing lambdas? Ruby, similar idiomatic norms abound, 'nuf said.
&lt;/p&gt;
&lt;p&gt;
I've appreciated celebrations of wizardry (see &lt;a href="http://olabini.com/blog/2009/01/a-folding-language/"&gt;A folding language&lt;/a&gt;) but there's more to wizardry than meta-programming and brevity. Coding like a wizard doesn't mean being so clever that only other wizards can collaborate. In my view, a true wizard has the wisdom to steer clear of verbose indulgences &lt;em&gt;and&lt;/em&gt; terse spells; the wizard walks the middle path of clarity so that the code is not dumbed-down but the apprentice will grasp the intent. The wizard's code should read as poetry.
&lt;/p&gt;
&lt;p&gt;
When code is unsocial (or anti-social), the quality suffers. Complex software needs a gene pool - lots of eyeballs, lots of variant perspectives and experiences. A small gene pool leads to in-bred ideas. Thus code from a lone wolf (even a kick-ass wolf) will usually be of lower quality than code developed by a plurality (unless its a plurality of novices, then all bets are off).
&lt;/p&gt;
&lt;p&gt;
My plea to fellow crafters of bits: please code for clarity. Don't be so brief that your intentions are unclear. And don't be so garrulous that your intention is lost in the verbiage. Again, I'm not claiming innocence of these sins of code. But over the years I've become considerably more &lt;em&gt;aware&lt;/em&gt; of the costs and benefits in the choices between brevity and verbosity. Perhaps clarity is in the eye of beholder or perhaps more narrowly, in the eye of the author. But I try to look at my own code objectively and ask, "If I don't see this code for six months and then come back to it to do some maintenance, will today's intent be clear?" I hope the code I write will be approachable by those who come behind me to work on it, especially if it's me lest I ask myself the WWIT question.
&lt;/p&gt;
&lt;p&gt;
Sigh, I'm venting because I just got side tracked refactoring some program code (and its single test) that lacked clarity. Thanks for indulging me this far. I'm gonna go listen to some old &lt;a href="http://www.socialdistortion.com/"&gt;Social Distortion&lt;/a&gt; now, have a great weekend!
&lt;/p&gt;
&lt;p&gt;&lt;em&gt;
&lt;a href="http://technorati.com/tag/programming" rel="tag"&gt;programming&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/wizardry" rel="tag"&gt;wizardry&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/perl" rel="tag"&gt;perl&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/python" rel="tag"&gt;python&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/ruby" rel="tag"&gt;ruby&lt;/a&gt; &amp;nbsp;
&lt;/em&gt;&lt;/p&gt;

</description>          </item>
    <item>
    <guid isPermaLink="true">http://www.arachna.com/roller/spidaman/entry/inbloguration_one_week_later</guid>
    <title>Inbloguration: One Week Later</title>
    <dc:creator>spidaman</dc:creator>
    <link>http://www.arachna.com/roller/spidaman/entry/inbloguration_one_week_later</link>
        <pubDate>Tue, 27 Jan 2009 12:41:39 -0800</pubDate>
    <category>What's That Noise?</category>
            <description>&lt;p&gt;
&lt;a href="http://www.whitehouse.gov/blog" title="The White House Blog"&gt;
&lt;img src="http://www.arachna.com/images/whitehouse_logo.jpg" style="float:right; margin:5px 5px 5px 5px;cursor:pointer; cursor:hand;" border="0" width="267" height="115" /&gt;&lt;/a&gt;
As mentioned last week on the &lt;a href="http://technorati.com/weblog/2009/01/466.html"&gt;Technorati blog&lt;/a&gt;, Technorati was crawling the new &lt;a href="http://www.whitehouse.gov/blog" title="The White House Blog"&gt;White House blog&lt;/a&gt; within a day of its launch. Most of the blogosphere doesn't require individual customization in our crawling framework but in some special cases, it must be done. The White House blog is a prime example of why this is so but I'm pleased to report that Technorati's new crawling technology makes what was impossible with our old crawler easy with the new one.
&lt;/p&gt;
&lt;p&gt;
Given the volume of moderation that'd be required, it doesn't surprise me that the posts don't take comments. But there are other more basic blogging practices of concern:
&lt;ul&gt;
&lt;li&gt;It doesn't have clear feed discovery. There are multiple &lt;code&gt;rel="alternate"&lt;/code&gt; elements in the &lt;code&gt;head&lt;/code&gt; section of the HTML document.&lt;/li&gt;
&lt;li&gt;The permalinks are irregular; the first few were under the blog URL, the rest have been under other paths on the whitehouse.gov site.&lt;/li&gt;
&lt;li&gt;The HTML doesn't have any of the common CSS attributes that indicate structural semantics; if I get to vote on this, I'll opt for hAtom&lt;/li&gt;
&lt;/ul&gt;
The absence of full content feeds or full content on the front page isn't a problem for Technorati's new crawler; it can crawl the post pages and extract content to index from those.
&lt;/p&gt;
&lt;p&gt;
So how has the blog done in the last week? Well, it emerged in the top 1000 just &lt;em&gt;five days&lt;/em&gt; after its inception. Keep an eye on the blog's &lt;a href="http://technorati.com/blogs/www.whitehouse.gov/blog"&gt;Technorati blog info page&lt;/a&gt; - it's currently ranked 882 (1,876 links from 1440 blogs). Many of the links are to main blog page, citing its existence. But the majority of the links were to the kick-off post (date line reads: "Tuesday, January 20th, 2009 at 12:01 pm") and the inaugural address. Here are the posts and their link counts:
&lt;ul&gt;
&lt;li&gt;&lt;a href="http://www.whitehouse.gov/blog/change_has_come_to_whitehouse-gov/"&gt;Change has come to WhiteHouse.gov&lt;/a&gt; - 619 links from 552 blogs&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.whitehouse.gov/blog/inaugural-address/"&gt;President Barack Obama's Inaugural Address&lt;/a&gt; - 217 links from 194 blogs&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.whitehouse.gov/blog/a_national_day_of_renewal_and_reconciliation/"&gt;A National Day of Renewal and Reconciliation&lt;/a&gt; - 146 links from 137 blogs&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.whitehouse.gov/president-obama-delivers-your-weekly-address/"&gt;President Obama delivers Your Weekly Address&lt;/a&gt; - 103 links from 97 blogs&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.whitehouse.gov/statement-released-after-the-president-rescinds/"&gt;Statement released after the President rescinds "Mexico City Policy"&lt;/a&gt; - 34 links from 33 blogs&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.whitehouse.gov/executive-orders-to-date/"&gt;Executive Orders to date&lt;/a&gt; - 27 links from 27 blogs&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.whitehouse.gov/blog/the_whistle_stop_tour/"&gt;The Whistle Stop Tour&lt;/a&gt; - 26 links from 24 blogs&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.whitehouse.gov/blog/dr_king_service/"&gt;Honoring Dr. King's Legacy and Serving America&lt;/a&gt;  - 22 links from 20 blogs&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.whitehouse.gov/blog_post/Fromperiltoprogress/"&gt;From peril to progress&lt;/a&gt; - 17 links from 17 blogs&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.whitehouse.gov/now-comes-lilly-ledbetter/"&gt;Now Comes Lilly Ledbetter&lt;/a&gt; - 15 links from 13 blogs&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.whitehouse.gov/58_years_of_Indian_democracy/"&gt;58 years of Indian democracy&lt;/a&gt; - 4 links from 4 blogs&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.whitehouse.gov/the_year_of_the_ox/"&gt;The year of the ox&lt;/a&gt; - 3 links from 3 blogs&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.whitehouse.gov/Timothy_Geithner_sworn_in/"&gt;Timothy Geithner sworn in as Secretary of the Treasury&lt;/a&gt; - 3 links from 3 blogs&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.whitehouse.gov/blog_post/PressBriefingHighlights/"&gt;Press Briefing Highlights&lt;/a&gt; - 3 links from 3 blogs&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.whitehouse.gov/president-obama-on-the-selection-of-kirsten-gillibrand/"&gt;President Obama on the Selection of Kirsten Gillibrand&lt;/a&gt; - 2 links from 2 blogs&lt;/li&gt;
&lt;/ul&gt;
&lt;/p&gt;
&lt;p&gt;
It's really encouraging to see the White House, specifically &lt;a href="http://www.bluestatedigital.com/"&gt;Macon Phillips&lt;/a&gt;, posting updates about President Obama's policies and political activities. Going forward, I hope to see more consistent publishing practices. For instance, should slug words be separated by underscores or hyphens? Should the proper names in URL slugs be mixed case (e.g. Timothy_Geithner_sworn_in) or down cased (e.g. now-comes-lilly-ledbetter)? It's less important which one but just pick one and stick to it! Also, the White House Blog should use durable URLs for permalinks: there's a posting with the path &lt;code&gt;/blog_post/PressBriefingHighlights/&lt;/code&gt;, another with &lt;code&gt;/president-obama-delivers-your-weekly-address&lt;/code&gt; - what are they going to use for the next post with highlights from a press briefing or next week's weekly address? If Mr. Phillips needs any further advise on effective blog publishing technology, I'm easy to find and happy to be of service.
&lt;/p&gt;
&lt;p&gt;&lt;em&gt;
&lt;a href="http://technorati.com/tag/whitehouse" rel="tag"&gt;whitehouse&lt;/a&gt; &amp;nbsp; 
&lt;a href="http://technorati.com/tag/white+house+blog" rel="tag"&gt;white house blog&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/blogging" rel="tag"&gt;blogging&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/blog+technology" rel="tag"&gt;blog+technology&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/obama" rel="tag"&gt;obama&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/technorati" rel="tag"&gt;technorati&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/inbloguration" rel="tag"&gt;inbloguration&lt;/a&gt; &amp;nbsp;
&lt;/em&gt;&lt;/p&gt;


</description>          </item>
    <item>
    <guid isPermaLink="true">http://www.arachna.com/roller/spidaman/entry/downturn_what_downturn</guid>
    <title>Downturn, what downturn?</title>
    <dc:creator>spidaman</dc:creator>
    <link>http://www.arachna.com/roller/spidaman/entry/downturn_what_downturn</link>
        <pubDate>Thu, 25 Dec 2008 13:20:14 -0800</pubDate>
    <category>What's That Noise?</category>
            <description>&lt;p&gt;

&lt;a href="http://en.wikipedia.org/wiki/Data_center" title="Data centers"&gt;
&lt;img src="http://www.arachna.com/images/data_center_rack.jpg" style="float:right; margin:5px 5px 5px 5px;cursor:pointer; cursor:hand;" border="0" width="127" height="132" /&gt;&lt;/a&gt;

Just the other day, &lt;em&gt;Data Center Knowledge&lt;/em&gt; asked &lt;a href="http://www.datacenterknowledge.com/archives/2008/12/23/are-colocation-prices-heading-higher/"&gt;Are Colocation Prices Heading Higher?&lt;/a&gt; My immediate reaction was, that's a stupid question: last time VC funding went into hibernation, data center space was suddenly cheap and abundant. The article suggested that companies operating their own data centers will run to the colos as a cost cutting measure. Maybe, but I'm not so sure. Data center migrations can be expensive, risky operations. Methinks that the F500's inclined to undertake a migration would have done so already. The article cited a report emphasizing a shift from capital expenses to operating expenses.
&lt;blockquote cite="http://www.datacenterknowledge.com/archives/2008/12/23/are-colocation-prices-heading-higher/"&gt;
Tier 1 says demand for data center space grew 14 percent over the past 12 months, while supply grew by just 6 percent, "exacerbating an already lopsided supply/demand curve."
&lt;/blockquote&gt;
However, &lt;a href="http://www.t1r.com/press_release.php?pr=709"&gt;Tier 1 attributed the demand&lt;/a&gt;, "especially, (to) the primacy of the Internet as a vehicle for service and application delivery." With the litany of Techcrunch &lt;a href="http://search.techcrunch.com/query.php?y=%2Ftc_eng_id%2Fsearch%2Fv1%2Fquery%2Fdeadpool%3Fcategory_id%3DTechCrunch%2520Search%26sort%3Ddate%26client%3Dtechcrunch"&gt;deadpool&lt;/a&gt;
reports, I'm finding it difficult to believe that the data center space supply/demand will continue skewing.
&lt;/p&gt;
&lt;p&gt;
Sure, it's not all bad news. Fred Wilson reports that Union Square Ventures will be &lt;a href="http://www.avc.com/a_vc/2008/12/investing-in-th.html"&gt;Investing In Thick and Thin&lt;/a&gt;. Acknowledging that, "it is easier to invest in thin times. The difficult business climate starts to separate the wheat from the chaff and the strong companies are revealed." Wilson goes on to say
&lt;blockquote cite="http://www.avc.com/a_vc/2008/12/investing-in-th.html"&gt;
I don't feel that its possible, or wise, or prudent to attempt to time these (venture investment) cycles.
&lt;br/&gt;
Our approach is to manage a modest amount of capital (in our case less than $300 million across two active funds) and deploy it at roughly $40 million per year, year in and year out no matter what part of the cycle we are in.
&lt;br/&gt;
That way we'll be putting out money at the top of the market but also at the bottom of the market and also on the way up and the way down. The valuations we pay will average themselves out and this averaging allows us to invest in the underlying value creation process and not in the market per se.
&lt;/blockquote&gt;
Yes, the economy is gyrating in pain, but the four horsemen aren't galloping nearby. So take a pill, catch your breath and deal with it: the sun will come out, just don't bother trying to time it too carefully.
&lt;/p&gt;
&lt;p&gt;
Now, there's no shortage of reasons for gloom and doom: mega-ponzi schemes collapsing, banks and real estate combusting, the big 3 in various states of failure, yet &lt;a href="http://www.autonews.com/article/20081224/ANA05/812249975/1078"&gt;BMW North America will raise list prices 0.7%&lt;/a&gt;. Before the complete credit breakdown, real estate volume was actually rising in a lot of places (ergo: prices were aligning supply and demand). I was at a William-Sonoma store in Albuquerque the other day, the place was mobbed. My point is that while the economy is retrenching (or the country is &lt;a href="http://www.nytimes.com/2008/12/24/opinion/24friedman.html"&gt;rebooting&lt;/a&gt;), the detritus will be separated (Wilson's wheat from chaff) and data center space should be cheap and abundant. Everything seems fine to me. At least until the next bubble.
&lt;/p&gt;
&lt;p&gt;
For those of you observing that sort of thing, Merry Christmas!
&lt;/p&gt;
&lt;p&gt;&lt;em&gt;
&lt;a href="http://technorati.com/tag/colocation" rel="tag"&gt;colocation&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/data+centers" rel="tag"&gt;data centers&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/economy" rel="tag"&gt;economy&lt;/a&gt; &amp;nbsp;
&lt;/em&gt;&lt;/p&gt;


</description>          </item>
    <item>
    <guid isPermaLink="true">http://www.arachna.com/roller/spidaman/entry/another_hadoop_backended_database_cloudbase</guid>
    <title>Another Hadoop-backended Database: CloudBase</title>
    <dc:creator>spidaman</dc:creator>
    <link>http://www.arachna.com/roller/spidaman/entry/another_hadoop_backended_database_cloudbase</link>
        <pubDate>Tue, 23 Dec 2008 16:02:21 -0800</pubDate>
    <category>Java</category>
            <description>&lt;p&gt;

&lt;a href="http://hadoop.apache.org/" title="Hadoop"&gt;
&lt;img src="http://www.arachna.com/images/hadoop_logo.jpg" style="float:right; margin:5px 5px 5px 5px;cursor:pointer; cursor:hand;" border="0" width="300" height="71" /&gt;&lt;/a&gt;

This post to one of the Hadoop mailing lists caught my eye, &lt;a href="http://mail-archives.apache.org/mod_mbox/hadoop-core-user/200812.mbox/%3Ce75c02ef0812221514y575264a7i63adef15d37ab2a2@mail.gmail.com%3E"&gt;Announcing CloudBase-1.1 release&lt;/a&gt;. Wait, wasn't Cloudbase the embedded database company that IBM acquired several years back but ended up donating the product to the Apache Software Foundation as &lt;a href="http://db.apache.org/derby/"&gt;Derby&lt;/a&gt;? No, not that Cloudbase. This is apparently another project that aims to provide data warehousing on top of Hadoop.
&lt;/p&gt;
&lt;p&gt;
I've been watching the emergence of &lt;a href="http://hadoop.apache.org/hbase/"&gt;HBase&lt;/a&gt;, &lt;a href="http://hypertable.org/"&gt;Hypertable&lt;/a&gt; and most recently &lt;a href="http://mail-archives.apache.org/mod_mbox/incubator-general/200812.mbox/browser"&gt;the proposed incubation of Facebook's Cassandra&lt;/a&gt; with great interest. The first two are modeled from Google's &lt;a href="http://labs.google.com/papers/bigtable.html"&gt;BigTable&lt;/a&gt; but all are essentially horizontally scalable column oriented databases. The developers of these systems explicitly steer away having their technologies pegged as relational databases, with the refrain: "We don't do joins." What the &lt;a href="http://cloudbase.sourceforge.net/"&gt;CloudBase&lt;/a&gt; project aims to do is &lt;em&gt;not&lt;/em&gt; model themselves on BigTable but to explicitly support joins between tables built on top of an HDFS cluster. It looks like they've posted extensive documentation and have released a JDBC driver, pretty cool! This is the most interesting database initiative I've seen since &lt;a href="http://www.greenplum.com/resources/mapreduce/"&gt;GreenPlum announced their support for mapreduce&lt;/a&gt;.
&lt;/p&gt;
&lt;p&gt;
Yes, as far as scale-out data analytics, we live in interesting times.
&lt;/p&gt;
&lt;p&gt;&lt;em&gt;
&lt;a href="http://technorati.com/tag/mapreduce" rel="tag"&gt;mapreduce&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/hadoop" rel="tag"&gt;hadoop&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/hbase" rel="tag"&gt;hbase&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/hypertable" rel="tag"&gt;hypertable&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/jdbc" rel="tag"&gt;jdbc&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/cloudbase" rel="tag"&gt;cloudbase&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/bigtable" rel="tag"&gt;bigtable&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/derby" rel="tag"&gt;derby&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/greenplum" rel="tag"&gt;greenplum&lt;/a&gt; &amp;nbsp;

&lt;/em&gt;&lt;/p&gt;</description>          </item>
    <item>
    <guid isPermaLink="true">http://www.arachna.com/roller/spidaman/entry/python_svn_bindings_trac_and</guid>
    <title>Python SVN Bindings, Trac and mod_python</title>
    <dc:creator>spidaman</dc:creator>
    <link>http://www.arachna.com/roller/spidaman/entry/python_svn_bindings_trac_and</link>
        <pubDate>Sun, 21 Dec 2008 14:55:24 -0800</pubDate>
    <category>LAMP</category>
            <description>&lt;p&gt;
&lt;a href="http://trac.edgewall.org/" title="Trac"&gt;
&lt;img src="http://www.arachna.com/images/trac_logo.png" style="float:right; margin:5px 5px 5px 5px;cursor:pointer; cursor:hand;" border="0" width="214" height="61" /&gt;&lt;/a&gt;

I have some code I want to noodle on outside of work. Since I'm on a holiday break, I'm doing a bit of that (yes, this &lt;em&gt;is&lt;/em&gt; what I do for fun, so?). In the past, I had used my own private CVS server for those kinds of things but these days, I could just as well live without CVS. I decided to roll a subversion server into my Apache build (the latest Apache + other modules aren't in the yum repositories for my distro, so I roll my own). While I'm putting a subversion server up, why not &lt;a href="http://trac.edgewall.org/"&gt;trac&lt;/a&gt;, too? Heh, that's where things got stuck.
&lt;/p&gt;
&lt;p&gt;
When I installed the subversion dependencies (specifically, &lt;a href="http://www.webdav.org/neon/"&gt;neon&lt;/a&gt;), I just used vanilla build params. After installing subversion, I was surprised that Trac couldn't access it. It turns out that the litmus test was this:
&lt;pre&gt;
$ python
Python ...
&amp;gt;&amp;gt;&amp;gt; from svn import core
&lt;/pre&gt;
...it failed miserably. Various recompile efforts seemed to move the problem around. I saw a variety of the symptoms described in the Trac-Subversion integration docs &lt;a href="http://trac.edgewall.org/wiki/TracSubversion#Troubleshooting"&gt;troubleshooting&lt;/a&gt; section. The missing &lt;code&gt;gss_delete_sec_context&lt;/code&gt; symbol error was apparently the telltale critical one, it originated from neon having been compiled without SSL support. The neon compile config that led to success was
&lt;pre&gt;
./configure --enable-shared --enable-static --with-ssl=openssl 
&lt;/pre&gt;
Then the real key was to completely start over with the subversion compile, not just the &lt;a href="http://www.swig.org/"&gt;swig&lt;/a&gt; python bindings.
&lt;pre&gt;
make clean
./configure \
 --with-berkeley-db=/usr/local/BerkeleyDB.4.7 \
 --prefix=/usr/local \ 
 --with-apxs=/usr/local/httpd2.2.11/bin/apxs \
 --with-apr=/usr/local/apr \
 --with-apr-util=/usr/local/apr
make
make swig-py
make check-swig-py
make install
make install-swig-py
ldconfig
&lt;/pre&gt;
Only then did the litmus test above pass. One of the things about this setup that is kind of a nuisance is that the python bindings didn't get installed into &lt;code&gt;site-packages&lt;/code&gt;, therefore &lt;a href="http://www.modpython.org/"&gt;mod_python&lt;/a&gt; was quite unhappy. Also, trac seemed to want to put its eggs in the root directory. So the Apache server's &lt;code&gt;envvars&lt;/code&gt; script has these variables exported to work around those issues
&lt;pre&gt;
PYTHONPATH=/usr/local/lib/svn-python
LD_LIBRARY_PATH=/usr/local/lib/svn-python/libsvn
PYTHON_EGG_CACHE=/data1/egg_cache
&lt;/pre&gt;
The result (including the requisite &lt;code&gt;httpd.conf&lt;/code&gt; tweaks) is a working subversion 1.54 and trac 0.11 setup. It was more fiddling for the evening than I'd hoped for and I'm not sure my foibles and remedies were optimal (clearly, I missed an RTFM somewhere) but I hope this resolution helps at least one reader. 
&lt;/p&gt;
&lt;p&gt;
Happy Hannukah and winter solstice!
&lt;/p&gt;
&lt;p&gt;&lt;em&gt;
&lt;a href="http://technorati.com/tag/apache" rel="tag"&gt;apache&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/mod_python" rel="tag"&gt;mod_python&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/trac" rel="tag"&gt;trac&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/neon" rel="tag"&gt;neon&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/subversion" rel="tag"&gt;subversion&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/swig" rel="tag"&gt;swig&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/python" rel="tag"&gt;python&lt;/a&gt; &amp;nbsp;
&lt;/em&gt;&lt;/p&gt;
</description>          </item>
    <item>
    <guid isPermaLink="true">http://www.arachna.com/roller/spidaman/entry/world_leader_reflex_tests</guid>
    <title>World Leader Reflex Tests</title>
    <dc:creator>spidaman</dc:creator>
    <link>http://www.arachna.com/roller/spidaman/entry/world_leader_reflex_tests</link>
        <pubDate>Mon, 15 Dec 2008 21:35:52 -0800</pubDate>
    <category>What's That Noise?</category>
            <description>&lt;p&gt;
&lt;img src="http://www.arachna.com/images/bush_vs_shoe.jpg" style="float:right; margin:5px 5px 5px 5px;cursor:pointer; cursor:hand;" border="0" width="124" height="83" /&gt;
Will Chuck-The-Shoe-At-The-World-Leader be an Olympic sport in the years ahead? Since finishing with dinner this evening, I've found no less than three flash games and a compendium of animated GIF satires.
&lt;ul&gt;
&lt;li&gt;&lt;a href="http://kotaku.com/5110547/shoe-attack-on-president-already-turned-into-a-crappy-flash-game"&gt;Shoe Attack On President Already Turned Into A Crappy Flash Game&lt;/a&gt; (Kotaku)&lt;/li&gt;
&lt;li&gt;&lt;a href="http://sockandawe.com/"&gt;Sock and Awe&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.kroma.no/2008/bushgame/"&gt;The Bush Game&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.boingboing.net/2008/12/15/iraq-shoe-tosser-guy.html"&gt;Iraq Shoe Tosser Guy: The Animated Gifs&lt;/a&gt; (Boing Boing)&lt;/li&gt;
&lt;/ul&gt;
I bet Jon Stewart is pissed that this had to happen two days before his last show of the year. There's still a month left for more Dubya foibles, cut the vacation short Stewart! In the meantime,  keep it safe kids.
&lt;/p&gt;
&lt;p&gt;&lt;em&gt;
&lt;a href="http://technorati.com/tag/shoe+toss" rel="tag"&gt;shoe toss&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/bush" rel="tag"&gt;bush&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/humor" rel="tag"&gt;humor&lt;/a&gt; &amp;nbsp;
&lt;/em&gt;&lt;/p&gt;</description>          </item>
    <item>
    <guid isPermaLink="true">http://www.arachna.com/roller/spidaman/entry/openedge_vs_net_neutrality_vs</guid>
    <title>OpenEdge vs. Net Neutrality vs. CDN</title>
    <dc:creator>spidaman</dc:creator>
    <link>http://www.arachna.com/roller/spidaman/entry/openedge_vs_net_neutrality_vs</link>
        <pubDate>Sun, 14 Dec 2008 22:52:25 -0800</pubDate>
    <category>What's That Noise?</category>
            <description>&lt;p&gt;
&lt;a href="http://online.wsj.com" title="Wall Street Journal"&gt;
&lt;img src="http://www.arachna.com/images/north_american_internet_traffic_chart.gif" style="float:right; margin:5px 5px 5px 5px;cursor:pointer; cursor:hand;" border="0" width="184" height="276" /&gt;&lt;/a&gt;
The Wall Street Journal reported today that &lt;a href="http://online.wsj.com/article/SB122929270127905065.html"&gt;Google Wants Its Own Fast Track on the Web&lt;/a&gt;, describing it as an example of the decline of support for &lt;a href="http://en.wikipedia.org/wiki/Network_neutrality"&gt;net neutrality&lt;/a&gt; amongst the The Powers That Be (the usual suspects: Google, Yahoo, Microsoft, Amazon). Plenty of deals have been getting struck anyway between TPTB and data carriers (most prominently AT&amp;amp;T + Yahoo DSL) but outright transit preference doesn't seem to be an issue here. What Google appears to be getting into, called &lt;em&gt;OpenEdge&lt;/em&gt;, sounds like an arrangement that amounts to co-locating their gear in the major carrier's datacenters. This would move serving capacity closer to the end-users of their services and thereby accelerate the user experience. Since it doesn't concern transit per se, this actually doesn't sound like a net neutrality issue at all, it sounds like another form of datacenter dispersion.
&lt;/p&gt;
&lt;p&gt;
So what exactly is the big deal? All of the TPTB and loads of other online services have &lt;a href="http://en.wikipedia.org/wiki/Content_delivery_network"&gt;content delivery network&lt;/a&gt; (CDN) deals. Yahoo, Amazon, Facebook... they all operate or partner with a CDN in some shape or form (full disclosure: I've been working on a CDN evaluation for &lt;a href="http://technorati.com/"&gt;Technorati&lt;/a&gt;). With a CDN, publishers pay specifically to have their content cached at points-of-presence (PoP) around the intertubes that, through some DNS and routing magic, enables web content to get to end-users more quickly. The next step beyond a CDN is to put equipment in the carrier's datacenter. Here's what WSJ said
&lt;blockquote cite="http://online.wsj.com/article/SB122929270127905065.html"&gt;
Google's proposed arrangement with network providers, internally called OpenEdge, would place Google servers directly within the network of the service providers, according to documents reviewed by the Journal. The setup would accelerate Google's service for users. Google has asked the providers it has approached not to talk about the idea, according to people familiar with the plans.
&lt;br/&gt;
Asked about OpenEdge, Google said only that other companies such as Yahoo and Microsoft could strike similar deals if they desired. But Google's move, if successful, would give it an advantage available to very few.
&lt;/blockquote&gt;
It seems perfectly logical, actually.
&lt;/p&gt;
&lt;p&gt;
Nonetheless, I am concerned about wavering support for net neutrality. Lawrence Lessig, fresh off of his &lt;a href="http://joi.ito.com/weblog/2008/12/12/big-news-from-l.html"&gt;Big News&lt;/a&gt; post concerning setting up shop at Harvard Law School, is quoted as saying
&lt;blockquote  cite="http://online.wsj.com/article/SB122929270127905065.html"&gt;
There are good reasons to be able to prioritize traffic. If everyone had to pay the same rates for postal service, than you wouldn't be able to differentiate between sending a greeting card to your grandma versus sending an overnight letter to your lawyer.
&lt;/blockquote&gt;
But the counter argument says that there's a big difference.  Grandma isn't trying to compete with your attorney (at least, not usually). If the big guys are paying more to be faster, who will be able afford to challenge them? The intertubularly rich will get richer, the poor will be stay poor. The TPTB will ensconce themselves as dynastic media walking on paths paved with gold while all of us commoners walk in the gutter.
&lt;/p&gt;
&lt;p&gt;
The dumb pipes should stay dumb. If an internet service wants to operate out of multiple datacenters, lease dedicated pipes to accelerate their inter-datacenter data distribution and peer with the carrier's PoPs proximate to their datacenters, mazel tov. This can be augmented with CDNs. It can even be taken to the next step by directly installing the carrier's datacenters. But at the network exchanges and pipes connecting them, everyone's packets should remain equal.
&lt;/p&gt;
&lt;p&gt;
&lt;strong&gt;UPDATE&lt;/strong&gt;
&lt;a href="http://gigaom.com/2008/12/14/google-turns-its-back-on-network-neutrality/"&gt;GigaOM&lt;/a&gt; posted about a clarification from Google which says that the WSJ was &amp;quot;confused&amp;quot;. The hubbub in that article really was misplaced, it's a CDN deal.
&lt;/p&gt;
&lt;p&gt;&lt;em&gt;&lt;a href="http://technorati.com/tag/google" rel="tag"&gt;google&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/cdn" rel="tag"&gt;cdn&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/content+delivery+network" rel="tag"&gt;content delivery network&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/net+neutrality" rel="tag"&gt;net neutrality&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/wall+street+journal" rel="tag"&gt;wall street journal&lt;/a&gt; &amp;nbsp;
&lt;/em&gt;&lt;/p&gt;</description>          </item>
    <item>
    <guid isPermaLink="true">http://www.arachna.com/roller/spidaman/entry/cloud_hype_an_amazon_web</guid>
    <title>Cloud Hype, An Amazon Web Services Post-Mortem</title>
    <dc:creator>spidaman</dc:creator>
    <link>http://www.arachna.com/roller/spidaman/entry/cloud_hype_an_amazon_web</link>
        <pubDate>Wed, 10 Dec 2008 23:53:19 -0800</pubDate>
    <category>What's That Noise?</category>
            <description>&lt;p&gt;
&lt;a href="http://aws.amazon.com/" title="Amazon Web Services"&gt;
&lt;img src="http://www.arachna.com/images/aws_logo.gif" style="float:right; margin:5px 5px 5px 5px;cursor:pointer; cursor:hand;" border="0" width="164" height="60" /&gt;&lt;/a&gt;
In the last few years, the scope of Amazon Web Services (AWS) has broadened to cover a range of infrastructure capabilities and has emerged as a game changer.   The hype around AWS isn't all wrong, a whole ecosystem of tools and services has developed around AWS that makes the offering compelling. However, the hype isn't all right either.  At Technorati, we used AWS this year to develop and put in production a new crawler and a system that produces the web page screenshot thumbnails now seen on search result pages. But now that that chapter is coming to a close, it's time to retrospect.
&lt;/p&gt;
&lt;p&gt;
&lt;img src="http://www.arachna.com/images/variable_capacity.png" style="float:left; margin:5px 5px 5px 5px;cursor:pointer; cursor:hand;" border="0" width="357" height="106" /&gt;
There's a prevailing myth that using the elasticity of EC2 makes it cheaper to operate than fixed assets. The theory is that by shutting down unneeded infrastructure during the lulls, you're saving money. In a purely fixed infrastructure model, Technorati's data aquisition systems must be provisioned for their maximum utilization capacity threshold. When utilization ebbs, a lot of that infrastructure sits relatively idle. That much is true but the reality is that flexible capacity is only saving money relative to the minimum requirements. So the theory only holds if your variability is high compared to your minimum. That is, if the difference between your minimum and maximum capacity is large or you're not operating a 365/7/24 system but episodically using a lot of infrastructure and then shutting it down. Neither is true for us. The normal operating mode of Technorati's data acquisition systems follows the ebb and flow of the blogosphere, which varies a lot but is always on. The sketch to the left shows the minimum capacity and the variable capacity distinguished.
&lt;/p&gt;
&lt;p&gt;
In response to some of the fallacies posted on an O'Reilly blog the other day (by George Reese), &lt;a href="http://broadcast.oreilly.com/2008/12/why-i-dont-like-cloud-auto-scaling.html"&gt;On Why I Don't Like Auto-Scaling in the Cloud&lt;/a&gt;, Don MacAskill from SmugMug wrote a really great post yesterday about his SkyNet system, &lt;a href="http://blogs.smugmug.com/don/2008/12/09/on-why-auto-scaling-in-the-cloud-rocks/"&gt;On Why Auto-Scaling in the Cloud Rocks&lt;/a&gt;. Don also emphasizes SmugMugs modest requirements for operations staff. In an application with sufficient simplicity and automation around it, it's easy to imagine a 365/7/24 service having meager ops burdens. I think we should surmise that the cost of operating SmugMug with autonomic de/provisioning works because it fits their operating model. I understand Reese's concern, that folks may not do the hard work of really understanding their capacity requirements if they're too coddled by automation. However, that concern comes off as a shill for John Allspaw's capacity planning book (which I'm sure is great, can't wait to read it). Bryan Duxbury from RapLeaf describes their use of AWS and how the numbers work out in his post, &lt;a href="http://blog.rapleaf.com/dev/?p=38"&gt;Rent or Own: Amazon EC2 vs. Colocation Comparison for Hadoop Clusters&lt;/a&gt;. Since the target is to serve a Hadoop infrastructure,  AWS must get a thumbs down in their case. Hadoop's performance is impaired by poor rack locality and the latencies of Amazon's I/O systems clearly drags it down. If you're going to be running Hadoop on a continuous basis, use your own racks, with your own switches and your own disk spindles.
&lt;/p&gt;
&lt;p&gt;
At Technorati, we're migrating the crawl infrastructure from AWS to our colo. While I love the flexibility that AWS provides and it's been great using it as a platform to  ramp up on , the bottom line is that Technorati has a pre-existing investment in machines, racks and colo infrastructure. As much as I'd like our colo infrastructure to operate with lower labor and communication overhead, running on AWS has amounted to additional costs that we must curtail.
&lt;/p&gt;
&lt;p&gt;
Cloud computing (or &lt;a href="http://gigaom.com/2008/02/28/how-cloud-utility-computing-are-different/"&gt;utility computing&lt;/a&gt; or &lt;a href="http://www.arachna.com/roller/spidaman/entry/can_we_just_call_it"&gt;flex computing&lt;/a&gt; or whatever its called) &lt;em&gt;is&lt;/em&gt; a game changer. So when do I recommend you use AWS? Ideally: anytime. If your application is architected to expand and contract its footprint with the demands put upon it, provision your minimum capcacity requirements in your colo and use AWS to "burst" when your load demands it. Another case where using AWS is a big win is for a total green field. If you don't have a colo, are still determining the operating charactersics of your applications and need machines provisioned, AWS is an incredible resource. However, I think the flexibility vs. economy imperatives will always lead you to optimize your costs by provisioning your minimum capacity in infrastructure that you own and operate.
&lt;/p&gt;
&lt;p&gt;
There's also another option: instead of buying and operating your own machines and racks, you may be able to optimize costs by renting machines provisioned to your specs in a contract from the services that have established themselves in that market (Rackspace, Server Beach, ServePath, LayeredTech, etc). Ultimately, I'm looking forward to the emergence of a compute market place where the decisions to incur capital expense, rent by the hour or rent under a contract will be easier to traverse.
&lt;/p&gt;
&lt;p&gt;&lt;em&gt;
&lt;a href="http://technorati.com/tag/amazon+web+services" rel="tag"&gt;amazon web services&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/aws" rel="tag"&gt;aws&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/cloud+computing" rel="tag"&gt;cloud computing&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/rapleaf" rel="tag"&gt;rapleaf&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/technorati" rel="tag"&gt;technorati&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/smugmug" rel="tag"&gt;smugmug&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/hadoop" rel="tag"&gt;hadoop&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/oreilly" rel="tag"&gt;oreilly&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/data+centers" rel="tag"&gt;data centers&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/capacity+planning" rel="tag"&gt;capacity planning&lt;/a&gt; &amp;nbsp;
&lt;/em&gt;&lt;/p&gt;
</description>          </item>
    <item>
    <guid isPermaLink="true">http://www.arachna.com/roller/spidaman/entry/the_solar_decade</guid>
    <title>The Solar Decade</title>
    <dc:creator>spidaman</dc:creator>
    <link>http://www.arachna.com/roller/spidaman/entry/the_solar_decade</link>
        <pubDate>Tue, 9 Dec 2008 23:55:24 -0800</pubDate>
    <category>What's That Noise?</category>
            <description>&lt;SCRIPT TYPE="text/javascript"&gt;
&lt;!--
function popup(mylink, windowname)
{
if (! window.focus)return true;
var href;
if (typeof(mylink) == 'string')
   href=mylink;
else
   href=mylink.href;
window.open(href, windowname, 'width=720,height=630,scrollbars=yes');
return false;
}
//--&gt;
&lt;/SCRIPT&gt;
&lt;p&gt;
&lt;a href="http://www.arachna.com/images/solar_the_graph.gif" onClick="return popup(this, 'The Graph')" title="Fast Company""&gt;
&lt;img src="http://www.arachna.com/images/solar_the_graph_thumb.gif" style="float:right; margin:5px 5px 5px 5px;cursor:pointer; cursor:hand;" border="0" width="227" height="198" /&gt;&lt;/a&gt;
Ten years ago, you might have been advised that solar energy, while sounding nice, was a bad investment. The installations were failure prone and not cost effective. I don't know if I bought that then, I know of solar panels in San Francisco installed in the 80's that paid for themselves, just slowly. But what we're seeing isn't your father's solar panel. From &lt;a href="http://blog.makezine.com/archive/2007/06/google_solar_panel_projec.html"&gt;Google's solar panels&lt;/a&gt; to residential rooftops, it seems pretty clear that the &lt;a href="http://www.smartmoney.com/spending/for-the-home/economics-of-solar-power-are-looking-brighter-21444/"&gt;
Economics of Solar Power Are Looking Brighter&lt;/a&gt;. Fast Company is running an article &lt;a href="http://www.fastcompany.com/magazine/131/solar-goes-supernova.html"&gt;The Solar Industry Gains Ground&lt;/a&gt; that sounds a chord that we're hearing a lot of. Solar energy is getting more and more cost effective. What's projected is that the cost of solar power may share up-and-to-the-right properties of &lt;a href="http://en.wikipedia.org/wiki/Moore's_law"&gt;Moore's Law&lt;/a&gt;. The fabs that make the silicon enabling you to read this may also enable an energy giant leap forward. The Germans have their own "Solar Valley" and their industry projection &lt;a href="http://www.arachna.com/images/solar_the_graph.gif" onClick="return popup(this, 'The Graph')"&gt;graph&lt;/A&gt; appears pretty Moorish (look at the large yellow area).
&lt;/p&gt;
&lt;P&gt;
The big lift off is 10 years away but the investment that has been made in the area and the advances being made seem to put the benefits close at hand. But the big win, when dependence on fossil fuels are on a clear decline, is at leat 10 and 20 years out. But I think it can happen, I think the solar decade is coming. It should be the coming decade. However, it will require an Apollo-mission like focus from the Obama administration to succeed. And I hope we can make it a reality.
&lt;/p&gt;
&lt;p&gt;&lt;em&gt;
&lt;a href="http://technorati.com/tag/solar+energy" rel="tag"&gt;solar energy&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/obama" rel="tag"&gt;obama&lt;/a&gt; &amp;nbsp;
&lt;/em&gt;&lt;/p&gt;
</description>          </item>
    <item>
    <guid isPermaLink="true">http://www.arachna.com/roller/spidaman/entry/good_bye_perl</guid>
    <title>Good Bye, Perl</title>
    <dc:creator>spidaman</dc:creator>
    <link>http://www.arachna.com/roller/spidaman/entry/good_bye_perl</link>
        <pubDate>Mon, 8 Dec 2008 00:01:24 -0800</pubDate>
    <category>LAMP</category>
            <description>&lt;p&gt;
&lt;a href="http://www.perl.org/" title="Perl"&gt;
&lt;img src="http://www.arachna.com/images/perl-camel-logo.gif" style="float:right; margin:5px 5px 5px 5px;cursor:pointer; cursor:hand;" border="0" width="72" height="81" /&gt;&lt;/a&gt;
The other day, I was patching some Perl code. There I was, in the zone, code streaming off of my finger tips. But wait, I was writing Python in the middle of a Perl subroutine. Um. I found the bare word and missing semi-colon errors invoking &lt;code&gt;perl -cw dingleberry.pl&lt;/code&gt; amusing. How did &lt;em&gt;that&lt;/em&gt; happen?
&lt;/p&gt;
&lt;p&gt;
Truth is, I find myself rarely using Perl anymore. I spent years building applications with Perl. Having made extensive use of mod_perl APIs and various CPAN modules, studied the Talmudic wisdom of Damian Conway, struggled with the double-edged sword of &lt;a href="http://en.wikipedia.org/wiki/There_is_more_than_one_way_to_do_it"&gt;TMTOWTDI&lt;/a&gt; and rolled my eyes at the Perl haters for their failure to appreciate the strange poetry that is Perl it'd seem like a safe bet that Perl would remain on my top shelf. A lot of the complaints of Perl haters seem superficial ("ewe, all of those punctuation characters", shaddup). Yet, Perl has been long in the tooth, for a long time. I recall 5 years ago thinking that Perl 6 wasn't &lt;em&gt;too&lt;/em&gt; far away (after all, O'Reilly published &lt;a href="http://www.amazon.com/Perl-6-Essentials-Allison-Randal/dp/0596004990/"&gt;Perl 6 Essentials&lt;/a&gt; in June 2003). I'm sorry, my dear Perl friends, insistence that &lt;a href="http://london.pm.org/pipermail/london.pm/Week-of-Mon-20081208/015866.html"&gt;Perl is Alive&lt;/a&gt; rings hollow, now.
&lt;/p&gt;
&lt;p&gt;
I still find it heartening to hear of people doing cool things with Perl. David McLaughlin's uplifting &lt;a href="http://www.dmclaughlin.com/2008/12/07/how-i-learnt-to-love-perl/"&gt;How I learnt to love Perl&lt;/a&gt;, waxing on about &lt;a href="http://www.iinteractive.com/moose/"&gt;Moose&lt;/a&gt; and other "modern" Perl frameworks (but come on dude, everyone knows that PHP, sucks, heh). Brad Fitzpatrick released &lt;a href="http://brad.livejournal.com/2388824.html"&gt;Perl for Google App Engine&lt;/a&gt; today. But Perl, I'm sorry. It's just too little, too late.
&lt;/p&gt;
&lt;p&gt;
I'm just weary of the difficulties achieving team adherence to disciplined coding practices (or even appreciate why they're especially necessary in the TMTOWTDI world of Perl). The reliance on Wizardry is high with Perl; the path from novice to master requires grasping a wide range of arcana. Is it too much to ask for less magic in favor of easier developer ramp up? Perl's flexibility and expressiveness, it's high virtues, also comprises the generous reel of rope that programmers routinely hang themselves with. On top of idiomatic obscurities are the traps people fall into with dynamic typing and errors that only make themselves evident at runtime. Good testing practices are usually the anecdote to the woes of dynamic typing and and yet writing a good test harness for a Perl project is often a lot of work compared to the amount of work required to write the application code.
&lt;/p&gt;
&lt;p&gt;
I well understand the security that programmers feel using &lt;a href="http://teddziuba.com/2008/12/python-makes-me-nervous.html"&gt;static typing&lt;/a&gt; but I'm not saying the static typing is the cure to any ills. The compiler is the most basic test that your code can be understood and it gives your IDE a lotta help. That's great but static typing is also an anchor dragging on your time. From what I can tell, Java is the new C/C++ and Jython, JRuby. Groovy and friends (Scala, Clojure, etc) are the ways that people program the JVM with higher productivity. I'm not saying fornever to Perl (that's a long time). But I am saying Hasta La Vista, for now. I've been quite productive lately with Python (I know Perl friends, heresy!) and plan on pushing ahead with that, as well as with Java and other JVM languages. And where necessary, using Thrift to enable the pieces to work together. Python is certainly not perfect, it's quirks are many, too. But I've seen recent success with collaborative software development with Python that would have been difficult to accomplish with Perl. I'm not trying to stoke any language war at all, I'm just reflecting on how I've drifted from Perl. Amongst people I know and things I've read elsewhere, &lt;a href="http://journal.dedasys.com/2008/12/07/python-surpasses-perl"&gt;I'm not the only one&lt;/a&gt;. Don't fret, Perl, I'm sure I'll see you around.
&lt;/p&gt;
&lt;p&gt;&lt;em&gt;
&lt;a href="http://technorati.com/tag/perl" rel="tag"&gt;perl&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/python" rel="tag"&gt;python&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/java" rel="tag"&gt;java&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/programming" rel="tag"&gt;programming&lt;/a&gt; &amp;nbsp;
&lt;/em&gt;&lt;/p&gt;</description>          </item>
    <item>
    <guid isPermaLink="true">http://www.arachna.com/roller/spidaman/entry/mysql_vs_postgresql_again</guid>
    <title>MySQL vs. PostgreSQL (again)</title>
    <dc:creator>spidaman</dc:creator>
    <link>http://www.arachna.com/roller/spidaman/entry/mysql_vs_postgresql_again</link>
        <pubDate>Sat, 6 Dec 2008 12:08:20 -0800</pubDate>
    <category>LAMP</category>
            <description>&lt;p&gt;
&lt;a href="http://www.postgresql.org/" title="PostgreSQL"&gt;
&lt;img src="http://www.arachna.com/images/postgresql_logo.png" style="float:right; margin:5px 5px 5px 5px;cursor:pointer; cursor:hand;" border="0" width="175" height="68" /&gt;&lt;/a&gt;

&lt;a href="http://www.mysql.com/" title="MySQL"&gt;
&lt;img src="http://arachna.com/images/mysql_logo.gif" style="float:right; margin:5px 5px 5px 5px;cursor:pointer; cursor:hand;" border="0" width="114" height="68" /&gt;&lt;/a&gt;

If &lt;a href="http://feedblog.org/"&gt;Kevin Burton&lt;/a&gt; wanted to draw attention to &lt;a href="http://feedblog.org/2008/12/04/destroying-mysql/"&gt;MySQL's inattention to scalability concerns&lt;/a&gt;, it looks like he's &lt;a href="http://www.reddit.com/r/programming/comments/7hiv5/destroying_mysql/"&gt;succeeded&lt;/a&gt; (126 comments on &lt;a href="http://www.reddit.com/"&gt;Reddit&lt;/a&gt; in the last day and climbing). I totally understand why he feels that the &lt;a href="http://www.mysql.com/"&gt;MySQL&lt;/a&gt; folks need to be provoked into action. I'll confess to having serious "MySQL fatigue" after years of struggling with InnoDB quirks (we use MySQL extensively at &lt;a href="http://technorati.com/"&gt;Technorati&lt;/a&gt;), stupid query plans and difficult to predict performance inflection points (there's a calculus behind table row count, row width, number of indices, update rate and query rate -- but AFAIK nobody has a reliable formula to predict response times against those variables). Frankly, I was really surprised when Sun acquired MySQL (for such a hefty sum, too), I was expecting them to build up a &lt;a href="http://www.postgresql.org/"&gt;PostgreSQL&lt;/a&gt;-based platform by rolling up acquisitions of &lt;a href="http://www.greenplum.com/"&gt;Greenplum&lt;/a&gt; and &lt;a href="http://truviso.com/"&gt;Truviso&lt;/a&gt;.
&lt;/p&gt;
&lt;p&gt;
Kevin is totally correct that to find solid innovation with MySQL, don't look to the MySQL corporation. Instead, the consultants and third party shops specializing in MySQL are where the action's at (&lt;a href="http://blog.palominodb.com/"&gt;Palomino DB&lt;/a&gt;, &lt;a href="http://www.percona.com/"&gt;Percona&lt;/a&gt;, &lt;a href="https://launchpad.net/drizzle"&gt;Drizzle&lt;/a&gt;, etc). It's kinda sad, both Sun and MySQL have at various times been home to hot-beds of innovation. Sun has &lt;a href="http://blogs.sun.com/downstream/"&gt;great people&lt;/a&gt;  groundbreaking with cloud computing, impressive CPU performance per watt improvements and the Java ecosystem. But as far as MySQL goes, look to the outside practitioners.
&lt;/p&gt;
&lt;p&gt;
Kevin's post update cites the pluggable back-ends that MySQL supports as a feature but I'm not so sure. I don't have any evidence of this but my intuition is that it's exactly this feature that makes the overall stability and performance such a crap-shoot (or sometimes, just outright crap). I'm working on a personal project that uses &lt;a href="http://postgis.refractions.net/"&gt;PostGIS&lt;/a&gt; (PostgreSQL + GIS), nothing is live yet so I haven't had to scale it.  But I have a good deal of confidence in the platform.  &lt;a href="https://developer.skype.com/SkypeGarage/DbProjects/SkypePostgresqlWhitepaper"&gt;Skype&lt;/a&gt; and &lt;a href="http://pandora.com/"&gt;Pandora&lt;/a&gt; look like good case studies. The PostgreSQL people have been focused on MVCC concurrency, procedure languages, UDFs and data integrity semantics for years. In those realms, the MySQL people are Johnny-come-slowlies (and buggily). On the other hand, if you want the append-only characteristics of logging to a database, MyISAM and merge tables have performance properties that PostgreSQL just can't match.
&lt;/p&gt;
&lt;p&gt;
Maybe &lt;a href="http://en.wikipedia.org/wiki/David_Duffield"&gt;David Duffield&lt;/a&gt; will look beyond &lt;a href="http://workday.com/"&gt;enterprise app services&lt;/a&gt; and acquire, roll-up and market the PostgreSQL platforms that Sun didn't. Combining big data and event data with Greenplum and Truviso as a way to blow some smoke in Larry Ellison's eye, &lt;em&gt;that&lt;/em&gt; would be funny (and smart).
&lt;/p&gt;
&lt;p&gt;&lt;em&gt;
&lt;a href="http://technorati.com/tag/postgresql" rel="tag"&gt;postgresql&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/mysql" rel="tag"&gt;mysql&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/greenplum" rel="tag"&gt;greenplum&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/truviso" rel="tag"&gt;truviso&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/sun" rel="tag"&gt;sun&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/workday" rel="tag"&gt;workday&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/postgis" rel="tag"&gt;postgis&lt;/a&gt; &amp;nbsp;
&lt;/em&gt;&lt;/p&gt;</description>          </item>
    <item>
    <guid isPermaLink="true">http://www.arachna.com/roller/spidaman/entry/it_s_only_the_biggest</guid>
    <title>It's Only The Biggest Country In the World</title>
    <dc:creator>spidaman</dc:creator>
    <link>http://www.arachna.com/roller/spidaman/entry/it_s_only_the_biggest</link>
        <pubDate>Fri, 5 Dec 2008 18:12:19 -0800</pubDate>
    <category>What's That Noise?</category>
            <description>&lt;p&gt;
&lt;a href="http://en.wikipedia.org/wiki/China" title="China"&gt;
&lt;img src="http://www.arachna.com/images/chinese_flag.gif" style="float:right; margin:5px 5px 5px 5px;cursor:pointer; cursor:hand;" border="0" width="124" height="83" /&gt;&lt;/a&gt;
Should the confirmed reports that &lt;a href="http://www.ifgogo.com/251/technorati-is-blocked-in-china/"&gt;Technorati is banned in China&lt;/a&gt; be worn as a badge honor? I understand the Chinese authorities value stability but these kinds of things, treating billions of people like little children that need to be sheltered, will ultimately destabilize them.
&lt;/p&gt;
&lt;P&gt;
We've waited 18 years for &lt;a href="http://www.amazon.com/exec/obidos/ASIN/B001L4MHBE/wwwarachnacom-20"&gt;Chinese Democracy&lt;/a&gt;, isn't that long enough? (sorry, couldn't resist the joke)
&lt;/p&gt;
&lt;p&gt;
Best wishes to the Chinese people. At least most of you.
&lt;/p&gt;
&lt;p&gt;&lt;em&gt;
&lt;a href="http://technorati.com/tag/chinese+democracy" rel="tag"&gt;chinese democracy&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/china" rel="tag"&gt;china&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/censorship" rel="tag"&gt;censorship&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/technorati" rel="tag"&gt;technorati&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/guns-n-roses" rel="tag"&gt;guns n roses&lt;/a&gt; &amp;nbsp;
&lt;/em&gt;&lt;/p&gt;</description>          </item>
    <item>
    <guid isPermaLink="true">http://www.arachna.com/roller/spidaman/entry/technorati_releases</guid>
    <title>Technorati Releases Fixes Some UI Peeves</title>
    <dc:creator>spidaman</dc:creator>
    <link>http://www.arachna.com/roller/spidaman/entry/technorati_releases</link>
        <pubDate>Thu, 4 Dec 2008 18:43:39 -0800</pubDate>
    <category>What's That Noise?</category>
            <description>&lt;p&gt;
&lt;a href="http://technorati.com/" title="Technorati"&gt;
&lt;img src="http://www.arachna.com/images/technorati_logo.png" style="float:right; margin:5px 5px 5px 5px;cursor:pointer; cursor:hand;" border="0" width="110" height="15" /&gt;&lt;/a&gt;
In general, I regard successful user interfaces as the ones that provide the least amount of hunting and astonishment. Noone is delighted when the things they're looking for aren't obvious, the data displayed requires lots of explanation and the paths through an application are click-heavy. In this regard, Technorati was long saddled with a user interface that I regarded as delightless. However, I see that changing now and I'm delighted to see that!
&lt;p&gt;
&lt;a href="http://technorati.com/weblog/2008/12/457.html"&gt;Technorati's front end was released today&lt;/a&gt; with a handful of significant improvements. One is a long standing peeve of mine: the tag pages we're conflated with keyword search. That meant that if your post was about the president-elect and you tagged it "obama", your expectation that the the posts aggregated at &lt;a href="http://technorati.com/tag/obama"&gt;http://technorati.com/tag/obama&lt;/a&gt; would also be tagged "obama" would be disappointed; there would also be a bunch of keyword matches mixed in. That came out of last year's attempt to "simplify" the experience by making keyword search and tag browsing the same thing; which was, in all honesty, a George Bush level failure. Sure there are folks who don't know, "What's this 'tag' thing you're talking about?" But for the folks who &lt;em&gt;do&lt;/em&gt; know what the difference is between browsing blog posts grouped by tags and keyword search results, the mix wasn't received as a simplification but as a software defect.
&lt;blockquote&gt;
I tagged my post "obama" but all of these other posts aren't tagged "obama", what's going on?
&lt;/blockquote&gt;
I'm glad we've gone back to keeping search and tags distinct.
&lt;/p&gt;
&lt;p&gt;
The other failed aspect of the prior design was the demotion of the search box. The form input to type in your search was sized down and moved to the right, as if it were a "site search" feature. Yes, we'd like folks to explore our discovery features but the navigation for those features weren't great and the de-emphasis on search was again a source of more puzzlement than anything. The release today puts the search box back where it should be: bigger and right in the middle of the of the top third of the page, yay!
&lt;/p&gt;
&lt;p&gt;
Oh, and earlier today Technorati Media released its Engage platform to &lt;a href="https://beta.technoratimedia.com/"&gt;beta&lt;/a&gt;. This is a major step in opening up the ad market place for the blogosphere.
&lt;/p&gt;
&lt;p&gt;
So far, the feedback I've seen on these releases have been thumbs-up. Check 'em out, there's some more goodies in the works but these things only get better with your feedback. And yes, we know there's still more to do, I'm certainly busy with the backend stuff with our cloud platform, ping systems and crawlers (but did you notice the screenshot thumbnails on the search result and tag pages? I need to shake out the latencies producing and refreshing those). Kudos to Dave White, the front end team and the ad platform team for getting these releases out. Onward and upward!
&lt;/p&gt;
&lt;p&gt;&lt;em&gt;&lt;a href="http://technorati.com/tag/technorati" rel="tag"&gt;technorati&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/folksonomy" rel="tag"&gt;folksonomy&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/tagging" rel="tag"&gt;tagging&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/user+interface" rel="tag"&gt;user interface&lt;/a&gt; &amp;nbsp; 
&lt;a href="http://technorati.com/tag/blogging" rel="tag"&gt;blogging&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/ad+platform" rel="tag"&gt;ad platform&lt;/a&gt; &amp;nbsp;
&lt;/em&gt;&lt;/p&gt;</description>          </item>
    <item>
    <guid isPermaLink="true">http://www.arachna.com/roller/spidaman/entry/can_we_just_call_it</guid>
    <title>Can We Just Call It "Flex Computing"?</title>
    <dc:creator>spidaman</dc:creator>
    <link>http://www.arachna.com/roller/spidaman/entry/can_we_just_call_it</link>
        <pubDate>Wed, 3 Dec 2008 22:44:46 -0800</pubDate>
    <category>What's That Noise?</category>
            <description>&lt;p&gt;
&lt;a href="http://en.wikipedia.org/wiki/Cloud_computing" title="Cloud?"&gt;
&lt;img src="http://www.arachna.com/images/question_cloud.jpg" style="float:right; margin:5px 5px 5px 5px;cursor:pointer; cursor:hand;" border="0" width="80" height="127" /&gt;&lt;/a&gt;
The moniker &amp;quot;cloud computing&amp;quot; has been overloaded to mean to so many things, it's beginning to mean nothing. When someone refers to it generically, you have to ask them to dismbiguate; which of these are they referring to?
&lt;ol&gt;
&lt;li&gt;IT infrastructure offered as a services&lt;/li&gt;
&lt;li&gt;Hosted application functionality&lt;/li&gt;
&lt;li&gt;A virtualized server deployment&lt;/li&gt;
&lt;/ol&gt;
InfoWorld wrote &lt;a href="http://www.infoworld.com/article/08/04/07/15FE-cloud-computing-reality_1.html"&gt;What cloud computing really means&lt;/a&gt; last spring to help clarify the distinctions but still, I often have to stop folks when the use the C-word just make sure we're talking about the same things.
&lt;/p&gt;
&lt;p&gt;
Examples of the first definition are services like &lt;a href="http://aws.amazon.com/"&gt;Amazon Web Services&lt;/a&gt; (AWS) or &lt;a href="http://gogrid.com/"&gt;GoGrid&lt;/a&gt;. They provide metered virtual machines, you pay for what you use and have full access (root) on the machines while you use them. Additional goodies such as load balanced clusters, storage facilities and so forth are part of the deal too. Capacity can be scaled up or down on demand and typically, in very rapid fashion. When &lt;a href="http://www.infoworld.com/infoworld/article/08/07/21/30TC-cloud-reviews_8.html"&gt;Peter Wayner reviewed these guys last summer in InfoWorld&lt;/a&gt;, he was enamored with the GUI front ends. Call me old fashioned (or a dyed in the wool geek) but unless they're really saving me a lot of time, I have an aversion to the slick GUI's. For his part, Wayner complained about the AWS command line utilities. Actually, when I use AWS, I use a GUI for an overview of running instances, it's a Firefox plugin (&lt;a href="http://developer.amazonwebservices.com/connect/entry.jspa?externalID=609"&gt;Elasticfox&lt;/a&gt;) but what I &lt;em&gt;really&lt;/em&gt; like about AWS is programmatic access. Integrating application deployment with command and control functionality is very powerful, my tool of choice is &lt;a href="http://code.google.com/p/boto/"&gt;boto&lt;/a&gt;, a Python API for AWS.
&lt;/p&gt;
&lt;p&gt;
The second definition refers to hosted application functionality, in years gone by they were referred to as &lt;em&gt;Application Service Providers&lt;/em&gt; (ASP). The more modern label is &lt;em&gt;Software as a Service&lt;/em&gt; (SaaS). However, these services have to provide more than a console for functionality, they have to provide web service API's that enable them to be integrated into other applications. SalesForce.com was an early leader in this space (remember the red cross-outs, "No Software"), their example and the proliferation of RSS is really want inspired the proliferation of &lt;a href="http://www.programmableweb.com/"&gt;APIs and mashups&lt;/a&gt; we see today.
&lt;/p&gt;
&lt;p&gt;
The last definition refers to VMWare, Xen and so forth. By themselves, those aren't really cloud computing in my book. However, you can use them to create your own "private cloud" with tools like &lt;a href="http://www.enomaly.com/"&gt;Enomaly&lt;/a&gt; and &lt;a href="http://eucalyptus.cs.ucsb.edu/"&gt;Eucalyptus&lt;/a&gt;. This is an area of great interest to me.
&lt;/p&gt;
&lt;p&gt;
In his review, Wayner pointed out how very different all of the services are. I don't know why he included Google's App Engine at all in his write-up. Don't misunderstand, GAE is a great service but it more closely resembles an application container than infrastructure services.
&lt;/p&gt;
&lt;p&gt;
I'm imagining IT infrastructure management interfaces coalescing around standards (de-facto ones, not ones fashioned out of IETF meetings). Eucalyptus is a good discussion point. Eucalyptus provides an EC2 "work-alike" interface on top of a Xen virtual server platform. So picture this: if the Rackspaces, ServePaths, Server Beaches and LayeredTechs of this world were to provide a compatible interfaces built on top of Eucalyptus, buying compute power by the hour would become more like buying gasoline. There may be pros and cons to this station or that but fundamentally, if you don't like the pumps at one gas station or the prices are too high, you can go to the gas station across the street. Given compatible interfaces, management of the infrastructure, be it with boto, Elasticfox or using services such as &lt;a href="http://rightscale.com/"&gt;RightScale&lt;/a&gt; can be as dynamic as the server deployments in those clouds. Such a compute market place would unleash new rounds of innovation as it eases starting up and scaling online services.
&lt;/p&gt;
&lt;p&gt;
The Eucalyptus folks will be the first to fess up that their project is more academic in nature that industrial strength. However, it is the harbinger of AWS as a standard. Yes, I'm referring to AWS as a standard because of the level of adoption its enjoyed, the comprehensive set of APIs it provides and the rich ecosystem around it. What I foresee is that the first vendor to embrace and commoditize a standard interface for infrastructure management changes the game. The game becomes one of a meta-cloud because computing capacity will be truly fluid, flexibly shrinking and growing with hosted clouds, private clouds and migrating between clouds.
&lt;/p&gt;
&lt;p&gt;&lt;em&gt;
&lt;a href="http://technorati.com/tag/cloud+computing" rel="tag"&gt;cloud computing&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/saas" rel="tag"&gt;saas&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/virtualization" rel="tag"&gt;virtualization&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/aws" rel="tag"&gt;aws&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/xen" rel="tag"&gt;xen&lt;/a&gt; &amp;nbsp;
&lt;/em&gt;&lt;/p&gt;</description>          </item>
    <item>
    <guid isPermaLink="true">http://www.arachna.com/roller/spidaman/entry/social_media_backlash_against_cheaters</guid>
    <title>Social Media Backlash Against Cheaters and Fleshmongers</title>
    <dc:creator>spidaman</dc:creator>
    <link>http://www.arachna.com/roller/spidaman/entry/social_media_backlash_against_cheaters</link>
        <pubDate>Tue, 2 Dec 2008 23:42:34 -0800</pubDate>
    <category>What's That Noise?</category>
            <description>&lt;p&gt;
&lt;a href="http://en.wikipedia.org/wiki/Cheating" title="Cheating"&gt;
&lt;img src="http://www.arachna.com/images/no_cheaters.jpg" style="float:right; margin:5px 5px 5px 5px;cursor:pointer; cursor:hand;" border="0" width="116" height="116" /&gt;&lt;/a&gt;
As long as there is any media, pornographers will figure out how use it to purvey their wares. The other week, I mentioned on the &lt;a href="http://technorati.com/weblog/2008/10/455.html"&gt;Technorati blog&lt;/a&gt; that I'd been focusing on some spam scrubbing efforts, including removing porn. Apparently we're not the only social media service taking a look at the bottom line impact of miscreant activities. A few related items of interest percolated recently.
&lt;/p&gt;
&lt;p&gt;
Social network service provider Ning announced their &lt;a href="http://blog.ning.com/2008/12/the-end-of-the-red-light-district.html"&gt;End of the Red Light District&lt;/a&gt;. The high infrastructure costs, lack of revenue and administrative burdens (DMCA actions) were among the reasons cited. Sounds very familiar, we get our share of that kind of pointless nonsense at Technorati too.
&lt;/p&gt;
&lt;p&gt;
Today, YouTube posted that they were going to &lt;a href="http://www.youtube.com/blog?entry=AEX3_7h40mk"&gt;crack down or reduce the visibility of porny videos&lt;/a&gt;.  YouTube's measures include
&lt;ul&gt;
&lt;li&gt;Stricter standard for mature content&lt;/li&gt;
&lt;li&gt;Demotion of sexually suggestive content and profanity&lt;/li&gt;
&lt;li&gt;Improved thumbnails&lt;/li&gt;
&lt;/ul&gt;
&lt;/p&gt;
&lt;p&gt;
As expected in these cases, the trolls come out to cry foul. But this isn't about free speech or puritan ethics, the issue more closely resembles &lt;a href="http://en.wikipedia.org/wiki/Tragedy_of_the_commons"&gt;the tragedy of the commons&lt;/a&gt;. It's really very simple: these parasitic uses consume a lot of resources but bring no benefits to the host and degrade the service for other users.
&lt;/p&gt;
&lt;p&gt;
Also today, &lt;a href="http://mashable.com/2008/12/02/digg-bans-company-that-blatantly-sells-diggs/"&gt;Digg Bans Company That Blatantly Sells Diggs&lt;/a&gt; was reported by Mashable. Apparently Digg has directed a cease-and-desist at USocial.net's practice of selling diggs.
&lt;/p&gt;
&lt;p&gt;
It seems to be an accepted truism that social media oft demonstrates, &lt;a href="http://itc.conversationsnetwork.org/shows/detail461.html"&gt;All Complex Ecosystems Have Parasites&lt;/a&gt;.
Yep, I've talked to folks from &lt;a href="http://sixapart.com/"&gt;Six Apart&lt;/a&gt;, &lt;a href="http://wordpress.com/"&gt;Wordpress&lt;/a&gt;, &lt;a href="http://tumblr.com/"&gt;Tumblr&lt;/a&gt;, &lt;a href="http://twitter.com/"&gt;Twitter&lt;/a&gt; and elsewhere. We're all feeling the pains of success. Over the past month at Technorati, we've purged about 80% of the porn that was active in the search index. Sure, we're not spam free yet but the index is getting a &lt;em&gt;lot&lt;/em&gt; cleaner.
&lt;/p&gt;
&lt;p&gt;&lt;em&gt;
&lt;a href="http://technorati.com/tag/technorati" rel="tag"&gt;technorati&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/digg" rel="tag"&gt;digg&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/youtube" rel="tag"&gt;youtube&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/ning" rel="tag"&gt;ning&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/porn" rel="tag"&gt;porn&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/spam" rel="tag"&gt;spam&lt;/a&gt; &amp;nbsp;
&lt;/em&gt;&lt;/p&gt;
</description>          </item>
    <item>
    <guid isPermaLink="true">http://www.arachna.com/roller/spidaman/entry/system_gaming</guid>
    <title>System Gaming and Its Consequences</title>
    <dc:creator>spidaman</dc:creator>
    <link>http://www.arachna.com/roller/spidaman/entry/system_gaming</link>
        <pubDate>Mon, 1 Dec 2008 22:40:34 -0800</pubDate>
    <category>What's That Noise?</category>
            <description>&lt;p&gt;
&lt;a href="http://technorati.com/" title="Technorati"&gt;
&lt;img src="http://www.arachna.com/images/technorati_logo.png" style="float:right; margin:5px 5px 5px 5px;cursor:pointer; cursor:hand;" border="0" width="110" height="15" /&gt;&lt;/a&gt;
Technorati's &lt;a href="http://support.technorati.com/faq/topic/71"&gt;authority&lt;/a&gt; metric is based on a real simple concept: the count of the unique set of blogs linking to you in the trailing 180 days constitutes your authority. By its very nature, it's a volatile metric. The &lt;a href="http://technorati.com/pop/blogs"&gt;top 100&lt;/a&gt; of a few years ago bears little resemblance to the one today. When some folks observe their authority rising, they twitter w00ts of joy; when it's falling they complain bitterly that Technorati is "downgrading" them.
&lt;/p&gt;
&lt;p&gt;
Authority is not a perfect metric (crawl coverage variations, etc) nor the only important measurement of a blog (traffic and comments are other metrics we'd like to measure), however it is one that Technorati has been objectively calculating for years.
&lt;/p&gt;
&lt;p&gt;
What I find surprising is the surprise (or denial) that some people find when they learn there are consequences to gaming the system. On a fairly regular basis, someone comes up with the wholly unoriginal idea, "Hey, add your URL to my list of links, re-post it and urge others to follow suit to make your Technorati authority &lt;strong&gt;explode&lt;/strong&gt;!" Or some variant of a viral link exchanging scheme. Some folks take the news graciously, "Oh, that's not OK? I had no idea. It won't happen again." But some of these folks get downright hostile, as if the blog authority metric is their god given right to game.  These are probably the same people who expect appreciation on their home's property value to be a god given right. News flash: it's not. Since it's (apparently for some) not obvious: the attention you garner in the blogosphere and the price someone will pay for your house are driven by market forces. If your authority is dropping, create posts that are link-worthy. There's no shortcut. Blogs engaging in viral linking schemes stand a good chance that indexing will be suspended or the blog removed altogether from Technorati's index.
&lt;/p&gt;
&lt;p&gt;
Use the blogosphere to converse, to entertain, to teach and to learn. We'll do our best to measure it and to build applications with those measurements. If you want to play games, get a Wii.
&lt;/p&gt;
&lt;p&gt;&lt;em&gt;&lt;a href="http://technorati.com/tag/technorati" rel="tag"&gt;technorati&lt;/a&gt; &lt;/em&gt;&lt;/p&gt;
</description>          </item>
    <item>
    <guid isPermaLink="true">http://www.arachna.com/roller/spidaman/entry/big_is_the_problem</guid>
    <title>Big Is The Problem</title>
    <dc:creator>spidaman</dc:creator>
    <link>http://www.arachna.com/roller/spidaman/entry/big_is_the_problem</link>
        <pubDate>Sun, 30 Nov 2008 18:05:51 -0800</pubDate>
    <category>What's That Noise?</category>
            <description>&lt;p&gt;
I usually don't rant about economics but I wasn't shopping on "Black Friday" (nor will I be tomorrow on "Cyber Monday") - I'm trying to figure out how to tighten my belt. How it is that I, someone outside of the real estate, finance and auto industries that are so problem plagued, am getting caught in our economy's downdraft? Well, let's see.
&lt;/p&gt;
&lt;p&gt;
Last January, Business Week raised the question "When is an institution &lt;a href="http://www.businessweek.com/magazine/content/08_05/b4069032985454.htm"&gt;too big too fail&lt;/a&gt;?" Until September of this year, the financial industry's downward spiral meandered along, like a persistent flu. There were bank failures but the conventional wisdom seemed to be that this was the market at work, winnowing the weak. The bad news ebbed and flowed: mortgage failures, rising oil prices and the weak dollar were countered by stimulus package checks, housing sales leveling off or even rising (where prices crossed their local tipping points) and vibrant web 2.0 and green enterprises. There had been bank failures this year but it took the evaporation of really Big institutions, &lt;a href="http://en.wikipedia.org/wiki/Lehman_Brothers"&gt;Lehman Brothers&lt;/a&gt; and &lt;a href="http://en.wikipedia.org/wiki/Merrill_Lynch"&gt;Merrill Lynch&lt;/a&gt; to put Business Week's question on everyones lips. To free market purists, the answer is obvious: whatever may come, let the failures fail. But the reality is that when an enterprise is so big that its failure disrupts significant portions of the overall national and global economy, whatever may come of its failure won't be good. Everyone suffers and bigness &lt;em&gt;is&lt;/em&gt; the problem. When these companies become indispensable institutions, we should be afraid.
&lt;/p&gt;
&lt;p&gt;
It seems for years there's been a breakdown in accountability. Loan originators could resell their loans and write new ones, no harm no foul. Right? But one of the key problems with that system is that the originators don't have any skin in the game. The have a money merry-go-round and whoever is left holding the paper (big institutions and their investors) draws the short straw. It's total madness. To date, all of the bank failures have resulted in consolidation in some form or another. Lehman is absorbed by Barclays. Merrill by BoA (which already absorbed Countrywide). The big are getting bigger as the competitive field shrinks. Ironically, this perpetuates the problem: bigness. What happens when Barclays or BoA start wobbling next? Now we have yet bigger institutions that are again too big to fail.
&lt;/p&gt;
&lt;p&gt;
Among the remedies dismissed by free-market adherents is one of the Federal Government investing, taking an ownership stake in the banking, insurance and auto giants who have exposed themselves to risk that has subsequently blown up in their faces. "The government won't know any better how these companies should be run" goes the admonishment. But as if it isn't clear by now, the executives paid the big bucks to know how they should be run apparently don't either. As Newsweek explains in &lt;a href="http://www.newsweek.com/id/161199"&gt;The Monster That Ate Wall Street - How Credit Default Swaps Became a Timebomb&lt;/a&gt;, the financial industry had no shortage of creativity when it came skirting the liquidity requirements imposed on them in the years following the S&amp;amp;L crisis. Is it really such a surprise? Michael Lewis (Liar's Poker, Moneyball) recounts in &lt;a href="http://www.portfolio.com/news-markets/national-news/portfolio/2008/11/11/The-End-of-Wall-Streets-Boom"&gt;The End of Wall Streets Boom&lt;/a&gt; (Portfolio Magazine / December 2008), there were those calling Bullshit but things were just going too damned well for those alarms to be heeded.
&lt;/p&gt;
&lt;p&gt;
It's unescapably clear now that the old adage applies, "if it's too good to be true, it probably is."  Until recently, I thought this was only impacting me with the difficulty I had getting my mortgage. But no, the cavalier rating agencies ("the fox was guarding the hen house"), excessively leveraged financial arrangements and detached accountability have led us down this financial rabbit hole into what some now describe as a death spiral.  It's not just a Wall Street problem, it's spillover to Main street has cacaded down Sandhill Road. Here's the ominous and infamous &lt;a href="http://www.slideshare.net/eldon/sequoia-capital-on-startups-and-the-economic-downturn-presentation?type=powerpoint"&gt;slide deck from Doug Leone and friends at Sequoia Capital&lt;/a&gt;:

&lt;div style="width:425px;text-align:left" id="__ss_648808"&gt;&lt;a style="font:14px Helvetica,Arial,Sans-serif;display:block;margin:12px 0 3px 0;text-decoration:underline;" href="http://www.slideshare.net/eldon/sequoia-capital-on-startups-and-the-economic-downturn-presentation?type=powerpoint" title="Sequoia Capital on startups and the economic downturn"&gt;Sequoia Capital on startups and the economic downturn&lt;/a&gt;&lt;object style="margin:0px" width="425" height="355"&gt;&lt;param name="movie" value="http://static.slideshare.net/swf/ssplayer2.swf?doc=sequoia-1223625495238287-9&amp;stripped_title=sequoia-capital-on-startups-and-the-economic-downturn-presentation" /&gt;&lt;param name="allowFullScreen" value="true"/&gt;&lt;param name="allowScriptAccess" value="always"/&gt;&lt;embed src="http://static.slideshare.net/swf/ssplayer2.swf?doc=sequoia-1223625495238287-9&amp;stripped_title=sequoia-capital-on-startups-and-the-economic-downturn-presentation" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;div style="font-size:11px;font-family:tahoma,arial;height:26px;padding-top:2px;"&gt;View SlideShare &lt;a style="text-decoration:underline;" href="http://www.slideshare.net/eldon/sequoia-capital-on-startups-and-the-economic-downturn-presentation?type=powerpoint" title="View Sequoia Capital on startups and the economic downturn on SlideShare"&gt;presentation&lt;/a&gt; or &lt;a style="text-decoration:underline;" href="http://www.slideshare.net/upload?type=powerpoint"&gt;Upload&lt;/a&gt; your own. (tags: &lt;a style="text-decoration:underline;" href="http://slideshare.net/tag/depression"&gt;depression&lt;/a&gt; &lt;a style="text-decoration:underline;" href="http://slideshare.net/tag/recession"&gt;recession&lt;/a&gt;)&lt;/div&gt;&lt;/div&gt;
These slides were cited during Technorati's company meeting last week around the &lt;a href="http://technorati.com/weblog/2008/11/456.html"&gt;layoffs and salary cuts&lt;/a&gt;.  That really dropped the dark cloud of what's happening in the broader economy close to home.
&lt;/p&gt;
&lt;p&gt;
As bummed as I am about seeing colleagues depart and seeing my paycheck shrink, I'm actually optimistic about the future. Valuations on real estate seem to be reaching reality: they're hitting thresholds that people can afford with conventional financing. Technology continues to fuel innovation and innovation holds the potential to re-shape markets. Come Inauguration Day, it looks like Obama is coming into office surrounding himself with a team of &lt;a href="http://www.sfgate.com/cgi-bin/article.cgi?f=/c/a/2008/11/25/MN8U14B2BK.DTL"&gt;economic advisors&lt;/a&gt; who are committed to preserving free markets but are also not so steeped in ideology that they're paralyzed about how to intervene.
&lt;/p&gt;
&lt;p&gt;
I'm looking forward to this cloud lifting. That's my rant.
&lt;/p&gt;
&lt;p&gt;&lt;em&gt;&lt;a href="http://technorati.com/tag/economy" rel="tag"&gt;economy&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/banking" rel="tag"&gt;banking&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/layoffs" rel="tag"&gt;layoffs&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/finance" rel="tag"&gt;finance&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/economics" rel="tag"&gt;economics&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/obama" rel="tag"&gt;obama&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/death+spiral" rel="tag"&gt;death spiral&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/venture+capital" rel="tag"&gt;venture capital&lt;/a&gt; &amp;nbsp;

&lt;/em&gt;&lt;/p&gt; 
</description>          </item>
    <item>
    <guid isPermaLink="true">http://www.arachna.com/roller/spidaman/entry/getting_past_bad_checksums_in</guid>
    <title>Getting Past Bad Checksums in MacPorts</title>
    <dc:creator>spidaman</dc:creator>
    <link>http://www.arachna.com/roller/spidaman/entry/getting_past_bad_checksums_in</link>
        <pubDate>Sat, 29 Nov 2008 15:10:44 -0800</pubDate>
    <category>What's That Noise?</category>
            <description>&lt;p&gt;
&lt;a href="http://www.macports.org/" title="MacPorts"&gt;
&lt;img src="http://www.arachna.com/images/macports_logo.png" style="float:right; margin:5px 5px 5px 5px;cursor:pointer; cursor:hand;" border="0" width="183" height="70" /&gt;&lt;/a&gt;
Back in the 1990's I used FreeBSD fairly extensively. One of my favorite things about the FreeBSD project was the "ports and packages" system for installing libraries and application software. Since Mac OS X is, essentially, BSD with a lot of updated chrome, it's not surprising that there's a well functioning "ports and packages" system for it, &lt;a href="http://www.macports.org/"&gt;MacPorts&lt;/a&gt;. While it's not perfect, MacPorts seems to function and dovetail nicely with everything I use my Mac for, more so than &lt;a href="http://www.finkproject.org/"&gt;Fink&lt;/a&gt;. Sure, dpkg/apt-get seems to work OK on Debian, every effort I've encountered to apply that model elsewhere has left me disappointed... anyway, yum seems to work well enough, I don't expect to use Debian again. 
&lt;/p&gt;
&lt;p&gt;
Recently I found myself with a port that would not install, 
&lt;pre&gt;port install postgis&lt;/pre&gt; 
would bomb out:
&lt;pre&gt;
"Target org.macports.checksum returned: Unable to verify file checksums" postgis
&lt;/pre&gt;
It's not a very helpful error message. After some RTFM ("man port", imagine that), I figured there musta been some cruft in the way, so I did this:
&lt;pre&gt;
port -d selfupdate
port clean --all postgis
port install postgis
&lt;/pre&gt;
And I'm in business with the latest version of &lt;a href="http://postgis.refractions.net/"&gt;PostGIS&lt;/a&gt;. Yes, I coulda installed all of that stuff by hand but MacPorts generally has just what I need in a time-saving way. Note, I do all my MacOS X system administration as root so I'm not typing "sudo" all of the time.
&lt;/p&gt;
&lt;p&gt;&lt;em&gt;
&lt;a href="http://technorati.com/tag/macports" rel="tag"&gt;macports&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/freebsd" rel="tag"&gt;freebsd&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/postgresql" rel="tag"&gt;postgresql&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/postgis" rel="tag"&gt;postgis&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/macosx" rel="tag"&gt;macosx&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/fink" rel="tag"&gt;fink&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/debian" rel="tag"&gt;debian&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/yum" rel="tag"&gt;yum&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/package+management" rel="tag"&gt;package management&lt;/a&gt; &amp;nbsp;

&lt;/em&gt;&lt;/p&gt;</description>          </item>
    <item>
    <guid isPermaLink="true">http://www.arachna.com/roller/spidaman/entry/redistributing_the_karma</guid>
    <title>Redistributing the Karma</title>
    <dc:creator>spidaman</dc:creator>
    <link>http://www.arachna.com/roller/spidaman/entry/redistributing_the_karma</link>
        <pubDate>Fri, 28 Nov 2008 21:28:29 -0800</pubDate>
    <category>What's That Noise?</category>
            <description>&lt;p&gt;
&lt;a href="http://www.arachna.com/" title="PayPal"&gt;
&lt;img src="http://www.arachna.com/images/paypal_logo.gif" style="float:right; margin:5px 5px 5px 5px;cursor:pointer; cursor:hand;" border="0" width="200" height="50" /&gt;&lt;/a&gt;
Since Technorati announced pay cuts for the staff earlier this week, I've been a little worried.  The mortgage, an upcoming bat mitzvah (nothing opulent, really), doctor bills... the world won't wait for the economy's doldrums to turn around. I think I'll find ways to to tighten our belts (bag lunches, cancel the gym membership, etc) but if you're currently more fortunate than I am and so inclined, this PayPal &lt;em&gt;Donate&lt;/em&gt; button is a way you can help.
&lt;br/&gt;
&lt;form action="https://www.paypal.com/cgi-bin/webscr" method="post"&gt;
&lt;input type="hidden" name="cmd" value="_s-xclick"&gt;
&lt;input type="hidden" name="encrypted" value="-----BEGIN PKCS7-----MIIHLwYJKoZIhvcNAQcEoIIHIDCCBxwCAQExggEwMIIBLAIBADCBlDCBjjELMAkGA1UEBhMCVVMxCzAJBgNVBAgTAkNBMRYwFAYDVQQHEw1Nb3VudGFpbiBWaWV3MRQwEgYDVQQKEwtQYXlQYWwgSW5jLjETMBEGA1UECxQKbGl2ZV9jZXJ0czERMA8GA1UEAxQIbGl2ZV9hcGkxHDAaBgkqhkiG9w0BCQEWDXJlQHBheXBhbC5jb20CAQAwDQYJKoZIhvcNAQEBBQAEgYADYND0ZI12yx7LpmehTdBmE4oWqsvNSoXLFEOy6tQdiRBv6ahbjkuvKU/QCHk9IT8k3/17Md64UnYXNysbg+QdHWf1b212UZaGgEO14WItHutGYVuc0bdXUuwMa0U8HX+fgjJKfV0nM3s0ZmiInWkqO8CLCbLdvnVBcEUjkDeV+TELMAkGBSsOAwIaBQAwgawGCSqGSIb3DQEHATAUBggqhkiG9w0DBwQIu51lQe/V+z2AgYiyTnEwXSr943h4jzD/ITFw0d8BuA4PnrHoqK66m2Y/Xs5nzfdVgLFuVF9i/g9I8ZviKh/ePMGEUk3gkGB68PDGpc3nmXUPz1CpFlVI7rFa0t+E3gD2QrFywxDTly+LF9ewRDXEJpMWzMtkYpvZbkEEKL4twuwXH3jGBB+tmIrk9cUAbS4bNRgQoIIDhzCCA4MwggLsoAMCAQICAQAwDQYJKoZIhvcNAQEFBQAwgY4xCzAJBgNVBAYTAlVTMQswCQYDVQQIEwJDQTEWMBQGA1UEBxMNTW91bnRhaW4gVmlldzEUMBIGA1UEChMLUGF5UGFsIEluYy4xEzARBgNVBAsUCmxpdmVfY2VydHMxETAPBgNVBAMUCGxpdmVfYXBpMRwwGgYJKoZIhvcNAQkBFg1yZUBwYXlwYWwuY29tMB4XDTA0MDIxMzEwMTMxNVoXDTM1MDIxMzEwMTMxNVowgY4xCzAJBgNVBAYTAlVTMQswCQYDVQQIEwJDQTEWMBQGA1UEBxMNTW91bnRhaW4gVmlldzEUMBIGA1UEChMLUGF5UGFsIEluYy4xEzARBgNVBAsUCmxpdmVfY2VydHMxETAPBgNVBAMUCGxpdmVfYXBpMRwwGgYJKoZIhvcNAQkBFg1yZUBwYXlwYWwuY29tMIGfMA0GCSqGSIb3DQEBAQUAA4GNADCBiQKBgQDBR07d/ETMS1ycjtkpkvjXZe9k+6CieLuLsPumsJ7QC1odNz3sJiCbs2wC0nLE0uLGaEtXynIgRqIddYCHx88pb5HTXv4SZeuv0Rqq4+axW9PLAAATU8w04qqjaSXgbGLP3NmohqM6bV9kZZwZLR/klDaQGo1u9uDb9lr4Yn+rBQIDAQABo4HuMIHrMB0GA1UdDgQWBBSWn3y7xm8XvVk/UtcKG+wQ1mSUazCBuwYDVR0jBIGzMIGwgBSWn3y7xm8XvVk/UtcKG+wQ1mSUa6GBlKSBkTCBjjELMAkGA1UEBhMCVVMxCzAJBgNVBAgTAkNBMRYwFAYDVQQHEw1Nb3VudGFpbiBWaWV3MRQwEgYDVQQKEwtQYXlQYWwgSW5jLjETMBEGA1UECxQKbGl2ZV9jZXJ0czERMA8GA1UEAxQIbGl2ZV9hcGkxHDAaBgkqhkiG9w0BCQEWDXJlQHBheXBhbC5jb22CAQAwDAYDVR0TBAUwAwEB/zANBgkqhkiG9w0BAQUFAAOBgQCBXzpWmoBa5e9fo6ujionW1hUhPkOBakTr3YCDjbYfvJEiv/2P+IobhOGJr85+XHhN0v4gUkEDI8r2/rNk1m0GA8HKddvTjyGw/XqXa+LSTlDYkqI8OwR8GEYj4efEtcRpRYBxV8KxAW93YDWzFGvruKnnLbDAF6VR5w/cCMn5hzGCAZowggGWAgEBMIGUMIGOMQswCQYDVQQGEwJVUzELMAkGA1UECBMCQ0ExFjAUBgNVBAcTDU1vdW50YWluIFZpZXcxFDASBgNVBAoTC1BheVBhbCBJbmMuMRMwEQYDVQQLFApsaXZlX2NlcnRzMREwDwYDVQQDFAhsaXZlX2FwaTEcMBoGCSqGSIb3DQEJARYNcmVAcGF5cGFsLmNvbQIBADAJBgUrDgMCGgUAoF0wGAYJKoZIhvcNAQkDMQsGCSqGSIb3DQEHATAcBgkqhkiG9w0BCQUxDxcNMDgxMTI5MDUwMzAzWjAjBgkqhkiG9w0BCQQxFgQUt6IhXky63eVMOWE7+CrTIKvN9TMwDQYJKoZIhvcNAQEBBQAEgYAp8dRlsFx1odHXT1MN+CGjaQdYb6HRHwpbrfk+wc6XvNskVQYEmhh0tZVVJf8a2PrrVfLfLyPueDKYvvVa/zaxzCIWsdD9NY23/1pWP1BGffrq42RK/rhwnnqxw1hhp0W0C1vUf6Q8Jl2VBgm7pwZdnY15ijnOo/gAmpEXmgdaZg==-----END PKCS7-----
"&gt;
&lt;input type="image" src="https://www.paypal.com/en_US/i/btn/btn_donateCC_LG.gif" border="0" name="submit" alt=""&gt;
&lt;img alt="" border="0" src="https://www.paypal.com/en_US/i/scr/pixel.gif" width="1" height="1"&gt;
&lt;/form&gt;
&lt;/p&gt;
&lt;p&gt;
If I end up with more than needed, I'll simply donate the excess to a worthy charity.
&lt;br/&gt;
Thanks!
&lt;/p&gt;
</description>          </item>
    <item>
    <guid isPermaLink="true">http://www.arachna.com/roller/spidaman/entry/topic_clustering_visualized_in_library</guid>
    <title>Topic Clustering Visualized in Library Search</title>
    <dc:creator>spidaman</dc:creator>
    <link>http://www.arachna.com/roller/spidaman/entry/topic_clustering_visualized_in_library</link>
        <pubDate>Thu, 27 Nov 2008 11:43:13 -0800</pubDate>
    <category>What's That Noise?</category>
    <category>jazz</category>
    <category>search</category>
    <category>guitar</category>
    <category>django</category>
    <category>programming</category>
    <category>python</category>
            <description>&lt;p&gt;
&lt;a href="http://ccclib.org/" title="Contra Costa County Library"&gt;
&lt;img src="http://www.arachna.com/images/ccclogo.gif" style="float:right; margin:5px 5px 5px 5px;cursor:pointer; cursor:hand;" border="0" width="48" height="48" /&gt;&lt;/a&gt;
Public service announcement: your low-tech dowdy public libraries have slicked up high-tech. The old days of long searches through card catalogs and filling out forms in triplicate are gone. Since moving to the east bay several years ago, I've been impressed with the &lt;a href="http://ccclib.org/"&gt;Contra Costa County Library&lt;/a&gt;'s online catalog that searches all of the branches in the country, online reservations and inter-branch transfers. One of my favorite features is the visual topic clustering.
&lt;/p&gt;
&lt;p style="clear: both;"&gt;
&lt;div style="width: 640px; align: center; clear: left;"&gt;
&lt;a href="http://www.arachna.com/images/search_django.png" title="Screen grab of search for &amp;quot;django&amp;quot;"&gt;
&lt;img src="http://www.arachna.com/images/search_django_thumb.png" style="float:left; margin:5px 5px 5px 5px;cursor:pointer; cursor:hand;" border="0" width="237" height="266" /&gt;&lt;/a&gt;
When searching for "&lt;a href="http://catalog.ccclib.org/?q=django" title="Contra Costa County Library catalog search for &amp;quot;django&amp;quot;"&gt;django&lt;/a&gt;", a hub-and-spoke is displayed with related nodes such as "reinhardt" and "guitar" as well as misspell candidates. The search results are pretty good too, the first result is for a Gypsy jazz guitar (Django Reinhardt's signature style) instructional video by the main guy from &lt;a href="http://hcsf.com/"&gt;Hot Club San Francisco&lt;/a&gt; (Paul Mehling can often be found gigging here in the east bay at the &lt;a href="http://leftbank.com/locations/pleasant_hill.php?location=3"&gt;Left Bank in Pleasant Hill&lt;/a&gt;, good stuff). Overall, the selection of books, CD's and videos matching "django" was what I expected.
&lt;/div&gt;
&lt;/p&gt;
&lt;p style="clear: both;"&gt;
&lt;div style="width: 640px; align: center; clear: right;"&gt;
&lt;a href="http://www.arachna.com/images/search_python_django.png" title="Screen grab of search for &amp;quot;python django&amp;quot;"&gt;
&lt;img src="http://www.arachna.com/images/search_python_django_thumb.png" style="float:right; margin:5px 5px 5px 5px;cursor:pointer; cursor:hand;" border="0" width="244" height="265" /&gt;&lt;/a&gt;
As fond as I am of Gypsy jazz, I'm also interested in the web application framework written in the &lt;a href="http://python.org/"&gt;Python programming language&lt;/a&gt;. Changing my query to "&lt;a href="http://catalog.ccclib.org/?q=python+django" title="Contra Costa County Library catalog search for &amp;quot;python django&amp;quot;"&gt;python django&lt;/a&gt;" brings up a different visual cluster with some of the same cluster terms ("reinhardt" and "guitar") but adds some new ones "monty", "boa" and "computer". The search results were exactly what I wanted: &lt;a href="http://www.amazon.com/exec/obidos/ASIN/1590597257/wwwarachnacom-20"&gt;The Definitive Guide to Django: Web Development Done Right by Adrian Holovaty and Jacob Kaplan-Moss&lt;/a&gt; and &lt;a href="http://www.amazon.com/exec/obidos/ASIN/067232959X/wwwarachnacom-20"&gt;Sams Teach Yourself Django in 24 Hours by Brad Dayley&lt;/a&gt;. I'm planning on using &lt;a href="http://www.djangoproject.com/"&gt;django&lt;/a&gt; (the python web app framework) for a project (not &lt;a href="http://technorati.com/"&gt;work&lt;/a&gt; related) and, while the online docs are pretty good, having a book (or two) to refer to is definitely welcomed.
&lt;/div&gt;
&lt;/p&gt;
&lt;p style="clear: both;"&gt;
All said, I'm a fan of the search and clustering technology enabled by &lt;a href="http://www.aquabrowser.com/"&gt;AquaBrowser&lt;/a&gt; that the CCC library is using, it's had me wondering how well it would perform against the more volatile data set flowing through &lt;a href="http://technorati.com/"&gt;Technorati&lt;/a&gt;.
&lt;/p&gt;
&lt;p&gt;&lt;em&gt;
&lt;a href="http://technorati.com/tag/django" rel="tag"&gt;django&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/python" rel="tag"&gt;python&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/programming" rel="tag"&gt;programming&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/gypsy+jazz" rel="tag"&gt;gypsy jazz&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/search+clustering" rel="tag"&gt;search clustering&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/library" rel="tag"&gt;library&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/technorati" rel="tag"&gt;technorati&lt;/a&gt; &amp;nbsp;
&lt;/em&gt;&lt;/p&gt;
</description>          </item>
    <item>
    <guid isPermaLink="true">http://www.arachna.com/roller/spidaman/entry/wordpress_security_revisited</guid>
    <title>Wordpress Security Revisited</title>
    <dc:creator>spidaman</dc:creator>
    <link>http://www.arachna.com/roller/spidaman/entry/wordpress_security_revisited</link>
        <pubDate>Wed, 26 Nov 2008 07:09:11 -0800</pubDate>
    <category>What's That Noise?</category>
    <category>wordpress</category>
            <description>&lt;p&gt;
&lt;a href="http://wordpress.org/" title="WordPress"&gt;
&lt;img src="http://www.arachna.com/images/wordpress_logo.png" style="float:right; margin:5px 5px 5px 5px;cursor:pointer; cursor:hand;" border="0" width="189" height="45" /&gt;&lt;/a&gt;
The incidence of Wordpress compromises I wrote of in the spring is still high but the rate of new infections has dropped considerably. A lot of people learned of their blogs' affliction because they were not getting indexed by Technorati. Props to the folks from Google and the Wordpress team for getting the message out too.
&lt;/p&gt;
&lt;p&gt;
Yesterday's release of Wordpress 2.6.5 doesn't target SQL injection or XML-RPC vulnerabilities, this time it's a cross site scripting vulnerability.
&lt;blockquote cite="http://wordpress.org/development/2008/11/wordpress-265/"&gt;
The security issue is an XSS exploit discovered by Jeremias Reith that fortunately only affects IP-based virtual servers running on Apache 2.x. If you are interested only in the security fix, copy wp-includes/feed.php and wp-includes/version.php from the 2.6.5 release package.
&lt;br/&gt;
2.6.5 contains three other small fixes in addition to the XSS fix. The first prevents accidentally saving post meta information to a revision. The second prevents XML-RPC from fetching incorrect post types. The third adds some user ID sanitization during bulk delete requests. For a list of changed files, consult the full changeset between 2.6.3 and 2.6.5.
&lt;br/&gt;
&lt;a href="http://wordpress.org/development/2008/11/wordpress-265/"&gt;read the full post&lt;/a&gt;
&lt;/blockquote&gt;
So jump on it Wordpress users, time to update!
&lt;/p&gt;
&lt;p&gt;&lt;em&gt;
&lt;a href="http://technorati.com/tag/wordpress" rel="tag"&gt;wordpress&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/security" rel="tag"&gt;security&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/technorati" rel="tag"&gt;technorati&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/blogging" rel="tag"&gt;blogging&lt;/a&gt; &amp;nbsp;
&lt;/em&gt;&lt;/p&gt;</description>          </item>
    <item>
    <guid isPermaLink="true">http://www.arachna.com/roller/spidaman/entry/fifteen_hiccups_of_fame</guid>
    <title>Fifteen Hiccups Of Fame</title>
    <dc:creator>spidaman</dc:creator>
    <link>http://www.arachna.com/roller/spidaman/entry/fifteen_hiccups_of_fame</link>
        <pubDate>Tue, 25 Nov 2008 14:12:45 -0800</pubDate>
    <category>What's That Noise?</category>
            <description>&lt;p&gt;
&lt;a href="http://technorati.com/search/%22Sarah+Palin%22" title="Search for &amp;quot;Sarah Palin&amp;quot;"&gt;
&lt;img src="http://www.arachna.com/images/sarah_palin_100d_chart.png" style="float:right; margin:5px 5px 5px 5px;cursor:pointer; cursor:hand;" border="0" width="410" height="250" /&gt;&lt;/a&gt;
It's been a long time since I've felt hopeful about the outcome of an election. I remember well the civil rights anti-war marches of my childhood. The recent years have felt like a return of profound delusional corruption and polarization that marked the Nixon era. The most peculiar aspect of it is how it came to a head with desperate gasp of Sarah Palin's VP nomination. That the questions about her qualifications were questioned at all demonstrates the height of delusion. While the republican fringe ran to embrace her, those with a brain could only ask "WTF?" and cross party lines. If she had been equipped otherwise between those legs, it would never have happened. It's like reverse sexism, if it had been a man with that background and view points, he would have been laughed out the door as a naive hillbilly. (Yep, I am a reasonably educated urban elitist, so?) Instead, we were treated to tragic comedy in slow mo. The graphic included here is a chart of the blogosphere's mention trajectory for Sarah Palin over the prior 100 days. 
&lt;/p&gt;
&lt;p&gt;
The timeline starts off with the igloo phase: practically nobody has heard of her and nobody is talking about her. Then there's the nomination at the republican convention. Followed by interviews, the Tina Fey phenom, the election and ...back to the igloo. So long, Caribou Barbie.
&lt;/p&gt;
&lt;p&gt;
All of the talk about her PR representation, book deals, etc are for naught; she's proven at every opportunity that she has nothing to say that is meaningful and contributing to moving our society forward. Nonetheless, it's amusing to read the conservative bloggers who talk about Sarah Palin as "the future of the republican party." As long as they stick to that meme, they're assuring themselves falling further adrift of where this country is going. Bon voyage, don't let that iceberg hit you in the butt on the way out! 
&lt;/p&gt;
&lt;p&gt;&lt;em&gt;
&lt;a href="http://technorati.com/tag/Sarah+Palin" rel="tag"&gt;Sarah Palin&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/Election" rel="tag"&gt;Election&lt;/a&gt; &amp;nbsp;
&lt;a href="http://technorati.com/tag/Politics" rel="tag"&gt;Politics&lt;/a&gt; &amp;nbsp;
&lt;/em&gt;&lt;/p&gt;</description>          </item>
  </channel>
</rss>
