<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0"><channel><description>Official blog of Shalin Shekhar Mangar</description><title>Shalin Says...</title><generator>Tumblr (3.0; @shalinmangar)</generator><link>http://shal.in/</link><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/rss+xml" href="http://feeds.feedburner.com/ShalinSays" /><feedburner:info uri="shalinsays" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://tumblr.superfeedr.com/" /><item><title>Apache Lucene 3.1.0 and Apache Solr 3.1.0</title><description>&lt;a href="http://www.lucidimagination.com/blog/2011/03/31/apache-lucene-3-1-0-and-apache-solr-3-1-0/"&gt;Apache Lucene 3.1.0 and Apache Solr 3.1.0&lt;/a&gt;: &lt;p&gt;This is the first release bringing Lucene and Solr release versions in sync. There are numerous bug fixes, optimizations and new features. Download from &lt;a href="http://www.apache.org/dyn/closer.cgi/lucene/java"&gt;here&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/ShalinSays/~4/PcwTI9USb3g" height="1" width="1"/&gt;</description><link>http://feedproxy.google.com/~r/ShalinSays/~3/PcwTI9USb3g/4256013716</link><guid isPermaLink="false">http://shal.in/post/4256013716</guid><pubDate>Fri, 01 Apr 2011 16:24:59 +0530</pubDate><feedburner:origLink>http://shal.in/post/4256013716</feedburner:origLink></item><item><title>My Android phone - Samsung Galaxy S Review</title><description>&lt;p&gt;I had been holding out on buying a more internet friendly phone for some time now, waiting for 3G service to start in India. After &lt;a target="_self" href="http://shal.in/post/735399522/the-ipad-experience"&gt;my iPad experience&lt;/a&gt;, it was clear to me that I couldn’t be happy with an iPhone but it was also obvious that none of the available Android phones were good enough.&lt;/p&gt;
&lt;p&gt;Enter the new &lt;a href="http://galaxys.samsungmobile.com/"&gt;Samsung Galaxy S&lt;/a&gt; with Android 2.1, awesome 4” display, light weight and a good (enough) battery life. A little market research showed that Samsung’s &lt;a href="http://en.wikipedia.org/wiki/Super_AMOLED"&gt;Super AMOLED&lt;/a&gt; technology was best in class and a better phone, the &lt;a href="http://www.slashgear.com/samsung-galaxy-s2-i9200-tipped-2ghz-720p-super-amoled-display-and-more-0592814/"&gt;Samsung Galaxy S2&lt;/a&gt;, will only be available next year. With 3G to be introduced (supposedly) around October, the stage was set. So, one fine July evening, I bought the Samsung Galaxy S.&lt;/p&gt;
&lt;p&gt;&lt;img src="http://androidandme.com/wp-content/uploads/2010/06/samsung-galaxy-s-international.jpg" alt="Samsung Galaxy S" width="540" height="429"/&gt;&lt;/p&gt;
&lt;p&gt;First of all, a message to people who are surprised upon hearing the price - the Samsung Galaxy S is not a phone; it is a &lt;a href="http://galaxys.samsungmobile.com/specification/spec.html?ver=low"&gt;pocket sized computer&lt;/a&gt; that also happens to be a phone. And what have I been upto with this device? Here’s a list of ten things (in no particular order) that I have used it to:&lt;/p&gt;
&lt;ol&gt;&lt;li&gt;Get email alerts via &lt;a href="http://code.google.com/p/k9mail/"&gt;K9 Mail&lt;/a&gt;, read blogs, check twitter, buzz and facebook feeds&lt;/li&gt;
&lt;li&gt;Take photographs and upload it to Facebook (darn you Airtel, I could never get MMS to work properly)&lt;/li&gt;
&lt;li&gt;Watch movies on my desktop monitor and phone through VLC Remote and &lt;a href="http://www.gmote.org/"&gt;Gmote&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;Stream music from &lt;a href="http://listen.grooveshark.com/"&gt;GrooveShark&lt;/a&gt; - Yay! freedom from syncing my music&lt;/li&gt;
&lt;li&gt;Play &lt;a href="http://www.youtube.com/watch?v=QsFQMeHCBao"&gt;Asphalt5&lt;/a&gt; while sitting at the back of an autorickshaw - something ironic about it.&lt;/li&gt;
&lt;li&gt;Use AIM to answer questions from our QA team while having lunch. Note to myself, use a spoon next time if you want to use the phone.&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.geeky-gadgets.com/road-sms-lets-you-walk-around-while-texting-12-07-10/"&gt;SMS while walking down the road&lt;/a&gt; without the fear of banging into an obstacle.&lt;/li&gt;
&lt;li&gt;Read a book on the &lt;a href="http://www.amazon.com/gp/feature.html?ie=UTF8&amp;docId=165849822"&gt;Kindle&lt;/a&gt; app.&lt;/li&gt;
&lt;li&gt;Navigate to Kolar and back with Google Maps&lt;/li&gt;
&lt;li&gt;Impress people with the &lt;a href="http://www.youtube.com/watch?v=9phlk78qcro"&gt;live wallpapers&lt;/a&gt;, &lt;a href="http://gesturesearch.googlelabs.com/"&gt;gesture search&lt;/a&gt; and some not so useful tidbits such as a lie detector and a &lt;a href="http://www.google.com/mobile/skymap/"&gt;sky map&lt;/a&gt;.&lt;/li&gt;
&lt;/ol&gt;&lt;p&gt;Google integration means that all my contacts are backed up on their servers. I am still surprised that something as simple as backing up contacts has been so hard for phones till now. Another cool feature that I loved was that I could link phone numbers with Facebook contacts together. Now when a friend calls, I see his Facebook photo automatically. The device has a decent battery life; with on and off wifi use, it lasts for a little more than a day. The device has a 5 megapixel camera and can record a 720p video. And, by the way, flash sites works too.&lt;/p&gt;
&lt;p&gt;Before I bought the phone, I wasn’t sure how easy it would be to use a touch keyboard. Well, it wasn’t too hard with the default keyboard but ever since I switched to the &lt;a href="http://www.youtube.com/watch?v=mRUoWUhcRlE&amp;feature=fvw"&gt;Swype keyboard&lt;/a&gt;, I’m insanely fast. I’d recommend Swype to everybody. Good thing that Samsung provides this keyboard application for free.&lt;/p&gt;
&lt;p&gt;There are a few downsides though. The phone feels sluggish when more than a couple of applications are running. The “Advanced Task Killer” application is essential for a decent experience. There is no flash light so one must depend on external light source being available to avoid dark photos. Sometimes (and this happens rarely), after moving out of a wifi zone, it won’t automatically switch to a GPRS/EDGE connection unless I restart the phone.&lt;/p&gt;
&lt;p&gt;All in all, I’m happy with the device and eagerly waiting for an Android 2.2 (froyo) upgrade and Airtel’s 3G service.&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/ShalinSays/~4/7rHv-qUPuyk" height="1" width="1"/&gt;</description><link>http://feedproxy.google.com/~r/ShalinSays/~3/7rHv-qUPuyk/909119227</link><guid isPermaLink="false">http://shal.in/post/909119227</guid><pubDate>Fri, 06 Aug 2010 02:15:00 +0530</pubDate><category>Product Review</category><category>Android</category><feedburner:origLink>http://shal.in/post/909119227</feedburner:origLink></item><item><title>Solr 1.4.1 Released</title><description>&lt;a href="http://www.lucidimagination.com/blog/2010/06/28/announce-solr-1-4-1-released/"&gt;Solr 1.4.1 Released&lt;/a&gt;: &lt;p&gt;From the mailing list announcement:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;span&gt;Apache Solr 1.4.1 has been released and is now available for public&lt;br/&gt;download!&lt;br/&gt;&lt;a href="http://www.apache.org/dyn/closer.cgi/lucene/solr/"&gt;&lt;a href="http://www.apache.org/dyn/closer.cgi/lucene/solr/"&gt;http://www.apache.org/dyn/closer.cgi/lucene/solr/&lt;/a&gt;&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;&lt;span&gt;&lt;span&gt;Solr 1.4.1 is a bug fix release. See the &lt;a href="http://svn.apache.org/repos/asf/lucene/solr/tags/release-1.4.1/CHANGES.txt"&gt;change log&lt;/a&gt; for more details.&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/ShalinSays/~4/kPQKQHsMoqw" height="1" width="1"/&gt;</description><link>http://feedproxy.google.com/~r/ShalinSays/~3/kPQKQHsMoqw/748932725</link><guid isPermaLink="false">http://shal.in/post/748932725</guid><pubDate>Tue, 29 Jun 2010 13:57:23 +0530</pubDate><category>Apache Solr</category><feedburner:origLink>http://shal.in/post/748932725</feedburner:origLink></item><item><title>The iPad experience</title><description>&lt;p&gt;&lt;img height="690" width="570" alt="Apple iPad" src="http://images.apple.com/ipad/home/images/hero7_20100621.png"/&gt;&lt;/p&gt;
&lt;p&gt;Now that I have had the chance to play with the Apple iPad, I thought I’d put down some observations and opinions about the device. I know that I’m late to the party but hey, they don’t sell these devices here in India.&lt;/p&gt;
&lt;p&gt;First of all, I love that I do not have to sit before a desk to use this. There is a real need in the world for a device like this and even though many have tried in the past, hats off to Apple for pulling this off. No company other than Apple, without its army of fan boys, would have been able to make this kind of a product successful. But now that Apple has, we can all expect that the new shiny competing devices will be able to do at least as much as what the iPad does. So, thank you Apple!&lt;/p&gt;
&lt;p&gt;The model that I have tried is the 32GB iPad. The device is slim, the touch experience is intuitive and the display is sharp. It is great as a browsing device. I guess it would have been easier to use if it were a little more lighter - which is not too much to expect from later versions. The dimensions at 9.56 x 7.47 inches feels a little too big at times. A 7-8 inches display may be more handy.&lt;/p&gt;
&lt;p&gt;Typing on the touch keyboard takes a little getting used to but practice makes perfect. Still, without my favorite set of browser plugins, I find it hard to share stuff on twitter/buzz. I guess the iPad is designed for people who are mostly consumers of data rather than creators. The device does not have a USB port which is a bummer. A Linux user like me cannot transfer songs or movies from my laptop to the device because one must use iTunes. I had to borrow a friend’s laptop to use Windows(!) so that I can transfer some media into the device. But to do that, you need to convert your divx media to the format iPad can play. The App Store is not open to anyone outside the US so I couldn’t use it. &lt;/p&gt;
&lt;p&gt;As a geek, one gets bored with the device very soon. There’s just too many things that I want to do with it but am not able to because Apple won’t let me. I am looking forward to the &lt;a href="http://www.engadget.com/2010/06/14/samsungs-7-inch-tab-to-run-android-2-2-on-1-2ghz-a8-processor/"&gt;Samsung Android based tablet&lt;/a&gt; which is supposed to be coming out in Q3. No one can beat Apple in the wow factor category but I am hoping that the Android devices may allow me to do much more.&lt;/p&gt;
&lt;p&gt;[Update] - I used the excellent &lt;a href="http://handbrake.fr/"&gt;Handbrake&lt;/a&gt; application to convert between media formats and I’d highly recommend it.&lt;/p&gt;
&lt;p&gt;[Update 2] - &lt;a href="http://www.google.com/buzz/shalinmangar/gKFGWuvqJWT/The-iPad-experience-Shalin-Says"&gt;An interesting discussion on Google Buzz on this topic&lt;/a&gt;.&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/ShalinSays/~4/L_TiQ2LJpsk" height="1" width="1"/&gt;</description><link>http://feedproxy.google.com/~r/ShalinSays/~3/L_TiQ2LJpsk/735399522</link><guid isPermaLink="false">http://shal.in/post/735399522</guid><pubDate>Sat, 26 Jun 2010 00:18:00 +0530</pubDate><feedburner:origLink>http://shal.in/post/735399522</feedburner:origLink></item><item><title>Apache Mahout 0.3 Released</title><description>&lt;p&gt;&lt;a&gt;Apache Mahout&lt;/a&gt; 0.3 has been released. Apache Mahout is a project which attempts to make machine learning both scalable and accessible. It is a sub-project of the excellent &lt;a&gt;Apache Lucene&lt;/a&gt; project which provides open source search software.&lt;/p&gt;
&lt;p&gt;From the project website:&lt;/p&gt;
&lt;blockquote&gt;The Apache Lucene project is pleased to announce the release of Apache Mahout 0.3. Highlights include:&lt;/blockquote&gt;
&lt;blockquote&gt;
&lt;ul&gt;&lt;li&gt;New: math and collections modules based on the high performance Colt library&lt;/li&gt;
&lt;li&gt;Faster Frequent Pattern Growth(FPGrowth) using FP-bonsai pruning&lt;/li&gt;
&lt;li&gt;Parallel Dirichlet process clustering (model-based clustering algorithm)&lt;/li&gt;
&lt;li&gt;Parallel co-occurrence based recommender&lt;/li&gt;
&lt;li&gt;Parallel text document to vector conversion using LLR based ngram generation&lt;/li&gt;
&lt;li&gt;Parallel Lanczos SVD (Singular Value Decomposition) solver&lt;/li&gt;
&lt;li&gt;Shell scripts for easier running of algorithms, utilities and examples&lt;/li&gt;
&lt;/ul&gt;&lt;p&gt;.. and much much more: code cleanup, many bug fixes and performance improvements&lt;/p&gt;
&lt;p&gt;Details on what’s included can be found in the &lt;a&gt;release notes&lt;/a&gt;. Downloads are available from the &lt;a&gt;Apache Mirrors&lt;/a&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Congratulations and thanks to all Mahout developers!&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/ShalinSays/~4/vQ8cYR4Jhw0" height="1" width="1"/&gt;</description><link>http://feedproxy.google.com/~r/ShalinSays/~3/vQ8cYR4Jhw0/456339651</link><guid isPermaLink="false">http://shal.in/post/456339651</guid><pubDate>Thu, 18 Mar 2010 14:58:29 +0530</pubDate><category>Apache Mahout</category><feedburner:origLink>http://shal.in/post/456339651</feedburner:origLink></item><item><title>Merging Lucene and Solr</title><description>&lt;p&gt;A couple of weeks back, Apache Lucene Committer and PMC member, Michael McCandless started a discussion on &lt;a href="http://search-lucene.com/m/kZkk3CXObA"&gt;factoring out a shared, standalone Analysis package for Lucene, Solr and Nutch&lt;/a&gt;. During the discussions, Yonik Seeley, Solr Creator, &lt;a href="http://search-lucene.com/m/jISUj1CXObA"&gt;proposed merging the development of Lucene and Solr&lt;/a&gt;. After &lt;a href="http://search-lucene.com/m/kZkk3CXObA/Factor+out+a+standalone%252C+shared+analysis+package+for+Nutch%252FSolr%252FLucene%255C%253F/v=threaded"&gt;intense&lt;/a&gt; discussions and &lt;a href="http://lucene.markmail.org/thread/w7wj4yla7tdytgw3"&gt;multiple&lt;/a&gt; &lt;a href="http://markmail.org/thread/lwzravbw23gxqwje"&gt;rounds&lt;/a&gt; of &lt;a href="http://lucene.markmail.org/thread/cmbek34vfycb6sfc"&gt;voting&lt;/a&gt;, the following changes are being put into effect:&lt;/p&gt;
&lt;ul&gt;&lt;li&gt;Merging the developer mailing lists into a single list.&lt;/li&gt;
&lt;li&gt;Merging the set of committers.&lt;/li&gt;
&lt;li&gt;When any change is committed (to a module that “belongs to” Solr or to Lucene), all tests must pass.&lt;/li&gt;
&lt;li&gt;Release details will be decided by dev community, but, Lucene may release without Solr.&lt;/li&gt;
&lt;li&gt;Modularize the sources: pull things out of Lucene’s core (break out query parser, move all core queries &amp; analyzers under their contrib counterparts), pull things out of Solr’s core (analyzers, queries).&lt;/li&gt;
&lt;/ul&gt;&lt;p&gt;The following things do not change:&lt;/p&gt;
&lt;ul&gt;&lt;li&gt;Besides modularizing (above), the source code would remain factored into separate dirs/modules the way it is now.&lt;/li&gt;
&lt;li&gt;Issue tracking remains separate (SOLR-XXX and LUCENE-XXX issues).&lt;/li&gt;
&lt;li&gt;User’s lists remain separate.&lt;/li&gt;
&lt;li&gt;Web sites remain separate.&lt;/li&gt;
&lt;li&gt;Release artifacts/jars remain separate.&lt;/li&gt;
&lt;/ul&gt;&lt;p&gt;So what does it mean for Lucene/Solr users? Nothing much, really. Except that you should see tighter co-ordination between Lucene and Solr development. New Lucene features should reach Solr faster and releases should be more frequent. Solr features may also be made available to Lucene users who do not want to setup Solr use the RESTy APIs.&lt;/p&gt;
&lt;p&gt;Already, Solr has been upgraded to use Lucene trunk (in branches/solr) and should soon become the new Solr trunk. There is talk of &lt;a href="http://markmail.org/thread/3dy35xfy4gpizjkp"&gt;re-organizing the source structure&lt;/a&gt; to better fit the new model. Things are moving fast!&lt;/p&gt;
&lt;p&gt;Personally, I feel that this merge is a good thing for both Lucene and Solr:&lt;/p&gt;
&lt;ul&gt;&lt;li&gt;Solr users get the latest Lucene improvements faster and releases get streamlined.&lt;/li&gt;
&lt;li&gt;Lucene users get access to Solr features such as faceting.&lt;/li&gt;
&lt;li&gt;The in-sync trunk allows new features to make their way into the right place (Lucene vs Solr) more easily and duplication is minimized.&lt;/li&gt;
&lt;li&gt;Bugs are caught earlier by the huge combined test suite.&lt;/li&gt;
&lt;li&gt;More number of committers means more ideas and hands available to the projects&lt;/li&gt;
&lt;li&gt;Other Lucene based projects can benefit too because many Solr features will be made available through Java APIs.&lt;/li&gt;
&lt;/ul&gt;&lt;p&gt;There are a couple of things to be worked out. For example, we need to decide where the integrated sources should live and whether or not to &lt;a href="http://markmail.org/thread/saaljcvmq6os6rpv"&gt;sync Solr’s version with Lucene’s&lt;/a&gt;. All this will take some time but I am confident that our combined community will manage the transition well.&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/ShalinSays/~4/Bxn_vL5JO4s" height="1" width="1"/&gt;</description><link>http://feedproxy.google.com/~r/ShalinSays/~3/Bxn_vL5JO4s/452753260</link><guid isPermaLink="false">http://shal.in/post/452753260</guid><pubDate>Wed, 17 Mar 2010 01:18:00 +0530</pubDate><category>Apache Solr</category><category>Apache Lucene</category><feedburner:origLink>http://shal.in/post/452753260</feedburner:origLink></item><item><title>Apache Lucene Java 3.0.1 and 2.9.2 Released</title><description>&lt;p&gt;&lt;img src="http://lucene.apache.org/images/lucene_green_300.gif" alt="Apache Lucene Logo" width="300" height="46"/&gt;&lt;/p&gt;
&lt;p&gt;Congratulations to the Apache Lucene team on releasing Lucene Java 3.0.1 and 2.9.2. Both of these are bug fix releases and are backwards compatible with Lucene Java 3.0.0 and 2.9.1 respectively.&lt;/p&gt;
&lt;p&gt;From the &lt;a href="http://bit.ly/bf8vlD"&gt;official announcement&lt;/a&gt;:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Hello Lucene users,&lt;/p&gt;
&lt;p&gt;On behalf of the Lucene development community I would like to announce the release of Lucene Java versions 3.0.1 and 2.9.2:&lt;/p&gt;
&lt;p&gt;Both releases fix bugs in the previous versions:&lt;/p&gt;
&lt;ul&gt;&lt;li&gt;2.9.2 is a bugfix release for the Lucene Java 2.x series, based on Java 1.4&lt;/li&gt;
&lt;li&gt;3.0.1 has the same bug fix level but is for the Lucene Java 3.x series, based on Java 5&lt;/li&gt;
&lt;/ul&gt;&lt;p&gt;New users of Lucene are advised to use version 3.0.1 for new developments, because it has a clean, type-safe API.&lt;/p&gt;
&lt;p&gt;Important improvements in these releases include:&lt;/p&gt;
&lt;ul&gt;&lt;li&gt;An increased maximum number of unique terms in each index segment.&lt;/li&gt;
&lt;li&gt;Fixed experimental CustomScoreQuery to respect per-segment search. This introduced an API change!&lt;/li&gt;
&lt;li&gt;Important fixes to IndexWriter: a commit() thread-safety issue, lost document deletes in near real-time indexing.&lt;/li&gt;
&lt;li&gt;Bugfixes for Contrib’s Analyzers package.&lt;/li&gt;
&lt;li&gt;Restoration of some public methods that were lost during deprecation removal.&lt;/li&gt;
&lt;li&gt;The new Attribute-based TokenStream API now works correctly with different class loaders.&lt;/li&gt;
&lt;/ul&gt;&lt;p&gt;Both releases are fully compatible with the corresponding previous versions. We strongly recommend upgrading to 2.9.2 if you are using 2.9.1 or 2.9.0; and to 3.0.1 if you are using 3.0.0.&lt;/p&gt;
&lt;p&gt;See core changes at:&lt;/p&gt;
&lt;ul&gt;&lt;li&gt;&lt;a href="http://lucene.apache.org/java/3_0_1/changes/Changes.html"&gt;http://lucene.apache.org/java/3_0_1/changes/Changes.html&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="http://lucene.apache.org/java/2_9_2/changes/Changes.html"&gt;http://lucene.apache.org/java/2_9_2/changes/Changes.html&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;&lt;p&gt;and contrib changes at:&lt;/p&gt;
&lt;ul&gt;&lt;li&gt;&lt;a href="http://lucene.apache.org/java/3_0_1/changes/Contrib-Changes.html"&gt;http://lucene.apache.org/java/3_0_1/changes/Contrib-Changes.html&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="http://lucene.apache.org/java/2_9_2/changes/Contrib-Changes.html"&gt;http://lucene.apache.org/java/2_9_2/changes/Contrib-Changes.html&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;&lt;p&gt;Binary and source distributions are available at &lt;a title="Download Lucene Java Releases"&gt;&lt;a href="http://www.apache.org/dyn/closer.cgi/lucene/java/"&gt;http://www.apache.org/dyn/closer.cgi/lucene/java/&lt;/a&gt;&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Lucene artifacts are also available in the Maven2 repository at &lt;a title="Apache Lucene Java artifacts in Maven"&gt;&lt;a href="http://repo1.maven.org/maven2/org/apache/lucene/"&gt;http://repo1.maven.org/maven2/org/apache/lucene/&lt;/a&gt;&lt;/a&gt;&lt;/p&gt;
&lt;/blockquote&gt;&lt;img src="http://feeds.feedburner.com/~r/ShalinSays/~4/welB2TeaP-M" height="1" width="1"/&gt;</description><link>http://feedproxy.google.com/~r/ShalinSays/~3/welB2TeaP-M/415145683</link><guid isPermaLink="false">http://shal.in/post/415145683</guid><pubDate>Sat, 27 Feb 2010 15:39:00 +0530</pubDate><category>Apache Lucene</category><feedburner:origLink>http://shal.in/post/415145683</feedburner:origLink></item><item><title>Microsoft dropping FAST search for Linux, Unix</title><description>&lt;p&gt;According to a &lt;a href="http://blogs.msdn.com/enterprisesearch/archive/2010/02/04/innovation-on-linux-and-unix.aspx"&gt;blog post&lt;/a&gt; from Microsoft Distinguished Engineer and CTO, FAST Bjørn Olstad, the 2010 products will be the last to have a search core that runs on Linux and UNIX.&lt;/p&gt;
&lt;p&gt;Being involved in &lt;a href="http://lucene.apache.org/solr"&gt;Apache Solr&lt;/a&gt; and the newly formed &lt;a href="http://incubator.apache.org/connectors/"&gt;Lucene Connectors Framework&lt;/a&gt; (LCF) project, I’m very interested in the implications. Undoubtedly, at least some FAST customers will not be happy with this decision and will want to switch to something which can still run on Linux/UNIX.&lt;/p&gt;
&lt;p&gt;I believe that this is a great opportunity for the Apache Solr/LCF combo. Perhaps, the newly proposed &lt;a href="http://wiki.apache.org/incubator/SpatialProposal"&gt;Apache Spatial Information Systems (SIS)&lt;/a&gt; will help as well. Of course, this is big news for &lt;a href="http://www.lucidimagination.com"&gt;Lucid Imagination&lt;/a&gt;, &lt;a href="http://www.sematext.com"&gt;Sematext&lt;/a&gt; and other companies as well who offer consultancy, training and support for Lucene/Solr.&lt;/p&gt;
&lt;p&gt;I’d like to ask people who have used FAST in the past, what would it take for Lucene/Solr/LCF to fill the gap?&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/ShalinSays/~4/ZOOkaxdKLn4" height="1" width="1"/&gt;</description><link>http://feedproxy.google.com/~r/ShalinSays/~3/ZOOkaxdKLn4/376632778</link><guid isPermaLink="false">http://shal.in/post/376632778</guid><pubDate>Mon, 08 Feb 2010 02:09:47 +0530</pubDate><category>Apache Solr</category><category>Enterprise Search</category><feedburner:origLink>http://shal.in/post/376632778</feedburner:origLink></item><item><title>Saw a Kurkure advertisement on a website titled “No...</title><description>&lt;img src="http://25.media.tumblr.com/tumblr_kwr0qkZQPf1qaxqqlo1_400.jpg"/&gt;&lt;br/&gt;&lt;br/&gt;&lt;p&gt;Saw a Kurkure advertisement on a website titled “No plastic in Kurkure”. ROTFL!&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/ShalinSays/~4/r0JSnLQvczI" height="1" width="1"/&gt;</description><link>http://feedproxy.google.com/~r/ShalinSays/~3/r0JSnLQvczI/350623416</link><guid isPermaLink="false">http://shal.in/post/350623416</guid><pubDate>Sun, 24 Jan 2010 16:48:20 +0530</pubDate><category>Funny</category><feedburner:origLink>http://shal.in/post/350623416</feedburner:origLink></item><item><title>Solr In Action Case Studies</title><description>&lt;p&gt;Well, the cat is out of the bag. I’ve been working with &lt;a href="http://www.jroller.com/otis/"&gt;Otis&lt;/a&gt; on &lt;a title="Solr In Action" href="http://www.manning.com/catalog/undercontract.html"&gt;Solr In Action&lt;/a&gt;. We’re looking for a couple of contributors to write case studies for the book describing how they have used Solr. Otis just posted this to his &lt;a href="http://www.jroller.com/otis/entry/contributors_for_solr_in_action"&gt;blog&lt;/a&gt; and to the Solr mailing list as well.&lt;/p&gt;
&lt;p&gt;&lt;b&gt;So, if you are are using Apache Solr in some clever, interesting or unusual way, or deal with large indexes or large number of cores or distributed search and are willing to share this information with the world, please get in touch. We are looking for between five to ten pages (soft limits) per case study.&lt;/b&gt;&lt;/p&gt;
&lt;p&gt;You can contact either me or Otis by leaving a comment on this post with your contact info or contact &lt;a href="http://twitter.com/shalinmangar"&gt;@shalinmangar&lt;/a&gt; or &lt;a href="http://twitter.com/otisg"&gt;@otisg&lt;/a&gt; on Twitter or email me on shalin at apache and we’ll get back to you right away.&lt;/p&gt;
&lt;p&gt;It’d be great if you can share this message around too.&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/ShalinSays/~4/ihu7eZsXmJ8" height="1" width="1"/&gt;</description><link>http://feedproxy.google.com/~r/ShalinSays/~3/ihu7eZsXmJ8/335570903</link><guid isPermaLink="false">http://shal.in/post/335570903</guid><pubDate>Fri, 15 Jan 2010 14:55:38 +0530</pubDate><category>Apache Solr</category><category>Solr In Action</category><feedburner:origLink>http://shal.in/post/335570903</feedburner:origLink></item><item><title>The Total Growth of Open Source</title><description>&lt;a href="http://dirkriehle.com/publications/2008/the-total-growth-of-open-source/"&gt;The Total Growth of Open Source&lt;/a&gt;: &lt;p&gt;Amit Desh­pande and Dirk Riehle from SAP Labs have conducted and published a research on the growth of open source software.&lt;/p&gt;
&lt;p&gt;The data has been culled from &lt;a href="http://www.ohloh.net"&gt;Ohloh.net&lt;/a&gt; and is based on the stats and activity of around 5000 open source projects written in 30 different languages and 103 open source licenses.&lt;/p&gt;
&lt;p&gt;Some interesting quotes from the publication:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Suc­cess­ful open source projects like Linux, Apache, Post­greSQL and many oth­ers are grow­ing super-linearly. Pre­vi­ous research showed that lin­ear and qua­dratic growth is the dom­i­nant growth pat­tern of open source soft­ware projects&lt;/p&gt;
&lt;/blockquote&gt;
&lt;blockquote&gt;
&lt;p&gt;Our work shows that the addi­tions to open source projects, the total project size (mea­sured in source lines of code), the num­ber of new open source projects, and the total num­ber of open source projects are grow­ing at an expo­nen­tial rate. The total amount of source code and the total num­ber of projects dou­ble about every 14 months.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Open Source has taken off handsomely and continues to thrive. It is not just about philosophy any more, it is good business sense.&lt;/p&gt;
&lt;p&gt;In case you are interested about Solr stats, see the &lt;a href="https://www.ohloh.net/p/solr"&gt;Solr project page at Ohloh&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;&lt;i&gt;Update - The report is quite old but I just discovered it now :)&lt;/i&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/ShalinSays/~4/rqYJExhu8oo" height="1" width="1"/&gt;</description><link>http://feedproxy.google.com/~r/ShalinSays/~3/rqYJExhu8oo/330194280</link><guid isPermaLink="false">http://shal.in/post/330194280</guid><pubDate>Tue, 12 Jan 2010 13:01:00 +0530</pubDate><feedburner:origLink>http://shal.in/post/330194280</feedburner:origLink></item><item><title>SolrMarc vs DIHMarc</title><description>&lt;p&gt;Erik has written about &lt;a href="http://www.lucidimagination.com/blog/2010/01/08/lucid-library-love/"&gt;Solr’s usage in libraries&lt;/a&gt; on the &lt;a href="http://www.lucidimagination.com/blog"&gt;Lucid Imagination Blog&lt;/a&gt;. Solr has found its way into many libraries and quite rightly so. However, one of the main things that Erik talks about in that blog post is the performance of &lt;a href="http://wiki.apache.org/solr/DataImportHandler"&gt;DataImportHandler&lt;/a&gt; vs &lt;a href="http://code.google.com/p/solrmarc/"&gt;SolrMarc&lt;/a&gt; (the indexing library used by both &lt;a href="http://vufind.org/"&gt;VUFind&lt;/a&gt; and &lt;a href="http://projectblacklight.org/"&gt;Blacklight&lt;/a&gt;).&lt;/p&gt;
&lt;p&gt;Quoting from &lt;a href="http://groups.google.com/group/solrmarc-tech/browse_thread/thread/b9ba2ed86f5da979"&gt;Erik’s email&lt;/a&gt; to the &lt;a href="http://groups.google.com/group/solrmarc-tech"&gt;solrmarc-tech&lt;/a&gt; google group:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;The difference in speeds:&lt;/p&gt;
SolrMarc: 22 docs / s &lt;br/&gt; DIHmarc: 1,745 docs / s&lt;/blockquote&gt;
&lt;p&gt;W00t!&lt;/p&gt;
&lt;p&gt;Well, I don’t know much about SolrMarc but I’ve seen DataImportHandler instances with comparable (or even better) throughput many times. There is something fishy inside SolrMarc for sure and I have a feeling that fixing it would be a low hanging fruit. However, Erik’s opinion is that DataImportHandler is a better way to index and that he will devote a portion of his time to helping the Solr using library community (thanks Erik!).&lt;/p&gt;
&lt;p&gt;DIH has really taken off since it debuted in Solr 1.3 and it would be safe to call it the de-facto standard for indexing data into Solr. It may not be the most elegant way to index data but it is quick and it works great. With the &lt;a href="https://issues.apache.org/jira/secure/IssueNavigator.jspa?reset=true&amp;&amp;pid=12310230&amp;component=12312438&amp;fixfor=12313566&amp;resolution=-1&amp;sorter/field=priority&amp;sorter/order=DESC"&gt;planned features for DataImportHandler in Solr v1.5&lt;/a&gt;, it will continue to improve and it makes sense to base VUFind and Blacklight’s indexing infrastructure on top of it. I’m very excited to see this happening.&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/ShalinSays/~4/j3Hgyn2mFao" height="1" width="1"/&gt;</description><link>http://feedproxy.google.com/~r/ShalinSays/~3/j3Hgyn2mFao/327246264</link><guid isPermaLink="false">http://shal.in/post/327246264</guid><pubDate>Mon, 11 Jan 2010 00:01:51 +0530</pubDate><category>Apache Solr</category><feedburner:origLink>http://shal.in/post/327246264</feedburner:origLink></item><item><title>Exploding mobile web usage in India</title><description>&lt;a href="http://www.opera.com/smw/2009/11/"&gt;Exploding mobile web usage in India&lt;/a&gt;: &lt;p&gt;Opera published a study titled &lt;a href="http://www.opera.com/smw/2009/11/"&gt;State of the Mobile Web, November 2009&lt;/a&gt; which I found through &lt;a href="http://www.techcrunch.com/2009/12/22/opera-state-mobile-web-november/"&gt;TechCrunch&lt;/a&gt;. I can’t help but notice the tremendous growth in web usage through mobile phones in India. Page views have grown by 228.5% Y/Y and unique users have grown by 208.4% but if you look at metrics like page views per user or the amount of data transferred per user, you’ll see that they are quite small.&lt;/p&gt;
&lt;p&gt;One of the reason is that in India we don’t have 3G (actually we do but it is limited to only one provider - BSNL) and browsing the Internet is painfully slow on GPRS connections. With 3G finally coming next year, I’m quite sure that the mobile web usage in India will just explode.&lt;/p&gt;
&lt;p&gt;Right now the only kind of mobile applications that can work in India are SMS based. The increase in download speeds will make more people use the Internet and SMS will be less relevant, though that may take more time than a year. If 3G indeed is introduced by mid next year by prominent providers like Airtel and Vodaphone, I won’t be surprised to see Y/Y growth rate exceeding 500%.&lt;/p&gt;
&lt;p&gt;Companies building products/services around SMS should start thinking about a mobile web strategy now.&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/ShalinSays/~4/k9cmRjAoDY0" height="1" width="1"/&gt;</description><link>http://feedproxy.google.com/~r/ShalinSays/~3/k9cmRjAoDY0/295048962</link><guid isPermaLink="false">http://shal.in/post/295048962</guid><pubDate>Tue, 22 Dec 2009 19:12:46 +0530</pubDate><feedburner:origLink>http://shal.in/post/295048962</feedburner:origLink></item><item><title>Migrating from Blogger to Tumblr</title><description>&lt;p&gt;I had been thinking about moving away from &lt;a target="_self" href="http://shalinsays.blogspot.com"&gt;Blogger&lt;/a&gt; to my own &lt;a href="http://shal.in"&gt;domain&lt;/a&gt;. Finally, I decided to give in and I was fortunate enough to buy this domain. Blogger has been a simple service but I wanted to try the new kids &lt;a href="http://www.tumblr.com"&gt;Tumblr&lt;/a&gt; or &lt;a href="http://www.posterous.com"&gt;Posterous&lt;/a&gt;. After spending some time fiddling with both of them, I decided to go with Tumblr.&lt;/p&gt;
&lt;p&gt;It took me some time to figure out the right way to move from Blogger. I used the &lt;a target="_self" href="http://terrymhung.com/jtran/tumblr/import-blogger-to-tumblr.php"&gt;import script&lt;/a&gt; written by &lt;a href="http://jonnytran.com/"&gt;Jonathan Tron&lt;/a&gt; to import my old posts. Sadly, there is no way to import comments from blogger. The least I could do was to import them into &lt;a target="_self" href="http://www.disqus.com"&gt;disqus&lt;/a&gt; and link the same disqus account into tumblr; which actually does not help much.&lt;/p&gt;
&lt;p&gt;The bigger issue was to migrate without leaving RSS subscribers in the lurch. The slightly lesser issue was to preserve my earlier blog’s Google page rank.&lt;/p&gt;
&lt;p&gt;The first issue was solved easily. You can do the same with the following sequence of steps.&lt;/p&gt;
&lt;ol&gt;&lt;li&gt; Create a &lt;a target="_self" href="http://www.feedburner.com"&gt;Feedburner&lt;/a&gt; account and import your blogger feed.&lt;/li&gt;
&lt;li&gt;Set your blogger feed to redirect to Feedburner.&lt;/li&gt;
&lt;li&gt;Edit your tumblr theme and replace the RSS link to point to feedburner&lt;/li&gt;
&lt;li&gt;Edit your feedburner to import from tumblr’s rss&lt;/li&gt;
&lt;/ol&gt;&lt;p&gt;The second was slightly more tricky. The right way to move a website to a different domain is to use permanent redirects from the old page to the new page. However, Blogger (obviously) does not allow you to do that. Thankfully, Google recently announced &lt;a target="_self" href="http://googlewebmastercentral.blogspot.com/2009/02/specify-your-canonical.html"&gt;support for specifying  canonical links&lt;/a&gt; which can point to the preferred version of a URL. So, I hacked up a script to match pages of blogger’s RSS with tumblr’s RSS and generate conditional blogger template snippets which let me specify the canonical (tumblr) URL for each page on my Blogger account.&lt;/p&gt;
&lt;p&gt;I couldn’t redirect so I had to fall back on meta-refresh to redirect anyone visiting an old page to the new page. I hate to break the back button like this but that was the only possible way in this case. This is what it looked like:&lt;/p&gt;
&lt;p&gt;
&lt;script src="http://gist.github.com/258970.js?file=blogger-canonical.xml"&gt;&lt;/script&gt;&lt;/p&gt;
&lt;p&gt;I used to think that blogging is a solved problem. After doing all this and trying out many service, I don’t believe that anymore.&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/ShalinSays/~4/3uxyhQvlH-8" height="1" width="1"/&gt;</description><link>http://feedproxy.google.com/~r/ShalinSays/~3/3uxyhQvlH-8/287833987</link><guid isPermaLink="false">http://shal.in/post/287833987</guid><pubDate>Fri, 18 Dec 2009 01:31:14 +0530</pubDate><feedburner:origLink>http://shal.in/post/287833987</feedburner:origLink></item><item><title>AOL’s new logos, which one do you like?</title><description>&lt;img src="http://28.media.tumblr.com/tumblr_kuscumHmRF1qaxqqlo1_500.jpg"/&gt;&lt;br/&gt; &lt;br/&gt;&lt;img src="http://29.media.tumblr.com/tumblr_kuscumHmRF1qaxqqlo2_500.jpg"/&gt;&lt;br/&gt; &lt;br/&gt;&lt;img src="http://25.media.tumblr.com/tumblr_kuscumHmRF1qaxqqlo3_500.jpg"/&gt;&lt;br/&gt; &lt;br/&gt;&lt;img src="http://27.media.tumblr.com/tumblr_kuscumHmRF1qaxqqlo4_500.jpg"/&gt;&lt;br/&gt; &lt;br/&gt;&lt;img src="http://25.media.tumblr.com/tumblr_kuscumHmRF1qaxqqlo5_500.jpg"/&gt;&lt;br/&gt; &lt;br/&gt;&lt;p&gt;AOL’s new logos, which one do you like?&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/ShalinSays/~4/25Q5aOnVOQI" height="1" width="1"/&gt;</description><link>http://feedproxy.google.com/~r/ShalinSays/~3/25Q5aOnVOQI/287323865</link><guid isPermaLink="false">http://shal.in/post/287323865</guid><pubDate>Thu, 17 Dec 2009 15:06:25 +0530</pubDate><category>AOL</category><feedburner:origLink>http://shal.in/post/287323865</feedburner:origLink></item><item><title>AOL lists on NYSE</title><description>&lt;p&gt;AOL listed on the New York Stock Exchange on 10th December 2009. This has been in the works for a long time and I’m glad we’re finally here. Things are changing around the company and I’m happy to be a part of this change.&lt;/p&gt;
&lt;p&gt;AOL has a new logo (and yes, it is still to be written as AOL). I loved the new brand videos, watch them on youtube - &lt;a&gt;&lt;a href="http://www.youtube.com/watch?v=YlSL7svbooY"&gt;http://www.youtube.com/watch?v=YlSL7svbooY&lt;/a&gt;&lt;/a&gt;&lt;br/&gt;&lt;br/&gt;&lt;a title="Seed.com" target="_blank" href="http://www.seed.com"&gt;Seed.com&lt;/a&gt; was also launched a few days back. It is a new spin on crowd sourcing content which, I believe, is a great idea.&lt;br/&gt;&lt;br/&gt;We’re just getting started!&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/ShalinSays/~4/USO_6ey-0HI" height="1" width="1"/&gt;</description><link>http://feedproxy.google.com/~r/ShalinSays/~3/USO_6ey-0HI/287282974</link><guid isPermaLink="false">http://shal.in/post/287282974</guid><pubDate>Thu, 17 Dec 2009 14:06:00 +0530</pubDate><feedburner:origLink>http://shal.in/post/287282974</feedburner:origLink></item><item><title>Apache Lucene Java 3.0 Released</title><description>&lt;p&gt;&lt;a style="font-size: 24px; line-height: 28.5px;" onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://lucene.apache.org/images/lucene_green_300.gif"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 300px; height: 46px; font-size: 24px; line-height: 28.5px;" src="http://lucene.apache.org/images/lucene_green_300.gif" alt="" height="69" width="450" border="0"/&gt;&lt;/a&gt;&lt;br/&gt;Apache Lucene Java 3.0.0 has been released. Lucene Java 3.0.0 is mostly a clean-up release without any new features. It paves the path for refactoring and adding new features without the shackles of backwards compatibility. All APIs deprecated in Lucene 2.9 have been removed and Lucene Java has officially moved to Java 5 as the minimum requirement.&lt;br/&gt;&lt;br/&gt;See the &lt;a href="http://markmail.org/message/ujyx3najm7krbz56"&gt;announcement email&lt;/a&gt; for more details. Congratulations Lucene Devs!&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width="1" height="1" src="https://blogger.googleusercontent.com/tracker/9767426-4290516244163129044?l=shalinsays.blogspot.com" alt=""/&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/ShalinSays/~4/sL_FOm9bgAQ" height="1" width="1"/&gt;</description><link>http://feedproxy.google.com/~r/ShalinSays/~3/sL_FOm9bgAQ/285909711</link><guid isPermaLink="false">http://shal.in/post/285909711</guid><pubDate>Thu, 26 Nov 2009 05:45:00 +0530</pubDate><category>Apache Lucene</category><feedburner:origLink>http://shal.in/post/285909711</feedburner:origLink></item><item><title>Apache Mahout 0.2 Released</title><description>&lt;p&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="font-family:arial;font-size:100%;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br/&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style="font-size:100%;"&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://lucene.apache.org/mahout/images/Mahout-logo-82x100.png"&gt;&lt;img src="http://lucene.apache.org/mahout/images/Mahout-logo-82x100.png" alt="" style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 82px; height: 100px;" border="0"/&gt;&lt;/a&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family:arial;font-size:100%;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br/&gt;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-size:100%;"&gt;&lt;a href="http://lucene.apache.org/mahout/"&gt;&lt;span class="Apple-style-span" style="font-family:arial;"&gt;&lt;span class="Apple-style-span"&gt;Apache Mahout&lt;/span&gt;&lt;/span&gt;&lt;/a&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family:arial;font-size:100%;"&gt;&lt;span class="Apple-style-span"&gt; 0.2 has been released. Apache Mahout is a project which attempts to make machine learning both scalable and accessible. It is a sub-project of the excellent &lt;/span&gt;&lt;/span&gt;&lt;span style="font-size:100%;"&gt;&lt;a href="http://lucene.apache.org/"&gt;&lt;span class="Apple-style-span" style="font-family:arial;"&gt;&lt;span class="Apple-style-span"&gt;Apache Lucene&lt;/span&gt;&lt;/span&gt;&lt;/a&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family:arial;font-size:100%;"&gt;&lt;span class="Apple-style-span"&gt; project which provides open source search software.&lt;br/&gt;&lt;br/&gt;From the project website:&lt;br/&gt;&lt;/span&gt;&lt;/span&gt;&lt;blockquote style="color: rgb(0, 0, 0);"&gt;&lt;span class="Apple-style-span" style=";font-family:Verdana,Helvetica,sans-serif;font-size:100%;"&gt;&lt;p style="line-height: 15px; text-align: left; margin-top: 0.5em; margin-bottom: 1em;"&gt;&lt;span class="Apple-style-span" style="font-family:arial;"&gt;&lt;span class="Apple-style-span"&gt;The Apache Lucene project is pleased to announce the release of Apache Mahout 0.2.&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p style="line-height: 15px; text-align: left; margin-top: 0.5em; margin-bottom: 1em;"&gt;&lt;span class="Apple-style-span" style="font-family:arial;"&gt;&lt;span class="Apple-style-span"&gt;Highlights include:&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;ul style="margin: 0px; padding: 0px 25px;"&gt;&lt;li style="padding: 0px 5px; margin-top: 0.5em; margin-bottom: 0.5em;"&gt;&lt;span class="Apple-style-span" style="font-family:arial;"&gt;&lt;span class="Apple-style-span"&gt;Significant performance increase (and API changes) in collaborative filtering engine&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;&lt;li style="padding: 0px 5px; margin-top: 0.5em; margin-bottom: 0.5em;"&gt;&lt;span class="Apple-style-span" style="font-family:arial;"&gt;&lt;span class="Apple-style-span"&gt;K-nearest-neighbor and SVD recommenders&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;&lt;li style="padding: 0px 5px; margin-top: 0.5em; margin-bottom: 0.5em;"&gt;&lt;span class="Apple-style-span" style="font-family:arial;"&gt;&lt;span class="Apple-style-span"&gt;Much code cleanup, bug fixing&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;&lt;li style="padding: 0px 5px; margin-top: 0.5em; margin-bottom: 0.5em;"&gt;&lt;span class="Apple-style-span" style="font-family:arial;"&gt;&lt;span class="Apple-style-span"&gt;Random forests, frequent pattern mining using parallel FP growth&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;&lt;li style="padding: 0px 5px; margin-top: 0.5em; margin-bottom: 0.5em;"&gt;&lt;span class="Apple-style-span" style="font-family:arial;"&gt;&lt;span class="Apple-style-span"&gt;Latent Dirichlet Allocation&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;&lt;li style="padding: 0px 5px; margin-top: 0.5em; margin-bottom: 0.5em;"&gt;&lt;span class="Apple-style-span" style="font-family:arial;"&gt;&lt;span class="Apple-style-span"&gt;Updates for Hadoop 0.20.x&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;p style="line-height: 15px; text-align: left; margin-top: 0.5em; margin-bottom: 1em;"&gt;&lt;span class="Apple-style-span" style="font-family:arial;"&gt;&lt;span class="Apple-style-span"&gt;Details on what’s included can be found in the&lt;/span&gt;&lt;/span&gt;&lt;a href="https://issues.apache.org/jira/secure/ReleaseNote.jspa?version=12313278&amp;styleName=Html&amp;projectId=12310751" style="color: rgb(0, 153, 153);"&gt;&lt;span class="Apple-style-span" style="font-family:arial;"&gt;&lt;span class="Apple-style-span"&gt; release notes&lt;/span&gt;&lt;/span&gt;&lt;/a&gt;&lt;span class="Apple-style-span" style="font-family:arial;"&gt;&lt;span class="Apple-style-span"&gt;.&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p style="line-height: 15px; text-align: left; margin-top: 0.5em; margin-bottom: 1em;"&gt;&lt;span class="Apple-style-span" style="font-family:arial;"&gt;&lt;span class="Apple-style-span"&gt;Downloads are available from the &lt;/span&gt;&lt;/span&gt;&lt;a href="http://www.apache.org/dyn/closer.cgi/lucene/mahout/" style="color: rgb(0, 0, 255);"&gt;&lt;span class="Apple-style-span" style="font-family:arial;"&gt;&lt;span class="Apple-style-span"&gt;Apache Mirrors&lt;/span&gt;&lt;/span&gt;&lt;/a&gt;&lt;/p&gt;&lt;/span&gt;&lt;/blockquote&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width="1" height="1" src="https://blogger.googleusercontent.com/tracker/9767426-3600431917631094270?l=shalinsays.blogspot.com" alt=""/&gt;&lt;/div&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/ShalinSays/~4/XHFud03za54" height="1" width="1"/&gt;</description><link>http://feedproxy.google.com/~r/ShalinSays/~3/XHFud03za54/285909705</link><guid isPermaLink="false">http://shal.in/post/285909705</guid><pubDate>Wed, 18 Nov 2009 05:54:00 +0530</pubDate><category>Apache Mahout</category><feedburner:origLink>http://shal.in/post/285909705</feedburner:origLink></item><item><title>Apache Solr 1.4 Released</title><description>&lt;p&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://lucene.apache.org/solr/images/solr.jpg"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 283px; height: 156px;" src="http://lucene.apache.org/solr/images/solr.jpg" alt="" border="0"/&gt;&lt;/a&gt;&lt;br/&gt;From the official announcement:&lt;br/&gt;&lt;br/&gt;Apache Solr 1.4 has been released and is now available for public download!&lt;br/&gt;&lt;a href="http://www.apache.org/dyn/closer.cgi/lucene/solr/" target="_blank"&gt;&lt;a href="http://www.apache.org/dyn/"&gt;http://www.apache.org/dyn/&lt;/a&gt;&lt;wbr&gt;closer.cgi/lucene/solr/&lt;/wbr&gt;&lt;/a&gt;&lt;br/&gt;&lt;br/&gt;Solr is the popular, blazing fast open source enterprise search platform from the Apache Lucene project. Its major features include powerful full-text search, hit highlighting, faceted search, dynamic clustering, database integration, and rich document (e.g., Word, PDF) handling. Solr is highly scalable, providing distributed search and index replication, and it powers the search and navigation features of&lt;br/&gt;many of the world’s largest internet sites.&lt;br/&gt;&lt;br/&gt;Solr is written in Java and runs as a standalone full-text search server within a servlet container such as Tomcat.  Solr uses the Lucene Java search library at its core for full-text indexing and search, and has REST-like HTTP/XML and JSON APIs that make it easy to use from virtually any programming language. Solr’s powerful external configuration allows it to be tailored to almost any type of application  without Java coding, and it has an extensive plugin architecture when more  advanced customization is required.&lt;br/&gt;&lt;br/&gt;New Solr 1.4 features include&lt;br/&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;Major performance enhancements in indexing, searching, and faceting&lt;/li&gt;&lt;li&gt;Revamped all-Java index replication that’s simple to configure and can replicate configuration files&lt;/li&gt;&lt;li&gt;Greatly improved database integration via the DataImportHandler&lt;/li&gt;&lt;li&gt;Rich document processing (Word, PDF, HTML) via Apache Tika&lt;/li&gt;&lt;li&gt;Dynamic search results clustering via Carrot2&lt;/li&gt;&lt;li&gt;Multi-select faceting (support for multiple items in a single category to be selected)&lt;/li&gt;&lt;li&gt;Many powerful query enhancements, including ranges over arbitrary functions, and nested queries of different syntaxes&lt;/li&gt;&lt;li&gt;Many other plugins including Terms for auto-suggest, Statistics, TermVectors, Deduplication&lt;/li&gt;&lt;/ul&gt;&lt;span style="font-size:130%;"&gt;Performance Enhancements&lt;/span&gt;&lt;br/&gt;&lt;ol&gt;&lt;li&gt;&lt;a href="http://www.lucidimagination.com/blog/2009/09/22/contrived-fieldcache-load-test-lucene-2-4-vs-lucene-2-9/"&gt;A simple FieldCache load test&lt;/a&gt;&lt;br/&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://yonik.wordpress.com/2009/05/27/filtered-query-performance-increases-for-solr-1-4/"&gt;Filtered query performance increases&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://yonik.wordpress.com/2008/12/01/solr-scalability-improvements/"&gt;Solr scalability improvements&lt;/a&gt;&lt;br/&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://yonik.wordpress.com/2008/11/25/solr-faceted-search-performance-improvements/"&gt;Solr faceted search performance improvements&lt;/a&gt;&lt;br/&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://shalinsays.blogspot.com/2009/04/inside-solr-improvements-in-faceted.html"&gt;Improvements in Solr Faceting Search&lt;/a&gt;&lt;br/&gt;&lt;/li&gt;&lt;/ol&gt;&lt;span style="font-size:130%;"&gt;Revamped All-Java Replication&lt;/span&gt;&lt;br/&gt;&lt;ol&gt;&lt;li&gt;&lt;a href="http://wiki.apache.org/solr/SolrReplication"&gt;SolrReplication wiki page&lt;/a&gt;&lt;br/&gt;&lt;/li&gt;&lt;li&gt;Works on Microsoft Windows Platforms too!&lt;br/&gt;&lt;/li&gt;&lt;/ol&gt;&lt;span style="font-size:130%;"&gt;DataImportHandler improvements&lt;/span&gt;&lt;br/&gt;&lt;ol&gt;&lt;li&gt;&lt;a href="http://shalinsays.blogspot.com/2009/09/whats-new-in-dataimporthandler-in-solr.html"&gt;What’s new in DataImportHandler in Solr&lt;/a&gt;&lt;br/&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://wiki.apache.org/solr/DataImportHandler"&gt;DataImportHandler wiki page&lt;/a&gt;&lt;br/&gt;&lt;/li&gt;&lt;/ol&gt;&lt;span style="font-size:130%;"&gt;Rich document processing&lt;/span&gt;&lt;br/&gt;&lt;ol&gt;&lt;li&gt;&lt;a href="http://wiki.apache.org/solr/ExtractingRequestHandler"&gt;ExtractingRequestHandler Wiki page&lt;/a&gt;&lt;br/&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.lucidimagination.com/blog/2009/09/14/posting-rich-documents-to-apache-solr-using-solrj-and-solr-cell-apache-tika/"&gt;Posting Rich Documents to Apache Solr using SolrJ and Solr Cell&lt;/a&gt;&lt;br/&gt;&lt;/li&gt;&lt;/ol&gt;&lt;span style="font-size:130%;"&gt;Dynamic Search Results Clustering&lt;/span&gt;&lt;br/&gt;&lt;ol&gt;&lt;li&gt;&lt;a href="http://wiki.apache.org/solr/ClusteringComponent"&gt;ClusteringComponent Wiki page&lt;/a&gt;&lt;br/&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.lucidimagination.com/blog/2009/09/28/solrs-new-clustering-capabilities/"&gt;Solr’s new Clustering Capabilities&lt;/a&gt;&lt;br/&gt;&lt;/li&gt;&lt;/ol&gt;&lt;span style="font-size:130%;"&gt;Multi-select Faceting&lt;/span&gt;&lt;br/&gt;&lt;ol&gt;&lt;li&gt;&lt;a href="http://wiki.apache.org/solr/SimpleFacetParameters#LocalParams_for_faceting"&gt;Local params for faceting&lt;/a&gt;&lt;br/&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://shalinsays.blogspot.com/2009/04/tagging-and-excluding-filters.html"&gt;Tagging and excluding filters&lt;/a&gt;&lt;br/&gt;&lt;/li&gt;&lt;/ol&gt;&lt;span style="font-size:130%;"&gt;Query Enhancements&lt;/span&gt;&lt;br/&gt;&lt;ol&gt;&lt;li&gt;&lt;a href="http://yonik.wordpress.com/2009/07/06/ranges-over-functions-in-solr-1-4/"&gt;Ranges over functions&lt;/a&gt;&lt;/li&gt;&lt;li&gt;Nested query support for any type of query parser (via QParserPlugin). Quotes will often be necessary to encapsulate the nested query if it contains reserved characters. &lt;span class="anchor" id="line-41"&gt;&lt;/span&gt;Example: &lt;tt&gt;_query_:”{!dismax qf=myfield}how now brown cow”&lt;br/&gt;&lt;/tt&gt;&lt;/li&gt;&lt;/ol&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;New Plugins&lt;/span&gt;&lt;/span&gt;&lt;br/&gt;&lt;ol&gt;&lt;li&gt;&lt;a href="http://wiki.apache.org/solr/TermsComponent"&gt;TermsComponent&lt;/a&gt; (can be used for auto-suggest)&lt;br/&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://wiki.apache.org/solr/TermVectorComponent"&gt;TermVectorComponent&lt;/a&gt;&lt;br/&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://wiki.apache.org/solr/StatsComponent"&gt;Statistics&lt;/a&gt;&lt;br/&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://wiki.apache.org/solr/Deduplication"&gt;Deduplication&lt;/a&gt;&lt;br/&gt;&lt;/li&gt;&lt;/ol&gt;&lt;span style="font-size:130%;"&gt;SolrJ - Java client&lt;/span&gt;&lt;br/&gt;&lt;ol&gt;&lt;li&gt;Faster, more efficient Binary Update format&lt;/li&gt;&lt;li&gt;&lt;a href="http://wiki.apache.org/solr/Solrj#Directly_adding_POJOs_to_Solr"&gt;Javabean (POJO) binding support&lt;/a&gt;&lt;br/&gt;&lt;/li&gt;&lt;li&gt;Fast multi-threaded updates through &lt;a href="http://lucene.apache.org/solr/api/solrj/org/apache/solr/client/solrj/impl/StreamingUpdateSolrServer.html"&gt;StreamingUpdateSolrServer&lt;/a&gt;&lt;/li&gt;&lt;li&gt;Simple round-robin load balancing client - &lt;a href="http://lucene.apache.org/solr/api/solrj/org/apache/solr/client/solrj/impl/LBHttpSolrServer.html"&gt;LBHttpSolrServer&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://wiki.apache.org/solr/Solrj#Streaming_documents_for_an_update"&gt;Stream documents through an Iterator API&lt;/a&gt;&lt;br/&gt;&lt;/li&gt;&lt;li&gt;Many performance optimizations&lt;br/&gt;&lt;/li&gt;&lt;/ol&gt;&lt;span style="font-size:130%;"&gt;Miscellaneous&lt;/span&gt;&lt;br/&gt;&lt;ol&gt;&lt;li&gt;Rollback command in UpdateHandler&lt;/li&gt;&lt;li&gt;More configurable logging through the use of SLF4J library&lt;/li&gt;&lt;li&gt;‘commitWithin’ parameter on add document command allows setting a per-request auto-commit time limit.&lt;/li&gt;&lt;li&gt;TokenFilter factories for Arabic language&lt;/li&gt;&lt;li&gt;Improved Thai language tokenization (&lt;a href="https://issues.apache.org/jira/browse/SOLR-1078"&gt;SOLR-1078&lt;/a&gt;)&lt;br/&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://wiki.apache.org/solr/MergingSolrIndexes"&gt;Merge multiple indexes&lt;/a&gt;&lt;br/&gt;&lt;/li&gt;&lt;li&gt;Expunge Deletes command&lt;/li&gt;&lt;/ol&gt;&lt;span style="font-size:130%;"&gt;Upgrade instructions&lt;/span&gt;&lt;br/&gt;&lt;br/&gt;Although Solr 1.4 is backwards-compatible with previous releases, users are encouraged to read the upgrading notes in the &lt;a href="http://svn.apache.org/repos/asf/lucene/solr/tags/release-1.4.0/CHANGES.txt"&gt;Solr Change Log&lt;/a&gt;.&lt;br/&gt;&lt;br/&gt;There are so many more new features, optimizations, bug fixes and refactorings that it is not possible to cover them all in a single blog post.&lt;br/&gt;&lt;br/&gt;A large amount of effort has gone into this release. Many congratulations to the entire Solr community for making this happen!&lt;br/&gt;&lt;br/&gt;Great things are planned for the next release and it is a great time to get involved. See &lt;a href="http://wiki.apache.org/solr/HowToContribute"&gt;&lt;a href="http://wiki.apache.org/solr/HowToContribute"&gt;http://wiki.apache.org/solr/HowToContribute&lt;/a&gt;&lt;/a&gt; for how to get started.&lt;br/&gt;&lt;br/&gt;Enjoy Solr 1.4 and let us know on the &lt;a href="http://lucene.apache.org/solr/mailing_lists.html"&gt;mailing lists&lt;/a&gt; if you have any questions!&lt;div class="blogger-post-footer"&gt;&lt;img width="1" height="1" src="https://blogger.googleusercontent.com/tracker/9767426-7399170086924132370?l=shalinsays.blogspot.com" alt=""/&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/ShalinSays/~4/FZfIe3DvBxw" height="1" width="1"/&gt;</description><link>http://feedproxy.google.com/~r/ShalinSays/~3/FZfIe3DvBxw/285909701</link><guid isPermaLink="false">http://shal.in/post/285909701</guid><pubDate>Tue, 10 Nov 2009 07:15:00 +0530</pubDate><category>Apache Solr</category><feedburner:origLink>http://shal.in/post/285909701</feedburner:origLink></item><item><title>Why you should contribute to Open Source</title><description>&lt;p&gt;&lt;span style="font-size:78%;"&gt;Note: The following material and presentation was prepared for students of the Indian Institute of Information Technology (IIIT), Allahabad. The aim was to get them excited about contributing to open source projects and in particular about Apache Lucene, Solr and Hadoop. The first talk was titled “&lt;a href="http://www.slideshare.net/shalinmangar/why-you-should-contribute-to-open-source"&gt;Why you should contribute to Open Source&lt;/a&gt;” and was aimed at freshmen and has no technical content. The second was titled “&lt;a href="http://www.slideshare.net/shalinmangar/get-involved-with-the-apache-software-foundation"&gt;Get involved with the Apache Software Foundation&lt;/a&gt;” and was given to sophomore, junior and senior students and it goes into some basic technical information on Apache Lucene, Solr and Hadoop projects. The following post comprises of some notes that I put together for the talks.&lt;/span&gt;&lt;br/&gt;&lt;br/&gt;&lt;span style="font-size:130%;"&gt;Work on what you like, when you like&lt;/span&gt;&lt;br/&gt;&lt;br/&gt;Everybody wants to work on “cool” products. However, the reality is that most of you will get stuck in a job which although may pay well, it will hardly be about the things you wanted to work on. In your course, you will learn about algorithms, distributed systems, natural language processing, information retrieval, bio-informatics and other areas of computer science and its applications but in real life, the majority of the work done in software companies will have little direct application of things you will learn in your course.&lt;br/&gt;&lt;br/&gt;Most of the times you will be using things built by others and writing glue code to build things needed by your company’s business. This is not to say that all that knowledge will go waste; it will definitely help you become a better programmer and you should learn it but there’s a fair chance that it may not be used directly in your job.&lt;br/&gt;&lt;br/&gt;Open Source projects offer you a chance to work on something that you want rather than something that others want you to work on. It is a great opportunity to work on something that is both cool and useful as well as to associate with a well known brand and all the publicity and goodwill it brings. You are free to pick and choose between the thousands of open source projects out there. Moreover, you are free to decide on how much you want to contribute. You won’t have a boss and you won’t have the pressure of deadlines and schedules.&lt;br/&gt;&lt;br/&gt;&lt;span style="font-size:130%;"&gt;Development in the “real” world&lt;/span&gt;&lt;br/&gt;&lt;br/&gt;Academic projects are insufficient to impart many of the skills that you’d need once you start developing software full-time. Many of these skills are “social” rather than technical in nature but are at least as important.&lt;br/&gt;&lt;br/&gt;Most academic projects are “toy” projects. By that, I mean that their whole life cycle revolves around you. You are the designer, developer, tester and also the user. As a result, there are few key things missing in those projects.&lt;br/&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;No build system - Makefiles? Ant? Maven? Very few students are familiar with using them. Don’t even ask about creating a build from scratch. “Hey! Just open those files in a text editor or an IDE and hack away” is not an unusual thing to be heard&lt;/li&gt;&lt;li&gt;No source control - CVS? SVN? Git? A single person writing all the code or &gt;80% of the code is very common&lt;/li&gt;&lt;li&gt;No bug tracker - “It is never going to be used after we demo it to the professors”&lt;/li&gt;&lt;li&gt;No user documentation - maybe you will write a research paper detailing your findings but there is little or no documentation written for “other” people&lt;/li&gt;&lt;li&gt;No mailing lists or forums for support - Nobody but you is going to use it&lt;/li&gt;&lt;/ul&gt;Moreover, under these circumstances, you never learn how to:&lt;br/&gt;&lt;ul&gt;&lt;li&gt;Discuss technical design or issues in writing&lt;/li&gt;&lt;li&gt;Resolve conflicts in matters of design, architecture and a project’s road map.&lt;br/&gt;&lt;/li&gt;&lt;li&gt;Build usable interfaces (whether command line options or a GUI or an API)&lt;/li&gt;&lt;li&gt;Write proper error handling and logging code&lt;/li&gt;&lt;li&gt;Identify hooks for monitoring systems in production&lt;/li&gt;&lt;li&gt;Think about backup and recovery&lt;br/&gt;&lt;/li&gt;&lt;li&gt;Identify components which can be extended or replaced to add or modify functionality of the system&lt;/li&gt;&lt;/ul&gt;Open source projects are the real deal. If you are involved for long enough, you will either see or be a part of many such discussions and conflicts. All of the above skills are things you will need when you get around to software development in the real world.&lt;br/&gt;&lt;br/&gt;&lt;span style="font-size:130%;"&gt;Learn from the best&lt;/span&gt;&lt;br/&gt;&lt;br/&gt;How many great developers do you know about? How many of them work or have worked on an open source project? I bet there are many names common to both the lists.&lt;br/&gt;&lt;br/&gt;Open Source development will help you observe how experienced developers work and their various ways of designing, coding and discussing solutions. You will learn new ideas and new ways of solving problems. The second and probably more important part is that many smart programmers will be looking over your code and will provide review comments which will help you improve yourself. You will learn more efficient or shorter (or both) ways to solve the same problem. That kind of feedback is invaluable to a budding programmer.&lt;br/&gt;&lt;br/&gt;I know that I’ve learned a great deal since I got involved in &lt;a href="http://lucene.apache.org/solr"&gt;Apache Solr&lt;/a&gt;.&lt;br/&gt;&lt;br/&gt;&lt;span style="font-size:130%;"&gt;Build a publicly verifiable resume&lt;/span&gt;&lt;br/&gt;&lt;br/&gt;What you tell in your resume are things like contact information, performance in academia, programming languages you know, projects you’ve worked on and other such stuff. There is very little in this document which can be verified easily. This is a problem for you as well as for the prospective employer because:&lt;br/&gt;&lt;ol&gt;&lt;li&gt;It may not represent you, your skills and your hard work sufficiently enough&lt;/li&gt;&lt;li&gt;It makes hiring a game of chance for the prospective employer and prevents them from making more informed decisions&lt;br/&gt;&lt;/li&gt;&lt;/ol&gt;The best thing about contributing to an open source project is that everything you do is public. So you can say things like the following:&lt;br/&gt;&lt;ul&gt;&lt;li&gt;I have worked on this project for the last two years&lt;br/&gt;&lt;/li&gt;&lt;li&gt;I wrote features X, Y and Z on Project P&lt;br/&gt;&lt;/li&gt;&lt;li&gt;I have over two hundred posts on the user forum or mailing list&lt;/li&gt;&lt;li&gt;I have commit access to the project&lt;/li&gt;&lt;li&gt;I am the expert because “I wrote it”&lt;/li&gt;&lt;/ul&gt;And your prospective employer can search and verify such things easily. Congratulations, you have just landed on top of the stack of resumes!&lt;br/&gt;&lt;br/&gt;&lt;span style="font-size:130%;"&gt;Companies will find you&lt;/span&gt;&lt;br/&gt;&lt;br/&gt;When a company evaluates that an open source Project X can save them a lot of money, it is likely that they will hire a few people who have experience on project X and can support its use internally. Many such companies also allow their developers to work on the project either part-time or full-time. And who else is more qualified to work on the project but you - an existing contributor!&lt;br/&gt;&lt;br/&gt;More and more companies are starting up around providing training, consulting and support for open source projects. Many such companies exclusively hire existing contributors.&lt;br/&gt;&lt;br/&gt;Even if an open source project is not used directly inside the company, many tech companies hire open source contributors because:&lt;br/&gt;&lt;ol&gt;&lt;li&gt;Hiring popular open source developers makes them more cool in the eyes of other developers&lt;/li&gt;&lt;li&gt;Developers who contribute to open source projects are good programmers&lt;/li&gt;&lt;/ol&gt;&lt;br/&gt;I’m sure there are many more reasons other than the ones I’ve given here. In the end, contributing to an open source project is a good investment of your time and it may well be your big ticket to finding that great job. Good Luck!&lt;div class="blogger-post-footer"&gt;&lt;img width="1" height="1" src="https://blogger.googleusercontent.com/tracker/9767426-6174317903363690680?l=shalinsays.blogspot.com" alt=""/&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/ShalinSays/~4/g8Jou6W7_oA" height="1" width="1"/&gt;</description><link>http://feedproxy.google.com/~r/ShalinSays/~3/g8Jou6W7_oA/285909694</link><guid isPermaLink="false">http://shal.in/post/285909694</guid><pubDate>Mon, 26 Oct 2009 15:12:00 +0530</pubDate><feedburner:origLink>http://shal.in/post/285909694</feedburner:origLink></item></channel></rss>

