<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0">

<channel>
	<title>Federated Search</title>
	
	<link>http://federatedsearchblog.com</link>
	<description>Covers topics related to federated search and the deep web</description>
	<pubDate>Tue, 31 Aug 2010 14:43:38 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.7.1</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/rss+xml" href="http://feeds.feedburner.com/Federatedsearchblogcom" /><feedburner:info uri="federatedsearchblogcom" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><feedburner:emailServiceId>Federatedsearchblogcom</feedburner:emailServiceId><feedburner:feedburnerHostname>http://feedburner.google.com</feedburner:feedburnerHostname><item>
		<title>Avi Rappoport on federated vs. aggregated search architectures</title>
		<link>http://feedproxy.google.com/~r/Federatedsearchblogcom/~3/wdf2oBPukrs/</link>
		<comments>http://federatedsearchblog.com/2010/08/31/avi-rappoport-on-federated-vs-aggregated-search-architectures/#comments</comments>
		<pubDate>Tue, 31 Aug 2010 14:40:15 +0000</pubDate>
		<dc:creator>Sol</dc:creator>
		
		<category><![CDATA[Uncategorized]]></category>

		<category><![CDATA[federated search]]></category>

		<guid isPermaLink="false">http://federatedsearchblog.com/?p=1647</guid>
		<description><![CDATA[In May, search consultant Avi Rappoport delivered a presentation at the Enterprise Search Summit: Federated vs. Aggregated Search Architectures.

Avi Rappoport is an enterprise search consultant, helping companies improve search engine functionality for websites and intranets. She has a degree from UC Berkeley’s (then) School of Library and Information Science and spent 10 years in software [...]<script type="text/javascript">SHARETHIS.addEntry({ title: "Avi Rappoport on federated vs. aggregated search architectures", url: "http://federatedsearchblog.com/2010/08/31/avi-rappoport-on-federated-vs-aggregated-search-architectures/" });</script>]]></description>
			<content:encoded><![CDATA[<p><img align="right" src="http://federatedsearchblog.com/images/avi.jpg">In May, search consultant <a href="http://searchtools.com/about/bio.html">Avi Rappoport</a> delivered a presentation at the Enterprise Search Summit: Federated vs. Aggregated Search Architectures.</p>
<blockquote><p>
Avi Rappoport is an enterprise search consultant, helping companies improve search engine functionality for websites and intranets. She has a degree from UC Berkeley’s (then) School of Library and Information Science and spent 10 years in software development before becoming a search consultant. She is the editor of <a href="http://searchtools.com">SearchTools.com</a> and a frequent speaker and author, providing a strong focus on search usability in the broadest sense and sharing her conviction that search engines can always be better.
</p></blockquote>
<p>Avi created <a href="http://searchtools.com/slides/ess10/index.html">a web page</a> with a summary of and links to a couple of versions of her presentation.</p>
<p>I greatly appreciate Avi&#8217;s consideration of the pluses and minuses of federation aggregation (i.e. discovery service) in a world that is often polarized about one approach being better in all cases.</p>
<blockquote><p>
My research for this presentation indicated that each is useful in specific circumstances (I know, no surprise there). Many data sources are obviously best accessed by one or the other, but it&#8217;s the corner cases that are tricky. Aspects to consider include:</p>
<ul>
<li>size of the content in the source
<li>how often your users need that content
<li>content change rate
<li>importance of real-time access control permissions changes
<li>content licensing rules
<li>available tools for indexing / querying
<li>difficulty of extracting and indexing
<li>quality of the internal search engine
<li>difficulty of sending queries and receiving results
</ul>
</blockquote>
<p>The final slide has some sage advice:</p>
<blockquote><p>
Be open-minded, analyze the benefits of each approach for each data source.
</p></blockquote>
<p>One size does NOT fit all.</p>
<p><a href="http://sharethis.com/item?&wp=2.7.1&amp;publisher=3cfadb1e-a1cd-4b64-bce1-33d1702fd1a6&amp;title=Avi+Rappoport+on+federated+vs.+aggregated+search+architectures&amp;url=http%3A%2F%2Ffederatedsearchblog.com%2F2010%2F08%2F31%2Favi-rappoport-on-federated-vs-aggregated-search-architectures%2F">ShareThis</a></p><img src="http://feeds.feedburner.com/~r/Federatedsearchblogcom/~4/wdf2oBPukrs" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://federatedsearchblog.com/2010/08/31/avi-rappoport-on-federated-vs-aggregated-search-architectures/feed/</wfw:commentRss>
		<feedburner:origLink>http://federatedsearchblog.com/2010/08/31/avi-rappoport-on-federated-vs-aggregated-search-architectures/</feedburner:origLink></item>
		<item>
		<title>Sharing Data Leads to Progress</title>
		<link>http://feedproxy.google.com/~r/Federatedsearchblogcom/~3/leqvcxibUiw/</link>
		<comments>http://federatedsearchblog.com/2010/08/26/sharing-data-leads-to-progress/#comments</comments>
		<pubDate>Thu, 26 Aug 2010 13:55:58 +0000</pubDate>
		<dc:creator>Sol</dc:creator>
		
		<category><![CDATA[Uncategorized]]></category>

		<category><![CDATA[federated search]]></category>

		<guid isPermaLink="false">http://federatedsearchblog.com/?p=1636</guid>
		<description><![CDATA[[ Editor's Note: This is a very touching article by Nena Moss first published in the OSTI Blog. My dad suffered with Alzheimer's for a number of years before he died so I can relate to Nena's experience. Disclaimer: I have been paid to support OSTI in a number of capacities for the past eight [...]<script type="text/javascript">SHARETHIS.addEntry({ title: "Sharing Data Leads to Progress", url: "http://federatedsearchblog.com/2010/08/26/sharing-data-leads-to-progress/" });</script>]]></description>
			<content:encoded><![CDATA[<p>[ <em>Editor's Note: This is a very touching <a href="http://www.osti.gov/ostiblog/sharing-data-leads-progress">article by Nena Moss</a> first published in the <a href="http://www.osti.gov/ostiblog">OSTI Blog</a>. My dad suffered with Alzheimer's for a number of years before he died so I can relate to Nena's experience. Disclaimer: I have been paid to support <a href="http://www.osti.gov">OSTI</a> in a number of capacities for the past eight years.</em> ]</p>
<p><img align="right" src="http://www.osti.gov/ostiblog/sites/www.osti.gov.ostiblog/files/pictures/picture-22.jpg">My mother died in March 2010 after a 15-year battle with Alzheimer’s, so I pay particular attention to news about this dreadful disease. A recent New York Times article caught my eye: “<a href="http://www.nytimes.com/2010/08/13/health/research/13alzheimer.html?_r=1">Sharing of Data Leads to Progress on Alzheimer&#8217;s</a>.”</p>
<p>How did sharing data lead to progress on Alzheimer’s?  A collaborative effort, the <a href="http://www.adni-info.org/">Alzheimer’s Disease Neuroimaging Initiative</a>, was formed to find the biological markers that show the progression of Alzheimer’s disease in the human brain. The key was to share all the data, making every finding public immediately – “available to anyone with a computer anywhere in the world.”</p>
<p><span id="more-1636"></span>Alzheimer’s research is an enormous task with limited returns. Dr. Michael W. Weiner of the San Francisco Department of Veterans Affairs said “Different people using different methods on different subjects in different places were getting different results, which is not surprising. What was needed was to get everyone together and to get a common data set.” Numerous entities were willing to shoulder the burden and work together on the project, sharing their information for the good of all.</p>
<p>According to Dr. John Q. Trojanowski, an Alzheimer’s researcher at the University of Pennsylvania, “It’s not science the way most of us have practiced it in our careers. But we all realized that we would never get biomarkers unless all of us parked our egos and intellectual-property noses outside the door and agreed that all of our data would be public immediately.” The National institutes of Health served as an “honest broker, between the pharmaceutical industry and academia.”</p>
<p> The effort has produced “a wealth of recent scientific papers on the early diagnosis of Alzheimer’s using methods like PET scans and tests of spinal fluid. More than 100 studies are under way to test drugs that might slow or stop the disease.” The collaboration has become a “model for similar efforts against Parkinson’s disease.”</p>
<p>This model matches OSTI’s mission. We share scientific data in an effort to encourage progress. In a <a href="http://www.juggle.com/ostiblog-top-government-blog">recent interview with Juggle.com</a>, Dr. Walter Warnick, OSTI Director, noted “It is common knowledge that science can advance only if it is shared. The OSTI Corollary to this is:  Accelerating the sharing of scientific knowledge accelerates the advancement of science.”</p>
<p> That’s the idea behind ScienceAccelerator.gov – developed to advance discovery and to deliver science information.You can find information about current Alzheimer’s research by searching Alzheimer’s 2010 at <a href="http://www.scienceaccelerator.gov/">ScienceAccelerator.gov</a>.</p>
<p>Share the data!</p>
<p><a href="http://sharethis.com/item?&wp=2.7.1&amp;publisher=3cfadb1e-a1cd-4b64-bce1-33d1702fd1a6&amp;title=Sharing+Data+Leads+to+Progress&amp;url=http%3A%2F%2Ffederatedsearchblog.com%2F2010%2F08%2F26%2Fsharing-data-leads-to-progress%2F">ShareThis</a></p><img src="http://feeds.feedburner.com/~r/Federatedsearchblogcom/~4/leqvcxibUiw" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://federatedsearchblog.com/2010/08/26/sharing-data-leads-to-progress/feed/</wfw:commentRss>
		<feedburner:origLink>http://federatedsearchblog.com/2010/08/26/sharing-data-leads-to-progress/</feedburner:origLink></item>
		<item>
		<title>On federated fetching</title>
		<link>http://feedproxy.google.com/~r/Federatedsearchblogcom/~3/qB9BewsNU-Q/</link>
		<comments>http://federatedsearchblog.com/2010/08/21/on-federated-fetching/#comments</comments>
		<pubDate>Sat, 21 Aug 2010 14:08:29 +0000</pubDate>
		<dc:creator>Sol</dc:creator>
		
		<category><![CDATA[viewpoints]]></category>

		<category><![CDATA[federated search]]></category>

		<guid isPermaLink="false">http://federatedsearchblog.com/?p=1628</guid>
		<description><![CDATA[&#8220;Federated fetching&#8221; is a new term to me. I discovered it at Srinivas Reddy&#8217;s Weblog, referencing the O&#8217;Reilly book, <script type="text/javascript">SHARETHIS.addEntry({ title: "On federated fetching", url: "http://federatedsearchblog.com/2010/08/21/on-federated-fetching/" });</script>]]></description>
			<content:encoded><![CDATA[<p>&#8220;Federated fetching&#8221; is a new term to me. I discovered it at<a href="http://srinivasreddy.wordpress.com/2010/08/21/emerging-information-architectures/"> Srinivas Reddy&#8217;s Weblog</a>, referencing the O&#8217;Reilly book, <a href="http://oreilly.com/catalog/9780596157128"<em>Beautiful Data</em></a>:</p>
<blockquote><p>
When we deal with web scale data &#8216;discoverability&#8217; of information is key. While &#8216;web search&#8217; provides a lot of value today what we really need is to enable &#8216;data find data&#8217;.  I like the differentiation in the book between &#8216;federated search&#8217; and &#8216;federated fetch&#8217;. The latter needs adaptive systems that can discover new data correlations based on user context and new data collected.
</p></blockquote>
<p>This reference got me curious. Was the Web buzzing with discussion of federated search vs. federated fetch? Not exactly, according to Google, although there are <a href="http://www.google.com/search?client=safari&#038;rls=en-us&#038;q=%22federated+fetch%22&#038;ie=UTF-8&#038;oe=UTF-8">740 references</a> to the phrase but only 24 of them are considered unique enough for Google to display. Interestingly enough, the first reference is to Jeff Jonas &#8220;<a href="http://federatedsearchblog.com/2010/07/26/huh/">When Federated Search Bites</a>&#8221; article which I wrote about a month ago. </p>
<blockquote><p>
Once a directory reveals a pointer, you can go fetch it.  <em>Federated fetch</em> does scale.
</p></blockquote>
<p><a href="http://books.google.com/books?id=zxNglqU1FKgC&#038;lpg=PA113&#038;ots=DBHSL8humE&#038;dq=%22federated%20fetch%22&#038;pg=PA113#v=onepage&#038;q=%22federated%20fetch%22&#038;f=false">Google Books</a> provides the term in the context of the <em>Beautiful Data</em> book:</p>
<p><img src="http://federatedsearchblog.com/images/Federated Fetch.png"></p>
<p>So, <em>federated fetch</em> is the &#8220;end game,&#8221; if I understand the concept correctly. It&#8217;s what you get when, for example, a link resolver gets you to the full text copy of a book you can actually read.</p>
<p>There you have it, a new phrase I learned today.</p>
<p><a href="http://sharethis.com/item?&wp=2.7.1&amp;publisher=3cfadb1e-a1cd-4b64-bce1-33d1702fd1a6&amp;title=On+federated+fetching&amp;url=http%3A%2F%2Ffederatedsearchblog.com%2F2010%2F08%2F21%2Fon-federated-fetching%2F">ShareThis</a></p><img src="http://feeds.feedburner.com/~r/Federatedsearchblogcom/~4/qB9BewsNU-Q" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://federatedsearchblog.com/2010/08/21/on-federated-fetching/feed/</wfw:commentRss>
		<feedburner:origLink>http://federatedsearchblog.com/2010/08/21/on-federated-fetching/</feedburner:origLink></item>
		<item>
		<title>Review: Search Patterns</title>
		<link>http://feedproxy.google.com/~r/Federatedsearchblogcom/~3/iWWuQiro970/</link>
		<comments>http://federatedsearchblog.com/2010/08/09/review-search-patterns/#comments</comments>
		<pubDate>Mon, 09 Aug 2010 19:55:13 +0000</pubDate>
		<dc:creator>Sol</dc:creator>
		
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://federatedsearchblog.com/?p=1608</guid>
		<description><![CDATA[
Early this year O&#8217;Reilly published Search Patterns, by Peter Morville and Jeffery Callender. This is Morville&#8217;s fourth information/search-related book. Search Patterns addresses the intersection of user interface and search. 
Search Patterns is an absolutely outstanding book. I don&#8217;t get excited about search-related books very often but this one totally captivated me. O&#8217;Reilly sent me a [...]<script type="text/javascript">SHARETHIS.addEntry({ title: "Review: Search Patterns", url: "http://federatedsearchblog.com/2010/08/09/review-search-patterns/" });</script>]]></description>
			<content:encoded><![CDATA[<p><center><img src="https://images-na.ssl-images-amazon.com/images/I/41EMBSY8w7L._SL110_.jpg"><img src="https://images-na.ssl-images-amazon.com/images/I/510Aj%2BI-VNL._SL110_.jpg"><img src="https://images-na.ssl-images-amazon.com/images/I/51JY05HVVBL._SL110_.jpg"><img src="https://images-na.ssl-images-amazon.com/images/I/413ByHBnVHL._SL110_.jpg"></center><br />
Early this year O&#8217;Reilly published <a href="http://searchpatterns.org/">Search Patterns</a>, by Peter Morville and Jeffery Callender. This is Morville&#8217;s fourth information/search-related book. <em>Search Patterns</em> addresses the intersection of user interface and search. </p>
<p><em>Search Patterns</em> is an absolutely outstanding book. I don&#8217;t get excited about search-related books very often but this one totally captivated me. O&#8217;Reilly sent me a review copy some months ago. It sat in a pile until I started seeing reviews and references to the book on the Web. The press prompted me to open the book.</p>
<p>The first thing I noticed in flipping through the book was the many high-quality color screen shots and illustrations. Plus, <em>Search Patterns</em> is printed on glossy paper to enhance the visual elements of the book.</p>
<p>At 173 pages (plus index) and a nice balance of text and images, <em>Search Patterns</em> is, at the surface, a quick read. But, there are numerous gems throughout the book so allow yourself plenty of time to read (and reread) sections that draw you.</p>
<p><span id="more-1608"></span>I have to admit that I had a very difficult time planning for this review. That&#8217;s because I&#8217;m used to reading books that are very logical, very left-brained. This book speaks frequently in metaphor and connections between sections are not always clear. But, paradoxically, the writing and the logic are absolutely brilliant. As I hinted at in the previous paragraph I recommend you pick sections of the book you find interesting and read those sections as many times as you need to (along with related illustrations) to absorb the material.</p>
<p><em>Search Patterns</em> has a preface and six chapters. </p>
<ul>
<li><strong>Chapter 1: Pattern Recognition.</strong> Provides an introduction to the search problem.
<li><strong>Chapter 2: The Anatomy of Search.</strong> Dissecting search into its components - users, interface, engine, content, creators, and context.
<li><strong>Chapter 3: Behavior.</strong> A consideration of how users interact with search and how that should influence search engine design.
<li><strong>Chapter 4: Design Patterns.</strong> A catalog of different mechanisms that support search - autocomplete, best first, federated search, faceted navigation, advanced search, personalization, pagination, and others.
<li><strong>Chapter 5: Engines of Discovery.</strong> How content is organized.
<li><strong>Chapter 6: Tangible Futures.</strong> Musings about the future.
</ul>
<p>My favorite things about the book are the many examples of search done right and also of search done poorly together with the many gems.</p>
<p>Chapter 1 has a great set of gems. Titled <em><a href="http://tm.durusau.net/?p=602">A Mapmaker&#8217;s Manifesto</a></em>, this list of twenty items enumerates the authors&#8217; beliefs and principles.</p>
<p>Zachary Spencer blogged some notes on <a href="http://www.zacharyspencer.com/2010/07/peter-morville-on-search-and-information-architecture/">a talk Morville recently gave</a>. There are lots of gems in those notes.</p>
<p>Here&#8217;s a great <a href="http://johnnyholland.org/2010/03/29/search-patterns-an-interview-with-peter-morville/">interview with Peter Morville</a> at JohnnyHolland.org.</p>
<p>This advanced praise quote by Dave Gray (founder and Chairman, XPLANE) sums up the essence of the book particularly well:</p>
<blockquote><p>
&#8220;It&#8217;s not often I come across a book that asks profound questions about a fundamental human activity, and then proceeds to answer those questions with practical observations and suggestions. <em>Search Patterns</em> is an expedition into the heart of the Web and human cognition, and for me it was a delightful journey that delivered scores of insights.&#8221;
</p></blockquote>
<p>I highly recommend <em>Search Patterns</em> to anyone who is designing a search engine, consulting on its development, or just wanting to understand why a particular search application is easy or hard to use. The book clarified so many issues I&#8217;ve seen (and continue to see) with search applications, federated or otherwise.</p>
<p>Finally, here are some of Morville&#8217;s thoughts on federated search, Google search, and the conversational search experience on YouTube:</p>
<div id="vvq4c8728b6ea448" class="vvqbox vvqyoutube" style="width:425px;height:355px;">
<p><a href="http://www.youtube.com/watch?v=oDJRn4-vI4c">http://www.youtube.com/watch?v=oDJRn4-vI4c</a></p>
</div>
<p><a href="http://sharethis.com/item?&wp=2.7.1&amp;publisher=3cfadb1e-a1cd-4b64-bce1-33d1702fd1a6&amp;title=Review%3A+Search+Patterns&amp;url=http%3A%2F%2Ffederatedsearchblog.com%2F2010%2F08%2F09%2Freview-search-patterns%2F">ShareThis</a></p><img src="http://feeds.feedburner.com/~r/Federatedsearchblogcom/~4/iWWuQiro970" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://federatedsearchblog.com/2010/08/09/review-search-patterns/feed/</wfw:commentRss>
		<feedburner:origLink>http://federatedsearchblog.com/2010/08/09/review-search-patterns/</feedburner:origLink></item>
		<item>
		<title>Most “adorable” federated search?</title>
		<link>http://feedproxy.google.com/~r/Federatedsearchblogcom/~3/DqMGN8BWbNk/</link>
		<comments>http://federatedsearchblog.com/2010/07/30/most-adorable-federated-search/#comments</comments>
		<pubDate>Fri, 30 Jul 2010 13:32:57 +0000</pubDate>
		<dc:creator>Sol</dc:creator>
		
		<category><![CDATA[fun]]></category>

		<category><![CDATA[federated search]]></category>

		<guid isPermaLink="false">http://federatedsearchblog.com/?p=1567</guid>
		<description><![CDATA[
The U.S. Census Bureau has a federated search tool in development, Data Ferrett. 

The (Beta)DataFerrett helps you locate and retrieve the data you need across the Internet to your desktop or system, regardless of where the data resides.
DataFerrett is a unique data mining and extraction tool. (Beta)DataFerrett allows you to select a databasket full of [...]<script type="text/javascript">SHARETHIS.addEntry({ title: "Most &#8220;adorable&#8221; federated search?", url: "http://federatedsearchblog.com/2010/07/30/most-adorable-federated-search/" });</script>]]></description>
			<content:encoded><![CDATA[<p><center><img src="http://dataferrett.census.gov/TheDataWeb/images/type.gif"></center><br />
The U.S. Census Bureau has a <a href="http://dataferrett.census.gov/TheDataWeb/index.html">federated search tool</a> in development, Data Ferrett. </p>
<blockquote><p>
The (Beta)DataFerrett helps you locate and retrieve the data you need across the Internet to your desktop or system, regardless of where the data resides.</p>
<p>DataFerrett is a unique data mining and extraction tool. (Beta)DataFerrett allows you to select a databasket full of variables and then recode those variables as you need. You can then develop and customize tables. Selecting your results in your table you can create a chart or graph for a visual presentation into an html page. Save your data in the databasket and save your table for continued reuse.
</p></blockquote>
<p>I have no idea how useful the tool is but their mascot sure is cute!</p>
<p><a href="http://sharethis.com/item?&wp=2.7.1&amp;publisher=3cfadb1e-a1cd-4b64-bce1-33d1702fd1a6&amp;title=Most+%26%238220%3Badorable%26%238221%3B+federated+search%3F&amp;url=http%3A%2F%2Ffederatedsearchblog.com%2F2010%2F07%2F30%2Fmost-adorable-federated-search%2F">ShareThis</a></p><img src="http://feeds.feedburner.com/~r/Federatedsearchblogcom/~4/DqMGN8BWbNk" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://federatedsearchblog.com/2010/07/30/most-adorable-federated-search/feed/</wfw:commentRss>
		<feedburner:origLink>http://federatedsearchblog.com/2010/07/30/most-adorable-federated-search/</feedburner:origLink></item>
		<item>
		<title>Huh?</title>
		<link>http://feedproxy.google.com/~r/Federatedsearchblogcom/~3/dxOSakQG8FA/</link>
		<comments>http://federatedsearchblog.com/2010/07/26/huh/#comments</comments>
		<pubDate>Mon, 26 Jul 2010 11:09:23 +0000</pubDate>
		<dc:creator>Sol</dc:creator>
		
		<category><![CDATA[viewpoints]]></category>

		<category><![CDATA[federated search]]></category>

		<guid isPermaLink="false">http://federatedsearchblog.com/?p=1571</guid>
		<description><![CDATA[Jeff Jonas recently published an article, &#8220;When Federated Search Bites.&#8221; If this article is meant to be link bait, I&#8217;m not biting. You can get a link from Google.
I certainly don&#8217;t know everything about federated search but I know enough to recognize what&#8217;s not federated search, at least not what most of us think to [...]<script type="text/javascript">SHARETHIS.addEntry({ title: "Huh?", url: "http://federatedsearchblog.com/2010/07/26/huh/" });</script>]]></description>
			<content:encoded><![CDATA[<p>Jeff Jonas recently published an article, &#8220;When Federated Search Bites.&#8221; If this article is meant to be link bait, I&#8217;m not biting. You can get a link <a href="http://www.google.com/search?client=gmail&#038;rls=gm&#038;q=when%20federated%20search%20bites">from Google</a>.</p>
<p>I certainly don&#8217;t know everything about federated search but I know enough to recognize what&#8217;s not federated search, at least not what most of us think to be federated search. </p>
<p>The article, really a rant, starts off reasonably enough:</p>
<blockquote><p>
Federated search: conducting a search against ?n? source systems via a broadcast mechanism without the benefit or guidance of an index.</p>
<p>I am speaking specifically about environments where the systems in the federation are heterogeneous, are physically dispersed, were not engineered for federation a priori, and are not managed by a common command and control system.
</p></blockquote>
<p>Here&#8217;s another reasonable statement:</p>
<blockquote><p>
Most organizations have some obligation to make sense of what they know.  For example, the airline should know if the person added to the watch list is already an employee or already has a flight reservation.  Ideally, the moment such facts become knowable, someone or some system should be notified.  Think of this as ?the data speaks to itself.?  I call this data finds data.
</p></blockquote>
<p>Yes, having new data trigger analysis is a good idea. But, IT&#8217;S NOT FEDERATED SEARCH.</p>
<p>So, the entire basis of the rant is that federated search is not this advanced analysis system I want therefore it sucks. That&#8217;s like saying that my oven doesn&#8217;t analyze the food I put in it and automatically cook it perfectly therefore my oven &#8220;bites.&#8221;</p>
<p>There may be a discussion about the challenges of analyzing federated data vs. indexed data but that has nothing to do with what federated search does.</p>
<p>What do you think? Does the article make sense to you?</p>
<p><a href="http://sharethis.com/item?&wp=2.7.1&amp;publisher=3cfadb1e-a1cd-4b64-bce1-33d1702fd1a6&amp;title=Huh%3F&amp;url=http%3A%2F%2Ffederatedsearchblog.com%2F2010%2F07%2F26%2Fhuh%2F">ShareThis</a></p><img src="http://feeds.feedburner.com/~r/Federatedsearchblogcom/~4/dxOSakQG8FA" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://federatedsearchblog.com/2010/07/26/huh/feed/</wfw:commentRss>
		<feedburner:origLink>http://federatedsearchblog.com/2010/07/26/huh/</feedburner:origLink></item>
		<item>
		<title>History of the federal agency that pioneered federated search in the government</title>
		<link>http://feedproxy.google.com/~r/Federatedsearchblogcom/~3/zJvZ2OU3VKA/</link>
		<comments>http://federatedsearchblog.com/2010/07/23/history-of-the-federal-agency-that-pioneered-federated-search-in-the-government/#comments</comments>
		<pubDate>Fri, 23 Jul 2010 10:29:39 +0000</pubDate>
		<dc:creator>Sol</dc:creator>
		
		<category><![CDATA[viewpoints]]></category>

		<category><![CDATA[federated search]]></category>

		<guid isPermaLink="false">http://federatedsearchblog.com/?p=1589</guid>
		<description><![CDATA[&#8220;Federated search as a transformational technology enabling knowledge discovery: the role of WorldWideScience.org&#8221; is by far the best historical paper I&#8217;ve read about DOE&#8217;s Office of Scientific and Technical Information (OSTI), and I consult for the agency.  
OSTI has created a number of search portals (WorldWideScience.org, Science,gov, DOE ScienceAccelerator, DOE Energy Citations Database, and [...]<script type="text/javascript">SHARETHIS.addEntry({ title: "History of the federal agency that pioneered federated search in the government", url: "http://federatedsearchblog.com/2010/07/23/history-of-the-federal-agency-that-pioneered-federated-search-in-the-government/" });</script>]]></description>
			<content:encoded><![CDATA[<p>&#8220;<a href="http://www.osti.gov/ILDS_38_2Warnick2010.pdf">Federated search as a transformational technology enabling knowledge discovery: the role of WorldWideScience.org</a>&#8221; is by far the best historical paper I&#8217;ve read about DOE&#8217;s Office of Scientific and Technical Information (<a href="http://www.osti.gov">OSTI</a>), and I consult for the agency.  </p>
<p>OSTI has created a number of search portals (<a href="http://worldwidescience.org">WorldWideScience.org</a>, <a href="http://science.gov">Science,gov</a><a href="http://scienceaccelerator.gov">, DOE ScienceAccelerator</a>, <a href="http://www.osti.gov/energycitations/">DOE Energy Citations Database</a>, and <a href="http://www.osti.gov/bridge/">DOE Information Bridge</a> to name a few) but few know about the history of the agency that created them. </p>
<blockquote><p>
OSTI grew out of the post-World War II initiative to make the scientific research of the Manhattan Project as freely available to the public as possible. On November 17, 1944, President Roosevelt wrote Vannevar Bush, then the Director of the Office of Scientific Research and Development, to request his counsel on how to capitalize on the experience of the United States&#8217; R&#038;D war efforts &#8212; most of which was done in utter secrecy &#8212; in the days of peace to come.
</p></blockquote>
<p>OSTI Director <a href="http://www.osti.gov/bios/warnick.html">Dr. Walter Warnick</a> tells the story of the development of OSTI, its role in advancing science, and how federated search serves that role in ways that Google can&#8217;t. </p>
<p>The paper, at 23 pages, covers the subject with a good deal of depth. </p>
<p><span id="more-1589"></span>More than 60 years ago OSTI began advancing science.</p>
<blockquote><p>
Long before the Internet came along, OSTI advanced science by making research information widely available. OSTI annually responded to upwards of 50,000 requests for information and during the 1977 ?energy crisis? fielded more than 150,000 requests. OSTI operated one of the few federal printing plants in the United States, and in 1948 began an almost 30-year production of the world-famous Nuclear Science Abstracts, which greatly expanded access to nuclear science information. OSTI shouldered a lead role in providing materials to the Atoms for Peace Geneva Conferences, envisioned by President Dwight D. Eisenhower to pool nuclear information for sharing with peaceful nations. OSTI was instrumental in establishing the International Nuclear Information System (INIS), which promotes nuclear information exchange between 110 countries.
</p></blockquote>
<p>OSTI has capitalized on the power of the Web since the early days of the Web:</p>
<blockquote><p>
In 1994, OSTI created the first DOE home page, and it has made significant strides into the Information Age ever since, defining new electronic exchange formats, creating collections of digitized scientific and technical information, serving researchers directly, and developing an energy science and technology virtual library. OSTI today hosts three major collections of scientific and technical information: Science Accelerator, which features DOE R&#038;D resources; Science.gov, which provides access to STI from federal science agencies through the U.S. government; and WorldWideScience.org, which offers resources from more than 60 nations around the world.
</p></blockquote>
<p>What&#8217;s next for OSTI? Here is one direction:</p>
<blockquote><p>
What is next? There is no inherent reason that a single tool cannot rely upon both a crawled index and a live federated search in parallel. Indeed, OSTI?s largest product does just that. It is the Eprint Network (<a href="http://www.osti.gov/eprints">http://www.osti.gov/eprints</a>/). All in parallel, it searches 1.5 million eprints that have been crawled, plus an addition 5 million eprints hosted in 50 eprint databases, comprising in all about 100 million pages. As far as we know, there is no other tool in the world that virtually integrates such a quantity of eprints. Further, we are not aware of another publicly available search tool that searches federated databases and crawled indexes in parallel.
</p></blockquote>
<p>[ Disclaimer: I not only consult for OSTI but also for <a href="http://www.deepwebtech.com">Deep Web Technologies</a> who sponsors this blog and who built the search engines behind a number of the OSTI federated search applications. ]</p>
<p><a href="http://sharethis.com/item?&wp=2.7.1&amp;publisher=3cfadb1e-a1cd-4b64-bce1-33d1702fd1a6&amp;title=History+of+the+federal+agency+that+pioneered+federated+search+in+the+government&amp;url=http%3A%2F%2Ffederatedsearchblog.com%2F2010%2F07%2F23%2Fhistory-of-the-federal-agency-that-pioneered-federated-search-in-the-government%2F">ShareThis</a></p><img src="http://feeds.feedburner.com/~r/Federatedsearchblogcom/~4/zJvZ2OU3VKA" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://federatedsearchblog.com/2010/07/23/history-of-the-federal-agency-that-pioneered-federated-search-in-the-government/feed/</wfw:commentRss>
		<feedburner:origLink>http://federatedsearchblog.com/2010/07/23/history-of-the-federal-agency-that-pioneered-federated-search-in-the-government/</feedburner:origLink></item>
		<item>
		<title>Google acquires Metaweb - another step towards semantic search</title>
		<link>http://feedproxy.google.com/~r/Federatedsearchblogcom/~3/wb8D3QINs7g/</link>
		<comments>http://federatedsearchblog.com/2010/07/19/google-acquires-metaweb-another-step-towards-semantic-search/#comments</comments>
		<pubDate>Mon, 19 Jul 2010 13:20:59 +0000</pubDate>
		<dc:creator>Sol</dc:creator>
		
		<category><![CDATA[industry news]]></category>

		<category><![CDATA[federated search]]></category>

		<guid isPermaLink="false">http://federatedsearchblog.com/?p=1580</guid>
		<description><![CDATA[Google just announced that they would buy ITA Software, regulators permitting. Here&#8217;s another Google purchase that would take Google deeper into smarter searching.
Semantic processing is taking a big step forward.
From Mashable:

Google Acquires Metaweb to Improve Search
Google has acquired semantic web and real world database company Metaweb, a move the company says will help them ?improve [...]<script type="text/javascript">SHARETHIS.addEntry({ title: "Google acquires Metaweb - another step towards semantic search", url: "http://federatedsearchblog.com/2010/07/19/google-acquires-metaweb-another-step-towards-semantic-search/" });</script>]]></description>
			<content:encoded><![CDATA[<p>Google just announced that they would buy <a href="http://federatedsearchblog.com/2010/07/12/sometimes-less-is-more/">ITA Software</a>, regulators permitting. Here&#8217;s another Google purchase that would take Google deeper into smarter searching.</p>
<p>Semantic processing is taking a big step forward.</p>
<p>From <a href="http://mashable.com/2010/07/16/google-acquires-metaweb/">Mashable</a>:</p>
<blockquote>
<h2>Google Acquires Metaweb to Improve Search</h2>
<p>Google has acquired semantic web and real world database company Metaweb, a move the company says will help them ?improve search and make the web richer and more meaningful for everyone.?</p>
<p>We wrote about Metaweb back in 2008 when they received a significant chunk of funding to the tune of $42 million, on top of their first round of $15 million back in 2006. Since then the company has built its Freebase open database into a collection of over 12 million items from entertainment (movies, books, TV shows) to locations, celebrities, companies and other ?real world? objects. Google says the plan is to preserve and further develop the database and hope to enlist other companies to make use of and contribute to the data.</p>
<p>In addition to fleshing out Freebase, Google also hopes to leverage Metaweb to enhance its efforts with features like rich snippets and search answers, both of which aim to give back ?smarter? and more immediate results to specific queries. Right now, simpler requests like ?Barack Obama birthday? and ?events in San Jose? can spawn relevant answers right at the top of the search results page, but Google hopes to take this initiative further by feeding in more facts about the real world from Metaweb?s data repository.
</p></blockquote>
<p>Resource Shelf has some <a href="http://www.resourceshelf.com/2010/07/16/news-and-a-few-thoughts-even-some-history-as-google-adds-some-structure-with-acquistion-of-metaweb-freebase/">very insightful thoughts</a> on the acquisition. </p>
<p>Here&#8217;s a good video on what Metaweb is about:</p>
<div id="vvq4c8728b7042ef" class="vvqbox vvqyoutube" style="width:425px;height:355px;">
<p><a href="http://www.youtube.com/watch?v=TJfrNo3Z-DU">http://www.youtube.com/watch?v=TJfrNo3Z-DU</a></p>
</div>
<p>Here&#8217;s Google&#8217;s <a href="http://googleblog.blogspot.com/2010/07/deeper-understanding-with-metaweb.html">announcement of the acquisition</a>.</p>
<p><a href="http://sharethis.com/item?&wp=2.7.1&amp;publisher=3cfadb1e-a1cd-4b64-bce1-33d1702fd1a6&amp;title=Google+acquires+Metaweb+-+another+step+towards+semantic+search&amp;url=http%3A%2F%2Ffederatedsearchblog.com%2F2010%2F07%2F19%2Fgoogle-acquires-metaweb-another-step-towards-semantic-search%2F">ShareThis</a></p><img src="http://feeds.feedburner.com/~r/Federatedsearchblogcom/~4/wb8D3QINs7g" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://federatedsearchblog.com/2010/07/19/google-acquires-metaweb-another-step-towards-semantic-search/feed/</wfw:commentRss>
		<feedburner:origLink>http://federatedsearchblog.com/2010/07/19/google-acquires-metaweb-another-step-towards-semantic-search/</feedburner:origLink></item>
		<item>
		<title>Hope Leman on multilingual federated search</title>
		<link>http://feedproxy.google.com/~r/Federatedsearchblogcom/~3/KnR88-Cajto/</link>
		<comments>http://federatedsearchblog.com/2010/07/15/hope-leman-on-multilingual-federated-search/#comments</comments>
		<pubDate>Thu, 15 Jul 2010 14:10:35 +0000</pubDate>
		<dc:creator>Sol</dc:creator>
		
		<category><![CDATA[viewpoints]]></category>

		<category><![CDATA[federated search]]></category>

		<guid isPermaLink="false">http://federatedsearchblog.com/?p=1555</guid>
		<description><![CDATA[
Hope Leman is one of my favorite people. I know of very few individuals who are as passionate about anything as is Hope. Hope won second place in our second Federated Search Blog contest and I commented on her passionate review of WorldWideScience.org in 2008.
Hope wrote again about WorldWideScience.org. Her article is at her blog, [...]<script type="text/javascript">SHARETHIS.addEntry({ title: "Hope Leman on multilingual federated search", url: "http://federatedsearchblog.com/2010/07/15/hope-leman-on-multilingual-federated-search/" });</script>]]></description>
			<content:encoded><![CDATA[<p><center><img src="http://federatedsearchblog.com/images/SignificantScience.png"></center><br />
Hope Leman is one of my favorite people. I know of very few individuals who are as passionate about anything as is Hope. <a href="http://federatedsearchblog.com/2010/03/08/hope-leman-wins-2nd-place-in-federated-search-blog-contest/">Hope won second place</a> in our second Federated Search Blog contest and I commented on her <a href="http://federatedsearchblog.com/2008/11/19/science-portals-its-about-diversity-and-hope/">passionate review of WorldWideScience.org</a> in 2008.</p>
<p>Hope wrote again about <a href="http://worldwidescience.org">WorldWideScience.org</a>. Her article is at her blog, <a href="http://significantscience.com/2010/07/06/multilingual-worldwidescience-accelerating-scientific-research-empowering-researchers/">Signifcant Science</a>. Hope is a research information technologist for a health network in Oregon. She is also Web administrator of the free online grants and scholarship listing service, <a href="http://www.scangrants.com/">ScanGrants</a>, and of the free online search platform, <a href="http://www.researchraven.com/">ResearchRaven</a>. From several conversations with Hope I know that ScanGrants is a labor of love and a good demonstration of Hope&#8217;s passion about helping researchers.</p>
<p>In <a href="http://significantscience.com/2010/07/06/multilingual-worldwidescience-accelerating-scientific-research-empowering-researchers/">Multilingual WorldWideScience: Accelerating Scientific Research, Empowering Researchers</a> Hope reminds us of the key role that search plays in research especially in the world of free science and foreign language science.</p>
<p>Hope&#8217;s message is personal, and I love that:</p>
<blockquote><p>
As someone who grew up in a family that housed students who had left home and family in China, Japan, Iran, Korea and other countries to study engineering, chemistry, physics, biochemistry and so on at Oregon State University here in my hometown of Corvallis, Oregon I know what brilliant people there are in many countries who have so much to offer and what a boon it will be that the work of researchers worldwide will become useable to each of them and benefit the rest of us.
</p></blockquote>
<p>This update on Hope&#8217;s friend who suffered from ALS is even more touching:</p>
<blockquote><p>
I have recently lost a friend to amyotrophic lateral sclerosis and I would often sadly reflect as I bicycled home from her house about the glacial pace of progress on research on that disease and others like it. That is why I find Dr. Warnick&#8217;s enthusiasm and practical accomplishments so very admirable and the best possible case for paying one?s taxes with a minimal amount of grumbling. He is putting federal funds to exemplary use
</p></blockquote>
<p>Dr. Warnick, Director of <a href="http://www.osti.gov">OSTI</a>, conceived WorldWideScience and his agency hosts and manages the search portal.</p>
<p>Databases and search engines aren&#8217;t about getting one&#8217;s job done. At the noblest level, they&#8217;re about solving important problems, and saving lives when we can.</p>
<p>[ Disclaimer: OSTI is one of my consulting clients. Deep Web Technologies, who built the single and multiple language search engines behind WorldWideScience.org and who sponsors this blog is another of my clients. ]</p>
<p><a href="http://sharethis.com/item?&wp=2.7.1&amp;publisher=3cfadb1e-a1cd-4b64-bce1-33d1702fd1a6&amp;title=Hope+Leman+on+multilingual+federated+search&amp;url=http%3A%2F%2Ffederatedsearchblog.com%2F2010%2F07%2F15%2Fhope-leman-on-multilingual-federated-search%2F">ShareThis</a></p><img src="http://feeds.feedburner.com/~r/Federatedsearchblogcom/~4/KnR88-Cajto" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://federatedsearchblog.com/2010/07/15/hope-leman-on-multilingual-federated-search/feed/</wfw:commentRss>
		<feedburner:origLink>http://federatedsearchblog.com/2010/07/15/hope-leman-on-multilingual-federated-search/</feedburner:origLink></item>
		<item>
		<title>Sometimes less is more</title>
		<link>http://feedproxy.google.com/~r/Federatedsearchblogcom/~3/GT_tBz8piqo/</link>
		<comments>http://federatedsearchblog.com/2010/07/12/sometimes-less-is-more/#comments</comments>
		<pubDate>Mon, 12 Jul 2010 19:24:00 +0000</pubDate>
		<dc:creator>Sol</dc:creator>
		
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://federatedsearchblog.com/?p=1548</guid>
		<description><![CDATA[I don&#8217;t write about metasearch engines very often but I think that Google&#8217;s proposed purchase of ITA Software is worth commenting on. Here&#8217;s some info from Google:

On July 1, 2010, Google announced an agreement to acquire ITA Software, a Cambridge, Massachusetts flight information software company, for $700 million, subject to adjustments.
Google&#8217;s acquisition of ITA Software [...]<script type="text/javascript">SHARETHIS.addEntry({ title: "Sometimes less is more", url: "http://federatedsearchblog.com/2010/07/12/sometimes-less-is-more/" });</script>]]></description>
			<content:encoded><![CDATA[<p>I don&#8217;t write about metasearch engines very often but I think that <a href="http://www.google.com/press/ita/">Google&#8217;s proposed purchase</a> of <a href="http://www.itasoftware.com/">ITA Software</a> is worth commenting on. Here&#8217;s some info from Google:</p>
<blockquote><p>
On July 1, 2010, Google announced an agreement to acquire ITA Software, a Cambridge, Massachusetts flight information software company, for $700 million, subject to adjustments.</p>
<p>Google&#8217;s acquisition of ITA Software will create a new, easier way for users to find better flight information online, which should encourage more users to make their flight purchases online.</p>
<p>The acquisition will benefit passengers, airlines and online travel agencies by making it easier for users to comparison shop for flights and airfares and by driving more potential customers to airlines&#8217; and online travel agencies&#8217; websites. Google won&#8217;t be setting airfare prices and has no plans to sell airline tickets to consumers.</p>
<p>Because Google doesn&#8217;t currently compete against ITA Software, the deal will not change existing market shares. We are very excited about ITA Software&#8217;s QPX business, and we&#8217;re looking forward to working with current and future customers. Google will honor all existing agreements, and we&#8217;re also enthusiastic about adding new partners.
</p></blockquote>
<p><span id="more-1548"></span>The deal is subject to regulatory approval.</p>
<p>Travel news site <a href="http://www.tnooz.com/2010/07/12/news/google-ita-software-deal-why-the-airlines-could-be-very-happy/">tnooz</a> made an interesting comment on Google&#8217;s new &#8220;troogle&#8221; service: </p>
<blockquote><p>
A user goes to native search in a Google search box, for example, and types the following query:</p>
<p>&nbsp;&nbsp;&nbsp;&nbsp;Lowest fare to NYC on Sept 12th back on Sept 15th 2 people.</p>
<p>Google may respond with a display that shows a results box and the user selects from the list and, Voila, the user is now deep inside the workflow of the airline&#8217;s booking engine.</p>
<p><strong>No messy metasearch</strong>, no expensive online travel, no blood curdling GDS control.</p>
<p>The price is guaranteed to be the lowest, so no surprises for the customer who now has a trustworthy result.
</p></blockquote>
<p>Note the emphasis, mine, on &#8220;No messy metasearch.&#8221; That got my attention. While I&#8217;m not a super frequent flyer I fly often enough to be annoyed at how difficult it is to find the best combination of flights with the least total travel time, the fewest hops, the most convenience, and the lowest cost. If troogle can really eliminate this exercise in frustration and drop me &#8220;deep inside the workflow of the airline&#8217;s booking engine&#8221; at the lowest price then they&#8217;ll earn my business.</p>
<p>The take-away for me here is that less search can be better. Wasn&#8217;t it Roy Tennant who said that only librarians like to search, everyone else likes to find? Don&#8217;t you think it&#8217;ll be the case that when the computer knows exactly what you want, without your needing to provide it with detailed context for every search &#8212; i.e. when search becomes ubiquitous and invisible &#8212; that that&#8217;s when search will travel to new heights?</p>
<p>What do you think?</p>
<p><a href="http://sharethis.com/item?&wp=2.7.1&amp;publisher=3cfadb1e-a1cd-4b64-bce1-33d1702fd1a6&amp;title=Sometimes+less+is+more&amp;url=http%3A%2F%2Ffederatedsearchblog.com%2F2010%2F07%2F12%2Fsometimes-less-is-more%2F">ShareThis</a></p><img src="http://feeds.feedburner.com/~r/Federatedsearchblogcom/~4/GT_tBz8piqo" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://federatedsearchblog.com/2010/07/12/sometimes-less-is-more/feed/</wfw:commentRss>
		<feedburner:origLink>http://federatedsearchblog.com/2010/07/12/sometimes-less-is-more/</feedburner:origLink></item>
	</channel>
</rss>
