<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" version="2.0">
<channel>
	<title>Comments for Federated Search</title>
	
	<link>http://federatedsearchblog.com</link>
	<description>Covers topics related to federated search and the deep web</description>
	<pubDate>Fri, 03 Jul 2009 21:47:21 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.7.1</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" href="http://feeds.feedburner.com/CommentsForFederatedSearch" type="application/rss+xml" /><feedburner:emailServiceId xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0">CommentsForFederatedSearch</feedburner:emailServiceId><feedburner:feedburnerHostname xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0">http://feedburner.google.com</feedburner:feedburnerHostname><item>
		<title>Comment on Science source selection by Peter Noerr</title>
		<link>http://federatedsearchblog.com/2009/07/01/science-source-selection/comment-page-1/#comment-28652</link>
		<dc:creator>Peter Noerr</dc:creator>
		<pubDate>Fri, 03 Jul 2009 21:09:53 +0000</pubDate>
		<guid isPermaLink="false">http://federatedsearchblog.com/?p=739#comment-28652</guid>
		<description>There is generally another aspect to the "accessibility" of sources other than the technical, which is what Jonathan talks about. And that is commercial.

The commercial conditions for getting a set of records into an aggregated search environment seem to often be more onerous than allowing a broadcast search of the native source. Part of the problem lies in perception of the aggregated environment as being a competitor to the source of the records. This may be a true perception, a false one, or a wishful one.</description>
		<content:encoded><![CDATA[<p>There is generally another aspect to the &#8220;accessibility&#8221; of sources other than the technical, which is what Jonathan talks about. And that is commercial.</p>
<p>The commercial conditions for getting a set of records into an aggregated search environment seem to often be more onerous than allowing a broadcast search of the native source. Part of the problem lies in perception of the aggregated environment as being a competitor to the source of the records. This may be a true perception, a false one, or a wishful one.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on “Query-free” federated search by Peter Noerr</title>
		<link>http://federatedsearchblog.com/2009/06/01/query-free-federated-search/comment-page-1/#comment-28651</link>
		<dc:creator>Peter Noerr</dc:creator>
		<pubDate>Fri, 03 Jul 2009 20:59:32 +0000</pubDate>
		<guid isPermaLink="false">http://federatedsearchblog.com/?p=541#comment-28651</guid>
		<description>As a historical note I would point readers to a product now called "ClickSurge" from www.mediariver.com. This started life in 2006 as a tool called Watson, and the company as Open Road which morphed to Intellext, and then its current name. 

The technology in this product does exactly what is described in Sol's post - create searches from the typings of the user.

Another product performing the same functionality, though in a different use case, is gClick from DowJones. This provides contextually relevant company and people information for a retrieved web page.

Of course both of these are commercial products, so Andy would need to ask about how they can be used in a library web site.</description>
		<content:encoded><![CDATA[<p>As a historical note I would point readers to a product now called &#8220;ClickSurge&#8221; from <a href="http://www.mediariver.com" rel="nofollow">http://www.mediariver.com</a>. This started life in 2006 as a tool called Watson, and the company as Open Road which morphed to Intellext, and then its current name. </p>
<p>The technology in this product does exactly what is described in Sol&#8217;s post - create searches from the typings of the user.</p>
<p>Another product performing the same functionality, though in a different use case, is gClick from DowJones. This provides contextually relevant company and people information for a retrieved web page.</p>
<p>Of course both of these are commercial products, so Andy would need to ask about how they can be used in a library web site.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on “Query-free” federated search by Andy</title>
		<link>http://federatedsearchblog.com/2009/06/01/query-free-federated-search/comment-page-1/#comment-28605</link>
		<dc:creator>Andy</dc:creator>
		<pubDate>Thu, 02 Jul 2009 16:11:05 +0000</pubDate>
		<guid isPermaLink="false">http://federatedsearchblog.com/?p=541#comment-28605</guid>
		<description>I would love to add this tool to our library.  Any idea if Los Alamos plans to release the code?</description>
		<content:encoded><![CDATA[<p>I would love to add this tool to our library.  Any idea if Los Alamos plans to release the code?</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Science source selection by Sol</title>
		<link>http://federatedsearchblog.com/2009/07/01/science-source-selection/comment-page-1/#comment-28604</link>
		<dc:creator>Sol</dc:creator>
		<pubDate>Thu, 02 Jul 2009 14:53:52 +0000</pubDate>
		<guid isPermaLink="false">http://federatedsearchblog.com/?p=739#comment-28604</guid>
		<description>Jonathan,

I think the answer is a hybrid service. Index what you can (discovery service), federate what you can, harvest what you can, etc. A good federated search framework can integrate access to content that has been obtained by a number of different technologies.

In other words, lead with the question of what sources are important not with the question of what technology is the most convenient. Then find ways to get access to each of them. I've never said that federated search makes discovery services or crawling or harvesting obsolete.</description>
		<content:encoded><![CDATA[<p>Jonathan,</p>
<p>I think the answer is a hybrid service. Index what you can (discovery service), federate what you can, harvest what you can, etc. A good federated search framework can integrate access to content that has been obtained by a number of different technologies.</p>
<p>In other words, lead with the question of what sources are important not with the question of what technology is the most convenient. Then find ways to get access to each of them. I&#8217;ve never said that federated search makes discovery services or crawling or harvesting obsolete.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Science source selection by Jonathan Rochkind</title>
		<link>http://federatedsearchblog.com/2009/07/01/science-source-selection/comment-page-1/#comment-28602</link>
		<dc:creator>Jonathan Rochkind</dc:creator>
		<pubDate>Thu, 02 Jul 2009 13:51:30 +0000</pubDate>
		<guid isPermaLink="false">http://federatedsearchblog.com/?p=739#comment-28602</guid>
		<description>Sol, I've said this before, but I don't understand where you're coming from on the "it's fine to only search content you provide access to" point, because as far as I can tell, _federated search is exactly the same thing_.  Even with broadcast search, you can only search content that is accessible to broadcast search by your tool, which is not ALL content!

With either broadcast search or an aggregated index, you only have access to what you have access to. And in either case, providing access to any arbitrary resource may or may not be possible, and will take a non-zero amount of 'development' time to add.  With either case, the user using the tool only has access to searching content included in the tool, unless they leave the tool, which they can do in either case. 

It seems to me you are assuming that broadcast search will have a lower barrier to including additional resources, and end up having more coverage?  This to me is something that needs to be seen in practice, not assumed theoretically.  The key thing will be comparing two actual tools, and seeing which tool includes more resources, and more relevant and quality resources, for some particular context (audience and use cases). 

If SerSol's aggregated index comes up short, then that will be to it's detriment.  But I don't see how that can be assumed without an actual evaluation of two actual deployed tools side by side.</description>
		<content:encoded><![CDATA[<p>Sol, I&#8217;ve said this before, but I don&#8217;t understand where you&#8217;re coming from on the &#8220;it&#8217;s fine to only search content you provide access to&#8221; point, because as far as I can tell, _federated search is exactly the same thing_.  Even with broadcast search, you can only search content that is accessible to broadcast search by your tool, which is not ALL content!</p>
<p>With either broadcast search or an aggregated index, you only have access to what you have access to. And in either case, providing access to any arbitrary resource may or may not be possible, and will take a non-zero amount of &#8216;development&#8217; time to add.  With either case, the user using the tool only has access to searching content included in the tool, unless they leave the tool, which they can do in either case. </p>
<p>It seems to me you are assuming that broadcast search will have a lower barrier to including additional resources, and end up having more coverage?  This to me is something that needs to be seen in practice, not assumed theoretically.  The key thing will be comparing two actual tools, and seeing which tool includes more resources, and more relevant and quality resources, for some particular context (audience and use cases). </p>
<p>If SerSol&#8217;s aggregated index comes up short, then that will be to it&#8217;s detriment.  But I don&#8217;t see how that can be assumed without an actual evaluation of two actual deployed tools side by side.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Federated search: A wonder or a waste? by Peter Noerr</title>
		<link>http://federatedsearchblog.com/2009/06/24/federated-search-a-wonder-or-a-waste/comment-page-1/#comment-28274</link>
		<dc:creator>Peter Noerr</dc:creator>
		<pubDate>Fri, 26 Jun 2009 00:48:33 +0000</pubDate>
		<guid isPermaLink="false">http://federatedsearchblog.com/?p=703#comment-28274</guid>
		<description>Walt,

As the "odd one out" on sol's panel (according to his post above), I should point out that my background is in libraries as well. 

However, we have taken a piece of software (our Muse Platform) which powers a few library marketplace Federated Search systems, and expanded to pastures new. I'm not going to name names (look on our web site for those), but we have partners in publishing, medical information, enterprise software, corporate consultancy, and government systems. As you say federated access (not just search) is applicable across the board, and I hope to be able to bring some of our experience of that diversity to the panel.

I would also hope that the audience will keep us honest and make sure we keep the discussion to the fundamentals and not wander off into library minutiae.</description>
		<content:encoded><![CDATA[<p>Walt,</p>
<p>As the &#8220;odd one out&#8221; on sol&#8217;s panel (according to his post above), I should point out that my background is in libraries as well. </p>
<p>However, we have taken a piece of software (our Muse Platform) which powers a few library marketplace Federated Search systems, and expanded to pastures new. I&#8217;m not going to name names (look on our web site for those), but we have partners in publishing, medical information, enterprise software, corporate consultancy, and government systems. As you say federated access (not just search) is applicable across the board, and I hope to be able to bring some of our experience of that diversity to the panel.</p>
<p>I would also hope that the audience will keep us honest and make sure we keep the discussion to the fundamentals and not wander off into library minutiae.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Federated search: A wonder or a waste? by Sol</title>
		<link>http://federatedsearchblog.com/2009/06/24/federated-search-a-wonder-or-a-waste/comment-page-1/#comment-28235</link>
		<dc:creator>Sol</dc:creator>
		<pubDate>Thu, 25 Jun 2009 13:40:12 +0000</pubDate>
		<guid isPermaLink="false">http://federatedsearchblog.com/?p=703#comment-28235</guid>
		<description>Walt,

While 2 of the 3 panelists are in the library space I believe that there will be applicability to the enterprise space. The enterprise search community needs to deal with many of the same issues as the library community - accessing subscription content, relevance ranking, ease of use, requirements gathering, installation, maintenance, and performance to name some of the major concerns.</description>
		<content:encoded><![CDATA[<p>Walt,</p>
<p>While 2 of the 3 panelists are in the library space I believe that there will be applicability to the enterprise space. The enterprise search community needs to deal with many of the same issues as the library community - accessing subscription content, relevance ranking, ease of use, requirements gathering, installation, maintenance, and performance to name some of the major concerns.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Federated search: A wonder or a waste? by Walt Warnick</title>
		<link>http://federatedsearchblog.com/2009/06/24/federated-search-a-wonder-or-a-waste/comment-page-1/#comment-28234</link>
		<dc:creator>Walt Warnick</dc:creator>
		<pubDate>Thu, 25 Jun 2009 13:32:09 +0000</pubDate>
		<guid isPermaLink="false">http://federatedsearchblog.com/?p=703#comment-28234</guid>
		<description>Sol,


The panel you are moderating at the ENTERPRISE SEARCH SUMMIT WEST seems to focus exclusively on the application of federated search for online library services.  Of course, federated search has enterprise applications that go beyond online library services.  Further, federated search has application beyond enterprise applications, such as enabling WorldWideScience.org to search the huge national science databases of over 50 countries around the world.  

As your panel intends to debate whether federated search is "A Wonder or a Waste," it seems inappropriate to limit attention on but one application of federated search.

How about helping me out with a panel to debate whether my Buick LeSabre is a wonder or a waste?  The panel will limit attention to the application of my LeSabre for pulling stumps from my vacation mountain land.

Walt Warnick</description>
		<content:encoded><![CDATA[<p>Sol,</p>
<p>The panel you are moderating at the ENTERPRISE SEARCH SUMMIT WEST seems to focus exclusively on the application of federated search for online library services.  Of course, federated search has enterprise applications that go beyond online library services.  Further, federated search has application beyond enterprise applications, such as enabling WorldWideScience.org to search the huge national science databases of over 50 countries around the world.  </p>
<p>As your panel intends to debate whether federated search is &#8220;A Wonder or a Waste,&#8221; it seems inappropriate to limit attention on but one application of federated search.</p>
<p>How about helping me out with a panel to debate whether my Buick LeSabre is a wonder or a waste?  The panel will limit attention to the application of my LeSabre for pulling stumps from my vacation mountain land.</p>
<p>Walt Warnick</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on The “lowest common denominator” myth by Peter Noerr</title>
		<link>http://federatedsearchblog.com/2009/06/22/the-lowest-common-denominator-myth/comment-page-1/#comment-28215</link>
		<dc:creator>Peter Noerr</dc:creator>
		<pubDate>Thu, 25 Jun 2009 01:39:21 +0000</pubDate>
		<guid isPermaLink="false">http://federatedsearchblog.com/?p=683#comment-28215</guid>
		<description>Since I started "LCD", I'll have a go.

As an operation I agree with the comments by Dave, Sol and Jonathan. LCD to me is literally where the Fed Search system issues a search of the same functionality (but possibly different syntax) to each Source. That implies that the functionality of the search is limited to that available from the least functional Source. I have deliberately used "functionality" to be as inclusive in the definition as possible. Indices, operators, relations, limits, even vocabularies are all examples of "functionality".

Thus Dave's A, B, C case is a perfect example of what I would call an LCD situation - in the second instance. Here the search is issued only as a full text search, because that is the only functionality supported by all three Sources. The other instance (where only A &amp; B are searched) is dealt with below.

A couple of comments need to attach to this. Firstly it is interesting to ponder that the Fed search system must know enough about the different Sources to be able to determine what the "common" functions are. If it can do this, it is halfway(-ish) towards being able to handle Source Specific Searches (SSS). So why dumb down? Who knows? I don't.

The second point is that the first of Dave's instances (search A&amp;B only) uses what we call a "strict" mode for the search. That is: if the Source can't handle the search in it's entirety, then fail for that Source, and return 0 results. This is not LCD, but rather a strange form of "HCF" (if we want to stick with basic arithmetic acronyms) where only those Sources which meet ALL the requirements (support all the functionality) are sorted. Different category, but a problem none-the-less for certain types of searches.

Where non-LCD searching is possible (SSS as used above) then it is possible to switch to a "relaxed" mode and allow some portion of the search to be processed and produce at least some results. (You want details - I knew you would. OK, the most obvious example is where a Source does not support a particular index and any terms for that Source are mapped to one it does support - in 99% of cases this is "keyword" or its functional equivalent. )

This mapped and relaxed operation does allow results from Sources which would be "0"s to a strict search and, in our experience, most people would rather get something back which is somewhat like what they asked for, rather than nothing. Often because they weren't too sure about the query in the first place. Again a good result for some case, but not all. However take the union of the two cases and you cover what virtually everybody would want - and what you would get from the native Sources - always an important touchstone. (And this whole comment raise a host more issues for Sol to tease out into other posts - like the issue of notifying users of what the search has done.)</description>
		<content:encoded><![CDATA[<p>Since I started &#8220;LCD&#8221;, I&#8217;ll have a go.</p>
<p>As an operation I agree with the comments by Dave, Sol and Jonathan. LCD to me is literally where the Fed Search system issues a search of the same functionality (but possibly different syntax) to each Source. That implies that the functionality of the search is limited to that available from the least functional Source. I have deliberately used &#8220;functionality&#8221; to be as inclusive in the definition as possible. Indices, operators, relations, limits, even vocabularies are all examples of &#8220;functionality&#8221;.</p>
<p>Thus Dave&#8217;s A, B, C case is a perfect example of what I would call an LCD situation - in the second instance. Here the search is issued only as a full text search, because that is the only functionality supported by all three Sources. The other instance (where only A &amp; B are searched) is dealt with below.</p>
<p>A couple of comments need to attach to this. Firstly it is interesting to ponder that the Fed search system must know enough about the different Sources to be able to determine what the &#8220;common&#8221; functions are. If it can do this, it is halfway(-ish) towards being able to handle Source Specific Searches (SSS). So why dumb down? Who knows? I don&#8217;t.</p>
<p>The second point is that the first of Dave&#8217;s instances (search A&amp;B only) uses what we call a &#8220;strict&#8221; mode for the search. That is: if the Source can&#8217;t handle the search in it&#8217;s entirety, then fail for that Source, and return 0 results. This is not LCD, but rather a strange form of &#8220;HCF&#8221; (if we want to stick with basic arithmetic acronyms) where only those Sources which meet ALL the requirements (support all the functionality) are sorted. Different category, but a problem none-the-less for certain types of searches.</p>
<p>Where non-LCD searching is possible (SSS as used above) then it is possible to switch to a &#8220;relaxed&#8221; mode and allow some portion of the search to be processed and produce at least some results. (You want details - I knew you would. OK, the most obvious example is where a Source does not support a particular index and any terms for that Source are mapped to one it does support - in 99% of cases this is &#8220;keyword&#8221; or its functional equivalent. )</p>
<p>This mapped and relaxed operation does allow results from Sources which would be &#8220;0&#8243;s to a strict search and, in our experience, most people would rather get something back which is somewhat like what they asked for, rather than nothing. Often because they weren&#8217;t too sure about the query in the first place. Again a good result for some case, but not all. However take the union of the two cases and you cover what virtually everybody would want - and what you would get from the native Sources - always an important touchstone. (And this whole comment raise a host more issues for Sol to tease out into other posts - like the issue of notifying users of what the search has done.)</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on The “lowest common denominator” myth by Sol</title>
		<link>http://federatedsearchblog.com/2009/06/22/the-lowest-common-denominator-myth/comment-page-1/#comment-28203</link>
		<dc:creator>Sol</dc:creator>
		<pubDate>Wed, 24 Jun 2009 19:53:18 +0000</pubDate>
		<guid isPermaLink="false">http://federatedsearchblog.com/?p=683#comment-28203</guid>
		<description>Everyone,

I think of the myth as the blanket statement that federated search engines rollover/suck because some sources are not easy to search well. 

But, I'm interested to know what you all think LCD means. Abe and I discussed it for a while and realized that we weren't quite sure what it means. So, please tell me.

At the same time there's a different discussion about how some federated search engines handle tricky sources better than others. I'll do what I can to find examples of smart connectors searching sources particularly well.</description>
		<content:encoded><![CDATA[<p>Everyone,</p>
<p>I think of the myth as the blanket statement that federated search engines rollover/suck because some sources are not easy to search well. </p>
<p>But, I&#8217;m interested to know what you all think LCD means. Abe and I discussed it for a while and realized that we weren&#8217;t quite sure what it means. So, please tell me.</p>
<p>At the same time there&#8217;s a different discussion about how some federated search engines handle tricky sources better than others. I&#8217;ll do what I can to find examples of smart connectors searching sources particularly well.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
