<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0">

<channel>
	<title>The Findability blog</title>
	
	<link>http://findabilityblog.se</link>
	<description>the search and findability blog</description>
	<lastBuildDate>Wed, 15 Feb 2012 13:50:44 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/rss+xml" href="http://feeds.feedburner.com/TheFindabilityBlog" /><feedburner:info uri="thefindabilityblog" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><item>
		<title>Searching for Zebras: Doing More with Less</title>
		<link>http://feedproxy.google.com/~r/TheFindabilityBlog/~3/U20pHFlJioA/</link>
		<comments>http://findabilityblog.se/searching-for-zebras-doing-more-with-less/#comments</comments>
		<pubDate>Wed, 15 Feb 2012 13:50:44 +0000</pubDate>
		<dc:creator>Paula Petcu</dc:creator>
				<category><![CDATA[Development]]></category>
		<category><![CDATA[Future development]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[Text Analytics]]></category>

		<guid isPermaLink="false">http://findabilityblog.se/?p=2876</guid>
		<description><![CDATA[There is a very controversial and highly cited 2006 British Medical Journal (BMJ) article called &#8220;Googling for a diagnosis &#8211; use of Google as a diagnostic aid: internet based study&#8221; which concludes that, for difficult medical diagnostic cases, it is often useful to use Google Search as a tool for finding a diagnosis. Difficult medical [...]]]></description>
			<content:encoded><![CDATA[<p>There is a very controversial and highly cited 2006 British Medical Journal (BMJ) article called &#8220;<a href="http://www.bmj.com/content/early/2005/12/31/bmj.39003.640567.AE.abstract">Googling for a diagnosis &#8211; use of Google as a diagnostic aid: internet based study</a>&#8221; which concludes that, for difficult medical diagnostic cases, it is often useful to use Google Search as a tool for finding a diagnosis. Difficult medical cases are often represented by rare diseases, which are diseases with a very low prevalence.</p>
<p>The authors use 26 diagnostic cases published in the New England Journal of Medicine (NEJM) in order to compile a short list of symptoms describing each patient case, and use those keywords as queries for Google. The authors, blinded to the correct disease (a rare diseases in 85% of the cases), select the most &#8216;prominent&#8217; diagnosis that fits each case. In 58% of the cases they succeed in finding the correct diagnosis.</p>
<p>Several other articles also point to Google as a tool often used by clinicians when searching for medical diagnoses.</p>
<p>But is that so convenient, is that enough, or can this process be easily improved? Indeed, two major advantages for Google are the clinicians&#8217; familiarity with it, and its fresh and extensive index. But how would a vertical search engine with focused and curated content compare to Google when given the task of finding the correct diagnosis for a difficult case?</p>
<p>Well, take an open-source search engine such as <a href="http://lemurproject.com">Indri</a>, index around 300,000 freely available medical articles describing rare or genetic diseases, use an off-the-shelf retrieval model, and there you have <a href="http://findzebra.com">Zebra</a>. In medicine, the term &#8220;zebra&#8221; is a slang for a surprising diagnosis. In comparison with a search on Google, which often returns results that point to unverified content from blogs or content aggregators, the documents from this vertical search engine are crawled from 10 web resources containing only rare and genetic disease articles, and which are mostly maintained by medical professionals or patient organizations.</p>
<p>Evaluating on a set of 56 queries extracted in a similar manner to the one described above, Zebra easily beats Google. Zebra finds the correct diagnosis in top 20 results in 68% of the cases, while Google succeeds in 32% of them. And this is only the performance of the Zebra with the baseline relevance model — imagine how much more could be done (for example, displaying results as a network of diseases, clustering or even ranking by diseases, or automatic extraction and translation of electronic health record data).</p>
<img src="http://feeds.feedburner.com/~r/TheFindabilityBlog/~4/U20pHFlJioA" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://findabilityblog.se/searching-for-zebras-doing-more-with-less/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://findabilityblog.se/searching-for-zebras-doing-more-with-less/</feedburner:origLink></item>
		<item>
		<title>Search in the Digital Workplace</title>
		<link>http://feedproxy.google.com/~r/TheFindabilityBlog/~3/kqp2ajqtK-4/</link>
		<comments>http://findabilityblog.se/search-in-the-digital-workplace/#comments</comments>
		<pubDate>Thu, 09 Feb 2012 12:19:43 +0000</pubDate>
		<dc:creator>Caroline Abrahamsson</dc:creator>
				<category><![CDATA[Conference]]></category>
		<category><![CDATA[Digital Workplace]]></category>
		<category><![CDATA[Search]]></category>

		<guid isPermaLink="false">http://findabilityblog.se/?p=2879</guid>
		<description><![CDATA[Last week we (Caroline Abrahamsson and Kristian Norling) had the opportunity to act as moderators for a conference on the Digital Workplace in Stockholm. Amongst the many good presentations, the keynote by Jane McConell was a gem. The Digital Workplace Trends report by Jane gives many insights into the intranet world, or as Jane and many others [...]]]></description>
			<content:encoded><![CDATA[<p>Last week we (Caroline Abrahamsson and Kristian Norling) had the opportunity to act as moderators for a <a href="http://www.abilitypartner.se/framtidens-digitala-sociala-och-kollaborativa-arbetsplats.aspx">conference on the Digital Workplace in Stockholm</a>. Amongst the many good presentations, the keynote by <a href="http://netjmc.com/about" target="_blank">Jane McConell</a> was a gem. <a href="http://www.digital-workplace-trends.com/">The Digital Workplace Trends</a> report by Jane gives many insights into the intranet world, or as Jane and many others prefer to call it, the Digital Workplace (Participants in the survey receives a free copy of the report, highly recommended!). One of  the most interesting parts for us was the four different future scenarios that Jane described during her session and that the survey participants had voted on (on a scale with low, medium or high business value):</p>
<blockquote>
<ul>
<li>“My apps” &#8211; The intranet is a set of highly customized apps. People select what they need to do their jobs and build their own “intranet” like on an iPad.</li>
<li>“Smartsystems”-The userexperience is efficient and relevant because information is delivered in meaningful ways based on past behavior and context.</li>
<li>“People-centric” &#8211; Social networking, social tagging, location awareness, presence indicators and other technologies are integrated into processes and how people work daily.</li>
<li>“Super search” &#8211; Various search technologies come together to offer people greater relevance and control over vast amounts of information from inside and outside the enterprise.</li>
</ul>
<p style="text-align: right;">p. 19 <a href="http://www.digital-workplace-trends.com/">Digital Workplace Trends 2012</a></p>
</blockquote>
<p>When Jane asked the audience at the conference if they thought Super Search had &#8220;high potential value”, a whopping 100% answered yes! In the Digital Workplace Trends report 70% of the participants considered Super Search to have a &#8220;high potential value”, and 20% of the leadership group has started implementing it.</p>
<p><a href="http://www.infocentricresearch.com/Research/Publications/The-Digital-Workplace.aspx">The Digital Workplace: Redefining Productivity in the Information Age</a> by  <a href="http://www.infocentricresearch.com/">Infocentric Research</a> is another excellent (and free) source on the current state of the Digital Workplace. Also in this report good search is mentioned as very important for getting work done in the digital workplace:</p>
<blockquote><p>&#8220;Imagine that each and every employee in your organization would spend 1 to 2 full working hours per day surfing the web and social media sites (such as Facebook, YouTube and Twitter) purely for private pleasure. Would that be acceptable for you? And even more important: would it leave your bottom line results unaffected?</p>
<p>The answer to both questions of course is clear “No”. But the bad news is that your employees spend just that amount of time for something even worse. And they do so with full allowance by management and in accordance to accepted work practices in your organization. What they do, what you do as well, is looking for information they need to do their job and ineffectively working with that information.&#8221;</p>
<p style="text-align: right;">p. 4 <a href="http://www.infocentricresearch.com/Research/Publications/The-Digital-Workplace.aspx">The Digital Workplace: Redefining Productivity in the Information Age</a></p>
</blockquote>
<p>When reading these excellent reports it is quite obvious to us that the need for a &#8220;Super Search&#8221;, i.e. an Enterprise Search solution that can reach all types of information, is very much in demand. Many organizations have worked extensively with search for many years understand that this is actually a never-ending task. But search is still a very cost-effective and hands-on solution for many information and knowledge intensive tasks.</p>
<blockquote><p>&#8220;Information based work is driven and determined by having the right information to perform the task at hand. For this, the information has to be there when needed. Looking for the right information to do something therefore constitutes one of the most relevant of all tasks. In fact, “searching” in all its forms is the most ubiquitous activity that information workers perform in their jobs&#8221;.</p>
<p style="text-align: right;">p. 15 <a href="http://www.infocentricresearch.com/Research/Publications/The-Digital-Workplace.aspx">The Digital Workplace: Redefining Productivity in the Information Age</a></p>
</blockquote>
<p>To conclude, the new digital workplace in transforming the way we work, interact and communicate. The discussions during the conference showed that almost all organizations were in a transformation phase where the traditional intranet (with static pages updated by editors) is being complemented (and in some cases replaced altoghether) with collaboration areas and flexible worktools.  We look forward to this years development and hope to share some good cases with you, especially with regard to search, collaboration and mobility..</p>
<h2>More reading on the Digital Workplace</h2>
<p><a href="http://intranet-pioneer.com/tag/digital-workplace/">Intranet Pioneer Mark Morell</a></p>
<p><a href="http://connaxions.wordpress.com/category/digital-workplace/">Connaxions / Martin Risgaard </a></p>
<p><a href="http://www.ibforum.com/services/digital-workplace/">The Intranet Benchmarking Forum</a></p>
<img src="http://feeds.feedburner.com/~r/TheFindabilityBlog/~4/kqp2ajqtK-4" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://findabilityblog.se/search-in-the-digital-workplace/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://findabilityblog.se/search-in-the-digital-workplace/</feedburner:origLink></item>
		<item>
		<title>Search stuffed up with GIS</title>
		<link>http://feedproxy.google.com/~r/TheFindabilityBlog/~3/Nw9RFtGTIgg/</link>
		<comments>http://findabilityblog.se/search-stuffed-up-with-gis/#comments</comments>
		<pubDate>Fri, 03 Feb 2012 10:01:53 +0000</pubDate>
		<dc:creator>Pawel Wroblewski</dc:creator>
				<category><![CDATA[Enterprise Search]]></category>
		<category><![CDATA[Future development]]></category>
		<category><![CDATA[Information seeking behaviour]]></category>
		<category><![CDATA[Market trends]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[Search Watch]]></category>
		<category><![CDATA[Technology]]></category>

		<guid isPermaLink="false">http://findabilityblog.se/?p=2865</guid>
		<description><![CDATA[When I browsed through marketing brochures of GIS (Geographic Information System) vendors I noticed that the message is quite similar to search analytics. It refers in general to integration of various separate sources into analysis based on geo-visualizations. I have recently seen quite nice and powerful combination of search and GIS technologies and so I [...]]]></description>
			<content:encoded><![CDATA[<p>When I browsed through marketing brochures of GIS (<a href="http://en.wikipedia.org/wiki/Geographic_information_system">Geographic Information System)</a> vendors I noticed that the message is quite similar to search analytics. It refers in general to integration of various separate sources into analysis based on geo-visualizations. I have recently seen quite nice and powerful combination of search and GIS technologies and so I would like to describe it a little bit. Let us start from the basic things.</p>
<h2><strong>Search result visualization</strong></h2>
<p>It is quite obvious to use a map instead of simple list of results to visualize what was returned for an entered query. This technique is frequently used for plenty of online search applications especially in directory services like yellow pages or real estate web sites. The list of things that are required to do this is pretty short:</p>
<p>- geoloalization of items  &#8211; it means to assign accurate geo coordinates to location names, addresses, zip codes or whatever expected to be shown in the map; geo localization services are given more less for free by Google or Bing maps.</p>
<p>- backgroud map &#8211; this is necessity and also given by Google or Bing; there are also plenty of vendors for more specialized mapping applications</p>
<p>- returned results with geo-coordinates  as metadata &#8211; to put them in the map</p>
<p>Normally this kind of basic GIS visualisation delivers basic map operations like zooming, panning, different views and additionally some more data like traffic, parks, shops etc. Results are usually pins [Bing] or drops [Google].</p>
<h2><strong>Querying / filtering with the map</strong></h2>
<p>The step further of integration between search and GIS would be utilizing the map as a tool for definition of search query. One way is to create area of interest that could be drawn in the map as circle, rectangle or polygon. In simple way it could be just the current window view on the map as the area of query. In such an approach full text query is refined to include only results belonging to area defined.</p>
<p>Apart from map all other query refinement tools should be available as well, like date-time sliders or any kind of navigation and fielded queries.</p>
<p><a href="http://searchmatics.files.wordpress.com/2011/08/esri1.jpg"><img src="http://searchmatics.files.wordpress.com/2011/08/esri1.jpg" alt="" width="468" height="464" /></a></p>
<h2><strong>Simple geo-spatial analysis</strong></h2>
<p>Sometimes it is important to sort query results by distance from a reference point in order to see all the nearest Chinese restaurant in the neighborhood.  I would also categorize as simple geo-spatial analysis grouping of search result into a GIS layers like e.g. density heatmap, hot spots using geographical and other information stored in results metadata etc.</p>
<h2><strong>Advanced geo-spatial analysis</strong></h2>
<p>More advance query definition and refinement would involve geo-spatial computations. Basing on real needs it could be possible for example to refine search results by an area of sight line from a picked reference point or select filtering areas like those inside specific borders of cities, districts, countries etc.</p>
<p>So the idea is to use relevant output from advanced GIS analysis as an input for query refinement. In this way all the power of GIS can be used to get to the unstructured data through a search process.</p>
<p><a href="http://searchmatics.files.wordpress.com/2011/08/esri2.jpg"><img src="http://searchmatics.files.wordpress.com/2011/08/esri2.jpg" alt="" width="468" height="349" /></a></p>
<p>What kind of applications do you think could get advantage of search stuffed with really advanced GIS? Looking forward to your comments on this post.</p>
<img src="http://feeds.feedburner.com/~r/TheFindabilityBlog/~4/Nw9RFtGTIgg" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://findabilityblog.se/search-stuffed-up-with-gis/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://findabilityblog.se/search-stuffed-up-with-gis/</feedburner:origLink></item>
		<item>
		<title>How to Index and Search XML Content in Solr</title>
		<link>http://feedproxy.google.com/~r/TheFindabilityBlog/~3/NzMxSVUC0TE/</link>
		<comments>http://findabilityblog.se/how-to-index-and-search-xml-content-in-solr/#comments</comments>
		<pubDate>Wed, 25 Jan 2012 14:20:39 +0000</pubDate>
		<dc:creator>Xiaodong Shen</dc:creator>
				<category><![CDATA[Development]]></category>
		<category><![CDATA[Solr]]></category>

		<guid isPermaLink="false">http://findabilityblog.se/?p=2819</guid>
		<description><![CDATA[Indexing XML Content In solr, there is an xml update request handler which can be used to update xml formatted data. For example, &#60;add&#62; &#60;doc&#62; &#60;field name="employeeId"&#62;05991&#60;/field&#62; &#60;field name="office"&#62;Bridgewater&#60;/field&#62; &#60;field name="skills"&#62;Perl&#60;/field&#62; &#60;field name="skills"&#62;Java&#60;/field&#62; &#60;/doc&#62; [&#60;doc&#62; ... &#60;/doc&#62;[&#60;doc&#62; ... &#60;/doc&#62;]] &#60;/add&#62; However when a field itself should contain xml formatted data, the xml update handler will [...]]]></description>
			<content:encoded><![CDATA[<h2>Indexing XML Content</h2>
<p>In solr, there is an xml update request handler which can be used to update xml formatted data.</p>
<p>For example,</p>
<p><code>&lt;add&gt;</code><br />
<code>&lt;doc&gt;</code><br />
<code>&lt;field name="employeeId"&gt;05991&lt;/field&gt;</code><br />
<code>&lt;field name="office"&gt;Bridgewater&lt;/field&gt;</code><br />
<code>&lt;field name="skills"&gt;Perl&lt;/field&gt;</code><br />
<code>&lt;field name="skills"&gt;Java&lt;/field&gt;</code><br />
<code>&lt;/doc&gt;</code><br />
<code>[&lt;doc&gt; ... &lt;/doc&gt;[&lt;doc&gt; ... &lt;/doc&gt;]]</code><br />
<code>&lt;/add&gt;</code></p>
<p>However when a field itself should contain xml formatted data, the xml update handler will fail to import. Because, xml update handler parse the import data with xml parser, it will try to get direct child text under &#8216;field&#8217; node, which is empty if a field&#8217;s direct child is xml tag.</p>
<p>What we can do is to use json update handler. For example:</p>
<p><code>[</code><br />
<code>  {</code><br />
<code>    "id" : "MyTestDocument",</code><br />
<code>    "title" : "&lt;root p=\"cc\"&gt;test \\ node&lt;/root&gt;"</code><br />
<code>  }</code><br />
<code>]</code></p>
<p><code></code>There are two things to notice,</p>
<ol start="1">
<li>Both &#8216;<code>”</code>&#8216; and &#8216;<code>\</code>&#8216; characters should be escaped</li>
<li>The xml content should be kept as a single line</li>
</ol>
<p>Json import data can be loaded into Solr by the curl command,</p>
<p><code>curl 'http://localhost:8983/solr/update/json?commit=true' --data-binary @books.json -H 'Content-type:application/json'</code></p>
<p>Or, by using solrj:</p>
<p><code>CommonsHttpSolrServer server = new CommonsHttpSolrServer(serverpath);</code><br />
<code>server.setMaxRetries(1);</code><br />
<code>ContentStreamUpdateRequest csureq = new ContentStreamUpdateRequest("/update/json");</code><br />
<code>csureq.addFile(file);</code><br />
<code>NamedList&lt;Object&gt; result = server.request(csureq);</code><br />
<code>NamedList&lt;Object&gt; responseHeader = (NamedList&lt;Object&gt;) result.get("responseHeader");</code></p>
<p><code>Integer status = (Integer) responseHeader.get("status");</code></p>
<h1>Stripping out xml tags in Schema definition</h1>
<p>When querying xml content, we most likely will not be interested in xml tags. So we need to strip out xml tags before indexing the xml text. We can do that by applying <code>HTMLStripCharFilter</code> to the xml content.<br />
<code>            &lt;analyzer type="index"&gt;</code><br />
<code>                ...</code><br />
<code>                &lt;charFilterSpellE"&gt;solr.HTMLStripCharFilterFactory"/&gt;</code><br />
<code>                &lt;tokenizerSpellE"&gt;solr.StandardTokenizerFactory"/&gt;</code><br />
<code>                &lt;filterSpellE"&gt;solr.LowerCaseFilterFactory"/&gt;</code><br />
<code>                ...</code><br />
<code>            &lt;/analyzer&gt;</code><br />
<code>            &lt;analyzer type="query"&gt;</code><br />
<code>                ...</code><br />
<code>                &lt;charFilterSpellE"&gt;solr.HTMLStripCharFilterFactory"/&gt;</code><br />
<code>                &lt;tokenizerSpellE"&gt;solr.StandardTokenizerFactory"/&gt;</code><br />
<code>                &lt;filterSpellE"&gt;solr.LowerCaseFilterFactory"/&gt;</code><br />
<code>                ...</code><br />
<code>            &lt;/analyzer&gt;</code></p>
<h1>Search XML Content</h1>
<p>Xml content search does not differ much from text content search. However, if people want to search for xml attributes, there requires some special tweak.</p>
<p><code>HTMLStripCharFilter</code> we mentioned earlier will filter out all xml tags including attributes, in order to index attributes, we need to find a way to make <code>HTMLStripCharFilter</code> keep the attribute text.</p>
<p>For example if we have original xml content as following,</p>
<p><code>&lt;sample attr=”key_o2_4”&gt;find it &lt;/sample&gt;</code><br />
After applying <code>HTMLStripCharFilter</code>, we want to have,</p>
<p><code>key_o2_4    find it</code><br />
One way we can do is to add assistance xml instruction tags in original xml content such as,</p>
<p><code>&lt;sample attr=”key_o2_4”&gt;&lt;?solr key_o2_4?&gt;find it&lt;/sample&gt;</code></p>
<p>And apply <code>Solr.PatternReplaceCharFilterFactory</code> to it as shown in following schema fieldtype definition.</p>
<p><code>&lt;analyzer type="index"&gt;</code><br />
<code>...</code><br />
<code>&lt;charFilter pattern="&amp;lt;\?solr ([A-Z0-9_-]*)\?&amp;gt; " replacement="       $1  " maxBlockChars="10000000"/&gt;</code><br />
<code>&lt;charFilter/&gt;</code><br />
<code>...</code><br />
<code>&lt;/analyzer&gt;</code></p>
<p>Which will make replace <code>&lt;?solr key_o2_4?&gt;</code> with 7 leading empty spaces + key_o2_4 + 2 ending empty spaces in order to keep the original offset,</p>
<p>With this technique, we can do a search on <code>attr</code> attribute and get a hit.</p>
<img src="http://feeds.feedburner.com/~r/TheFindabilityBlog/~4/NzMxSVUC0TE" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://findabilityblog.se/how-to-index-and-search-xml-content-in-solr/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://findabilityblog.se/how-to-index-and-search-xml-content-in-solr/</feedburner:origLink></item>
		<item>
		<title>Search-driven Navigation and Content</title>
		<link>http://feedproxy.google.com/~r/TheFindabilityBlog/~3/TB_zq55l2a4/</link>
		<comments>http://findabilityblog.se/search-driven-navigation-and-content/#comments</comments>
		<pubDate>Thu, 19 Jan 2012 08:05:37 +0000</pubDate>
		<dc:creator>Maggie Michnik</dc:creator>
				<category><![CDATA[FAST]]></category>
		<category><![CDATA[Findability]]></category>
		<category><![CDATA[Interaction Design]]></category>
		<category><![CDATA[User Experience]]></category>

		<guid isPermaLink="false">http://findabilityblog.se/?p=2823</guid>
		<description><![CDATA[In the beginning of October I attended Microsoft SharePoint Conference 2011 in Anaheim, USA. There were a lot of interesting and useful topics that were discussed. One really interesting session was Content Targeting with the FAST Search Web Part by Martin Harwar. Martin Harwar talked about how search can be used to show content on [...]]]></description>
			<content:encoded><![CDATA[<p>In the beginning of October I attended <a href="http://www.mssharepointconference.com/Pages/default.aspx">Microsoft SharePoint Conference</a> 2011 in Anaheim, USA. There were a lot of interesting and useful topics that were discussed. One really interesting session was <em><a href="http://www.mssharepointconference.com/pages/sessiondetailsShort.aspx?sessionguid=0604e9e1-ba23-426f-abda-58e917fdd1fa">Content Targeting with the FAST Search Web Part</a></em> by Martin Harwar.</p>
<p>Martin Harwar talked about how search can be used to show content on a web page. The most common search-driven content is of course the traditional search. But there are a lot more content that can be retrieved by search. One of them is to have search-driven navigation and content. The search-driven navigation means that instead of having static links on a page we can render them depending on the query the user typed in. If a user is for example on a health care site and had recently done a search on “ear infection” the page can show links to ear specialist departments. When the user will do another search and returns to the same page the links will be different.</p>
<p>In the same way we can render content on the page. Imagine a webpage of a tools business that on its start page has two lists of products, most popular and newest tools. To make these lists more adapted for a user we only want show products that are of interest for the user. Instead of only showing the most popular and newest tools the lists can also be filtered on the last query a user has typed. Assume a user searches on “saw” and then returns to the page with the product lists. The lists will now show the most popular saws and the newest saws. This can also be used when a user finds the companies webpage by searching for “saw” on for instance Google.</p>
<p>This shows that search can be used in many ways to personalize a webpage and thereby increase <a title="findability by Findwise" href="http://www.findwise.com/what-we-do/findability-by-findwise">Findability</a>.</p>
<img src="http://feeds.feedburner.com/~r/TheFindabilityBlog/~4/TB_zq55l2a4" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://findabilityblog.se/search-driven-navigation-and-content/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		<feedburner:origLink>http://findabilityblog.se/search-driven-navigation-and-content/</feedburner:origLink></item>
		<item>
		<title>Text Analytics in Enterprise Search</title>
		<link>http://feedproxy.google.com/~r/TheFindabilityBlog/~3/161v3yGVtDA/</link>
		<comments>http://findabilityblog.se/text-analytics-in-enterprise-search/#comments</comments>
		<pubDate>Wed, 11 Jan 2012 14:55:25 +0000</pubDate>
		<dc:creator>Kristian Norling</dc:creator>
				<category><![CDATA[Conference]]></category>
		<category><![CDATA[Enterprise Search]]></category>
		<category><![CDATA[Lucene]]></category>
		<category><![CDATA[Text Analytics]]></category>

		<guid isPermaLink="false">http://findabilityblog.se/?p=2836</guid>
		<description><![CDATA[A presentation made by Daniel Ling at Apache Lucene Eurocon in Barcelona, october 2011. We think this is the first of many forthcoming presentations. We also want to get more involved in the community in the future. By doing presentations, sponsoring, contributing code. Hope to bring more news on this subject in the next few [...]]]></description>
			<content:encoded><![CDATA[<p>A <a title="Text Analytics in Enterprise Search" href="http://2011.lucene-eurocon.org/talks/20836">presentation made by Daniel Ling at Apache Lucene Eurocon</a> in Barcelona, october 2011.</p>
<p>We think this is the first of many forthcoming presentations.</p>
<p>We also want to get more involved in the community in the future. By doing presentations, sponsoring, contributing code. Hope to bring more news on this subject in the next few weeks. Enjoy the presentation:</p>
<p><iframe src="http://player.vimeo.com/video/33458815" frameborder="0" width="400" height="225"></iframe></p>
<p><a href="http://vimeo.com/33458815">Text Analytics in Enterprise Search, Daniel Ling, Findwise, Eurocon 2011</a> from <a href="http://vimeo.com/user5025093">Lucene Revolution</a> on <a href="http://vimeo.com">Vimeo</a>.</p>
<img src="http://feeds.feedburner.com/~r/TheFindabilityBlog/~4/161v3yGVtDA" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://findabilityblog.se/text-analytics-in-enterprise-search/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://findabilityblog.se/text-analytics-in-enterprise-search/</feedburner:origLink></item>
		<item>
		<title>ExternalFileField in Solr</title>
		<link>http://feedproxy.google.com/~r/TheFindabilityBlog/~3/QuK61g6JxrQ/</link>
		<comments>http://findabilityblog.se/externalfilefield-in-solr/#comments</comments>
		<pubDate>Wed, 04 Jan 2012 14:57:15 +0000</pubDate>
		<dc:creator>Maggie Michnik</dc:creator>
				<category><![CDATA[Development]]></category>
		<category><![CDATA[Solr]]></category>

		<guid isPermaLink="false">http://findabilityblog.se/?p=2825</guid>
		<description><![CDATA[Sometimes we want to update document values in an indexed field more often than other fields. A good solution to this is to use the field type ExternFileField. The ExternalFileField gets values from an external file instead of the index. Such file can easily be changed and update the field after a commit. Hence no [...]]]></description>
			<content:encoded><![CDATA[<p>Sometimes we want to update document values in an indexed field more often than other fields. A good solution to this is to use the field type <a title="ExternalFileField" href="http://lucene.apache.org/solr/api/org/apache/solr/schema/ExternalFileField.html">ExternFileField</a>. The ExternalFileField gets values from an external file instead of the index. Such file can easily be changed and update the field after a commit. Hence no documents need to be re-indexed. A field that has ExternalFileField as type is not searchable. The field may currently only be used as a ValueSource in a FunctionQuery.</p>
<p>The external file contains keys and values:</p>
<p>key1=value1<br />
key2=value2</p>
<p>The keys don&#8217;t need to be unique.</p>
<p>The name of the external file must be <code>external_&lt;fieldname&gt;</code> or <code>external_&lt;fieldname&gt;.*</code> and must be placed in the index directory.</p>
<p>A new file type of the type ExternalFileField and field must be added to schema.xml.</p>
<p><code>&lt;fieldType name="file"</code></p>
<p><code>           keyField="keyField" defVal="1" indexed="false"</code></p>
<p><code>           stored="false" valType="float" /&gt;</code></p>
<p><code>&lt;field name="&lt;fieldname&gt;" type="file" /&gt;</code></p>
<p><code>keyField</code> is the field that contains the keys and <code>&lt;fieldname&gt;</code> contains the values from the external file.</p>
<p><code>valType</code> defines the value type of the field.</p>
<p>At Findwise we have used this method for a customer where we wanted to show the most visited pages higher up in the search result. These statistics are changing daily for a lot of pages and we don&#8217;t want to re-index all these pages every day.</p>
<img src="http://feeds.feedburner.com/~r/TheFindabilityBlog/~4/QuK61g6JxrQ" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://findabilityblog.se/externalfilefield-in-solr/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://findabilityblog.se/externalfilefield-in-solr/</feedburner:origLink></item>
		<item>
		<title>Analytics and BigData at IBM Information On Demand 2011</title>
		<link>http://feedproxy.google.com/~r/TheFindabilityBlog/~3/Ic-gWY0KDoo/</link>
		<comments>http://findabilityblog.se/analytics-and-bigdata-at-ibm-information-on-demand-2011/#comments</comments>
		<pubDate>Tue, 20 Dec 2011 15:03:01 +0000</pubDate>
		<dc:creator>Leonard Saers</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Enterprise Search]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[BigData]]></category>
		<category><![CDATA[IBM Content Analytics]]></category>
		<category><![CDATA[IBM Information on demand]]></category>
		<category><![CDATA[Watson]]></category>

		<guid isPermaLink="false">http://findabilityblog.se/?p=2806</guid>
		<description><![CDATA[The big trend these days are in BigData and how you can analyze large amounts of information in order to gain important insights, and from those insights be able to take the right action. This trend was a hot topic at the IBM Information On Demand (IOD) conference in Las Vegas earlier this year. IBM [...]]]></description>
			<content:encoded><![CDATA[<p>The big trend these days are in BigData and how you can analyze large amounts of information in order to gain important insights, and from those insights be able to take the right action. This trend was a hot topic at the <a href="http://www-01.ibm.com/software/data/2011-conference/">IBM Information On Demand (IOD) conference in Las Vegas</a> earlier this year. IBM has a very strong position in this field, it&#8217;s hard to have missed how their computer system Watson challenged the top players of all time in Jeopardy recently, and won! <a title="IBM Watson" href="http://www-03.ibm.com/innovation/us/watson/index.html">Read more about Watson</a></p>
<p>Now IBM has taken the technology behind Watson and started to apply it in their different analytics products, where one specific area that is being targeted is healthcare. For this area IBM released a new product during IOD called <a href="http://www-01.ibm.com/software/ecm/content-analytics/predictive/healthcare.html">IBM Content and Predictive Analytics for Healthcare</a>, which can for example be used as a tool for physicians to support them in their diagnosis of patients.</p>
<p>In April this year IBM merged two of their products, their search engine OmniFind and their product for analyzing large amounts of unstructured information, Content Analytics. The new product is called <a title="IBM Content Analytics with Enterprice Search" href="http://www-01.ibm.com/software/ecm/content-analytics/bundle.html">IBM Content analytics with Enterprise search</a> and it too is based on much of the same technology that is used in Watson, more specifically it utilizes the same Natural Language Processing techniques. This means that it has the ability to understand text on a level just as sophisticated as that of Watson.</p>
<p>Content Analytics with enterprise search scales very well to many millions of documents. However, when there is a need for analyzing really enormous data sets, in the magnitude of petabytes or even exabytes, IBM has developed what they call their <a title="IBM Big Data Platform" href="http://www-01.ibm.com/software/data/bigdata/">BigData platform</a>. This platform mainly revolves around two products, InfoSphere Streams and InfoSphere BigInsights, and it builds on a foundation of open source software, such as Apache Hadoop and Apache Lucene. InfoSphere Streams is used for real time analysis of information in motion. This helps you understand what&#8217;s happening right at this moment in your organization and supports you in taking appropriate action as things are happening. InfoSphere BigInsights on the other hand lets you analyze and draw insight from massive amounts of already existing data.</p>
<p>Studies have shown how organizations that fall short in this area are overtaken by those who understand how to use the power of analytics.</p>
<p>IBM has surely chosen an interesting path when merging Analytics with Findability.</p>
<img src="http://feeds.feedburner.com/~r/TheFindabilityBlog/~4/Ic-gWY0KDoo" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://findabilityblog.se/analytics-and-bigdata-at-ibm-information-on-demand-2011/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://findabilityblog.se/analytics-and-bigdata-at-ibm-information-on-demand-2011/</feedburner:origLink></item>
		<item>
		<title>Inspiration from the Enterprise Search Europe conference</title>
		<link>http://feedproxy.google.com/~r/TheFindabilityBlog/~3/EuzbhRxu6rQ/</link>
		<comments>http://findabilityblog.se/inspiration-from-the-enterprise-search-europe-conference/#comments</comments>
		<pubDate>Fri, 11 Nov 2011 11:06:15 +0000</pubDate>
		<dc:creator>Christian Ubbesen</dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[Enterprise Search]]></category>
		<category><![CDATA[Findability]]></category>
		<category><![CDATA[Market trends]]></category>
		<category><![CDATA[Search]]></category>

		<guid isPermaLink="false">http://findabilityblog.se/?p=2792</guid>
		<description><![CDATA[A couple of weeks ago, me and some of my colleagues attended the Enterprise Search Europe conference in London. We&#8217;re very grateful to the organizer Martin White at IntranetFocus for arranging the event, and having us as one of the gold sponsors. For me it was the first time in years I attended a conference like [...]]]></description>
			<content:encoded><![CDATA[<p>A couple of weeks ago, me and some of my colleagues attended the Enterprise Search Europe conference in London. We&#8217;re very grateful to the organizer Martin White at IntranetFocus for arranging the event, and having us as one of the gold sponsors.</p>
<p>For me it was the first time in years I attended a conference like this, and while it was “same old, same old” for many of the attendees, for me it was enlightening to meet up with the industry and have a discussion on where we are as an industry.</p>
<p>There were mainly software vendors and professional services/consultants there, as well a few customers or actual users of enterprise search… and I think the consensus of the two days were that we in the industry STILL haven’t really figured out what we should do with the enterprise search concept, and how to make it valuable for our customers. We at Findwise are not alone with this challenge, but rather it is an industry challenge. There are some vendors who seem to be doing some good work of delivering real value to customers, and also there are a few colleagues to us in the industry that do good professional services/consultant work. At first it was a bit of a downer to realize that we haven&#8217;t progressed more during the 10 years I&#8217;ve been in the business, but at the same time it was very inspirational to see that we at Findwise together with a few other players, seem to be on the right track with our hard work, and that we have the position to solve some of the real industry challenges we&#8217;re facing.</p>
<p>As I see it, if we gather our forces and make a focused “push forward” together now, we will be able to take the industry to a new maturity level where we better solve real business challenges with enterprise search (or search-driven Findability solutions, as we like to call them).</p>
<p>My simple analysis of all the discussions at the conference is that we need to do two things:</p>
<ol>
<li>Manage the whole “full picture” of enterprise search – from strategy to organizational governance, involving necessary competencies to cover all aspects of a successful Findability solution.</li>
<li>Break down the customer challenge into manageable chunks, and solve actual business problems, not just solving the traditional “finding stuff when needed” challenge.</li>
</ol>
<p>I think we are on the right track, and it&#8217;s going to be a very interesting journey from here on!</p>
<img src="http://feeds.feedburner.com/~r/TheFindabilityBlog/~4/EuzbhRxu6rQ" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://findabilityblog.se/inspiration-from-the-enterprise-search-europe-conference/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://findabilityblog.se/inspiration-from-the-enterprise-search-europe-conference/</feedburner:origLink></item>
		<item>
		<title>Content choreography?</title>
		<link>http://feedproxy.google.com/~r/TheFindabilityBlog/~3/ywzLN660HI8/</link>
		<comments>http://findabilityblog.se/content-choreography/#comments</comments>
		<pubDate>Thu, 27 Oct 2011 16:02:21 +0000</pubDate>
		<dc:creator>Christopher Wallstrom</dc:creator>
				<category><![CDATA[Information Architecture]]></category>
		<category><![CDATA[Information management]]></category>
		<category><![CDATA[Strategy]]></category>
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://findabilityblog.se/?p=2783</guid>
		<description><![CDATA[Is getting the right content to the right users and customers a priority for you and your organisation? Do you drown in too much information? With some insight into how to manage content your answer is probably &#8220;Yes!&#8221;. Today we have loads of channels to choose from, e-mails, internet/intranets, Yammer feeds, blogs and different collaboration [...]]]></description>
			<content:encoded><![CDATA[<p>Is getting the right content to the right users and customers a priority for you and your organisation? Do you drown in too much information? With some insight into how to manage content your answer is probably &#8220;Yes!&#8221;.</p>
<p>Today we have loads of channels to choose from, e-mails, internet/intranets, Yammer feeds, blogs and different collaboration platforms and social media services. Some content is more beneficial in one channel and other content in another channel. But how do you make sure the right information reaches the right users, in the right channels?</p>
<p>Content Choreography aims to handle all that; Content, strategy, format and delivery.</p>
<p>We need to tailor the user/customer experience in order to achieve good Findability. How? Taxonomy, Metadata and Search!<br />
Taxonomy to ensure that we speak the same language, metadata to classify the content to fulfill a certain task or objective and search to deliver it to the right channel.</p>
<p>Need more information about Content Choreography?<br />
Join us in our joint seminar with KnowIT, Nov 22nd: <a href="http://www.knowit.se/en/Kampanjsidor/contentchoreography-en/" target="_blank">Future Choreography of Content Management</a>, where Seth Earley &#8211; CEO at <a href="http://www.earley.com/" target="_blank">Early Associates</a> will speak about Content Choreography &#8211; The Art of Dynamic Web Content. Seth Earley have more than 20 years experience in the field and is a very eloquent and interesting speaker. He will share his thoughts and ideas gathered from a number of large customers worldwide.</p>
<p>More information and registration can be found <a href="http://www.knowit.se/en/Kampanjsidor/contentchoreography-en/" target="_blank">here</a>.</p>
<img src="http://feeds.feedburner.com/~r/TheFindabilityBlog/~4/ywzLN660HI8" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://findabilityblog.se/content-choreography/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://findabilityblog.se/content-choreography/</feedburner:origLink></item>
	</channel>
</rss>

