<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0">

<channel>
	<title>Text Analysis Blog</title>
	
	<link>http://blog.saplo.com</link>
	<description>Keep track of what is happening around Saplo by following our blog</description>
	<lastBuildDate>Thu, 02 Feb 2012 15:17:13 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3</generator>
		<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/rss+xml" href="http://feeds.feedburner.com/Saplo" /><feedburner:info uri="saplo" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><item>
		<title>Meet Saplo at Semantic Conference in Berlin</title>
		<link>http://feedproxy.google.com/~r/Saplo/~3/WhqVBxALyV8/</link>
		<comments>http://blog.saplo.com/meet-saplo-at-semantic-conference-in-berlin/#comments</comments>
		<pubDate>Thu, 02 Feb 2012 15:14:04 +0000</pubDate>
		<dc:creator>Mattias Tyrberg</dc:creator>
				<category><![CDATA[Events]]></category>
		<category><![CDATA[predictive analytics]]></category>
		<category><![CDATA[predictive text analytics]]></category>
		<category><![CDATA[SemTech]]></category>
		<category><![CDATA[SemTech Berlin]]></category>
		<category><![CDATA[text analytics]]></category>

		<guid isPermaLink="false">http://blog.saplo.com/?p=1889</guid>
		<description><![CDATA[Next week you will be able to meet Saplo at the Semantic Tech &#38; Business Conference in Berlin. Saplo will have a session and talk about predictive text analytics. If you are interested in knowing more please join our session on Tuesday (you can find tickets here). We will be in Berlin between Sunday to Tuesday, if you ... <br /><br /><a href="http://blog.saplo.com/meet-saplo-at-semantic-conference-in-berlin/">[ Read more ]</a>]]></description>
			<content:encoded><![CDATA[<p><strong>Next week you will be able to meet Saplo at the <a href="http://semtechbizberlin2012.semanticweb.com/">Semantic Tech &amp; Business Conference</a> in Berlin.</strong></p>
<p><img class="wp-image-1892 alignleft" title="Semantic Conference with Saplo" src="http://blog.saplo.com/wp-content/uploads/2012/02/Semantic-Conference-with-Saplo.jpg" alt="" width="614" height="408" /></p>
<p>Saplo will have <a title="Saplo session at SemTech Berlin" href="http://semtechbizberlin2012.semanticweb.com/sessionPop.cfm?confid=66&amp;proposalid=4418">a session</a> and talk about predictive text analytics. If you are interested in knowing more please join our session on Tuesday (you can find tickets <a href="http://semtechbizberlin2012.semanticweb.com/reg.cfm">here</a>). We will be in Berlin between Sunday to Tuesday, if you are interested in meeting us at the conference, or at another place in Berlin, please <a title="Contact Saplo" href="http://saplo.com/contact">let us know</a>.</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p><small>Picture from <a href="http://www.flickr.com/photos/officialgdc">officialgdc</a></small></p>
<img src="http://feeds.feedburner.com/~r/Saplo/~4/WhqVBxALyV8" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.saplo.com/meet-saplo-at-semantic-conference-in-berlin/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blog.saplo.com/meet-saplo-at-semantic-conference-in-berlin/</feedburner:origLink></item>
		<item>
		<title>Saplo is a Red Herring Global Finalist</title>
		<link>http://feedproxy.google.com/~r/Saplo/~3/eypAREYO3M4/</link>
		<comments>http://blog.saplo.com/red-herring-global-finalist/#comments</comments>
		<pubDate>Mon, 21 Nov 2011 16:09:27 +0000</pubDate>
		<dc:creator>Mattias Tyrberg</dc:creator>
				<category><![CDATA[Awards]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[Red Herring]]></category>
		<category><![CDATA[Red Herring 100]]></category>
		<category><![CDATA[Red Herring Global]]></category>
		<category><![CDATA[Saplo]]></category>
		<category><![CDATA[text analysis technology]]></category>

		<guid isPermaLink="false">http://blog.saplo.com/?p=1873</guid>
		<description><![CDATA[Today Red Herring announced that Saplo is a Finalist in the famous Red Herring 100 Global Award. Saplo was one of the winners of Red Herring Europe earlier this year and in competition with thousands of companies from all around the world Saplo was picked as one of the best companies world wide. Saplo’s text ... <br /><br /><a href="http://blog.saplo.com/red-herring-global-finalist/">[ Read more ]</a>]]></description>
			<content:encoded><![CDATA[<p><strong>Today Red Herring announced that Saplo is a Finalist in the famous Red Herring 100 Global Award. Saplo was one of the winners of Red Herring Europe earlier this year and in competition with thousands of companies from all around the world Saplo was picked as one of the best companies world wide.</strong></p>
<div id="attachment_1874" class="wp-caption aligncenter" style="width: 310px"><img class="size-medium wp-image-1874" title="Red Herring 2011 Global Finalist - Saplo" src="http://blog.saplo.com/wp-content/uploads/2011/11/Red-Herring-2011-Global-Finalist-300x300.jpg" alt="Saplo a Red Herring 2011 Global Finalist" width="300" height="300" /><p class="wp-caption-text">Saplo is a Red Herring 2011 Global Finalist</p></div>
<p>Saplo’s <a href="http://saplo.com/products">text analysis technology</a> have been <a title="Awards" href="http://blog.saplo.com/category/awards/">awarded before</a> but this is something extra since Red Herring is a global competition finding the best and most promising technology companies in the world from thousands of companies. Back in May this year Saplo was <a href="http://blog.saplo.com/saplo-is-a-red-herring-europe-finalist/">awarded the Red Herring Europe Award</a> and thanks to that and the progress during the year Saplo is now been been honored as one of the Global Finalists.</p>
<p>The final award and the Top 100 winners will be announced at a special awards ceremony on December 7. So keep your fingers crossed =)</p>
<p><strong>About the Red Herring Award</strong><br />
The Red Herring editorial team selected the 2011 Finalists as the most innovative companies from a pool of thousands of companies from over 40 nations. These companies, representing past Red Herring Asia, Europe and North America winners.</p>
<img src="http://feeds.feedburner.com/~r/Saplo/~4/eypAREYO3M4" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.saplo.com/red-herring-global-finalist/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		<feedburner:origLink>http://blog.saplo.com/red-herring-global-finalist/</feedburner:origLink></item>
		<item>
		<title>Saplo’s Text Analytics on World Tour</title>
		<link>http://feedproxy.google.com/~r/Saplo/~3/-3SsD2-rCNk/</link>
		<comments>http://blog.saplo.com/saplo%e2%80%99s-text-analytics-on-world-tour/#comments</comments>
		<pubDate>Fri, 18 Nov 2011 08:38:57 +0000</pubDate>
		<dc:creator>Mattias Tyrberg</dc:creator>
				<category><![CDATA[Awards]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[Asia]]></category>
		<category><![CDATA[Brasil]]></category>
		<category><![CDATA[Canada]]></category>
		<category><![CDATA[Cracking the Code of Innovation]]></category>
		<category><![CDATA[Europe]]></category>
		<category><![CDATA[Future of Mobile]]></category>
		<category><![CDATA[Innovate Sweden]]></category>
		<category><![CDATA[Innovation]]></category>
		<category><![CDATA[Rio de Janeiro]]></category>
		<category><![CDATA[São Paulo]]></category>
		<category><![CDATA[Saplo]]></category>
		<category><![CDATA[SI]]></category>
		<category><![CDATA[Silicon Valley]]></category>
		<category><![CDATA[Stanford]]></category>
		<category><![CDATA[Swedish Institute]]></category>
		<category><![CDATA[text analysis]]></category>
		<category><![CDATA[text analytics]]></category>
		<category><![CDATA[text analytics world tour]]></category>
		<category><![CDATA[Toronto]]></category>
		<category><![CDATA[US]]></category>
		<category><![CDATA[Washington]]></category>

		<guid isPermaLink="false">http://blog.saplo.com/?p=1858</guid>
		<description><![CDATA[Earlier this year Saplo was chosen as one of the top 20 innovations from Sweden for our text analytics technology. Now all 20 innovations are on a tour called Innovative Sweden and the exhibition will visit cities from all around the world. The first stop was California. The Innovative Sweden Exhibition shows the best innovations ... <br /><br /><a href="http://blog.saplo.com/saplo%e2%80%99s-text-analytics-on-world-tour/">[ Read more ]</a>]]></description>
			<content:encoded><![CDATA[<p><strong>Earlier this year Saplo was chosen as one of the <a title="Saplo awarded top 20 innovative company in Sweden" href="http://blog.saplo.com/saplo-awarded-as-one-of-the-most-innovative-companies-in-sweden/">top 20 innovations</a> from Sweden for our <a title="text analytics technology" href="http://saplo.com/about/technology">text analytics technology</a>. Now all 20 innovations are on a tour called Innovative Sweden and the exhibition will visit cities from all around the world. The first stop was California.</strong></p>
<p><img class="aligncenter size-medium wp-image-1866" title="Text Analytics from Saplo.com - Sentiment Analysis" src="http://blog.saplo.com/wp-content/uploads/2011/11/Text-Analytics-from-Saplo.com-Sentiment-Analysis-276x300.jpg" alt="Text Analytics from Saplo.com at Innovate Sweden Stanford" width="276" height="300" /></p>
<p>The <a href="http://projects.si.se/innovative-sweden/">Innovative Sweden Exhibition</a> shows the best innovations from Sweden and the first stop was Stanford in the beginning of November. Swedish Institute held a one week program with different theme days such as Future of Mobile and Cracking the Code of Innovation. As one of the 20 selected companies Saplo and our text analytics innovation was a popular visit for hundreds of the top influencers from Silicon Valley that took the chance to visit the event.</p>
<p>Saplo will be a part of the full tour that will continue for several years and visit cities in all continents. Next stop will be Toronto in January 2012 (January 23 to February 12) and then we will visit Washington in March (March 1 – April 30). Later next year the exhibition will visit Brasil,  Rio de Janeiro in May (May 21 – June 24) and São Paulo in July (July 9 – middle of August). The tour will later also visit both Asia and Europe.</p>
<p>For latest information regarding program and not yet released cities please visit <a href="http://projects.si.se/innovative-sweden/">Swedish Institute’s website</a>. If you are in one of the cities or interested to have the exhibition in your city please contact Swedish Institute or <a href="http://saplo.com/contact">write us an email</a>.</p>
<p>Hope to see you on the text analytics world tour.</p>
<img src="http://feeds.feedburner.com/~r/Saplo/~4/-3SsD2-rCNk" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.saplo.com/saplo%e2%80%99s-text-analytics-on-world-tour/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blog.saplo.com/saplo%e2%80%99s-text-analytics-on-world-tour/</feedburner:origLink></item>
		<item>
		<title>Saplo Text Analysis API Integration for CMS</title>
		<link>http://feedproxy.google.com/~r/Saplo/~3/_oqCo3r57Ow/</link>
		<comments>http://blog.saplo.com/saplo-api-integration-for-cms/#comments</comments>
		<pubDate>Fri, 28 Oct 2011 10:26:33 +0000</pubDate>
		<dc:creator>Fredrik Hörte</dc:creator>
				<category><![CDATA[API]]></category>
		<category><![CDATA[Development]]></category>
		<category><![CDATA[CMS]]></category>
		<category><![CDATA[integration]]></category>
		<category><![CDATA[text analysis]]></category>

		<guid isPermaLink="false">http://blog.saplo.com/?p=1837</guid>
		<description><![CDATA[When using Saplo Text Analysis API in a larger system e.g. a content management system which is used daily in production their are some “best-practises” to consider when planning and developing against Saplo API. Use this guide as a complement to the Saplo API documentation. This guide will be maintained on our Saplo Developer topic ... <br /><br /><a href="http://blog.saplo.com/saplo-api-integration-for-cms/">[ Read more ]</a>]]></description>
			<content:encoded><![CDATA[<p><strong>When using <a href="http://saplo.com/api">Saplo Text Analysis API</a> in a larger system e.g. a content management system which is used daily in production their are some “best-practises” to consider when planning and developing against Saplo API. Use this guide as a complement to the <a href="http://developer.saplo.com">Saplo API documentation</a>.</strong><br />
<img src="http://blog.saplo.com/wp-content/uploads/2011/10/cache_results_on_client_side.png" alt="" title="Picture of how to cache results on client side" width="500" height="375" class="alignnone size-full wp-image-1847" /><br />
<em>This guide will be maintained on our <a href="http://developer.saplo.com/topics">Saplo Developer topic site</a>.</em></p>
<h2>Cache results on client-side</h2>
<p>When a  result has been processed and returned from Saplo API (e.g. related texts or tags), the result needs to be stored on the client side. Don’t connect and make a request to Saplo API everytime you are about to present a result for a user or on a website. Fetch the result from your local storage server.</p>
<p>Why? Otherwise you will run out of API calls and the response time is much faster.</p>
<h2>Store all data</h2>
<p>Always store everything you get back from Saplo API. For example when fetching tags for a text always store all tags regardless relevance value and store all information you get from the API about the tag, e.g. relevance and category.</p>
<p>Why? You never know what you want to do with your data in the future. Perhaps you want to search for tags that are related to each other, create trending tag lists, order by relevance or category, show 5 related texts one week and 20 another week.</p>
<h2>Store data in its own data structure</h2>
<p>Always try to have a separate data structure (persistent structure) for you meta data with relations to your texts. You would also want to have your meta data indexed and searchable.</p>
<p>Why? If the meta data is stored in separate places (e.g. XML-files, or together with each text) it is much harder to query the meta data. Example cases when it’s better to store all data in its own data structure:</p>
<ul>
<li>You want to be able to find related and common tags by searching through (count) all tags.</li>
<li>You want to update all tags from one certain name to another.</li>
<li>You want to get the trending tags for a time period.</li>
<li>You want to traverse through your related texts.</li>
</ul>
<h2>Blacklist</h2>
<p>If you are using <a href="http://saplo.com/products/tags">Saplo Tags</a> create a blacklist system that works on article level.</p>
<p>Why? There might be some tags you don’t want to show for a user and these can be different for each text. It’s kind of like an hide/show functionality. Still you might want to be able to index them in a search index.</p>
<h2>Handle manual changes</h2>
<p>If an editor have changed the results, e.g. added, deleted, edited tags or related texts you have to make sure you handle these cases. For instance re-tagging needs to be forced manually.</p>
<p>Why? If an editor deletes a tag from the result  and the text is sent to be tagged again the tag will re-appear which would be really annoying for the editor.</p>
<h2>Indicator if text has “auto” result or if it has been approved manually</h2>
<p>Use some kind of indicator so you know if a result has been reviewed or not.</p>
<p>Why? Some companies don’t want to push non-reviewed results to the GUI, though they want to have the result searchable and indexed automatically. This indicator can prevent a push to the GUI.</p>
<h2>Multiple languages</h2>
<p>Saplo API supports English and Swedish. You would need to create two collections (one Swedish and one English) and use one of them as default. If a text is not written in the default language you will not be able to add it to that collection. Than you can add it to the other collection and get results. If it can’t be added to any of the collections it has to be discarded.</p>
<h2>Batch job (cron job) and Response Time</h2>
<p>Depending on text length and method the response time can various and take longer than expected. Usually a response (tagging, related texts) takes a couple of seconds 2-5 seconds. You would want to keep track of texts that have not yet received results so you can fetch these later.</p>
<h2>Send Feedback to Saplo API</h2>
<p>Saplo is built using machine learning. This means that we become better the more users and the more feedback the system gets. For example if a tag is wrong we want to know it. Prepare your system so it is easy to send feedback to Saplo API when a user corrects something.</p>
<h2>Flexibility for different users</h2>
<p>What works for one user/site don’t have to be the correct setup and settings for another. Create a settings file (or admin ui) where the site owner can change properties for their setup. Examples of properties that might be important:</p>
<ul>
<li>Be able to change default input parameters to different API methods (e.g. change wait time, change limits, thresholds etc.)</li>
<li>How tags should be presented; alphabetically, by relevance, by category etc.</li>
<li>How related texts should be presented; by relevance, by date, how many etc.</li>
<li>If “auto” results can be pushed to GUI instantly.</li>
<li>Default collections to use in Saplo API</li>
<li>API- and Secret keys</li>
</ul>
<p><img src="http://blog.saplo.com/wp-content/uploads/2011/10/integration_overview.png" alt="" title="Overview of CMS Integration with Saplo Text Analysis API " width="500" height="375" class="alignnone size-full wp-image-1851" /></p>
<img src="http://feeds.feedburner.com/~r/Saplo/~4/_oqCo3r57Ow" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.saplo.com/saplo-api-integration-for-cms/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blog.saplo.com/saplo-api-integration-for-cms/</feedburner:origLink></item>
		<item>
		<title>No use getting sentimental? Think again!</title>
		<link>http://feedproxy.google.com/~r/Saplo/~3/VWAQlQ3Iji0/</link>
		<comments>http://blog.saplo.com/no-use-getting-sentimental-think-again/#comments</comments>
		<pubDate>Thu, 13 Oct 2011 12:28:11 +0000</pubDate>
		<dc:creator>Bengt Holm</dc:creator>
				<category><![CDATA[Development]]></category>

		<guid isPermaLink="false">http://blog.saplo.com/?p=1827</guid>
		<description><![CDATA[The last few months, following summer, I have had a strong hunch that the prospects of sentiment analysis in general look very bright. Why? Because the marriage between semantic methods in linguistics and scientific methods from statistics and computer science is very timely and is starting to pay dividends. With social media becoming more and ... <br /><br /><a href="http://blog.saplo.com/no-use-getting-sentimental-think-again/">[ Read more ]</a>]]></description>
			<content:encoded><![CDATA[<p>The  last few months, following summer, I have had a strong hunch that the  prospects of sentiment analysis in general look very bright. Why?  Because the marriage between semantic methods in linguistics and  scientific methods from statistics and computer science is very timely  and is starting to pay dividends.</p>
<p>With  social media becoming more and more an integrated part of people&#8217;s  lives and awareness, the question is no longer how to obtain information  in text, but how to select the relevant parts of it. The information  itself is out there readily available.</p>
<p>I  have always felt that the core of our prediction method is to extract  some entity or number connected to the mood or sentiment in texts and  process it according to the needs of a certain issue or costumer.</p>
<p>One  could compare the vision (haha) of text analysis to the fitting of a  pair of glasses. Each eye needs an individually tailored lens in order  to get a perfect impression of the outside world. If the lens is too  weak or strong, this impression will be blurred to a smaller or greater  extent. But there exists this optimal fitting that permits the viewer to  experience a razor sharp picture of the surrounding environment.</p>
<p><img class="aligncenter size-full wp-image-1828" title="Vision Of Eyechart With Glasses" src="http://blog.saplo.com/wp-content/uploads/2011/10/vision-of-text-analysis.jpg" alt="Vision Of Eyechart With Glasses" width="500" height="372" /></p>
<p>The  role of a practitioner of our trade is that of the optician&#8217;s. In an  environment with an abundance of text available, information can easily  be blurred by the sheer volumes of sources available. With our  technology there is an emerge of tools to obtain relevant information as  well as performing a ”read between the lines” analysis. And, as in the  case with the glasses, what relevant information constitutes is highly  individual and any tool provided by us must therefore contain the  possibility to be fine tuned according to the needs of the user.</p>
<p>Enough  said about visions. Lately we have been investigating the use of text  analysis in risk management. Some exciting results have emerged both in  terms of the interpretation of events but more importantly in the  predictive power in foreseeing future changes in entities that have text written about them.</p>
<p>Recently,  we have been using our sentiment analysis with other techniques in  order to predict future credit ratings of companies so that now not only  pure economical data is used for such an analysis, but also the  information contained in the text in numerous articles about that very  company.</p>
<p>The  results have been very encouraging, displaying a connection between the  average sentiment in articles and actual subsequent credit rating. A  perfect example of our aim to link information contained in writing to a  real quantity. We hope to see many opportunities of practical  importance in the wake of these findings in the near future.</p>
<p>&nbsp;</p>
<p><em>Picture: <a title="Senior Living" href="http://www.seniorliving.org/">http://www.seniorliving.org/</a></em></p>
<img src="http://feeds.feedburner.com/~r/Saplo/~4/VWAQlQ3Iji0" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.saplo.com/no-use-getting-sentimental-think-again/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blog.saplo.com/no-use-getting-sentimental-think-again/</feedburner:origLink></item>
		<item>
		<title>When machine learning techniques scale</title>
		<link>http://feedproxy.google.com/~r/Saplo/~3/zCFiQh2Gjyk/</link>
		<comments>http://blog.saplo.com/when-machine-learning-techniques-scale/#comments</comments>
		<pubDate>Wed, 05 Oct 2011 10:57:21 +0000</pubDate>
		<dc:creator>Anders Hall</dc:creator>
				<category><![CDATA[Development]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[language independent]]></category>
		<category><![CDATA[machine learning]]></category>
		<category><![CDATA[prediction]]></category>
		<category><![CDATA[psychology]]></category>
		<category><![CDATA[scale]]></category>
		<category><![CDATA[tagging]]></category>

		<guid isPermaLink="false">http://blog.saplo.com/?p=1822</guid>
		<description><![CDATA[As a startup in 2008, we found ourselves in a situation were we had the outlines for a heavy technology at our hands though hardly the perfect palette of competences, or the needed breadth of skills, to cover every programming aspect ranging from web to low level algorithm development (math). In our tiny company, we ... <br /><br /><a href="http://blog.saplo.com/when-machine-learning-techniques-scale/">[ Read more ]</a>]]></description>
			<content:encoded><![CDATA[<p>As a startup in 2008, we found ourselves in a situation were we had the outlines for a heavy technology at our hands though hardly the perfect palette of competences, or the needed breadth of skills, to cover every programming aspect ranging from web to low level algorithm development (math).</p>
<p>In our tiny company, we were then forced to make many hard decision, often after heated debates, that seen in retrospect were followed through, despite hardships. What I find most interesting, is that many decisions we made are only recently starting to to pay off.</p>
<p>Notably long term decisions were;</p>
<ul>
<li>to commit to a development process that is based on existing or homegrown machine learning techniques</li>
<li>to build software that is language independent</li>
<li>to automate everything</li>
<li>to use all technologies we could get our hands on, as long as they had elements of prediction technology in them (were scalable in our opinion at the time).</li>
<li>to build a scalable technology platform that can spawn many types of easy accessible technologies or high level services</li>
<li>calculations and systems must scale (large amounts of text == more hidden information)</li>
<li>focus on text analysis (e.g., sound analysis was briefly on the table)</li>
</ul>
<p>Somewhat later, but nonetheless important, we decided to;</p>
<ul>
<li> build a user friendly and scalable API</li>
</ul>
<p>The task of building a fully language dependent technology platform is a painstakingly long effort, where hundreds of assorted system and coding projects needs to be completed. Thus, it is not entirely easy to defend that we went down that path, considered we primarily were active in the Swedish market the first two and half years. Today, we see growing demand and interest in language independent technology. It is on the other hand easy to defend the long term work on scalable systems, as we now have the capability to provide stable services to larger organizations and can perform large scale analysis of text.</p>
<p>However, the one effort that consistently have kept our head above the water, in a treacherous sea of marvelous competitors, is the constant strive to automate processes and technologies. The automation of system tests, deployment processes and system administration is to a great degree, but not entirely, based on knowledge from the ICT community. Thanks! We are not the first company to build this type of large scale systems with API:s. Still, we have less resources than many competitors and manage to keep good pace with other similar efforts &#8211; while being active in more text analysis markets than may seem healthy. Why?</p>
<p>My firm belief is that the decision to commit to predictive technologies, rather than building programs designed to solve a specific problem, is one major reason why we still are competitive. Machine learning (or genetic programming) is by no means a silver bullet. It requires (perhaps less today) good theoretical knowledge of prediction methods (e.g., we get many ideas from the field of psychology) and how to combine them, has a rather high learning curve and requires many non-standard system solutions to work well and scale performance-wise.</p>
<p>The main benefit is that machine learning scales well in terms of how many developers a technology or service require. Take the case of our tagging service, which we built from scratch at Saplo. Excluded below are all API, GUI, support and market efforts required to launch a service/product, which at least triple all numbers:</p>
<ul>
<li> (2 p team) firstly, we built a technology that could predict the category of entities in Swedish text</li>
<li>(2 p team) secondly, we built a technology that could predict entities in Swedish text</li>
<li>(0.5 p team) thirdly, we shaved of a few aspects that were not fully language independent and ran the same code on English text.</li>
<li>(1 p team) fourthly, we refined the above techniques to a second generation tagging technology.</li>
<li>(0.2 p team) to scale to other languages (markets), or support the code base, we now don&#8217;t need much basic code work.</li>
</ul>
<p>A third generation of this service will probably be merged with technology from the other 3 (soon 4) technologies we provide with a small team of dedicated coders.</p>
<p>Traditional rule based coding can also be implemented in smart ways, still in order to scale vertically in terms of languages, text formats, types of predictive tagging tasks, etc the code base will likely contain many special rules and is probably maintained by a larger team. The code will, in my opinion, also be harder to adopt to new complex problems and to combine with other technologies.</p>
<p>A second problem we solved, by using machine learning, is that often we found that we were entirely wrong in our assumptions of what was the best solution for a specific problems. Today, we to a great degree let the technology find the optimal solution to any given problem by rapidly implementing ideas (models) and testing them. The test effort is seldom considered lost time, even if we prove a model has poor standard. For instance, when analyzing sentiment predictions we can see that in some cases a particular technology is significant while in others it is not. It depends entirely on the purpose of the analysis task designated to our API by the end-user (or customer).</p>
<p>A third, perhaps hidden benefit, is that we let the technology predict/decide even minor options for running a service. This greatly reduces the need for complex decisions by the end user (e.g. when API methods are designed).</p>
<p>On an ending note. Yesterday, when watching an documentary about IBM:s Watson I was stricken by similarities between their and our team efforts, despite the apparent size difference between our projects. I’m not indicating we are their equals. What I interpreted as similarities was; The progress of technology based on machine learning can initially be slow &#8211; in the long term (in their case the 4:th year) it will accelerate beyond anything rule based programming can provide. In conclusion, I argue that machine learning greatly accelerates the development process in the long term.</p>
<p>Cheers,<br />
Anders Hall,<br />
coder</p>
<img src="http://feeds.feedburner.com/~r/Saplo/~4/zCFiQh2Gjyk" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.saplo.com/when-machine-learning-techniques-scale/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		<feedburner:origLink>http://blog.saplo.com/when-machine-learning-techniques-scale/</feedburner:origLink></item>
		<item>
		<title>Saplo in the finals of a… golf tournament?</title>
		<link>http://feedproxy.google.com/~r/Saplo/~3/3YnMZTIEGM4/</link>
		<comments>http://blog.saplo.com/saplo-in-the-finals-of-a-golf-tournament/#comments</comments>
		<pubDate>Tue, 06 Sep 2011 13:45:54 +0000</pubDate>
		<dc:creator>Joakim Stenberg</dc:creator>
				<category><![CDATA[Awards]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[Barcelona]]></category>
		<category><![CDATA[Företagsgolfen]]></category>
		<category><![CDATA[Golf]]></category>
		<category><![CDATA[Nya Affärer]]></category>
		<category><![CDATA[PGA Catalunya]]></category>
		<category><![CDATA[PGA of Sweden National]]></category>
		<category><![CDATA[Sand Golf Club]]></category>
		<category><![CDATA[Spain]]></category>
		<category><![CDATA[Stora Entreprenörsgolfen 2011]]></category>

		<guid isPermaLink="false">http://blog.saplo.com/?p=1816</guid>
		<description><![CDATA[Yes, you heard right, there are developers out there who know one or two things about golf. But what’s the odds that two of these very odd developers would be located under the same roof&#8230; Fredrik Hörte and Joakim Stenberg has taken Team Saplo all the way to the finals of the prestigious Swedish Business ... <br /><br /><a href="http://blog.saplo.com/saplo-in-the-finals-of-a-golf-tournament/">[ Read more ]</a>]]></description>
			<content:encoded><![CDATA[<p>Yes, you heard right, there are developers out there who know one or two things about golf. But what’s the odds that two of these very odd developers would be located under the same roof&#8230;</p>
<p>Fredrik Hörte and Joakim Stenberg has taken Team Saplo all the way to the finals of the prestigious Swedish Business Golf Tournament. The finals will be played in mid October at <a href="http://www.pgacatalunya.com" title="PGA Catalunya Resort">PGA Catalunya Resort</a>, Barcelona, Spain.</p>
<p><img src="http://blog.saplo.com/wp-content/uploads/2011/09/golf.jpg" alt="Golf Course" title="Golf Course" width="500" height="375" class="aligncenter size-full wp-image-1818" /><br />
<small>Disclaimer: The image is not from any of the golf courses mentioned in the post, but from a course in France.</small></p>
<p>The tournament is played as a <a href="http://en.wikipedia.org/wiki/Foursome_%28golf%29" title="Foursome Golf">foursome</a>, where players compete in teams of two, taking alternate shots. Team Saplo entered the tournament August 16 at <a href="http://www.pgaswedennational.se/en" title="PGA of Sweden National">PGA of Sweden National</a>, together with another 100 teams.<br />
Solid drives and amazing short game got us a third place. Luckily, since Saplo was one of the entrepreneur teams invited by <a href="http://nyaaffarer.se" title="Nya Affärer">Nya Affärer</a>, that was enough to get us to the semi-finals.</p>
<p>The first semi-finals and the one Saplo was entering took place September 2 at <a href="http://www.sandgolfclub.com" title="Sand Golf Club">Sand Golf Club</a> together with over 60 other qualified teams from south Sweden. Team Saplo played better golf then ever before and mixed with a good portion of luck we managed amazing 7 strokes under par net after nine holes!</p>
<p>Unfortunately that made us realize that we had a great chance of winning the whole thing and suddenly nerves was a prominent part of the game. For another five holes we managed to keep our great score intact, then it all came crashing down. The last four holes we went from 6 under par net to even. A disaster!</p>
<p>Again, Team Saplo finished third, just missing the finals in Barcelona! We were pretty happy anyway, we had played great for most part of the round and got rewarded with some really great bags from Victorinox. Add to that playing two of the best golf courses in Sweden, PGA of Sweden National and Sand Golf Club, and we were all but sad.</p>
<p>Then September 6 during lunch Fredrik from Team Saplo gets a phone call. Apparently the best entrepreneur team from each of the semi-finals get a spot in the finals at PGA Catalunya in Barcelona!<br />
So, of more then 1300 teams, Team Saplo will be one of eight going to Spain in October! Isn’t that a treat!</p>
<p>Thanks a lot to all of you that make the tournament such a great event! For those of you who understand Swedish, you can read more about the tournament at <a href="http://www.foretagsgolfen.com" title="Företagsgolfen">http://www.foretagsgolfen.com</a></p>
<img src="http://feeds.feedburner.com/~r/Saplo/~4/3YnMZTIEGM4" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.saplo.com/saplo-in-the-finals-of-a-golf-tournament/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blog.saplo.com/saplo-in-the-finals-of-a-golf-tournament/</feedburner:origLink></item>
		<item>
		<title>6 Reasons Why Start-ups Should Use Cloud Services Like Amazon Web Services (AWS)</title>
		<link>http://feedproxy.google.com/~r/Saplo/~3/kSTxAjfkLL4/</link>
		<comments>http://blog.saplo.com/6-reasons-why-start-ups-should-use-cloud-services-like-amazon-web-services-aws/#comments</comments>
		<pubDate>Tue, 30 Aug 2011 12:49:18 +0000</pubDate>
		<dc:creator>Ikrom Hotamov</dc:creator>
				<category><![CDATA[API]]></category>
		<category><![CDATA[Products]]></category>
		<category><![CDATA[Amazon Web Services]]></category>
		<category><![CDATA[AWS]]></category>
		<category><![CDATA[CloudWatch]]></category>
		<category><![CDATA[Elastic Block Storage]]></category>
		<category><![CDATA[Elastic Cloud Computing]]></category>
		<category><![CDATA[Elastic Load Balancer]]></category>
		<category><![CDATA[Route 53]]></category>
		<category><![CDATA[Simple Storage Service]]></category>
		<category><![CDATA[Start-Up]]></category>
		<category><![CDATA[start-ups]]></category>

		<guid isPermaLink="false">http://blog.saplo.com/?p=1793</guid>
		<description><![CDATA[Amazon is a very convenient way for start-up companies to set their businesses up in a short amount of time. Especially with the variety of services it offers to easily scale out your business. We (Saplo) is one of such companies, taking advantage of Amazon&#8217;s Web Services and APIs here are our top six reasons ... <br /><br /><a href="http://blog.saplo.com/6-reasons-why-start-ups-should-use-cloud-services-like-amazon-web-services-aws/">[ Read more ]</a>]]></description>
			<content:encoded><![CDATA[<p><strong>Amazon is a very convenient way for start-up companies to set their businesses up in a short amount of time. Especially with the variety of services it offers to easily scale out your business. We (Saplo) is one of such companies, taking advantage of Amazon&#8217;s Web Services and APIs here are our top six reasons why start-ups should use the cloud.</strong></p>
<p style="text-align: center;"><strong><img class="aligncenter size-full wp-image-1805" title="Cloud Computing" src="http://blog.saplo.com/wp-content/uploads/2011/08/cloud-3205676821_634f5b27bb_b1.jpg" alt="" width="614" height="462" /><br />
</strong></p>
<div>
<h4 id="internal-source-marker_0.9954989678226411" dir="ltr">1) Elastic Cloud Computing (EC2)</h4>
<p>We use <a href="http://aws.amazon.com/ec2/">EC2</a> in different ways and parts of our system. First off, it&#8217;s a really convenient way for us to run tests.<br />
We have set up and bundled several EC2 test machines. Whenever needed, we can start an EC2 server using those bundles and have our test machines running in several seconds. It costs almost nothing, and you only pay for storing those bundles on Amazon&#8217;s S3, and for having those EC2 servers running during the test, which rounds up to a couple of cents during each test run.</p>
<p>Moreover, EC2 is perfect for scaling out our systems. We have bundles for different Saplo services, like the API, widget, etc, and whenever there is any load on the servers, some magic scripts automatically start up additional work servers. Of course, the perfect way to handle it would be to take use of <a href="http://aws.amazon.com/autoscaling/">Amazon&#8217;s Auto Scaling</a> solutions, but in our case, the load can not really be determined by the machine load. Therefore, we have our custom scripts to manage load levels, and start up/shut down additional instances whenever necessary.<br />
The only downside of using S3 bundled servers for this is that, if we change or update the servers, we need to re-bundle and re-store them on S3. Although the process of re-bundling is also automatized, it requires a little manual work, just to make sure everything goes as desired. In the future, we are looking forward to moving towards a &#8220;puppetized&#8221; solution, using <a href="http://www.puppetlabs.com/">Puppet-Lab&#8217;s</a> awesome framework to push changes directly into the servers, and automatically setting up any dummy Amazon EC2 instance whenever we need it.</p>
<h4 dir="ltr">2) Simple Storage Service (S3)</h4>
<p>Amazon has proven its S3 as a reliable service to store whatever data you want on the cloud. S3 also stands behind some other AWS services, like storing your EC2 bundles, EBS instances, snapshots, etc. Other than that, we use S3 for storing various kinds of data files that we want available at any time we need.</p>
<h4 dir="ltr">3) Elastic Block Storage (EBS)</h4>
<p><a href="http://aws.amazon.com/ebs/">Elastic Block Storage</a> is used for more critical parts of our services. We have customized automatic snapshotting scripts for almost all of the EBS instances we run, taking daily/weekly snapshots for availability reasons. Unlike EC2 instances, EBS is more reliable and offers extended functionality, like backups, surviving service interruptions, etc. for a little extra cost :)<br />
There is a really cool snapshotting tool by <a href="https://plus.google.com/111045584683584396225/posts">Eric Hammond</a> called <a href="http://alestic.com/2009/09/ec2-consistent-snapshot">ec2-consistent-snapshot</a>.</p>
<h4 dir="ltr">4) Elastic Load Balancer (ELB)</h4>
<p>In order to easily scale out and down, we take use of Amazon&#8217;s <a href="http://aws.amazon.com/elasticloadbalancing/">Elastic Load Balancers</a>.  All you have to do is register an extra worker server with your load balancer, and it’s transparently added into the group of working servers behind the load balancers and your users won’t even notice a glitch.<br />
Besides, load balancers are a perfect way of handling Amazon availability-zone blackouts. We have our instances spread out onto different availability zones behind our ELBs. Furthermore, it can automatically detect unhealthy instances and route its traffic to other healthy ones, until the damaged server comes back online or replaced.</p>
<h4 dir="ltr">5) Route 53</h4>
<p>After having to struggle with loopia.se and repeating failures on their DNS services, we had to come up with a more reliable solution. And guess what we ended up with! Right, <a href="http://aws.amazon.com/route53/">Amazon Route 53</a>. Once you get familiar with its API, Route 53 is a really easy and simple way of handling DNS routings in the cloud. Plus, the changes you make take effect almost instantly.</p>
<h4 dir="ltr">6) CloudWatch</h4>
<p><a href="http://aws.amazon.com/cloudwatch/">Amazon CloudWatch</a> offers great monitoring options for all the instances within Amazon. However, since we use a hybrid cloud with different machines spread on different providers, we have chosen <a href="https://www.icinga.org/">Icinga</a> open source monitoring service as a central monitoring point for all of our cloud servers. It&#8217;s used for monitoring servers from a single location, sending alert emails and even sms messages, automatically taking critical measures, etc. For instance, we use some custom icinga plugins to determine load on our API servers, and it automatically starts a new instance, or replaces unhealthy instances whenever necessary.</p>
</div>
<p>&nbsp;</p>
<p><em>Picture: <strong id="yui_3_4_0_3_1314625500304_2818"><a href="http://www.flickr.com/photos/23045224@N04/">Athena&#8217;s Pix</a></strong></em></p>
<img src="http://feeds.feedburner.com/~r/Saplo/~4/kSTxAjfkLL4" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.saplo.com/6-reasons-why-start-ups-should-use-cloud-services-like-amazon-web-services-aws/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blog.saplo.com/6-reasons-why-start-ups-should-use-cloud-services-like-amazon-web-services-aws/</feedburner:origLink></item>
		<item>
		<title>5 Ways Google+ Can Fight Facebook</title>
		<link>http://feedproxy.google.com/~r/Saplo/~3/RGxzu9dDH8k/</link>
		<comments>http://blog.saplo.com/5-ways-google-can-fight-facebook/#comments</comments>
		<pubDate>Wed, 17 Aug 2011 15:11:31 +0000</pubDate>
		<dc:creator>Mattias Tyrberg</dc:creator>
				<category><![CDATA[API]]></category>
		<category><![CDATA[Products]]></category>
		<category><![CDATA[Facebook]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Google plus]]></category>
		<category><![CDATA[related posts]]></category>
		<category><![CDATA[text analysis]]></category>
		<category><![CDATA[text analytics]]></category>
		<category><![CDATA[text filter]]></category>

		<guid isPermaLink="false">http://blog.saplo.com/?p=1766</guid>
		<description><![CDATA[Google+ has been out for one and a half month now and usage is growing but is it enough? There are some key features that are missing that would help Google+ to be a better alternative to Facebook. I will not focus on the most obvious ones (such as search, because its coming) but on ... <br /><br /><a href="http://blog.saplo.com/5-ways-google-can-fight-facebook/">[ Read more ]</a>]]></description>
			<content:encoded><![CDATA[<p><strong><strong><strong>Google+ has been out for one and a half month now and usage is growing but is it enough? There are some key features that are missing that would help Google+ to be a better alternative to Facebook. I will not focus on the most obvious ones (such as search, because its coming) but on cool and life changing features that really could make Google+ better than Facebook.</strong></strong></strong></p>
<p><strong><strong><strong> </strong></strong></strong><strong>1) Filter posts &#8211; Read only the posts that are interesting for <strong>you<br />
</strong></strong>Reading through the Stream there is a big problem finding the most important and interesting information. Facebook has the “Top News” but in Google+ you only get the most recent news.</p>
<p><img class="aligncenter size-full wp-image-1776" title="Google plus - Filter by interest using text analytics" src="http://blog.saplo.com/wp-content/uploads/2011/08/Google-plus-Filter-by-interest-using-text-analytics.jpg" alt="Filter by Interest - Google+ with text analytics" width="963" height="622" /></p>
<p style="text-align: center;"><em>The picture show an example of how easy the “Filter by Interest” feature could be implemented in the current UI.</em></p>
<p><em> </em>I noticed that my stream get noisy with very few users and I need some type of filter to find the most relevant posts. The solution to this problem is to atomically find the most interesting posts and I want to take this one step further than what Facebook have done. I want to analyse the meaning of the content and based on what it is about show only posts that are interesting for me. By using information such as your Sparks, the posts (and links) you shared and the posts you commented on you can use advanced text analytics to predict the relevance of each post based on the users interests.</p>
<p><strong>2) Automatically Recommend Sparks</strong><br />
The first page you see when you visit Sparks can be more interesting. Instead of getting standard Sparks you should get personal recommendations of Sparks that are interesting for you. As before this can be done using what you have written and commented on plus the sparks you already have and in combination with text analysis this will give you some great recommendations.</p>
<p style="text-align: center;"><img class="aligncenter size-large wp-image-1777" title="Google plus - Recommended Sparks by interest using text analytics" src="http://blog.saplo.com/wp-content/uploads/2011/08/Google-plus-Recommended-Sparks-by-interest-using-text-analytics-1024x677.jpg" alt="Recommended Sparks by interest - Google+ with text analytics" width="1024" height="677" /></p>
<p style="text-align: center;">&nbsp;</p>
<p><strong>Get relevant information on a Spark page<br />
</strong>Instead of just show the posts on the Spark-page there is an opportunity to show so much more relevant information. For instance;</p>
<p><strong>3) Related Persons </strong>to the spark, showing people that often comment or share information in the area. This makes it possible for persons that are interested in an area to find other people to add to circles and to interact with them.</p>
<p><strong>4) Related Sparks</strong>, showing other sparks that are related to this one and makes it easy to find more interesting material.</p>
<p><strong>5) Related Posts</strong>, find other posts that are about the same subject/event so you can find out about what have happened before this article was written.</p>
<p style="text-align: center;"><img class="aligncenter size-large wp-image-1778" title="Google plus - Related Sparks using text analytics" src="http://blog.saplo.com/wp-content/uploads/2011/08/Google-plus-Related-Sparks-using-text-analytics-1024x677.jpg" alt="Related Sparks - Google+ with text analytics" width="1024" height="677" /></p>
<p style="text-align: center;"><em>The picture shows how related persons, related Sparks and related posts can be implemented in Google+.</em></p>
<p><strong>Summary</strong><br />
I think there are a few killer features that could be build very easily that would be able to change Google+’s position.<br />
This are my top five<br />
1) Smart filer of Stream<br />
2) Automatic recommendation of Sparks<br />
3) Related Persons<br />
4) Related Sparks<br />
5) Related Posts<br />
Sure there is also an opportunity for Facebook to implement similar features before Google does it. So who will win, Google+ or Facebook?</p>
<p>&nbsp;</p>
<p><em>Note: The features are not science fiction they can all be implemented using existing <a href="http://saplo.com/products">text analytics technologies</a>.</em></p>
<img src="http://feeds.feedburner.com/~r/Saplo/~4/RGxzu9dDH8k" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.saplo.com/5-ways-google-can-fight-facebook/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		<feedburner:origLink>http://blog.saplo.com/5-ways-google-can-fight-facebook/</feedburner:origLink></item>
		<item>
		<title>Drupal Module For Entity Tagging And Related Content</title>
		<link>http://feedproxy.google.com/~r/Saplo/~3/H8i0GVDi1kM/</link>
		<comments>http://blog.saplo.com/drupal-module-for-entity-tagging-and-related-content/#comments</comments>
		<pubDate>Sat, 13 Aug 2011 10:59:36 +0000</pubDate>
		<dc:creator>Oskar Olsson</dc:creator>
				<category><![CDATA[Drupal]]></category>
		<category><![CDATA[Products]]></category>
		<category><![CDATA[Entity Tagging]]></category>
		<category><![CDATA[Related Content]]></category>

		<guid isPermaLink="false">http://blog.saplo.com/?p=1753</guid>
		<description><![CDATA[We are bringing the semantic capabilities of the Saplo Text Analysis API to Drupal, one of the greatest and most widely used open source Content Management Systems (CMS). The supported languages are Swedish and English, with additional languages coming in the future depending on interest. The module works with Drupal 6.x and PHP 5 or ... <br /><br /><a href="http://blog.saplo.com/drupal-module-for-entity-tagging-and-related-content/">[ Read more ]</a>]]></description>
			<content:encoded><![CDATA[<p>We are bringing the semantic capabilities of the <a href="http://saplo.com/api" title="Saplo Text Analysis API">Saplo Text Analysis API</a> to <strong>Drupal</strong>, one of the greatest and most widely used open source Content Management Systems (CMS).</p>
<p>The supported languages are Swedish and English, with additional languages coming in the future depending on interest.</p>
<p>The module works with <strong>Drupal 6.x</strong> and PHP 5 or later. If you’re interested in seeing a version for <strong>Drupal 7</strong> please <a href="http://saplo.com/contact" title="Get in contact with Saplo">contact us</a>. At this moment the module is being reviewed by the Drupal community, but you’re free to download it using Git from the <a href="http://drupal.org/sandbox/oskarols/1247304" title="Download Saplo Drupal Module for Entity Tagging and Related Content from Git">sandbox page</a>.</p>
<p>The Saplo <strong>Drupal module</strong> supports two of the main features from the Saplo <strong>API</strong>.</p>
<p><img src="http://blog.saplo.com/wp-content/uploads/2011/08/drupal-module-entity-tagging-related-content.png" alt="Drupal module for entity tagging and related content" title="Drupal module for entity tagging and related content" width="500" height="272" class="aligncenter size-full wp-image-1759" /></p>
<h3>Tagging</h3>
<p>Using our module you can &#8211; with the push of a button &#8211; <strong>extract tags</strong> from articles.<br />
The benefits of this versus a manual approach are twofold; first of all it’s very convenient to have the tags magically appear with the click of a button. Second of all you add consistency to the <strong>tagging</strong>, since the Saplo <strong>Text Analysis API</strong> is based on machine learning and will intelligently <em>analyze</em> and <em>extract</em> the tags the same way every time. It’s like having a dedicated employee just doing your <strong>tagging</strong>.</p>
<h3>Related Articles</h3>
<p>The module also provides you with a means to display <strong>related content</strong>, for example lists of <strong>“Related articles”</strong> that are often found at the end of news articles.<br />
Similar modules have the capability to do this as well, but are often powered solely on <strong>tags</strong>. In other words, they make the assumption that articles that share the same tags should be related.<br />
The Saplo <strong>API</strong> goes beyond that; we use <strong>semantic analysis</strong> and <strong>machine learning</strong> to give a higher precision to this <em>relevance</em>.  </p>
<p>The caveat is that the module can only give you <strong>related content</strong> if that content has been previously analyzed. So as you analyze more and more of your articles or text, naturally more related content will be able to be displayed.<br />
We basically establish relationships between different pieces of content, which is measured in a perceptual value. E.g. “this article is 60% similar to that article”. These <em>relationships</em> will be stored in the database powering your <strong>Drupal</strong> installation; so you’re free to use it in interesting ways if you want to.</p>
<p>Included in the module is also a basic filter, where you can change the relevance thresholds for displayed related content. So if you only want to display articles that are very related, you can easily do so in the administrative interface.</p>
<h3>Using the Saplo Drupal Module</h3>
<p>Installing the module is easy; you will be able to find the complete guide on the module page. In essence, it goes like this:</p>
<ol>
<li>Download and unpack the module inside your module folder.</li>
<li>Enable the module in the Drupal administrative interface.</li>
<li>Enter the API Key-pair that you’ve received from signing up at <a href="http://saplo.com/signup/free" title="Sign up for a free Saplo Text Analysis API account">http://saplo.com/signup/free</a> into the modules administrative interface.</li>
<li>Enable the service you want (tagging, related articles) for your node types.</li>
<li>Insert a new node, and when inside the view click the “Add Saplo Metadata”-button.</li>
</ol>
<p><img src="http://blog.saplo.com/wp-content/uploads/2011/08/drupal-module-entity-tagging-related-content-settings.png" alt="Drupal module settings for entity tagging and related content" title="Drupal module settings for entity tagging and related content" width="500" height="271" class="aligncenter size-full wp-image-1760" /></p>
<p>The “Add Saplo Metadata”-button is probably the most common way you will be interacting with the module, besides changing settings inside the administrative interface.<br />
Pressing this button sends the text from the node to the Saplo API. Once the Saplo API is done analysing the text it sends back the the extracted metadata that you’ve requested. The data is then saved locally, and displayed when the node is viewed.</p>
<h3>Other features</h3>
<p>The module implements various useful Drupal hooks. Amongst others we have different permissions that you can enable for users to access the administrative interface and to extract metadata for articles.<br />
We also provide a small template that you can further configure that shows the related articles.</p>
<img src="http://feeds.feedburner.com/~r/Saplo/~4/H8i0GVDi1kM" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.saplo.com/drupal-module-for-entity-tagging-and-related-content/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blog.saplo.com/drupal-module-for-entity-tagging-and-related-content/</feedburner:origLink></item>
	</channel>
</rss>

