<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Clariba Blog</title>
	<atom:link href="http://www.clariba.com/blog/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.clariba.com/blog</link>
	<description>Welcome to the Clariba business intelligence blog. We invite you to read our posts related to SAP BusinessObjects, Data Management and other BI topics, written by our consultants. If you have questions or comments, please let us know.</description>
	<lastBuildDate>Wed, 22 Oct 2014 12:09:56 +0000</lastBuildDate>
	<language>en-US</language>
		<sy:updatePeriod>hourly</sy:updatePeriod>
		<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.9.3</generator>
	<item>
		<title>Tableau Upgrade – Lessons Learned</title>
		<link>http://www.clariba.com/blog/tableau-upgrade-lessons-learned/</link>
		<comments>http://www.clariba.com/blog/tableau-upgrade-lessons-learned/#comments</comments>
		<pubDate>Wed, 22 Oct 2014 11:52:55 +0000</pubDate>
		<dc:creator><![CDATA[Franklin Mejias]]></dc:creator>
				<category><![CDATA[Tableau]]></category>
		<category><![CDATA["BI"]]></category>
		<category><![CDATA["business analytics"]]></category>
		<category><![CDATA["Business Intelligence"]]></category>
		<category><![CDATA[Clariba]]></category>
		<category><![CDATA[single sign on]]></category>
		<category><![CDATA[upgrade]]></category>

		<guid isPermaLink="false">http://www.clariba.com/blog/?p=6265</guid>
		<description><![CDATA[Recently, we upgraded Tableau from version 8.0 (32 Bits) to 8.1.7 (64 Bits) along with Automatic Login (Single Sign On) and PostgreSQL configuration (Access to the Tableau system Database). After the successful upgrade of this application and configuration of the named features, I would like to share my experience as they may be useful in [&#8230;]]]></description>
				<content:encoded><![CDATA[<p>Recently, we upgraded Tableau from version 8.0 (32 Bits) to 8.1.7 (64 Bits) along with Automatic Login (Single Sign On) and PostgreSQL configuration (Access to the Tableau system Database). After the successful upgrade of this application and configuration of the named features, I would like to share my experience as they may be useful in case you face a similar situation.</p>
<p><strong><a href="http://www.clariba.com/blog/wp-content/uploads/2014/10/lessons-learned.jpg"><img class="alignright wp-image-6279 " src="http://www.clariba.com/blog/wp-content/uploads/2014/10/lessons-learned.jpg" alt="lessons-learned" width="372" height="258" /></a>Experience with upgrading Tableau</strong></p>
<p>Before upgrading the server, we had to make sure our server met the pre requisites for the upgrade.</p>
<p>As Automatic Login was going to be configured, previously a domain service account to the IT department needed to be requested (Non-expiring password and administrator rights granted) and the communication between the Tableau server and the Windows AD server needed to be established (Tested via ping and telnet commands in the CMD window, in our case we had to request to open ports on the firewall).</p>
<p>Once the pre-requisites were checked and made sure the server met all of them, we proceeded with the installation, as we needed to upgrade from version 8.0 (32 bits) to 8.1.7 (64 bits) it was necessary to uninstall the current software and install the new version after a previous back-up of the current the data. The installation went smoothly, no errors or warnings until we realized that the current data was not present and in the application only remained sample data, even though in the Tableau upgrade guide it is written that the data remains in the server and it is picked up automatically by the new version.</p>
<p>We needed to recover the data, therefore we proceeded to restore the backup saved before the installation and after the restore of the data was finished, we encountered another issue, the Administrator account was not working, so we had no access to the system and a few scary minutes went by until we found the tabadmin reset command to set a new password for the administrator account.</p>
<p>More info on Tabadmin reset command: <a href="http://kb.tableausoftware.com/articles/howto/resetting-the-admin-user-account">http://kb.tableausoftware.com/articles/howto/resetting-the-admin-user-account</a></p>
<p>We finally recovered the data and Tableau was up and running, then it was time to configure Automatic Login, the way Tableau works regarding Automatic login is importing all users from the Windows AD group selected to allow access, we contacted the responsible for Windows AD in order to obtain a group suitable for us (In our case it is a big customer with many offices in many countries, we only needed users from Holland), after the Windows AD responsible provided the name of the group we proceeded with the synchronization, it took around 20 minutes to synchronize 3000+ users.</p>
<p>Once the users were synchronized, we tested and the automatic login was working fine but we could not find a way to link the existing local users to the new Windows AD users, due to this we had to tweak the user groups accordingly to the security already configured.</p>
<p>We also installed the PostgreSQL driver in order to allow access to the Tableau system database in order to create data governance reports, by default Tableau only provides access to a limited number of tables (Only tables which names start with underscore), but doing a little research we found a-way to provide full access to all tables. See below details for how to connect to the database and how to provide full access to all tables.</p>
<p><a href="http://www.clariba.com/blog/wp-content/uploads/2014/10/blog_franklin_11.png"><img class="alignnone size-full wp-image-6275" src="http://www.clariba.com/blog/wp-content/uploads/2014/10/blog_franklin_11.png" alt="blog_franklin_1" width="600" height="523" /></a> <a href="http://www.clariba.com/blog/wp-content/uploads/2014/10/blog_franklin_21.png"><img class="alignnone size-large wp-image-6276" src="http://www.clariba.com/blog/wp-content/uploads/2014/10/blog_franklin_21.png" alt="blog_franklin_2" width="600" height="236" /></a> <a href="http://www.clariba.com/blog/wp-content/uploads/2014/10/blog_franklin_31.png"><img class="alignnone size-large wp-image-6277" src="http://www.clariba.com/blog/wp-content/uploads/2014/10/blog_franklin_31.png" alt="blog_franklin_3" width="600" height="421" /></a></p>
<p>While doing the post installation testing, we found out that the Tableau maps were not being displayed correctly; this is due to the fact that we changed the account the Tableau service was running on, from local system account to a service account. At the moment of doing this, the proxy settings were not present in the session of the service account, configuring the proxy settings and restarting the Tableau server fixed the issue. More info on Maps not being displayed on: <a href="http://kb.tableausoftware.com/articles/issue/maps-do-not-display?keywords=map%20server">http://kb.tableausoftware.com/articles/issue/maps-do-not-display?keywords=map%20server</a></p>
<p><strong>Summary of lessons learned</strong></p>
<p>As a summary of this experience I can say that the lessons learned were the following:</p>
<ul>
<li>Always create a backup outside of the server before installing or modifying any application</li>
<li>After configuring a new account to run the service, check all the settings configured in the local account session and apply them in the new session</li>
<li>User groups administration and security for Tableau automatic login should be handle at the Windows AD group level and not in Tableau as the synchronization brings all users in the group.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.clariba.com/blog/tableau-upgrade-lessons-learned/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Clariba receives certification for SAP recognized expertise in business intelligence and SAP HANA</title>
		<link>http://www.clariba.com/blog/clariba-receive-certification-for-sap-recognized-expertise-in-business-intelligence-and-sap-hana/</link>
		<comments>http://www.clariba.com/blog/clariba-receive-certification-for-sap-recognized-expertise-in-business-intelligence-and-sap-hana/#comments</comments>
		<pubDate>Wed, 17 Sep 2014 11:41:41 +0000</pubDate>
		<dc:creator><![CDATA[admin]]></dc:creator>
				<category><![CDATA[Certification]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[SAP BusinessObjects]]></category>
		<category><![CDATA[SAP HANA]]></category>
		<category><![CDATA["BI"]]></category>
		<category><![CDATA["business analytics"]]></category>
		<category><![CDATA["Business Intelligence"]]></category>
		<category><![CDATA["BusinessObjects"]]></category>
		<category><![CDATA[Big data]]></category>
		<category><![CDATA[BO]]></category>
		<category><![CDATA[Clariba]]></category>
		<category><![CDATA[SAP]]></category>
		<category><![CDATA[SAP certification]]></category>

		<guid isPermaLink="false">http://www.clariba.com/blog/?p=6166</guid>
		<description><![CDATA[Based on Clariba’s specific expertise and successful completion of all requirements of the SAP® Recognized Expertise Program, Clariba Consulting Middle East has been granted the SAP Recognized Expertise designation in SAP HANA® and Business Intelligence.]]></description>
				<content:encoded><![CDATA[<p>Based on Clariba’s specific expertise and successful completion of all requirements of the SAP® Recognized Expertise Program, Clariba Consulting Middle East has been granted the SAP Recognized Expertise designation in SAP HANA® and Business Intelligence.</p>
<p><a href="http://www.clariba.com/blog/wp-content/uploads/2014/07/Clariba-in-SAP-Recognized-Expertise-Business-Intelligence-e1410953685796.jpg"><img class="alignnone size-full wp-image-6255" src="http://www.clariba.com/blog/wp-content/uploads/2014/07/Clariba-in-SAP-Recognized-Expertise-Business-Intelligence-e1410953685796.jpg" alt="Clariba-in-SAP-Recognized-Expertise-Business-Intelligence" width="340" height="76" /><img class="alignnone wp-image-6257" src="http://www.clariba.com/blog/wp-content/uploads/2014/09/bg_white-SAP_Recognized_Expertise-SAP_HANA-1024x217.gif" alt="bg_white-SAP_Recognized_Expertise-SAP_HANA" width="340" height="72" /></a></p>
<p>&#8220;This certification reinforces Clariba’s continued leadership position that we have carved out for ourselves in the SAP Business Intelligence and Big Data domains,&#8221; says Marc Haberland, managing director, Clariba.</p>
<p>The SAP Recognized Expertise Certification in SAP HANA and Business Intelligence is valid for two years, and complements Clariba’s other certifications including its <a href="http://www.clariba.com/blog/clariba-obtains-the-sap-partner-center-of-expertise-certification/">Partner Center of Expertise (PCoE) Certification</a>.</p>
<p>Certification is only awarded to SAP Partners with proven competencies in selected solutions or industries. This new certification means that Clariba will be listed in the Directory for SAP Partners with Recognized Expertise and SAP will endorse Clariba for applicable tenders/RFPs.</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.clariba.com/blog/clariba-receive-certification-for-sap-recognized-expertise-in-business-intelligence-and-sap-hana/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Clariba’s Findings: Choosing the right tool for geographical analysis</title>
		<link>http://www.clariba.com/blog/claribas-findings-choosing-the-right-tool-for-geographical-analysis/</link>
		<comments>http://www.clariba.com/blog/claribas-findings-choosing-the-right-tool-for-geographical-analysis/#comments</comments>
		<pubDate>Sun, 14 Sep 2014 07:51:12 +0000</pubDate>
		<dc:creator><![CDATA[LA]]></dc:creator>
				<category><![CDATA[Geographical Analysis]]></category>
		<category><![CDATA[Antivia]]></category>
		<category><![CDATA[Clariba]]></category>
		<category><![CDATA[GaliGeo]]></category>
		<category><![CDATA[Geo BI]]></category>
		<category><![CDATA[geographical analysis]]></category>
		<category><![CDATA[SAP]]></category>
		<category><![CDATA[SAP Dashboards (Xcelsius)]]></category>
		<category><![CDATA[SAP Design Studio]]></category>
		<category><![CDATA[SAP Lumira]]></category>
		<category><![CDATA[Tableau]]></category>

		<guid isPermaLink="false">http://www.clariba.com/blog/?p=6238</guid>
		<description><![CDATA[Building geographical information on top of maps is a topic that is becoming trendier and trendier and this is where big investments from companies are focusing on during this and coming years. We all have seen an explosion of public geo-information brought to our mobile devices and consumers at corporations are asking themselves why they cannot have that same flashy stuff with their corporate information being shown on a map, either on mobile or on desktops. We have researched across most of the SAP BI modules and also some competition and analysed the features that one or another can offer; you can find below the results of this benchmark study.]]></description>
				<content:encoded><![CDATA[<h4><strong>We have temporarily removed our findings about the geographical analysis tools to re-evaluate the results. </strong></h4>
<h4><strong>Stay posted!</strong></h4>
]]></content:encoded>
			<wfw:commentRss>http://www.clariba.com/blog/claribas-findings-choosing-the-right-tool-for-geographical-analysis/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Decision trees with SAP Predictive Analytics and SAP HANA</title>
		<link>http://www.clariba.com/blog/decision-trees-with-sap-predictive-analytics-and-sap-hana/</link>
		<comments>http://www.clariba.com/blog/decision-trees-with-sap-predictive-analytics-and-sap-hana/#comments</comments>
		<pubDate>Mon, 11 Aug 2014 09:53:42 +0000</pubDate>
		<dc:creator><![CDATA[Emilio Nieto]]></dc:creator>
				<category><![CDATA[SAP HANA]]></category>
		<category><![CDATA[SAP Predictive Analysis]]></category>

		<guid isPermaLink="false">http://www.clariba.com/blog/?p=6207</guid>
		<description><![CDATA[Data mining, as per definition, is a discipline whose main aim is to discover patterns and most importantly, to predict and gain more knowledge on your data. This is done by combining different methods and approaches from artificial intelligence and statistics disciplines. One of the most common problems is how to build accurate and optimal classifier that given raw data helps us to build a model to classify future cases. There are many algorithms and methods available and we will refer in this blog article to the C4.5 which is available in the SAP Predictive Analytics tool. SAP Predictive analytics (SAP Infinite Insight) provides data mining capabilities that help many companies to anticipate customer behaviors and demands. ]]></description>
				<content:encoded><![CDATA[<p style="text-align: center;"> <a href="http://www.clariba.com/blog/wp-content/uploads/2014/08/image-11.jpg"><img class="size-full wp-image-6210 aligncenter" alt="image 1" src="http://www.clariba.com/blog/wp-content/uploads/2014/08/image-11.jpg" width="628" height="238" /></a></p>
<p>Data mining, as per definition, is a discipline whose main aim is to discover patterns and most importantly, to predict and gain more knowledge on your data. This is done by combining different methods and approaches from artificial intelligence and statistics disciplines. One of the most common problems is how to build accurate and optimal classifier that given raw data helps us to build a model to classify future cases. There are many algorithms and methods available and we will refer in this blog article to the C4.5 which is available in the SAP Predictive Analytics tool. SAP Predictive analytics (SAP Infinite Insight) provides data mining capabilities that help many companies to anticipate customer behaviors and demands. SAP Predictive Analytics is very easy to use and very powerful, it can be downloaded <a href="http://www.sap.com/pc/analytics/predictive-analytics/software/predictive-analysis/index.html">here.</a></p>
<p>The C4.5 algorithm goal is to make decision trees based on datasets. Its first version came from ID3 algorithm which was developed by Ross Quinlan.  Although C4.5 is highly popular, there are many other options like J48 and the extended C5.0.</p>
<p><i>How does it work?</i></p>
<p>By taking a group of examples, C4.5 builds the simplest decision tree (not necessarily binary) in order to classify new cases. It establishes the most important attribute as a base and adds new nodes by evaluating the importance of the following attributes.</p>
<p><a href="http://www.clariba.com/blog/wp-content/uploads/2014/08/image-2.png"><img class="alignnone size-full wp-image-6211" alt="image 2" src="http://www.clariba.com/blog/wp-content/uploads/2014/08/image-2.png" width="608" height="288" /></a></p>
<p>In the example above, we can classify an object according to its attributes and get whether is a class A or a class B object. The leaf of the tree shows the result of the analysis.</p>
<p>Going into more detail, getting the shortest decision tree is a problem known in Computer Science as NP-Complete. This means basically that there is no way to find the most optimal solution for this problem in a reasonable time. And how is this possible? The order of the nodes is going to affect the size of the decision tree and this increases the complexity to a very high level. However, C4.5 uses a greedy approach to get a solution which works reasonably well. This solution relies on the concept of Information Entropy:</p>
<p><i>Note: The entropy measures the lack of homogeneity of an examples set</i></p>
<p>That is, the algorithm builds the tree by selecting the attribute with the smallest entropy possible. There are many references about the C4.5 operation and performance, but this may be something to cover in another post.</p>
<p><i>How can SAP help?</i></p>
<p>Data-mining capabilities are a must for many companies. They provide an insight for the future and analyst can anticipate events or behaviors that will improve the whole decision making procedure. On one hand, SAP Predictive analytics tool will provide the user interface to build predictive models and apply data mining procedures to forecast and analyze. It is built on top of SAP Lumira, therefore is graphic-based and very easy-to-use.</p>
<p>On the other hand, SAP HANA provides an extremely fast and powerful in-memory database. It would make sense that we can actually take advantage of this horse power and run the algorithm in HANA.</p>
<ul>
<li>HANA’s Predictive Analysis Library (PAL) defines functions that can be used to perform predictive analysis algorithms.</li>
</ul>
<p><i>Example</i></p>
<p>For this analysis, we will use the following raw data. The below table represents a list of customers from an insurance company and the last column defines if it is a fraudulent customer or not.</p>
<table width="436" border="0" cellspacing="0" cellpadding="0">
<tbody>
<tr>
<td valign="bottom" nowrap="nowrap" width="37">
<p align="center">ID</p>
</td>
<td valign="bottom" nowrap="nowrap" width="64">
<p align="center">POLICY</p>
</td>
<td valign="bottom" nowrap="nowrap" width="57">
<p align="center">AGE</p>
</td>
<td valign="bottom" nowrap="nowrap" width="95">
<p align="center">NATIONALITY</p>
</td>
<td valign="bottom" nowrap="nowrap" width="104">
<p align="center">OCCUPATION</p>
</td>
<td valign="bottom" nowrap="nowrap" width="79">
<p align="center">FRAUD</p>
</td>
</tr>
<tr>
<td valign="bottom" nowrap="nowrap" width="37">
<p align="center">1</p>
</td>
<td valign="bottom" nowrap="nowrap" width="64">
<p align="center">Home</p>
</td>
<td valign="bottom" nowrap="nowrap" width="57">
<p align="center">24</p>
</td>
<td valign="bottom" nowrap="nowrap" width="95">
<p align="center">Nation 1</p>
</td>
<td valign="bottom" nowrap="nowrap" width="104">
<p align="center">Sales</p>
</td>
<td valign="bottom" nowrap="nowrap" width="79">
<p align="center">No</p>
</td>
</tr>
<tr>
<td valign="bottom" nowrap="nowrap" width="37">
<p align="center">2</p>
</td>
<td valign="bottom" nowrap="nowrap" width="64">
<p align="center">Home</p>
</td>
<td valign="bottom" nowrap="nowrap" width="57">
<p align="center">41</p>
</td>
<td valign="bottom" nowrap="nowrap" width="95">
<p align="center">Nation 1</p>
</td>
<td valign="bottom" nowrap="nowrap" width="104">
<p align="center">IT</p>
</td>
<td valign="bottom" nowrap="nowrap" width="79">
<p align="center">No</p>
</td>
</tr>
<tr>
<td valign="bottom" nowrap="nowrap" width="37">
<p align="center">3</p>
</td>
<td valign="bottom" nowrap="nowrap" width="64">
<p align="center">Home</p>
</td>
<td valign="bottom" nowrap="nowrap" width="57">
<p align="center">38</p>
</td>
<td valign="bottom" nowrap="nowrap" width="95">
<p align="center">Nation 1</p>
</td>
<td valign="bottom" nowrap="nowrap" width="104">
<p align="center">Sales</p>
</td>
<td valign="bottom" nowrap="nowrap" width="79">
<p align="center">Yes</p>
</td>
</tr>
<tr>
<td valign="bottom" nowrap="nowrap" width="37">
<p align="center">4</p>
</td>
<td valign="bottom" nowrap="nowrap" width="64">
<p align="center">Home</p>
</td>
<td valign="bottom" nowrap="nowrap" width="57">
<p align="center">62</p>
</td>
<td valign="bottom" nowrap="nowrap" width="95">
<p align="center">Nation 1</p>
</td>
<td valign="bottom" nowrap="nowrap" width="104">
<p align="center">Marketing</p>
</td>
<td valign="bottom" nowrap="nowrap" width="79">
<p align="center">No</p>
</td>
</tr>
<tr>
<td valign="bottom" nowrap="nowrap" width="37">
<p align="center">5</p>
</td>
<td valign="bottom" nowrap="nowrap" width="64">
<p align="center">Home</p>
</td>
<td valign="bottom" nowrap="nowrap" width="57">
<p align="center">51</p>
</td>
<td valign="bottom" nowrap="nowrap" width="95">
<p align="center">Nation 2</p>
</td>
<td valign="bottom" nowrap="nowrap" width="104">
<p align="center">Sales</p>
</td>
<td valign="bottom" nowrap="nowrap" width="79">
<p align="center">No</p>
</td>
</tr>
<tr>
<td valign="bottom" nowrap="nowrap" width="37">
<p align="center">6</p>
</td>
<td valign="bottom" nowrap="nowrap" width="64">
<p align="center">Travel</p>
</td>
<td valign="bottom" nowrap="nowrap" width="57">
<p align="center">33</p>
</td>
<td valign="bottom" nowrap="nowrap" width="95">
<p align="center">Nation 2</p>
</td>
<td valign="bottom" nowrap="nowrap" width="104">
<p align="center">Sales</p>
</td>
<td valign="bottom" nowrap="nowrap" width="79">
<p align="center">No</p>
</td>
</tr>
<tr>
<td valign="bottom" nowrap="nowrap" width="37">
<p align="center">7</p>
</td>
<td valign="bottom" nowrap="nowrap" width="64">
<p align="center">Travel</p>
</td>
<td valign="bottom" nowrap="nowrap" width="57">
<p align="center">46</p>
</td>
<td valign="bottom" nowrap="nowrap" width="95">
<p align="center">Nation 2</p>
</td>
<td valign="bottom" nowrap="nowrap" width="104">
<p align="center">IT</p>
</td>
<td valign="bottom" nowrap="nowrap" width="79">
<p align="center">No</p>
</td>
</tr>
<tr>
<td valign="bottom" nowrap="nowrap" width="37">
<p align="center">8</p>
</td>
<td valign="bottom" nowrap="nowrap" width="64">
<p align="center">Travel</p>
</td>
<td valign="bottom" nowrap="nowrap" width="57">
<p align="center">42</p>
</td>
<td valign="bottom" nowrap="nowrap" width="95">
<p align="center">Nation 2</p>
</td>
<td valign="bottom" nowrap="nowrap" width="104">
<p align="center">Marketing</p>
</td>
<td valign="bottom" nowrap="nowrap" width="79">
<p align="center">No</p>
</td>
</tr>
<tr>
<td valign="bottom" nowrap="nowrap" width="37">
<p align="center">9</p>
</td>
<td valign="bottom" nowrap="nowrap" width="64">
<p align="center">Travel</p>
</td>
<td valign="bottom" nowrap="nowrap" width="57">
<p align="center">21</p>
</td>
<td valign="bottom" nowrap="nowrap" width="95">
<p align="center">Nation 2</p>
</td>
<td valign="bottom" nowrap="nowrap" width="104">
<p align="center">Sales</p>
</td>
<td valign="bottom" nowrap="nowrap" width="79">
<p align="center">No</p>
</td>
</tr>
<tr>
<td valign="bottom" nowrap="nowrap" width="37">
<p align="center">10</p>
</td>
<td valign="bottom" nowrap="nowrap" width="64">
<p align="center">Vehicle</p>
</td>
<td valign="bottom" nowrap="nowrap" width="57">
<p align="center">44</p>
</td>
<td valign="bottom" nowrap="nowrap" width="95">
<p align="center">Nation 2</p>
</td>
<td valign="bottom" nowrap="nowrap" width="104">
<p align="center">IT</p>
</td>
<td valign="bottom" nowrap="nowrap" width="79">
<p align="center">No</p>
</td>
</tr>
<tr>
<td valign="bottom" nowrap="nowrap" width="37">
<p align="center">11</p>
</td>
<td valign="bottom" nowrap="nowrap" width="64">
<p align="center">Vehicle</p>
</td>
<td valign="bottom" nowrap="nowrap" width="57">
<p align="center">64</p>
</td>
<td valign="bottom" nowrap="nowrap" width="95">
<p align="center">Nation 1</p>
</td>
<td valign="bottom" nowrap="nowrap" width="104">
<p align="center">Sales</p>
</td>
<td valign="bottom" nowrap="nowrap" width="79">
<p align="center">Yes</p>
</td>
</tr>
<tr>
<td valign="bottom" nowrap="nowrap" width="37">
<p align="center">12</p>
</td>
<td valign="bottom" nowrap="nowrap" width="64">
<p align="center">Vehicle</p>
</td>
<td valign="bottom" nowrap="nowrap" width="57">
<p align="center">54</p>
</td>
<td valign="bottom" nowrap="nowrap" width="95">
<p align="center">Nation 3</p>
</td>
<td valign="bottom" nowrap="nowrap" width="104">
<p align="center">IT</p>
</td>
<td valign="bottom" nowrap="nowrap" width="79">
<p align="center">No</p>
</td>
</tr>
<tr>
<td valign="bottom" nowrap="nowrap" width="37">
<p align="center">13</p>
</td>
<td valign="bottom" nowrap="nowrap" width="64">
<p align="center">Vehicle</p>
</td>
<td valign="bottom" nowrap="nowrap" width="57">
<p align="center">26</p>
</td>
<td valign="bottom" nowrap="nowrap" width="95">
<p align="center">Nation 3</p>
</td>
<td valign="bottom" nowrap="nowrap" width="104">
<p align="center">Sales</p>
</td>
<td valign="bottom" nowrap="nowrap" width="79">
<p align="center">No</p>
</td>
</tr>
<tr>
<td valign="bottom" nowrap="nowrap" width="37">
<p align="center">14</p>
</td>
<td valign="bottom" nowrap="nowrap" width="64">
<p align="center">Vehicle</p>
</td>
<td valign="bottom" nowrap="nowrap" width="57">
<p align="center">44</p>
</td>
<td valign="bottom" nowrap="nowrap" width="95">
<p align="center">Nation 3</p>
</td>
<td valign="bottom" nowrap="nowrap" width="104">
<p align="center">Marketing</p>
</td>
<td valign="bottom" nowrap="nowrap" width="79">
<p align="center">Yes</p>
</td>
</tr>
</tbody>
</table>
<p>We would like to build a decision tree based on this data, that is going to help us to classify a customer to see if it’s probably fraudulent or not.</p>
<p>Let’s assume we have a model in HANA with this data available (check how to build a model find SAP reference guide  <a href="http://www.saphana.com/docs/DOC-1074">http://www.saphana.com/docs/DOC-1074</a>) in an Attribute View Called “PRED_SAMPLE”.  So First we open SAP PA and click New Document and select Connect to SAP HANA.<a href="http://www.clariba.com/blog/wp-content/uploads/2014/08/image-3.jpg"><img class="alignnone size-full wp-image-6212" alt="image 3" src="http://www.clariba.com/blog/wp-content/uploads/2014/08/image-3.jpg" width="627" height="318" /></a></p>
<p>After introducing host name and credentials, follow the process and select the view which contains the data to be analyzed and click on create.</p>
<p><i> Note: C4.5 is only available in HANA PAL and not in PA itself, so it will only be available when we are working with data in HANA.</i></p>
<p>After data has been loaded, we click on Predict to access the Predictive view. In the right panel we can select the available predictive algorithms and other procedures to use. HANA C4.5 should be available if we have established the connection successfully. We just have to double click and a node will be added in the composer.</p>
<p><a href="http://www.clariba.com/blog/wp-content/uploads/2014/08/image-4.jpg"><img class="alignnone size-full wp-image-6213" alt="image 4" src="http://www.clariba.com/blog/wp-content/uploads/2014/08/image-4.jpg" width="457" height="263" /></a></p>
<p>We don’t need any data transformation since we assume the data has been prepared before.</p>
<p>Next step is to configure the execution, so we double click on the HANA C4.5 icon in the workflow and we will access the configuration panel. Once opened, in the Column Selection we select as Features the columns which are going to be the attributes of the decision tree:</p>
<ul>
<li>Policy</li>
<li>Age</li>
<li>Nationality</li>
<li>Occupation</li>
</ul>
<p>The value which is going to be the target of the classification is the column “FRAUD” (which is actually going to be the leaf of the tree). Therefore we specify it in the “Target Variable” field and click “Done”:</p>
<p><a href="http://www.clariba.com/blog/wp-content/uploads/2014/08/image-5.jpg"><img class="alignnone size-full wp-image-6214" alt="image 5" src="http://www.clariba.com/blog/wp-content/uploads/2014/08/image-5.jpg" width="457" height="228" /></a></p>
<p>Now we simply click on “Run” and we will be redirected to the results view where we can get the decision tree design. Note that all leafs are providing a result for the analysis, while the nodes provide the probability of the result by going that way:</p>
<p><a href="http://www.clariba.com/blog/wp-content/uploads/2014/08/image-6.jpg"><img class="alignnone size-full wp-image-6215" alt="image 6" src="http://www.clariba.com/blog/wp-content/uploads/2014/08/image-6.jpg" width="628" height="333" /></a></p>
<p><i>Some considerations</i></p>
<p>-&gt;  Check that you have HANA PAL installed on your HANA server</p>
<p>AFL (Application function library) includes PAL. We can open a SQL console and run the following command to check:</p>
<p><b>SELECT</b> * <b>FROM</b> &#8220;SYS&#8221;.&#8221;AFL_FUNCTIONS&#8221; <b>WHERE</b> SCHEMA_NAME = &#8216;_SYS_AFL&#8217; <b>AND</b> AREA_NAME = &#8216;AFLPAL&#8217;;</p>
<p>-&gt; Check if the user used to log in HANA through PA has the role <i>AFL_SYS_AFL_AFLPAL_EXECUTE</i> granted. Same way, check in the Object Privileges tab if the procedures AFL_WRAPPER_GENERATOR and AFL_WRAPPER_ERASER are granted with Execute:</p>
<p><a href="http://www.clariba.com/blog/wp-content/uploads/2014/08/image-7.jpg"><img class="alignnone size-full wp-image-6216" alt="image 7" src="http://www.clariba.com/blog/wp-content/uploads/2014/08/image-7.jpg" width="538" height="119" /></a></p>
<p><a href="http://www.clariba.com/blog/wp-content/uploads/2014/08/image-8.jpg"><img class="alignnone size-full wp-image-6217" alt="image 8" src="http://www.clariba.com/blog/wp-content/uploads/2014/08/image-8.jpg" width="628" height="134" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.clariba.com/blog/decision-trees-with-sap-predictive-analytics-and-sap-hana/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>More and more SAP products integrate &#8220;R&#8221; as data processing, statistical modeling language of choice</title>
		<link>http://www.clariba.com/blog/more-and-more-sap-products-integrate-r-as-data-processing-statistical-modeling-language-of-choice/</link>
		<comments>http://www.clariba.com/blog/more-and-more-sap-products-integrate-r-as-data-processing-statistical-modeling-language-of-choice/#comments</comments>
		<pubDate>Wed, 30 Jul 2014 05:51:18 +0000</pubDate>
		<dc:creator><![CDATA[Luis Gonzalez]]></dc:creator>
				<category><![CDATA[Data processing]]></category>
		<category><![CDATA[Statistical Data Modeling]]></category>
		<category><![CDATA[Clariba]]></category>
		<category><![CDATA[R data processing]]></category>
		<category><![CDATA[SAP]]></category>
		<category><![CDATA[statistical modeling language]]></category>

		<guid isPermaLink="false">http://www.clariba.com/blog/?p=6200</guid>
		<description><![CDATA[More and more SAP products integrate "R" as data processing, statistical modeling language of choice. Acquiring R expressiveness in our skill set is now a strategic achievement in the BI sector, so Clariba has composed a list of resources available on the Internet for auto-training in R skills.]]></description>
				<content:encoded><![CDATA[<p>Widely used in the academic world, originally designed by and for statisticians, R has become a symbol for data scientists.</p>
<p><a href="http://www.clariba.com/blog/wp-content/uploads/2014/07/pic_Luis.png"><img class="alignnone  wp-image-6201" alt="pic_Luis" src="http://www.clariba.com/blog/wp-content/uploads/2014/07/pic_Luis.png" width="680" height="369" /></a></p>
<p><em><strong>Example above of R with RGui frontend plotting a 6 million rows dataset.</strong></em></p>
<p>Acquiring R expressiveness in our skill set is now a strategic achievement in the BI sector, so we have composed a list of resources available on the Internet for auto-training in R skills:</p>
<p><a href="http://www.r-fiddle.org/">Try it online!</a></p>
<p><a href="http://learnxinyminutes.com/docs/r/">Learn R in minutes</a></p>
<p><a href="https://www.datacamp.com/courses/introduction-to-r">Introduction to R Course</a><span style="text-decoration: underline;">​</span></p>
<p><a href="http://www.statmethods.net/">Quick-R (how to&#8217;s)</a></p>
<p><a href="http://www.saphana.com/docs/DOC-4049">SAP HANA R Integration Guide</a></p>
<p><a href="https://www.datacamp.com/courses/data-analysis-and-statistical-inference_mine-cetinkaya-rundel-by-datacamp">Data Analysis and Statistical Inference Course</a></p>
<p>(The authorship copyrights and trademarks correspond to the authors of the websites linked.)</p>
]]></content:encoded>
			<wfw:commentRss>http://www.clariba.com/blog/more-and-more-sap-products-integrate-r-as-data-processing-statistical-modeling-language-of-choice/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
