<?xml version="1.0" encoding="utf-8"?>
<feed xmlns="http://www.w3.org/2005/Atom">
 
 <title>Rogue Wolves</title>
 <link href="http://roguewolves.com/atom.xml" rel="self"/>
 <link href="http://roguewolves.com/"/>
 <updated>2012-12-12T11:30:07-08:00</updated>
 <id>http://roguewolves.com/</id>
 <author>
   <name>Scott Langevin</name>
   <email>scott@roguewolves.com</email>
 </author>

 
 <entry>
   <title>Language Performance Benchmarks</title>
   <link href="http://roguewolves.com/2012/12/12/Language_Performance_Benchmarks/"/>
   <updated>2012-12-12T00:00:00-08:00</updated>
   <id>http://roguewolves.com/2012/12/12/Language_Performance_Benchmarks</id>
   <content type="html">&lt;p&gt;Recently, I came across &lt;a href='http://julialang.org/'&gt;Julia&lt;/a&gt;, a high-level, high-performance dynamic programming language for technical computing. The language itself is interesting, but on the Julia website is are comparative &lt;a href='http://julialang.org/'&gt;language performance benchmark results&lt;/a&gt;. Julia is compared against several other common scientific computing languages, such as, Python, Matlab, R, etc.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;The following micro-benchmark results are from a MacBook Pro with a 2.53GHz Intel Core 2 Duo CPU and 8GB of 1066MHz DDR3 RAM&lt;/p&gt;
&lt;/blockquote&gt;
&lt;img alt='language performance benchmark results' src='files/language-benchmark.png' /&gt;
&lt;p&gt;(Figure from julialang.org)&lt;/p&gt;

&lt;p&gt;While the performance benchmarks are touting the advantages of the Julia language, I found it most striking the JavaScript performance. It was a surprise to me that JavaScript mostly outperforms Python by an &lt;em&gt;order of magnitude&lt;/em&gt;. I knew the optimizations Google and Apple were performing made dramatic improvements, but putting it into this perspective is very impressive. It would have more interesting if they had included Java in the benchmark as well.&lt;/p&gt;

&lt;p&gt;I guess I see why &lt;a href='http://nodejs.org/'&gt;node.js&lt;/a&gt; has been getting lots of attention for server side processing.&lt;/p&gt;</content>
 </entry>
 
 <entry>
   <title>Rice University Data Visualization Course Materials</title>
   <link href="http://roguewolves.com/2012/12/11/Rice_University_Data_Visualization_Course_Materials/"/>
   <updated>2012-12-11T00:00:00-08:00</updated>
   <id>http://roguewolves.com/2012/12/11/Rice_University_Data_Visualization_Course_Materials</id>
   <content type="html">&lt;p&gt;Looking to learn more about data visualization? &lt;a href='http://had.co.nz/'&gt;Hadley Wickham&lt;/a&gt; at Rice University has posted all the course materials for his course &lt;a href='http://had.co.nz/stat645/'&gt;Stat 645: Data Visualization&lt;/a&gt; online. It&amp;#8217;s a nice collection of papers, tutorials and presentations covering a wide range of topics related to visualization and visual analytics.&lt;/p&gt;</content>
 </entry>
 
 <entry>
   <title>Fix Eclipse Fonts on Retina MacBook Pro</title>
   <link href="http://roguewolves.com/2012/12/08/Fix_Eclipse_Fonts_on_Retina_MacBook_Pro/"/>
   <updated>2012-12-08T00:00:00-08:00</updated>
   <id>http://roguewolves.com/2012/12/08/Fix_Eclipse_Fonts_on_Retina_MacBook_Pro</id>
   <content type="html">&lt;p&gt;Are you being driven mad by the horrible non-retina font rendering of Eclipse on a retina MacBook Pro? Fortunately, there is a &lt;a href='https://bugs.eclipse.org/bugs/show_bug.cgi?id=382972'&gt;workaround&lt;/a&gt;:&lt;/p&gt;

&lt;p&gt;Open Finder and right click on the Eclipse.app then select &amp;#8220;Show package contents&amp;#8221;. Edit Contents/Info.plist file in a text editor and add lines 5 + 6.&lt;/p&gt;
&lt;div class='highlight'&gt;&lt;pre&gt;&lt;code class='xml'&gt;&lt;span class='lineno'&gt;1&lt;/span&gt;     &lt;span class='nt'&gt;&amp;lt;dict&amp;gt;&lt;/span&gt;
&lt;span class='lineno'&gt;2&lt;/span&gt;     &lt;span class='nt'&gt;&amp;lt;plist&amp;gt;&lt;/span&gt;
&lt;span class='lineno'&gt;3&lt;/span&gt;     ...
&lt;span class='lineno'&gt;4&lt;/span&gt; 
&lt;span class='lineno'&gt;5&lt;/span&gt;     &lt;span class='nt'&gt;&amp;lt;key&amp;gt;&lt;/span&gt;NSHighResolutionCapable&lt;span class='nt'&gt;&amp;lt;/key&amp;gt;&lt;/span&gt;
&lt;span class='lineno'&gt;6&lt;/span&gt;     &lt;span class='nt'&gt;&amp;lt;true/&amp;gt;&lt;/span&gt;
&lt;span class='lineno'&gt;7&lt;/span&gt; 
&lt;span class='lineno'&gt;8&lt;/span&gt;     &lt;span class='nt'&gt;&amp;lt;/dict&amp;gt;&lt;/span&gt;
&lt;span class='lineno'&gt;9&lt;/span&gt;     &lt;span class='nt'&gt;&amp;lt;/plist&amp;gt;&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;
&lt;/div&gt;
&lt;p&gt;Then, log out or make a copy of the app so that OSX will notice the change. Launch Eclipse and it should now be retina.&lt;/p&gt;</content>
 </entry>
 
 <entry>
   <title>Rapid Machine Learning with Ramp</title>
   <link href="http://roguewolves.com/2012/11/25/Rapid_Machine_Learning_with_Ramp/"/>
   <updated>2012-11-25T00:00:00-08:00</updated>
   <id>http://roguewolves.com/2012/11/25/Rapid_Machine_Learning_with_Ramp</id>
   <content type="html">&lt;p&gt;&lt;a href='https://github.com/kvh/ramp'&gt;Ramp&lt;/a&gt; is a python library for rapid machine learning prototyping. It provides a simple, declarative syntax for exploring features, algorithms and transformations quickly and efficiently. The library leverages &lt;a href='http://pandas.pydata.org/'&gt;pandas&lt;/a&gt;, which provides high-performance, easy-to-use data structures and data analysis tools, as well as various python machine learning and statistics libraries (scikit-learn, rpy2, etc.). Some features:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Fast caching and persistence of all intermediate and final calculations &amp;#8211; nothing is recomputed unnecessarily.&lt;/li&gt;

&lt;li&gt;Advanced training and preparation logic. Ramp respects the current training set, even when using complex trained features and blended predictions, and also tracks the given preparation set (the x values used in feature preparation &amp;#8211; e.g. the mean and stdev used for feature normalization.)&lt;/li&gt;

&lt;li&gt;A growing library of feature transformations, metrics and estimators. Ramp&amp;#8217;s simple API allows for easy extension.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;kvh has a nice &lt;a href='http://www.kenvanharen.com/2012/11/getting-started-with-ramp-detecting.html'&gt;blog post introducing Ramp&lt;/a&gt;. I&amp;#8217;m looking forward to experimenting with it.&lt;/p&gt;</content>
 </entry>
 
 <entry>
   <title>Script Excel with Python using IronSpread</title>
   <link href="http://roguewolves.com/2012/06/11/Script_Excel_with_Python_using_IronSpread/"/>
   <updated>2012-06-11T00:00:00-07:00</updated>
   <id>http://roguewolves.com/2012/06/11/Script_Excel_with_Python_using_IronSpread</id>
   <content type="html">&lt;p&gt;&lt;a href='http://www.ironspread.com/'&gt;IronSpread&lt;/a&gt; is an Excel 2010 plugin for Windows that lets you script your spreadsheets with Python.&lt;/p&gt;

&lt;p&gt;I&amp;#8217;d choose python over VBA any day.&lt;/p&gt;</content>
 </entry>
 
 <entry>
   <title>The Minds of Modern Mathematics App</title>
   <link href="http://roguewolves.com/2012/04/08/The_Minds_of_Modern_Mathematics_App/"/>
   <updated>2012-04-08T00:00:00-07:00</updated>
   <id>http://roguewolves.com/2012/04/08/The_Minds_of_Modern_Mathematics_App</id>
   <content type="html">&lt;p&gt;IBM has released a new iOS app, &lt;a href='http://mindsofmath.com'&gt;The Minds of Modern Mathematics&lt;/a&gt;, which provides an interactive history of mathematics. The app recreates a 50-foot long info graphic designed by Charles and Ray Rames that depicts 1000 years of the history and influence of mathematics through the ages.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;If you have ever wondered how mathematicians and mathematics have shaped the world we live in, this is the app for you. The Minds of Modern Mathematics takes you on an interactive journey that spans nearly 1,000 years. It tells the story of mathematics and how it has impacted almost every aspect of human progress, from science to music, art, architecture and culture.&lt;/p&gt;
&lt;/blockquote&gt;</content>
 </entry>
 
 <entry>
   <title>Learning from Data Course</title>
   <link href="http://roguewolves.com/2012/03/25/Learning_from_Data/"/>
   <updated>2012-03-25T00:00:00-07:00</updated>
   <id>http://roguewolves.com/2012/03/25/Learning_from_Data</id>
   <content type="html">&lt;p&gt;Yet another &lt;a href='http://work.caltech.edu/telecourse.html'&gt;free online introductory course on Machine Learning&lt;/a&gt;. This one from Caltech.&lt;/p&gt;

&lt;p&gt;The course description from the web page reads:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;This is an introductory course on machine learning that covers the basic theory, algorithms, and applications. Machine learning (ML) enables computational systems to adaptively improve their performance with experience accumulated from the observed data. ML techniques are widely applied in engineering, science, finance, and commerce to build systems for which we do not have full mathematical specification (and that covers a lot of systems). The course balances theory and practice, and covers the mathematical as well as the heuristic aspects. Detailed topics are listed below.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;I like the (apparent) dig at &lt;a href='http://www.ml-class.org'&gt;another free online ML course&lt;/a&gt;:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;A real Caltech course, &lt;strong&gt;not&lt;/strong&gt; a watered-down version&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Ah rivalries&amp;#8230;&lt;/p&gt;

&lt;p&gt;It really is astonishing the amount and quality of free online education available today. There had never been a better time to sharpen your skills or learn something new.&lt;/p&gt;</content>
 </entry>
 
 <entry>
   <title>Introduction to Data Science course</title>
   <link href="http://roguewolves.com/2012/03/25/Introduction_to_Data_Science_course/"/>
   <updated>2012-03-25T00:00:00-07:00</updated>
   <id>http://roguewolves.com/2012/03/25/Introduction_to_Data_Science_course</id>
   <content type="html">&lt;p&gt;Interesting &lt;a href='http://datascienc.es/'&gt;introductory course offered at Berkeley on Data Science&lt;/a&gt;. From the course page:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;Organizations use their data for decision support and to build data-intensive products and services. The collection of skills required by organizations to support these functions has been grouped under the term &amp;#8220;Data Science&amp;#8221;. &lt;em&gt;This course will attempt to articulate the expected output of Data Scientists and then equip the students with the ability to deliver against these expectations.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;I&amp;#8217;m still not a fan of the marketing term &amp;#8220;data science&amp;#8221;, however more applied machine learning, statistical analysis and data mining courses are definitely a good thing.&lt;/p&gt;</content>
 </entry>
 
 <entry>
   <title>Impending Windows 8 Disaster</title>
   <link href="http://roguewolves.com/2012/03/14/Impending_Windows_8_Disaster/"/>
   <updated>2012-03-14T00:00:00-07:00</updated>
   <id>http://roguewolves.com/2012/03/14/Impending_Windows_8_Disaster</id>
   <content type="html">&lt;p&gt;Interesting &lt;a href='http://www.fixingwindows8.com/2012/03/chris-pirillos-father-tries-windows-8'&gt;video from Chris Pirillo posted on the Fixing Windows 8 blog&lt;/a&gt;. The video illustrates (quite effectively) the major usability issues with Microsofts new operating system. In the video, we see Chris&amp;#8217; dad trying to navigate between the Metro/Windows modes. It&amp;#8217;s painful.&lt;/p&gt;

&lt;p&gt;I noticed this problem when some of my colleagues returned from a Windows Developer Conference. They demoed a pre-release version of Windows 8. What struck me about Metro and the multi-modal environment was the expansive use of &amp;#8220;magical gestures&amp;#8221; that were used pervasively throughout. I made a comment that this was a major UX problem. There is no discoverability of these features. You just have to &lt;em&gt;know&lt;/em&gt; a slew of new unintuitive gestures to interact with the operating system.&lt;/p&gt;

&lt;p&gt;Since seeing this internal demo, my prediction has been: Windows 8 is going to be an utter disaster for the company. I think it will be on par - if not worse - than the Vista disaster of 2007. I hope I&amp;#8217;m wrong, but unless they fix these UX problems post-haste, it&amp;#8217;s going to be rough going.&lt;/p&gt;

&lt;p&gt;This is a big gamble for Microsoft, but I have little confidence it will pay off. Metro is an interesting and fresh innovation, but I&amp;#8217;m unconvinced it will have appeal outside of phones or tablets. Bolting it on as a separate side-car to a &amp;#8220;Metro-ish&amp;#8221; version of Windows is muddled and confused. Further confusing, is the apparent need to switch back and forth between these modes and lack of integration between them.&lt;/p&gt;</content>
 </entry>
 
 <entry>
   <title>Cassovary Large Scale Graph Processing</title>
   <link href="http://roguewolves.com/2012/03/11/Cassovary_Large_Scale_Graph_Processing/"/>
   <updated>2012-03-11T00:00:00-08:00</updated>
   <id>http://roguewolves.com/2012/03/11/Cassovary_Large_Scale_Graph_Processing</id>
   <content type="html">&lt;p&gt;Twitter recently released an open source large scale graph precessing library named &lt;a href='https://github.com/twitter/cassovary'&gt;Cassovary&lt;/a&gt;. The library is written in Scala and therefore can be easily integrated into any JVM-based language. The library focus is on simplicity and efficient data structures to support processing of large graphical structures.&lt;/p&gt;

&lt;p&gt;From the project README:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;Cassovary is a simple &amp;#8220;big graph&amp;#8221; processing library for the JVM. Most JVM-hosted graph libraries are flexible but not space efficient. Cassovary is designed from the ground up to first be able to efficiently handle graphs with billions of nodes and edges. A typical example usage is to do large scale graph mining and analysis of a big network.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;The library appears to mainly provide data structures and some common graph search algorithms. Any dataming or graph related operations are currently not part of the library. It looks interesting and should generate a lot of interest.&lt;/p&gt;</content>
 </entry>
 
 
</feed>