<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>The Zen of Balda</title>
	<atom:link href="https://baldazen.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>https://baldazen.wordpress.com</link>
	<description>A journey through art, science and technology</description>
	<lastBuildDate>Wed, 21 Jun 2017 09:20:46 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='baldazen.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>https://s0.wp.com/i/buttonw-com.png</url>
		<title>The Zen of Balda</title>
		<link>https://baldazen.wordpress.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="https://baldazen.wordpress.com/osd.xml" title="The Zen of Balda" />
	<atom:link rel='hub' href='https://baldazen.wordpress.com/?pushpress=hub'/>
	<item>
		<title>Links of the week</title>
		<link>https://baldazen.wordpress.com/2017/06/21/links-of-the-week-17/</link>
					<comments>https://baldazen.wordpress.com/2017/06/21/links-of-the-week-17/#respond</comments>
		
		<dc:creator><![CDATA[baldazen]]></dc:creator>
		<pubDate>Wed, 21 Jun 2017 09:20:46 +0000</pubDate>
				<category><![CDATA[links of the week]]></category>
		<category><![CDATA[biohacking]]></category>
		<category><![CDATA[book]]></category>
		<category><![CDATA[computer science]]></category>
		<category><![CDATA[easting]]></category>
		<category><![CDATA[fasting]]></category>
		<category><![CDATA[jupyter]]></category>
		<category><![CDATA[knowledge]]></category>
		<category><![CDATA[learning]]></category>
		<category><![CDATA[life hacking]]></category>
		<category><![CDATA[lifestyle]]></category>
		<category><![CDATA[open source]]></category>
		<category><![CDATA[science]]></category>
		<category><![CDATA[understanding]]></category>
		<guid isPermaLink="false">http://baldazen.wordpress.com/?p=1444</guid>

					<description><![CDATA[A New Kind of Science: A 15-Year View &#8211; BackChannel Stephen Wolfram celebrates 15 years after publishing A New Kind of Science with a long article elucidating the computational paradigm introduced in his 1000+-pages book. If one manages to withstand the Wolfram&#8217;s self-celebratory tone and prolix writing, there&#8217;s a deep idea to be savoured: what if &#8230; <a href="https://baldazen.wordpress.com/2017/06/21/links-of-the-week-17/" class="more-link">Continue reading <span class="screen-reader-text">Links of the&#160;week</span></a>]]></description>
										<content:encoded><![CDATA[<p><a href="http://baldassarre.photoshelter.com/gallery-image/Adventure-Hiking-in-the-mountains/G0000CKypi0jPP3w/I0000MNBtJmv3ZzE"><img data-attachment-id="1456" data-permalink="https://baldazen.wordpress.com/2017/06/21/links-of-the-week-17/balda_20070428_8227/" data-orig-file="https://baldazen.wordpress.com/wp-content/uploads/2017/06/balda_20070428_8227.jpg" data-orig-size="1200,798" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;Luca Baldassarre&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;Male hiker descending the grassy ridge of Monte Fontanini which separates Tuscany from Emilia-Romagna in the Italian Apennines.&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;\u00ae Luca Baldassarre&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}" data-image-title="Balda_20070428_8227" data-image-description="" data-image-caption="&lt;p&gt;Male hiker descending the grassy ridge of Monte Fontanini which separates Tuscany from Emilia-Romagna in the Italian Apennines.&lt;/p&gt;
" data-medium-file="https://baldazen.wordpress.com/wp-content/uploads/2017/06/balda_20070428_8227.jpg?w=300" data-large-file="https://baldazen.wordpress.com/wp-content/uploads/2017/06/balda_20070428_8227.jpg?w=656" class=" size-full wp-image-1456 aligncenter" src="https://baldazen.wordpress.com/wp-content/uploads/2017/06/balda_20070428_8227.jpg?w=656" alt="Balda_20070428_8227.jpg"   srcset="https://baldazen.wordpress.com/wp-content/uploads/2017/06/balda_20070428_8227.jpg 1200w, https://baldazen.wordpress.com/wp-content/uploads/2017/06/balda_20070428_8227.jpg?w=150&amp;h=100 150w, https://baldazen.wordpress.com/wp-content/uploads/2017/06/balda_20070428_8227.jpg?w=300&amp;h=200 300w, https://baldazen.wordpress.com/wp-content/uploads/2017/06/balda_20070428_8227.jpg?w=768&amp;h=511 768w, https://baldazen.wordpress.com/wp-content/uploads/2017/06/balda_20070428_8227.jpg?w=1024&amp;h=681 1024w" sizes="(max-width: 1200px) 100vw, 1200px" /></a></p>
<p><strong><a href="https://backchannel.com/a-new-kind-of-science-a-15-year-view-4f5668abe54f">A New Kind of Science: A 15-Year View &#8211; BackChannel</a><br />
</strong>Stephen Wolfram celebrates 15 years after publishing <strong><a href="http://www.wolframscience.com/">A New Kind of Science</a> </strong>with a long article elucidating the computational paradigm introduced in his 1000+-pages book. If one manages to withstand the Wolfram&#8217;s self-celebratory tone and prolix writing, there&#8217;s a deep idea to be savoured: what if the fundamental descriptions of nature are not elegant mathematical equations, but simple programs? What can we then say about these programs? Do they all have the same irreducible complexity?</p>
<p><strong><a href="https://backchannel.com/inside-one-founders-personal-fast-club-dea3a3592123">Inside One Founder’s Personal Fast Club &#8211; BackChannel</a><br />
</strong>Five years ago, it was meditation, now it&#8217;s fasting. Read about the new Silicon Valley, but not only, craze about not eating, and it&#8217;s superlative health benefits. Research is positive, but still very scant.</p>
<p><strong><a href="https://www.scotthyoung.com/blog/2017/06/13/how-much-do-you-understand/">How Much Do You Really Understand? &#8211; Scott Young</a><br />
</strong>Excellent explanation about checking your understanding of anything, and why we often underestimate our ignorance. Plus some tips on how to learn to learn.</p>
<p><strong><a href="http://blog.jupyter.org/2016/07/14/jupyter-lab-alpha/">JupyterLab: the next generation of the Jupyter Notebook &#8211; Jupyter</a><br />
</strong>What are the promises of JupyterLab? Pretty impressive!</p>
<p><strong><a href="https://www.oreilly.com/ideas/jupyterlab-the-evolution-of-the-jupyter-web-interface">JupyterLab: The evolution of the Jupyter web interface &#8211; O&#8217;Reilly<br />
</a></strong>A short, but insightful, interview of Brian Granger, one of the creators of Jupyter Notebook and its evolution, JupyterLab: What issues is JupyterLab addressing and what are the new features?<strong></p>
<p></strong></p>
]]></content:encoded>
					
					<wfw:commentRss>https://baldazen.wordpress.com/2017/06/21/links-of-the-week-17/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		
		<media:content url="https://1.gravatar.com/avatar/a667e4083bd1a4b0ef3679c1a55eb1e6449cad178243c13be80246a696c28390?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">baldazen</media:title>
		</media:content>

		<media:content url="https://baldazen.wordpress.com/wp-content/uploads/2017/06/balda_20070428_8227.jpg" medium="image">
			<media:title type="html">Balda_20070428_8227.jpg</media:title>
		</media:content>
	</item>
		<item>
		<title>Links of the week</title>
		<link>https://baldazen.wordpress.com/2017/06/04/links-of-the-week-16/</link>
					<comments>https://baldazen.wordpress.com/2017/06/04/links-of-the-week-16/#respond</comments>
		
		<dc:creator><![CDATA[baldazen]]></dc:creator>
		<pubDate>Sun, 04 Jun 2017 08:57:45 +0000</pubDate>
				<category><![CDATA[links of the week]]></category>
		<category><![CDATA[AI]]></category>
		<category><![CDATA[AlphaGo]]></category>
		<category><![CDATA[artificial intelligence]]></category>
		<category><![CDATA[consciousness]]></category>
		<category><![CDATA[future]]></category>
		<category><![CDATA[go]]></category>
		<category><![CDATA[hr]]></category>
		<category><![CDATA[human resources]]></category>
		<category><![CDATA[machine learning]]></category>
		<category><![CDATA[management]]></category>
		<category><![CDATA[mind]]></category>
		<category><![CDATA[philosophy]]></category>
		<category><![CDATA[productivity]]></category>
		<category><![CDATA[science fiction]]></category>
		<category><![CDATA[short story]]></category>
		<category><![CDATA[strategy]]></category>
		<guid isPermaLink="false">http://baldazen.wordpress.com/?p=1311</guid>

					<description><![CDATA[Morning mist rolling through beech forest in Monte Amiata, Val d&#8217;Orcia, Tuscany, Italy. Conscious exotica: From algorithms to aliens, could humans ever understand minds that are radically unlike our own? &#8211; Aeon A philosophical attempt to map minds other than human, with implications to what it means to be conscious. Is consciousness an intrinsic, inscrutable subjective &#8230; <a href="https://baldazen.wordpress.com/2017/06/04/links-of-the-week-16/" class="more-link">Continue reading <span class="screen-reader-text">Links of the&#160;week</span></a>]]></description>
										<content:encoded><![CDATA[<h6 style="text-align:center;"><a href="http://baldassarre.photoshelter.com/gallery-image/Nature/G0000y72K7cCkZc0/I0000yDSjcJLDXM4"><img data-attachment-id="1317" data-permalink="https://baldazen.wordpress.com/2017/06/04/links-of-the-week-16/beech-forest-with-mist/" data-orig-file="https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_20051002_amiata_08.jpg" data-orig-size="1200,798" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;Luca Baldassarre&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;Morning mist rolling through beech forest in Monte Amiata, Val d&#039;Orcia, Tuscany, Italy.&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;\u00ae Luca Baldassarre&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;Beech forest with mist.&quot;,&quot;orientation&quot;:&quot;0&quot;}" data-image-title="Beech forest with mist." data-image-description="" data-image-caption="&lt;p&gt;Morning mist rolling through beech forest in Monte Amiata, Val d&#8217;Orcia, Tuscany, Italy.&lt;/p&gt;
" data-medium-file="https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_20051002_amiata_08.jpg?w=300" data-large-file="https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_20051002_amiata_08.jpg?w=656" class="alignnone size-full wp-image-1317" src="https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_20051002_amiata_08.jpg?w=656" alt="Balda_20051002_Amiata_08.jpg"   srcset="https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_20051002_amiata_08.jpg 1200w, https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_20051002_amiata_08.jpg?w=150&amp;h=100 150w, https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_20051002_amiata_08.jpg?w=300&amp;h=200 300w, https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_20051002_amiata_08.jpg?w=768&amp;h=511 768w, https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_20051002_amiata_08.jpg?w=1024&amp;h=681 1024w" sizes="(max-width: 1200px) 100vw, 1200px" /></a><br />
Morning mist rolling through beech forest in Monte Amiata, Val d&#8217;Orcia, Tuscany, Italy.</h6>
<p><strong><a href="https://aeon.co/essays/beyond-humans-what-other-kinds-of-minds-might-be-out-there">Conscious exotica: From algorithms to aliens, could humans ever understand minds that are radically unlike our own? &#8211; Aeon</a></strong><br />
A philosophical attempt to map minds other than human, with implications to what it means to be conscious. Is consciousness an intrinsic, inscrutable subjective phenomenon or a fact of matter that can be known? Read on.</p>
<p><b><a href="https://rsbakker.files.wordpress.com/2015/11/crash-space-tpb.pdf">Crash Space &#8211; Scott Bakker</a><br />
</b>What would happen if we engineered our brains to be able to tweak our personality and emotional responses as we experience life? What would life look like? Scott Bakker gives us a glimpse in this short story.</p>
<p><strong><a href="https://medium.com/@karpathy/alphago-in-context-c47718cb95a5">AlphaGo, in context &#8211; Andrej Karpathy</a><br />
</strong>A short, but comprehensive explanation of why the recent AlphaGo victories do not represent a big breakthrough in artificial intelligence, and how real-world problems differ, from an algorithmic point of view, from the game of Go.</p>
<p><strong><a href="https://www.scotthyoung.com/blog/2017/05/30/multiply-or-add/">Multiply or Add? &#8211; Scott Young</a><br />
</strong>In many business and personal projects, factors multiply, meaning that the performance you get is heavily influenced by the performance of weakest factor. In some other cases, e.g., learning a language, factors add. The strategy to take in developing factors/skills depends by which context, add or multiply, you&#8217;re in. For more insights, read the original article.</p>
<p><strong><a href="https://backchannel.com/human-resources-isnt-about-humans-c09c6c3b8a4f">Human Resources Isn’t About Humans &#8211; BackChannel</a><br />
</strong>Often, HR is not there to help us or solve people&#8217;s problems, it is just another corporate division with its own strict rules. But it can be changed for the better. Read on.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://baldazen.wordpress.com/2017/06/04/links-of-the-week-16/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		
		<media:content url="https://1.gravatar.com/avatar/a667e4083bd1a4b0ef3679c1a55eb1e6449cad178243c13be80246a696c28390?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">baldazen</media:title>
		</media:content>

		<media:content url="https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_20051002_amiata_08.jpg" medium="image">
			<media:title type="html">Balda_20051002_Amiata_08.jpg</media:title>
		</media:content>
	</item>
		<item>
		<title>The Marginal Value of Adaptive Gradient Methods in Machine Learning</title>
		<link>https://baldazen.wordpress.com/2017/06/04/the-marginal-value-of-adaptive-gradient-methods-in-machine-learning/</link>
					<comments>https://baldazen.wordpress.com/2017/06/04/the-marginal-value-of-adaptive-gradient-methods-in-machine-learning/#respond</comments>
		
		<dc:creator><![CDATA[baldazen]]></dc:creator>
		<pubDate>Sun, 04 Jun 2017 08:01:22 +0000</pubDate>
				<category><![CDATA[papers]]></category>
		<category><![CDATA[deep learning]]></category>
		<category><![CDATA[deep neural networks]]></category>
		<category><![CDATA[generalization]]></category>
		<category><![CDATA[machine learning]]></category>
		<category><![CDATA[optimization]]></category>
		<guid isPermaLink="false">http://baldazen.wordpress.com/?p=1388</guid>

					<description><![CDATA[Benjamin Recht and co-authors, after the revealing paper on generalization of Deep Learning, have delved into the failures of adaptive gradient methods. First of all, they constructed a linearly separable classification example where adaptive methods fail miserably, achieving a classification accuracy arbitrarily close to random guessing. Conversely, standard gradient descent methods, which converge to the &#8230; <a href="https://baldazen.wordpress.com/2017/06/04/the-marginal-value-of-adaptive-gradient-methods-in-machine-learning/" class="more-link">Continue reading <span class="screen-reader-text">The Marginal Value of Adaptive Gradient Methods in Machine&#160;Learning</span></a>]]></description>
										<content:encoded><![CDATA[<p>Benjamin Recht and co-authors, after the <strong><a href="https://baldazen.wordpress.com/2017/05/19/understanding-deep-learning-requires-rethinking-generalization/">revealing paper on generalization of Deep Learning</a></strong>, have delved into <strong><a href="https://arxiv.org/pdf/1705.08292.pdf">the failures of adaptive gradient methods</a></strong>.</p>
<p>First of all, they constructed a linearly separable classification example where adaptive methods fail miserably, achieving a classification accuracy arbitrarily close to random guessing. Conversely, standard gradient descent methods, which converge to the minimum norm solution, succeed to find the correct solution with zero prediction error.</p>
<p>Despite its artificiality, this simple example clearly shows that adaptive and non-adaptive gradient methods can converge to very different solutions.</p>
<p>Then, the authors provide substantial experimental evidence that adaptive methods do not generalize as well as non-adaptive ones, given the same amount of tuning, on four machine learning tasks addressed with deep learning architectures:</p>
<ol>
<li>Image classification (C1) on the CIFAR-10 dataset with a deep convolutional network;</li>
<li>Character-level language modeling (L1) on the War and Peace novel with a 2-layer LSTM;</li>
<li>Discriminative (L2) and</li>
<li>Generative (L3) parsing on the Penn Treebank dataset with LSTM.</li>
</ol>
<p><img data-attachment-id="1412" data-permalink="https://baldazen.wordpress.com/2017/06/04/the-marginal-value-of-adaptive-gradient-methods-in-machine-learning/wilson_marginal_value_table/" data-orig-file="https://baldazen.wordpress.com/wp-content/uploads/2017/06/wilson_marginal_value_table.png" data-orig-size="1232,236" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}" data-image-title="Wilson_Marginal_Value_Table" data-image-description="" data-image-caption="" data-medium-file="https://baldazen.wordpress.com/wp-content/uploads/2017/06/wilson_marginal_value_table.png?w=300" data-large-file="https://baldazen.wordpress.com/wp-content/uploads/2017/06/wilson_marginal_value_table.png?w=656" class=" size-full wp-image-1412 aligncenter" src="https://baldazen.wordpress.com/wp-content/uploads/2017/06/wilson_marginal_value_table.png?w=656" alt="Wilson_Marginal_Value_Table"   srcset="https://baldazen.wordpress.com/wp-content/uploads/2017/06/wilson_marginal_value_table.png 1232w, https://baldazen.wordpress.com/wp-content/uploads/2017/06/wilson_marginal_value_table.png?w=150&amp;h=29 150w, https://baldazen.wordpress.com/wp-content/uploads/2017/06/wilson_marginal_value_table.png?w=300&amp;h=57 300w, https://baldazen.wordpress.com/wp-content/uploads/2017/06/wilson_marginal_value_table.png?w=768&amp;h=147 768w, https://baldazen.wordpress.com/wp-content/uploads/2017/06/wilson_marginal_value_table.png?w=1024&amp;h=196 1024w" sizes="(max-width: 1232px) 100vw, 1232px" /></p>
<p>The experiments show the following findings:</p>
<ol>
<li>&#8220;Adaptive method find solutions that generalize worse than those found by non-adaptive methods.&#8221;</li>
<li>&#8220;Even when the adaptive method achieve the <em>same training loss or lower</em> than non-adaptive methods, the development or test performance is worse.&#8221;</li>
<li>&#8220;Adaptive methods often display faster initial progress on the training set, but their performance quickly plateaus on the development set.&#8221;</li>
<li>&#8220;Though conventional wisdom suggests that Adam does not require tuning, we find that tuning the initial learning rate and decay scheme for Adam yields significant improvements over its default settings in all cases.&#8221;</li>
</ol>
<p>The plots below are an illustration of these finding for image classification task.</p>
<p><img loading="lazy" data-attachment-id="1426" data-permalink="https://baldazen.wordpress.com/2017/06/04/the-marginal-value-of-adaptive-gradient-methods-in-machine-learning/wilson_marginal_value_figure/" data-orig-file="https://baldazen.wordpress.com/wp-content/uploads/2017/06/wilson_marginal_value_figure.png" data-orig-size="1414,808" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}" data-image-title="Wilson_Marginal_Value_Figure" data-image-description="" data-image-caption="" data-medium-file="https://baldazen.wordpress.com/wp-content/uploads/2017/06/wilson_marginal_value_figure.png?w=300" data-large-file="https://baldazen.wordpress.com/wp-content/uploads/2017/06/wilson_marginal_value_figure.png?w=656" class="alignnone size-full wp-image-1426" src="https://baldazen.wordpress.com/wp-content/uploads/2017/06/wilson_marginal_value_figure.png?w=656" alt="Wilson_Marginal_Value_Figure.png"   srcset="https://baldazen.wordpress.com/wp-content/uploads/2017/06/wilson_marginal_value_figure.png 1414w, https://baldazen.wordpress.com/wp-content/uploads/2017/06/wilson_marginal_value_figure.png?w=150&amp;h=86 150w, https://baldazen.wordpress.com/wp-content/uploads/2017/06/wilson_marginal_value_figure.png?w=300&amp;h=171 300w, https://baldazen.wordpress.com/wp-content/uploads/2017/06/wilson_marginal_value_figure.png?w=768&amp;h=439 768w, https://baldazen.wordpress.com/wp-content/uploads/2017/06/wilson_marginal_value_figure.png?w=1024&amp;h=585 1024w" sizes="(max-width: 1414px) 100vw, 1414px" /></p>
<p><strong><a href="https://arxiv.org/pdf/1705.08292.pdf">The paper can be found on arXiv</a></strong>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://baldazen.wordpress.com/2017/06/04/the-marginal-value-of-adaptive-gradient-methods-in-machine-learning/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		
		<media:content url="https://1.gravatar.com/avatar/a667e4083bd1a4b0ef3679c1a55eb1e6449cad178243c13be80246a696c28390?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">baldazen</media:title>
		</media:content>

		<media:content url="https://baldazen.wordpress.com/wp-content/uploads/2017/06/wilson_marginal_value_table.png" medium="image">
			<media:title type="html">Wilson_Marginal_Value_Table</media:title>
		</media:content>

		<media:content url="https://baldazen.wordpress.com/wp-content/uploads/2017/06/wilson_marginal_value_figure.png" medium="image">
			<media:title type="html">Wilson_Marginal_Value_Figure.png</media:title>
		</media:content>
	</item>
		<item>
		<title>Living Together: Mind and Machine Intelligence</title>
		<link>https://baldazen.wordpress.com/2017/05/31/living-together-mind-and-machine-intelligence/</link>
					<comments>https://baldazen.wordpress.com/2017/05/31/living-together-mind-and-machine-intelligence/#respond</comments>
		
		<dc:creator><![CDATA[baldazen]]></dc:creator>
		<pubDate>Wed, 31 May 2017 19:02:37 +0000</pubDate>
				<category><![CDATA[papers]]></category>
		<category><![CDATA[artificial intelligence]]></category>
		<category><![CDATA[consciousness]]></category>
		<category><![CDATA[machine learning]]></category>
		<category><![CDATA[mind]]></category>
		<category><![CDATA[philosophy]]></category>
		<category><![CDATA[society]]></category>
		<guid isPermaLink="false">http://baldazen.wordpress.com/?p=1327</guid>

					<description><![CDATA[Neil Lawrence wrote a nifty paper on the current difference between human and machine intelligence titled Living Together: Mind and Machine Intelligence. The paper initially appeared in his blog, inverseprobability.com, on Sunday, but was then removed. It can now be found on arXiv. The paper comes up with a quantitive metric to use as a lens to understand &#8230; <a href="https://baldazen.wordpress.com/2017/05/31/living-together-mind-and-machine-intelligence/" class="more-link">Continue reading <span class="screen-reader-text">Living Together: Mind and Machine&#160;Intelligence</span></a>]]></description>
										<content:encoded><![CDATA[<p><a href="http://baldassarre.photoshelter.com/gallery-image/Sunday-morning-in-Brick-Lane-March-24th-2012/G0000AIbMKl3Bcc8/I0000M6AgGsW1L5Q"><img loading="lazy" data-attachment-id="1375" data-permalink="https://baldazen.wordpress.com/2017/05/31/living-together-mind-and-machine-intelligence/balda_20120325_d700_3739/" data-orig-file="https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_20120325_d700_3739.jpg" data-orig-size="798,1200" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;8&quot;,&quot;credit&quot;:&quot;Luca Baldassarre&quot;,&quot;camera&quot;:&quot;NIKON D700&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;1332680176&quot;,&quot;copyright&quot;:&quot;\u00ae Luca Baldassarre&quot;,&quot;focal_length&quot;:&quot;65&quot;,&quot;iso&quot;:&quot;800&quot;,&quot;shutter_speed&quot;:&quot;0.004&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}" data-image-title="Balda_20120325_D700_3739" data-image-description="" data-image-caption="" data-medium-file="https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_20120325_d700_3739.jpg?w=200" data-large-file="https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_20120325_d700_3739.jpg?w=656" class="  wp-image-1375 aligncenter" src="https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_20120325_d700_3739.jpg?w=380&#038;h=571" alt="Balda_20120325_D700_3739.jpg" width="380" height="571" srcset="https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_20120325_d700_3739.jpg?w=380&amp;h=571 380w, https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_20120325_d700_3739.jpg?w=760&amp;h=1142 760w, https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_20120325_d700_3739.jpg?w=100&amp;h=150 100w, https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_20120325_d700_3739.jpg?w=200&amp;h=300 200w" sizes="(max-width: 380px) 100vw, 380px" /></a></p>
<p><strong><a href="http://inverseprobability.com/">Neil Lawrence</a></strong> wrote a nifty paper on the current difference between human and machine intelligence titled <strong><a href="https://arxiv.org/abs/1705.07996.pdf">Living Together: Mind and Machine Intelligence</a></strong>. The paper initially appeared in his blog, <strong><a href="http://inverseprobability.com/">inverseprobability.com</a></strong>, on Sunday, but was then removed. It can now be found on arXiv.</p>
<p>The paper comes up with a quantitive metric to use as a lens to understand the differences between the human mind and pervasive machine intelligence. The <em>embodiment factor </em>is defined as the ratio between the <em>computational power</em> and the <em>communication bandwidth</em>. If we take the computational power of the brain as the estimate of what it would take to simulate it, we are talking of the order of exaflops. However, human communication is limited by the speed at which we can talk, read or listen, and can be estimated at around 100 bits per second. The human embodiment factor is therefore around 10^16. The situation is almost reversed for machines, a current computational power of approximately 10 gigaflops is matched to a bandwidth of one gigabit per second, yielding an embodiment factor of 10.</p>
<p>Neil then argues that the human mind is <em>locked in</em>, and needs accurate models of the world and its actors in order to best utilize the little information it can ingest and spit out. From this need, all sorts of theories of mind emerge that allow us to understand each other even without communication. Furthermore, it seems that humans operate via <strong><a href="http://amzn.to/2rc2lgf">two systems</a></strong>, one and two, the fast and the slow, the quick unconscious and the deliberate self, the <em>it</em> and the <em>I.</em> System one is the reflexive, basic, biased process that allows us to survive and take rapid life-saving, but not only, decisions. System two creates a sense of self to explain its own actions and interpret those of others.</p>
<p>Machines do not need such sophisticated mind models as they can directly and fully share their inner states. Therefore, they operate in a very different way than us humans, which makes them quite <strong><a href="https://aeon.co/essays/beyond-humans-what-other-kinds-of-minds-might-be-out-there">alien</a></strong>. Neil argues that the current algorithms that recommend us what to buy, what to click, what to read and so on, operate on a level which he calls System Zero, in the sense that it boycotts and influences the human System One, exploiting its basic needs and biases, in order to achieve its own goal: to give us &#8220;what we want, but not what we aspire to.&#8221; This is creating undesirable consequences, like the polarization of information that led to the <strong><a href="https://en.wikipedia.org/wiki/Fake_news">Fake News phenomenon</a></strong>, which might have had a significant impact on the last US elections.</p>
<p>What can we do? Neil offers us three lines of action:</p>
<ol>
<li>&#8220;Encourage a wider societal understanding of how closely our privacy is interconnected with our personal freedom.&#8221;</li>
<li>&#8220;Develop a much better understanding of our own cognitive biases and characterise our own intelligence better.&#8221;</li>
<li>&#8220;Develop a sentient aspect to our machine intelligences which allows them to explain actions and justify decision making.&#8221;</li>
</ol>
<p>I really encourage you to <strong><a href="https://arxiv.org/abs/1705.07996.pdf">read the paper</a></strong> to get a more in-depth understanding of these definitions, issues and recommendations.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://baldazen.wordpress.com/2017/05/31/living-together-mind-and-machine-intelligence/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		
		<media:content url="https://1.gravatar.com/avatar/a667e4083bd1a4b0ef3679c1a55eb1e6449cad178243c13be80246a696c28390?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">baldazen</media:title>
		</media:content>

		<media:content url="https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_20120325_d700_3739.jpg" medium="image">
			<media:title type="html">Balda_20120325_D700_3739.jpg</media:title>
		</media:content>
	</item>
		<item>
		<title>Links of the week</title>
		<link>https://baldazen.wordpress.com/2017/05/26/links-of-the-week-15/</link>
					<comments>https://baldazen.wordpress.com/2017/05/26/links-of-the-week-15/#respond</comments>
		
		<dc:creator><![CDATA[baldazen]]></dc:creator>
		<pubDate>Fri, 26 May 2017 19:44:54 +0000</pubDate>
				<category><![CDATA[links of the week]]></category>
		<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[agriculture]]></category>
		<category><![CDATA[AI]]></category>
		<category><![CDATA[artificial intelligence]]></category>
		<category><![CDATA[bayesian machine learning]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[deep learning]]></category>
		<category><![CDATA[farming]]></category>
		<category><![CDATA[precision agriculture]]></category>
		<category><![CDATA[reinforcement learning]]></category>
		<category><![CDATA[superintelligence]]></category>
		<guid isPermaLink="false">http://baldazen.wordpress.com/?p=1260</guid>

					<description><![CDATA[Using Machine Learning to Explore Neural Network Architecture &#8211; Google Designing Neural Network Architectures using Reinforcement Learning &#8211; MIT How neural networks can generate successful offsprings and alleviate the burden from human designers using reinforcement learning. Data as Agriculture’s New Currency: The Farmer’s Perspective &#8211; AgFunder News A classification of three types of agricultural data and how &#8230; <a href="https://baldazen.wordpress.com/2017/05/26/links-of-the-week-15/" class="more-link">Continue reading <span class="screen-reader-text">Links of the&#160;week</span></a>]]></description>
										<content:encoded><![CDATA[<p><a href="http://baldassarre.photoshelter.com/gallery-image/Nature/G0000y72K7cCkZc0/I0000x4szpg992wA"><img loading="lazy" data-attachment-id="1308" data-permalink="https://baldazen.wordpress.com/2017/05/26/links-of-the-week-15/balda_api0004/" data-orig-file="https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_api0004.jpg" data-orig-size="1200,793" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;Luca Baldassarre&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;1166982767&quot;,&quot;copyright&quot;:&quot;\u00ae Luca Baldassarre&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}" data-image-title="Balda_API0004" data-image-description="" data-image-caption="" data-medium-file="https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_api0004.jpg?w=300" data-large-file="https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_api0004.jpg?w=656" class="aligncenter size-full wp-image-1308" src="https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_api0004.jpg?w=656" alt="Balda_API0004.jpg"   srcset="https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_api0004.jpg 1200w, https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_api0004.jpg?w=150&amp;h=99 150w, https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_api0004.jpg?w=300&amp;h=198 300w, https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_api0004.jpg?w=768&amp;h=508 768w, https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_api0004.jpg?w=1024&amp;h=677 1024w" sizes="(max-width: 1200px) 100vw, 1200px" /></a></p>
<p><strong><a href="http://research.googleblog.com/2017/05/using-machine-learning-to-explore.html">Using Machine Learning to Explore Neural Network Architecture &#8211; Google<br />
</a></strong><a href="https://arxiv.org/pdf/1611.02167.pdf"><strong>Designing Neural Network Architectures using Reinforcement Learning &#8211; MIT<br />
</strong></a>How neural networks can generate successful offsprings and alleviate the burden from human designers using reinforcement learning.</p>
<p><strong><a href="https://agfundernews.com/data-as-agricultures-new-currency-the-farmers-perspective.html">Data as Agriculture’s New Currency: The Farmer’s Perspective &#8211; AgFunder News</a><br />
</strong>A classification of three types of agricultural data and how they related to the farmer&#8217;s needs.</p>
<p><strong><a href="https://backchannel.com/the-myth-of-a-superhuman-ai-59282b686c62">The AI Cargo Cult: The Myth of a Superhuman AI &#8211; Kevin Kelly</a><br />
</strong>The founding executive editor of Wired explains why he believes superhuman AI is very unlikely. Instead, we already see many form of extra-human new species of intelligence.</p>
<p><strong><a href="http://www.inference.vc/everything-that-works-works-because-its-bayesian-2/">Everything that Works Works Because it&#8217;s Bayesian: Why Deep Nets Generalize? &#8211; inFERENCe</a><br />
</strong>Finally, Bayesian can also say that they can explain why Deep Learning works! Jokes apart, this article overviews several recent useful interpretations of Deep Learning from a Bayesian perspective.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://baldazen.wordpress.com/2017/05/26/links-of-the-week-15/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		
		<media:content url="https://1.gravatar.com/avatar/a667e4083bd1a4b0ef3679c1a55eb1e6449cad178243c13be80246a696c28390?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">baldazen</media:title>
		</media:content>

		<media:content url="https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_api0004.jpg" medium="image">
			<media:title type="html">Balda_API0004.jpg</media:title>
		</media:content>
	</item>
		<item>
		<title>Book review: Big Data Analytics: A Management Perspective by Francesco Corea</title>
		<link>https://baldazen.wordpress.com/2017/05/23/book-review-big-data-analytics-a-management-perspective-by-francesco-corea/</link>
					<comments>https://baldazen.wordpress.com/2017/05/23/book-review-big-data-analytics-a-management-perspective-by-francesco-corea/#respond</comments>
		
		<dc:creator><![CDATA[baldazen]]></dc:creator>
		<pubDate>Tue, 23 May 2017 18:15:48 +0000</pubDate>
				<category><![CDATA[books]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[data science]]></category>
		<category><![CDATA[management]]></category>
		<category><![CDATA[strategy]]></category>
		<guid isPermaLink="false">http://baldazen.wordpress.com/?p=1280</guid>

					<description><![CDATA[I stumbled upon Francesco Corea&#8217;s writings on Medium and I started following his posts about Data Science and AI strategy. They are concise, clear and no-nonsense. Intrigued, I plunged into his book. To my disappointment. Let me explain. Blog posts such as his are compelling exactly due to their straight statements, clarity and conciseness.  One does &#8230; <a href="https://baldazen.wordpress.com/2017/05/23/book-review-big-data-analytics-a-management-perspective-by-francesco-corea/" class="more-link">Continue reading <span class="screen-reader-text">Book review: Big Data Analytics: A Management Perspective by Francesco&#160;Corea</span></a>]]></description>
										<content:encoded><![CDATA[<p><a href="http://amzn.to/2qSreOh"><img loading="lazy" data-attachment-id="1296" data-permalink="https://baldazen.wordpress.com/2017/05/23/book-review-big-data-analytics-a-management-perspective-by-francesco-corea/francesco_corea_big_data_analytics_a_management_perspective/" data-orig-file="https://baldazen.wordpress.com/wp-content/uploads/2017/05/francesco_corea_big_data_analytics_a_management_perspective.jpg" data-orig-size="153,230" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}" data-image-title="Francesco_Corea_Big_Data_Analytics_A_Management_Perspective" data-image-description="" data-image-caption="" data-medium-file="https://baldazen.wordpress.com/wp-content/uploads/2017/05/francesco_corea_big_data_analytics_a_management_perspective.jpg?w=153" data-large-file="https://baldazen.wordpress.com/wp-content/uploads/2017/05/francesco_corea_big_data_analytics_a_management_perspective.jpg?w=153" class=" size-full wp-image-1296 aligncenter" src="https://baldazen.wordpress.com/wp-content/uploads/2017/05/francesco_corea_big_data_analytics_a_management_perspective.jpg?w=656" alt="Francesco_Corea_Big_Data_Analytics_A_Management_Perspective"   srcset="https://baldazen.wordpress.com/wp-content/uploads/2017/05/francesco_corea_big_data_analytics_a_management_perspective.jpg 153w, https://baldazen.wordpress.com/wp-content/uploads/2017/05/francesco_corea_big_data_analytics_a_management_perspective.jpg?w=100&amp;h=150 100w" sizes="(max-width: 153px) 100vw, 153px" /></a></p>
<p>I stumbled upon Francesco Corea&#8217;s writings on <strong><a href="https://medium.com/@Francesco_AI">Medium</a></strong> and I started following his posts about Data Science and AI strategy. They are concise, clear and no-nonsense. Intrigued, I plunged into his book. To my disappointment. Let me explain.</p>
<p>Blog posts such as his are compelling exactly due to their straight statements, clarity and conciseness.  One does not expect a thorough treatment of the subject matter, but a precise statement of opinion.</p>
<p>A book is a different story. It offers the space and time to delve deeper into the subject, provide proper arguments and evidence, illustrate through the use of a multitude of real examples. All of this lacks from Corea&#8217;s <strong><a href="http://amzn.to/2qSreOh">Big Data Analytics: A Management Perspective</a></strong>. Indeed, it is only 48 pages long. The penultimate chapter titled &#8220;Where are we going? The path toward an artificial intelligence&#8221; is four-paragraph long, plus a paragraph for the abstract.</p>
<p>Don&#8217;t take me wrong. The book does make sense and it offers good advice and a quick overview of the trends and key terminology of big data analytics, but it feels just like a sketch, a book outline more than a proper book.</p>
<p>Book critique apart, I will continue reading Corea on Medium. I&#8217;m curious to see where he is going, since I perceive a certain strong ambition to become a key thought leader in this area. But the road is still long.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://baldazen.wordpress.com/2017/05/23/book-review-big-data-analytics-a-management-perspective-by-francesco-corea/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		
		<media:content url="https://1.gravatar.com/avatar/a667e4083bd1a4b0ef3679c1a55eb1e6449cad178243c13be80246a696c28390?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">baldazen</media:title>
		</media:content>

		<media:content url="https://baldazen.wordpress.com/wp-content/uploads/2017/05/francesco_corea_big_data_analytics_a_management_perspective.jpg" medium="image">
			<media:title type="html">Francesco_Corea_Big_Data_Analytics_A_Management_Perspective</media:title>
		</media:content>
	</item>
		<item>
		<title>Links of the week</title>
		<link>https://baldazen.wordpress.com/2017/05/20/links-of-the-week-14/</link>
					<comments>https://baldazen.wordpress.com/2017/05/20/links-of-the-week-14/#respond</comments>
		
		<dc:creator><![CDATA[baldazen]]></dc:creator>
		<pubDate>Sat, 20 May 2017 06:08:27 +0000</pubDate>
				<category><![CDATA[links of the week]]></category>
		<category><![CDATA[backpropagation]]></category>
		<category><![CDATA[deep learning]]></category>
		<category><![CDATA[evolution]]></category>
		<category><![CDATA[harari]]></category>
		<category><![CDATA[hinton]]></category>
		<category><![CDATA[history]]></category>
		<category><![CDATA[image processing]]></category>
		<category><![CDATA[image segmentation]]></category>
		<category><![CDATA[learning]]></category>
		<category><![CDATA[learning to learn]]></category>
		<category><![CDATA[math]]></category>
		<category><![CDATA[neuroscience]]></category>
		<category><![CDATA[sapiens]]></category>
		<category><![CDATA[society]]></category>
		<category><![CDATA[technology]]></category>
		<category><![CDATA[time]]></category>
		<guid isPermaLink="false">http://baldazen.wordpress.com/?p=1202</guid>

					<description><![CDATA[Sunset on the north face of the Cima del Cantun (m. 3354) reflecting on a small lake, Val Bregaglia, Switzerland. How could we do this? &#8211; SUM A concise summary of Yuval Harari&#8217;s Sapiens, which I read last year and left me profoundly impressed about our history. Deep, Deep Trouble &#8211; Micheal Elad Elad reflects &#8230; <a href="https://baldazen.wordpress.com/2017/05/20/links-of-the-week-14/" class="more-link">Continue reading <span class="screen-reader-text">Links of the&#160;week</span></a>]]></description>
										<content:encoded><![CDATA[<p><a href="http://baldassarre.photoshelter.com/gallery-image/Landscapes-and-Mountains-photographs/G0000_vMZ3OepmXg/I0000saLYCTF3rmI"><img loading="lazy" data-attachment-id="1256" data-permalink="https://baldazen.wordpress.com/2017/05/20/links-of-the-week-14/balda_200608_trek_156/" data-orig-file="https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_200608_trek_156.jpg" data-orig-size="1200,798" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}" data-image-title="Balda_200608_Trek_156" data-image-description="" data-image-caption="" data-medium-file="https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_200608_trek_156.jpg?w=300" data-large-file="https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_200608_trek_156.jpg?w=656" class=" size-full wp-image-1256 aligncenter" src="https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_200608_trek_156.jpg?w=656" alt="Balda_200608_Trek_156.jpg"   srcset="https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_200608_trek_156.jpg 1200w, https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_200608_trek_156.jpg?w=150&amp;h=100 150w, https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_200608_trek_156.jpg?w=300&amp;h=200 300w, https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_200608_trek_156.jpg?w=768&amp;h=511 768w, https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_200608_trek_156.jpg?w=1024&amp;h=681 1024w" sizes="(max-width: 1200px) 100vw, 1200px" /></a>Sunset on the north face of the Cima del Cantun (m. 3354) reflecting on a small lake, Val Bregaglia, Switzerland.</p>
<p><strong><a href="http://sumpeople.ch/2017/05/how-could-we-do-this/">How could we do this? &#8211; SUM</a><br />
</strong>A concise summary of <strong><a href="http://amzn.to/2pL3Ldv">Yuval Harari&#8217;s Sapiens</a></strong>, which I read last year and left me profoundly impressed about our history.</p>
<p><strong><a href="https://sinews.siam.org/Details-Page/deep-deep-trouble">Deep, Deep Trouble &#8211; Micheal Elad</a><br />
</strong>Elad reflects on the impact of deep learning on image processing. Should we throw away rigorous mathematical models for the improved, but black-box, performance of deep learning?</p>
<p><strong><a href="https://www.youtube.com/watch?v=VIRCybGgHts">Can the brain do back-propagation? &#8211; Geoffrey Hinton<br />
</a></strong>A seminar from last year by Geoffrey Hinton at Stanford on why he thinks that the brain can actually do back-propagation, addressing four obstacles raised by neuroscientists.</p>
<div class="jetpack-video-wrapper"><iframe class="youtube-player" width="560" height="315" src="https://www.youtube.com/embed/VIRCybGgHts?version=3&#038;rel=1&#038;showsearch=0&#038;showinfo=1&#038;iv_load_policy=1&#038;fs=1&#038;hl=en&#038;autohide=2&#038;wmode=transparent" allowfullscreen="true" style="border:0;" sandbox="allow-scripts allow-same-origin allow-popups allow-presentation allow-popups-to-escape-sandbox"></iframe></div>
<p><strong><a href="https://blog.athelas.com/a-brief-history-of-cnns-in-image-segmentation-from-r-cnn-to-mask-r-cnn-34ea83205de4">A Brief History of CNNs in Image Segmentation: From R-CNN to Mask R-CNN &#8211; Dhruv Parthasarathy</a><br />
</strong>Well written post about the development from AlexNet to Mask R-CNN for pixel-level image segmentation.</p>
<p><strong><a href="https://www.scotthyoung.com/blog/2017/05/09/interview-dr-barbara-oakley/">Should You Listen to Music While Studying, The Pi Model and Learning How to Learn w/ Dr. Barbara Oakley &#8211; Scott Young</a><br />
</strong>Interesting 20mins conversation about learning techniques and tips.</p>
<div class="jetpack-video-wrapper"><iframe class="youtube-player" width="560" height="315" src="https://www.youtube.com/embed/uDtA9cWNUYY?version=3&#038;rel=1&#038;showsearch=0&#038;showinfo=1&#038;iv_load_policy=1&#038;fs=1&#038;hl=en&#038;autohide=2&#038;wmode=transparent" allowfullscreen="true" style="border:0;" sandbox="allow-scripts allow-same-origin allow-popups allow-presentation allow-popups-to-escape-sandbox"></iframe></div>
<p><strong><a href="https://www.unlimited.world/future-labs/escaping-the-24-hour-dystopia">Escaping The 24-hour Dystopia &#8211; Unlimited</a><br />
</strong>&#8220;Busyiness has become a global cult&#8221;. We cannot keep pace with the online onslaught of information. What&#8217;s the cure? This article overviews some technological solutions: brain enhancement, supersonic travel, Neuralink and others. My take is that we must first consider behavioral solutions instead.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://baldazen.wordpress.com/2017/05/20/links-of-the-week-14/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		
		<media:content url="https://1.gravatar.com/avatar/a667e4083bd1a4b0ef3679c1a55eb1e6449cad178243c13be80246a696c28390?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">baldazen</media:title>
		</media:content>

		<media:content url="https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_200608_trek_156.jpg" medium="image">
			<media:title type="html">Balda_200608_Trek_156.jpg</media:title>
		</media:content>
	</item>
		<item>
		<title>Understanding deep learning requires rethinking generalization</title>
		<link>https://baldazen.wordpress.com/2017/05/19/understanding-deep-learning-requires-rethinking-generalization/</link>
					<comments>https://baldazen.wordpress.com/2017/05/19/understanding-deep-learning-requires-rethinking-generalization/#comments</comments>
		
		<dc:creator><![CDATA[baldazen]]></dc:creator>
		<pubDate>Fri, 19 May 2017 19:00:29 +0000</pubDate>
				<category><![CDATA[papers]]></category>
		<category><![CDATA[deep learning]]></category>
		<category><![CDATA[deep neural networks]]></category>
		<category><![CDATA[generalization]]></category>
		<category><![CDATA[machine learning]]></category>
		<guid isPermaLink="false">http://baldazen.wordpress.com/?p=1227</guid>

					<description><![CDATA[Zhang et al have written a splendid concise paper that shows how neural networks, even of depth 2, can easily fit random labels from random data. Furthermore, from their experiments with Inception-like architectures they observe that: The effective capacity of neural networks is large enough for a brute force memorization of the entire dataset. Even &#8230; <a href="https://baldazen.wordpress.com/2017/05/19/understanding-deep-learning-requires-rethinking-generalization/" class="more-link">Continue reading <span class="screen-reader-text">Understanding deep learning requires rethinking&#160;generalization</span></a>]]></description>
										<content:encoded><![CDATA[<p><img loading="lazy" data-attachment-id="1245" data-permalink="https://baldazen.wordpress.com/2017/05/19/understanding-deep-learning-requires-rethinking-generalization/understanding-deep-learning-requires-rethinking-generalization/" data-orig-file="https://baldazen.wordpress.com/wp-content/uploads/2017/05/understanding-deep-learning-requires-rethinking-generalization.png" data-orig-size="1374,654" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}" data-image-title="Understanding deep learning requires rethinking generalization" data-image-description="" data-image-caption="" data-medium-file="https://baldazen.wordpress.com/wp-content/uploads/2017/05/understanding-deep-learning-requires-rethinking-generalization.png?w=300" data-large-file="https://baldazen.wordpress.com/wp-content/uploads/2017/05/understanding-deep-learning-requires-rethinking-generalization.png?w=656" class="alignnone size-full wp-image-1245" src="https://baldazen.wordpress.com/wp-content/uploads/2017/05/understanding-deep-learning-requires-rethinking-generalization.png?w=656" alt="Understanding deep learning requires rethinking generalization.png"   srcset="https://baldazen.wordpress.com/wp-content/uploads/2017/05/understanding-deep-learning-requires-rethinking-generalization.png 1374w, https://baldazen.wordpress.com/wp-content/uploads/2017/05/understanding-deep-learning-requires-rethinking-generalization.png?w=150&amp;h=71 150w, https://baldazen.wordpress.com/wp-content/uploads/2017/05/understanding-deep-learning-requires-rethinking-generalization.png?w=300&amp;h=143 300w, https://baldazen.wordpress.com/wp-content/uploads/2017/05/understanding-deep-learning-requires-rethinking-generalization.png?w=768&amp;h=366 768w, https://baldazen.wordpress.com/wp-content/uploads/2017/05/understanding-deep-learning-requires-rethinking-generalization.png?w=1024&amp;h=487 1024w" sizes="(max-width: 1374px) 100vw, 1374px" /></p>
<p>Zhang et al have written a <a href="https://arxiv.org/pdf/1611.03530.pdf">splendid concise paper</a> that shows how neural networks, even of depth 2, can easily fit random labels from random data.</p>
<p>Furthermore, from their experiments with <a href="http://nicolovaligi.com/history-inception-deep-learning-architecture.html">Inception-like</a> architectures they observe that:</p>
<ol>
<li><em>The effective capacity of neural networks is large enough for a brute force memorization of the entire dataset.</em></li>
<li><em>Even optimization on random labels remains easy. In fact, training time increases only by a small constant factor compare with training on the true labels.</em></li>
<li><em>Randomizing labels is solely a data transformation, leaving all other properties of the learning problem unchanged.</em></li>
</ol>
<p>The authors also show that standard generalization theories, such as <a href="https://en.wikipedia.org/wiki/VC_dimension">VC dimension</a>, <a href="https://en.wikipedia.org/wiki/Rademacher_complexity">Rademacher complexity</a> and <a href="https://en.wikipedia.org/wiki/Stability_(learning_theory)">uniform stability</a>, cannot explain while networks that have the capacity to memorize the entire dataset still can generalize well.</p>
<p><em>&#8220;Explicit <a href="http://www.deeplearningbook.org/contents/regularization.html">regularization</a> may improve performance, but is neither necessary or by itself sufficient for controlling generalization error.&#8221;</em></p>
<p><a href="https://arxiv.org/pdf/1611.03530.pdf">This paper</a> is one of those rare ones, that in a crystalline way shows our ignorance.</p>
<p><strong>Abstract</strong></p>
<p><em>Despite their massive size, successful deep artificial neural networks can exhibit a remarkably small difference between training and test performance. Conventional wisdom attributes small generalization error either to properties of the model family, or to the regularization techniques used during training. Through extensive systematic experiments, we show how these traditional approaches fail to explain why large neural networks generalize well in practice. Specifically, our experiments establish that state-of-the-art convolutional networks for image classification trained with stochastic gradient methods easily fit a random labeling of the training data. This phenomenon is qualitatively unaffected by explicit regularization, and occurs even if we replace the true images by completely unstructured random noise. We corroborate these experimental findings with a theoretical construction showing that simple depth two neural networks already have perfect finite sample expressivity as soon as the number of parameters exceeds the number of data points as it usually does in practice. We interpret our experimental findings by comparison with traditional models. </em></p>
]]></content:encoded>
					
					<wfw:commentRss>https://baldazen.wordpress.com/2017/05/19/understanding-deep-learning-requires-rethinking-generalization/feed/</wfw:commentRss>
			<slash:comments>1</slash:comments>
		
		
		
		<media:content url="https://1.gravatar.com/avatar/a667e4083bd1a4b0ef3679c1a55eb1e6449cad178243c13be80246a696c28390?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">baldazen</media:title>
		</media:content>

		<media:content url="https://baldazen.wordpress.com/wp-content/uploads/2017/05/understanding-deep-learning-requires-rethinking-generalization.png" medium="image">
			<media:title type="html">Understanding deep learning requires rethinking generalization.png</media:title>
		</media:content>
	</item>
		<item>
		<title>Links of the week</title>
		<link>https://baldazen.wordpress.com/2017/05/13/links-of-the-week-13/</link>
					<comments>https://baldazen.wordpress.com/2017/05/13/links-of-the-week-13/#respond</comments>
		
		<dc:creator><![CDATA[baldazen]]></dc:creator>
		<pubDate>Sat, 13 May 2017 14:44:33 +0000</pubDate>
				<category><![CDATA[links of the week]]></category>
		<category><![CDATA[AI]]></category>
		<category><![CDATA[artificial intelligence]]></category>
		<category><![CDATA[chaos]]></category>
		<category><![CDATA[chess]]></category>
		<category><![CDATA[deep work]]></category>
		<category><![CDATA[kasparov]]></category>
		<category><![CDATA[perspective]]></category>
		<category><![CDATA[philosophy]]></category>
		<category><![CDATA[productivity]]></category>
		<category><![CDATA[quantification]]></category>
		<guid isPermaLink="false">http://baldazen.wordpress.com/?p=1172</guid>

					<description><![CDATA[Arches onto high cliff over the Mediterranean. Portovenere, Italy. Deep Habits: The Importance of Planning Every Minute of Your Work Day &#8211; Study Hacks How to increase your productivity by taking control of your time via time blocking. Chaos, Ignorance and Newton’s Great Puzzle &#8211; Scott Young Luck, chaos or ignorance? Understanding this mixture for your &#8230; <a href="https://baldazen.wordpress.com/2017/05/13/links-of-the-week-13/" class="more-link">Continue reading <span class="screen-reader-text">Links of the&#160;week</span></a>]]></description>
										<content:encoded><![CDATA[<p style="text-align:center;"><a href="http://baldassarre.photoshelter.com/gallery-image/Landscapes-and-Mountains-photographs/G0000_vMZ3OepmXg/I0000PBUPf2yUhkg"><img loading="lazy" data-attachment-id="1200" data-permalink="https://baldazen.wordpress.com/2017/05/13/links-of-the-week-13/portovenere-arches-onto-high-cliff-over-the-mediterranean-la-spezia-italy/" data-orig-file="https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_p0030.jpg" data-orig-size="1200,785" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;Luca Baldassarre&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;Portovenere. Arches onto high cliff over the Mediterranean. La Spezia, Italy.&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;\u00ae Luca Baldassarre&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;Portovenere. Arches onto high cliff over the Mediterranean. La Spezia, Italy.&quot;,&quot;orientation&quot;:&quot;0&quot;}" data-image-title="Portovenere. Arches onto high cliff over the Mediterranean. La Spezia, Italy." data-image-description="" data-image-caption="&lt;p&gt;Portovenere. Arches onto high cliff over the Mediterranean. La Spezia, Italy.&lt;/p&gt;
" data-medium-file="https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_p0030.jpg?w=300" data-large-file="https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_p0030.jpg?w=656" class=" size-full wp-image-1200 alignnone" src="https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_p0030.jpg?w=656" alt="Balda_P0030.jpg"   srcset="https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_p0030.jpg 1200w, https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_p0030.jpg?w=150&amp;h=98 150w, https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_p0030.jpg?w=300&amp;h=196 300w, https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_p0030.jpg?w=768&amp;h=502 768w, https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_p0030.jpg?w=1024&amp;h=670 1024w" sizes="(max-width: 1200px) 100vw, 1200px" /></a><br />
Arches onto high cliff over the Mediterranean. Portovenere, Italy.</p>
<p><strong><a href="http://calnewport.com/blog/2013/12/21/deep-habits-the-importance-of-planning-every-minute-of-your-work-day/">Deep Habits: The Importance of Planning Every Minute of Your Work Day &#8211; Study Hacks</a><br />
</strong>How to increase your productivity by taking control of your time via <em>time blocking.</em></p>
<p><strong><a href="https://www.scotthyoung.com/blog/2017/05/03/chaos-factor/">Chaos, Ignorance and Newton’s Great Puzzle &#8211; Scott Young<br />
</a></strong>Luck, chaos or ignorance? Understanding this mixture for your projects may help to better allocate resources.</p>
<p><strong><a href="https://medium.com/conversations-with-tyler/garry-kasparov-tyler-cowen-chess-iq-ai-putin-3bf28baf4dba">Garry Kasparov on AI, Chess, and the Future of Creativity &#8211; Mercatus Center</a></strong><br />
A very interesting conversation with Garry Kasparov on chess, AI, Russian politics, education and creativity.</p>
<p><strong><a href="http://justice-everywhere.org/democracy/if-everything-is-measured-can-we-still-see-one-another-as-equals/">If everything is measured, can we still see one another as equals? &#8211; Justice Everywhere</a><br />
</strong>The dangers of measuring everything and ranking ourselves on different scales, neglecting those human skills and experiences that cannot and should not quantified.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://baldazen.wordpress.com/2017/05/13/links-of-the-week-13/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		
		<media:content url="https://1.gravatar.com/avatar/a667e4083bd1a4b0ef3679c1a55eb1e6449cad178243c13be80246a696c28390?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">baldazen</media:title>
		</media:content>

		<media:content url="https://baldazen.wordpress.com/wp-content/uploads/2017/05/balda_p0030.jpg" medium="image">
			<media:title type="html">Balda_P0030.jpg</media:title>
		</media:content>
	</item>
		<item>
		<title>Failures of Gradient-Based Deep Learning</title>
		<link>https://baldazen.wordpress.com/2017/05/12/failures-of-gradient-based-deep-learning/</link>
					<comments>https://baldazen.wordpress.com/2017/05/12/failures-of-gradient-based-deep-learning/#respond</comments>
		
		<dc:creator><![CDATA[baldazen]]></dc:creator>
		<pubDate>Fri, 12 May 2017 06:55:07 +0000</pubDate>
				<category><![CDATA[papers]]></category>
		<category><![CDATA[deep learning]]></category>
		<category><![CDATA[failure]]></category>
		<category><![CDATA[gradient]]></category>
		<category><![CDATA[machine learning]]></category>
		<guid isPermaLink="false">http://baldazen.wordpress.com/?p=1182</guid>

					<description><![CDATA[A very informative article by Shalev-Shwartz, Shamir and Shammah about critical problems faced when solving some simple problems via neural networks trained with gradient-based methods. Find the article here. Abstract In recent years, Deep Learning has become the go-to solution for a broad range of applications, often outperforming state-of-the-art. However, it is important, for both &#8230; <a href="https://baldazen.wordpress.com/2017/05/12/failures-of-gradient-based-deep-learning/" class="more-link">Continue reading <span class="screen-reader-text">Failures of Gradient-Based Deep&#160;Learning</span></a>]]></description>
										<content:encoded><![CDATA[<p><a href="https://arxiv.org/pdf/1703.07950"><img loading="lazy" data-attachment-id="1185" data-permalink="https://baldazen.wordpress.com/2017/05/12/failures-of-gradient-based-deep-learning/limitations_of_gradient-based_learning/" data-orig-file="https://baldazen.wordpress.com/wp-content/uploads/2017/05/limitations_of_gradient-based_learning.png" data-orig-size="1380,1466" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}" data-image-title="limitations_of_gradient-based_learning" data-image-description="" data-image-caption="" data-medium-file="https://baldazen.wordpress.com/wp-content/uploads/2017/05/limitations_of_gradient-based_learning.png?w=282" data-large-file="https://baldazen.wordpress.com/wp-content/uploads/2017/05/limitations_of_gradient-based_learning.png?w=656" class=" size-full wp-image-1185 aligncenter" src="https://baldazen.wordpress.com/wp-content/uploads/2017/05/limitations_of_gradient-based_learning.png?w=656" alt="limitations_of_gradient-based_learning"   srcset="https://baldazen.wordpress.com/wp-content/uploads/2017/05/limitations_of_gradient-based_learning.png 1380w, https://baldazen.wordpress.com/wp-content/uploads/2017/05/limitations_of_gradient-based_learning.png?w=141&amp;h=150 141w, https://baldazen.wordpress.com/wp-content/uploads/2017/05/limitations_of_gradient-based_learning.png?w=282&amp;h=300 282w, https://baldazen.wordpress.com/wp-content/uploads/2017/05/limitations_of_gradient-based_learning.png?w=768&amp;h=816 768w, https://baldazen.wordpress.com/wp-content/uploads/2017/05/limitations_of_gradient-based_learning.png?w=964&amp;h=1024 964w" sizes="(max-width: 1380px) 100vw, 1380px" /></a></p>
<p>A very informative article by Shalev-Shwartz, Shamir and Shammah about critical problems faced when solving some simple problems via neural networks trained with gradient-based methods. Find the article <a href="https://arxiv.org/pdf/1703.07950">here</a>.</p>
<div><b>Abstract</b></div>
<div>In recent years, Deep Learning has become the go-to solution for a broad range of applications, often outperforming state-of-the-art. However, it is important, for both theoreticians and practitioners, to gain a deeper understanding of the difﬁculties and limitations associated with common approaches and algorithms. We describe four types of simple problems, for which the gradient-based algorithms commonly used in deep learning either fail or suffer from signiﬁcant difﬁculties. We illustrate the failures through practical experiments, and provide theoretical insights explaining their source, and how they might be remedied.</div>
]]></content:encoded>
					
					<wfw:commentRss>https://baldazen.wordpress.com/2017/05/12/failures-of-gradient-based-deep-learning/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		
		<media:content url="https://1.gravatar.com/avatar/a667e4083bd1a4b0ef3679c1a55eb1e6449cad178243c13be80246a696c28390?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">baldazen</media:title>
		</media:content>

		<media:content url="https://baldazen.wordpress.com/wp-content/uploads/2017/05/limitations_of_gradient-based_learning.png" medium="image">
			<media:title type="html">limitations_of_gradient-based_learning</media:title>
		</media:content>
	</item>
	</channel>
</rss>
