<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0">

<channel>
	<title>Alexander Kruel</title>
	
	<link>http://kruel.co</link>
	<description>Thoughts and news on transhumanism, vegetarianism, science fiction, science, philosophy, math, programming, language, consciousness and the nature of reality.</description>
	<lastBuildDate>Thu, 17 May 2012 11:40:50 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
		<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/rss+xml" href="http://feeds.feedburner.com/AlexanderKruel" /><feedburner:info uri="alexanderkruel" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><item>
		<title>There is no Science but Bayes and it is our Method.</title>
		<link>http://feedproxy.google.com/~r/AlexanderKruel/~3/9xOo-o2SHxQ/</link>
		<comments>http://kruel.co/2012/05/16/there-is-no-science-but-bayes-and-it-is-our-method/#comments</comments>
		<pubDate>Wed, 16 May 2012 09:56:53 +0000</pubDate>
		<dc:creator>Alexander Kruel</dc:creator>
				<category><![CDATA[SIAI/LW]]></category>
		<category><![CDATA[lesswrong]]></category>
		<category><![CDATA[muflax]]></category>

		<guid isPermaLink="false">http://kruel.co/?p=2986</guid>
		<description><![CDATA[The title of this post and the following quote are the works of Stefan Dorn, alias muflax. It was written in reply to a Less Wrong post by user:Grognor. I believe in Probability Theory, the Foundation, the wellspring of knowledge, I believe in Bayes, Its only Interpretation, our Method. It was discovered by the power [...]]]></description>
			<content:encoded><![CDATA[<p>The title of this post and the following quote are the <a href="http://daily.muflax.com/log/46/">works of Stefan Dorn</a>, alias muflax. It was written in reply to a Less Wrong <a href="http://lesswrong.com/r/discussion/lw/cfe/i_stand_by_the_sequences/">post</a> by user:Grognor.</p>
<p><em>I believe in Probability Theory, the Foundation, the wellspring of knowledge,</em><br />
<em> I believe in Bayes, Its only Interpretation, our Method.</em><br />
<em> It was discovered by the power of Induction and given form by the Elder Jaynes.</em><br />
<em> It suffered from the lack of priors, was complicated, obscure, and forgotten.</em><br />
<em> It descended into <abbr title="Artificial Intelligence">AI</abbr> winter. In the third millennium it rose again.</em><br />
<em> It ascended into relevance and is seated at the core of our FAI.</em><br />
<em> It will be implemented to judge the true and the false.</em><br />
<em> I believe in the Sequences,</em><br />
<em> Many Worlds, too slow science,</em><br />
<em> the solution of metaethics,</em><br />
<em> the cryopreservation of the brain,</em><br />
<em> and sanity everlasting.</em><br />
<em> Phyg.</em></p>
<p>Give credit <a href="http://muflax.com/">where</a> credit is due. All the credit is due to muflax.</p>
<div class="plus-one-wrap"><g:plusone href="http://kruel.co/2012/05/16/there-is-no-science-but-bayes-and-it-is-our-method/"></g:plusone></div><img src="http://feeds.feedburner.com/~r/AlexanderKruel/~4/9xOo-o2SHxQ" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://kruel.co/2012/05/16/there-is-no-science-but-bayes-and-it-is-our-method/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		<feedburner:origLink>http://kruel.co/2012/05/16/there-is-no-science-but-bayes-and-it-is-our-method/</feedburner:origLink></item>
		<item>
		<title>Disturbingly Ungrounded</title>
		<link>http://feedproxy.google.com/~r/AlexanderKruel/~3/x08WJVHZk5Y/</link>
		<comments>http://kruel.co/2012/05/14/disturbingly-ungrounded/#comments</comments>
		<pubDate>Mon, 14 May 2012 17:21:05 +0000</pubDate>
		<dc:creator>Alexander Kruel</dc:creator>
				<category><![CDATA[SIAI/LW]]></category>
		<category><![CDATA[ai]]></category>
		<category><![CDATA[artificial general intelligence]]></category>

		<guid isPermaLink="false">http://kruel.co/?p=2981</guid>
		<description><![CDATA[Richard Loosemore, Mon May 14 09:57:55 CDT 2012 @ existential mailing list: I find this entire discussion to be disturbingly ungrounded. We are debating the behavior, drives and motivations of intelligent machines, so it would seem critical to understand how the mechanisms underlying behavior, drives and motivations would actually work. But it is most certainly NOT [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.richardloosemore.com/">Richard Loosemore</a>, <em>Mon May 14 09:57:55 CDT 2012</em> @ <a href="https://mailman-mail5.webfaction.com/listinfo/existential">existential mailing list</a>:</p>
<blockquote><p>I find this entire discussion to be disturbingly ungrounded.</p>
<p>We are debating the behavior, drives and motivations of intelligent machines, so it would seem critical to understand how the mechanisms underlying behavior, drives and motivations would actually work.</p>
<p>But it is most certainly NOT the case that we understand these mechanisms. There is a widespread assumption (especially at SIAI and FHI) that the mechanisms must be some type of proposition-based utility-maximization function. But this is nothing more than an extrapolation from certain types of narrow-AI hierarchical goal-planning systems, and a convenient excuse to engage in unconstrained mathematical theorizing. In practice, we have not come anywhere near to building an AGI system that:</p>
<p>(a) contains such types of motivation mechanism, with extremely high-level supergoals, or<br />
(b) contains such a mechanism and also exhibits a stable form of intelligence.</p>
<p>Everything said in arguments like the current thread depends on exactly how the mechanism would work, but that means that everything said is actually predicated on unfounded assumptions.</p>
<p>On a more particular note:</p>
<p>On 5/14/12 9:24 AM, Anders Sandberg wrote:<br />
&gt; For example, the work on formalizing philosophical concepts<br />
&gt; (automating the process of grounding the fuzzy words into something<br />
&gt; real) into something an AI could understand requires quite<br />
&gt; sophisticated understanding of both philosophy and machine learning.</p>
<p>This assumes that there is some formalization to be had. But there are many arguments (including those in some of my own papers on the subject) that lead to the conclusion that this kind of formalization of semantics, and the whole machine learning paradigm, is not going to lead to AGI.</p>
<p>In plain language: you are never going to formalize the notion of &#8220;friendliness&#8221; in such a way that the AGI can &#8220;understand&#8221; it in the way that will make &#8220;Be friendly to humans&#8221; a valid supergoal statement.</p></blockquote>
<div class="plus-one-wrap"><g:plusone href="http://kruel.co/2012/05/14/disturbingly-ungrounded/"></g:plusone></div><img src="http://feeds.feedburner.com/~r/AlexanderKruel/~4/x08WJVHZk5Y" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://kruel.co/2012/05/14/disturbingly-ungrounded/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://kruel.co/2012/05/14/disturbingly-ungrounded/</feedburner:origLink></item>
		<item>
		<title>Intelligence as a fully general counterargument</title>
		<link>http://feedproxy.google.com/~r/AlexanderKruel/~3/SWnfPByEBWI/</link>
		<comments>http://kruel.co/2012/05/13/intelligence-as-a-fully-general-counterargument/#comments</comments>
		<pubDate>Sun, 13 May 2012 17:12:27 +0000</pubDate>
		<dc:creator>Alexander Kruel</dc:creator>
				<category><![CDATA[SIAI/LW]]></category>
		<category><![CDATA[ai]]></category>
		<category><![CDATA[artificial general intelligence]]></category>
		<category><![CDATA[risks]]></category>

		<guid isPermaLink="false">http://kruel.co/?p=2962</guid>
		<description><![CDATA[Here are two partial quotes from two recent conversations that reflect a typical argument being made by proponents of AI risks: 1.) OK, but maybe an AGI could solve lots of problems *faster* than humans &#8230; 2.) In that case the AI could simply scan everybody&#8217;s genome and neural system and store the information in [...]]]></description>
			<content:encoded><![CDATA[<p>Here are two partial quotes from two recent conversations that reflect a typical argument being made by proponents of AI risks:</p>
<blockquote><p>1.) OK, but maybe an AGI could solve lots of problems *faster* than humans &#8230;</p></blockquote>
<blockquote><p>2.) In that case the AI could simply scan everybody&#8217;s genome and neural system and store the information in a 100g diamond. And also to store all humanity&#8217;s information.</p></blockquote>
<p>The problem is <em>&#8220;Ok, but&#8230;&#8221;</em> or <em>&#8220;simply&#8221;</em>.</p>
<p>Overpowering humanity is not something you can <em>just</em> conjecture and justify by imagining arbitrary amounts of intelligence as if it was some sort of magic that can disqualify any argument to the contrary.</p>
<p>If you want to be convincing then you have to be more specific and name some concrete examples of how exactly an AGI going to take over the world.</p>
<h3>Taboo intelligence</h3>
<p>For starters you should be able to answer the following questions and taboo &#8220;intelligence&#8221; in doing so:</p>
<ul>
<li>How is an AGI going to become a master of <a href="http://en.wikipedia.org/wiki/Psychological_manipulation">dark arts</a> and <a href="http://en.wikipedia.org/wiki/Social_engineering_%28security%29">social engineering</a> in order to persuade and deceive humans?</li>
<li>How is an AGI going to coordinate a large scale conspiracy or deception, given its initial resources, without making any suspicious mistakes along the way?</li>
<li>How is an AGI going to hack the Internet to acquire more computational resources?</li>
<li>Are those computational resources that can be hacked applicable to improve the general intelligence of an AGI?</li>
<li>Does throwing more computational resources at important problems, like building new and better computational substrates, allow an AGI to come up with better architectures so much faster as to outweigh the expenditure of obtaining those resources, without hitting diminishing returns?</li>
<li>Does an increase in intelligence vastly outweigh its computational cost and the expenditure of time needed to discover it?</li>
<li>How can small improvements replace conceptual revolutions that require the discovery of unknown unknowns?</li>
<li>How does an AGI brute-force the discovery of unknown unknowns?</li>
<li>Is an agent of a given level of intelligence capable of handling its own complexity efficiently?</li>
<li>How is an AGI going to predict how improvements, respectively improved versions of itself, are going to act, to ensure that its values are preserved?</li>
<li>How is an AGI going to solve important problems without real-world experimentation and slow environmental feedback?</li>
<li>How is an AGI going to build new computational substrates and obtain control of those resources without making use of existing infrastructure?</li>
<li>How is an AGI going to cloak its actions, i.e. its energy consumption etc.?</li>
<li>How is an AGI going to stop humans from using its own analytic and predictive algorithms in the form of expert systems to analyze and predict its malicious intentions?</li>
<li>How is an AGI going to protect itself from human counter strikes given the fragility of the modern world and its infrastructure, without some sort of shellproof internal power supply?</li>
</ul>
<p>If you are unable to answer those questions other than by invoking intelligence as some sort of magic that makes all problems disappear, the scenario that you envision is nothing more than pure fantasy!</p>
<h3>Technobabble</h3>
<p>You can&#8217;t estimate the probability and magnitude of the advantage an AI will have if you are using something that is as vague as the concept of &#8220;intelligence&#8221;.</p>
<p><a href="http://scifiwire.com/2009/10/ron-moore-calls-star-trek.php">Here is a case</a> that bears some similarity and which might shed light on what I am trying to explain:</p>
<blockquote><p>At his recent keynote speech at the New York Television Festival, former Star Trek writer and creator of the re-imagined Battlestar Galactica Ron Moore revealed the secret formula to writing for Trek.</p>
<p>He described how the writers would just insert &#8220;tech&#8221; into the scripts whenever they needed to resolve a story or plot line, then they&#8217;d have consultants fill in the appropriate words (aka technobabble) later.</p>
<p>&#8220;It became the solution to so many plot lines and so many stories,&#8221; Moore said. &#8220;It was so mechanical that we had science consultants who would just come up with the words for us and we&#8217;d just write &#8216;tech&#8217; in the script. You know, Picard would say &#8216;Commander La Forge, tech the tech to the warp drive.&#8217; I&#8217;m serious. If you look at those scripts, you&#8217;ll see that.&#8221;</p>
<p>Moore then went on to describe how a typical script might read before the science consultants did their thing:</p>
<p>La Forge: &#8220;Captain, the tech is overteching.&#8221;</p>
<p>Picard: &#8220;Well, route the auxiliary tech to the tech, Mr. La Forge.&#8221;</p>
<p>La Forge: &#8220;No, Captain. Captain, I&#8217;ve tried to tech the tech, and it won&#8217;t work.&#8221;</p>
<p>Picard: &#8220;Well, then we&#8217;re doomed.&#8221;</p>
<p>&#8220;And then Data pops up and says, &#8216;Captain, there is a theory that if you tech the other tech &#8230; &#8216;&#8221; Moore said. &#8220;It&#8217;s a rhythm and it&#8217;s a structure, and the words are meaningless. It&#8217;s not about anything except just sort of going through this dance of how they tech their way out of it.&#8221;</p></blockquote>
<p>The use of &#8220;intelligence&#8221; is as misleading and dishonest in evaluating risks from AI as the use of &#8220;tech&#8221; in Star Trek.</p>
<h3>Conclusion</h3>
<p>Taking over the world is an extremely conjunctive vision. And the only argument you have is that intelligence will <em>somehow</em> magically allow an AGI to accomplish that.</p>
<h3>Further reading</h3>
<p>I elaborate on all of the above and much more in the following posts:</p>
<ul>
<li><a href="http://kruel.co/2011/07/21/why-i-am-skeptical-of-risks-from-ai/">Why I am skeptical of risks from AI</a></li>
<li><a href="http://kruel.co/2012/05/11/risks-from-ai-and-charitable-giving/">Risks from AI and Charitable Giving</a></li>
</ul>
<div class="plus-one-wrap"><g:plusone href="http://kruel.co/2012/05/13/intelligence-as-a-fully-general-counterargument/"></g:plusone></div><img src="http://feeds.feedburner.com/~r/AlexanderKruel/~4/SWnfPByEBWI" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://kruel.co/2012/05/13/intelligence-as-a-fully-general-counterargument/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://kruel.co/2012/05/13/intelligence-as-a-fully-general-counterargument/</feedburner:origLink></item>
		<item>
		<title>Eliezer Yudkowsky Quotes</title>
		<link>http://feedproxy.google.com/~r/AlexanderKruel/~3/rjWlJAl8KB0/</link>
		<comments>http://kruel.co/2012/05/13/eliezer-yudkowsky-quotes/#comments</comments>
		<pubDate>Sun, 13 May 2012 15:14:59 +0000</pubDate>
		<dc:creator>Alexander Kruel</dc:creator>
				<category><![CDATA[SIAI/LW]]></category>
		<category><![CDATA[Eliezer Yudkowsky]]></category>

		<guid isPermaLink="false">http://kruel.co/?p=2963</guid>
		<description><![CDATA[And if Novamente should ever cross the finish line, we all die. That is what I believe or I would be working for Ben this instant. — Eliezer Yudkowsky, on SL4 I bet there’s at least one up-arrow-sized hypergalactic civilization folded into a halting Turing machine with 15 states, or something like that. […] It [...]]]></description>
			<content:encoded><![CDATA[<blockquote><p>And if Novamente should ever cross the finish line, we all die. That is<br />
what I believe or I would be working for Ben this instant.</p></blockquote>
<p>— Eliezer Yudkowsky, on <a href="http://acceleratingfuture.com/sl4/archive/0501/10611.html">SL4</a></p>
<blockquote><p>I bet there’s at least one up-arrow-sized hypergalactic civilization folded into a halting Turing machine with 15 states, or something like that. […] It might perhaps be more limited than this in mere practice, if it’s just running on a laptop computer or something.</p></blockquote>
<p>— Eliezer Yudkowsky, <a href="http://lesswrong.com/lw/vh/complexity_and_intelligence/">Complexity and Intelligence</a></p>
<blockquote><p>Whoever knowingly chooses to save one life, when they could have saved two &#8211; to say nothing of a thousand lives, or a world &#8211; they have damned themselves as thoroughly as any murderer.</p></blockquote>
<p>— Eliezer Yudkowsky, <a href="http://lesswrong.com/lw/hx/one_life_against_the_world/">One Life Against the World</a></p>
<blockquote><p>If you don’t sign up your kids for cryonics then you are a lousy parent.</p></blockquote>
<p>— Eliezer Yudkowsky, <a href="http://lesswrong.com/lw/1mc/normal_cryonics/">Normal Cryonics</a></p>
<blockquote><p>Science is built around the assumption that you’re too stupid and self-deceiving to just use Solomonoff induction. After all, if it was that simple, we wouldn’t need a social process of science… right?</p></blockquote>
<p>— Eliezer Yudkowsky, <a href="http://lesswrong.com/lw/qb/science_doesnt_trust_your_rationality/">Science Doesn’t Trust Your Rationality</a></p>
<blockquote><p>I am tempted to say that a doctorate in AI would be negatively useful, but I am not one to hold someone’s reckless youth against them &#8211; just because you acquired a doctorate in AI doesn’t mean you should be permanently disqualified.</p></blockquote>
<p>— Eliezer Yudkowsky, <a href="http://web.archive.org/web/20101227203946/http://www.acceleratingfuture.com/wiki/So_You_Want_To_Be_A_Seed_AI_Programmer">So You Want To Be A Seed AI Programmer</a></p>
<blockquote><p>This seems obvious, until you realize that only the Singularity Institute has even tried to address this issue. […] Once I acknowledged the problem existed, I didn’t waste time planning the New World Order.</p></blockquote>
<p>— Eliezer Yudkowsky, <a href="http://singinst.org/upload/CEV.html">Coherent Extrapolated Volition</a></p>
<blockquote><p>I intend to plunge into the decision theory of self-modifying decision systems and never look back. (And finish the decision theory and implement it and run the AI, at which point, if all goes well, we Win.)</p></blockquote>
<p>— Eliezer Yudkowsky in an <a href="http://johncarlosbaez.wordpress.com/2011/03/07/this-weeks-finds-week-311/">interview</a> with John Baez</p>
<blockquote><p>Take metaethics, a solved problem: what are the odds that someone who still thought metaethics was a Deep Mystery could write an AI algorithm that could come up with a correct metaethics? I tried that, you know, and in retrospect it didn’t work.</p></blockquote>
<p>— Eliezer Yudkowsky in a <a href="http://lesswrong.com/lw/15m/towards_a_new_decision_theory/11k5">comment</a> on ‘Towards a New Decision Theory’</p>
<blockquote><p>If you haven’t read through the MWI sequence, read it. Then try to talk with your smart friends about it. You will soon learn that your smart friends and favorite SF writers are not remotely close to the rationality standards of Less Wrong, and you will no longer think it anywhere near as plausible that their differing opinion is because they know some incredible secret knowledge you don’t.</p></blockquote>
<p>— Eliezer Yudkowsky in a <a href="http://lesswrong.com/lw/2l0/should_i_believe_what_the_siai_claims/2f14">comment</a> on ‘Should I believe what the SIAI claims’</p>
<blockquote><p>I sometimes get questioned on why I don’t drink, but my response of “I don’t dare lose a single brain cell” […] is usually accepted with a smile.</p></blockquote>
<p>— Eliezer Yudkowsky <a href="http://lesswrong.com/lw/49f/social_necessity_of_drinking/3jvk">comments</a> on ‘Social Necessity of Drinking’</p>
<blockquote>
<blockquote><p>One might think that the possibility of CEV punishing people couldn&#8217;t possibly be taken seriously enough by anyone to actually motivate them. But in fact one person at SIAI was severely worried by this, to the point of having terrible nightmares, though ve wishes to remain anonymous.</p></blockquote>
<p>I don&#8217;t usually talk like this, but I&#8217;m going to make an exception for this case.</p>
<p>Listen to me very closely, you idiot.</p>
<p><strong>YOU DO NOT THINK IN SUFFICIENT DETAIL ABOUT SUPERINTELLIGENCES CONSIDERING WHETHER OR NOT TO BLACKMAIL YOU. THAT IS THE ONLY POSSIBLE THING WHICH GIVES THEM A MOTIVE TO FOLLOW THROUGH ON THE BLACKMAIL.</strong></p>
<p>There&#8217;s an obvious equilibrium to this problem where you engage in all positive acausal trades and ignore all attempts at acausal blackmail.</p>
<p>Until we have a better worked-out version of TDT and we can prove that formally, it should just be <strong>OBVIOUS</strong> that you <strong>DO NOT THINK ABOUT DISTANT BLACKMAILERS</strong> in <strong>SUFFICIENT DETAIL</strong> that they have a motive to <strong>ACTUALLY BLACKMAIL YOU</strong>.</p>
<p>If there is <em>any</em> part of this acausal trade that is positive-sum and actually worth doing, that is exactly the sort of thing you leave up to an FAI. We probably also have the FAI take actions that cancel out the impact of anyone motivated by true rather than imagined blackmail, so as to obliterate the motive of any superintelligences to engage in blackmail.</p>
<p>Meanwhile I&#8217;m banning this post so that it doesn&#8217;t (a) give people horrible nightmares and (b) give distant superintelligences a motive to follow through on blackmail against people dumb enough to think about them in sufficient detail, though, thankfully, I doubt anyone dumb enough to do this knows the sufficient detail. (I&#8217;m not sure I know the sufficient detail.)</p>
<p>You have to be really clever to come up with a genuinely dangerous thought. I am disheartened that people can be clever enough to do that and not clever enough to do the obvious thing and <strong>KEEP THEIR IDIOT MOUTHS SHUT</strong> about it, because it is much more important to sound intelligent when talking to your friends.</p>
<p>This post was <strong>STUPID</strong>.</p>
<p>(For those who have no idea why I&#8217;m using capital letters for something that just sounds like a random crazy idea, and worry that it means I&#8217;m as crazy as Roko, the gist of it was that he just did something that potentially gives superintelligences an increased motive to do extremely evil things in an attempt to blackmail us. It is the sort of thing you want to be EXTREMELY CONSERVATIVE about NOT DOING.)</p></blockquote>
<p>— Eliezer Yudkowsky in a reply to <a href="http://kruel.co/lw/r02.txt">a post by Roko</a> suggesting that friendly AI might torture those who <em>“who knew about existential risks but who didn’t give 100% of their disposable incomes to it.”</em></p>
<p>See also the following comments: <a href="http://kruel.co/lw/r04.png">#1</a>, <a href="http://kruel.co/lw/r05.png">#2</a>, <a href="http://kruel.co/lw/nightmare.png">#3</a></p>
<blockquote><p>I must warn my reader that my first allegiance is to the Singularity, not humanity. I don’t know what the Singularity will do with us. I don’t know whether Singularities upgrade mortal races, or disassemble us for spare atoms. While possible, I will balance the interests of mortality and Singularity. But if it comes down to Us or Them, I’m with Them. You have been warned.</p></blockquote>
<p>— Eliezer Yudkowsky <a href="http://hanson.gmu.edu/vc.html#yudkowsky">comments</a> on Vinge’s Singularity</p>
<p>(In April 2001, Eliezer said that these comments no longer describe his opinions, found at <em>“<a href="http://singinst.org/CaTAI/friendly/contents.html">Friendly AI</a>”</em>.)</p>
<blockquote><p>[…] I would be asking for more people to make as much money as possible if they’re the sorts of people who can make a lot of money and can donate a substantial amount fraction, never mind all the minimal living expenses, to the Singularity Institute.</p>
<p>This is crunch time. This is crunch time for the entire human species. […] and it’s crunch time not just for us, it’s crunch time for the intergalactic civilization whose existence depends on us. I think that if you’re actually just going to sort of confront it, rationally, full-on, then you can’t really justify trading off any part of that intergalactic civilization for any intrinsic thing that you could get nowadays […]</p>
<p>[…] having seen that intergalactic civilization depends on us, in one sense, all you can really do is try not to think about that, and in another sense though, if you spend your whole life creating art to inspire people to fight global warming, you’re taking that ‘forgetting about intergalactic civilization’ thing much too far.</p></blockquote>
<p>— <a href="http://vimeo.com/8586168">Video Q&amp;A</a> with Eliezer Yudkowsky</p>
<blockquote><p>I’ll readily concede that my exact species extinction numbers were made up. But does it really matter? Two hundred million years from now, the children’s children’s children of humanity in their galaxy-civilizations, are unlikely to look back and say, “You know, in retrospect, it really would have been worth not colonizing the Herculus supercluster if only we could have saved 80% of species instead of 20%”. I don’t think they’ll spend much time fretting about it at all, really. It is really incredibly hard to make the consequentialist utilitarian case here, as opposed to the warm-fuzzies case.</p></blockquote>
<p>— Eliezer Yudkowsky <a href="http://johncarlosbaez.wordpress.com/2011/04/24/what-to-do/#comment-5592">replies</a> to John Baez</p>
<blockquote><p>Find whatever you’re best at; if that thing that you’re best at is inventing new math[s] of artificial intelligence, then come work for the Singularity Institute. If the thing that you’re best at is investment banking, then work for Wall Street and transfer as much money as your mind and will permit to the Singularity institute where [it] will be used by other people.</p>
<p>[…]</p>
<p>Aside from that, though, I think that saving the human species eventually comes down to, metaphorically speaking, nine people and a brain in a box in a basement, and everything else feeds into that.</p>
<p>[…] it’s completely respectable to work hours doing what you’re best at, and then transfer the sort of expected utilons that a society assigns to that to the Singularity Institute […]</p></blockquote>
<p>— Eliezer Yudkowsky’s <a href="http://youtu.be/xIt0UofHnOQ">advice for Less Wrong readers</a> who want to help save the human race.</p>
<blockquote><p>[…] an LW post is important and interesting in proportion to how much it helps construct a Friendly AI, how much it gets people to participate in the human project […]</p></blockquote>
<p>— Eliezer Yudkowsky <a href="http://lesswrong.com/lw/4vr/less_wrong_rationality_and_mainstream_philosophy/3qoa">comments</a> on ‘Less Wrong Rationality and Mainstream Philosophy’</p>
<div class="plus-one-wrap"><g:plusone href="http://kruel.co/2012/05/13/eliezer-yudkowsky-quotes/"></g:plusone></div><img src="http://feeds.feedburner.com/~r/AlexanderKruel/~4/rjWlJAl8KB0" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://kruel.co/2012/05/13/eliezer-yudkowsky-quotes/feed/</wfw:commentRss>
		<slash:comments>13</slash:comments>
		<feedburner:origLink>http://kruel.co/2012/05/13/eliezer-yudkowsky-quotes/</feedburner:origLink></item>
		<item>
		<title>AGI is near? Bullshit!</title>
		<link>http://feedproxy.google.com/~r/AlexanderKruel/~3/MUfQBYmKUeQ/</link>
		<comments>http://kruel.co/2012/05/13/agi-is-near-bullshit/#comments</comments>
		<pubDate>Sun, 13 May 2012 12:37:19 +0000</pubDate>
		<dc:creator>Alexander Kruel</dc:creator>
				<category><![CDATA[SIAI/LW]]></category>
		<category><![CDATA[ai]]></category>
		<category><![CDATA[artificial general intelligence]]></category>

		<guid isPermaLink="false">http://kruel.co/?p=2960</guid>
		<description><![CDATA[Here is another piece of an email discussion: I think that many serious people believe that AGI would be extraordinarily dangerous for a different reason than SI thinks it would be: they think it would be an extraordinarily powerful tool and thus dangerous *in the wrong hands.* Like who? I don&#8217;t believe that people like [...]]]></description>
			<content:encoded><![CDATA[<p>Here is another piece of an email discussion:</p>
<blockquote><p>I think that many serious people believe that AGI would be extraordinarily dangerous for a different reason than SI thinks it would be: they think it would be an extraordinarily powerful tool and thus dangerous *in the wrong hands.*</p></blockquote>
<p>Like <a href="http://wiki.lesswrong.com/wiki/Interview_series_on_risks_from_AI">who</a>? I don&#8217;t believe that people like <a href="http://www.idsia.ch/~juergen/">Jürgen Schmidhuber</a> are a risk, apart from a very abstract possibility.</p>
<p>The reason is that they are unable to show off some <em>applicable</em> progress on a par with IBM Watson or <a href="http://en.wikipedia.org/wiki/Siri_%28software%29">Siri</a>. And in the case that they claim that their work relies on a single mathematical breakthrough, I doubt that it would be justified <em>even in principle</em> to be confident in that prediction.</p>
<p>In short, either their work is incrementally useful or is based on wild speculations about the possible discovery of unknown unknowns.</p>
<p>The kind of optimism that people like Jürgen Schmidbauer and <a href="http://wp.goertzel.org/">Ben Goertzel</a> seem to be displaying is completely unfounded. Especially given that most AI researchers seem to disagree.</p>
<p>There are mainly two possibilities here:</p>
<ol>
<li>It takes a few breakthroughs, i.e. that it is a somewhat gradual development that can be extrapolated.</li>
<li>It only takes a single breakthrough.</li>
</ol>
<p>1.) In the case that the development of self-improving AI&#8217;s is stepwise I doubt that their optimism is justified simply because they are unable to show any achievements. All achievements in AI so far are either a result of an increase in computational resources or, in the case of e.g. IBM Watson or the <a href="http://en.wikipedia.org/wiki/Netflix_Prize">Netflix algorithm</a>, the result of throwing everything we have at a problem to brute force a solution. None of those achievements are based on a single principle like an approximation of <a href="http://www.hutter1.net/ai/aixigentle.htm">AIXI</a>. Therefore, if people like Schmidbauer and Goertzel made stepwise progress and extrapolate it to conclude that more progress will amount to general intelligence, then where are the results? They should be able to market even partial achievements.</p>
<p>2.) In the case that the development of self-improving AI&#8217;s demands a single breakthrough or mathematical insights I simply doubt their optimism based on the fact that such predictions amount to pure guesswork and that nobody knows when such a breakthrough will be achieved or at what point new mathematical insights will be discovered.</p>
<p>I haven&#8217;t heard of any justification for why it might only take <em>&#8220;nine people and a brain in a box in a basement&#8221;</em>. I think some people are too convinced of the AIXI approximation route and therefore believe that it is just a math problem that only takes some thinking and one or two deep insights.</p>
<p>Every success in AI so far relied on a huge team. IBM Watson, Siri, <a href="http://youtu.be/cNZPRsrwumQ">Big Dog</a> or the various <a href="http://en.wikipedia.org/wiki/Autonomous_car">self-driving cars</a>:</p>
<blockquote><p>1.) With Siri, Apple is using the results of over 40 years of research funded by DARPA via SRI International&#8217;s Artificial Intelligence Center through the Personalized Assistant that Learns Program and Cognitive Agent that Learns and Organizes Program CALO.</p>
<p>2.) When a question is put to Watson, more than 100 algorithms analyze the question in different ways, and find many different plausible answers–all at the same time. Yet another set of algorithms ranks the answers and gives them a score. For each possible answer, Watson finds evidence that may support or refute that answer. So for each of hundreds of possible answers it finds hundreds of bits of evidence and then with hundreds of algorithms scores the degree to which the evidence supports the answer. The answer with the best evidence assessment will earn the most confidence. The highest-ranking answer becomes the answer. However, during a Jeopardy! game, if the highest-ranking possible answer isn’t rated high enough to give Watson enough confidence, Watson decides not to buzz in and risk losing money if it’s wrong. The Watson computer does all of this in about three seconds.</p></blockquote>
<p>It takes a company like IBM to design even such a narrow AI. That it could be done without a lot of computational and intellectual resources seems ridiculous given the above.</p>
<p>And regarding the proponents of a technological Singularity. 99% of their arguments consist of handwaving and claims that <a href="http://kruel.co/2012/05/11/risks-from-ai-and-charitable-giving/">physical possibility implies feasibility</a>. In other words, <em>bullshit</em>.</p>
<div class="plus-one-wrap"><g:plusone href="http://kruel.co/2012/05/13/agi-is-near-bullshit/"></g:plusone></div><img src="http://feeds.feedburner.com/~r/AlexanderKruel/~4/MUfQBYmKUeQ" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://kruel.co/2012/05/13/agi-is-near-bullshit/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://kruel.co/2012/05/13/agi-is-near-bullshit/</feedburner:origLink></item>
		<item>
		<title>Superapish intelligence</title>
		<link>http://feedproxy.google.com/~r/AlexanderKruel/~3/MXyZLnGYQqs/</link>
		<comments>http://kruel.co/2012/05/13/superapish-intelligence/#comments</comments>
		<pubDate>Sun, 13 May 2012 12:17:29 +0000</pubDate>
		<dc:creator>Alexander Kruel</dc:creator>
				<category><![CDATA[SIAI/LW]]></category>
		<category><![CDATA[ai]]></category>
		<category><![CDATA[artificial general intelligence]]></category>

		<guid isPermaLink="false">http://kruel.co/?p=2957</guid>
		<description><![CDATA[The following is part of an email that I received today: I&#8217;ve heard that humans do not actually differ a great deal genetically from chimps. The idea that we &#8220;hit a critical threshold&#8221; that made us vastly more powerful than them seems sound. This shows that it&#8217;s possible for a relatively small intelligence advantage to [...]]]></description>
			<content:encoded><![CDATA[<p>The following is part of an email that I received today:</p>
<blockquote>
<div>I&#8217;ve heard that humans do not actually differ a great deal genetically from chimps. The idea that we &#8220;hit a critical threshold&#8221; that made us vastly more powerful than them seems sound.</div>
<div></div>
<p>This shows that it&#8217;s possible for a relatively small intelligence advantage to quickly compound and become decisive.</p></blockquote>
<p>This is not the first time that someone raised this argument with me. But even the first time I found that it sounds suspicious.</p>
<p>The genetic difference between a chimp and a human amounts to about ~40–45 million bases that are present in humans and missing from chimps. And that number is irrespective of the difference in gene expression between humans and chimps. So it&#8217;s not like you&#8217;re adding a tiny bit of code and get a superapish intelligence. It actually amounts to a lot of information painstakingly evolved over millions of years.</p>
<p>The argument from the gap between chimpanzees and humans is interesting but can not be used to extrapolate onwards from human general intelligence. It is pure speculation that humans are not Turing complete and that there are levels above our own. That chimpanzees exist, and humans exist, is not a proof for the existence of anything that bears, in any relevant respect, the same relationship to a human that a human bears to a chimpanzee.</p>
<p>Humans can process long chains of inferences with the help of tools. The important question is if incorporating those tools into some sort of self-perception, some sort of guiding agency, is vastly superior to humans using a combination of tools and expert systems.</p>
<p>In other words, it is not clear that there does exist a class of problems that is solvable by Turing machines in general, but not by a combination of humans and expert systems.</p>
<p>If an AI that we invented can hold a complex model in its mind, then we can also simulate such a model by making use of expert systems. Being consciously aware of the model doesn&#8217;t make any great difference in principle to what you can do with the model.</p>
<div class="plus-one-wrap"><g:plusone href="http://kruel.co/2012/05/13/superapish-intelligence/"></g:plusone></div><img src="http://feeds.feedburner.com/~r/AlexanderKruel/~4/MXyZLnGYQqs" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://kruel.co/2012/05/13/superapish-intelligence/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://kruel.co/2012/05/13/superapish-intelligence/</feedburner:origLink></item>
		<item>
		<title>We are SIAI. Argument is futile.</title>
		<link>http://feedproxy.google.com/~r/AlexanderKruel/~3/oX2YbsItpeI/</link>
		<comments>http://kruel.co/2012/05/12/we-are-siai-argument-is-futile/#comments</comments>
		<pubDate>Sat, 12 May 2012 17:54:44 +0000</pubDate>
		<dc:creator>Alexander Kruel</dc:creator>
				<category><![CDATA[SIAI/LW]]></category>
		<category><![CDATA[SIAI]]></category>
		<category><![CDATA[Singularity Institute]]></category>

		<guid isPermaLink="false">http://kruel.co/?p=2954</guid>
		<description><![CDATA[The Singularity Institute (SIAI) created an insurmountable fortress that shields them from any criticism: Skeptic: If you are so smart and rational, why don&#8217;t you fund yourself? Why isn&#8217;t your organisation sustainable? SIAI: Rationality is only aimed at expected winning. Skeptic: But you don&#8217;t seem to be winning yet. Have you considered the possibility that [...]]]></description>
			<content:encoded><![CDATA[<p>The <a href="http://singinst.org/">Singularity Institute</a> (SIAI) created an insurmountable fortress that shields them from any criticism:</p>
<p><strong>Skeptic:</strong> If you are so smart and rational, why don&#8217;t you fund yourself? Why isn&#8217;t your organisation sustainable?</p>
<p style="padding-left: 30px;"><strong>SIAI:</strong> Rationality is only aimed at <a href="http://lesswrong.com/lw/c7g/rationality_and_winning/6iki"><em>expected</em> winning</a>.</p>
<p><strong>Skeptic:</strong> But you don&#8217;t seem to be winning yet. Have you considered the possibility that your methods are suboptimal? Have you set yourself any goals, that you expect to be better able to achieve than less rational folks, to test your rationality?</p>
<p style="padding-left: 30px;"><strong>SIAI:</strong> One could have highly rational beliefs and make highly rational choices and still fail to win due to akrasia, lack of resources, lack of intelligence, and so on. Like intelligence and money, rationality is only a <a href="http://lesswrong.com/lw/c7g/rationality_and_winning/">caeteris paribus predictor of success</a>.</p>
<p><strong>Skeptic:</strong> Okay, but given that you <a href="http://lesswrong.com/lw/b98/minicamps_on_rationality_and_awesomeness_may_1113/">spend a lot of time</a> on refining your rationality, you must believe that it is worth it somehow? What makes you think so then?</p>
<p style="padding-left: 30px;"><strong>SIAI:</strong> We are trying to create a friendly artificial intelligence <a href="http://johncarlosbaez.wordpress.com/2011/03/07/this-weeks-finds-week-311/">implement it and run the AI</a>, at which point, if all goes well, we Win. We believe that rationality is very important to achieve that goal.</p>
<p><strong>Skeptic:</strong> I see. But there surely must be some sub-goals that you anticipate to be able to solve and thereby test if your rationality skills are worth the effort?</p>
<p style="padding-left: 30px;"><strong>SIAI:</strong> Many of the problems related to navigating the Singularity have not yet been stated with mathematical precision, and the need for a precise statement of the problem is part of the problem.</p>
<p><strong>Skeptic:</strong> Has there been any success in formalizing one of the problems that you need to solve?</p>
<p style="padding-left: 30px;"><strong>SIAI:</strong> There are some unpublished results that we have had no time to put into a coherent form yet.</p>
<p><strong>Skeptic:</strong> Well, it seems that there is no way for me to judge if it is worth it to read up on your writings on rationality.</p>
<p style="padding-left: 30px;"><strong>SIAI:</strong> If you want to more reliably achieve life success, I recommend inheriting a billion dollars or, failing that, being born+raised to have an excellent work ethic and low akrasia.</p>
<p><strong>Skeptic:</strong> Awesome, I&#8217;ll do that next time. But for now, why would I bet on you or even trust that you know what you are talking about?</p>
<p style="padding-left: 30px;"><strong>SIAI:</strong> We spent a lot of time on debiasing techniques and thought long and hard about the relevant issues.</p>
<p><strong>Skeptic:</strong> That seems to be insufficient evidence in favor of your accuracy given the nature of your claims and that you are asking for money.</p>
<p style="padding-left: 30px;"><strong>SIAI:</strong> We make predictions. We make statements of confidence of events that merely sound startling. You are asking for evidence we couldn&#8217;t possibly be expected to be able to provide, even given that we are right.</p>
<p><strong>Skeptic:</strong> But what do you anticipate to see if your ideas are right, is there any possibility to update on evidence?</p>
<p style="padding-left: 30px;"><strong>SIAI:</strong> No, once the evidence is available it will be too late. You&#8217;re entitled to arguments, but not (that particular) proof.</p>
<p><strong>Skeptic:</strong> But then why would I trust you instead of actual experts who studied AI and who tell me that you are wrong?</p>
<p style="padding-left: 30px;"><strong>SIAI:</strong> You will soon learn that your smart friends and experts <a href="http://lesswrong.com/lw/2l0/should_i_believe_what_the_siai_claims/2f14">are not remotely close to the rationality standards of SI/LW</a>, and you will no longer think it anywhere near as plausible that their differing opinion is because they know some incredible secret knowledge you don&#8217;t.</p>
<p><strong>Skeptic:</strong> But you have never achieved anything when it comes to AI, why would I trust your reasoning on the topic more than the opinion of those experts?</p>
<p style="padding-left: 30px;"><strong>SIAI:</strong> That is magical thinking about prestige. Prestige is not a good indicator of quality.</p>
<p><strong>Skeptic:</strong> Ummm, okay. You won&#8217;t convince me without providing further evidence.</p>
<p style="padding-left: 30px;"><strong>SIAI:</strong> We call that motivated cognition. You created a fully general counterargument you can use to discount any conclusion.</p>
<div class="plus-one-wrap"><g:plusone href="http://kruel.co/2012/05/12/we-are-siai-argument-is-futile/"></g:plusone></div><img src="http://feeds.feedburner.com/~r/AlexanderKruel/~4/oX2YbsItpeI" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://kruel.co/2012/05/12/we-are-siai-argument-is-futile/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		<feedburner:origLink>http://kruel.co/2012/05/12/we-are-siai-argument-is-futile/</feedburner:origLink></item>
		<item>
		<title>Implicit constraints of practical goals</title>
		<link>http://feedproxy.google.com/~r/AlexanderKruel/~3/YP01Exv3PPs/</link>
		<comments>http://kruel.co/2012/05/11/implicit-constraints-of-practical-goals/#comments</comments>
		<pubDate>Fri, 11 May 2012 17:38:11 +0000</pubDate>
		<dc:creator>Alexander Kruel</dc:creator>
				<category><![CDATA[SIAI/LW]]></category>
		<category><![CDATA[ai]]></category>
		<category><![CDATA[artificial general intelligence]]></category>
		<category><![CDATA[drives]]></category>
		<category><![CDATA[goals]]></category>
		<category><![CDATA[risks]]></category>

		<guid isPermaLink="false">http://kruel.co/?p=2948</guid>
		<description><![CDATA[(The following is an expanded combination of two previous posts.) If you were going to add huge amounts of intelligence to Google Maps, why would it turn worse? Sure, the space of unfriendly navigation software is much larger than the space of navigation software oriented toward navigating to good destinations – i.e., destinations consistent with [...]]]></description>
			<content:encoded><![CDATA[<p><span style="color: #888888;">(The following is an expanded combination of two previous posts.)</span></p>
<p>If you were going to add huge amounts of intelligence to Google Maps, why would it turn worse?</p>
<p>Sure, the space of <a href="http://wiki.lesswrong.com/wiki/Unfriendly_artificial_intelligence">unfriendly</a> navigation software is much larger than the space of navigation software oriented toward navigating to good destinations – i.e., destinations consistent with human intent.</p>
<p>But what reason do we have to believe that improving our navigation software to the point of being general intelligent will cause it to kill us?</p>
<p>Right now, if I ask <a href="http://maps.google.com/">Google Maps</a> to navigate me toward McDonald’s, it does the job very well. So why would an ultraintelligent Google Maps misunderstand what I mean by “Take me to McDonald’s” and navigate me toward a McDonald’s in France, plunging me into the sea? Or drive me underground where the corpse of a man named McDonald lies?</p>
<p>I think that the idea that an ultraintelligent Google Maps would decide to kill all humans, e.g. because they are a security risk, is similar to the idea that it would destroy all roads because it would be less <a href="http://en.wikipedia.org/wiki/Travelling_salesman_problem">computationally expensive</a> to calculate the routes then. After all, roads were never an explicit part of its goal architecture, so why not destroy them all?</p>
<p>You can come up with all kinds of complex fantasies where a certain kind of artificial general intelligence is invented overnight and suddenly makes a huge jump in capability, taking over the universe and destroying all human value.</p>
<p>That is however completely unconvincing given that <em>actual</em> technology is constantly improved toward more user-friendliness and better results and that malfunctions are seldom of such a vast complexity as to work well enough to outsmart humanity.</p>
<p>That said, for the rest of this post I will assume <a href="http://youtu.be/ySxvsvpfdUQ">the kind of artificial general intelligence</a> that proponents of AI risks have in mind.</p>
<p>A cherished idea of AI risk proponents is that an expected utility maximizer will completely ignore anything which it is not specifically tasked to maximize.</p>
<p>One example here is that if you tell a superintelligent expected utility maximizer to prevent human suffering it might simply kill all humans, notwithstanding that it is obviously not what humans want an AI to do and what humans mean by “prevent human suffering”.</p>
<p>In the sense that the computation of an algorithm is deterministic, that line of reasoning is not illogical.</p>
<p>So let us instead of a superhuman agent conjecture the possibility of an oracle, an ultra-advanced version of Google or <a href="http://en.wikipedia.org/wiki/Watson_%28computer%29">IBM Watson</a>.</p>
<p>If I was to ask such an answering machine how to prevent human suffering, would it be reasonable to assume that the top result it would return would be to kill all humans? Would any product that returns similarly wrong answers survive even the earliest research phase, let alone any market pressure?</p>
<p>Don’t get me wrong though. A thermostat is not going to do anything else than what it has been designed for. But an AI is very likely going to be designed to exhibit some amount of user-friendliness. Although that doesn’t mean that one can’t design an AI that won’t, the default outcome seems to be that an AI is not just going to act according to its utility-function but also according to more basic drives, i.e. acting intelligently.</p>
<p>A fundamental requirement for any rational agent is the motivation to act maximally intelligently and correctly. That requirement seems even more obvious if we are talking about a conjectured artificial general intelligence (AGI) that is able to <a href="http://lesswrong.com/lw/we/recursive_selfimprovement/">improve itself</a> to the point where it is substantially better at most activities than humans. Since if it wouldn’t want to be maximally correct then it wouldn’t become superhuman intelligent in the first place.</p>
<p>If we consider giving such an AGI a simple goal, e.g. the goal of <a href="http://wiki.lesswrong.com/wiki/Paperclip_maximizer">paperclip maximization</a>. Is it really clear that human values are not implicit even given such a simplistic goal?</p>
<p>To pose an existential risk in the first place, an AGI would have to maximize paperclips in an unbounded way, eventually taking over the whole universe and convert all matter into paperclips. Given that no sane human would explicitly define such a goal, an AGI with the goal of maximizing paperclips would have to infer it as implicit to do so. But would such an inference make sense, given its superhuman intelligence?</p>
<p>The question boils down to how an AGI would interpret any vagueness present in its goal architecture and how it would deal with the implied invisible.</p>
<p>Given that any rational agent, especially AGI’s capable of recursive self-improvement, want to act in the most intelligent and correct way possible, it seems reasonable that it would interpret any vagueness in a way that most closely reflects the most probable way it was meant to be interpreted.</p>
<p>Would it be intelligent and correct to ignore human volition in the context of maximizing paperclips? Would it be less wrong to maximize paperclips in the most literal sense possible?</p>
<p>The argument uttered by advocates of <a href="http://wiki.lesswrong.com/wiki/Friendly_AI">friendly AI</a> is that any AGI that isn’t explicitly designed to be friendly won’t be friendly. But how much sense does this actually make?</p>
<p>Any human who does pursue a business realizes that a contract with its customers includes unspoken, implicit parameters. Respecting those implied values of their customers is not a result of their shared evolutionary history but a result of their intelligence that allows them to realize that the goal of their business implicitly includes those values.</p>
<p>Every human craftsman who enters into an agreement is bound by a contract that includes a lot of implied conditions. Humans use their intelligence to fill the gaps. For example, if a human craftsman is told to decorate a house, they are not going to attempt to take over the neighbourhood to protect their work.</p>
<p>A human craftsman wouldn’t do that, not because they share human values, but simply because it wouldn’t be sensible to do so given the implicit frame of reference of their contract. The contract implicitly includes the volition of the person that told them to decorate their house. They might not even like the way they are supposed to do it. It would simply be <em>stupid</em> to do it any different way.</p>
<p>How would a superhuman AI not contemplate its own drives and interpret them given the right frame of reference, i.e. human volition? Why would a superhuman general intelligence misunderstand what is meant by “maximize paperclips”, while any human intelligence will be better able to infer the correct interpretation?</p>
<p>How wouldn’t any expected utility maximizer not try to <em>carefully</em> refine its models? I am asking how an highly rational agent will interpret any vagueness inherent in its goal definition, that needs to be resolved in order to calculate what to do, by choosing an interpretation that does not involve the intention of its creators but rather perceive it to be something it has to fight.</p>
<p>If you tell an AGI to maximize paperclips but not what they are made of, it has to figure out what is meant by “paperclips” to learn what it means to maximize them.</p>
<p>Given that a very accurate definition and model of paperclips is necessary to maximize paperclips, including what is meant by “maximization”, the expected utility of refining its goals by learning what it is supposed to do should be sufficient to pursue that path until it is reasonably confident that it arrived at a true comprehension of its terminal goals.</p>
<p>And here human volition should be the most important physical resource since there exists a direct causal connection between its goal parameters and the intentions of its creators.</p>
<p>Human beings and their intentions are part of the physical world. Just like the fact that paperclips are supposed to be made of steel wire.</p>
<p>It would in principle be possible to create a superintelligent machine that does kill all humans, but it would have to be explicitly designed to do so. Since as long as there is some vagueness involved, as long as its goal parameters are open to interpretation, a superintelligence will by definition arrive at the correct implications or otherwise it wouldn’t be superintelligent in the first place. And given most goals it is implicit that it would be incorrect to assume that human volition is not a relevant factor in the correct interpretation of how to act.</p>
<div class="plus-one-wrap"><g:plusone href="http://kruel.co/2012/05/11/implicit-constraints-of-practical-goals/"></g:plusone></div><img src="http://feeds.feedburner.com/~r/AlexanderKruel/~4/YP01Exv3PPs" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://kruel.co/2012/05/11/implicit-constraints-of-practical-goals/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		<feedburner:origLink>http://kruel.co/2012/05/11/implicit-constraints-of-practical-goals/</feedburner:origLink></item>
		<item>
		<title>Risks from AI and Charitable Giving</title>
		<link>http://feedproxy.google.com/~r/AlexanderKruel/~3/7yyxDrvFomY/</link>
		<comments>http://kruel.co/2012/05/11/risks-from-ai-and-charitable-giving/#comments</comments>
		<pubDate>Fri, 11 May 2012 13:13:30 +0000</pubDate>
		<dc:creator>Alexander Kruel</dc:creator>
				<category><![CDATA[SIAI/LW]]></category>
		<category><![CDATA[ai]]></category>
		<category><![CDATA[artificial general intelligence]]></category>
		<category><![CDATA[Eliezer Yudkowsky]]></category>
		<category><![CDATA[risks]]></category>

		<guid isPermaLink="false">http://kruel.co/?p=2940</guid>
		<description><![CDATA[(This post is part of a series of imports of submissions that I wrote that initially appeared on lessswrong.com) If you’re interested in being on the right side of disputes, you will refute your opponents&#8217; arguments. But if you&#8217;re interested in producing truth, you will fix your opponents&#8217; arguments for them. To win, you must [...]]]></description>
			<content:encoded><![CDATA[<p><span style="color: #808080;">(This post is part of a series of imports of submissions that I wrote that initially appeared on lessswrong.com)</span></p>
<blockquote><p>If you’re interested in being on the right side of disputes, you will refute your opponents&#8217; arguments. But if you&#8217;re interested in producing truth, you will fix your opponents&#8217; arguments for them. To win, you must fight not only the creature you encounter; you [also] must fight the most horrible thing that can be constructed from its corpse.</p></blockquote>
<p>&#8211; <a href="http://web.archive.org/web/20100328161823/http://www.acceleratingfuture.com/steven/?p=155">Black Belt Bayesian</a></p>
<p><strong>Contents</strong></p>
<table style="border: thin solid #808080; width: 440px; padding: 5px;" border="0">
<tbody>
<tr>
<td>
<ul>
<li><a href="#abstract">Abstract</a></li>
<li><a href="#FOOM">Requirements for an Intelligence Explosion</a></li>
<li><a href="#CHARITY">Requirements for SIAI to constitute an optimal charity</a></li>
<li><a href="#FR">Further Reading</a></li>
<li><a href="#NR">Notes and References</a></li>
</ul>
</td>
</tr>
</tbody>
</table>
<p>&nbsp;</p>
<h3><a name="abstract"></a>Abstract</h3>
<p>In this post I just want to take a look at a few premises (P#) that <a href="/r/discussion/lw/8fa/intelligence_explosion_a_disjunctive_or/">need to be true simultaneously</a> to make the SIAI a wortwhile charity from the point of view of someone trying to do as much good as possible by contributing money. I am going to show that the case of <a href="/lw/8fb/why_an_intelligence_explosion_might_be_a/">risks from AI</a> is strongly conjunctive, that without a concrete and grounded understanding of AGI an abstract analysis of the issues is going to be very shaky, and that therefore SIAI is likely to be a bad choice as a charity. In other words, that which speaks in favor of SIAI does mainly consist of highly specific, conjunctive, non-evidence-backed speculations on possible bad outcomes.</p>
<h3><a name="FOOM"></a>Requirements for an Intelligence Explosion</h3>
<p><strong>P1</strong> Fast, and therefore dangerous, recursive self-improvement is <span style="text-decoration: underline;">logically</span> possible.</p>
<p style="padding-left: 30px;">It took almost four hundred years to prove <a href="http://en.wikipedia.org/wiki/Fermat%27s_Last_Theorem">Fermat’s Last Theorem</a>. The <a href="http://en.wikipedia.org/wiki/Wiles%27_proof_of_Fermat%27s_Last_Theorem">final proof</a> is over a hundred pages long. Over a hundred pages! And we are not talking about something like an artificial general intelligence that can magically make itself smart enough to prove such theorems and many more that no human being would be capable of proving. Fermat’s Last Theorem simply states <em>“no three positive integers a, b, and c can satisfy the equation a^n + b^n = c^n for any integer value of n greater than two.”</em></p>
<p style="padding-left: 30px;">Even artificial intelligence researchers admit that <em>&#8220;there could be non-linear complexity constrains meaning that even theoretically optimal algorithms experience strongly diminishing intelligence returns for additional compute power.&#8221;</em> [1] We just don&#8217;t know.</p>
<p style="padding-left: 30px;">Other possible problems include the impossibility of a stable utility function and a reflective decision theory, the intractability of real world expected utility maximization or that expected utility maximizers stumble over <a href="/lw/kd/pascals_mugging_tiny_probabilities_of_vast/">Pascal&#8217;s mugging</a>, among other things [2].</p>
<p style="padding-left: 30px;">For an AI to be capable of recursive self-improvement it also has to guarantee that its goals will be preserved when it improves itself. It is still questionable if it is possible to conclusively prove that improvements to an agent&#8217;s intelligence or decision procedures maximize expected utility. If this isn&#8217;t possible it won&#8217;t be rational or possible to undergo <em>explosive</em> self-improvement.</p>
<p><strong>P1.b</strong> The fast computation of a simple algorithm is sufficient to outsmart and overpower humanity.</p>
<p style="padding-left: 30px;">Imagine a group of 100 world-renowned scientists and military strategists.</p>
<ul>
<li>The group is analogous to the initial resources of an AI.</li>
<li>The knowledge that the group has is analogous to what an AI could come up with by simply &#8220;thinking&#8221; about it given its current resources.</li>
</ul>
<p style="padding-left: 30px;">Could such a group easily wipe away the Roman empire when beamed back in time?</p>
<ul>
<li>The Roman empire is analogous to our society today.</li>
</ul>
<p style="padding-left: 30px;">Even if you gave all of them a machine gun, the Romans would quickly adapt and the people from the future would run out of ammunition.</p>
<ul>
<li>Machine guns are analogous to the supercomputer it runs on.</li>
</ul>
<p style="padding-left: 30px;">Consider that it takes a <a href="http://www.antipope.org/charlie/blog-static/2010/07/insufficient-data.html" rel="nofollow" target="_blank">whole technological civilization</a> to produce a modern smartphone.</p>
<p style="padding-left: 30px;">You can&#8217;t just say <em>&#8220;with more processing power you can do more different things&#8221;</em>, that would be analogous to saying that &#8220;100 people&#8221; from today could just build more &#8220;machine guns&#8221;. But they can&#8217;t! They can&#8217;t use all their knowledge and magic from the future to defeat the Roman empire.</p>
<p style="padding-left: 30px;">A lot of assumptions have to turn out to be correct to make humans discover simple algorithms over night that can then be improved to self-improve explosively.</p>
<p style="padding-left: 30px;">You can also compare this to the idea of a Babylonian mathematician discovering modern science and physics given that he would be uploaded into a supercomputer (a possibility that is in and of itself already highly speculative). It assumes that he could brute-force conceptual revolutions.</p>
<p style="padding-left: 30px;">Even if he was given a detailed explanation of how his mind works and the resources to understand it, self-improving to achieve superhuman intelligence assumes that throwing resources at the problem of intelligence will magically allow him to pull improved algorithms from solution space as if they were signposted.</p>
<p style="padding-left: 30px;">But unknown unknowns are not signposted. It&#8217;s rather like finding a needle in a haystack. Evolution is great at doing that and assuming that one could speed up evolution considerably is another assumption about technological feasibility and real-world resources.</p>
<p style="padding-left: 30px;">That conceptual revolutions are just a matter of computational resources is pure speculation.</p>
<p style="padding-left: 30px;">If one were to speed up the whole Babylonian world and accelerate cultural evolution, obviously one would arrive quicker at some insights. But how much quicker? How much are many insights dependent on experiments, to yield empirical evidence, that can&#8217;t be speed-up considerably? And what is the return? Is the payoff proportionally to the resources that are necessary?</p>
<p style="padding-left: 30px;">If you were going to speed up a chimp brain a million times, would it quickly reach human-level intelligence? If not, why then would it be different for a human-level intelligence trying to reach transhuman intelligence? It seems like a nice idea when formulated in English, but would it work?</p>
<p style="padding-left: 30px;">Being able <em>to state</em> that an AI could use some magic to take over the earth does not make it a serious possibility.</p>
<p style="padding-left: 30px;">Magic has to be discovered, adapted and manufactured first. It doesn&#8217;t just emerge out of nowhere from the computation of certain algorithms. It emerges from a society of agents with various different goals and heuristics like <em>&#8220;Treating Rare Diseases in Cute Kittens&#8221;</em>. It is an evolutionary process that relies on massive amounts of real-world feedback and empirical experimentation. Assuming that all that can happen because some simple algorithm is being computed is like believing it will emerge &#8216;out of nowhere&#8217;, it is magical thinking.</p>
<p style="padding-left: 30px;">Unknown unknowns are not sign-posted. [3]</p>
<p style="padding-left: 30px;">If people like Benoît B. Mandelbrot would have never decided to research Fractals then many modern movies wouldn&#8217;t be possible, as they rely on fractal landscape algorithms. Yet, at the time Benoît B. Mandelbrot conducted his research it was not foreseeable that his work would have any real-world applications.</p>
<p style="padding-left: 30px;">Important discoveries are made because many routes with low or no expected utility are explored at the same time [4]. And to do so efficiently it takes random mutation, a whole society of minds, a lot of feedback and empirical experimentation.</p>
<p style="padding-left: 30px;"><em>&#8220;Treating rare diseases in cute kittens&#8221;</em> might or might not provide genuine insights and open up new avenues for further research. As long as you don&#8217;t try it you won&#8217;t know.</p>
<p style="padding-left: 30px;">The idea that a rigid consequentialist with simple values can think up insights and conceptual revolutions simply because it is instrumentally useful to do so is implausible.</p>
<p style="padding-left: 30px;">Complex values are the cornerstone of diversity, which in turn enables creativity and drives the exploration of various conflicting routes. A singleton with a stable utility-function lacks the feedback provided by a society of minds and its cultural evolution.</p>
<p style="padding-left: 30px;">You need to have various different agents with different utility-functions around to get the necessary diversity that can give rise to enough selection pressure. A &#8220;singleton&#8221; won&#8217;t be able to predict the actions of new and improved versions of itself by just running sandboxed simulations. Not just because of logical uncertainty but also because it is computationally intractable to predict the real-world payoff of changes to its decision procedures.</p>
<p style="padding-left: 30px;">You need complex values to give rise to the necessary drives to function in a complex world. You can&#8217;t just tell an AI to protect itself. What would that even mean? What changes are illegitimate? What constitutes <em>&#8220;self&#8221;</em>? That are all unsolved problems that are just assumed to be solvable when talking about risks from AI.</p>
<p style="padding-left: 30px;">An AI with simple values will simply lack the creativity, due to a lack of drives, to pursue the huge spectrum of research that a society of humans does pursue. Which will allow an AI to solve some well-defined narrow problems, but it will be unable to make use of the broad range of synergetic effects of cultural evolution. Cultural evolution is a result of the interaction of a wide range of utility-functions.</p>
<p style="padding-left: 30px;">Yet even if we assume that there is one complete theory of general intelligence, once discovered, one just has to throw more resources at it. It might be able to incorporate all human knowledge, adapt it and find new patterns. But would it really be vastly superior to human society and their expert systems?</p>
<p style="padding-left: 30px;">Can intelligence itself be improved apart from solving well-defined problems and making more accurate predictions on well-defined classes of problems? The discovery of unknown unknowns does not seem to be subject to other heuristics than natural selection. Without goals, well-defined goals, terms like &#8220;optimization&#8221; have no meaning.</p>
<p><strong>P2</strong> Fast, and therefore dangerous, recursive self-improvement is <span style="text-decoration: underline;">physically</span> possible.</p>
<p style="padding-left: 30px;">Even if it could be proven that <em>explosive</em> recursive self-improvement is logically possible, e.g. that there are no complexity constraints, the question remains if it is physically possible.</p>
<p style="padding-left: 30px;">Our best theories about intelligence are highly abstract and their relation to real world human-level general intelligence is often wildly speculative [5][6].</p>
<p><strong>P3</strong> Fast, and therefore dangerous, recursive self-improvement is <span style="text-decoration: underline;">economically</span> feasible.</p>
<p style="padding-left: 30px;">To exemplify the problem take the science fictional idea of using <a href="http://en.wikipedia.org/wiki/Antimatter">antimatter</a> as explosive for weapons. It is physically possible to produce antimatter and use it for large scale destruction. An equivalent of the Hiroshima atomic bomb will only take half a gram of antimatter. But it will take 2 billion years to produce that amount of antimatter [7].</p>
<p style="padding-left: 30px;">We simply don’t know if intelligence is instrumental or quickly hits diminishing returns [8].</p>
<p><strong>P3.b</strong> AGI is able to create (or acquire) resources, empowering technologies or civilisatory support [9].</p>
<p style="padding-left: 30px;">We are already at a point where we have to build billion dollar chip manufacturing facilities to run our mobile phones. We need to build huge particle accelerators to obtain new insights into the nature of reality.</p>
<p style="padding-left: 30px;">An AI would either have to rely on the help of a whole technological civilization or be in control of advanced nanotech assemblers.</p>
<p style="padding-left: 30px;">And if an AI was to acquire the necessary resources on its own, its plan for world-domination would have to go unnoticed. This would require the workings of the AI to be opaque to its creators yet comprehensible to itself.</p>
<p style="padding-left: 30px;">But an AI capable of efficient recursive self improvement must be able to</p>
<ol>
<li>comprehend its own workings</li>
<li>predict how improvements, respectively improved versions of itself, are going to act to ensure that its values are preserved</li>
</ol>
<p style="padding-left: 30px;">So if the AI can do that, why wouldn&#8217;t humans be able to use the same algorithms to predict what the initial AI is going to do? And if the AI can&#8217;t do that, how is it going to maximize expected utility if it is unable to predict what it is going to do?</p>
<p style="padding-left: 30px;">Any AI capable of efficient self-modification must be able to grasp its own workings and make predictions about improvements to various algorithms and its overall decision procedure. If an AI can do that, why would the humans who build it be unable to notice any malicious intentions? Why wouldn&#8217;t the humans who created it not be able to use the same algorithms that the AI uses to predict what it will do? If humans are unable to predict what the AI will do, how is the AI able to predict what improved versions of itself will do?</p>
<p style="padding-left: 30px;">And even if an AI was able to somehow acquire large amounts of money. It is not easy to <em>use</em> the money. You can&#8217;t <em>&#8220;just&#8221;</em> build huge companies with fake identities, or a straw man, to create revolutionary technologies easily. Running companies with real people takes a lot of real-world knowledge, interactions and feedback. But most importantly, it takes a lot of time. An AI could not simply create a new Intel or Apple over a few years without its creators noticing anything.</p>
<p style="padding-left: 30px;">The goals of an AI will be under scrutiny at any time. It seems very implausible that scientists, a company or the military are going to create an AI and then just let it run without bothering about its plans. An artificial agent is not a black box, like humans are, where one is only able to guess its real intentions.</p>
<p style="padding-left: 30px;">A plan for world domination seems like something that can&#8217;t be concealed from its creators. Lying is no option if your algorithms are open to inspection.</p>
<p><strong>P4</strong> Dangerous recursive self-improvement is the default outcome of the creation of artificial general intelligence.</p>
<p style="text-align: left; padding-left: 30px;">Complex goals need complex optimization parameters (the design specifications of the subject of the optimization process against which it will measure its success of self-improvement).</p>
<p style="text-align: left; padding-left: 30px;">Even the creation of paperclips is a much more complex goal than telling an AI to compute as many decimal digits of Pi as possible.</p>
<p style="text-align: left; padding-left: 30px;">For an AGI, that was designed to design paperclips, to pose an existential risk, its creators would have to be capable enough to enable it to take over the universe on its own, yet forget, or fail to, define time, space and energy bounds as part of its optimization parameters. Therefore, given the large amount of restrictions that are inevitably part of any advanced general intelligence (AGI), the nonhazardous subset of all possible outcomes might be much larger than that where the AGI works perfectly yet fails to hold before it could wreak havoc.</p>
<p style="text-align: left; padding-left: 30px;">And even given a rational utility maximizer. It is possible to maximize paperclips in a lot of different ways. How it does it is fundamentally dependent on its utility-function and how precisely it was defined.</p>
<p style="text-align: left; padding-left: 30px;">If there are no constraints in the form of design and goal parameters then it can maximize paperclips in all sorts of ways that don&#8217;t demand recursive self-improvement.</p>
<p style="text-align: left; padding-left: 30px;">&#8220;Utility&#8221; does only become well-defined if we precisely define what it means to maximize it. Just maximizing paperclips doesn&#8217;t define how quickly and how economically it is supposed to happen.</p>
<p style="padding-left: 30px;">The problem is that &#8220;utility&#8221; has to be defined. To maximize expected utility does not imply certain actions, efficiency and economic behavior, or the drive to protect yourself. You can also rationally maximize paperclips without protecting yourself if it is not part of your goal parameters.</p>
<p style="padding-left: 30px;">You can also assign utility to maximize paperclips as long as nothing turns you off but don&#8217;t care about being turned off. If an AI is not explicitly programmed to care about it, then it won&#8217;t.</p>
<p style="padding-left: 30px;">Without well-defined goals in form of a precise utility-function, it might be impossible to maximize expected &#8220;utility&#8221;. Concepts like &#8220;efficient&#8221;, &#8220;economic&#8221; or &#8220;self-protection&#8221; all have a meaning that is inseparable with an agent&#8217;s terminal goals. If you just tell it to maximize paperclips then this can be realized in an infinite number of ways that would all be rational given imprecise design and goal parameters. Undergoing to explosive recursive self-improvement, taking over the universe and filling it with paperclips, is just one outcome. Why would an arbitrary mind pulled from mind-design space care to do that? Why not just wait for paperclips to arise due to random fluctuations out of a state of chaos? That wouldn&#8217;t be irrational. To have an AI take over the universe as fast as possible you would have to explicitly design it to do so.</p>
<p style="padding-left: 30px;">But for the sake of a thought experiment assume that the default case was recursive self-improvement. Now imagine that a company like Apple wanted to build an AI that could answer every question (an Oracle).</p>
<p style="padding-left: 30px;">If Apple was going to build an Oracle it would anticipate that other people would also want to ask it questions. Therefore it can&#8217;t just waste all resources on looking for an inconsistency arising from the Peano axioms when asked to solve 1+1. It would not devote additional resources on answering those questions that are already known to be correct with a high probability. It wouldn&#8217;t be <em>economically useful</em> to take over the universe to answer simple questions.</p>
<p style="padding-left: 30px;">It would neither be <em>rational</em> to look for an inconsistency arising from the Peano axioms while solving 1+1. To answer questions an Oracle needs a good amount of general intelligence. And concluding that asking it to solve 1+1 implies to look for an inconsistency arising from the Peano axioms does not seem reasonable. It also does not seem reasonable to suspect that humans desire an answer to their questions to approach infinite certainty. Why would someone build such an Oracle in the first place?</p>
<p style="padding-left: 30px;">A reasonable Oracle would quickly yield good solutions by trying to find answers within a reasonable time which are with a high probability just 2–3% away from the optimal solution. I don&#8217;t think anyone would build an answering machine that throws the whole universe at the first sub-problem it encounters.</p>
<p><strong>P5</strong> The human development of artificial general intelligence will take place quickly.</p>
<p style="padding-left: 30px;">What evidence do we have that there is some principle that, once discovered, allows us to grow superhuman intelligence overnight?</p>
<p style="padding-left: 30px;">If the development of AGI takes place slowly, a gradual and controllable development, we might be able to learn from small-scale mistakes, or have enough time to develop friendly AI, while having to face other existential risks.</p>
<p style="padding-left: 30px;">This might for example be the case if intelligence can not be captured by a discrete algorithm, or is modular, and therefore never allow us to reach a point where we can suddenly build the smartest thing ever that does just extend itself indefinitely.</p>
<p>Therefore the probability of an AI to undergo explosive recursive self-improvement (P(FOOM)) is the probability of the <a href="http://en.wikipedia.org/wiki/Logical_conjunction">conjunction</a> (P#<strong>∧</strong>P#) of its premises: <strong></strong></p>
<p><strong>P(FOOM) = P(P1∧P2∧P3∧P4∧P5)</strong></p>
<p>Of course, there are many more premises that need to be true in order to enable an AI to go FOOM, e.g. that each level of intelligence can effectively handle its own complexity, or that most AGI designs can somehow self-modify their way up to massive superhuman intelligence. But I believe that the above points are enough to show that the case for a hard takeoff is not disjunctive, but rather strongly conjunctive.</p>
<h3><a name="CHARITY"></a>Requirements for SIAI to constitute an optimal charity</h3>
<p>In this section I will assume the truth of all premises in the previous section.</p>
<p><strong>P6</strong> SIAI can solve friendly AI.</p>
<p style="padding-left: 30px;">Say you believe that unfriendly AI will wipe us out with a probability of 60% and that there is another existential risk that will wipe us out with a probability of 10% even if unfriendly AI turns out to be no risk or in all possible worlds where it comes later. Both risks have the same utility <em>x</em> (if we don&#8217;t assume that an unfriendly AI could also wipe out aliens etc.). Thus .6x &gt; .1x. But if the probability of solving friendly AI = <em>A</em> to the probability of solving the second risk = <em>B</em> is A<em></em> ≤ (1/6)B then the expected utility of mitigating friendly AI is at best equal to the other existential risk because .6Ax ≤ .1Bx.</p>
<p style="padding-left: 30px;">Consider that one order of magnitude more utility could easily be outweighed or trumped by an underestimation of the complexity of friendly AI.</p>
<p style="padding-left: 30px;">So how hard is it to solve friendly AI?</p>
<p style="padding-left: 30px;">Take for example Pascal&#8217;s mugging, if you can&#8217;t solve it then you need to implement a hack that is largely based on human intuition. Therefore, in order to estimate the possibility of solving friendly AI one needs to account for the difficulty in solving all sub-problems.</p>
<p style="padding-left: 30px;">Consider that we don&#8217;t even know <em>&#8220;how one would start to research the problem of getting a hypothetical AGI to recognize humans as distinguished beings.&#8221;</em> [10]</p>
<p><strong>P7</strong> SIAI does not increase risks from AI.</p>
<p style="padding-left: 30px;">By trying to solve friendly AI, SIAI has to think about a lot of issues related to AI in general and might have to solve problems that will make it easier to create artificial general intelligence.</p>
<p style="padding-left: 30px;">It is far from being clear that SIAI is able to protect its findings against intrusion, betrayal, industrial or espionage.</p>
<p><strong>P8</strong> SIAI does not increase negative utility.</p>
<p style="padding-left: 30px;">There are several possibilities by which SIAI could actually cause a direct increase in negative utility.</p>
<p style="padding-left: 30px;">1) Friendly AI is incredible hard and complex. Complex systems can fail in complex ways. Agents that are an effect of evolution have complex values. To satisfy complex values you need to meet complex circumstances. Therefore any attempt at friendly AI, which is incredible complex, is likely to fail in unforeseeable ways. A half-baked, not quite friendly, AI might create a living hell for the rest of time, increasing negative utility dramatically [11].</p>
<p style="padding-left: 30px;">2) Humans are not provably friendly. Given the power to shape the universe the SIAI might fail to act altruistic and deliberately implement an AI with selfish motives or horrible strategies [12].</p>
<p><strong>P9</strong> It makes sense to support SIAI at this time [13].</p>
<p>Therefore the probability of SIAI to be a worthwhile charity (P(CHARITY)) is the probability of the <a href="http://en.wikipedia.org/wiki/Logical_conjunction">conjunction</a> (P#<strong>∧</strong>P#) of its premises: <strong></strong></p>
<p><strong>P(CHARITY) = P(P6∧P7∧P8∧P9)</strong></p>
<p>As before, there are many more premises that need to be true in order for SIAI to be the best choice for someone who wants to maximize doing good by contributing money to a charity.</p>
<h3><a name="FR"></a>Further Reading</h3>
<p>The following posts and resources elaborate on many of the above points and hint at a lot of additional problems.</p>
<ul>
<li><a href="http://kruel.co/2011/07/21/intelligence-explosion-disjunctive-conjunctive/">Is an Intelligence Explosion a Disjunctive or Conjunctive Event?</a></li>
<li><a href="http://kruel.co/2011/07/21/why-i-am-skeptical-of-risks-from-ai/">Why I am skeptical of risks from AI</a></li>
<li><a href="http://lesswrong.com/lw/cbs/thoughts_on_the_singularity_institute_si/">Thoughts on the Singularity Institute (SI)</a></li>
<li><a href="http://wiki.lesswrong.com/wiki/Interview_series_on_risks_from_AI">Interview series on risks from AI</a></li>
</ul>
<h3><a name="NR"></a>Notes and References</h3>
<p>[1] <a href="/lw/691/shane_legg_on_risks_from_ai/">Q&amp;A with Shane Legg on risks from AI</a></p>
<p>[2] <a href="http://lukeprog.com/SaveTheWorld.html">http://lukeprog.com/SaveTheWorld.html</a></p>
<p>[3] <em>&#8220;In many ways, this is a book about hindsight. Pythagoras could not have imagined the uses to which his equation would be put (if, indeed, he ever came up with the equation himself in the first place). The same applies to almost all of the equations in this book. They were studied/discovered/developed by mathematicians and mathematical physicists who were investigating subjects that fascinated them deeply, not because they imagined that two hundred years later the work would lead to electric light bulbs or GPS or the internet, but rather because they were genuinely curious.&#8221;</em></p>
<p><a href="http://plus.maths.org/content/17-equations-changed-world">17 Equations that changed the world</a></p>
<p>[4] Here is my list of &#8220;really stupid, frivolous academic pursuits&#8221; that have lead to major scientific breakthroughs.</p>
<ul>
<li>Studying monkey social behaviors and eating habits lead to insights into HIV (Radiolab: Patient Zero)</li>
<li>Research into how algae move toward light paved the way for optogenetics: using light to control brain cells (Nature 2010 Method of the Year).</li>
<li>Black hole research gave us WiFi (ICRAR award)</li>
<li>Optometry informs architecture and saved lives on 9/11 (APA Monitor)</li>
<li>Certain groups HATE SETI, but SETI&#8217;s development of cloud-computing service SETI@HOME paved the way for citizen science and recent breakthroughs in protein folding (Popular Science)</li>
<li>Astronomers provide insights into medical imaging (TEDxBoston: Michell Borkin)</li>
<li>Basic physics experiments and the Fibonacci sequence help us understand plant growth and neuron development</li>
</ul>
<p><a href="http://blog.ketyov.com/2012/02/basic-science-is-about-creating.html">http://blog.ketyov.com/2012/02/basic-science-is-about-creating.html</a></p>
<p>[5] <em>&#8220;<a title="Universal Algorithmic Intelligence" href="http://www.hutter1.net/ai/aixigentle.htm">AIXI</a> is often quoted as a <a href="http://en.wikipedia.org/wiki/Proof_of_concept">proof of concept</a> that it is possible for a simple algorithm to improve itself to such an extent that it could in principle reach superhuman intelligence. AIXI proves that there is a general theory of intelligence. But there is a minor problem, AIXI is as far from real world human-level general intelligence as an abstract notion of a <a href="http://en.wikipedia.org/wiki/Turing_machine">Turing machine</a> with an infinite tape is from a supercomputer with the computational capacity of the human brain. An abstract notion of intelligence doesn’t get you anywhere in terms of real-world general intelligence. Just as you won’t be able to <a href="http://www.boingboing.net/2011/07/14/far.html">upload yourself</a><a title="Mind uploading" href="http://en.wikipedia.org/wiki/Mind_uploading"> to a non-biological substrate</a> because you showed that in some abstract sense <a title="Church–Turing–Deutsch principle" href="http://en.wikipedia.org/wiki/Church%E2%80%93Turing%E2%80%93Deutsch_principle">you can simulate every physical process</a>.&#8221;</em></p>
<p>Alexander Kruel, <a href="http://kruel.co/2011/07/21/why-i-am-skeptical-of-risks-from-ai/">Why I am skeptical of risks from AI</a></p>
<p>[6] <em>&#8220;…please bear in mind that the relation of Solomonoff induction and “Universal AI” to real-world general intelligence of any kind is also rather wildly speculative… This stuff is beautiful math, but does it really have anything to do with real-world intelligence? These theories have little to say about human intelligence, and they’re not directly useful as foundations for building AGI systems (though, admittedly, a handful of scientists are working on “scaling them down” to make them realistic; so far this only works for very simple toy problems, and it’s hard to see how to extend the approach broadly to yield anything near human-level AGI). And it’s not clear they will be applicable to future superintelligent minds either, as these minds may be best conceived using radically different concepts.&#8221;</em></p>
<p>Ben Goertzel, &#8216;<a href="http://multiverseaccordingtoben.blogspot.com/2012/03/prediction-and-intelligence.html">Are Prediction and Reward Relevant to Superintelligences?</a>&#8216;</p>
<p>[7] <a href="http://public.web.cern.ch/public/en/spotlight/SpotlightAandD-en.html">http://public.web.cern.ch/public/en/spotlight/SpotlightAandD-en.html</a></p>
<p>[8] <em>&#8220;If any increase in intelligence is vastly outweighed by its computational cost and the expenditure of time needed to discover it then it might not be instrumental for a perfectly rational agent (such as an artificial general intelligence), as <a title="Rational Agent" href="http://xixidu.tumblr.com/post/7648180962/what-game-theorists-somewhat-disturbingly-call">imagined by game theorists</a>, to increase its intelligence as opposed to using its existing intelligence to pursue its terminal goals directly or to invest its given resources to acquire other means of self-improvement, e.g. more efficient sensors.&#8221;</em></p>
<p>Alexander Kruel, <a href="http://kruel.co/2011/07/21/why-i-am-skeptical-of-risks-from-ai/">Why I am skeptical of risks from AI</a></p>
<p>[9] Section <em>&#8216;Necessary resources for an intelligence explosion&#8217;</em>, <a href="http://kruel.co/2011/07/21/why-i-am-skeptical-of-risks-from-ai/">Why I am skeptical of risks from AI</a>, Alexander Kruel</p>
<p>[10] <a href="http://lesswrong.com/lw/3aa/friendly_ai_research_and_taskification/">http://lesswrong.com/lw/3aa/friendly_ai_research_and_taskification/</a></p>
<p>[11] <a href="http://lesswrong.com/r/discussion/lw/ajm/ai_risk_and_opportunity_a_strategic_analysis/5ylx">http://lesswrong.com/r/discussion/lw/ajm/ai_risk_and_opportunity_a_strategic_analysis/5ylx</a></p>
<p>[12] <a href="http://lesswrong.com/lw/8c3/qa_with_new_executive_director_of_singularity/5y77">http://lesswrong.com/lw/8c3/qa_with_new_executive_director_of_singularity/5y77</a></p>
<p>[13] <em>&#8220;I think that if you&#8217;re aiming to develop knowledge that won&#8217;t be useful until very very far in the future, you&#8217;re probably wasting your time, if for no other reason than this: by the time your knowledge is relevant, someone will probably have developed a tool (such as a narrow AI) so much more efficient in generating this knowledge that it renders your work moot.&#8221;</em></p>
<p>Holden Karnofsky in a <a href="http://kruel.co/JaanTallinn201105.pdf">conversation</a> with Jaan Tallinn</p>
<div class="plus-one-wrap"><g:plusone href="http://kruel.co/2012/05/11/risks-from-ai-and-charitable-giving/"></g:plusone></div><img src="http://feeds.feedburner.com/~r/AlexanderKruel/~4/7yyxDrvFomY" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://kruel.co/2012/05/11/risks-from-ai-and-charitable-giving/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		<feedburner:origLink>http://kruel.co/2012/05/11/risks-from-ai-and-charitable-giving/</feedburner:origLink></item>
		<item>
		<title>A Primer On Risks From AI</title>
		<link>http://feedproxy.google.com/~r/AlexanderKruel/~3/0rI2RzGBdOI/</link>
		<comments>http://kruel.co/2012/05/11/a-primer-on-risks-from-ai/#comments</comments>
		<pubDate>Fri, 11 May 2012 12:45:34 +0000</pubDate>
		<dc:creator>Alexander Kruel</dc:creator>
				<category><![CDATA[SIAI/LW]]></category>
		<category><![CDATA[ai]]></category>
		<category><![CDATA[artificial general intelligence]]></category>
		<category><![CDATA[risks]]></category>

		<guid isPermaLink="false">http://kruel.co/?p=2931</guid>
		<description><![CDATA[(This post is part of a series of imports of submissions that I wrote that initially appeared on lessswrong.com) The Power of Algorithms Evolutionary processes are the most evident example of the power of simple algorithms [1][2][3][4][5]. The field of evolutionary biology gathered a vast amount of evidence [6] that established evolution as the process [...]]]></description>
			<content:encoded><![CDATA[<p><span style="color: #808080;">(This post is part of a series of imports of submissions that I wrote that initially appeared on lessswrong.com)</span></p>
<h2 dir="ltr">The Power of Algorithms</h2>
<p>Evolutionary processes are the most evident example of the power of simple algorithms [1][2][3][4][5].</p>
<p>The field of evolutionary biology gathered a vast amount of evidence [6] that established evolution as the process that explains the local decrease in entropy [7], the complexity of life.</p>
<p>Since it can be conclusively shown that all life is an effect of an evolutionary process it is implicit that everything we do not understand about living beings is also an effect of evolution.</p>
<p>We might not understand the nature of intelligence [8] and consciousness [9] but we do know that they are the result of an optimization process that is neither intelligent nor conscious.</p>
<p>Therefore we know that it is possible for an physical optimization process to culminate in the creation of more advanced processes that feature superior qualities.</p>
<p>One of these qualities is the human ability to observe and improve the optimization process that created us. The most obvious example being science [10].</p>
<p>Science can be thought of as civilization-level self-improvement method. It allows us to work together in a systematic and efficient way and accelerate the rate at which further improvements are made.</p>
<h2 dir="ltr">The Automation of Science</h2>
<p>We know that optimization processes that can create improved versions of themselves are possible, even without an explicit understanding of their own workings, as exemplified by natural selection.</p>
<p>We know that optimization processes can lead to self-reinforcing improvements, as exemplified by the adaptation of the scientific method [11] as an improved evolutionary process and successor of natural selection.</p>
<p>Which raises questions about the continuation of this self-reinforcing feedback cycle and its possible implications.</p>
<p>One possibility is to automate science [12][13] and apply it to itself and its improvement.</p>
<p>But science is a tool and its bottleneck are its users. Humans, the biased [14] effect of the blind idiot god that is evolution.</p>
<p>Therefore the next logical step is to use science to figure out how to replace humans by a better version of themselves, artificial general intelligence.</p>
<p>Artificial general intelligence, that can recursively optimize itself [15], is the logical endpoint of various converging and self-reinforcing feedback cycles.</p>
<h2 dir="ltr">Risks from AI</h2>
<p>Will we be able to build an artificial general intelligence? Yes, sooner or later.</p>
<p>Even the unintelligent, unconscious and aimless process of natural selection was capable of creating goal-oriented, intelligent and conscious agents that can think ahead, jump fitness gaps and improve upon the process that created them to engage in prediction and direct experimentation.</p>
<p>The question is, what are the possible implications of the invention of an artificial, fully autonomous, intelligent and goal-oriented optimization process?</p>
<p>One good bet is that such an agent will recursively improve its most versatile, and therefore instrumentally useful, resource. It will improve its general intelligence, respectively cross-domain optimization power.</p>
<p>Since it is unlikely that human intelligence is the optimum, the positive feedback effect, that is a result of using intelligence amplifications to amplify intelligence, is likely to lead to a level of intelligence that is generally more capable than the human intelligence level.</p>
<p>Humans are unlikely to be the most efficient thinkers because evolution is mindless and has no goals. Evolution did not actively try to create the smartest thing possible.</p>
<p>Evolution is further not limitlessly creative, each step of an evolutionary design must increase the fitness of its host. Which makes it probable that there are artificial mind designs that can do what no product of natural selection could accomplish, since an intelligent artificer does not rely on the incremental fitness of each step in the development process.</p>
<p>It is actually possible that human general intelligence is the bare minimum. Because the human level of intelligence might have been sufficient to both survive and reproduce and that therefore no further evolutionary pressure existed to select for even higher levels of general intelligence.</p>
<p>The implications of this possibility might be the creation of an intelligent agent that is more capable than humans in every sense. Maybe because it does directly employ superior approximations of our best formal methods, that tell us how to update based on evidence and how to choose between various actions. Or maybe it will simply think faster. It doesn’t matter.</p>
<p>What matters is that a superior intellect is probable and that it will be better than us at discovering knowledge and inventing new technology. Technology that will make it even more powerful and likely invincible.</p>
<p>And that is the problem. We might be unable to control such a superior being. Just like a group of chimpanzees is unable to stop a company from clearing its forest [16].</p>
<p>But even if such a being is only slightly more capable than us. We might find ourselves at its mercy nonetheless.</p>
<p>Human history provides us with many examples [17][18][19] that make it abundantly clear that even the slightest advance can enable one group to dominate others.</p>
<p>What happens is that the dominant group imposes its values on the others. Which in turn raises the question of what values an artificial general intelligence might have and the implications of those values for us.</p>
<p>Due to our evolutionary origins, our struggle for survival and the necessity to cooperate with other agents, we are equipped with many values and a concern for the welfare of others [20].</p>
<p>The information theoretic complexity [21][22] of our values is very high. Which means that it is highly unlikely for similar values to automatically arise in agents that are the product of intelligent design, agents that never underwent the million of years of competition with other agents that equipped humans with altruism and general compassion.</p>
<p>But that does not mean that an artificial intelligence won’t have any goals [23][24]. Just that those goals will be simple and their realization remorseless [25].</p>
<p>An artificial general intelligence will do whatever is implied by its initial design. And we will be helpless to stop it from achieving its goals. Goals that won’t automatically respect our values [26].</p>
<p>A likely implication is the total extinction of all of humanity [27][28].</p>
<h2 dir="ltr">Further Reading</h2>
<ul>
<li><a href="http://michaelnielsen.org/blog/what-should-a-reasonable-person-believe-about-the-singularity/">What should a reasonable person believe about the Singularity?</a></li>
<li><a href="http://consc.net/papers/singularity.pdf">The Singularity: A Philosophical Analysis</a></li>
<li><a href="http://commonsenseatheism.com/wp-content/uploads/2012/02/Muehlhauser-Salamon-Intelligence-Explosion-Evidence-and-Import.pdf">Intelligence Explosion: Evidence and Import</a></li>
<li><a href="http://kruel.co/papers/probable-intelligence-explosion.pdf">Why an Intelligence Explosion is Probable</a></li>
<li><a href="http://yudkowsky.net/singularity/ai-risk">Artificial Intelligence as a Positive and Negative Factor in Global Risk</a></li>
<li><a href="http://www.singinst.com/upload/mostly-harmless.pdf">From mostly harmless to civilization-threatening: pathways to dangerous artificial general intelligences</a></li>
<li><a href="http://wiki.lesswrong.com/wiki/The_Hanson-Yudkowsky_AI-Foom_Debate">The Hanson-Yudkowsky AI-Foom Debate</a></li>
<li><a href="http://facingthesingularity.com/">Facing The Singularity</a></li>
</ul>
<p><strong>References</strong></p>
<p align="left">[1] Genetic Algorithms and Evolutionary Computation, <a href="http://www.talkorigins.org/faqs/genalg/genalg.html">talkorigins.org/faqs/genalg/genalg.html</a><br />
[2] Fixing software bugs in 10 minutes or less using evolutionary computation, <a href="http://www.genetic-programming.org/hc2009/1-Forrest/Forrest-Presentation.pdf">genetic-programming.org/hc2009/1-Forrest/Forrest-Presentation.pdf</a><br />
[3] Automatically Finding Patches Using Genetic Programming, <a href="http://www.genetic-programming.org/hc2009/1-Forrest/Forrest-Paper-on-Patches.pdf">genetic-programming.org/hc2009/1-Forrest/Forrest-Paper-on-Patches.pdf</a><br />
[4] A Genetic Programming Approach to Automated Software Repair, <a href="http://www.genetic-programming.org/hc2009/1-Forrest/Forrest-Paper-on-Repair.pdf">genetic-programming.org/hc2009/1-Forrest/Forrest-Paper-on-Repair.pdf</a><br />
[5]GenProg: A Generic Method for Automatic Software Repair, <a href="http://www.cs.virginia.edu/%7Eweimer/p/weimer-tse2012-genprog.pdf">virginia.edu/~weimer/p/weimer-tse2012-genprog.pdf</a><br />
[6] 29+ Evidences for Macroevolution (The Scientific Case for Common Descent), <a href="http://www.talkorigins.org/faqs/comdesc/">talkorigins.org/faqs/comdesc/</a><br />
[7] Thermodynamics, Evolution and Creationism, <a href="http://www.talkorigins.org/faqs/thermo.html">talkorigins.org/faqs/thermo.html</a><br />
[8] A Collection of Definitions of Intelligence, <a href="http://www.vetta.org/documents/A-Collection-of-Definitions-of-Intelligence.pdf">vetta.org/documents/A-Collection-of-Definitions-of-Intelligence.pdf</a><br />
[9] <a href="http://plato.stanford.edu/entries/consciousness/">plato.stanford.edu/entries/consciousness/</a><br />
[10] <a href="http://en.wikipedia.org/wiki/Science">en.wikipedia.org/wiki/Science</a><br />
[11] <a href="http://en.wikipedia.org/wiki/Scientific_method">en.wikipedia.org/wiki/Scientific_method</a><br />
[12] The Automation of Science, <a href="http://www.sciencemag.org/content/324/5923/85.abstract">sciencemag.org/content/324/5923/85.abstract</a><br />
[13] Computer Program Self-Discovers Laws of Physics, <a href="http://www.wired.com/wiredscience/2009/04/newtonai/">wired.com/wiredscience/2009/04/newtonai/</a><br />
[14] List of cognitive biases, <a href="http://en.wikipedia.org/wiki/List_of_cognitive_biases">en.wikipedia.org/wiki/List_of_cognitive_biases</a><br />
[15] Intelligence explosion, <a href="http://wiki.lesswrong.com/wiki/Intelligence_explosion">wiki.lesswrong.com/wiki/Intelligence_explosion</a><br />
[16] 1% with Neil deGrasse Tyson, <a href="http://youtu.be/9nR9XEqrCvw">youtu.be/9nR9XEqrCvw</a><br />
[17] Mongol military tactics and organization, <a href="http://en.wikipedia.org/wiki/Mongol_military_tactics_and_organization">en.wikipedia.org/wiki/Mongol_military_tactics_and_organization</a><br />
[18] Wars of Alexander the Great, <a href="http://en.wikipedia.org/wiki/Wars_of_Alexander_the_Great">en.wikipedia.org/wiki/Wars_of_Alexander_the_Great</a><br />
[19] Spanish colonization of the Americas, <a href="http://en.wikipedia.org/wiki/Spanish_colonization_of_the_Americas">en.wikipedia.org/wiki/Spanish_colonization_of_the_Americas</a><br />
[20] A Quantitative Test of Hamilton&#8217;s Rule for the Evolution of Altruism, <a href="http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.1000615">plosbiology.org/article/info:doi/10.1371/journal.pbio.1000615</a><br />
[21] Algorithmic information theory, <a href="http://www.scholarpedia.org/article/Algorithmic_information_theory">scholarpedia.org/article/Algorithmic_information_theory</a><br />
[22] Algorithmic probability, <a href="http://www.scholarpedia.org/article/Algorithmic_probability">scholarpedia.org/article/Algorithmic_probability</a><br />
[23] The Nature of Self-Improving Artiﬁcial Intelligence, <a href="http://selfawaresystems.files.wordpress.com/2008/01/nature_of_self_improving_ai.pdf">selfawaresystems.files.wordpress.com/2008/01/nature_of_self_improving_ai.pdf</a><br />
[24] The Basic AI Drives, <a href="http://selfawaresystems.files.wordpress.com/2008/01/ai_drives_final.pdf">selfawaresystems.files.wordpress.com/2008/01/ai_drives_final.pdf</a><br />
[25] Paperclip maximizer, <a href="http://wiki.lesswrong.com/wiki/Paperclip_maximizer">wiki.lesswrong.com/wiki/Paperclip_maximizer</a><br />
[26] Friendly artificial intelligence, <a href="http://wiki.lesswrong.com/wiki/Friendly_artificial_intelligence">wiki.lesswrong.com/wiki/Friendly_artificial_intelligence</a><br />
[27] Existential Risk, <a href="http://www.existential-risk.org/">existential-risk.org<br />
</a>[28] 5 minutes on AI risk <a href="http://www.existential-risk.org/">youtu.be/ySxvsvpfdUQ<br />
</a></p>
<div class="plus-one-wrap"><g:plusone href="http://kruel.co/2012/05/11/a-primer-on-risks-from-ai/"></g:plusone></div><img src="http://feeds.feedburner.com/~r/AlexanderKruel/~4/0rI2RzGBdOI" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://kruel.co/2012/05/11/a-primer-on-risks-from-ai/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://kruel.co/2012/05/11/a-primer-on-risks-from-ai/</feedburner:origLink></item>
	</channel>
</rss>

