<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/atom10full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><feed xmlns="http://www.w3.org/2005/Atom" xmlns:openSearch="http://a9.com/-/spec/opensearch/1.1/" xmlns:blogger="http://schemas.google.com/blogger/2008" xmlns:georss="http://www.georss.org/georss" xmlns:gd="http://schemas.google.com/g/2005" xmlns:thr="http://purl.org/syndication/thread/1.0" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" gd:etag="W/&quot;DkEHRX89eSp7ImA9WhJUGUg.&quot;"><id>tag:blogger.com,1999:blog-3231233230519160453</id><updated>2012-09-18T00:50:34.161-07:00</updated><category term="robot" /><category term="SL" /><category term="RL" /><category term="monkey" /><category term="BMI" /><category term="data" /><category term="AI" /><category term="robotics" /><category term="theory of mind" /><title>Reinforcement Learning</title><subtitle type="html">A blog about reinforcement learning, machine learning, and more generally artificial intelligence, with occasional posts about professorial life.</subtitle><link rel="http://schemas.google.com/g/2005#feed" type="application/atom+xml" href="http://reinforcementlearning.blogspot.com/feeds/posts/default" /><link rel="alternate" type="text/html" href="http://reinforcementlearning.blogspot.com/" /><author><name>Satinder Singh</name><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="21" height="32" src="http://bp3.blogger.com/_pgMEjMHzg3I/SD35y5FBqrI/AAAAAAAAAAM/cWDTG5e9144/S220/Baveja.jpg" /></author><generator version="7.00" uri="http://www.blogger.com">Blogger</generator><openSearch:totalResults>19</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>25</openSearch:itemsPerPage><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/atom+xml" href="http://feeds.feedburner.com/ReinforcementLearning" /><feedburner:info uri="reinforcementlearning" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><entry gd:etag="W/&quot;CUECQXc4cCp7ImA9WhJTF04.&quot;"><id>tag:blogger.com,1999:blog-3231233230519160453.post-4675351902198817883</id><published>2012-06-24T18:37:00.001-07:00</published><updated>2012-06-26T11:01:00.938-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-06-26T11:01:00.938-07:00</app:edited><title>A great talk on "Are Humans Just another Primate"</title><summary type="html">
Every so often I have discussions with my AI colleagues about whether the best research approach to building human-level intelligence is to build machines that have the linguistic abilities of humans or perhaps the high-level problem-solving skills of humans or whether it is to builds machines that can navigate, physically manipulate their world, and deal with other machines at a basic social &lt;img src="http://feeds.feedburner.com/~r/ReinforcementLearning/~4/YvlastMZops" height="1" width="1"/&gt;</summary><link rel="replies" type="application/atom+xml" href="http://reinforcementlearning.blogspot.com/feeds/4675351902198817883/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=3231233230519160453&amp;postID=4675351902198817883" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/4675351902198817883?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/4675351902198817883?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/ReinforcementLearning/~3/YvlastMZops/great-talk-on-are-humans-just-another.html" title="A great talk on &quot;Are Humans Just another Primate&quot;" /><author><name>Satinder Singh</name><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="21" height="32" src="http://bp3.blogger.com/_pgMEjMHzg3I/SD35y5FBqrI/AAAAAAAAAAM/cWDTG5e9144/S220/Baveja.jpg" /></author><thr:total>0</thr:total><feedburner:origLink>http://reinforcementlearning.blogspot.com/2012/06/great-talk-on-are-humans-just-another.html</feedburner:origLink></entry><entry gd:etag="W/&quot;D0EBQHczfCp7ImA9Wx5XEU8.&quot;"><id>tag:blogger.com,1999:blog-3231233230519160453.post-7805238328426499151</id><published>2010-09-10T06:34:00.000-07:00</published><updated>2010-09-10T06:34:11.984-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2010-09-10T06:34:11.984-07:00</app:edited><title>To "know everything"</title><summary type="html">Eric Schmidt, CEO of Google, describes the circumstance when "all the information in the world is available at our fingertips" (presumably through always networked mobile devices, etc.) as being a circumstance when "we can literally know everything".

It is interesting that common parlance is shifting to call looking something up on the internets as knowing that something. At one level this is &lt;img src="http://feeds.feedburner.com/~r/ReinforcementLearning/~4/f0zX9qXKaro" height="1" width="1"/&gt;</summary><link rel="replies" type="application/atom+xml" href="http://reinforcementlearning.blogspot.com/feeds/7805238328426499151/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=3231233230519160453&amp;postID=7805238328426499151" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/7805238328426499151?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/7805238328426499151?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/ReinforcementLearning/~3/f0zX9qXKaro/to-know-everything.html" title="To &quot;know everything&quot;" /><author><name>Satinder Singh</name><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="21" height="32" src="http://bp3.blogger.com/_pgMEjMHzg3I/SD35y5FBqrI/AAAAAAAAAAM/cWDTG5e9144/S220/Baveja.jpg" /></author><thr:total>0</thr:total><feedburner:origLink>http://reinforcementlearning.blogspot.com/2010/09/to-know-everything.html</feedburner:origLink></entry><entry gd:etag="W/&quot;D0YFQXk5fip7ImA9Wx5SGUo.&quot;"><id>tag:blogger.com,1999:blog-3231233230519160453.post-943536579187624394</id><published>2010-08-16T08:18:00.000-07:00</published><updated>2010-08-16T09:11:50.726-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2010-08-16T09:11:50.726-07:00</app:edited><title>A Visual Representation of a Doctorate</title><summary type="html">An amusing representation of what a doctorate means.&lt;img src="http://feeds.feedburner.com/~r/ReinforcementLearning/~4/PO_K_fESZZ0" height="1" width="1"/&gt;</summary><link rel="replies" type="application/atom+xml" href="http://reinforcementlearning.blogspot.com/feeds/943536579187624394/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=3231233230519160453&amp;postID=943536579187624394" title="1 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/943536579187624394?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/943536579187624394?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/ReinforcementLearning/~3/PO_K_fESZZ0/graphical-representation-of-doctorate.html" title="A Visual Representation of a Doctorate" /><author><name>Satinder Singh</name><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="21" height="32" src="http://bp3.blogger.com/_pgMEjMHzg3I/SD35y5FBqrI/AAAAAAAAAAM/cWDTG5e9144/S220/Baveja.jpg" /></author><thr:total>1</thr:total><feedburner:origLink>http://reinforcementlearning.blogspot.com/2010/08/graphical-representation-of-doctorate.html</feedburner:origLink></entry><entry gd:etag="W/&quot;Dk4FQH84eCp7ImA9WxJUFEw.&quot;"><id>tag:blogger.com,1999:blog-3231233230519160453.post-7698700345926255204</id><published>2009-07-12T09:05:00.000-07:00</published><updated>2009-07-12T09:08:31.130-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-07-12T09:08:31.130-07:00</app:edited><title>Data on Scientists and what they and the public think of them?</title><summary type="html">An interesting post from the Pew Research Center for the People &amp;amp; Press. A must read for academics and researchers. Lots of poll results and some commentary.&lt;img src="http://feeds.feedburner.com/~r/ReinforcementLearning/~4/IgbsmAE5r40" height="1" width="1"/&gt;</summary><link rel="replies" type="application/atom+xml" href="http://reinforcementlearning.blogspot.com/feeds/7698700345926255204/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=3231233230519160453&amp;postID=7698700345926255204" title="2 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/7698700345926255204?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/7698700345926255204?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/ReinforcementLearning/~3/IgbsmAE5r40/data-on-scientists-and-what-they-and.html" title="Data on Scientists and what they and the public think of them?" /><author><name>Satinder Singh</name><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="21" height="32" src="http://bp3.blogger.com/_pgMEjMHzg3I/SD35y5FBqrI/AAAAAAAAAAM/cWDTG5e9144/S220/Baveja.jpg" /></author><thr:total>2</thr:total><feedburner:origLink>http://reinforcementlearning.blogspot.com/2009/07/data-on-scientists-and-what-they-and.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DEQGRngycSp7ImA9WxVUF0s.&quot;"><id>tag:blogger.com,1999:blog-3231233230519160453.post-3085195992488559209</id><published>2009-03-22T16:52:00.000-07:00</published><updated>2009-03-22T16:58:47.699-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-03-22T16:58:47.699-07:00</app:edited><title>Is University Science somehow more Pure than Industry Science?</title><summary type="html">In this Washington post article the writer (a former stem cell researcher at Harvard) argues that the view that University research/science is "curiosity-driven" is misplaced and that the incentive structure is broken. I agree!Here are two relevant quotes from his argument that I find real."University researchers are in a constant battle for recognition and the rewards associated with success: &lt;img src="http://feeds.feedburner.com/~r/ReinforcementLearning/~4/Wibg-lS81i8" height="1" width="1"/&gt;</summary><link rel="replies" type="application/atom+xml" href="http://reinforcementlearning.blogspot.com/feeds/3085195992488559209/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=3231233230519160453&amp;postID=3085195992488559209" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/3085195992488559209?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/3085195992488559209?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/ReinforcementLearning/~3/Wibg-lS81i8/is-university-science-somehow-more-pure.html" title="Is University Science somehow more Pure than Industry Science?" /><author><name>Satinder Singh</name><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="21" height="32" src="http://bp3.blogger.com/_pgMEjMHzg3I/SD35y5FBqrI/AAAAAAAAAAM/cWDTG5e9144/S220/Baveja.jpg" /></author><thr:total>0</thr:total><feedburner:origLink>http://reinforcementlearning.blogspot.com/2009/03/is-university-science-somehow-more-pure.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C0cBR3Y4eSp7ImA9WxRUFUk.&quot;"><id>tag:blogger.com,1999:blog-3231233230519160453.post-7782960751383602469</id><published>2008-11-24T07:30:00.000-08:00</published><updated>2008-11-24T07:37:36.831-08:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2008-11-24T07:37:36.831-08:00</app:edited><title>On Academic Freedom</title><summary type="html">A New York Times opinion writer, Stanley Fish, often writes about questions of interest to university professors. In this article, he writes about the notion of "academic freedom" and what it means and how it is quite different from individual First Amendment rights. He also refers to a new book on this topic. Much of his discussion makes sense to me. In particular, the notion that "... academic &lt;img src="http://feeds.feedburner.com/~r/ReinforcementLearning/~4/NejO5nu46sg" height="1" width="1"/&gt;</summary><link rel="replies" type="application/atom+xml" href="http://reinforcementlearning.blogspot.com/feeds/7782960751383602469/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=3231233230519160453&amp;postID=7782960751383602469" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/7782960751383602469?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/7782960751383602469?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/ReinforcementLearning/~3/NejO5nu46sg/on-academic-freedom.html" title="On Academic Freedom" /><author><name>Satinder Singh</name><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="21" height="32" src="http://bp3.blogger.com/_pgMEjMHzg3I/SD35y5FBqrI/AAAAAAAAAAM/cWDTG5e9144/S220/Baveja.jpg" /></author><thr:total>0</thr:total><feedburner:origLink>http://reinforcementlearning.blogspot.com/2008/11/on-academic-freedom.html</feedburner:origLink></entry><entry gd:etag="W/&quot;D08CRHk8cSp7ImA9WxRXFkw.&quot;"><id>tag:blogger.com,1999:blog-3231233230519160453.post-3181067478324484120</id><published>2008-10-21T12:03:00.001-07:00</published><updated>2008-10-21T12:04:25.779-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2008-10-21T12:04:25.779-07:00</app:edited><title>On Reviewing Reinforcement Learning Papers</title><summary type="html">*I had drafted this earlier this summer but never got around to finishing and posting this. Am doing so now*I just finished reviewing for a conference and in looking over my reviews was reminded of the following comment by a senior colleague to me at ICML this year. He has been program chair for several conferences and said that he thought that RL reviewers were the hardest on their own community&lt;img src="http://feeds.feedburner.com/~r/ReinforcementLearning/~4/vG-A5L_GDq4" height="1" width="1"/&gt;</summary><link rel="replies" type="application/atom+xml" href="http://reinforcementlearning.blogspot.com/feeds/3181067478324484120/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=3231233230519160453&amp;postID=3181067478324484120" title="4 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/3181067478324484120?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/3181067478324484120?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/ReinforcementLearning/~3/vG-A5L_GDq4/on-reviewing-reinforcement-learning.html" title="On Reviewing Reinforcement Learning Papers" /><author><name>Satinder Singh</name><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="21" height="32" src="http://bp3.blogger.com/_pgMEjMHzg3I/SD35y5FBqrI/AAAAAAAAAAM/cWDTG5e9144/S220/Baveja.jpg" /></author><thr:total>4</thr:total><feedburner:origLink>http://reinforcementlearning.blogspot.com/2008/10/on-reviewing-reinforcement-learning.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DEQMRnk4eCp7ImA9WxRXEE4.&quot;"><id>tag:blogger.com,1999:blog-3231233230519160453.post-2105329632843765368</id><published>2008-10-14T16:49:00.001-07:00</published><updated>2008-10-14T19:06:27.730-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2008-10-14T19:06:27.730-07:00</app:edited><title>General Results versus Illuminating Examples</title><summary type="html">I recently came across this description of economists attributed by Paul Krugman to Robert Solow. "There are two kinds of economists: those who look for general results and those who look for illuminating examples." This dichotomy struck me as rather interesting and upon reflection, it was pretty clear to me that my own AI-research methodology is overwhelmingly towards the general results side. I&lt;img src="http://feeds.feedburner.com/~r/ReinforcementLearning/~4/VU4I1ghBjKA" height="1" width="1"/&gt;</summary><link rel="replies" type="application/atom+xml" href="http://reinforcementlearning.blogspot.com/feeds/2105329632843765368/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=3231233230519160453&amp;postID=2105329632843765368" title="6 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/2105329632843765368?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/2105329632843765368?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/ReinforcementLearning/~3/VU4I1ghBjKA/general-results-versus-illuminating.html" title="General Results versus Illuminating Examples" /><author><name>Satinder Singh</name><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="21" height="32" src="http://bp3.blogger.com/_pgMEjMHzg3I/SD35y5FBqrI/AAAAAAAAAAM/cWDTG5e9144/S220/Baveja.jpg" /></author><thr:total>6</thr:total><feedburner:origLink>http://reinforcementlearning.blogspot.com/2008/10/general-results-versus-illuminating.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DkEGQHc-fip7ImA9WxRQGUk.&quot;"><id>tag:blogger.com,1999:blog-3231233230519160453.post-1628950944990731127</id><published>2008-10-13T17:28:00.001-07:00</published><updated>2008-10-13T17:37:01.956-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2008-10-13T17:37:01.956-07:00</app:edited><title>Krugman and Foundation</title><summary type="html">On a non-RL note: Paul Krugman, one of the writers whose nytimes blog I have enjoyed reading over the past few months (and incidentally the one who just got a Nobel Memorial Prize in Economics) credits the reading of the Foundation series of books by Isaac Asimov for sparking his interest in economics. His notion being that short of inventing the field of psychohistory, economics is the field &lt;img src="http://feeds.feedburner.com/~r/ReinforcementLearning/~4/ojZ2BDSc0jY" height="1" width="1"/&gt;</summary><link rel="replies" type="application/atom+xml" href="http://reinforcementlearning.blogspot.com/feeds/1628950944990731127/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=3231233230519160453&amp;postID=1628950944990731127" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/1628950944990731127?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/1628950944990731127?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/ReinforcementLearning/~3/ojZ2BDSc0jY/krugman-and-foundation.html" title="Krugman and Foundation" /><author><name>Satinder Singh</name><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="21" height="32" src="http://bp3.blogger.com/_pgMEjMHzg3I/SD35y5FBqrI/AAAAAAAAAAM/cWDTG5e9144/S220/Baveja.jpg" /></author><thr:total>0</thr:total><feedburner:origLink>http://reinforcementlearning.blogspot.com/2008/10/krugman-and-foundation.html</feedburner:origLink></entry><entry gd:etag="W/&quot;A0UNQX8zcCp7ImA9WxdRE0s.&quot;"><id>tag:blogger.com,1999:blog-3231233230519160453.post-2238434055351252377</id><published>2008-06-01T18:10:00.000-07:00</published><updated>2008-06-01T18:14:50.188-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2008-06-01T18:14:50.188-07:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="AI" /><title>On "What it means to be Human"</title><summary type="html">A far too brief report of the World Science Festival stellar panelon What it means to be human.(If someone finds a better report or video, please point us to it.)&lt;img src="http://feeds.feedburner.com/~r/ReinforcementLearning/~4/yzvRLBi1fnE" height="1" width="1"/&gt;</summary><link rel="replies" type="application/atom+xml" href="http://reinforcementlearning.blogspot.com/feeds/2238434055351252377/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=3231233230519160453&amp;postID=2238434055351252377" title="1 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/2238434055351252377?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/2238434055351252377?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/ReinforcementLearning/~3/yzvRLBi1fnE/on-what-it-means-to-be-human.html" title="On &quot;What it means to be Human&quot;" /><author><name>Satinder Singh</name><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="21" height="32" src="http://bp3.blogger.com/_pgMEjMHzg3I/SD35y5FBqrI/AAAAAAAAAAM/cWDTG5e9144/S220/Baveja.jpg" /></author><thr:total>1</thr:total><feedburner:origLink>http://reinforcementlearning.blogspot.com/2008/06/on-what-it-means-to-be-human.html</feedburner:origLink></entry><entry gd:etag="W/&quot;A0EHSHg4fSp7ImA9WxdRE04.&quot;"><id>tag:blogger.com,1999:blog-3231233230519160453.post-7746610765222084997</id><published>2008-06-01T08:03:00.000-07:00</published><updated>2008-06-01T10:00:39.635-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2008-06-01T10:00:39.635-07:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="AI" /><title>An interesting conversation about "Conversation"</title><summary type="html">Conversation with Ian McEwan &amp;amp; Steven Pinker (audio only) [Warning: has mature content](or, why building conversational systems would be so hard)&lt;img src="http://feeds.feedburner.com/~r/ReinforcementLearning/~4/VIH3uIyg4VA" height="1" width="1"/&gt;</summary><link rel="replies" type="application/atom+xml" href="http://reinforcementlearning.blogspot.com/feeds/7746610765222084997/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=3231233230519160453&amp;postID=7746610765222084997" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/7746610765222084997?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/7746610765222084997?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/ReinforcementLearning/~3/VIH3uIyg4VA/great-conversation-about-conversation.html" title="An interesting conversation about &quot;Conversation&quot;" /><author><name>Satinder Singh</name><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="21" height="32" src="http://bp3.blogger.com/_pgMEjMHzg3I/SD35y5FBqrI/AAAAAAAAAAM/cWDTG5e9144/S220/Baveja.jpg" /></author><thr:total>0</thr:total><feedburner:origLink>http://reinforcementlearning.blogspot.com/2008/06/great-conversation-about-conversation.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CkADSHo9cCp7ImA9WxdREUo.&quot;"><id>tag:blogger.com,1999:blog-3231233230519160453.post-1275913919544462638</id><published>2008-05-30T11:01:00.000-07:00</published><updated>2008-05-30T11:06:19.468-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2008-05-30T11:06:19.468-07:00</app:edited><title>Some RL Community Statistics</title><summary type="html">In response to Satinder's post about growth of the field, I did a little experiment.  Google scholar allows for date searches.  (Citeseer has similar information, but I couldn't figure out a way to work with it.)  I searched for the phrase "reinforcement learning" and counted the number of hits in each year.  Google scholar also creates a "key author" list for each search, so I included that &lt;img src="http://feeds.feedburner.com/~r/ReinforcementLearning/~4/7E9GrHsQx8k" height="1" width="1"/&gt;</summary><link rel="replies" type="application/atom+xml" href="http://reinforcementlearning.blogspot.com/feeds/1275913919544462638/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=3231233230519160453&amp;postID=1275913919544462638" title="2 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/1275913919544462638?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/1275913919544462638?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/ReinforcementLearning/~3/7E9GrHsQx8k/some-rl-community-statistics.html" title="Some RL Community Statistics" /><author><name>Michael Littman</name><uri>http://www.blogger.com/profile/05878303953204580887</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>2</thr:total><feedburner:origLink>http://reinforcementlearning.blogspot.com/2008/05/some-rl-community-statistics.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DUUDRng-cCp7ImA9WxdREUs.&quot;"><id>tag:blogger.com,1999:blog-3231233230519160453.post-4384346342187128361</id><published>2008-05-30T09:59:00.000-07:00</published><updated>2008-05-30T10:07:57.658-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2008-05-30T10:07:57.658-07:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="AI" /><category scheme="http://www.blogger.com/atom/ns#" term="robotics" /><category scheme="http://www.blogger.com/atom/ns#" term="BMI" /><title>Another amazing BMI robot arm</title><summary type="html">An online Wired article describes a Robot arm built by Dean Kamen (of the Segway fame). I think it is great that folks like Dean want to innovate in this space. Great for AI and robotics, not to mention all the folks this could end up helping. (BTW, does RL have a role to play in building brain machine interfaces?)Interview of Dean by Walter Mossberg, and another video&lt;img src="http://feeds.feedburner.com/~r/ReinforcementLearning/~4/oU79Nq6G4NM" height="1" width="1"/&gt;</summary><link rel="replies" type="application/atom+xml" href="http://reinforcementlearning.blogspot.com/feeds/4384346342187128361/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=3231233230519160453&amp;postID=4384346342187128361" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/4384346342187128361?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/4384346342187128361?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/ReinforcementLearning/~3/oU79Nq6G4NM/another-amazing-bmi-robot-arm.html" title="Another amazing BMI robot arm" /><author><name>Satinder Singh</name><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="21" height="32" src="http://bp3.blogger.com/_pgMEjMHzg3I/SD35y5FBqrI/AAAAAAAAAAM/cWDTG5e9144/S220/Baveja.jpg" /></author><thr:total>0</thr:total><feedburner:origLink>http://reinforcementlearning.blogspot.com/2008/05/another-amazing-bmi-robot-arm.html</feedburner:origLink></entry><entry gd:etag="W/&quot;AkAHSHo6eCp7ImA9WxdREUs.&quot;"><id>tag:blogger.com,1999:blog-3231233230519160453.post-4197612825472709742</id><published>2008-05-29T19:35:00.001-07:00</published><updated>2008-05-30T10:32:19.410-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2008-05-30T10:32:19.410-07:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="AI" /><category scheme="http://www.blogger.com/atom/ns#" term="SL" /><category scheme="http://www.blogger.com/atom/ns#" term="theory of mind" /><title>AI in Second Life (post 1)</title><summary type="html">I have had a casual interest in Second Life (SL) for a little while and periodically I am going to collect links to and briefly discuss AI work being done in SL. Here is the first such discussion.This page has a brief description of work by RPI on building an AI avatar that can do rudimentary conversations in SL. The description characterizes the avatar as a "four year old" based on a theory of &lt;img src="http://feeds.feedburner.com/~r/ReinforcementLearning/~4/sUmITdftzFc" height="1" width="1"/&gt;</summary><link rel="replies" type="application/atom+xml" href="http://reinforcementlearning.blogspot.com/feeds/4197612825472709742/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=3231233230519160453&amp;postID=4197612825472709742" title="4 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/4197612825472709742?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/4197612825472709742?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/ReinforcementLearning/~3/sUmITdftzFc/ai-in-second-life-post-1_29.html" title="AI in Second Life (post 1)" /><author><name>Satinder Singh</name><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="21" height="32" src="http://bp3.blogger.com/_pgMEjMHzg3I/SD35y5FBqrI/AAAAAAAAAAM/cWDTG5e9144/S220/Baveja.jpg" /></author><thr:total>4</thr:total><feedburner:origLink>http://reinforcementlearning.blogspot.com/2008/05/ai-in-second-life-post-1_29.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DkMARn4yeip7ImA9WxdREEQ.&quot;"><id>tag:blogger.com,1999:blog-3231233230519160453.post-354767025250031671</id><published>2008-05-29T13:48:00.000-07:00</published><updated>2008-05-29T13:54:07.092-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2008-05-29T13:54:07.092-07:00</app:edited><title>Suggest Captions :)</title><summary type="html">Matthew Taylor just sent me this funny picture of me at the recent Barbados workshop on RL along with his caption which was "Satinder is ready to wring the last bit of performance out of some poor PSR algorithm.  "How would you caption it? :)&lt;img src="http://feeds.feedburner.com/~r/ReinforcementLearning/~4/BLjnXqo-ZZ8" height="1" width="1"/&gt;</summary><link rel="replies" type="application/atom+xml" href="http://reinforcementlearning.blogspot.com/feeds/354767025250031671/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=3231233230519160453&amp;postID=354767025250031671" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/354767025250031671?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/354767025250031671?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/ReinforcementLearning/~3/BLjnXqo-ZZ8/suggest-captions.html" title="Suggest Captions :)" /><author><name>Satinder Singh</name><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="21" height="32" src="http://bp3.blogger.com/_pgMEjMHzg3I/SD35y5FBqrI/AAAAAAAAAAM/cWDTG5e9144/S220/Baveja.jpg" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://bp3.blogger.com/_pgMEjMHzg3I/SD8XNZFBqtI/AAAAAAAAAAY/EiSZ1GdEf80/s72-c/2462359772_9380f908d5.jpg" height="72" width="72" /><thr:total>0</thr:total><feedburner:origLink>http://reinforcementlearning.blogspot.com/2008/05/suggest-captions.html</feedburner:origLink></entry><entry gd:etag="W/&quot;AkUDQnw8fyp7ImA9WxdREE8.&quot;"><id>tag:blogger.com,1999:blog-3231233230519160453.post-1148087446183937795</id><published>2008-05-28T19:16:00.000-07:00</published><updated>2008-05-28T19:31:13.277-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2008-05-28T19:31:13.277-07:00</app:edited><title>What a totally crazy wacky fruit!</title><summary type="html">Ok, this has nothing to do with RL or AI. But what a fruit! Someone should bring a bag to the next conference for fun.To quote the nytimes article:"The miracle fruit, Synsepalum dulcificum, is native to West Africa and has been known to Westerners since the 18th century. The cause of the reaction is a protein called miraculin, which binds with the taste buds and acts as a sweetness inducer when &lt;img src="http://feeds.feedburner.com/~r/ReinforcementLearning/~4/pa_hnbqxzKY" height="1" width="1"/&gt;</summary><link rel="replies" type="application/atom+xml" href="http://reinforcementlearning.blogspot.com/feeds/1148087446183937795/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=3231233230519160453&amp;postID=1148087446183937795" title="2 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/1148087446183937795?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/1148087446183937795?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/ReinforcementLearning/~3/pa_hnbqxzKY/what-totally-crazy-wacky-fruit.html" title="What a totally crazy wacky fruit!" /><author><name>Satinder Singh</name><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="21" height="32" src="http://bp3.blogger.com/_pgMEjMHzg3I/SD35y5FBqrI/AAAAAAAAAAM/cWDTG5e9144/S220/Baveja.jpg" /></author><thr:total>2</thr:total><feedburner:origLink>http://reinforcementlearning.blogspot.com/2008/05/what-totally-crazy-wacky-fruit.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C0cEQXo5fyp7ImA9WxdREE8.&quot;"><id>tag:blogger.com,1999:blog-3231233230519160453.post-4878780115778234178</id><published>2008-05-28T17:13:00.000-07:00</published><updated>2008-05-28T17:30:00.427-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2008-05-28T17:30:00.427-07:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="robot" /><category scheme="http://www.blogger.com/atom/ns#" term="BMI" /><category scheme="http://www.blogger.com/atom/ns#" term="monkey" /><title>Monkeying around with robots</title><summary type="html">This is a cool project and brain-machine interfaces like this will be a great application of machine learning that will help society. This stuff is exciting!&lt;img src="http://feeds.feedburner.com/~r/ReinforcementLearning/~4/Xrtok8XChD8" height="1" width="1"/&gt;</summary><link rel="replies" type="application/atom+xml" href="http://reinforcementlearning.blogspot.com/feeds/4878780115778234178/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=3231233230519160453&amp;postID=4878780115778234178" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/4878780115778234178?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/4878780115778234178?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/ReinforcementLearning/~3/Xrtok8XChD8/monkeying-around-with-robots.html" title="Monkeying around with robots" /><author><name>Satinder Singh</name><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="21" height="32" src="http://bp3.blogger.com/_pgMEjMHzg3I/SD35y5FBqrI/AAAAAAAAAAM/cWDTG5e9144/S220/Baveja.jpg" /></author><thr:total>0</thr:total><feedburner:origLink>http://reinforcementlearning.blogspot.com/2008/05/monkeying-around-with-robots.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DEUHQn4-eCp7ImA9WxdREUU.&quot;"><id>tag:blogger.com,1999:blog-3231233230519160453.post-2672780675529029431</id><published>2008-05-27T18:32:00.001-07:00</published><updated>2008-05-30T15:23:53.050-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2008-05-30T15:23:53.050-07:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="data" /><category scheme="http://www.blogger.com/atom/ns#" term="RL" /><title>Measuring Reinforcement Learning</title><summary type="html">When I started my PhD in Andy Barto's lab in 1988, there were perhaps a handful of folks doing research in the field of modern RL. There was an outpost of ex-students from Andy's lab at GTE Laboratories including folks like Rich Sutton, Chuck Anderson and Judy Franklin. By 1990 or so, there were a few others including Leslie Kaelbling and Peter Dayan. It was a pretty lonely field back then.But &lt;img src="http://feeds.feedburner.com/~r/ReinforcementLearning/~4/j_fDT8qzQms" height="1" width="1"/&gt;</summary><link rel="replies" type="application/atom+xml" href="http://reinforcementlearning.blogspot.com/feeds/2672780675529029431/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=3231233230519160453&amp;postID=2672780675529029431" title="7 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/2672780675529029431?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/2672780675529029431?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/ReinforcementLearning/~3/j_fDT8qzQms/measuring-rl_27.html" title="Measuring Reinforcement Learning" /><author><name>Satinder Singh</name><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="21" height="32" src="http://bp3.blogger.com/_pgMEjMHzg3I/SD35y5FBqrI/AAAAAAAAAAM/cWDTG5e9144/S220/Baveja.jpg" /></author><thr:total>7</thr:total><feedburner:origLink>http://reinforcementlearning.blogspot.com/2008/05/measuring-rl_27.html</feedburner:origLink></entry><entry gd:etag="W/&quot;AkUBQHczeSp7ImA9WxdSGU4.&quot;"><id>tag:blogger.com,1999:blog-3231233230519160453.post-6924064705105578457</id><published>2008-05-27T18:22:00.001-07:00</published><updated>2008-05-27T18:30:51.981-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2008-05-27T18:30:51.981-07:00</app:edited><title>A beginning</title><summary type="html">So I am starting a blog in which I (and others to be invited) will initiate conversations with those who wish to comment (or just read) on things pertaining to RL and more generally AI. The intention is to keep this related to research and only occasionally stray into other areas of life.At this point I have no rules for posts or comments other than to keep things friendly and civil. This may be &lt;img src="http://feeds.feedburner.com/~r/ReinforcementLearning/~4/3VLfb1iFZFk" height="1" width="1"/&gt;</summary><link rel="replies" type="application/atom+xml" href="http://reinforcementlearning.blogspot.com/feeds/6924064705105578457/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=3231233230519160453&amp;postID=6924064705105578457" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/6924064705105578457?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/3231233230519160453/posts/default/6924064705105578457?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/ReinforcementLearning/~3/3VLfb1iFZFk/beginning.html" title="A beginning" /><author><name>Satinder Singh</name><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="21" height="32" src="http://bp3.blogger.com/_pgMEjMHzg3I/SD35y5FBqrI/AAAAAAAAAAM/cWDTG5e9144/S220/Baveja.jpg" /></author><thr:total>0</thr:total><feedburner:origLink>http://reinforcementlearning.blogspot.com/2008/05/beginning.html</feedburner:origLink></entry></feed>
