<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/atom10full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><feed xmlns="http://www.w3.org/2005/Atom" xmlns:openSearch="http://a9.com/-/spec/opensearch/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:gd="http://schemas.google.com/g/2005" xmlns:thr="http://purl.org/syndication/thread/1.0" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" gd:etag="W/&quot;A04DR3o9fyp7ImA9WhRUF0o.&quot;"><id>tag:blogger.com,1999:blog-21224994</id><updated>2012-01-28T11:46:16.467-08:00</updated><category term="ACL" /><category term="UNIX" /><category term="CVPR" /><category term="correlate" /><category term="datasets" /><category term="jsm2011" /><category term="China" /><category term="MapReduce" /><category term="osdi10" /><category term="localization" /><category term="Machine Learning" /><category term="UI" /><category term="resource optimization" /><category term="Speech" /><category term="conference" /><category term="NIPS" /><category term="Google Books" /><category term="distributed systems" /><category term="trends" /><category term="Government" /><category term="market algorithms" /><category term="internationalization" /><category term="jsm" /><category term="accessibility" /><category term="Structured Data" /><category term="Awards" /><category term="Public Data Explorer" /><category term="video" /><category term="Africa" /><category term="Visiting Faculty" /><category term="osdi" /><category term="Android" /><category term="Translate" /><category term="Korean" /><category term="Natural Language Processing" /><category term="grants" /><category term="Policy" /><category term="Cantonese" /><category term="Vision Research" /><category term="K-12" /><category term="search ads" /><category term="Machine Hearing" /><category term="ACM" /><category term="TV" /><category term="Ngram" /><category term="operating systems" /><category term="ph.d. fellowship" /><category term="YouTube" /><category term="Voice Search" /><category term="Image Annotation" /><category term="App Inventor" /><category term="economics" /><category term="Research Awards" /><category term="University Relations" /><category term="Labs" /><category term="EMNLP" /><category term="Interspeech" /><category term="publication" /><category term="Publications" /><category term="Fusion Tables" /><category term="statistics" /><category term="Education" /><category term="conferences" /><title type="text">Google Research Blog</title><subtitle type="html">The latest news on Google Research.</subtitle><link rel="http://schemas.google.com/g/2005#feed" type="application/atom+xml" href="http://googleresearch.blogspot.com/feeds/posts/default" /><link rel="alternate" type="text/html" href="http://googleresearch.blogspot.com/" /><link rel="next" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default?start-index=26&amp;max-results=25&amp;redirect=false&amp;v=2" /><author><name>Kate Berrio</name><uri>http://www.blogger.com/profile/07320530782831519185</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><generator version="7.00" uri="http://www.blogger.com">Blogger</generator><openSearch:totalResults>177</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>25</openSearch:itemsPerPage><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/atom+xml" href="http://feeds.feedburner.com/blogspot/gJZg" /><feedburner:info uri="blogspot/gjzg" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><entry gd:etag="W/&quot;D0cASX44fyp7ImA9WhRUEEQ.&quot;"><id>tag:blogger.com,1999:blog-21224994.post-7755764154292321053</id><published>2012-01-20T13:30:00.000-08:00</published><updated>2012-01-20T13:30:48.037-08:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-01-20T13:30:48.037-08:00</app:edited><title>Open-sourcing Sky Map and collaborating with Carnegie Mellon University</title><content type="html">Posted by John Taylor and Kevin Serafini&lt;br /&gt;
&lt;br /&gt;
&lt;a href="http://googleblog.blogspot.com/2009/05/planetarium-in-your-pocket.html."&gt;In May 2009&lt;/a&gt; we launched Google Sky Map: our “window on the sky” for Android phones.  Created by half a dozen Googlers at the &lt;a href="http://googleblog.blogspot.com/2010/12/its-beautiful-day-in-this-neighborhood.html."&gt;Pittsburgh office&lt;/a&gt; in our 20% time, the app was designed to show off the amazing capabilities of the sensors in the first generation Android phones.  Mostly, however, we wrote it because we love astronomy. And, thanks to Android’s broad reach, we have managed to share this passion with over 20 million Android users as well as with our local community at events such as the &lt;a href="http://www.pittsburghparks.org/urbanstarparty"&gt;Urban Sky Party&lt;/a&gt;.&lt;br /&gt;
&lt;br /&gt;
Today, we are delighted to announce that we are going to share Sky Map in a different way: we are donating Sky Map to the community.  We are collaborating with Carnegie Mellon University in an exciting partnership that will see further development of Sky Map as a series of student projects.  Sky Map’s development will now be driven by the students, with Google engineers remaining closely involved as advisors. Additionally, we have &lt;a href="http://code.google.com/p/stardroid/"&gt;open-sourced&lt;/a&gt; the app so that other astronomy enthusiasts can take the code and augment it as they wish.&lt;br /&gt;
&lt;br /&gt;
The Google Sky Map team would like to thank all of our users who have taken the time to send us comments over the past 3 years.  You tell us that Sky Map has helped you show off your phone, enabled you to see the stars when the urban light pollution or weather obscured them and even find romance!  The feedback that touched us most though can be summarized by this short email:&lt;br /&gt;
&lt;br /&gt;
“sat down with my son and looked around at the planets for about 45 minutes...time well spent, thanx”&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/21224994-7755764154292321053?l=googleresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/blogspot/gJZg?a=8glcHFvzSt8:_qlwbkqO0KE:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/blogspot/gJZg?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/gJZg/~4/8glcHFvzSt8" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://googleresearch.blogspot.com/feeds/7755764154292321053/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=21224994&amp;postID=7755764154292321053" title="9 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/7755764154292321053?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/7755764154292321053?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/gJZg/~3/8glcHFvzSt8/open-sourcing-sky-map-and-collaborating.html" title="Open-sourcing Sky Map and collaborating with Carnegie Mellon University" /><author><name>Research @ Google</name><uri>http://www.blogger.com/profile/12098626514775266161</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>9</thr:total><feedburner:origLink>http://googleresearch.blogspot.com/2012/01/open-sourcing-sky-map-and-collaborating.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CkMFSH0ycSp7ImA9WhRVFEU.&quot;"><id>tag:blogger.com,1999:blog-21224994.post-5091762402114985010</id><published>2012-01-13T10:46:00.000-08:00</published><updated>2012-01-13T10:46:59.399-08:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-01-13T10:46:59.399-08:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="statistics" /><category scheme="http://www.blogger.com/atom/ns#" term="datasets" /><title>CDC Birth Vital Statistics in BigQuery</title><content type="html">&lt;span class="byline-author"&gt;Posted by Dan Vanderkam, Software Engineer&lt;/span&gt;  &lt;br /&gt;
&lt;br /&gt;
Google’s &lt;a href="http://code.google.com/apis/bigquery/"&gt;BigQuery Service&lt;/a&gt; lets enterprises and developers crunch large-scale data sets quickly. But what if you don’t have a large-scale data set of your own?&lt;br /&gt;
&lt;br /&gt;
To help the data-less masses, BigQuery offers several &lt;a href="http://code.google.com/apis/bigquery/docs/sample-datasets.html"&gt;large, public data sets&lt;/a&gt;. One of these is the &lt;a href="http://code.google.com/apis/bigquery/docs/dataset-natality.html"&gt;natality&lt;/a&gt; data set, which records information about live births in the United States. The data is derived from the &lt;a href="http://www.cdc.gov/nchs/nvss.htm"&gt;Division of Vital Statistics&lt;/a&gt; at the &lt;a href="http://www.cdc.gov/"&gt;Centers for Disease Control and Prevention&lt;/a&gt;, which has collected an electronic record of birth statistics &lt;a href="http://www.cdc.gov/nchs/data_access/Vitalstatsonline.htm"&gt;since 1969&lt;/a&gt;. It is one of the longest-running electronic records in existence.&lt;br /&gt;
&lt;br /&gt;
Each row in this database represents a live birth. Using simple queries, you can discover fascinating trends from the last forty years.&lt;br /&gt;
&lt;br /&gt;
For example, here’s the average age of women giving birth to their first child:&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://3.bp.blogspot.com/-AZNr0o5S97M/TxB3osZ-JtI/AAAAAAAAAAo/Ue2-2XQk100/s1600/age-at-first-birth.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="237" src="http://3.bp.blogspot.com/-AZNr0o5S97M/TxB3osZ-JtI/AAAAAAAAAAo/Ue2-2XQk100/s400/age-at-first-birth.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;br /&gt;
The average age has increased from 21.3 years in 1969 to 25.1 years in 2008. Using more complex queries, one could analyze the factors which have contributed to this increase, i.e. whether it can be explained by changing racial/ethnic composition of the population.&lt;br /&gt;
&lt;br /&gt;
You can see more &lt;a href="http://goo.gl/yvlJ9" target="_blank"&gt;examples&lt;/a&gt; like this one on the BigQuery site.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/21224994-5091762402114985010?l=googleresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/blogspot/gJZg?a=LAHkUn-0Jpc:V6v-dKnY-CI:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/blogspot/gJZg?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/gJZg/~4/LAHkUn-0Jpc" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://googleresearch.blogspot.com/feeds/5091762402114985010/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=21224994&amp;postID=5091762402114985010" title="1 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/5091762402114985010?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/5091762402114985010?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/gJZg/~3/LAHkUn-0Jpc/cdc-birth-vital-statistics-in-bigquery.html" title="CDC Birth Vital Statistics in BigQuery" /><author><name>Research @ Google</name><uri>http://www.blogger.com/profile/12098626514775266161</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://3.bp.blogspot.com/-AZNr0o5S97M/TxB3osZ-JtI/AAAAAAAAAAo/Ue2-2XQk100/s72-c/age-at-first-birth.png" height="72" width="72" /><thr:total>1</thr:total><feedburner:origLink>http://googleresearch.blogspot.com/2012/01/cdc-birth-vital-statistics-in-bigquery.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CUIFRXg_eSp7ImA9WhRWFkw.&quot;"><id>tag:blogger.com,1999:blog-21224994.post-594702638846459145</id><published>2012-01-03T09:58:00.000-08:00</published><updated>2012-01-03T09:58:34.641-08:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-01-03T09:58:34.641-08:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="economics" /><category scheme="http://www.blogger.com/atom/ns#" term="correlate" /><category scheme="http://www.blogger.com/atom/ns#" term="internationalization" /><category scheme="http://www.blogger.com/atom/ns#" term="trends" /><title>Google Correlate expands to 49 additional countries</title><content type="html">&lt;span class="byline-author"&gt;Posted by Matt Mohebbi, Software Engineer&lt;/span&gt;  &lt;br /&gt;
&lt;br /&gt;
In May of this year we &lt;a href="http://googleblog.blogspot.com/2011/05/mining-patterns-in-search-data-with.html"&gt;launched&lt;/a&gt; Google Correlate on Google Labs. &lt;a href="http://www.google.com/trends/correlate"&gt;This system&lt;/a&gt; enables a correlation search between a user-provided time series and millions of time series of Google search traffic. Since our initial launch, we've graduated to Google Trends and we've seen a number of great applications of Correlate in several domains, including economics (&lt;a href="http://www.thinkwithgoogle.com/quarterly/people/hal-varian-predicting-the-present.html"&gt;consumer spending&lt;/a&gt;, &lt;a href="http://www.freakonomics.com/2011/05/25/mining-for-correlations-it-works/"&gt;unemployment rate&lt;/a&gt; and &lt;a href="http://radar.oreilly.com/2011/06/google-correlate.html"&gt;housing inventory&lt;/a&gt;), &lt;a href="http://thesocietypages.org/socimages/2011/10/06/google-index-of-poor-mothers%E2%80%99-pain/"&gt;sociology&lt;/a&gt; and &lt;a href="http://www.newscientist.com/blogs/onepercent/2011/05/google-correlate-passes-our-we.html"&gt;meteorology&lt;/a&gt;. The correspondence of &lt;a href="http://www.google.com/trends/correlate/search?e=id:RqxskAqoRkX&amp;amp;t=weekly"&gt;gas prices and search activity for fuel efficient cars&lt;/a&gt; was even briefly discussed in a &lt;a href="http://www.youtube.com/watch?v=dKNNN0NvVrc&amp;amp;feature=player_embedded#t=56m43s"&gt;Fox News presidential debate&lt;/a&gt; and NPR recently &lt;a href="http://www.npr.org/2012/01/02/144572891/google-searches-are-a-window-into-our-culture"&gt;covered&lt;/a&gt; correlations related to political commentators. &lt;br /&gt;
&lt;br /&gt;
Health has always been an area of particular interest to our team (Matt Mohebbi, Julia Kodysh, Rob Schonberger and Dan Vanderkam). Correlate was inspired by Google Flu Trends and many of us worked on both systems. So we were very excited when the BioSense division at the CDC &lt;a href="http://cdc.gov/biosense/correlate/"&gt;published&lt;/a&gt;&amp;nbsp;a page which shows correlations between some of their national trends in patient diagnosis activity and Google search activity. With just three years of weekly data, relevant search terms are surfaced. For example, the time series for &lt;a href="http://www.google.com/trends/correlate/search?e=id:gLrYbR8MP9P&amp;amp;t=weekly"&gt;bloody nose&lt;/a&gt; surfaces "bloody snot" and "blood in snot". &lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://1.bp.blogspot.com/-PPw_Iz2432c/TwM0bDoP0HI/AAAAAAAAAAc/IM6qDx2kO0c/s1600/bloody%2Bsnot.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="222" src="http://1.bp.blogspot.com/-PPw_Iz2432c/TwM0bDoP0HI/AAAAAAAAAAc/IM6qDx2kO0c/s400/bloody%2Bsnot.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;br /&gt;
While these terms shouldn't come as a surprise, there are others which are more interesting, including searches related to static electricity, dry skin, and red cheeks. Of course, correlation is not causation but we hope that Correlate can be used as a method for researchers to generate new hypotheses with their data. &lt;br /&gt;
&lt;br /&gt;
To help researchers outside the United States, we're pleased to announce support for 49 additional countries in Google Correlate. It's now possible to see correlations like &lt;a href="http://www.google.com/trends/correlate/search?e=snorkeling&amp;amp;t=weekly&amp;amp;p=au"&gt;"snorkeling" in Australia&lt;/a&gt;, &lt;a href="http://www.google.com/trends/correlate/search?e=cherry+blossoms&amp;amp;t=weekly&amp;amp;p=jp"&gt;"cherry blossoms" in Japan&lt;/a&gt; , and &lt;a href="http://www.google.com/trends/correlate/search?e=beer+garden&amp;amp;t=weekly&amp;amp;p=de"&gt;"beer garden" in Germany&lt;/a&gt;. We look forward to seeing what new correlations researchers can find with this data!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/21224994-594702638846459145?l=googleresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/blogspot/gJZg?a=sTJZqz8PPL0:u5akv2MkAVE:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/blogspot/gJZg?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/gJZg/~4/sTJZqz8PPL0" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://googleresearch.blogspot.com/feeds/594702638846459145/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=21224994&amp;postID=594702638846459145" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/594702638846459145?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/594702638846459145?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/gJZg/~3/sTJZqz8PPL0/google-correlate-expands-to-49.html" title="Google Correlate expands to 49 additional countries" /><author><name>Research @ Google</name><uri>http://www.blogger.com/profile/12098626514775266161</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://1.bp.blogspot.com/-PPw_Iz2432c/TwM0bDoP0HI/AAAAAAAAAAc/IM6qDx2kO0c/s72-c/bloody%2Bsnot.png" height="72" width="72" /><thr:total>0</thr:total><feedburner:origLink>http://googleresearch.blogspot.com/2012/01/google-correlate-expands-to-49.html</feedburner:origLink></entry><entry gd:etag="W/&quot;Dk8HSHszcCp7ImA9WhRXFUQ.&quot;"><id>tag:blogger.com,1999:blog-21224994.post-1809234908579026219</id><published>2011-12-22T15:00:00.000-08:00</published><updated>2011-12-22T15:00:39.588-08:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-12-22T15:00:39.588-08:00</app:edited><title>Academic Successes in Cluster Computing</title><content type="html">Posted by Alfred Spector, VP of Research&lt;br /&gt;
&lt;br /&gt;
Access to massive computing resources is foundational to Research and Development. Fifteen awardees of the National Science Foundation (NSF) &lt;a href="http://www.nsf.gov/awardsearch/progSearch.do?SearchType=progSearch&amp;page=2&amp;QueryText=&amp;ProgOrganization=&amp;ProgOfficer=&amp;ProgEleCode=7782&amp;BooleanElement=false&amp;ProgRefCode=&amp;BooleanRef=false&amp;ProgProgram=&amp;ProgFoaCode=&amp;RestrictActive=on&amp;Search=Search#results"&gt;Cluster Exploratory Service&lt;/a&gt; (CLuE) program have been applying large scale computational resources &lt;a href="http://googleblog.blogspot.com/2008/02/supporting-cluster-computing-in.html"&gt;donated by Google and IBM&lt;/a&gt;. &lt;br /&gt;
&lt;br /&gt;
Overall, 1,328 researchers have used the cluster to perform over 120 million computing tasks on the cluster and in the process, have published 49 scientific publications, educated thousands of students on parallel computing and supported numerous post-doctoral candidates in their academic careers. Researchers have used the program for such diverse fields as astronomy, oceanography and linguistics. Besides validating &lt;a href="http://research.google.com/archive/mapreduce.html"&gt;MapReduce&lt;/a&gt; as a useful tool in academic research, the program has also generated significant scientific knowledge. &lt;br /&gt;
&lt;br /&gt;
Three years later, there are many viable, affordable alternatives to the Academic Cloud Computing Initiative, so we have decided to bring our part of the program to a close. It has been a great opportunity to collaborate with IBM, the NSF and the many universities on this program. It was state-of-the-art four years ago when it was started; now, Academic Cloud Computing is a worldwide phenomena and there are many low-cost cloud computing options that provide viable alternatives to the Academic Cloud Computing Initiative.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/21224994-1809234908579026219?l=googleresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/blogspot/gJZg?a=ZwCMlAETwMY:baaav8Ba96E:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/blogspot/gJZg?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/gJZg/~4/ZwCMlAETwMY" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://googleresearch.blogspot.com/feeds/1809234908579026219/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=21224994&amp;postID=1809234908579026219" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/1809234908579026219?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/1809234908579026219?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/gJZg/~3/ZwCMlAETwMY/academic-successes-in-cluster-computing.html" title="Academic Successes in Cluster Computing" /><author><name>Research Admin</name><uri>http://www.blogger.com/profile/00043158880867757514</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://googleresearch.blogspot.com/2011/12/academic-successes-in-cluster-computing.html</feedburner:origLink></entry><entry gd:etag="W/&quot;A0QASHsycCp7ImA9WhRQGUs.&quot;"><id>tag:blogger.com,1999:blog-21224994.post-4621301871806736434</id><published>2011-12-09T08:48:00.000-08:00</published><updated>2011-12-15T09:15:49.598-08:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-12-15T09:15:49.598-08:00</app:edited><title>Measuring Ad Effectiveness Using Geo Experiments</title><content type="html">&lt;span class="byline-author"&gt;Posted by Lizzy Van Alstine and Jon Vaver, Quantitative Analysis Team&lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;
Advertisers want to be able to measure the effectiveness of their advertising. Many methods have been used to address this need, but the most rigorous and trusted of these are randomized experiments, which involve randomly assigning experimental units to control and test conditions. At Google, we have found that randomized geo experiments are a powerful approach to measuring the effectiveness of advertising.&lt;br /&gt;
&lt;br /&gt;
Many advertising platforms allow advertising to be targeted by geographical region. In these experiments, we first assign geographic regions to test or control conditions and employ AdWords’ geo-targeted advertising capabilities to increase or decrease the regional advertising spend accordingly. The use of randomized assignments guards against potential hidden test/control biases that could impact the measurements. Our approach also accounts for seasonal changes that impact the volume and cost of advertising across the length of the experiment.&lt;br /&gt;
&lt;br /&gt;
In &lt;a href="http://services.google.com/fh/files/blogs/geo_experiments_final_version.pdf"&gt;this paper&lt;/a&gt;, we describe the application of geo experiments for measuring the impact of advertising on consumer behavior (e.g. clicks, conversions, downloads, etc.). This description includes the results of a geo experiment that our research team ran for a Google advertiser.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/21224994-4621301871806736434?l=googleresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/blogspot/gJZg?a=L62rVlggJyI:b2_ddEyFAwY:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/blogspot/gJZg?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/gJZg/~4/L62rVlggJyI" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://googleresearch.blogspot.com/feeds/4621301871806736434/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=21224994&amp;postID=4621301871806736434" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/4621301871806736434?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/4621301871806736434?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/gJZg/~3/L62rVlggJyI/measuring-ad-effectiveness-using-geo.html" title="Measuring Ad Effectiveness Using Geo Experiments" /><author><name>Research @ Google</name><uri>http://www.blogger.com/profile/12098626514775266161</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://googleresearch.blogspot.com/2011/12/measuring-ad-effectiveness-using-geo.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CkMAQHk7eCp7ImA9WhRQE0s.&quot;"><id>tag:blogger.com,1999:blog-21224994.post-1347738637523703608</id><published>2011-12-08T07:30:00.000-08:00</published><updated>2011-12-08T08:07:21.700-08:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-12-08T08:07:21.700-08:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="Awards" /><category scheme="http://www.blogger.com/atom/ns#" term="ACM" /><title>ACM Fellows for 2011</title><content type="html">Posted by Alfred Spector, Google Research&lt;br /&gt;
&lt;br /&gt;
&lt;i&gt;Cross-posted with the &lt;a href="http://googleblog.blogspot.com/2011/12/congratulations-to-three-googlers.html"&gt;Official Google Blog&lt;/a&gt;&lt;/i&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Congratulations to three Googlers elected ACM Fellows&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
It gives me great pleasure to share that the &lt;a href="http://www.acm.org/"&gt;Association for Computing Machinery&lt;/a&gt; (ACM) has &lt;a href="http://www.acm.org/press-room/news-releases/2011/fellows-2011/"&gt;announced&lt;/a&gt; that three Googlers have been elected ACM Fellows in 2011. The ACM is the world’s largest educational and scientific computing society, and the Fellows Program celebrates the exceptional contributions of leaders in the computing field. This year the society has selected &lt;a href="https://plus.google.com/115744399689614835150/about"&gt;Amit Singhal&lt;/a&gt;, &lt;a href="https://plus.google.com/110401818717224273095/posts"&gt;Peter S. Magnusson&lt;/a&gt; and &lt;a href="http://cseweb.ucsd.edu/~vahdat/"&gt;Amin Vahdat&lt;/a&gt; for their outstanding work, which has provided fundamental knowledge to the field.&lt;br /&gt;
&lt;br /&gt;
The recently-named Fellows join 14 &lt;a href="http://googleblog.blogspot.com/2010/12/four-googlers-elected-acm-fellows-this.html"&gt;prior Googler ACM Fellows&lt;/a&gt; and other professional society honorees in exemplifying our extraordinarily talented people. On behalf of Google, I congratulate our colleagues. They embody Google’s commitment to innovation with impact, and I hope that they’ll serve as inspiration to students as well as the broader community of computer scientists.&lt;br /&gt;
&lt;br /&gt;
You can read more detailed summaries of their achievements below, including the official citations from the ACM. &lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Dr. Amit Singhal, Google Fellow&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;i&gt;For contributions to search and information retrieval&lt;/i&gt;&lt;br /&gt;
&lt;br /&gt;
Since 2000, Dr. Amit Singhal has been pioneering search as the technical lead for Google's core search algorithms. He is credited with most of the information retrieval design decisions in Google Search – a massive system that has responded to hundreds of billions of queries. More than anyone, Amit has a deep understanding of Google’s entire algorithmic system. He is responsible for prioritization and has overseen the development of numerous algorithmic signals and their progression over time. He is the clear thought and managerial leader who has led critically important initiatives at the company. Among many other things, Amit catalyzed Universal Search, which returns multi-modal results from all available corpora; he was the force behind Realtime Search, which returns results from dynamic corpora with low latency; and he championed Google Instant, which returns search results as the user types. &lt;br /&gt;
&lt;br /&gt;
Prior to joining Google, Amit boasted a prolific publication record averaging 5 publications/year from 1996-9 while at AT&amp;amp;T Labs. Since that time, you could say Google Search has been one long, sustained publication demonstrating a constant advancement in the state of the art of information retrieval. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Peter S. Magnusson, Engineering Director&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;i&gt;For contributions to full-system simulation&lt;/i&gt;&lt;br /&gt;
&lt;br /&gt;
Peter has made a tremendous impact by driving full-system simulation. His approach was so advanced, it can be used in real world production of commercial CPUs and prototyping of system software. Starting in 1991, Peter began to challenge the notion that simulators could not be made fast enough to run large workloads, nor accurate enough to run commercial operating systems. His innovations in simulator design culminated in Simics, the first academic simulator that could boot and run commercial multiprocessor workloads. Simics saw huge academic success and has been used to run simulations for research presented in several hundred subsequent publications. &lt;br /&gt;
&lt;br /&gt;
Peter founded Virtutech in 1998 to commercially develop Simics, and he ultimately forged and became the leader in a new market segment for software tools. With Peter at the helm, Virtutech pushed Simics beyond several performance barriers to make it the first simulator to exceed 1 billion instructions per second and the first simulator to model over 1,000 processors. Peter joined Google in 2010 to work with cloud computing.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Dr. Amin Vahdat, Principal Engineer&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;i&gt;For contributions to data center scalability and management&lt;/i&gt;&lt;br /&gt;
&lt;br /&gt;
Amin’s work made an impact at Google long before he arrived here. Amin is known for conducting research through bold, visionary projects that combine creativity with careful consideration of the engineering constraints needed to make them applicable in real world applications. Amin’s infrastructure ideas have underpinned the shift in the computing field from the pure client-server paradigm to a landscape in which major web services are hosted “in the cloud” across multiple data centers. In addition to pioneering “third-party cloud computing” through his work on WebOS and Rent-A-Server in the mid-90s, Amin has made important advancements in managing wide-area consistency between data centers, scalable modeling of data center applications, and building scalable data center networks. &lt;br /&gt;
&lt;br /&gt;
Amin’s innovations have penetrated and broadly influenced the networking community within academia and industry, including Google, and his research has been recapitulated and expanded upon in a number of publications. Conferences that formerly did not even cover data centers now have multiple sessions covering variants of what Amin and his team have proposed. At Google, Amin continues to drive next-generation data center infrastructure focusing on Software Defined Networking and new opportunities from optical technologies. This is emblematic of Amin’s ability to build real systems, and perhaps more significantly, convince people of their value.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/21224994-1347738637523703608?l=googleresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/blogspot/gJZg?a=KasnsGdFrA0:M6-MZE7rZoY:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/blogspot/gJZg?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/gJZg/~4/KasnsGdFrA0" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://googleresearch.blogspot.com/feeds/1347738637523703608/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=21224994&amp;postID=1347738637523703608" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/1347738637523703608?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/1347738637523703608?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/gJZg/~3/KasnsGdFrA0/acm-fellows-for-2011.html" title="ACM Fellows for 2011" /><author><name>Research Admin</name><uri>http://www.blogger.com/profile/00043158880867757514</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://googleresearch.blogspot.com/2011/12/acm-fellows-for-2011.html</feedburner:origLink></entry><entry gd:etag="W/&quot;AkYCRH4_fCp7ImA9WhRQEUU.&quot;"><id>tag:blogger.com,1999:blog-21224994.post-2153916703336370099</id><published>2011-12-06T08:00:00.000-08:00</published><updated>2011-12-06T08:16:05.044-08:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-12-06T08:16:05.044-08:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="University Relations" /><category scheme="http://www.blogger.com/atom/ns#" term="Research Awards" /><title>Our second round of Google Research Awards for 2011</title><content type="html">Posted by Maggie Johnson, Director of Education &amp;amp; University Relations&lt;br /&gt;
&lt;br /&gt;
We’ve just finished the review process for the latest round of the &lt;a href="http://research.google.com/university/relations/research_awards.html"&gt;Google Research Awards&lt;/a&gt;, which provide funding to full-time faculty working on research in areas of mutual interest with Google. We are delighted to be funding 119 awards across 21 different focus areas for a total of $6 million. The subject areas that received the highest level of support this time were systems and infrastructure, human-computer interaction, social and mobile. In addition, 24% of the funding was awarded to universities outside the U.S.&lt;br /&gt;
&lt;br /&gt;
One way in which we measure the impact of the research award program is through surveys of Principal Investigators (PIs) and their Google sponsors (a Googler with whom grantees can discuss research directions, provide progress updates, engage in knowledge transfer, etc.). Here are some highlights from our most recent survey, covering projects funded over the last two years:&lt;br /&gt;
&lt;br /&gt;
&lt;ul&gt;&lt;li&gt;433 papers were published as a result of a Google research award&lt;/li&gt;
&lt;li&gt;126 projects made data sets or software publicly available&lt;/li&gt;
&lt;li&gt;63 research talks were given by sponsored PIs at Google offices&lt;/li&gt;
&lt;/ul&gt;&lt;br /&gt;
An important aspect of the program is that it often gives early career academics a head start on their research agenda.  Many new PIs commented on how a Google research award allowed them to explore their initial ideas and build a foundation for obtaining more significant funding from other sources. This type of seed funding is especially hard to get in the current economic environment.  &lt;br /&gt;
&lt;br /&gt;
The goal of the research award program is to initiate and sustain strong collaborations with our academic colleagues. The collaborations take many forms, from working on a project together, to co-writing a paper, to coming to Google to give a research talk. Whatever the form, the most important aspect is building strong relationships that last. Case in point, many of our &lt;a href="http://research.google.com/university/relations/focused_research_awards.html"&gt;focused awards&lt;/a&gt; (multi-year, unrestricted grants that include access to Google’s tools, technology and expertise) started as Google research awards.&lt;br /&gt;
&lt;br /&gt;
Congratulations to the &lt;a href="http://services.google.com/fh/files/blogs/2011R2%20recipients%20for%20blog.pdf"&gt;well-deserving recipients of this round’s awards&lt;/a&gt;, and if you are interested in applying for the next round (deadline is April 15), please visit &lt;a href="http://research.google.com/university/relations/research_awards.html"&gt;our website&lt;/a&gt; for more information.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/21224994-2153916703336370099?l=googleresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/blogspot/gJZg?a=_-TBfRhTGWw:0rILOGDq0Iw:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/blogspot/gJZg?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/gJZg/~4/_-TBfRhTGWw" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://googleresearch.blogspot.com/feeds/2153916703336370099/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=21224994&amp;postID=2153916703336370099" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/2153916703336370099?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/2153916703336370099?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/gJZg/~3/_-TBfRhTGWw/our-second-round-of-google-research.html" title="Our second round of Google Research Awards for 2011" /><author><name>Research Admin</name><uri>http://www.blogger.com/profile/00043158880867757514</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://googleresearch.blogspot.com/2011/12/our-second-round-of-google-research.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DkMGR3Y8fCp7ImA9WhRRGEg.&quot;"><id>tag:blogger.com,1999:blog-21224994.post-621494373520213050</id><published>2011-12-02T11:30:00.000-08:00</published><updated>2011-12-02T11:33:46.874-08:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-12-02T11:33:46.874-08:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="University Relations" /><category scheme="http://www.blogger.com/atom/ns#" term="China" /><category scheme="http://www.blogger.com/atom/ns#" term="Education" /><title>2011 Google China Faculty Summit in Hangzhou</title><content type="html">Posted by Aimin Zhu, University Relationship Manager, Google China&lt;br /&gt;
&lt;br /&gt;
We just wrapped up a highly successful 2011 Google China Faculty Summit in &lt;a href="http://maps.google.com/maps?q=hangzhou,china&amp;amp;hl=en&amp;amp;ll=30.221102,120.146484&amp;amp;spn=45.110857,79.013672&amp;amp;sll=34.01609,-117.856509&amp;amp;sspn=0.086083,0.154324&amp;amp;vpsrc=6&amp;amp;hnear=Hangzhou,+Zhejiang,+China&amp;amp;t=m&amp;amp;z=4"&gt;Hangzhou, China&lt;/a&gt;. On November 17 and 18, Googlers from China and the U.S. gathered with more than 80 faculty members representing more than 45 universities and institutes, including Tsinghua University, Peking University and The Chinese Academy of Sciences. The two-day event revolved around the theme of “Communication, Exploration and Expansion,” with day one covering research and day two focusing on academic development. &lt;br /&gt;
&lt;br /&gt;
The summit provided a unique setting for both sides to share the results of their research and exchange ideas. Speakers included: &lt;br /&gt;
&lt;br /&gt;
&lt;ul&gt;&lt;li&gt;Maggie Johnson, director of education and university relations at Google, presenting on innovation in Google research and global university relations programs,&lt;/li&gt;
&lt;li&gt;Dr. Boon-Lock Yeo, head of engineering and research for Google China, providing an overview of  innovation in China engineering and corporate social responsibility efforts and accomplishments, and&lt;/li&gt;
&lt;li&gt;Prof. Edward Chang, director of research for Google China, delivering a keynote on mobile information management and retrieval.&lt;/li&gt;
&lt;/ul&gt;&lt;br /&gt;
The discussions on November 17 focused on two tracks, mobile computing and natural language processing, while discussions on November 18 focused on curriculum development with a special focus on Android app development. The attendees also spent time discussing joint research and development between universities and industry.&lt;br /&gt;
&lt;br /&gt;
This summit is part of a continuing to effort to collaborate with Chinese universities in order to support education in China. Click &lt;a href="http://www.google.com/intl/zh-CN/corporate/university/en/index.html"&gt;here&lt;/a&gt; for a list of the variety of education programs we have launched there in recent years. We look forward to expanding partnership opportunities in the future.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/21224994-621494373520213050?l=googleresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/blogspot/gJZg?a=Yz1fuPessVA:cC1ecfj2iJo:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/blogspot/gJZg?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/gJZg/~4/Yz1fuPessVA" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://googleresearch.blogspot.com/feeds/621494373520213050/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=21224994&amp;postID=621494373520213050" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/621494373520213050?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/621494373520213050?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/gJZg/~3/Yz1fuPessVA/2011-google-china-faculty-summit-in.html" title="2011 Google China Faculty Summit in Hangzhou" /><author><name>Research Admin</name><uri>http://www.blogger.com/profile/00043158880867757514</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://googleresearch.blogspot.com/2011/12/2011-google-china-faculty-summit-in.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DkMCQHw5eip7ImA9WhRRFk0.&quot;"><id>tag:blogger.com,1999:blog-21224994.post-243460295690219074</id><published>2011-11-29T14:07:00.000-08:00</published><updated>2011-11-29T14:07:41.222-08:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-11-29T14:07:41.222-08:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="datasets" /><title>More Google Cluster Data</title><content type="html">&lt;span class="byline-author"&gt;Posted by John Wilkes, Principal Software Engineer&lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;
Google has a strong interest in promoting high quality systems research, and we believe that providing information about real-life workloads to the academic community can help.&lt;br /&gt;
&lt;br /&gt;
In support of this we published a small (7-hour) sample of resource-usage information from a Google production cluster in 2010 (&lt;a href="http://googleresearch.blogspot.com/2010/01/google-cluster-data.html"&gt;research blog on Google Cluster Data&lt;/a&gt;).  Approximately a dozen researchers at UC Berkeley, CMU, Brown, NCSU, and elsewhere have made use of it.&lt;br /&gt;
&lt;br /&gt;
Recently, we released a larger dataset.  It covers a longer period of time (29 days) for a larger cell (about 11k machines) and includes significantly more information, including:&lt;br /&gt;
&lt;br /&gt;
&lt;ul&gt;
&lt;li&gt;the original resource requests, to permit scheduling experiments&lt;/li&gt;
&lt;li&gt;request constraints and machine attriibutes&lt;/li&gt;
&lt;li&gt;machine availability and failure events&lt;/li&gt;
&lt;li&gt;some of the reasons for task exits&lt;/li&gt;
&lt;li&gt;(obfuscated) job and job-submitter names, to help identify repeated or related jobs&lt;/li&gt;
&lt;li&gt;more types of usage information&lt;/li&gt;
&lt;li&gt;CPI (cycles per instruction) and memory traffic for some of the machines&lt;/li&gt;
&lt;/ul&gt;
&lt;br /&gt;
&lt;br /&gt;
Note that this trace primarily provides data about resource requests and usage.  It contains no information about end users, their data, or access patterns to storage systems and other services.&lt;br /&gt;
&lt;br /&gt;
More information can be found via &lt;a href="http://goo.gl/GIDUh"&gt;this link&lt;/a&gt;, which will (after a short questionnaire) take you to a site that provides access instructions, a description of the data schema, and information about how the data was derived and its meaning.&lt;br /&gt;
&lt;br /&gt;
We hope this data will facilitate a range of research in cluster management. Let us know if you find it useful, are willing to share tools that analyze it, or have suggestions for how to improve it.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/21224994-243460295690219074?l=googleresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/blogspot/gJZg?a=g4Rk64O3noo:Cn87IXccR0o:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/blogspot/gJZg?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/gJZg/~4/g4Rk64O3noo" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://googleresearch.blogspot.com/feeds/243460295690219074/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=21224994&amp;postID=243460295690219074" title="1 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/243460295690219074?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/243460295690219074?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/gJZg/~3/g4Rk64O3noo/more-google-cluster-data.html" title="More Google Cluster Data" /><author><name>Research @ Google</name><uri>http://www.blogger.com/profile/12098626514775266161</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>1</thr:total><feedburner:origLink>http://googleresearch.blogspot.com/2011/11/more-google-cluster-data.html</feedburner:origLink></entry><entry gd:etag="W/&quot;AkMGRn84cCp7ImA9WhRTEkk.&quot;"><id>tag:blogger.com,1999:blog-21224994.post-5952642640319180625</id><published>2011-11-02T08:00:00.000-07:00</published><updated>2011-11-02T08:40:27.138-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-11-02T08:40:27.138-07:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="YouTube" /><category scheme="http://www.blogger.com/atom/ns#" term="Machine Hearing" /><title>Discovering Talented Musicians with Acoustic Analysis</title><content type="html">Posted by Charles DuHadway, YouTube Slam Team, Google Research &lt;br /&gt;
&lt;br /&gt;
In an &lt;a href="http://googleresearch.blogspot.com/2011/06/instant-mix-for-music-beta-by-google.html"&gt;earlier post&lt;/a&gt; we talked about the technology behind Instant Mix for &lt;a href="http://music.google.com/"&gt;Music Beta by Google&lt;/a&gt;. Instant Mix uses machine hearing to characterize music attributes such as its timbre, mood and tempo. Today we would like to talk about acoustic and visual analysis -- this time on YouTube. A fundamental part of YouTube's mission is to allow anyone anywhere to showcase their talents -- occasionally leading to &lt;a href="http://www.youtube.com/watch?v=eQOFRZ1wNLw"&gt;life-changing success&lt;/a&gt; -- but many talented performers are never discovered. Part of the problem is the sheer volume of videos: forty eight hours of video are uploaded to YouTube every minute (that’s eight years of content every day). We wondered if we could use acoustic analysis and machine learning to pore over these videos and automatically identify talented musicians.&lt;br /&gt;
&lt;br /&gt;
First we analyzed audio and visual features of videos being uploaded. We wanted to find “singing at home” videos -- often correlated with features such as ambient indoor lighting, head-and-shoulders view of a person singing in front of a fixed camera, few instruments and often a single dominant voice. Here’s a sample set of videos we found.&lt;br /&gt;
&lt;br /&gt;
&lt;a href="http://2.bp.blogspot.com/-uIV4jRrylSI/TrB14eH590I/AAAAAAAAASc/xrHHAIj50X0/s1600/cover_wall_short.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" src="http://2.bp.blogspot.com/-uIV4jRrylSI/TrB14eH590I/AAAAAAAAASc/xrHHAIj50X0/s1600/cover_wall_short.png" /&gt;&lt;/a&gt;&lt;br /&gt;
&lt;br /&gt;
Then we estimated the quality of singing in each video. Our approach is based on acoustic analysis similar to that used by Instant Mix, coupled with a small set of singing quality annotations from human raters. Given these data we used machine learning to build a ranker that predicts if an average listener would like a performance. &lt;br /&gt;
&lt;br /&gt;
While machines are useful for weeding through thousands of not-so-great videos to find potential stars, we know they alone can't pick the next great star. So we turn to YouTube users to help us identify the real hidden gems by playing a voting game called &lt;a href="http://www.youtube.com/slam"&gt;YouTube Slam&lt;/a&gt;. We're putting an equal amount of effort into the game itself -- how do people vote? What makes it fun? How do we know when we have a true hit? We're looking forward to your feedback to help us refine this process: &lt;a href="http://www.youtube.com/slam/music/vote"&gt;give it a try&lt;/a&gt;*.  You can also check out singer and voter &lt;a href="http://www.youtube.com/slam/music"&gt;leaderboards&lt;/a&gt;. Toggle “All time” to “Last week” to find emerging talent in fresh videos or all-time favorites. &lt;br /&gt;
&lt;br /&gt;
Our “Music Slam” has only been running for a few weeks and we have already found some very talented musicians. Many of the videos have less than 100 views when we find them.&lt;br /&gt;
&lt;br /&gt;
&lt;div&gt;
&lt;table&gt;
&lt;tr&gt;&lt;td&gt;&lt;object class="BLOGGER-youtube-video" classid="clsid:D27CDB6E-AE6D-11cf-96B8-444553540000" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0" data-thumbnail-src="http://2.gvt0.com/vi/ZMZr83rwdNI/0.jpg" height="266" width="320"&gt;
&lt;param name="movie" value="http://www.youtube.com/v/ZMZr83rwdNI&amp;fs=1&amp;source=uds" /&gt;
&lt;param name="bgcolor" value="#FFFFFF" /&gt;
&lt;embed width="320" height="266"  src="http://www.youtube.com/v/ZMZr83rwdNI&amp;fs=1&amp;source=uds" type="application/x-shockwave-flash"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;/td&gt;
&lt;td&gt;&lt;object class="BLOGGER-youtube-video" classid="clsid:D27CDB6E-AE6D-11cf-96B8-444553540000" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0" data-thumbnail-src="http://3.gvt0.com/vi/OY2vWMtSsIM/0.jpg" height="266" width="320"&gt;
&lt;param name="movie" value="http://www.youtube.com/v/OY2vWMtSsIM&amp;fs=1&amp;source=uds" /&gt;
&lt;param name="bgcolor" value="#FFFFFF" /&gt;
&lt;embed width="320" height="266"  src="http://www.youtube.com/v/OY2vWMtSsIM&amp;fs=1&amp;source=uds" type="application/x-shockwave-flash"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td&gt;&lt;object class="BLOGGER-youtube-video" classid="clsid:D27CDB6E-AE6D-11cf-96B8-444553540000" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0" data-thumbnail-src="http://0.gvt0.com/vi/wRCPCNtViGA/0.jpg" height="266" width="320"&gt;
&lt;param name="movie" value="http://www.youtube.com/v/wRCPCNtViGA&amp;fs=1&amp;source=uds" /&gt;
&lt;param name="bgcolor" value="#FFFFFF" /&gt;
&lt;embed width="320" height="266"  src="http://www.youtube.com/v/wRCPCNtViGA&amp;fs=1&amp;source=uds" type="application/x-shockwave-flash"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;/td&gt;
&lt;td&gt;&lt;object class="BLOGGER-youtube-video" classid="clsid:D27CDB6E-AE6D-11cf-96B8-444553540000" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0" data-thumbnail-src="http://2.gvt0.com/vi/gBfynvifkOY/0.jpg" height="266" width="320"&gt;
&lt;param name="movie" value="http://www.youtube.com/v/gBfynvifkOY&amp;fs=1&amp;source=uds" /&gt;
&lt;param name="bgcolor" value="#FFFFFF" /&gt;
&lt;embed width="320" height="266"  src="http://www.youtube.com/v/gBfynvifkOY&amp;fs=1&amp;source=uds" type="application/x-shockwave-flash"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td&gt;&lt;object class="BLOGGER-youtube-video" classid="clsid:D27CDB6E-AE6D-11cf-96B8-444553540000" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0" data-thumbnail-src="http://2.gvt0.com/vi/LVFe6P-C7iY/0.jpg" height="266" width="320"&gt;
&lt;param name="movie" value="http://www.youtube.com/v/LVFe6P-C7iY&amp;fs=1&amp;source=uds" /&gt;
&lt;param name="bgcolor" value="#FFFFFF" /&gt;
&lt;embed width="320" height="266"  src="http://www.youtube.com/v/LVFe6P-C7iY&amp;fs=1&amp;source=uds" type="application/x-shockwave-flash"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;/td&gt;
&lt;td&gt;&lt;object class="BLOGGER-youtube-video" classid="clsid:D27CDB6E-AE6D-11cf-96B8-444553540000" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0" data-thumbnail-src="http://3.gvt0.com/vi/Qh-4qF07V1s/0.jpg" height="266" width="320"&gt;
&lt;param name="movie" value="http://www.youtube.com/v/Qh-4qF07V1s&amp;fs=1&amp;source=uds" /&gt;
&lt;param name="bgcolor" value="#FFFFFF" /&gt;
&lt;embed width="320" height="266"  src="http://www.youtube.com/v/Qh-4qF07V1s&amp;fs=1&amp;source=uds" type="application/x-shockwave-flash"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;/table&gt;
&lt;/div&gt;


&lt;br /&gt;
&lt;br /&gt;
And while we're excited about what we've done with music, there's as much undiscovered potential in almost any subject you can think of. Try our other slams: &lt;a href="http://www.youtube.com/slam/cute"&gt;cute&lt;/a&gt;, &lt;a href="http://www.youtube.com/slam/bizarre"&gt;bizarre&lt;/a&gt;, &lt;a href="http://www.youtube.com/slam/comedy"&gt;comedy&lt;/a&gt;, and &lt;a href="http://www.youtube.com/slam/dance"&gt;dance&lt;/a&gt;*. Enjoy!&lt;br /&gt;
&lt;br /&gt;
Related work by Google Researchers:&lt;br /&gt;
“&lt;a href="http://research.google.com/pubs/pub35638.html"&gt;Video2Text: Learning to Annotate Video Content&lt;/a&gt;”, &lt;a href="http://research.google.com/pubs/author37818.html"&gt;Hrishikesh Aradhye&lt;/a&gt;, &lt;a href="http://research.google.com/pubs/author38233.html"&gt;George Toderici&lt;/a&gt;, &lt;a href="http://research.google.com/pubs/author36197.html"&gt;Jay Yagnik&lt;/a&gt;, ICDM Workshop on Internet Multimedia Mining, 2009.&lt;br /&gt;
&lt;br /&gt;
* Music and dance slams are currently available only in the US.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/21224994-5952642640319180625?l=googleresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/blogspot/gJZg?a=qBMF3t_cFq0:fle4X7ngyJ8:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/blogspot/gJZg?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/gJZg/~4/qBMF3t_cFq0" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://googleresearch.blogspot.com/feeds/5952642640319180625/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=21224994&amp;postID=5952642640319180625" title="2 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/5952642640319180625?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/5952642640319180625?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/gJZg/~3/qBMF3t_cFq0/discovering-talented-musicians-with.html" title="Discovering Talented Musicians with Acoustic Analysis" /><author><name>Research Admin</name><uri>http://www.blogger.com/profile/00043158880867757514</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://2.bp.blogspot.com/-uIV4jRrylSI/TrB14eH590I/AAAAAAAAASc/xrHHAIj50X0/s72-c/cover_wall_short.png" height="72" width="72" /><thr:total>2</thr:total><feedburner:origLink>http://googleresearch.blogspot.com/2011/11/discovering-talented-musicians-with.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DEMFRno7cCp7ImA9WhdUEk4.&quot;"><id>tag:blogger.com,1999:blog-21224994.post-751723561172313008</id><published>2011-09-28T12:00:00.000-07:00</published><updated>2011-09-28T12:00:17.408-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-09-28T12:00:17.408-07:00</app:edited><title>Fresh Perspectives about People and the Web from Think Quarterly</title><content type="html">Posted by Allison Mooney, Christina Park, and Caroline McCarthy, The Think Quarterly Team&lt;br /&gt;
&lt;br /&gt;
There’s a lot of research, analysis and insights—from inside and outside Google—that we use in building our products and making decisions. To share what we’ve learned with our partners, we created &lt;a href="http://www.google.com/url?q=http%3A%2F%2Fwww.thinkwithgoogle.com%2Fquarterly%23utm_medium%3DBlogs%26utm_campaign%3DGoogle%2BResearch%2BBlog%26utm_source%3DGoogle"&gt;Think Quarterly&lt;/a&gt;. It’s intended to be a snapshot of what Google and other industry leaders are talking about and inspired by right now.&lt;br /&gt;
&lt;br /&gt;
Today we’re launching our second edition, the &lt;a href="http://www.google.com/url?q=http%3A%2F%2Fwww.thinkwithgoogle.com%2Fquarterly%23utm_medium%3DBlogs%26utm_campaign%3DGoogle%2BResearch%2BBlog%26utm_source%3DGoogle"&gt;“People” issue&lt;/a&gt;, exploring the latest technologies connecting us and the big ideas driving society forward. It also includes some of the research and analysis that helps us shape our strategies. &lt;br /&gt;
&lt;br /&gt;
For those who love data as much as we do, here are a few articles worth reading:&lt;br /&gt;
&lt;br /&gt;
&lt;ul&gt;&lt;li&gt;“Following Generation Z,” in which Google research scientist Ed Chi details what he’s learned from monitoring the course of digital innovation and mapping patterns of digital technology use in the future&lt;/li&gt;
&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;“Predicting the Present,” by chief economist Hal Varian, about how publicly available search tools can help anyone gain valuable insights into the behavior of web users and predict what they might do next&lt;/li&gt;
&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;“Power to the People,” by Meg Pickard, anthropologist turned head of digital engagement at Guardian News and Media, about tracking the influence and power of online communities&lt;/li&gt;
&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;“From Cash to Contentment,” about the use of happiness as a measurable metric of success, with insights coming from Nobel Prize winner Joseph Stiglitz&lt;/li&gt;
&lt;/ul&gt;&lt;br /&gt;
&lt;a href="http://www.google.com/url?q=http%3A%2F%2Fwww.thinkwithgoogle.com%2Fquarterly%23utm_medium%3DBlogs%26utm_campaign%3DGoogle%2BResearch%2BBlog%26utm_source%3DGoogle"&gt;Click here&lt;/a&gt; to read all the articles, and if you have a suggestion for our next issue please tell us &lt;a href="https://services.google.com/fb/forms/contactthinkquarterlyus/#utm_medium=Blogs&amp;amp;utm_campaign=Google+Research+Blog&amp;amp;utm_source=Google"&gt;here&lt;/a&gt;. We hope you enjoy (and +1) it!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/21224994-751723561172313008?l=googleresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/blogspot/gJZg?a=poGDAEWLRow:aVEfDveGVVI:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/blogspot/gJZg?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/gJZg/~4/poGDAEWLRow" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://googleresearch.blogspot.com/feeds/751723561172313008/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=21224994&amp;postID=751723561172313008" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/751723561172313008?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/751723561172313008?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/gJZg/~3/poGDAEWLRow/fresh-perspectives-about-people-and-web.html" title="Fresh Perspectives about People and the Web from Think Quarterly" /><author><name>Research Admin</name><uri>http://www.blogger.com/profile/00043158880867757514</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://googleresearch.blogspot.com/2011/09/fresh-perspectives-about-people-and-web.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DU8AQ305eCp7ImA9WhdUEUs.&quot;"><id>tag:blogger.com,1999:blog-21224994.post-1038159944734838452</id><published>2011-09-27T15:11:00.000-07:00</published><updated>2011-09-27T16:57:22.320-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-09-27T16:57:22.320-07:00</app:edited><title>Trying on the new Dynamic Views from Blogger</title><content type="html">&lt;span class="post-author"&gt;Posted by Alison Powell, Google Research Team&lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;
As you may have noticed, the Google Research blog looks a lot different today. That’s because we—along with a few other Google blogs—are trying out a new set of &lt;a href="http://www.blogger.com/"&gt;Blogger&lt;/a&gt; templates called Dynamic Views.&lt;br /&gt;
&lt;br /&gt;
&lt;a href="http://buzz.blogger.com/2011/09/dynamic-views-seven-new-ways-to-share.html"&gt;Launched today&lt;/a&gt;, Dynamic Views is a unique browsing experience that makes it easier and faster for readers to explore blogs in interactive ways. We’re using the Magazine view, but you can also preview this blog in any of the other six new views by using the view selection bar at the top left of the screen.&lt;br /&gt;
&lt;br /&gt;
&lt;iframe allowfullscreen="" frameborder="0" height="284" src="http://www.youtube.com/embed/lpDQF2lFnBU" width="500"&gt;&lt;/iframe&gt;&lt;br /&gt;
&lt;br /&gt;
We’re eager to hear what you think about the new Dynamic Views. You can submit feedback using the “Send feedback” link on the bottom right of this page.&lt;br /&gt;
&lt;br /&gt;
If you like what you see here, and we hope you do, we encourage you to try out the new look(s) on your own blog—read the &lt;a href="http://buzz.blogger.com/2011/09/dynamic-views-seven-new-ways-to-share.html"&gt;Blogger Buzz post&lt;/a&gt;&amp;nbsp;for more info.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/21224994-1038159944734838452?l=googleresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/blogspot/gJZg?a=j_k-5gfUjr8:AWdx0849VJ8:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/blogspot/gJZg?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/gJZg/~4/j_k-5gfUjr8" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://googleresearch.blogspot.com/feeds/1038159944734838452/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=21224994&amp;postID=1038159944734838452" title="4 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/1038159944734838452?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/1038159944734838452?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/gJZg/~3/j_k-5gfUjr8/trying-on-new-dynamic-views-from.html" title="Trying on the new Dynamic Views from Blogger" /><author><name>A Googler</name><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://img.youtube.com/vi/lpDQF2lFnBU/default.jpg" height="72" width="72" /><thr:total>4</thr:total><feedburner:origLink>http://googleresearch.blogspot.com/2011/09/trying-on-new-dynamic-views-from.html</feedburner:origLink></entry><entry gd:etag="W/&quot;Dk8HQH0_eyp7ImA9WhdWFEQ.&quot;"><id>tag:blogger.com,1999:blog-21224994.post-1105872079913092982</id><published>2011-09-07T16:50:00.000-07:00</published><updated>2011-09-08T08:13:51.343-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-09-08T08:13:51.343-07:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="MapReduce" /><title>Sorting Petabytes with MapReduce - The Next Episode</title><content type="html">&lt;span itemscope itemtype="http://schema.org/Article"&gt;&lt;meta itemprop="name" content="Sorting Petabytes with MapReduce - The Next Episode"&gt;&lt;meta itemprop="description" content="Sorting a ten petabyte input set took 6 hours and 27 minutes to complete on 8000 computers. We are not aware of any other sorting experiment successfully completed at this scale."&gt;&lt;/span&gt;

&lt;span class="byline-author"&gt;Posted by Grzegorz Czajkowski, Marián Dvorský, Jerry Zhao, and Michael Conley, Systems Infrastructure&lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;
Almost three years ago we announced &lt;a href="http://googleblog.blogspot.com/2008/11/sorting-1pb-with-mapreduce.html"&gt;results of the first ever "petasort"&lt;/a&gt; (sorting a petabyte-worth of 100-byte records, following the &lt;a href="http://sortbenchmark.org/"&gt;Sort Benchmark&lt;/a&gt; rules).  It completed in just over six hours on 4000 computers.  Recently we repeated the experiment using 8000 computers.  The execution time was 33 minutes, an order of magnitude improvement.&lt;br /&gt;
&lt;br /&gt;
Our sorting code is based on &lt;a href="http://labs.google.com/papers/mapreduce.html"&gt;MapReduce&lt;/a&gt;, which is a key framework for running multiple processes simultaneously at Google. Thousands of applications, supporting most services offered by Google, have been expressed in MapReduce.  While not many MapReduce applications operate at a petabyte scale, some do.  Their scale is likely to continue growing quickly.  The need to help such applications scale motivated us to experiment with data sets larger than one petabyte.  In particular, sorting a ten petabyte input set took 6 hours and 27 minutes to complete on 8000 computers. We are not aware of any other sorting experiment successfully completed at this scale.&lt;br /&gt;
&lt;br /&gt;
We are excited by these results.  While internal improvements to the MapReduce framework contributed significantly, a large part of the credit goes to numerous advances in Google's hardware, cluster management system, and storage stack.  &lt;br /&gt;
&lt;br /&gt;
What would it take to scale MapReduce by further orders of magnitude and make processing of such large data sets efficient and easy? One way to find out is to join Google’s systems infrastructure team.  If you have a passion for distributed computing, are an expert or plan to become one, and feel excited about the challenges of exascale then definitely consider applying for a &lt;a href="http://www.google.com/intl/en/jobs/swe/#src=storageresearchblog"&gt;software engineering position&lt;/a&gt; with Google.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/21224994-1105872079913092982?l=googleresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/blogspot/gJZg?a=KWMkx2i3dyU:rrVpIrkO1Rw:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/blogspot/gJZg?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/gJZg/~4/KWMkx2i3dyU" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://googleresearch.blogspot.com/feeds/1105872079913092982/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=21224994&amp;postID=1105872079913092982" title="8 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/1105872079913092982?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/1105872079913092982?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/gJZg/~3/KWMkx2i3dyU/sorting-petabytes-with-mapreduce-next.html" title="Sorting Petabytes with MapReduce - The Next Episode" /><author><name>Research Admin</name><uri>http://www.blogger.com/profile/00043158880867757514</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>8</thr:total><feedburner:origLink>http://googleresearch.blogspot.com/2011/09/sorting-petabytes-with-mapreduce-next.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CkACSXk-eCp7ImA9WhdXEEg.&quot;"><id>tag:blogger.com,1999:blog-21224994.post-9079268293716491067</id><published>2011-08-22T15:06:00.000-07:00</published><updated>2011-08-22T15:06:08.750-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-08-22T15:06:08.750-07:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="statistics" /><category scheme="http://www.blogger.com/atom/ns#" term="jsm2011" /><category scheme="http://www.blogger.com/atom/ns#" term="jsm" /><category scheme="http://www.blogger.com/atom/ns#" term="conferences" /><title>Google at the Joint Statistical Meetings in Miami</title><content type="html">Posted by Marianna Dizik, Statistician&lt;br /&gt;
&lt;br /&gt;
The Joint Statistical Meetings (JSM) were held in Miami, Florida, this year. Nearly 5,000 participants from academia and industry came to present and discuss the latest in statistical research, methodology, and applications.  Similar to previous years, several Googlers shared expertise in large-scale experimental design and implementation, statistical inference with massive datasets and forecasting, data mining, parallel computing, and much more.&lt;br /&gt;
&lt;br /&gt;
Our session "Statistics: The Secret Weapon of Successful Web Giants" attracted over one hundred people; surprising for an 8:30 AM session!  Revolution Analytics reviewed this in their official blog post &lt;a href="http://blog.revolutionanalytics.com/2011/08/google-r-effective-ads.html"&gt;"How Google uses R to make online advertising more effective"&lt;/a&gt;&lt;br /&gt;
&lt;br /&gt;
The following talks were given by Googlers at JSM 2011.  Please check the upcoming Proceedings of the JSM 2011 for the full papers.&lt;br /&gt;
&lt;br /&gt;
&lt;ul&gt;&lt;li&gt;&lt;a href="http://www.amstat.org/meetings/jsm/2011/onlineprogram/AbstractDetails.cfm?abstractid=300146"&gt;Statistical Plumbing: Effective use of classical statistical methods for large scale applications&lt;/a&gt;&lt;/li&gt;
&lt;ul&gt;&lt;li&gt;Author(s): Ni Wang, Yong Li, Daryl Pregibon, and Rachel Schutt&lt;/li&gt;
&lt;/ul&gt;&lt;li&gt;&lt;a href="http://www.amstat.org/meetings/jsm/2011/onlineprogram/AbstractDetails.cfm?abstractid=300570"&gt;Parallel Computations in R, with Applications for Statistical Forecasting&lt;/a&gt;&lt;/li&gt;
&lt;ul&gt;&lt;li&gt;Author(s): Murray Stokely and Farzan Rohani and Eric Tassone&lt;/li&gt;
&lt;/ul&gt;&lt;li&gt;&lt;a href="http://www.amstat.org/meetings/jsm/2011/onlineprogram/AbstractDetails.cfm?abstractid=300008"&gt;Conditional Regression Models&lt;/a&gt;&lt;/li&gt;
&lt;ul&gt;&lt;li&gt;Author(s): William D. Heavlin&lt;/li&gt;
&lt;/ul&gt;&lt;li&gt;&lt;a href="http://www.amstat.org/meetings/jsm/2011/onlineprogram/AbstractDetails.cfm?abstractid=300051"&gt;The Effectiveness of Display Ads&lt;/a&gt;&lt;/li&gt;
&lt;ul&gt;&lt;li&gt;Author(s): Tim Hesterberg and Diane Lambert and David X. Chan and Or Gershony and Rong Ge&lt;/li&gt;
&lt;/ul&gt;&lt;li&gt;&lt;a href="http://www.amstat.org/meetings/jsm/2011/onlineprogram/AbstractDetails.cfm?abstractid=300257"&gt;Measuring Ad Effectiveness Using Continuous Geo Experiments&lt;/a&gt;&lt;/li&gt;
&lt;ul&gt;&lt;li&gt;Author(s): Jon Vaver and Deepak Kumar and Jim Koehler&lt;/li&gt;
&lt;/ul&gt;&lt;li&gt;&lt;a href="http://www.amstat.org/meetings/jsm/2011/onlineprogram/AbstractDetails.cfm?abstractid=300200"&gt;Post-Stratification and Network Sampling&lt;/a&gt;&lt;/li&gt;
&lt;ul&gt;&lt;li&gt;Author(s): Rachel Schutt and Andrew Gelman  and Tyler McCormick&lt;/li&gt;
&lt;/ul&gt;&lt;/ul&gt;Google has participated at JSM each year since 2004.  We have been increasing our involvement significantly by providing sponsorship, organizing and giving talks at sessions and roundtables, teaching courses and workshops, hosting a booth with new Google products demo, submitting posters, and more.  This year Googlers participated in sessions sponsored by ASA sections for Statistical Learning and Data Mining, Statistics and Marketing, Statistical Computing, Bayesian Statistical Science , Health Policy Statistics, Statistical Graphics, Quality and Productivity, Physical and Engineering Sciences, and Statistical Education.&lt;br /&gt;
&lt;br /&gt;
We also hosted the Google faculty reception, which was well-attended by faculty and their promising students. Google hires a growing number of statisticians and we were happy to participate in JSM again this year. People had a chance to talk to Googlers, ask about working here, encounter elements of Google culture (good food! T-shirts! 3D puzzles!), meet old and make new friends, and just have fun!&lt;br /&gt;
&lt;br /&gt;
Thanks to everyone that presented, attended, or otherwise engaged with the statistical community at JSM this year.  We’re looking forward to seeing you in San Diego next year.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/21224994-9079268293716491067?l=googleresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/blogspot/gJZg?a=BtFffo4zB3M:8ThloEE_ch8:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/blogspot/gJZg?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/gJZg/~4/BtFffo4zB3M" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://googleresearch.blogspot.com/feeds/9079268293716491067/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=21224994&amp;postID=9079268293716491067" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/9079268293716491067?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/9079268293716491067?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/gJZg/~3/BtFffo4zB3M/google-at-joint-statistical-meetings-in.html" title="Google at the Joint Statistical Meetings in Miami" /><author><name>Research Admin</name><uri>http://www.blogger.com/profile/00043158880867757514</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://googleresearch.blogspot.com/2011/08/google-at-joint-statistical-meetings-in.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DUIHQXYyfyp7ImA9WhdQFUw.&quot;"><id>tag:blogger.com,1999:blog-21224994.post-5622208624483313484</id><published>2011-08-16T09:00:00.000-07:00</published><updated>2011-08-16T10:58:50.897-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-08-16T10:58:50.897-07:00</app:edited><title>A new MIT center for mobile learning, with support from Google</title><content type="html">Posted by Hal Abelson, Professor of Computer Science and Engineering, MIT&lt;br /&gt;
&lt;br /&gt;
MIT and Google have a &lt;a href="http://googleblog.blogspot.com/2011/05/celebrating-150-years-of-mit.html"&gt;long-standing relationship&lt;/a&gt; based on mutual interests in education and technology. Today, we took another step forward in our shared goals with the establishment of the MIT Center for Mobile Learning, which will strive to transform learning and education through innovation in mobile computing. The new center will be actively engaged in studying and extending &lt;a href="http://appinventor.googlelabs.com/about/"&gt;App Inventor for Android&lt;/a&gt;, which Google recently announced it will be open sourcing.&lt;br /&gt;
&lt;br /&gt;
The new center, housed at MIT’s Media Lab, will focus on designing and studying new mobile technologies that enable people to learn anywhere, anytime, with anyone. The center was made possible in part by support from &lt;a href="http://research.google.com/university/"&gt;Google University Relations&lt;/a&gt; and will be run by myself and two distinguished MIT colleagues: Professors Eric Klopfer (science education) and Mitchel Resnick (media arts and sciences).&lt;br /&gt;
&lt;br /&gt;
App Inventor for Android—a programming system that makes it easy for learners to create mobile apps for Android smartphones—currently supports a community of about 100,000 educators, students and hobbyists. Through the new initiatives at the MIT Center for Mobile Learning, App Inventor will be connected to MIT’s premier research in educational technology and MIT’s long track record of creating and supporting open software.&lt;br /&gt;
&lt;br /&gt;
Google first launched App Inventor internally in order to move it forward with speed and focus, and then developed it to a point where it started to gain critical mass. Now, its impact can be amplified by collaboration with a top academic institution. At MIT, App Inventor will adopt an enriched research agenda with increased opportunities to influence the educational community. In a way, App Inventor has now come full circle, as I actually initiated App Inventor at Google by proposing it as a project during my sabbatical with the company in 2008. The core code for App Inventor came from Eric Klopfer’s &lt;a href="http://education.mit.edu/projects"&gt;lab&lt;/a&gt;, and the inspiration came from Mitch Resnick’s &lt;a href="http://scratched.media.mit.edu/"&gt;Scratch project&lt;/a&gt;. The new center is a perfect example of how industry and academia can collaborate effectively to create change enabled by technology, and we look forward to seeing what we can do next, together.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/21224994-5622208624483313484?l=googleresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/blogspot/gJZg?a=OMjMMSsNcYA:DqLsiifad9U:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/blogspot/gJZg?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/gJZg/~4/OMjMMSsNcYA" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://googleresearch.blogspot.com/feeds/5622208624483313484/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=21224994&amp;postID=5622208624483313484" title="20 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/5622208624483313484?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/5622208624483313484?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/gJZg/~3/OMjMMSsNcYA/new-mit-center-for-mobile-learning-with.html" title="A new MIT center for mobile learning, with support from Google" /><author><name>Research Admin</name><uri>http://www.blogger.com/profile/00043158880867757514</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>20</thr:total><feedburner:origLink>http://googleresearch.blogspot.com/2011/08/new-mit-center-for-mobile-learning-with.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CEUMQ307cCp7ImA9WhdQEUo.&quot;"><id>tag:blogger.com,1999:blog-21224994.post-8929157003305149385</id><published>2011-08-12T11:00:00.000-07:00</published><updated>2011-08-12T11:04:42.308-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-08-12T11:04:42.308-07:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="Education" /><title>Our Faculty Institute brings faculty back to the drawing board</title><content type="html">Posted by Nina Kim Schultz, Google Education Research&lt;br /&gt;
&lt;br /&gt;
&lt;i&gt;Cross-posted with the &lt;a href="http://googleblog.blogspot.com/2011/08/faculty-institute-brings-faculty-back.html"&gt;Official Google Blog&lt;/a&gt;&lt;/i&gt;&lt;br /&gt;
&lt;br /&gt;
School may still be out for summer, but teachers remain hard at work. This week, we hosted Google’s inaugural Faculty Institute at our Mountain View, Calif. headquarters. The three-day event was created for esteemed faculty from schools of education and math and science to explore teaching paradigms that leverage technology in K-12 classrooms. Selected via a rigorous nomination and application process, the 39 faculty members hail from 19 California State Universities (CSUs), as well as Stanford and UC Berkeley, and teach high school STEM (Science, Technology, Engineering and Math) teachers currently getting their teaching credentials. CSU programs credential 60 percent of California’s teachers—or 10 percent of all U.S. K-12 teachers—and one CSU campus alone can credential around 1,000 new teachers in a year. The purpose of gathering together at the Institute was to ensure our teachers’ teachers have the support they need to help educators adjust to a changing landscape.&lt;br /&gt;
&lt;br /&gt;
There is so much technology available to educators today, but unless they learn how to use it effectively, it does little to change what is happening in our classrooms. Without the right training and inspiration, interactive displays become merely expensive projection screens, and laptops simply replace paper rather than shifting the way teachers teach and students learn. Although the possibilities for technology use in schools are endless, teacher preparation for the 21st century classroom also has many constraints. For example: beyond the expense involved, there’s the time it costs educators to match a technological innovation to the improvement of pedagogy and curriculum; there’s a distinct shift in thinking that needs to take place to change classrooms; and there’s an essential challenge to help teachers develop the dispositions and confidence to be lifelong evaluators, learners and teachers of technology, instead of continuing to rely on traditional skill sets that will soon be outdated.&lt;br /&gt;
&lt;br /&gt;
The Institute featured keynote addresses from respected professors from Stanford and Berkeley, case studies from distinguished high school teachers from across California, hands-on technology workshops with a variety of Google and non-Google tools, and panels with professionals in the tech-education industry. Notable guests included representatives from &lt;a href="http://www.teachforamerica.org/"&gt;Teach for America&lt;/a&gt;, &lt;a href="http://tntp.org/"&gt;The New Teacher Project&lt;/a&gt;, the &lt;a href="http://www.ed.gov/"&gt;Department of Education&lt;/a&gt; and &lt;a href="http://www.edutopia.org/"&gt;Edutopia&lt;/a&gt;. Topics covered the ability to distinguish learning paths, how to use technology to transform classrooms into project-based, collaborative spaces and how to utilize a more interactive teaching style rather than the traditional lecture model.&lt;br /&gt;
&lt;br /&gt;
On the last day of the Institute, faculty members were invited to submit grant proposals to scale best practices outside of the meeting. Deans of the participating universities will convene at the end of the month to further brainstorm ways to scale new ideas in teacher preparation programs. Congratulations to all of the faculty members who were accepted into the inaugural Institute, and thank you for all that you do to help bring technology and new ways of thinking into the classroom. &lt;br /&gt;
&lt;br /&gt;
&lt;style="clear: both;="" center;"="" text-align:=""&gt;
&lt;a href="http://1.bp.blogspot.com/-_JrercX4bfo/TkVnu72IZwI/AAAAAAAAIYs/6ear903_-80/s1600/Faculty+Institute.jpeg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" src="http://1.bp.blogspot.com/-_JrercX4bfo/TkVnu72IZwI/AAAAAAAAIYs/6ear903_-80/Faculty+Institute.jpeg" width="500" /&gt;&lt;/a&gt;
This program is a part of Google’s continued commitment to supporting STEM education.  Details on our other programs can be found on &lt;a href="http://www.google.com/education"&gt;www.google.com/education&lt;/a&gt;.&lt;/style="clear:&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/21224994-8929157003305149385?l=googleresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/blogspot/gJZg?a=Ma_lx8JvQhE:ruwNt6-y978:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/blogspot/gJZg?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/gJZg/~4/Ma_lx8JvQhE" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://googleresearch.blogspot.com/feeds/8929157003305149385/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=21224994&amp;postID=8929157003305149385" title="1 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/8929157003305149385?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/8929157003305149385?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/gJZg/~3/Ma_lx8JvQhE/our-faculty-institute-brings-faculty.html" title="Our Faculty Institute brings faculty back to the drawing board" /><author><name>Research Admin</name><uri>http://www.blogger.com/profile/00043158880867757514</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://1.bp.blogspot.com/-_JrercX4bfo/TkVnu72IZwI/AAAAAAAAIYs/6ear903_-80/s72-c/Faculty+Institute.jpeg" height="72" width="72" /><thr:total>1</thr:total><feedburner:origLink>http://googleresearch.blogspot.com/2011/08/our-faculty-institute-brings-faculty.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DEENQH8_eyp7ImA9WhdQEEw.&quot;"><id>tag:blogger.com,1999:blog-21224994.post-6009201590116295031</id><published>2011-08-10T15:51:00.000-07:00</published><updated>2011-08-10T15:51:31.143-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-08-10T15:51:31.143-07:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="Google Books" /><category scheme="http://www.blogger.com/atom/ns#" term="Ngram" /><title>Culturomics, Ngrams and new power tools for Science</title><content type="html">&lt;span class="byline-author"&gt;Posted by Erez Lieberman Aiden and Jean-Baptiste Michel, Visiting Faculty at Google&lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;
Four years ago, we set out to create a research engine that would help people explore our cultural history by statistically analyzing the world’s books. In January 2011, the resulting method, &lt;a href="http://www.culturomics.org/"&gt;culturomics&lt;/a&gt;, was featured on the cover of the journal &lt;i&gt;&lt;a href="http://www.sciencemag.org/content/331/6014/176"&gt;Science&lt;/a&gt;&lt;/i&gt;. More importantly, Google implemented and launched a web-based version of our prototype research engine, the Google Books Ngram Viewer.&lt;br /&gt;
&lt;br /&gt;
Now scientists, scholars, and web surfers around the world can take advantage of the Ngram Viewer to study a vast array of phenomena. And that's exactly what they've done. Here are a few of our favorite examples.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Poverty&lt;/b&gt;&lt;br /&gt;
Martin Ravallion, head of the Development Research Group at the World Bank, has been using the ngrams to study the history of poverty. In a &lt;a href="http://www.psocommons.org/ppp/vol3/iss2/art2/"&gt;paper&lt;/a&gt; published in the journal Poverty and Public Policy, he argues for the existence of two ‘poverty enlightenments’ marked by increased awareness of the problem: one towards the end of the 18th century, and another in the 1970s and 80s. But he makes the point that only the second of these enlightenments brought with it a truly enlightened idea: that poverty can be and should be completely &lt;a href="http://ngrams.googlelabs.com/graph?content=eradicate+poverty&amp;amp;year_start=1800&amp;amp;year_end=2000&amp;amp;corpus=0&amp;amp;smoothing=3"&gt;eradicated&lt;/a&gt;.&lt;br /&gt;
&lt;br /&gt;
&lt;a href="http://ngrams.googlelabs.com/chart?content=eradicate%20poverty&amp;amp;corpus=0&amp;amp;smoothing=3&amp;amp;year_start=1800&amp;amp;year_end=2000"&gt;&lt;img alt="" border="0" src="http://ngrams.googlelabs.com/chart?content=eradicate%20poverty&amp;amp;corpus=0&amp;amp;smoothing=3&amp;amp;year_start=1800&amp;amp;year_end=2000" style="cursor: hand; cursor: pointer; float: center; height: 247px; margin: 0 10px 10px 0; width: 550px;" /&gt;&lt;/a&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;The Science Hall of Fame&lt;/b&gt;&lt;br /&gt;
Adrian Veres and John Bohannon wondered who the most famous scientists of the past two centuries were. But there was no hall of fame for scientists, or a committee that determines who deserves to get into such a hall. So they used the ngrams data to define a metric for celebrity – the milliDarwin – and algorithmically created a &lt;a href="http://www.sciencemag.org/site/feature/misc/webfeat/gonzoscientist/episode14/index.xhtml"&gt;Science Hall of Fame&lt;/a&gt; listing the most famous scientists born since 1800. They found that things like a popular book or a major controversy did more to increase discussion of a scientist than, for instance, winning a Nobel Prize.&lt;br /&gt;
&lt;br /&gt;
(Other users have been exploring the history of particular sciences with the Ngram Viewer, covering everything from &lt;a href="http://egosumdaniel.blogspot.com/2011/02/brief-history-of-neuroscience-in-google.html"&gt;neuroscience&lt;/a&gt; to the &lt;a href="http://www.theatlantic.com/technology/archive/2011/03/the-nuclear-century-in-google-ngrams/72461/"&gt;nuclear&lt;/a&gt; age.)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;b style="font-weight: bold;"&gt;The History of Typography&lt;/b&gt;&lt;br /&gt;
When we introduced the Ngram Viewer, we pointed out some potential pitfalls with the data. For instance, the ‘medial s’ ( ſ ), an older form of the letter s that looked like an integral sign and appeared in the beginning or middle of words, tends to be classified as an instance of the letter ‘f’ by the OCR algorithm used to create our version of the data. Andrew West, blogging at &lt;a href="http://babelstone.blogspot.com/2006/06/rules-for-long-s.html"&gt;Babelstone&lt;/a&gt;, found a clever way to exploit this error: using queries like ‘husband’ and ‘hufband’ to study the history of medial s typography, he pinned down the precise moment when the medial s disappeared from English (around 1800), French (1780), and Spanish (1760).&lt;br /&gt;
&lt;br /&gt;
People are clearly having a good time with the Ngram Viewer, and they have been learning a few things about science and history in the process. Indeed, the tool has proven so popular and so useful that Google recently announced that its imminent graduation from Google Labs to become a permanent part of Google Books.&lt;br /&gt;
&lt;br /&gt;
Similar ‘big data’ approaches can also be applied to a wide variety of other problems. From books to maps to the structure of the web itself, 'the world's information' is one amazing dataset. &lt;br /&gt;
&lt;i&gt;&lt;br /&gt;
Erez Lieberman Aiden is Visiting Faculty at Google and a Fellow of the Harvard Society of Fellows. Jean-Baptiste Michel is Visiting Faculty at Google and a Postdoctoral Fellow in Harvard's Department of Psychology.&lt;/i&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/21224994-6009201590116295031?l=googleresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/blogspot/gJZg?a=DDbuOHzqXfE:-wr_VjdKfSE:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/blogspot/gJZg?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/gJZg/~4/DDbuOHzqXfE" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://googleresearch.blogspot.com/feeds/6009201590116295031/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=21224994&amp;postID=6009201590116295031" title="1 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/6009201590116295031?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/6009201590116295031?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/gJZg/~3/DDbuOHzqXfE/culturomics-ngrams-and-new-power-tools.html" title="Culturomics, Ngrams and new power tools for Science" /><author><name>Research Admin</name><uri>http://www.blogger.com/profile/00043158880867757514</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>1</thr:total><feedburner:origLink>http://googleresearch.blogspot.com/2011/08/culturomics-ngrams-and-new-power-tools.html</feedburner:origLink></entry><entry gd:etag="W/&quot;D04HSH89fCp7ImA9WhdSGEo.&quot;"><id>tag:blogger.com,1999:blog-21224994.post-6244240591590895894</id><published>2011-07-28T10:58:00.000-07:00</published><updated>2011-07-28T10:58:59.164-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-07-28T10:58:59.164-07:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="Policy" /><category scheme="http://www.blogger.com/atom/ns#" term="Structured Data" /><category scheme="http://www.blogger.com/atom/ns#" term="Government" /><category scheme="http://www.blogger.com/atom/ns#" term="Fusion Tables" /><title>President's Council Recommends Open Data for Federal Agencies</title><content type="html">&lt;span class="byline-author"&gt;Posted by Alon Halevy, Senior Staff Research Scientist&lt;/span&gt;  &lt;br /&gt;
&lt;br /&gt;
&lt;i&gt;Cross-posted with the &lt;a href="http://googlepublicsector.blogspot.com/2011/07/presidents-council-recommends-open-data.html"&gt;Public Sector and Elections Lab Blog&lt;/a&gt;&lt;/i&gt;&lt;br /&gt;
&lt;br /&gt;
One of the things I most enjoy about working on data management is the ability to work on a variety of problems, both in the private sector and in government. I recently had the privilege of serving on a working group of the President’s Council of Advisors on Science and Technology (&lt;a href="http://www.whitehouse.gov/administration/eop/ostp/pcast"&gt;PCAST&lt;/a&gt;) studying the challenges of conserving the nation’s ecosystems. The report, titled “Sustaining Environmental Capital: Protecting Society and the Economy” was presented to President Obama on July 18th, 2011. The &lt;a href="http://www.whitehouse.gov/sites/default/files/microsites/ostp/pcast_sustaining_environmental_capital_report.pdf"&gt;full report&lt;/a&gt; is now available to the public. &lt;br /&gt;
&lt;br /&gt;
The &lt;a href="http://www.whitehouse.gov/sites/default/files/microsites/ostp/biodiversity_press_release_7-22-11.pdf"&gt;press release&lt;/a&gt; announcing the report summarizes its recommendations: &lt;br /&gt;
&lt;blockquote&gt;The Federal Government should launch a series of efforts to assess thoroughly the condition of U.S. ecosystems and the social and economic value of the services those ecosystems provide, according to a new report by the President’s Council of Advisors on Science and Technology (PCAST), an independent council of the Nation’s leading scientists and engineers. The report also recommends that the Nation apply modern informatics technologies to the vast stores of biodiversity data already collected by various Federal agencies in order to increase the usefulness of those data for decision- and policy-making.&lt;/blockquote&gt;&lt;br /&gt;
One of the key challenges we face in assessing the condition of ecosystems is that a lot of the data pertaining to these systems is locked up in individual databases. Even though this data is often collected using government funds, it is not always available to the public and in other cases available but not in usable formats. This is a classical example of a data integration problem that occurs in many other domains.&lt;br /&gt;
&lt;br /&gt;
The report calls for creating an ecosystem, EcoINFORMA, around data. The crucial piece of this ecosystem is to make the relevant data publicly available in a timely manner and, most importantly, in a machine readable form. Publishing data embedded in a PDF file is a classical example of what does not count as being machine readable. For example, if you are publishing a tabular data set, then a computer program should be able to directly access the meta-data (e.g., column names, date collected) and the data rows without having to heuristically extract it from surrounding text. &lt;br /&gt;
&lt;br /&gt;
Once the data is published, it can be discovered by search engines. Data from multiple sources can be combined to provide additional insight, and the data can be visualized and analyzed by sophisticated tools. The main point is that innovation should be pursued by many parties (academics, commercial, government), each applying their own expertise and passions.&lt;br /&gt;
&lt;br /&gt;
There is a subtle point about how much meta-data should be provided before publishing the data. Unfortunately, requiring too much meta-data (e.g., standard schemas) often stymies publication.  When meta-data exists, that’s great, but when it’s not there or is not complete, we should still publish the data in a timely manner. If the data is valuable and discoverable, there will be someone in the ecosystem who will enhance the data in an appropriate fashion. &lt;br /&gt;
&lt;br /&gt;
I look forward to seeing this ecosystem evolve and excited that &lt;a href="http://www.google.com/fusiontables/public/tour/index.html"&gt;Google Fusion Tables&lt;/a&gt;, our own cloud-based service for visualizing, sharing and integrating structured data, can contribute to its development.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/21224994-6244240591590895894?l=googleresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/blogspot/gJZg?a=6PXn1guYyck:VQHpcKU9USM:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/blogspot/gJZg?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/gJZg/~4/6PXn1guYyck" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://googleresearch.blogspot.com/feeds/6244240591590895894/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=21224994&amp;postID=6244240591590895894" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/6244240591590895894?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/6244240591590895894?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/gJZg/~3/6PXn1guYyck/presidents-council-recommends-open-data.html" title="President's Council Recommends Open Data for Federal Agencies" /><author><name>Research @ Google</name><uri>http://www.blogger.com/profile/03833875495392153515</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://googleresearch.blogspot.com/2011/07/presidents-council-recommends-open-data.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DU4DRn49cSp7ImA9WhdWFEk.&quot;"><id>tag:blogger.com,1999:blog-21224994.post-6538985493372526158</id><published>2011-07-21T08:10:00.000-07:00</published><updated>2011-09-07T19:12:57.069-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-09-07T19:12:57.069-07:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="search ads" /><title>Studies Show Search Ads Drive 89% Incremental Traffic</title><content type="html">&lt;span class="byline-author"&gt;Posted by David Chan and Lizzy Van Alstine, Quantitative Management Team&lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;
Advertisers often wonder whether search ads cannibalize their organic traffic. In other words, if search ads were paused, would clicks on organic results increase, and make up for the loss in paid traffic? Google statisticians recently ran over 400 studies on paused accounts to answer this question.&lt;br /&gt;
&lt;br /&gt;
In what we call “Search Ads Pause Studies”, our group of researchers observed organic click volume in the absence of search ads. Then they built a statistical model to predict the click volume for given levels of ad spend using spend and organic impression volume as predictors. These models generated estimates for the incremental clicks attributable to search ads (IAC), or in other words, the percentage of paid clicks that are not made up for by organic clicks when search ads are paused.&lt;br /&gt;
&lt;br /&gt;
The results were surprising. On average, the incremental ad clicks percentage across verticals is 89%. This means that a full 89% of the traffic generated by search ads is not replaced by organic clicks when ads are paused. This number was consistently high across verticals. The full study can be found on &lt;a href="http://research.google.com/pubs/archive/37161.pdf"&gt;here&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/21224994-6538985493372526158?l=googleresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/blogspot/gJZg?a=N0gVP0-DkHk:Ud7wgTgDJ5U:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/blogspot/gJZg?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/gJZg/~4/N0gVP0-DkHk" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://googleresearch.blogspot.com/feeds/6538985493372526158/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=21224994&amp;postID=6538985493372526158" title="18 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/6538985493372526158?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/6538985493372526158?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/gJZg/~3/N0gVP0-DkHk/studies-show-search-ads-drive-89.html" title="Studies Show Search Ads Drive 89% Incremental Traffic" /><author><name>Research @ Google</name><uri>http://www.blogger.com/profile/03833875495392153515</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>18</thr:total><feedburner:origLink>http://googleresearch.blogspot.com/2011/07/studies-show-search-ads-drive-89.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DU8HR38_cSp7ImA9WhdSEUU.&quot;"><id>tag:blogger.com,1999:blog-21224994.post-1563979813476126957</id><published>2011-07-20T11:08:00.000-07:00</published><updated>2011-07-20T11:50:36.149-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-07-20T11:50:36.149-07:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="conference" /><category scheme="http://www.blogger.com/atom/ns#" term="University Relations" /><title>Faculty from across the Americas meet in New York for the Faculty Summit</title><content type="html">&lt;span class="byline-author"&gt;Posted by Maggie Johnson, Director of Education &amp;amp; University Relations&lt;/span&gt; &lt;br /&gt;
&lt;br /&gt;
&lt;i&gt;(Cross-posted from the &lt;a href="http://googleblog.blogspot.com/2011/07/faculty-from-across-americas-meet-in.html"&gt;Official Google Blog&lt;/a&gt;)&lt;/i&gt;&lt;br /&gt;
&lt;br /&gt;
Last week, we held our seventh annual &lt;a href="https://sites.google.com/site/facultysummit2011/"&gt;Computer Science Faculty Summit&lt;/a&gt;. For the first time, the event took place at our &lt;a href="http://maps.google.com/?q=Google%20New%20York@40.741962,-74.004624&amp;amp;hl=en"&gt;New York City office&lt;/a&gt;;&amp;nbsp;nearly 100 faculty members from universities in the U.S., Canada and Latin America attended. The two-day Summit focused on systems, artificial intelligence and mobile computing. Alfred Spector, VP of research and special initiatives, hosted the conference and led lively discussions on privacy, security and Google’s approach to research.  &lt;br /&gt;
&lt;br /&gt;
Google’s Internet evangelist, Vint Cerf, opened the Summit with a talk on the challenges involved in securing the “&lt;a href="http://en.wikipedia.org/wiki/Internet_of_Things"&gt;Internet of things&lt;/a&gt;”—that is, uniquely identifiable objects (“things”) and their virtual representations. With almost 2 billion international Internet users and 5 billion mobile devices out there in the world, Vint expounded upon the idea that Internet security is not just about technology, but also about policy and global institutions. He stressed that our new digital ecosystem is complex and large in scale, and includes both hardware and software. It also has multiple stakeholders, diverse business models and a range of legal frameworks. Vint argued that making and keeping the Internet secure over the next few years will require technical innovation and global collaboration.&lt;br /&gt;
&lt;br /&gt;
After Vint kicked things off, faculty spent the two days attending presentations by Google software engineers and research scientists, including John Wilkes on the management of Google's large hardware infrastructure,  Andrew Chatham on the self-driving car, Johan Schalkwyk on mobile speech technology and Andrew Moore on the research challenges in commerce services. Craig Nevill-Manning, the engineering founder of Google’s NYC office, gave an update on Google.org, particularly its recent work in crisis response. Other talks covered the engineering work behind products like Ad Exchange and Google Docs, and the range of engineering projects taking place across 35 Google offices in 20 countries. For a complete list of the topics and sessions, visit the &lt;a href="https://sites.google.com/site/facultysummit2011/agenda"&gt;Faculty Summit site&lt;/a&gt;. Also, a few of our attendees heeded Alfred’s call to recap their breakout sessions in verse—&lt;a href="http://services.google.com/fh/files/blogs/google_facultysummitpoem.pdf"&gt;download a PDF&lt;/a&gt; of one of our favorite poems, about the future of mobile computing, penned by NYU professor Ken Perlin.&lt;br /&gt;
&lt;br /&gt;
A highlight of this year’s Summit was Bill Schilit’s presentation of the Library Wall, a Chrome OS experiment featuring an eight-foot tall full-color virtual display of ebooks that can be browsed and examined individually via touch screen. Faculty members were invited to play around with the digital-age “bookshelf,” which is one of the newest additions to our NYC office. &lt;br /&gt;
&lt;br /&gt;
We’ve already posted deeper dives on a few of the talks—including &lt;a href="http://googleresearch.blogspot.com/2011/07/google-north-american-faculty-summit_15.html"&gt;cluster management&lt;/a&gt;, &lt;a href="http://googleresearch.blogspot.com/2011/07/google-north-american-faculty-summit.html"&gt;mobile search&lt;/a&gt; and &lt;a href="http://googleresearch.blogspot.com/2011/07/google-north-american-faculty-summit_18.html"&gt;commerce&lt;/a&gt;. We also collected some interesting &lt;a href="http://googleresearch.blogspot.com/2011/07/google-americas-faculty-summit.html"&gt;faculty reflections&lt;/a&gt;. For more information on all of our programs, visit our &lt;a href="http://research.google.com/university/"&gt;University Relations website&lt;/a&gt;. The Faculty Summit is meant to connect forerunners across the computer science community—in business, research and academia—and we hope all our attendees returned home feeling informed and inspired.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/21224994-1563979813476126957?l=googleresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/blogspot/gJZg?a=52kYp2kVmKc:DCkHnDnwDDQ:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/blogspot/gJZg?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/gJZg/~4/52kYp2kVmKc" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://googleresearch.blogspot.com/feeds/1563979813476126957/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=21224994&amp;postID=1563979813476126957" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/1563979813476126957?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/1563979813476126957?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/gJZg/~3/52kYp2kVmKc/faculty-from-across-americas-meet-in.html" title="Faculty from across the Americas meet in New York for the Faculty Summit" /><author><name>A Googler</name><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://googleresearch.blogspot.com/2011/07/faculty-from-across-americas-meet-in.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DkAASHY4fip7ImA9WhdSEUQ.&quot;"><id>tag:blogger.com,1999:blog-21224994.post-1603334121997100669</id><published>2011-07-19T14:39:00.000-07:00</published><updated>2011-07-20T13:45:49.836-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-07-20T13:45:49.836-07:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="Education" /><title>Google Americas Faculty Summit: Reflections from our attendees</title><content type="html">&lt;span class="byline-author"&gt;Posted by Alfred Spector, Vice President, Research&lt;/span&gt; &lt;br /&gt;&lt;br /&gt;Last week, we held our seventh annual Americas &lt;a href="https://sites.google.com/site/facultysummit2011/"&gt;Computer Science Faculty Summit&lt;/a&gt; at our New York City office. About 100 faculty members from universities in the Western Hemisphere attended the two-day Summit, which focused on systems, artificial intelligence and mobile.  To finish up our series of Summit recaps, I asked four faculty members to provide us their perspective on the summit, thinking their views would complement our own blog:  Jeannette Wing from Carnegie Mellon, Rebecca Wright from Rutgers, Andrew Williams from Spelman and Christos Kozyrakis from Stanford.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Jeannette M. Wing, Carnegie Mellon University&lt;/b&gt;&lt;br /&gt;Fun, cool, edgy and irreverent.  Those words describe my impression of Google after attending the Google Faculty Summit, held for the first time at its New York City location.  Fun and cool: The Library Wall prototype, which attendees were privileged to see, is a peek at the the future where e-books have replaced physical books, but where physical space, equipped with wall-sized interactive displays, still encourages the kind of serendipitous browsing we enjoy in the grand libraries of today.  Cool and edgy: Being in the immense old Port Authority building in the midst of the Chelsea district of Manhattan  is just plain cool and adds an edgy character to Google not found at the corporate campuses of Silicon Valley.  Edgy, or more precisely “on the edge,” is Google as it explores new directions: social networking (Google+), mobile voice search (check out the microphone icon in your search bar) and commerce (e.g. selling soft goods on-line).    Why these directions? Some are definitely for business reasons, but some are also simply because Google can (self-driving cars) and because it’s good for society (e.g., emergency response in Haiti, Chile, New Zealand and Japan). “Irreverent” is Alfred Spector’s word and sums it up—Google is a fun place to work, where smart people can be creative, build cool products and make a difference in untraditional ways.&lt;br /&gt;&lt;br /&gt;But the one word that epitomizes Google is “scale.”  How do you manage clusters on the order of hundreds of thousands of processors where the focus is faults, not performance or power?   What knowledge about humanity can machine learning discover from 12 million scanned books in 400 languages that generated five billion pages and two trillion words digitized? Beyond Google, how do you secure the Internet of Things when eventually everything from light bulbs to pets will all be Internet-enabled and accessible?&lt;br /&gt;&lt;br /&gt;One conundrum. Google’s hybrid model of research clearly works for Google and for Googlers.  It is producing exciting advances in technology and having an immeasurable impact on society.   Evident from our open and intimate breakout sessions, Google stays abreast of cutting-edge academic research, often by hiring our Ph.D. students.  The challenge for computer science research is, “how can academia build on the shoulders of Google’s scientific results?” &lt;br /&gt;&lt;br /&gt;Academia does not have access to the scale of data or the complexity of system constraints found within Google.   For the good of the entire industry-academia-government research ecosystem, I hope that Google continues to maintain an open dialogue with academia—through faculty summits, participation and promotion of open standards, robust university relations programs and much more.&lt;br /&gt;-----&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Rebecca Wright, Rutgers University&lt;/b&gt;&lt;br /&gt;This was my first time attending a Google Faculty Summit.  It was great to see it held in my "backyard," which emphasized the message that much of Google's work takes place outside their Mountain View campus.  There was a broad variety of excellent talks, each of which only addressed the tip of the iceberg of the particular problem area.   The scope and scale of the work being done at Google is really mind-boggling.  It both drives Google’s need for new solutions and allows the company to consider new approaches.  At Google’s scale, automation is critical and almost everything requires research advances, engineering advances, considerable development effort and engagement of people outside Google (including academics, the open source community, policymakers and "the crowd").&lt;br /&gt;&lt;br /&gt;A unifying theme in much of Google’s work is the use of approaches that leverage its scale rather than fight it (such as MapMaker, which combines Google's data and computational resources with people's knowledge about and interest in their own geographic areas). In addition to hearing presentations, the opportunity to interact with the broad variety of Googlers present as well as other faculty was really useful and interesting.  As a final thought, I would like to see Google get more into education, particularly in terms of advancing hybrid in-class/on-line technologies that take advantage of the best features of each.&lt;br /&gt;-----&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Andrew Williams, Spelman College&lt;/b&gt;&lt;br /&gt;At the 2011 Google Faculty Summit in New York, the idea that we are moving past the Internet of computers to an "Internet of Things" became a clear theme. After hearing presentations by Googlers, such as Vint Cerf dapperly dressed in a three piece suit, I realized that we are in fact moving to an Internet of Things &lt;i&gt;and&lt;/i&gt; People.  The pervasiveness of connected computing devices and very large systems for cloud computing all interacting with socially connected people were expounded upon both in presentations and in informal discussions with faculty from around the world. The "Internet of people" aspect was also evident in emerging policies we touched on, involving security, privacy and social networks (like the Google+ project). I also enjoyed the demonstration of the Google self-driving car as an advanced application of artificial intelligence that integrates computer vision, localization and decision making in a real world transportation setting. I was impressed with how Google volunteers its talent, technology and time to help people, as it did with its crisis response efforts in Haiti, Japan and other parts of the world.&lt;br /&gt;&lt;br /&gt;As an educator and researcher in humanoid robotics and AI at a historically black college for women in Atlanta, the Google Faculty Summit motivated me to improve how I educate our students to eventually tackle the grand challenges posed by the Internet of Things and People. It was fun to learn how Google is actively seeking to solve these grand challenges on a global scale.&lt;br /&gt;-----&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Christos Kozyrakis, Stanford University&lt;/b&gt;&lt;br /&gt;What makes the Google Faculty Summit a unique event to attend is its wide-reaching focus. Our discipline-focused conferences facilitate in-depth debates over a narrow set of challenges. In contrast, the Faculty Summit is about bringing together virtually all disciplines of computer science to turn information into services with an immediate impact on our everyday lives. It is fascinating to discuss how large data centers and distributed software systems allow us to use machine learning algorithms on massive datasets and get voice based search, tailored shopping recommendations or driver-less cars. Apart from the general satisfaction of seeing these applications in action, one of the important takeaways for me is that specifying and managing the behavior of large systems in an end-to-end manner is currently a major challenge for our field. Now is probably the best time to be a computer scientist, and I am leaving with a better understanding of what advances in my area of expertise can have the biggest overall impact.&lt;br /&gt;&lt;br /&gt;I also enjoyed having the summit at the New York City office, away from Google headquarters in Silicon Valley. It’s great to see in practice how the products of our field (networking, video-conferencing and online collaboration tools) allow for technology development anywhere in the world. &lt;br /&gt;-----&lt;br /&gt;&lt;br /&gt;As per Jeannette Wing’s comments about Google being “irreverent,” I own up to using the term—initially about a subject on which Aristophanes once wrote (I’ll leave that riddle open). As long as you take my usage in the right way (that is, we’re very serious about the work we do, but perhaps not about all the things one would expect of a large company), I’m fine with it. There’s so much in the future of computer science and its potential impact that we should always be coming at things in new ways, with the highest aspirations and with joy at the prospects.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/21224994-1603334121997100669?l=googleresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/blogspot/gJZg?a=LWMg0BE4ToM:OaAOkiXGJog:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/blogspot/gJZg?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/gJZg/~4/LWMg0BE4ToM" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://googleresearch.blogspot.com/feeds/1603334121997100669/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=21224994&amp;postID=1603334121997100669" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/1603334121997100669?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/1603334121997100669?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/gJZg/~3/LWMg0BE4ToM/google-americas-faculty-summit.html" title="Google Americas Faculty Summit: Reflections from our attendees" /><author><name>A Googler</name><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://googleresearch.blogspot.com/2011/07/google-americas-faculty-summit.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CEYAQHc9fCp7ImA9WhdSEUw.&quot;"><id>tag:blogger.com,1999:blog-21224994.post-8581749400476506301</id><published>2011-07-18T14:01:00.001-07:00</published><updated>2011-07-19T14:49:01.964-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-07-19T14:49:01.964-07:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="Education" /><title>Google Americas Faculty Summit Day 2: Shopping, Coupons and Data</title><content type="html">&lt;span class="byline-author"&gt;Posted by Andrew W. Moore, Director, Google Commerce and Site Director, Pittsburgh&lt;/span&gt;  &lt;br /&gt;
&lt;br /&gt;
&lt;i&gt;On July 14 and 15, we held our seventh annual Faculty Summit for the Americas with our New York City offices hosting for the first time. Over the next few days, we will be bringing you a series of blog posts dedicated to sharing the Summit's events, topics and speakers. --Ed&lt;/i&gt;&lt;br /&gt;
&lt;br /&gt;
Google is ramping up its commitment to making shopping and commerce fun, convenient and useful. As a computer scientist with a background in algorithms and large scale artificial intelligence, what's most interesting to me is the breadth of fundamental new technologies needed in this area. They range from the computer vision technology that recognizes fashion styles and visually similar items of clothing, to a deep understanding of (potentially) all goods for sale in the world, to new and convenient payments technologies, to the intelligence that can be brought to the mobile shopping experience, to the infrastructure needed to make these technologies work on a global scale. &lt;br /&gt;
&lt;br /&gt;
At the Faculty Summit this week, I took the opportunity to engage faculty in some of the fascinating research questions that we are working on within Google Commerce. For example, consider the processing flow required to present a user with an appropriate set of shoes from which to choose, given the input of an image of a high heel shoe. First, we need to segment or identify the object of interest in the input image. If the input is an image of a high heel with the Alps in the background, we don’t want to find images of different types of shoes with the Alps in the background, we want images of high heels.&lt;br /&gt;
&lt;br /&gt;
The second step is to extract the object’s “visual signature” and build an index using color, shape, pattern and metadata.  Then, a search is performed using a variety of similarity measures. The implementation of this processing flow raises several research challenges. For example, the calculations required to determine similar shoes could be slow due to the number of factors that must be considered. Segmentation can also pose a difficult problem because of the complexity of the feature extraction algorithms.&lt;br /&gt;
&lt;br /&gt;
Another important consideration is personalization. Consumers want items that correspond to their interests, so we should include results based on historical search and shopping data for a particular person (who has opted-in to such features). More importantly, we want to downweight styles that the shopper has indicated he does not like. Finally, we also need to include some creative items to simulate the serendipitous connections one makes when shopping in a store. This is a new kind of search experience, which requires a new kind of architecture and new ways to infer shopper satisfaction. As a result, we find ourselves exploring new kinds of statistical models and the underlying infrastructure to support them. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/21224994-8581749400476506301?l=googleresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/blogspot/gJZg?a=5BQhtzu-cPQ:-pylhkJp7H8:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/blogspot/gJZg?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/gJZg/~4/5BQhtzu-cPQ" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://googleresearch.blogspot.com/feeds/8581749400476506301/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=21224994&amp;postID=8581749400476506301" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/8581749400476506301?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/8581749400476506301?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/gJZg/~3/5BQhtzu-cPQ/google-north-american-faculty-summit_18.html" title="Google Americas Faculty Summit Day 2: Shopping, Coupons and Data" /><author><name>A Googler</name><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://googleresearch.blogspot.com/2011/07/google-north-american-faculty-summit_18.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CEYBRHoyfSp7ImA9WhdSEUw.&quot;"><id>tag:blogger.com,1999:blog-21224994.post-56624785241734158</id><published>2011-07-15T11:32:00.000-07:00</published><updated>2011-07-19T14:49:15.495-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-07-19T14:49:15.495-07:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="Education" /><title>Google Americas Faculty Summit Day 1: Cluster Management</title><content type="html">&lt;span class="byline-author"&gt;Posted by John Wilkes, Principal Software Engineer&lt;/span&gt;  &lt;br /&gt;
&lt;br /&gt;
&lt;i&gt;On July 14 and 15, we held our seventh annual Faculty Summit for the Americas with our New York City offices hosting for the first time. Over the next few days, we will be bringing you a series of blog posts dedicated to sharing the Summit's events, topics and speakers. --Ed&lt;/i&gt;&lt;br /&gt;
&lt;br /&gt;
At this year’s Faculty Summit, I had the opportunity to provide a glimpse into the world of cluster management at Google. My goal was to brief the audience on the challenges of this complex system and explain a few of the research opportunities that these kinds of systems provide.&lt;br /&gt;
&lt;br /&gt;
First, a little background. Google’s fleet of machines are spread across many data centers, each of which consists of a number of clusters (a set of machines with a high-speed network between them). Each cluster is managed as one or more cells. A user (in this case, a Google engineer) submits jobs to a cell for it to run. A job could be a service that runs for an extended period, or a batch job that runs, for example, a MapReduce updating an index.&lt;br /&gt;
&lt;br /&gt;
Cluster management operates on a very large scale: whereas a storage system that can hold a petabyte of data is considered large by most people, our storage systems will send us an emergency page when it has only a few petabytes of free space remaining. This scale give us opportunities (e.g., a single job may use several thousand machines at a time), but also many challenges (e.g., we constantly need to worry about the effects of failures). The cluster management system juggles the needs of a large number of jobs in order to achieve good utilization, trying to strike a balance between a number of conflicting goals.&lt;br /&gt;
&lt;br /&gt;
To complicate things, data centers can have multiple types of machines, different network and power-distribution topologies, a range of OS versions and so on.  We also need to handle changes, such as rolling out a software or a hardware upgrade, while the system is running.&lt;br /&gt;
&lt;br /&gt;
Our current cluster management system is about seven years old now (several generations for most Google software) and, although it has been a huge success, it is beginning to show its age. We are currently prototyping a new system that will replace it; most of my talk was about the challenges we face in building this system.  We are building it to handle larger cells, to look into the future (by means of a calendar of resource reservations) to provide predictable behavior, to support failures as a first-class concept, to unify a number of today’s disjoint systems and to give us the flexibility to add new features easily.  A key goal is that it should provide predictable, understandable behavior to users and system administrators. For example, the latter want to know answers to questions like “Are we in trouble? Are we about to be in trouble? If so, what should we do about it?”  &lt;br /&gt;
&lt;br /&gt;
Putting all this together requires advances in a great many areas. I touched on a few of them, including scheduling and ways of representing and reasoning with user intentions. One of the areas that I think doesn’t receive nearly enough attention is system configuration—describing how systems should behave, how they should be set up, how those setups should change, etc.  Systems at Google typically rely on dozens of other services and systems.  It’s vital to simplify the process of making controlled changes to configurations that result in predictable outcomes, every time, even in the face of heterogeneous infrastructure environments and constant flux.&lt;br /&gt;
&lt;br /&gt;
We’ll be taking steps toward these goals ourselves, but the intent of today’s discussion was to encourage people in the academic community to think about some of these problems and come up with new and better solutions, thereby raising the level for us all.    &lt;br /&gt;
&lt;br /&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/21224994-56624785241734158?l=googleresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/blogspot/gJZg?a=uPLQQ2mf9YY:cYf_OlET86k:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/blogspot/gJZg?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/gJZg/~4/uPLQQ2mf9YY" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://googleresearch.blogspot.com/feeds/56624785241734158/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=21224994&amp;postID=56624785241734158" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/56624785241734158?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/56624785241734158?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/gJZg/~3/uPLQQ2mf9YY/google-north-american-faculty-summit_15.html" title="Google Americas Faculty Summit Day 1: Cluster Management" /><author><name>A Googler</name><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://googleresearch.blogspot.com/2011/07/google-north-american-faculty-summit_15.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CEYCRnc4fyp7ImA9WhdSEUw.&quot;"><id>tag:blogger.com,1999:blog-21224994.post-1287606688776274007</id><published>2011-07-15T10:29:00.001-07:00</published><updated>2011-07-19T14:49:27.937-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-07-19T14:49:27.937-07:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="Voice Search" /><category scheme="http://www.blogger.com/atom/ns#" term="Education" /><title>Google Americas Faculty Summit Day 1: Mobile Search</title><content type="html">&lt;span class="byline-author"&gt;Posted by Johan Schalkwyk, Software Engineer&lt;/span&gt;  &lt;br /&gt;
&lt;br /&gt;
&lt;i&gt;On July 14 and 15, we held our seventh annual Faculty Summit for the Americas with our New York City offices hosting for the first time. Over the next few days, we will be bringing you a series of blog posts dedicated to sharing the Summit's events, topics and speakers. --Ed&lt;/i&gt;&lt;br /&gt;
&lt;br /&gt;
Google’s mobile speech team has a lofty goal: recognize any search query spoken in English and return the relevant results. Regardless of whether your accent skews toward a Southern drawl, a Boston twang, or anything in between, spoken searches like “navigate to the Metropolitan Museum,” “call California Pizza Kitchen” or “weather, Scarsdale, New York” should provide immediate responses with a map, the voice of the hostess at your favorite pizza place or an online weather report. The responses must be fast and accurate or people will stop using the tool, and—given that the number of speech queries has more than doubled over the past year—the team is clearly succeeding.&lt;br /&gt;
&lt;br /&gt;
As a software engineer on the mobile speech team, I took the opportunity of the Faculty Summit this week to present some of the interesting challenges surrounding developing and implementing mobile search. One of the immediate puzzles we have to solve is how to train a computer system to recognize speech queries. There are two aspects to consider: the acoustic model, or the sound of letters and words in a language; and the language model, which in English is essentially grammar, or what allows us to predict words that follow one another. The language model we can put together using a huge amount of data gathered from our query logs. The acoustic model, however, is more challenging.&lt;br /&gt;
&lt;br /&gt;
To build our acoustic model, we could conduct “supervised learning” where we collect 100+ hours of audio data from search queries and then transcribe and label the data. We use this data to translate a speech query into a written query. This approach works fairly well, but it doesn’t improve as we collect more audio data. Thus, we use an “unsupervised model” where we continuously add more audio data to our training set as users do speech queries.&lt;br /&gt;
&lt;br /&gt;
Given the scale of this system, another interesting challenge is testing accuracy. The traditional approach is to have human testers run assessments. Over the past year, however, we have determined that our automated system has the same or better level of accuracy as our human testers, so we’ve decided to create a new method for automated testing at scale, a project we are working on now. &lt;br /&gt;
&lt;br /&gt;
The current voice search system is trained on over 230 billion words and has a one million word vocabulary, meaning it understands all the different contexts in which those one million words can be used. It requires multiple CPU decades for training and data processing, plus a significant amount of storage, so this is an area where Google’s large infrastructure is essential. It’s exciting to be a part of such cutting edge research, and the Faculty Summit was an excellent opportunity to share our latest innovations with people who are equally inspired by this area of computer science.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/21224994-1287606688776274007?l=googleresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/blogspot/gJZg?a=7aS6aQvB8QE:SRAplSfngbA:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/blogspot/gJZg?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/gJZg/~4/7aS6aQvB8QE" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://googleresearch.blogspot.com/feeds/1287606688776274007/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=21224994&amp;postID=1287606688776274007" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/1287606688776274007?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/1287606688776274007?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/gJZg/~3/7aS6aQvB8QE/google-north-american-faculty-summit.html" title="Google Americas Faculty Summit Day 1: Mobile Search" /><author><name>A Googler</name><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://googleresearch.blogspot.com/2011/07/google-north-american-faculty-summit.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DUYGQ3o-cCp7ImA9WhdTFU0.&quot;"><id>tag:blogger.com,1999:blog-21224994.post-3074129224865597838</id><published>2011-07-12T14:45:00.000-07:00</published><updated>2011-07-12T14:45:22.458-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-07-12T14:45:22.458-07:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="Android" /><title>What You Capture Is What You Get: A New Way for Task Migration Across Devices</title><content type="html">&lt;span class="byline-author"&gt;Posted by Yang Li, Research Scientist&lt;/span&gt;  &lt;br /&gt;
&lt;br /&gt;
We constantly move from one device to another while carrying out everyday tasks. For example, we might find an interesting article on a desktop computer at work, then bring the article with us on a mobile phone during the commute and keep reading it on a laptop or a TV when we get home. Cloud computing and web applications have made it possible to access the same data and applications on different devices and platforms. However, there are not many ways to easily move tasks across devices that are as intuitive as drag-and-drop in a graphical user interface.&lt;br /&gt;
&lt;br /&gt;
Since last year, our research team started developing new technologies for users to easily migrate their tasks across devices. In a project named Deep Shot, we demonstrated how a user can easily move web pages and applications, such as Google Maps directions, between a laptop and an Android phone by using the phone camera. With Deep Shot, a user can simply take a picture of their monitor with a phone camera, and the captured content automatically shows up and becomes instantly interactive on the mobile phone.&lt;br /&gt;
&lt;br /&gt;
This project was inspired by our observations that many people tend to take a picture of map directions on the monitor using their mobile phone camera, rather than using other approaches such as email. Taking pictures feels more direct and convenient, and fits well our everyday activity that is often more opportunistic. Instead of just capturing raw pixels, Deep Shot recovers the actual contents and applications on the mobile phone based on these pixels. You can find out how Deep Shot keeps user interaction simple and what happens behind the scenes &lt;a href="http://research.google.com/pubs/archive/37153.pdf"&gt;here&lt;/a&gt;. Similar to WYSIWYG—What You See Is What You Get—for graphical user interfaces, Deep Shot demonstrates WYCIWYG—What You Capture Is What You Get—for cross-device interaction. We are exploring this interaction style for various task migration situations in our everyday life.&lt;br /&gt;
&lt;br /&gt;
&lt;object width="320" height="266" class="BLOGGER-youtube-video" classid="clsid:D27CDB6E-AE6D-11cf-96B8-444553540000" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0" data-thumbnail-src="http://1.gvt0.com/vi/iGTM6xs2sck/0.jpg"&gt;&lt;param name="movie" value="http://www.youtube.com/v/iGTM6xs2sck&amp;fs=1&amp;source=uds" /&gt;&lt;param name="bgcolor" value="#FFFFFF" /&gt;&lt;embed width="320" height="266"  src="http://www.youtube.com/v/iGTM6xs2sck&amp;fs=1&amp;source=uds" type="application/x-shockwave-flash"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;br /&gt;
&lt;br /&gt;
Deep Shot remains a research project at Google. With increasing capabilities of mobile phones and fast growing web applications, we hope to explore more exciting ways to help users carry out their everyday activities.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/21224994-3074129224865597838?l=googleresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/blogspot/gJZg?a=AriXps6Z9RE:5-dFOsztC-M:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/blogspot/gJZg?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/blogspot/gJZg/~4/AriXps6Z9RE" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://googleresearch.blogspot.com/feeds/3074129224865597838/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://www.blogger.com/comment.g?blogID=21224994&amp;postID=3074129224865597838" title="3 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/3074129224865597838?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/21224994/posts/default/3074129224865597838?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/blogspot/gJZg/~3/AriXps6Z9RE/what-you-capture-is-what-you-get-new.html" title="What You Capture Is What You Get: A New Way for Task Migration Across Devices" /><author><name>Research @ Google</name><uri>http://www.blogger.com/profile/03833875495392153515</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>3</thr:total><feedburner:origLink>http://googleresearch.blogspot.com/2011/07/what-you-capture-is-what-you-get-new.html</feedburner:origLink></entry></feed>

