<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/atom10full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><feed xmlns="http://www.w3.org/2005/Atom" xmlns:openSearch="http://a9.com/-/spec/opensearch/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:gd="http://schemas.google.com/g/2005" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" gd:etag="W/&quot;Dk8FRXg4eSp7ImA9WxJUEkg.&quot;"><id>tag:blogger.com,1999:blog-6555947</id><updated>2009-07-10T13:40:14.631-06:00</updated><title>The Geomblog</title><subtitle type="html">Ruminations on computational geometry, algorithms, theoretical computer science and life</subtitle><link rel="http://schemas.google.com/g/2005#feed" type="application/atom+xml" href="http://geomblog.blogspot.com/feeds/posts/default" /><link rel="alternate" type="text/html" href="http://geomblog.blogspot.com/" /><link rel="next" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default?start-index=26&amp;max-results=25&amp;redirect=false&amp;v=2" /><author><name>Suresh</name><uri>http://www.blogger.com/profile/15898357513326041822</uri><email>noreply@blogger.com</email></author><generator version="7.00" uri="http://www.blogger.com">Blogger</generator><openSearch:totalResults>966</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>25</openSearch:itemsPerPage><link rel="self" href="http://feeds.feedburner.com/TheGeomblog" type="application/atom+xml" /><feedburner:browserFriendly>This is an XML content feed. It is intended to be viewed in a newsreader or syndicated to another site, subject to copyright and fair use.</feedburner:browserFriendly><entry gd:etag="W/&quot;CEYMRn8zfSp7ImA9WxJUEUU.&quot;"><id>tag:blogger.com,1999:blog-6555947.post-7797791710702404906</id><published>2009-07-09T17:24:00.004-06:00</published><updated>2009-07-09T17:29:47.185-06:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-07-09T17:29:47.185-06:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="miscellaneous" /><category scheme="http://www.blogger.com/atom/ns#" term="awards" /><category scheme="http://www.blogger.com/atom/ns#" term="blogosphere" /><title>Quick note</title><content type="html">&lt;span style="font-style:italic;"&gt;(am posting this from 37000 ft. isn't technology cool ?)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;The clustering series will be on hiatus for a few days while I wrap up some more pressing deadlines. It will restart two weeks from now. &lt;br /&gt;&lt;br /&gt;Congratulations to Adam Smith and Sean Hallgren on getting a &lt;a href="http://bit.ly/13mMOI"&gt;PECASE award&lt;/a&gt;. See what &lt;a href="http://adamdsmith.wordpress.com/"&gt;creating a new blog&lt;/a&gt; can do !&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6555947-7797791710702404906?l=geomblog.blogspot.com'/&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=GGLjmjq5QWY:PRdV6uRkEBg:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt; &lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=GGLjmjq5QWY:PRdV6uRkEBg:63t7Ie-LG7Y"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=63t7Ie-LG7Y" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/TheGeomblog/~4/GGLjmjq5QWY" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://geomblog.blogspot.com/feeds/7797791710702404906/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="https://www.blogger.com/comment.g?blogID=6555947&amp;postID=7797791710702404906" title="5 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/7797791710702404906?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/7797791710702404906?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/TheGeomblog/~3/GGLjmjq5QWY/quick-note.html" title="Quick note" /><author><name>Suresh</name><uri>http://www.blogger.com/profile/15898357513326041822</uri><email>noreply@blogger.com</email><gd:extendedProperty name="OpenSocialUserId" value="05767624388557425525" /></author><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">5</thr:total><feedburner:origLink>http://geomblog.blogspot.com/2009/07/quick-note.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CUcHQXcyeCp7ImA9WxJUEUs.&quot;"><id>tag:blogger.com,1999:blog-6555947.post-1187765454327736034</id><published>2009-07-09T11:58:00.003-06:00</published><updated>2009-07-09T12:10:30.990-06:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-07-09T12:10:30.990-06:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="nsf" /><category scheme="http://www.blogger.com/atom/ns#" term="research" /><title>NSF EDA Workshop: Wrap up</title><content type="html">Today morning was wrap up day. The theory group was "tasked" with coming up with a few slides suggested avenues for further cooperation. Dick talked about creating (or re-creating) an environment in academia where researchers are building chips and talking to other experts down the corridor (like theoreticians) rather than the 'planted' model where a theoretician is planted in a corporate design environment for 6 months or so. &lt;br /&gt;&lt;br /&gt;I talked briefly about what I called "new" theory: the twin ideas of randomization and approximation that have been so powerful in algorithm design (even for intractable prolems), and how these play into the problems of dealing with massive high dimensional data. &lt;br /&gt;&lt;br /&gt;Vijaya gave a synopsis of cache-obliviousness, multicore research, and her recent work in this area. There's a lot of interest in EDA and multicore work, and she made the argument that theoreticians are learning how to design efficient multicore algorithms and can be of great help in adapting EDA methods.&lt;br /&gt;&lt;br /&gt;There were also four sub-panels that addressed various aspects of EDA. What to me was interesting that many of the panels brought up "EDA needs in theory" that matched some of the things we talked about. For example, there's a pressing need for methods that run linear (and even sublinear) time, tools that are incremental, in that they can progressively generate better and better solutions given more time, and tools that can parallelize well.&lt;br /&gt;&lt;br /&gt;They also talked about providing benchmarks: for example, a 10,000 variable SAT instance and code that solves it. I thought (and said) that this would be an excellent way to get people dabbling in this area. &lt;br /&gt;&lt;br /&gt;Randomization was a trickier matter. Although the EDA folks recognize the power of randomization, they are concerned about reproducibility. The DA pipelne is long and involved, and once you've fixed the output of a piece of the pipeline, you'd like to keep it in place and not change from iteration to iteration.&lt;br /&gt;&lt;br /&gt;Of course this is something that can be fixed with seed control. For more portability, it's conceivable that you can cache the random coin tosses once you've settled on a reasonable solution to any piece in the pipeline. &lt;br /&gt;&lt;br /&gt;Overall, it was quite interesting, although exhausting. The general idea with such workshops is that the findings make their way into the next call for proposals (sometime in October/November), so if you have EDA people you'd like to collaborate it, this might be a good opportunity. &lt;br /&gt;&lt;br /&gt;It's time to go home.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6555947-1187765454327736034?l=geomblog.blogspot.com'/&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=m9QENwHrGfw:rJFxPo0zQiI:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt; &lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=m9QENwHrGfw:rJFxPo0zQiI:63t7Ie-LG7Y"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=63t7Ie-LG7Y" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/TheGeomblog/~4/m9QENwHrGfw" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://geomblog.blogspot.com/feeds/1187765454327736034/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="https://www.blogger.com/comment.g?blogID=6555947&amp;postID=1187765454327736034" title="10 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/1187765454327736034?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/1187765454327736034?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/TheGeomblog/~3/m9QENwHrGfw/nsf-eda-workshop-wrap-up.html" title="NSF EDA Workshop: Wrap up" /><author><name>Suresh</name><uri>http://www.blogger.com/profile/15898357513326041822</uri><email>noreply@blogger.com</email><gd:extendedProperty name="OpenSocialUserId" value="05767624388557425525" /></author><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">10</thr:total><feedburner:origLink>http://geomblog.blogspot.com/2009/07/nsf-eda-workshop-wrap-up.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CUYDRnY7eCp7ImA9WxJUEUg.&quot;"><id>tag:blogger.com,1999:blog-6555947.post-9101619299838130794</id><published>2009-07-09T07:55:00.002-06:00</published><updated>2009-07-09T09:26:17.800-06:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-07-09T09:26:17.800-06:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="nsf" /><category scheme="http://www.blogger.com/atom/ns#" term="research" /><category scheme="http://www.blogger.com/atom/ns#" term="eda" /><title>NSF EDA Workshop: A comment</title><content type="html">Paul Beame had a brilliant comment on my post about the EDA Workshop, and I liked it so much I'm reproducing it in full here:&lt;br /&gt;&lt;br /&gt;&lt;blockquote&gt;One fault is that we don't even teach the proper tools in our algorithms courses! There are relatively few algorithms texts that even support the teaching of backtracking as an algorithmic paradigm. Moreover, the sort of very nice work with provable small exponential upper bounds that some theory researchers (e.g., David Eppstein, Richard Beigel, Martin Furer) have done in backtracking algorithms for combinatorial problems is only a very small part of the landscape for algorithmic solutions to hard problems.&lt;br /&gt;&lt;br /&gt;Too often, we in theory leave the important hard problems to those in AI and formal methods. One of the standard paradigms in practical verification is to attack problems that in their full generality are PSPACE-hard (or worse), find a useful NP subclass, apply a reduction from these problems to SAT, and use SAT solvers that involve worst-case exponential backtracking (or to a much lesser extent local search) algorithms that are fast in many cases. The key techniques used in these solvers are powerful heuristics that are usually studied in AI but rarely in algorithms research.&lt;br /&gt;&lt;br /&gt;The potential for practical impact of improved versions of these algorithms (even with only polynomial speed-up) could be huge. Traditional algorithmic research has occasionally had something useful to say about these topics (such as Moser's recent beautiful analysis of variants of the random walk algorithm on the special classes of CNF formulas satisfying the Lovasz Local Lemma conditions) but it is somehow not viewed as part of the mainstream.&lt;br /&gt;&lt;br /&gt;Applications in EDA are only part of the utility of some of the exponential algorithms used in formal methods. There has been a large and surprisingly successful body of work on software model checking that uses quite clever theoretical ideas. One of my favorite examples is the following: Though techniques for checking properties of communicating finite state machines (one of those nasty PSPACE-complete problems) had been in use for EDA since the early 1990's, model checking software presented an additional problem because of the need to handle the call stack. Suppose that one needs to check a safety property (i.e., an invariant). The key observation (which goes back to the early 1960's) is that the language consisting of the possible stack contents of a PDA is always a regular language for which an NFA can be easily constructed from the PDA! One can then apply a small extension the previous algorithm to check safety properties (which must hold for all possible stack contents).&lt;/blockquote&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6555947-9101619299838130794?l=geomblog.blogspot.com'/&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=HOYeJm9A0DM:QIXu_0bVcZo:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt; &lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=HOYeJm9A0DM:QIXu_0bVcZo:63t7Ie-LG7Y"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=63t7Ie-LG7Y" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/TheGeomblog/~4/HOYeJm9A0DM" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://geomblog.blogspot.com/feeds/9101619299838130794/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="https://www.blogger.com/comment.g?blogID=6555947&amp;postID=9101619299838130794" title="1 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/9101619299838130794?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/9101619299838130794?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/TheGeomblog/~3/HOYeJm9A0DM/nsf-eda-workshop-comment.html" title="NSF EDA Workshop: A comment" /><author><name>Suresh</name><uri>http://www.blogger.com/profile/15898357513326041822</uri><email>noreply@blogger.com</email><gd:extendedProperty name="OpenSocialUserId" value="05767624388557425525" /></author><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">1</thr:total><feedburner:origLink>http://geomblog.blogspot.com/2009/07/nsf-eda-workshop-comment.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CEUMRX4zeSp7ImA9WxJUEUs.&quot;"><id>tag:blogger.com,1999:blog-6555947.post-332867922452184238</id><published>2009-07-08T20:20:00.005-06:00</published><updated>2009-07-09T11:58:04.081-06:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-07-09T11:58:04.081-06:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="nsf" /><category scheme="http://www.blogger.com/atom/ns#" term="research" /><title>NSF Workshop: Electronic Design Automation</title><content type="html">I'm currently at an NSF Worshop on Electronic Design Automation (the thing that used to be called VLSI design). I'm here as part of the 'theory audience' along with Vijaya Ramachandran and Dick Lipton (blogger power!).&lt;br /&gt;&lt;br /&gt;Thankfully, Dick has posted an &lt;a href="http://rjlipton.wordpress.com/2009/07/08/nsf-workshop-on-design-automation-and-theory/"&gt;extensive summary&lt;/a&gt; of the day of talks, so I don't have to. Our mandate here is to listen in on the discussions and come up with a response that suggests avenues where theory folk might have something useful to contribute. &lt;br /&gt;&lt;br /&gt;From a purely biased algorithms perspective, one thing that strikes me about the EDA community (at least in the formal methods/verification realm) is their unwillingness to give up in the face of beyond-NP-completeness. What I mean is this: most of my training in algorithms (and this is likely true for you as well) is in the polynomial-time regime: all the algorithic paradigms we learn are effective at reducing the complexity of an algorithm from one polynomial to another. &lt;br /&gt;&lt;br /&gt;When we engage with NP-hardness, we switch modes to approximations, and focus on the issue of quality: even the approximation algorithms themselves run in poly time. There are very few people (David Eppstein comes to mind) who work on algorithms in the exponential/subexponential realm and will worry about (say) reducing the base of the exponent for SAT or graph coloring or other hard problems. &lt;br /&gt;&lt;br /&gt;The verification folks don't necessarily solve their (very hard) problems exactly, but they do design all kinds of tricks (and heuristics) to deal with these problems, because they actually need to solve them ! In my view, it wouldn't be a bad idea for students learning algorithms to learn at least a few tricks for designing algorithms that might run in exponential time, but are efficient. Remember that &lt;a href="http://rjlipton.wordpress.com/2009/07/03/is-pnp-an-ill-posed-problem/"&gt;exponential might be better than n^100 for many values of n&lt;/a&gt;. &lt;br /&gt;&lt;br /&gt;One thing that came to mind as I listened to talks. With the exception of a talk by Rupak Mazumdar on faulty computations, and a talk by Ed Clarke (yes, &lt;a href="http://geomblog.blogspot.com/2008/02/2007-acm-turing-award-and-perspective.html"&gt;that Ed Clarke&lt;/a&gt;) on statistical model checking (based on the Neyman-Pearson hypothesis testing framework), there was little talk of the role that randomization might have to play in the problems of EDA. &lt;br /&gt;&lt;br /&gt;A second thought was how the lessons of massive data analysis might be useful in the realm of DA. One speakr described one critical problem as being the degree of complexity associated with current DA tools: there are over 4000 "knobs" to turn in one such tool ! It's believed that these knobs are not independent, and might even be contradictory. If we think of each "run" of the DA tool, outputing some kind of chip layout, as a point in this 4000+ dimensional space, I wonder whether techniques for dimensionality reduction and manifold analysis might be useful to find a set of "core knobs" that control the process. &lt;br /&gt;&lt;br /&gt;I have to say that it's nice to attend a workshop with a community that throws out terms like NP-Complete, \Sigma_2, and PSPACE so freely :).&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6555947-332867922452184238?l=geomblog.blogspot.com'/&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=-qeRyKkw5S4:gbVpoVm_vXY:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt; &lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=-qeRyKkw5S4:gbVpoVm_vXY:63t7Ie-LG7Y"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=63t7Ie-LG7Y" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/TheGeomblog/~4/-qeRyKkw5S4" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://geomblog.blogspot.com/feeds/332867922452184238/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="https://www.blogger.com/comment.g?blogID=6555947&amp;postID=332867922452184238" title="1 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/332867922452184238?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/332867922452184238?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/TheGeomblog/~3/-qeRyKkw5S4/nsf-workshop-electonic-design.html" title="NSF Workshop: Electronic Design Automation" /><author><name>Suresh</name><uri>http://www.blogger.com/profile/15898357513326041822</uri><email>noreply@blogger.com</email><gd:extendedProperty name="OpenSocialUserId" value="05767624388557425525" /></author><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">1</thr:total><feedburner:origLink>http://geomblog.blogspot.com/2009/07/nsf-workshop-electonic-design.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CkUDQXs8fCp7ImA9WxJVFkk.&quot;"><id>tag:blogger.com,1999:blog-6555947.post-4253584603827229177</id><published>2009-07-03T10:30:00.000-06:00</published><updated>2009-07-03T10:57:50.574-06:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-07-03T10:57:50.574-06:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="focs" /><category scheme="http://www.blogger.com/atom/ns#" term="research" /><title>FOCS 2009 results.</title><content type="html">The FOCS list is out, (with abstracts). 73 papers accepted. Some papers that popped up on my radar screen (post more in the comments since I'll probably miss many good ones):&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;a target="_top" href="http://cs.uiowa.edu/%7Emrgibson"&gt;Matt Gibson&lt;/a&gt; and &lt;a target="_top" href="http://www.cs.uiowa.edu/%7Ekvaradar/"&gt;Kasturi Varadarajan&lt;/a&gt;. Decomposing Coverings and the Planar Sensor Cover Problem.&lt;br /&gt;&lt;br /&gt;This paper continues a line of work that I like to think we started in a SODA paper a few years ago. The problem is this: you're given a collection of sensors that monitor a region. But they have limited battery life, and they're also redundant (many different subsets of sensors have the range to cover the region). Can you schedule the sensors to go on and off so that at all times,  the entire region is covered, and the region is covered for as long as possible ?&lt;br /&gt;&lt;br /&gt;Early work on this problem formulated it in a combinatorial setting, which immediately led to things like log n-approximability lower bounds via set cover variants. Our belief was that the geometry of the regions would make things a bit easier. We made some small progress, getting a sublogarithmic approximation ratio for intervals on the line, and a constant factor for ranges with special structure. Subsequent work made steady improvements, and this work has brought things down to a constant  for general classes of regions in the plane&lt;/li&gt;&lt;li&gt;&lt;a target="_top" href="http://www.math.purdue.edu/%7Esbasu"&gt;Saugata Basu&lt;/a&gt; and &lt;a target="_top" href="http://ww2.lr.edu/mat/zell/zell.htm"&gt;Thierry Zell&lt;/a&gt;. Polynomial hierarchy, Betti numbers and a real analogue of Toda's Theorem&lt;br /&gt;&lt;br /&gt;Betti number computation has been a hot topic in computational geometry of late, but the idea of using Betti numbers to bound the (algebraic complexity) of basic geometric operations dates back to work by Dobkin and Lipton that relates the complexity of certain computations to the number of connected components of "invariant paths" in a computation. Ben-Or generalized these results to more complicated algebraic computation trees, and noting that "number of connected components" is the zero-th Betti number of the set of computations, asked whether higher-order Betti numbers might be useful for proving lower bounds. A series of results by Bjorner, Lovasz and Yao, Bjorner and Lovasz, and finally Yao showed that indeed the higher-order Betti numbers can be used to lower bound the complexity of algebraic computations. &lt;br /&gt;&lt;br /&gt;So Betti numbers are important as a measure of "counting" in algebraic complexity. This paper reinforces this intuition, by relating #P over the reals to the complexity of computing Betti numbers over semi-algebraic sets. Interestingly, this paper mentions the difficult nature of Toda's original proof, and Lance just recently tweeted a new very simple proof of Toda's theorem that he just published in ToC.  &lt;br /&gt;&lt;/li&gt;&lt;br /&gt;&lt;li&gt;David Arthur, &lt;a target="_top" href="http://www-cc.cs.uni-saarland.de/manthey/"&gt;Bodo Manthey&lt;/a&gt; and &lt;a target="_top" href="http://www.roeglin.org/"&gt;Heiko Roeglin&lt;/a&gt;. k-Means has Polynomial Smoothed Complexity&lt;br /&gt;&lt;br /&gt;Read my post on k-means for more on the significance of such results. This paper finally resolves the smoothed complexity issue, giving a polynomial bound (expected).&lt;br /&gt;&lt;/li&gt;&lt;br /&gt;&lt;li&gt;&lt;a target="_top" href="http://www.daimi.au.dk/%7Epeyman/"&gt;Peyman Afshani&lt;/a&gt;, Jeremy Barbay and &lt;a target="_top" href="http://www.cs.uwaterloo.ca/%7Etmchan"&gt;Timothy M. Chan&lt;/a&gt;. Instance-Optimal Geometric Algorithms&lt;br /&gt;&lt;br /&gt;An interesting result that seems to be able to take away order-dependence from some geometric problems.&lt;br /&gt;&lt;/li&gt;&lt;br /&gt;&lt;li&gt;&lt;a target="_top" href="http://www.almaden.ibm.com/cs/people/jayram"&gt;T.S. Jayram&lt;/a&gt; and             &lt;a target="_top" href="http://www.almaden.ibm.com/cs/people/dpwoodru/"&gt;David Woodruff&lt;/a&gt;. The Data Stream Space Complexity of Cascaded Norms&lt;br /&gt;&lt;br /&gt;Cascaded norms (where you compute one aggregate on each of a collection of streams, and compute another aggregate on these aggregates) are tricky for streaming algorithms, and this paper provides some interesting results. Again, I'm waiting for the document to show up, and I'd be interesting in seeing the kinds of techniques they use.&lt;br /&gt;&lt;/li&gt;&lt;br /&gt;&lt;/ul&gt;Papers I'd like to learn more about:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;a target="_top" href="http://cs.yale.edu/people/kannan.html"&gt;Ravindran Kannan&lt;/a&gt;. A new probability using typical moments and concentration results&lt;/li&gt;&lt;li&gt;&lt;a target="_top" href="http://www.cs.uu.nl/%7Ehansb"&gt;Hans Bodlaender&lt;/a&gt;, &lt;a target="_top" href="http://www.ii.uib.no/%7Efomin/"&gt;Fedor Fomin&lt;/a&gt;, &lt;a target="_top" href="http://www.ii.uib.no/%7Edaniello"&gt;Daniel Lokshtanov&lt;/a&gt;, &lt;a target="_top" href="http://www.cs.uu.nl/staff/penninkx.html"&gt;Eelko Penninkx&lt;/a&gt;, &lt;a target="_top" href="http://www.imsc.res.in/%7Esaket/"&gt;Saket Saurabh&lt;/a&gt; and &lt;a target="_top" href="http://www.thilikos.info/"&gt;Dimitrios Thilikos&lt;/a&gt;. (Meta) Kernelization&lt;/li&gt;&lt;li&gt;&lt;a target="_top" href="http://lightless.org/"&gt;Flavio Chierichetti&lt;/a&gt;, &lt;a target="_top" href="http://research.yahoo.com/%7Eravi_k53"&gt;Ravi Kumar&lt;/a&gt;, Silvio Lattanzi, Alessandro Panconesi and Prabhakar Raghavan. Models for the compressible Web&lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;&lt;br /&gt;Comments ?&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6555947-4253584603827229177?l=geomblog.blogspot.com'/&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=cTUKAiGixE4:-NL5xWeNRsk:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt; &lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=cTUKAiGixE4:-NL5xWeNRsk:63t7Ie-LG7Y"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=63t7Ie-LG7Y" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/TheGeomblog/~4/cTUKAiGixE4" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://geomblog.blogspot.com/feeds/4253584603827229177/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="https://www.blogger.com/comment.g?blogID=6555947&amp;postID=4253584603827229177" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/4253584603827229177?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/4253584603827229177?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/TheGeomblog/~3/cTUKAiGixE4/focs-2009-results.html" title="FOCS 2009 results." /><author><name>Suresh</name><uri>http://www.blogger.com/profile/15898357513326041822</uri><email>noreply@blogger.com</email><gd:extendedProperty name="OpenSocialUserId" value="05767624388557425525" /></author><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">0</thr:total><feedburner:origLink>http://geomblog.blogspot.com/2009/07/focs-2009-results.html</feedburner:origLink></entry><entry gd:etag="W/&quot;AkUMRn4zeSp7ImA9WxJVFk4.&quot;"><id>tag:blogger.com,1999:blog-6555947.post-5133500106942523697</id><published>2009-07-03T10:00:00.000-06:00</published><updated>2009-07-03T10:24:47.081-06:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-07-03T10:24:47.081-06:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="research" /><category scheme="http://www.blogger.com/atom/ns#" term="data-mining" /><category scheme="http://www.blogger.com/atom/ns#" term="clustering" /><title>Clustering: k-means...</title><content type="html">&lt;span style="font-style: italic;"&gt;(this is part of an &lt;/span&gt;&lt;a style="font-style: italic;" href="http://geomblog.blogspot.com/2009/06/clustering-occasional-series.html"&gt;occasional series of essays on clustering&lt;/a&gt;&lt;span style="font-style: italic;"&gt;: for all posts in this topic, &lt;/span&gt;&lt;a style="font-style: italic;" href="http://geomblog.blogspot.com/search/label/clustering"&gt;click here&lt;/a&gt;&lt;span style="font-style: italic;"&gt;)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;k-means (also known as Lloyd's algorithm) is probably the most well known clustering algorithm, and as such, deserves some discussion of its own. We've already seen that it's part of the "I don't like you" family of clustering formulations. How does the method work ?&lt;br /&gt;&lt;br /&gt;We start with a set of points in R^d. The algorithm is really simple: Fix a collection of k centers. Now perform the following alternating optimization till you reach a steady state: (1) assign each point to its nearest center (2) for each group of points assigned to the same center, compute a new center by taking the centroid of the points.&lt;br /&gt;&lt;br /&gt;More importantly, what problem does this algorithm solve ? The quick answer is: none ! Yes, that's right: k-means gives no answer with any provable guarantee for a generic set of points for any cost measure. Pretty dismal, isn't it ? And yet, it's a really popular algorithm, a state of affairs that generally drives theoreticians to tears. So what's the deal ? The answer, as always, lies in the details, and is a good story of how theoretical work has managed to make some sense of an algorithm that was notoriously hard to analyze properly.&lt;br /&gt;&lt;br /&gt;The underlying problem that k-means attempts to solve is the one I mentioned in the last post: let a cluster cost be the sum of squared distances to the cluster center, and minimize the sum of cluster costs over all k-clusterings. Now here are some of the bad things that we CAN say about k-means:&lt;div&gt;&lt;ul&gt;&lt;li&gt;It might take exponential time to converge, even in the plane&lt;/li&gt;&lt;li&gt;It can get stuck in a local minimum that has an arbitrarily bad cost. &lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;&lt;b&gt;Running Time:&lt;/b&gt;&lt;br /&gt;&lt;div&gt;Let's first consider the problem of guaranteeing running time. Although k-means can take exponential time to converge, a smoothed complexity result says that w.h.p, a perturbed input will converge in polynomial time (for those not familiar with the concept, the smoothed complexity of a problem is its (expected) running time after the inputs have been perturbed randomly by a small amount). The smoothed complexity results suggest one reason why k-means converges quickly "in practice". The kinds of careful constructions (high dimensional, and low dimensional) needed to force super polynomial convergence times are very artificial. &lt;/div&gt;&lt;br /&gt;&lt;br /&gt;&lt;div&gt;&lt;b&gt;A crucial initialization&lt;/b&gt;:&lt;/div&gt;&lt;div&gt;&lt;br /&gt;So what about the quality of the results produced ? One unspecified aspect of the k-means algorithm is how the initial set of k centers is created. Two recent works have indicated that the key to extracting good performance from k-means is by being very careful with this initial choice.The idea is very neat, and really should be getting more play in the experimental community than I think it does.&lt;br /&gt;&lt;br /&gt;Recall the Gonzalez algorithm for the k-center problem: pick a point, and then its furthest neighbor, and then the point whose closest neighbor in this set is as far as possible, and so on. Consider the following randomized variant:&lt;br /&gt;&lt;blockquote&gt;Pick the next candidate center with probability proportional to its minimum distance squared from the current set of clusters.&lt;/blockquote&gt;Papers by &lt;a href="http://www.stanford.edu/%7Edarthur/kMeansPlusPlus.pdf"&gt;Arthur and Vassilvitski&lt;/a&gt;, and by &lt;a href="http://www.cs.ucla.edu/%7Erafail/PUBLIC/76.pdf"&gt;Ostrovksy, Rabani, Swamy and Schulman&lt;/a&gt; both show that this simple seeding strategy allows for provable guarantees on the quality of the solution returned by k-means. The actual results are slightly different, and give two different views of the behaviour of the algorithm:&lt;br /&gt;&lt;ol&gt;&lt;li&gt;The Ostrovsky et al paper starts with the assumption that the point set is epsilon-separated: the intuition behind this definition is that the cost of a k-clustering is significantly better than the cost of a (k-1)-clustering (controlled by a parameter epsilon). Under these conditions, they show that the initial seeding gives a PTAS for the clustering problem.&lt;br /&gt;&lt;br /&gt;I'm eliding a number of details about their algorithm. One interesting feature of their algorithm is what they do after the initial seeding: rather than run Lloyd's iterative step, they fix a radius around each center (different for each center) and compute the centroid of the portion of the set within this ball. They also use a more complicated procedure to convert this into a PTAS, and in my mind that makes their approach a lot less practical.&lt;br /&gt;&lt;/li&gt;&lt;br /&gt;&lt;li&gt;The Arthur/Vassilvitski paper starts with the same initialization procedure, but they assume no epsilon-separation result. Their algorithm is simply: run the initialization, and then run regular k-means. The guarantee they get is weaker (a log k approximation ratio), but they also provide empirical results showing the superiority of their approach compared to standard k-means. I think it'd be interesting to compare their approach with the 'ball k-means' approach by Ostrovsky et al to see which work better on benchmark data sets.&lt;br /&gt;&lt;/li&gt;&lt;/ol&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Another pleasing property of the k-means algorithm is its relationship to mixture modelling and the EM algorithm. I'll have more to say about EM later on, but the gist of the matter is this: The twin steps of finding new centroids and assigning points to nearest neighbours have direct analogs in the world of maximum likelihood concepts and distribution-generated data. There is a general procedure that takes a given exponential family (Gaussian, Poisson and the like), and generates a distance function d such that the density described by the distribution can be expressed as exp(-d(x, mu)), where mu is the mean of the distribution, and x is the point whose probability we're evaluating. ML folks will recognize the Bregman distances here, and the punchline is that for Gaussian distributions, the corresponding distance function is squared Euclidean distance (the exact function used in the k-means evaluation). &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;What does this all mean ? It means that the k-means process has a semantic meaning in the context of clustering data drawn from a mixture of Gaussians: what's even neater is that a k-means-like algorithm works for data drawn from other distributions, as long as you replace squared Euclidean distance by the appropriate (Bregman) distance function.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Apart from the simplicity of the algorithm, there are some aspects that make it attractive, regardless of lack of guaranteed bounds. From a heuristic-design standpoint, k-means is a particularly atractive example of a technique that's often called &lt;a href="http://geomblog.blogspot.com/2008/01/important-heuristics-alternating.html"&gt;alternating optimization&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;span style="font-weight: bold;"&gt;Notes&lt;/span&gt;:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;I was asked if there were standard data sets for benchmarking clustering algorithms. I don't often see papers doing this in a systematic manner, but a good source of data sets for experimentation is the UCI Machine Learning Repository: there aren't too many clustering data sets, but they can be &lt;a href="http://archive.ics.uci.edu/ml/datasets.html"&gt;found here&lt;/a&gt; (look for clustering under the default task).&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;/div&gt;Next up: hierarchical methods...&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6555947-5133500106942523697?l=geomblog.blogspot.com'/&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=Eihpsiv3A5g:tjYXJmTxC6s:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt; &lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=Eihpsiv3A5g:tjYXJmTxC6s:63t7Ie-LG7Y"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=63t7Ie-LG7Y" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/TheGeomblog/~4/Eihpsiv3A5g" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://geomblog.blogspot.com/feeds/5133500106942523697/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="https://www.blogger.com/comment.g?blogID=6555947&amp;postID=5133500106942523697" title="6 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/5133500106942523697?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/5133500106942523697?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/TheGeomblog/~3/Eihpsiv3A5g/clustering-k-means.html" title="Clustering: k-means..." /><author><name>Suresh</name><uri>http://www.blogger.com/profile/15898357513326041822</uri><email>noreply@blogger.com</email><gd:extendedProperty name="OpenSocialUserId" value="05767624388557425525" /></author><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">6</thr:total><feedburner:origLink>http://geomblog.blogspot.com/2009/07/clustering-k-means.html</feedburner:origLink></entry><entry gd:etag="W/&quot;Ck8MQHY-cSp7ImA9WxJVE00.&quot;"><id>tag:blogger.com,1999:blog-6555947.post-2599013145451522804</id><published>2009-06-29T12:40:00.000-06:00</published><updated>2009-06-29T12:41:21.859-06:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-06-29T12:41:21.859-06:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="soda" /><category scheme="http://www.blogger.com/atom/ns#" term="deadline" /><title>SODA 2010 submission server</title><content type="html">Remember: &lt;a href="http://soda10.cs.princeton.edu/"&gt;abstracts are due today at 5pm ET&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6555947-2599013145451522804?l=geomblog.blogspot.com'/&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=oKtnv9nadAk:TWg9fYmGz3U:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt; &lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=oKtnv9nadAk:TWg9fYmGz3U:63t7Ie-LG7Y"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=63t7Ie-LG7Y" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/TheGeomblog/~4/oKtnv9nadAk" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://geomblog.blogspot.com/feeds/2599013145451522804/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="https://www.blogger.com/comment.g?blogID=6555947&amp;postID=2599013145451522804" title="4 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/2599013145451522804?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/2599013145451522804?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/TheGeomblog/~3/oKtnv9nadAk/soda-2010-submission-server.html" title="SODA 2010 submission server" /><author><name>Suresh</name><uri>http://www.blogger.com/profile/15898357513326041822</uri><email>noreply@blogger.com</email><gd:extendedProperty name="OpenSocialUserId" value="05767624388557425525" /></author><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">4</thr:total><feedburner:origLink>http://geomblog.blogspot.com/2009/06/soda-2010-submission-server.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C0cNRHc7eSp7ImA9WxJVEkw.&quot;"><id>tag:blogger.com,1999:blog-6555947.post-8211560338624490290</id><published>2009-06-28T02:03:00.001-06:00</published><updated>2009-06-28T11:44:55.901-06:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-06-28T11:44:55.901-06:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="research" /><category scheme="http://www.blogger.com/atom/ns#" term="data-mining" /><category scheme="http://www.blogger.com/atom/ns#" term="clustering" /><title>Clustering: The "I don't like you" view</title><content type="html">&lt;span style="font-style: italic;"&gt;(this is part of an &lt;/span&gt;&lt;a style="font-style: italic;" href="http://geomblog.blogspot.com/2009/06/clustering-occasional-series.html"&gt;occasional series of essays on clustering&lt;/a&gt;&lt;span style="font-style: italic;"&gt;: for all posts in this topic, &lt;/span&gt;&lt;a style="font-style: italic;" href="http://geomblog.blogspot.com/search/label/clustering"&gt;click here&lt;/a&gt;&lt;span style="font-style: italic;"&gt;)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Any clustering problem starts with a data set. And you can basically tell what algorithm you're going to need by asking, 'what structure do I have on the data ?'. The bare minimum you need is some way of taking two items and returning a number that tells you how similar they are, or more usefully, how far apart they are.&lt;br /&gt;&lt;br /&gt;Of course, if I know that A and B are a certain distance apart, and B and C are a certain distance apart, I might like to infer something about the distance between A and C. This is what the triangle inequality gives you, and so the most elementary form of clustering takes items and a metric defined on these items (&lt;span style="font-style: italic;"&gt;humans actually DON'T think about distances in this manner, as many psychological studies have shown: more on that later on&lt;/span&gt;).&lt;br /&gt;&lt;br /&gt;I find it convenient to view clustering algorithms from a pop-psychology self-help perspective, and so I'll deem algorithms that operate with a distance metric the 'I don't like you' methods. The general goal with such methods is to group the items into "clusters" where there isn't too much hatred :).&lt;br /&gt;&lt;br /&gt;There are many ways to quantify this "lack of hatred". The one most natural to theoreticians is the 'minimize the worst-case angst' viewpoint: in other words, find k clusters such that over all clusters, the maximum pairwise distance between points in a cluster is minimized. This is the well-known &lt;span style="font-weight: bold;"&gt;k-center&lt;/span&gt; problem.&lt;br /&gt;&lt;br /&gt;If you don't like the lack of robustness (one point can mess up the cost), you could instead sum up all the pairwise distances to assign the cost for a cluster. This gives a variant of what's normally considered the &lt;span style="font-weight: bold;"&gt;k-median&lt;/span&gt; problem.&lt;br /&gt;&lt;br /&gt;A third way of measuring cluster cost is to sum the squares of distances, rather than the distances themselves. This gives you a cost function that looks a lot like what &lt;span style="font-weight: bold;"&gt;k-means&lt;/span&gt; attempts to solve.&lt;br /&gt;&lt;br /&gt;It's often useful to define a representative of a cluster: in the k-center view of the world, the center of a cluster is a point whose maximum distance to all points in the cluster is minimum. Note that by triangle inequality, this is within a factor of two of the k-center cluster cost, so it's often convenient to use this as the definition of the cluster cost (the cluster radius, instead of diameter, if you will). In the sum-of-all-costs view, the center correspondingly can be defined as as the point whose sum of all distances to all other points is minimized. Similarly for the case of sum-of-square-costs.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Remarks on complexity&lt;/span&gt;:&lt;br /&gt;There is &lt;a href="http://dblp.uni-trier.de/rec/bibtex/journals/tcs/Gonzalez85"&gt;an elegant 2-approximation&lt;/a&gt; for the k-center problem by Gonzalez. The algorithm itself is the essence of simplicity: pick any point in the space, take its furthest neighbor, take the point furthest from these two, and so on. The first k points picked in this manner yield the desired cluster centers, and a simple triangle inequality argument yields the 2-approximation.&lt;br /&gt;&lt;br /&gt;The k-median problem is a &lt;a href="http://geomblog.blogspot.com/2007/09/np-hardness-of-euclidean-k-median.html"&gt;little harder to solve&lt;/a&gt;. The best known bound in a general metric space is a 3+eps approximation, and it's known that you can't get a PTAS. The approximation algorithm itself is not too hard: it's a local search heuristic that gets closer and closer to 3 as you permit larger and larger sets of centers to be swapped out.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Going to Euclidean space, or what's in a name ? &lt;/span&gt;&lt;br /&gt;General metrics come either from some kind of similarity function between pairs of elements or from the induced shortest path metric on a graph (there are other metrics one can induce from a graph, and we'll talk about them later). But what if you have more information ? There's a general principle of clustering that's worth enunciating here: more structure on the data can lead you to more targeted algorithms that will probably work better, so resist the urge to be too general.&lt;br /&gt;&lt;br /&gt;The most natural space to consider is a Euclidean space, and it is this space that explains the names 'median' and 'mean'. It's well known that on the line, the point minimizing the sum of distances to a set of points (the 1-median) is given by the actual median of the numbers. This explains why the problem of minimizing sum of distances is called the k-median problem. Similarly, the point that minimizes the sum of squared Euclidean distance is the centroid or the 'mean', giving rise to the k-means problem.&lt;br /&gt;&lt;br /&gt;There's a long line of results on clustering under various measures in Euclidean space. In brief, for most of the above formulations, you can get a PTAS in Euclidean space, but you end up playing whack-a-mole with the parameters n, d, epsilon, and k (that is to say, something ends up exhibiting exponential dependence, and which parameter it is depends on what you care most about).&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;The Voronoi trick&lt;/span&gt;&lt;br /&gt;But there's a general principle that governs much of the algorithm design for these problems. It's the so-called &lt;a href="http://geomblog.blogspot.com/2007/09/voronoi-trick.html"&gt;Voronoi trick&lt;/a&gt;, and can be summarized by pointing out that you can always improve the clustering cost by making sure that each point is assigned to its nearest neighbor. Effectively this means that the clusters define a Voronoi partition, with the center as a site, and all points within a Voronoi cell being assigned to the corresponding site. First of all, this means that the number of possible outcomes reduces from k^n (the number of ways of partitioning n things into k groups) to n^{kd} (since the number of Voronoi partitions is much smaller than the number of partitions). Secondly, it suggests a simple heuristic for improvement: Take a given clustering: reassign points so that each point is assigned to its nearest cluster center, and then recompute the center for the cluster associated with all points in the (erstwhile) Voronoi cell. Rinse and repeat....&lt;br /&gt;&lt;br /&gt;This heuristic is the basis of the k-means algorithm, and is also the basis for a heuristic for the k-median problem clumsily named the k-medoids algorithm.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Notes&lt;/span&gt;:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;all the above algorithms have the parameter k for the number of clusters. It's obvious why: all the above cost measures can be minimized by placing each item in a cluster by itself, but the resulting answer is meaningless, and "no man is an island"&lt;/li&gt;&lt;li&gt;In a general metric space defined over a finite set of points, the cluster centers will by default by elements of the input. This is often a desirable property, if you want representatives to be from the input. But in a smooth space like a Euclidean space, there's no reason to limit yourselves to data points, and this creates a dichotomy between the so-called discrete and continuous variants of a clustering problem. Often, you can use triangle inequality to argue that the answers don't differ significantly in cost, but it's important to ask yourself which variant you care about. The discrete problem can often be "easier" to solve, because you can enumerate over the choices, but this "easier" can be expensive: for example, minimizing the sum of squares is easy in a (continuous) Euclidean space, but is more expensive in a (discrete) metric space.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;For some unknown reason, it's common to describe clustering problems by the algorithm used to "solve" them, which causes no end of confusion. I  lost count of the number of times I had to point out that k-means is an algorithm that attempts to optimize a specific cost function. This is more than just pedantry, because unless you realize the difference between a problem and a specific solution, you'll never be able to conceive of an alternate approach, and you'll never even try to abstract out what's special about your problem. To theoreticians, this might be obvious, but it isn't really obvious IRL.&lt;/li&gt;&lt;/ul&gt;Next up: everything you ever wanted to know about k-means.&lt;br /&gt;&lt;br /&gt;p.s this is written in a much more breezy style than I'd use for any formal document, but the organization and thinking reflects how I'd like to structure the material. Comments, as always, are welcome.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6555947-8211560338624490290?l=geomblog.blogspot.com'/&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=e2fJWhSDV5Y:DsswY6oZZII:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt; &lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=e2fJWhSDV5Y:DsswY6oZZII:63t7Ie-LG7Y"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=63t7Ie-LG7Y" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/TheGeomblog/~4/e2fJWhSDV5Y" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://geomblog.blogspot.com/feeds/8211560338624490290/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="https://www.blogger.com/comment.g?blogID=6555947&amp;postID=8211560338624490290" title="8 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/8211560338624490290?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/8211560338624490290?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/TheGeomblog/~3/e2fJWhSDV5Y/clustering-i-dont-like-you-view.html" title="Clustering: The &quot;I don't like you&quot; view" /><author><name>Suresh</name><uri>http://www.blogger.com/profile/15898357513326041822</uri><email>noreply@blogger.com</email><gd:extendedProperty name="OpenSocialUserId" value="05767624388557425525" /></author><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">8</thr:total><feedburner:origLink>http://geomblog.blogspot.com/2009/06/clustering-i-dont-like-you-view.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CEUMRXc6fyp7ImA9WxJVEEo.&quot;"><id>tag:blogger.com,1999:blog-6555947.post-8635788128119820146</id><published>2009-06-26T13:52:00.003-06:00</published><updated>2009-06-26T21:11:24.917-06:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-06-26T21:11:24.917-06:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="media" /><category scheme="http://www.blogger.com/atom/ns#" term="blogosphere" /><category scheme="http://www.blogger.com/atom/ns#" term="outreach" /><title>"Bloggers", the media and science reporting.</title><content type="html">Sometimes I read an article that I just cannot understand. The words make sense, and I can understand the logical flow, but the entire premise just escapes me. Consider the following exhibit: &lt;a href="http://www.nature.com/news/2009/090624/full/4591050a.html?s=news_rss"&gt;an article in Nature News on the "problem" of blogging from a conference&lt;/a&gt;. A snippet from the article (which I recommend reading in its entirety):&lt;br /&gt;&lt;blockquote&gt;By disseminating scientific results far beyond the lecture hall, blogging and social networking blurs the line between journalists and researchers. Scientists in competitive fields may be more reluctant to discuss new findings if they can be posted on the Internet within seconds.&lt;/blockquote&gt;and then later:&lt;br /&gt;&lt;blockquote&gt;MacArthur's comprehensive postings were read by many scientists but they irked journalists attending the meeting. The meeting rules stated that reporters had to seek permission from speakers before publishing material on their work, rules that Cold Spring Harbor instituted in part because some journals, such as &lt;span class="i"&gt;Nature&lt;/span&gt;, discourage scientists from talking to the press before their work is published. But those rules didn't apply to scientist-bloggers like MacArthur and, after he posted details from a few talks, reporters contacted Stewart for clarification on the policies. The complaint was a wake-up call: "For the first time, I became aware that people were blogging about the data at the meeting," Stewart says.&lt;/blockquote&gt;If I understand this correctly, the premise of the article is that blogging from a conference is bad because it&lt;br /&gt;&lt;ul&gt;&lt;li&gt;"scoops" the poor journalists under embargo ?&lt;br /&gt;&lt;/li&gt;&lt;li&gt;disseminates information faster than someone might like ? &lt;/li&gt;&lt;/ul&gt;to which my only response is "HUH " ? Isn't the whole point of a PUBLIC presentation to disseminate results to - you kn0w- THE PUBLIC ? and how on earth does it matter how fast this happens ? And although I feel for the journalists who agree to weird embargo rules by control-freak journals, why on earth should I, as a researcher who happens to blog from conferences, care about this ?&lt;br /&gt;&lt;br /&gt;Sometimes I think the media needs to get over itself....&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6555947-8635788128119820146?l=geomblog.blogspot.com'/&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=f_BteQ8fQ3c:A7DZNOfO6go:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt; &lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=f_BteQ8fQ3c:A7DZNOfO6go:63t7Ie-LG7Y"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=63t7Ie-LG7Y" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/TheGeomblog/~4/f_BteQ8fQ3c" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://geomblog.blogspot.com/feeds/8635788128119820146/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="https://www.blogger.com/comment.g?blogID=6555947&amp;postID=8635788128119820146" title="6 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/8635788128119820146?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/8635788128119820146?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/TheGeomblog/~3/f_BteQ8fQ3c/bloggers-media-and-science-reporting.html" title="&quot;Bloggers&quot;, the media and science reporting." /><author><name>Suresh</name><uri>http://www.blogger.com/profile/15898357513326041822</uri><email>noreply@blogger.com</email><gd:extendedProperty name="OpenSocialUserId" value="05767624388557425525" /></author><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">6</thr:total><feedburner:origLink>http://geomblog.blogspot.com/2009/06/bloggers-media-and-science-reporting.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CU8HQ3k5eip7ImA9WxJWGUw.&quot;"><id>tag:blogger.com,1999:blog-6555947.post-486205740724537689</id><published>2009-06-25T00:54:00.004-06:00</published><updated>2009-06-25T01:10:32.722-06:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-06-25T01:10:32.722-06:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="research" /><category scheme="http://www.blogger.com/atom/ns#" term="data-mining" /><category scheme="http://www.blogger.com/atom/ns#" term="clustering" /><title>Clustering: An Occasional Series</title><content type="html">Clustering is one of those areas that epitomizes the 'ooze' principle in data mining: it's 3 miles wide and an inch deep. Every year, copious ink get spilt on some clustering method or another, and yet it's not really clear what key ideas are driving the field forward.&lt;br /&gt;&lt;br /&gt;I ran a seminar this spring on clustering. I wanted to see if there was a way of organizing material on clustering in a way that was broad without necessarily being exhaustive, but that hit the "eigenvectors" of the field, so to speak. I was surprised that for a topic as important and broad-based as clustering, there was little out there in the way of textbook-level source material to work from. Most texts that I did find were too specialized for my taste, and didn't have the kind of broad perspective of clustering that I felt was critical to really understanding the area.&lt;br /&gt;&lt;br /&gt;So I organized my material, and was surprised to find that there actually was a way of collecting the base material on the topic in a way that could be covered in a semester format without leaving crucial pieces out. What I did not try to do:&lt;br /&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;Cover the details of the best/greatest/fastest algorithm for each problem. In other words, I made a conscious decision to shy away from theoretical overkill. &lt;/li&gt;&lt;li&gt;Try to cover all the variants of different clustering methods. I felt that as long as one knew what k-means was, and what spectral clustering was, there was no real need to study papers that tried to put them together in strange ways.&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;What I did try to do was:&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;Where possible, try to evaluate the practical significance of the methods (so a linear time clustering algorithm that actually had a running time doubly exponential in epsilon would not necessarily make the cut, unless it had some other redeeming features)&lt;/li&gt;&lt;li&gt;Try to view the methods from the perspective of the user: given some data, with some structure, what are your options in terms of clustering algorithms ? &lt;/li&gt;&lt;/ul&gt;&lt;div&gt;The results of this organization &lt;a href="http://apollonius.cs.utah.edu/mediawiki/index.php/Algorithms_Seminar/Spring09"&gt;can be found here&lt;/a&gt;. I was relatively happy with the outcome, and I think the students were too. In fact, many of the discussions in class were interesting enough that I put them together in a collection of notes. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;What I want to do in this occasional series is sketch out the "top-level" view of clustering as I see it, and as inspired by our seminar. My vague idea was that these notes might eventually be converted into some kind of monograph, or at the very least a web document that others might find useful. So I'm going to "test-drive" the ideas here, in the hope that my very well-informed readers will add to the discussion, and improve it (as usually happens). &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;First up: the basic trinity - k-(center/median/means). &lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6555947-486205740724537689?l=geomblog.blogspot.com'/&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=1upgpMHlIyA:Acjaw32g-nQ:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt; &lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=1upgpMHlIyA:Acjaw32g-nQ:63t7Ie-LG7Y"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=63t7Ie-LG7Y" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/TheGeomblog/~4/1upgpMHlIyA" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://geomblog.blogspot.com/feeds/486205740724537689/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="https://www.blogger.com/comment.g?blogID=6555947&amp;postID=486205740724537689" title="2 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/486205740724537689?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/486205740724537689?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/TheGeomblog/~3/1upgpMHlIyA/clustering-occasional-series.html" title="Clustering: An Occasional Series" /><author><name>Suresh</name><uri>http://www.blogger.com/profile/15898357513326041822</uri><email>noreply@blogger.com</email><gd:extendedProperty name="OpenSocialUserId" value="05767624388557425525" /></author><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">2</thr:total><feedburner:origLink>http://geomblog.blogspot.com/2009/06/clustering-occasional-series.html</feedburner:origLink></entry><entry gd:etag="W/&quot;D0QAR3s_cCp7ImA9WxJWFkk.&quot;"><id>tag:blogger.com,1999:blog-6555947.post-2193242209325516551</id><published>2009-06-21T22:34:00.001-06:00</published><updated>2009-06-21T22:35:46.548-06:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-06-21T22:35:46.548-06:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="community" /><title>"I, for one, welcome our new TCS overlords"</title><content type="html">Via Richard Ladner:&lt;br /&gt;&lt;blockquote&gt;I am pleased to announce the new SIGACT officers for the term beginning July 1, 2009 and ending in three years, June 30, 2012.&lt;br /&gt;&lt;br /&gt;SIGACT Chair&lt;br /&gt;Lance Fortnow, Northwestern University&lt;br /&gt;&lt;br /&gt;Members-at-large&lt;br /&gt;Cynthia Dwork , Microsoft Research&lt;br /&gt;Anna Lysyanskaya, Brown University&lt;br /&gt;Michael Mitzenmacher, Harvard University&lt;br /&gt;Mike Saks, Rutgers University&lt;/blockquote&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6555947-2193242209325516551?l=geomblog.blogspot.com'/&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=rdA_iKmTFRI:cu2amNoKrXg:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt; &lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=rdA_iKmTFRI:cu2amNoKrXg:63t7Ie-LG7Y"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=63t7Ie-LG7Y" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/TheGeomblog/~4/rdA_iKmTFRI" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://geomblog.blogspot.com/feeds/2193242209325516551/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="https://www.blogger.com/comment.g?blogID=6555947&amp;postID=2193242209325516551" title="1 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/2193242209325516551?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/2193242209325516551?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/TheGeomblog/~3/rdA_iKmTFRI/i-for-one-welcome-our-new-tcs-overlords.html" title="&quot;I, for one, welcome our new TCS overlords&quot;" /><author><name>Suresh</name><uri>http://www.blogger.com/profile/15898357513326041822</uri><email>noreply@blogger.com</email><gd:extendedProperty name="OpenSocialUserId" value="05767624388557425525" /></author><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">1</thr:total><feedburner:origLink>http://geomblog.blogspot.com/2009/06/i-for-one-welcome-our-new-tcs-overlords.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DUYGSHc8fSp7ImA9WxJWFUs.&quot;"><id>tag:blogger.com,1999:blog-6555947.post-5745297563836806980</id><published>2009-06-21T00:39:00.003-06:00</published><updated>2009-06-21T00:52:09.975-06:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-06-21T00:52:09.975-06:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="miscellaneous" /><title>Quick hits</title><content type="html">&lt;ul&gt;&lt;li&gt;If you limit yourself to actually reading one Terry Tao post each week (I'm told this frequency prevents brain explosion), then &lt;a href="http://terrytao.wordpress.com/2009/06/12/the-szemeredi-trotter-theorem-and-the-cell-decomposition/"&gt;read this one&lt;/a&gt;.&lt;/li&gt;&lt;li&gt;Joachim Gudmundsson wants to remind you that jausage is much taster than sausage. Oh yes, and &lt;a href="http://jocg.org"&gt;JoCG&lt;/a&gt; is open for submissions.&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.siam.org/meetings/da10/"&gt;Get those SODA papers in&lt;/a&gt; (but not too many, I have work to do !)&lt;/li&gt;&lt;li&gt;I kid, I kid....&lt;/li&gt;&lt;li&gt;I wanted to spend more time talking about &lt;a href="http://www.langorigami.com/"&gt;Robert Lang's&lt;/a&gt; talk on origami. It had deliciously beautiful pictures, and some even more juicy mathematics, and he pitched it at JUST the right level. No slides available, alas.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.cs.mcgill.ca/%7Edenis/notes09.pdf"&gt;Lecture notes on streaming&lt;/a&gt; from a workshop in Barbados...&lt;/li&gt;&lt;li&gt;I had to explain to someone that Denmark is not a city in Sweden. And this after watching the Denmark-Sweden world cup qualifying game in a bar with a hundred roaring Danes, and feeling like I had been transported to a Viking battlefield. &lt;/li&gt;&lt;/ul&gt;And because I can't help myself, "SK&amp;#197;L"&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6555947-5745297563836806980?l=geomblog.blogspot.com'/&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=UkegRtfyJnU:ERZ5kHeCyIk:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt; &lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=UkegRtfyJnU:ERZ5kHeCyIk:63t7Ie-LG7Y"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=63t7Ie-LG7Y" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/TheGeomblog/~4/UkegRtfyJnU" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://geomblog.blogspot.com/feeds/5745297563836806980/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="https://www.blogger.com/comment.g?blogID=6555947&amp;postID=5745297563836806980" title="1 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/5745297563836806980?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/5745297563836806980?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/TheGeomblog/~3/UkegRtfyJnU/quick-hits.html" title="Quick hits" /><author><name>Suresh</name><uri>http://www.blogger.com/profile/15898357513326041822</uri><email>noreply@blogger.com</email><gd:extendedProperty name="OpenSocialUserId" value="05767624388557425525" /></author><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">1</thr:total><feedburner:origLink>http://geomblog.blogspot.com/2009/06/quick-hits.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DkQHRHYzeSp7ImA9WxJWFEo.&quot;"><id>tag:blogger.com,1999:blog-6555947.post-7846898756789180823</id><published>2009-06-19T20:13:00.001-06:00</published><updated>2009-06-19T23:05:35.881-06:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-06-19T23:05:35.881-06:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="research" /><category scheme="http://www.blogger.com/atom/ns#" term="socg" /><title>SoCG 2009: Local search, geometric approximations and PTASs</title><content type="html">&lt;span style="font-style: italic;"&gt;I planned on this post coming a lot sooner after the last three, but got caught up in my own version of a Danish flu. sigh....&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;So let's get greedy. In the last post, I talked about new developments in the epsilon-net route to approximating covering/hitting problems. The problem with these approaches though (as &lt;a href="http://dblp.uni-trier.de/db/indices/a-tree/r/Ray:Saurabh.html"&gt;Saurabh Ray&lt;/a&gt; pointed out) is that they (at best) get you to "some" constant. They don't get you down to a specific constant (unless you can really optimize the net-construction) and they certainly don't give you a PTAS.&lt;br /&gt;&lt;br /&gt;So what's a poor approximator to do if their livelihood depends on getting a 1+epsilon-approximation ? Two papers at SoCG 2009 answer this question independently using essentially the same idea, and there's already a third paper that applies this idea as well. You know what they say: "three applications doth a general technique make" (or at least they should say it !), and the technique is so neat that I thought I'd spend some time outlining the basic principle.&lt;br /&gt;&lt;br /&gt;But first, a detour. What follows is my own take on the underlying argument, and I'm the only one responsible for all the inaccuracies in interpretation and exposition. Consider the problem of computing a maximum cardinality matching in a bipartite graph. As we all know, this can solved using a max-flow algorithm. A well-known "spin" on this result is the following:&lt;br /&gt;&lt;blockquote&gt;Suppose we construct a max flow F_k using augmenting paths of length at most 2k+1. Then OPT &amp;lt;= (1 + 1/(k+1)) F_k&lt;/blockquote&gt;Or in other words, F_k is a 1 + 1/(k+1) approximation to the true answer. The proof is fairly easy: consider any augmenting path of length at least 2k+3 that could increase the total matching size. Because we've tried all augmenting paths of length 2k+1, it must be that this path alternates between OPT (red) and algorithm (blue) edges, starting and ending in a red edge. In other words, if X is the number of blue edges along this path, then  2X+1= 2k+3, and X = k+1. This means that the number of optimal edges is at most (1 + 1/(k+1)) times the number of algorithm edges. Note that all augmenting paths must be disjoint, so some algebra yields the above result.&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Another way of viewing the above fact about alternation is that if we think of the edges as vertices of a bipartite graph, (colored as the edges were), then each red set is adjacent to a reasonable number of blue vertices (in fact if this were not the case, the analysis would break). The edge-disjointness then allows us to scale things up to the entire graph&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Notice that the process of generating longer and longer augmenting paths can be viewed as a local search operation. Starting from a given path, we modify it by constructing an alternating path using the current edges.In other words, we're able to make claims about local search yielding optimal solutions because we can control the way OPT interacts with the results of the algorithm, with the interaction parametrized by the "degree of locality". &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Now let's return to the papers, keeping this in mind. &lt;/div&gt;&lt;br /&gt;&lt;a href="http://www.madalgo.au.dk/socg2009/Images/Accepted%20papers%20w%20abstracts/21_Mustafa.pdf"&gt;Mustafa and Ray&lt;/a&gt; apply this idea to get a PTAS for hitting set problems. Specifically, they show that for &lt;span style="font-style: italic;"&gt;r-admissible&lt;/span&gt; regions in the plane and for halfspaces in R^3, the hitting set problem admits a PTAS that runs in time O(mn^(1/eps^2)) (m: number of regions, n: number of points). A collection of regions in the plane is r-admissible if any pair intersect in at most r points, and the differences are connected (these are often called non-piercing collections for that reason).&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.madalgo.au.dk/socg2009/Images/Accepted%20papers%20w%20abstracts/32_Chan.pdf"&gt;Chan and Har-Peled&lt;/a&gt; use the idea to get a PTAS for max-independent-set problems for pseudo-disks in the plane (which are 2-admissible). Their running time is similar.&lt;br /&gt;&lt;br /&gt;Both papers run the following algorithm, parametrized by a number b. Take any feasible solution, and call it L. Now check all sets that differ from L in at most b places and see if any of them can be used to improve the quality of L by at least one unit. If so, make the swap and repeat. This process will terminate in at most n steps, and at each stage, you need to do a search over at most n^b sets. We'll call such a solution "b-local"&lt;br /&gt;&lt;br /&gt;The idea that roughly governs both analyses is this: Consider the solution you're currently at (let's call it red) and the optimal solution (color it blue). Think of building a bipartite graph (this works slightly differently for the different problems) which contains the red and blue objects as vertices, and connects two (differently colored) objects by an edge if there's a range that contains both (for the hitting set problem) or if they intersect (for the MIS) (call this a 'locality property' as Mustafa and Ray do) Then they show some things:&lt;br /&gt;&lt;ol&gt;&lt;li&gt;This graph is planar&lt;/li&gt;&lt;li&gt;It "expands": neighborhoods of blue sets of size at most b are large&lt;br /&gt;&lt;/li&gt;&lt;li&gt;As a consequence of 1,2, the size of the blue set is upper bounded by (1+1/sqrt(b))(size of red set)&lt;/li&gt;&lt;/ol&gt;See the resemblance to the max flow argument ? (or maybe not ;)). A reasonably quick corollary then shows that by setting b = 1/eps^2, the desired PTAS follows (since by 3), the optimal solution is reasonably close to the current solution. 2) holds by the locality argument, i.e if it weren't true, then there'd be some way to locally improve the red set using a "lookahead" of size at most b. 1) is case-by-case, but works for these problems. The proof of 3) itself requires separator machinery for planar graphs.&lt;br /&gt;&lt;br /&gt;An upcoming paper by Gibson, Kanade, Krohn and Varadarajan applies the same idea for the 1.5D terrain guarding problem (one of the few nontrivial art gallery problems that has yielded to approximation attacks). They produce a planar graph that satisfies the locality property, and then apply the rest of the local search machinery as before. What's neat is that the machinery really does apply just as before: "all they have to do" (and I'm not trying to minimizing the effort here, just pointing out the smoothness of the technique) is construct the planar graph.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Some thoughts: &lt;/span&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;There's some room for improvement in the running time perhaps, at least down to a linear dependence on 1/epsilon in the exponent. We can't get a poly dependence because of strong-NP-completeness&lt;/li&gt;&lt;li&gt;I don't think the weighted versions of these problems would fall to the same "trick": indeed the Chan-Har-Peled paper considers a neat LP-based approach for the weighted MIS.&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;There have been numerous PTASs for geometric problems: the Arora/Mitchell TSP approximation is the most famous of course. But this technique is a relatively new kind of generic tool, and has demonstrated immediate applicability for different problems, which makes it noteworthy. Secondly, it's a way of reasoning about the efficacy of local search, which is always neat, since local search algorithms are generally not that complicated.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6555947-7846898756789180823?l=geomblog.blogspot.com'/&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=IcKbt4Wy9VY:hBrrdVoT7I0:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt; &lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=IcKbt4Wy9VY:hBrrdVoT7I0:63t7Ie-LG7Y"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=63t7Ie-LG7Y" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/TheGeomblog/~4/IcKbt4Wy9VY" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://geomblog.blogspot.com/feeds/7846898756789180823/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="https://www.blogger.com/comment.g?blogID=6555947&amp;postID=7846898756789180823" title="3 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/7846898756789180823?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/7846898756789180823?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/TheGeomblog/~3/IcKbt4Wy9VY/socg-2009-local-search-geometric.html" title="SoCG 2009: Local search, geometric approximations and PTASs" /><author><name>Suresh</name><uri>http://www.blogger.com/profile/15898357513326041822</uri><email>noreply@blogger.com</email><gd:extendedProperty name="OpenSocialUserId" value="05767624388557425525" /></author><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">3</thr:total><feedburner:origLink>http://geomblog.blogspot.com/2009/06/socg-2009-local-search-geometric.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C0ANQX07eSp7ImA9WxJWEUs.&quot;"><id>tag:blogger.com,1999:blog-6555947.post-583824111197133384</id><published>2009-06-13T20:25:00.005-06:00</published><updated>2009-06-16T08:16:30.301-06:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-06-16T08:16:30.301-06:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="research" /><category scheme="http://www.blogger.com/atom/ns#" term="socg" /><title>SoCG 2009: Epsilon-nets and covering/hitting problems.</title><content type="html">The 30,000 horsepower engine of approximation algorithms is linear programming. The fact that an LP gives you a bound on the cost of an NP-hard problem has been the major force behind ever more sophisticated methods for proving approximation guarantees.&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Ironically enough, given the geometric nature of an LP, this engine has not been particularly useful for geometric approximation problems. The high level reason (somewhat over-simplified) is that it is generally hard to encode specifics about geometry in a linear program. What this means is that (for example) if you want to solve a geometric covering problem and can't encode the geometry in your LP, you're stuck with the log n approximation that combinatorial set cover gives you, inspite of the fact that there are numerous examples of problems where throwing geometry in actually improves the approximation bound considerably.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;To put it mildly, this is a pain in the neck. It basically means that you have to design methods that work for a specific problem, and there are only a few general purpose techniques available, many of them outlined in &lt;a href="http://biogeometry.duke.edu/pubs-pankaj/surveys/randomized-opt.ps.gz"&gt;this survey&lt;/a&gt;. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;But if your problem can be framed as a covering/hitting problem, then there's a general body of work that has provided better and better results, (and for an added bonus, actually has an LP-formulation). The story dates back to a paper on iterative reweighting by Clarkson, and actually goes back even earlier if you jump to the ML literature: &lt;a href="http://www.cs.princeton.edu/~arora/pubs/MWsurvey.pdf"&gt;Arora, Hazan and Kale have a nice survey&lt;/a&gt; on this general approach. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;But I digress: the paper by Bronnimann and Goodrich put the technique in the language we're familiar with today: in short, &lt;/div&gt;&lt;div&gt;&lt;/div&gt;&lt;blockquote&gt;&lt;div&gt;Given a range space that admits epsilon-nets of size (1/epsilon)f(1/epsilon), you can solve the hitting set problem to an approximation of f(OPT). &lt;/div&gt;&lt;div&gt;&lt;/div&gt;&lt;/blockquote&gt;&lt;div&gt;In particular, range spaces that admit epsilon-nets of size 1/epsilon admit constant-factor approximations for the hitting set problem. This led to a "formula" for approximating these problems: find an epsilon-net of an appropriate size, and declare victory. There's been much work on constructing epsilon-nets since then, partly motivated by this. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Another angle to this story was put forward by Clarkson and Varadarajan in their &lt;a href="http://www.almaden.ibm.com/u/kclarkson/set_cover/p.pdf"&gt;2005 SoCG paper&lt;/a&gt;: the brief summary from that paper was: &lt;/div&gt;&lt;div&gt;&lt;/div&gt;&lt;blockquote&gt;&lt;div&gt;Given a range space in which you can bound the complexity of the union of a random subcollection  of size r by rf(r), you can find an epsilon net of size (1/epsilon)f(1/epsilon). &lt;/div&gt;&lt;/blockquote&gt;&lt;div&gt;This approach gave another tool for finding good approximations for covering problems (and in particular gave a nice constant factor approximation for a thorny guarding problem). &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;At SoCG 2009, &lt;a href="http://portal.acm.org/citation.cfm?id=1542362.1542366&amp;amp;coll=GUIDE&amp;amp;dl=GUIDE"&gt;Kasturi Varadarajan presented an improved bound&lt;/a&gt; along these lines: his main result improves the transfer complexity: modulo some technical details,&lt;/div&gt;&lt;div&gt;&lt;/div&gt;&lt;blockquote&gt;&lt;div&gt;Given a range space in which you can bound the complexity of the union of a random subcollection  of size r by rf(r), you can find an epsilon net of size (1/epsilon)&lt;b&gt;log&lt;/b&gt; f(1/epsilon). &lt;/div&gt;&lt;/blockquote&gt;&lt;div&gt;Although this doesn't improve results in the constant factor regime, where f() is a constant, it makes a significant difference when f() is non-constant. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The interesting story here is that Kasturi's result was simultaneous with another paper by &lt;a href="http://www.cs.tau.ac.il/~estere/Research/Eps_Nets/epsnets.pdf"&gt;Aronov, Ezra, and Sharir&lt;/a&gt; that just appeared in STOC 2009. They get the same bound without the technical details that limit his result, however their methods are quite different.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Returning to LPs, what I failed to mention is that the epsilon-net construction technique used by Bronnimann and Goodrich can be seen as a byproduct of an LP-algorithm, as was shown by &lt;a href="http://www.eng.tau.ac.il/~guy/Papers/VC.pdf"&gt;Even, Rawitz and Shahar&lt;/a&gt;, and &lt;a href="http://www.phillong.info/publications/intprog.html"&gt;Philip Long before that&lt;/a&gt; (thanks, Sariel)&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6555947-583824111197133384?l=geomblog.blogspot.com'/&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=Y4bGmP3rHrw:FbmaCDbiJEg:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt; &lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=Y4bGmP3rHrw:FbmaCDbiJEg:63t7Ie-LG7Y"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=63t7Ie-LG7Y" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/TheGeomblog/~4/Y4bGmP3rHrw" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://geomblog.blogspot.com/feeds/583824111197133384/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="https://www.blogger.com/comment.g?blogID=6555947&amp;postID=583824111197133384" title="5 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/583824111197133384?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/583824111197133384?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/TheGeomblog/~3/Y4bGmP3rHrw/socg-2009-epsilon-nets-and.html" title="SoCG 2009: Epsilon-nets and covering/hitting problems." /><author><name>Suresh</name><uri>http://www.blogger.com/profile/15898357513326041822</uri><email>noreply@blogger.com</email><gd:extendedProperty name="OpenSocialUserId" value="05767624388557425525" /></author><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">5</thr:total><feedburner:origLink>http://geomblog.blogspot.com/2009/06/socg-2009-epsilon-nets-and.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DUEGSXc7eyp7ImA9WxJWEEU.&quot;"><id>tag:blogger.com,1999:blog-6555947.post-9098884174452567136</id><published>2009-06-13T19:33:00.004-06:00</published><updated>2009-06-15T11:40:28.903-06:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-06-15T11:40:28.903-06:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="research" /><category scheme="http://www.blogger.com/atom/ns#" term="socg" /><title>SoCG 2009: A neat lower bound example</title><content type="html">I've had a number of posts queued up to write (for those who ask me "how do you get the time?", the answer is sometimes, "I don't !"). This SoCG was full of really interesting papers, and even collections of papers that together moved our understanding forward in very nontrivial ways. Definitely read &lt;a href="http://11011110.livejournal.com/174354.html"&gt;David E's post&lt;/a&gt; from earlier today.&lt;br /&gt;&lt;br /&gt;Gabriel Nivasch has been doing some great work on lower bound constructions. He recently &lt;a href="http://arxiv.org/abs/0807.0484"&gt;had a paper&lt;/a&gt; on analysing upper and lower bounds for Davenport-Schinzel sequences (got a best student paper award at SODA 2009), and the kind of tight bounds he gets are very impressive (upto terms involving powers of alpha(n) in the exponent). Here, he presented&lt;a href="http://arxiv.org/abs/0812.5039"&gt; joint work with Boris Bukh and Jiri Matousek&lt;strike&gt;Micha Sharir&lt;/strike&gt;&lt;/a&gt; on a new lower bound for weak epsilon-nets.&lt;br /&gt;&lt;br /&gt;A central construction here, that might be useful for other lower bound constructions is a very fast growing exponential grid. Roughly speaking, if we think of how it would map to a "regular grid", then the point (i1, i2, ....) on the [0..m] grid maps to the point whose jth coordinate is x_j = 2^{i_j * [m^{j-1} + m^{j-2} + ... + 1]d}. So things start getting really jumpy as the coordinates go up. A couple of observations do the heavy lifting for this grid:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;A straight line segment between two points in the stretched grid maps to a collection of d line segments in the "regular" grid: d-1 that goes almost straight up from the first (say lower) point, and another that goes across the final dimension (it's easier to think of this recursively)&lt;/li&gt;&lt;li&gt;Convex sets in the stretched grid correspond to so-called "stair-convex sets" in the regular grid&lt;/li&gt;&lt;li&gt;A diagonal in the stretched grid maps to a curve that doesn't touch many points in the regular grid. &lt;/li&gt;&lt;/ul&gt;&lt;div&gt;Basically, the grid construction allows them to reason about lower bounds by going back to the regular grid. They use these ideas to prove the lower bound for weak epsilon-nets, but also for some incidence problems (using the third property). It seems to me that this construction might prove useful for other lower bounds in the future as well. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Coming up next: max flows, local search and a plethora of PTASs (try saying that fast)...&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6555947-9098884174452567136?l=geomblog.blogspot.com'/&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=XSTEihxrTZ4:wDDXPYaX4UI:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt; &lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=XSTEihxrTZ4:wDDXPYaX4UI:63t7Ie-LG7Y"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=63t7Ie-LG7Y" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/TheGeomblog/~4/XSTEihxrTZ4" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://geomblog.blogspot.com/feeds/9098884174452567136/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="https://www.blogger.com/comment.g?blogID=6555947&amp;postID=9098884174452567136" title="1 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/9098884174452567136?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/9098884174452567136?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/TheGeomblog/~3/XSTEihxrTZ4/socg-2009-neat-lower-bound-example.html" title="SoCG 2009: A neat lower bound example" /><author><name>Suresh</name><uri>http://www.blogger.com/profile/15898357513326041822</uri><email>noreply@blogger.com</email><gd:extendedProperty name="OpenSocialUserId" value="05767624388557425525" /></author><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">1</thr:total><feedburner:origLink>http://geomblog.blogspot.com/2009/06/socg-2009-neat-lower-bound-example.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DEcDRH87cSp7ImA9WxJXFkk.&quot;"><id>tag:blogger.com,1999:blog-6555947.post-4263070419264601822</id><published>2009-06-10T08:53:00.004-06:00</published><updated>2009-06-10T09:01:15.109-06:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-06-10T09:01:15.109-06:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="research" /><category scheme="http://www.blogger.com/atom/ns#" term="randomness" /><title>Probability of a graph being connected...</title><content type="html">I ran across the following question in a problem I'm working on, and thought I'd ping the collective wisdom of the blog-reader-o-sphere.&lt;br /&gt;&lt;blockquote&gt;Given an undirected graph G with (possibly different) probabilities on edges, consider the usual random process where we pick each edge independently, but with probability p_e (this is the difference from the standard erdos-renyi model). &lt;br /&gt;&lt;br /&gt;What is the probability that the resulting graph is connected ? &lt;br /&gt;&lt;/blockquote&gt;&lt;br /&gt;This seems possibly to have something to do with the spanning tree polytope, but it seems quite nontrivial. In general, I'm given a subset of vertices S in this graph, and want to ask for the probability that S forms a connected component. Making sure that S is disconnected from the rest of the graph is easy, but then the remainder of the probability comes from the answer to the above question. &lt;br /&gt;&lt;br /&gt;I should add that the subset connectivity question is trivial if G is a tree: it's the higher connectivity case that makes things harder.&lt;br /&gt;&lt;br /&gt;p.s More SoCG posts should be on their way in the next few days after I return.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6555947-4263070419264601822?l=geomblog.blogspot.com'/&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=zBsWew_fgSU:AjxPuyQDejc:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt; &lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=zBsWew_fgSU:AjxPuyQDejc:63t7Ie-LG7Y"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=63t7Ie-LG7Y" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/TheGeomblog/~4/zBsWew_fgSU" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://geomblog.blogspot.com/feeds/4263070419264601822/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="https://www.blogger.com/comment.g?blogID=6555947&amp;postID=4263070419264601822" title="7 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/4263070419264601822?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/4263070419264601822?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/TheGeomblog/~3/zBsWew_fgSU/probability-of-graph-being-connected.html" title="Probability of a graph being connected..." /><author><name>Suresh</name><uri>http://www.blogger.com/profile/15898357513326041822</uri><email>noreply@blogger.com</email><gd:extendedProperty name="OpenSocialUserId" value="05767624388557425525" /></author><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">7</thr:total><feedburner:origLink>http://geomblog.blogspot.com/2009/06/probability-of-graph-being-connected.html</feedburner:origLink></entry><entry gd:etag="W/&quot;Dk4DR3o_fip7ImA9WxJXFE0.&quot;"><id>tag:blogger.com,1999:blog-6555947.post-8977237414219174543</id><published>2009-06-07T13:35:00.003-06:00</published><updated>2009-06-07T14:02:56.446-06:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-06-07T14:02:56.446-06:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="socg" /><title>SoCG 2009: Day 0</title><content type="html">At least in my mind, things have been somewhat overshadowed by the sad news of yesterday, thus preventing me (thankfully) from writing the more frivolous posts that I might have penned.&lt;br /&gt;&lt;br /&gt;Today was the 25th anniversary celebration for SoCG (1985-2009), and we had four speakers to do a retrospective on the field.&lt;br /&gt;&lt;br /&gt;David Dobkin was first, talking about how some of the first geometric algorithms came to be. He interspersed many fun facts about the early days of SoCG:&lt;br /&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;One of the reasons CG is tied to the STOC/FOCS community is because when Michael Shamos first asked him where to publish a geometry paper, STOC was the first conference that came to mind.&lt;/li&gt;&lt;li&gt;The Preparata-Shamos book only came about because (after Shamos left for law school) Ron Graham brokered a deal for Franco Preparata to write a book based on Shamos' thesis. &lt;/li&gt;&lt;li&gt;SoCG was almost called the Symposium on Computer Geometry (ugh). &lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;br /&gt;It was interesting to see that even way back then, there was serious intersection between numerical algorithms, the graphics and vision folks, and the nascent geometry area. Some things haven't really changed. It also explains (for me) why there is somewhere deep down a sense that CG doesn't wholly situate within the larger algorithms community, but has a core identity that's slightly different. &lt;br /&gt;&lt;br /&gt;One point that David made was very interesting. He talked about how Michael Shamos essentially created a problem book from scratch, listing basic problems in the new area of computational geometry, and methodically solving them one by one. He then asserted that it was the creation of this problem book that played a major role in the unification of the area. I think that makes a lot of sense. It's not so much that a list of problems got created. It's not too hard to do that. What I think was the critical factor was the listing of problems within-reach, that encouraged more people to start thinking about them. Muthu is very good at doing this with any field he jumps into, and I think he's able to jumpstart a lot of research for this exact reason. It's not easy to do: the problems have to be core to the area, but still not that difficult to solve. &lt;br /&gt;&lt;br /&gt;Micha Sharir went next with a talk on the evolution of work in arrangements. I've always felt that work on arrangements seems to have reached the point of diminishing returns after some point: that new results were mildly interesting, but didn't necessarily hook into algorithm design in a nontrivial way. I still have somewhat of the same opinion after the talk, but it was good to see the evolution of the problems into higher dimensional surface arrangement questions.&lt;br /&gt;&lt;br /&gt;Emo Welzl gave a nice overview of the exciting years between 1987-1992 when randomization began to bloom into a powerful tool for doing geometry. Of course, the very first randomized algorithm was a geometric one (Rabin's closest pair algorithm), but the really powerful methods came into being really with Clarkson's work, and the eps-net work of Haussler and Welzl. It was amusing that almost every single paper mentioned in Emo's talk was written or co-written by Ken: just goes to show his profound influence on the field both for new techniques, as well as new results. &lt;br /&gt;For me, another great influence was Raimund Seidel's amazingly lucid exposition of backwards analysis: there's something to be said for crystal-clear writing and the impact it can have. &lt;br /&gt;&lt;br /&gt;The final talk was by Kurt Mehlhorn, on the history that led to the development of LEDA and CGAL. I must confess that I've always felt rather intimidated by CGAL and LEDA: there's something about the phrase "traits class" that strikes fear into my heart. But Kurt's talk was immensely enjoyable: he gave several simple examples of inputs where floating point implementations of geometric algorithms fail spectacularly, and really reinforced the importance of precise implementations, something (if I may add) the rest of the algorithms community often can avoid if they're dealing with graphs or numeric data. I REALLY need to  weave CGAL into the next incarnation of my geometry class: must be less lazy, must be less lazy, .....&lt;br /&gt;&lt;br /&gt;Notes from the conference: I was fervently hoping that the organizers would lower expectation for next year, but no such luck :(. Impeccable organization, food, and free beer. Sigh......&lt;br /&gt;&lt;br /&gt;I'll be tweeting occasionally from the conference. If you're reading this at the conference and want to do the same, please the hashtag #socg so all tweets can be collected in one place.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6555947-8977237414219174543?l=geomblog.blogspot.com'/&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=uqH90V5GeIE:bmbRNG5fd-M:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt; &lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=uqH90V5GeIE:bmbRNG5fd-M:63t7Ie-LG7Y"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=63t7Ie-LG7Y" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/TheGeomblog/~4/uqH90V5GeIE" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://geomblog.blogspot.com/feeds/8977237414219174543/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="https://www.blogger.com/comment.g?blogID=6555947&amp;postID=8977237414219174543" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/8977237414219174543?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/8977237414219174543?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/TheGeomblog/~3/uqH90V5GeIE/socg-2009-day-0.html" title="SoCG 2009: Day 0" /><author><name>Suresh</name><uri>http://www.blogger.com/profile/15898357513326041822</uri><email>noreply@blogger.com</email><gd:extendedProperty name="OpenSocialUserId" value="05767624388557425525" /></author><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">0</thr:total><feedburner:origLink>http://geomblog.blogspot.com/2009/06/socg-2009-day-0.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CUcNQXo6cSp7ImA9WxJXE0s.&quot;"><id>tag:blogger.com,1999:blog-6555947.post-2414048047734029471</id><published>2009-06-06T03:34:00.005-06:00</published><updated>2009-06-07T02:24:50.419-06:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-06-07T02:24:50.419-06:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="rajeev motwani" /><title>Rajeev Motwani, R.I.P</title><content type="html">Rajeev Motwani passed away unexpectedly yesterday. He will be remembered for many things: his influence on the creation of Google, the foundational randomized algorithms book that he wrote with Prabhakar Raghavan, his role in the PCP theorem, and his extensive presence in the Bay Area investor community. Within the theory community, he will also be remembered by the legion of students he trained, including your humble blogger. &lt;span style="font-style:italic;"&gt;Update: Ashish Goel @ Stanford &lt;a href="http://www.stanford.edu/~ashishg/cgi-bin/RememberingRajeev/"&gt;has a page&lt;/a&gt; where you can also post comments&lt;/span&gt;. &lt;br /&gt;&lt;br /&gt;The influence an advisor has on their students can be subtle: as a grad student, I spent much of my time resisting, pushing back, trying to find my own voice. But more and more, as I look back, I see how he influenced my taste in problems and the sensibility I bring to my work. &lt;br /&gt;&lt;br /&gt;He had incredible breadth to go along with his depth. I remember scouring his web page in the early years of grad school, and seeing papers in robotics, databases, compilers, parallel systems, as well as in algorithms. Having spent time working in more applied areas, I can appreciate now how difficult it must have been to do that, and have the kinds of impact he had. That kind of incisiveness, that could cut to the heart of a modelling problem, was a hallmark of his more "applied" work. &lt;br /&gt;&lt;br /&gt;There's something special that comes with the ability to have a great career and train so many students. There are many famous researchers in the field who do one or the other, and Rajeev had a knack of catalysing the best from his students, and helping them succeed no matter what area they happened to work in. To this day, I'll catch myself thinking, "what did Rajeev do in this situation" when I'm interacting with my own students. &lt;br /&gt;&lt;br /&gt;As an advisor, the best way to describe him was 'laid-back'. He wasn't the kind to breathe down your neck, or demand regular updates. But I felt the terror all the same when going into his office to describe what I'd been upto. But he'd sit there with those half-closed eyes staring off into space as I spoke, and then the observations would come, carefully crafted, and enough to make me scurry off and think hard before our next meeting.&lt;br /&gt;&lt;br /&gt;As a teacher, he was brilliant. The very first time I took a class with him, I saw this. His style of presentation, how he organized his material, and even his impeccable boardwork: these are all things I teach my students today. His lectures weren't flashy: they were solid, and profound, and in his own mellow way, he was able to convey intuition, rigor and beauty with the kind of clarity one dreams of possessing. To flip the truism about doing and teaching around, he did, and taught even better as a result of it. &lt;br /&gt;&lt;br /&gt;I had less contact with him after graduation than I should have had. I was doing the whole 'be your own person, strike out on your own' thing. I'd bump into him occasionally at conferences, but as he got more involved in his investment efforts, he stopped attending conferences regularly (or so it seemed) and even that contact dried up. As someone just said to me in an email today morning, 'Life is fleeting'. &lt;br /&gt;&lt;br /&gt;It's a strange feeling, to lose your advisor so suddenly like this, and to hear about it on twitter, of all things. I'm not even sure what I'm feeling, except a sense of numbness, and a feeling that something has shifted, permanently, underneath. &lt;br /&gt;&lt;br /&gt;To his wife Asha and his family: my deepest condolences.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6555947-2414048047734029471?l=geomblog.blogspot.com'/&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=m07JZN_ZVKo:LL7t_8IgWig:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt; &lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=m07JZN_ZVKo:LL7t_8IgWig:63t7Ie-LG7Y"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=63t7Ie-LG7Y" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/TheGeomblog/~4/m07JZN_ZVKo" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://geomblog.blogspot.com/feeds/2414048047734029471/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="https://www.blogger.com/comment.g?blogID=6555947&amp;postID=2414048047734029471" title="6 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/2414048047734029471?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/2414048047734029471?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/TheGeomblog/~3/m07JZN_ZVKo/rajeev-motwani-rip.html" title="Rajeev Motwani, R.I.P" /><author><name>Suresh</name><uri>http://www.blogger.com/profile/15898357513326041822</uri><email>noreply@blogger.com</email><gd:extendedProperty name="OpenSocialUserId" value="05767624388557425525" /></author><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">6</thr:total><feedburner:origLink>http://geomblog.blogspot.com/2009/06/rajeev-motwani-rip.html</feedburner:origLink></entry><entry gd:etag="W/&quot;A0ICQXc-eyp7ImA9WxJXEE0.&quot;"><id>tag:blogger.com,1999:blog-6555947.post-6187468909599465405</id><published>2009-06-03T00:06:00.003-06:00</published><updated>2009-06-03T00:12:40.953-06:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-06-03T00:12:40.953-06:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="nsf" /><title>New(?) NSF policies</title><content type="html">While everyone recovers from what appears to be a gripping STOC, I thought I'd flag two NSF-related items that came my way. &lt;br /&gt;&lt;br /&gt;The first, &lt;a href="http://science-professor.blogspot.com/2009/06/postdoc-mentoring.html"&gt;via FSP&lt;/a&gt;, is on post-doc mentoring. Now I've always been told that the NSF frowns on inserting post-doc support in proposals, but apparently they don't mind it, and they have a new policy in place for proposals including postdoc support:&lt;br /&gt;&lt;blockquote&gt;Starting this year, NSF proposals that request funding for postdoctoral researchers must include a statement about how the postdoc(s) will be mentored. For a brief time this mentoring statement was supposed to be part of the body of the proposal, but, perhaps in response to complaints, the postdoc mentoring text is now a supplementary document, up to a page in length. As with the required Broader Impacts component of proposals, NSF is serious about the mentoring statement: proposals that request funding for postdocs but that do not contain the mentoring supplement will not even be reviewed.&lt;/blockquote&gt;&lt;br /&gt;I'm curious as to what experience people have had with postdoc support in proposals, and whether this is really new, or just a restatement of things that were already tacitly known. &lt;br /&gt;&lt;br /&gt;The second item affects us more broadly, and perniciously. A new policy guideline of May vintage appears to suggest (it's somewhat ambiguous) that equipment purchased for grants must be MUCH more closely tied to grant activities than previously expected. Specifically, it essentially rules out the purchase of machines for general PI use (like laptops etc), and raises a number of questions about how to purchase software for old machines, how to upgrade new machines etc. Again, I'm curious to know if people have already run afoul of this new rule, and what the deal is.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6555947-6187468909599465405?l=geomblog.blogspot.com'/&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=-aRUEemjjGw:pVH5sMMlD4Y:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt; &lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=-aRUEemjjGw:pVH5sMMlD4Y:63t7Ie-LG7Y"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=63t7Ie-LG7Y" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/TheGeomblog/~4/-aRUEemjjGw" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://geomblog.blogspot.com/feeds/6187468909599465405/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="https://www.blogger.com/comment.g?blogID=6555947&amp;postID=6187468909599465405" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/6187468909599465405?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/6187468909599465405?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/TheGeomblog/~3/-aRUEemjjGw/new-nsf-policies.html" title="New(?) NSF policies" /><author><name>Suresh</name><uri>http://www.blogger.com/profile/15898357513326041822</uri><email>noreply@blogger.com</email><gd:extendedProperty name="OpenSocialUserId" value="05767624388557425525" /></author><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">0</thr:total><feedburner:origLink>http://geomblog.blogspot.com/2009/06/new-nsf-policies.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C0ICRX45cSp7ImA9WxJQE0o.&quot;"><id>tag:blogger.com,1999:blog-6555947.post-4770838524053098287</id><published>2009-05-26T14:36:00.003-06:00</published><updated>2009-05-26T14:59:24.029-06:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-05-26T14:59:24.029-06:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="godel" /><category scheme="http://www.blogger.com/atom/ns#" term="research" /><category scheme="http://www.blogger.com/atom/ns#" term="expanders" /><title>more on the zig zag product..</title><content type="html">I did an impromptu lunch presentation with my research group on expanders to celebrate the Godel Prize, and in the process dug up some useful reading. There's nothing here that's new to folks familiar with the main results, but if you want a reading list to help with grasping the significance of the results, read on.&lt;br /&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Luca Trevisan's series on expanders and their relation to Cayley graphs (&lt;a href="http://lucatrevisan.wordpress.com/2006/12/06/expanders-and-groups/"&gt;one&lt;/a&gt;, &lt;a href="http://lucatrevisan.wordpress.com/2006/12/08/group-representation/"&gt;two&lt;/a&gt;, &lt;a href="http://lucatrevisan.wordpress.com/2006/12/13/eigenvalues-and-expansion/"&gt; three&lt;/a&gt;,&lt;a href="http://lucatrevisan.wordpress.com/2006/12/22/characters-and-expansion/"&gt; four&lt;/a&gt;)&lt;/li&gt;&lt;li&gt;&lt;a href="http://terrytao.wordpress.com/2008/01/11/distinguished-lecture-series-ii-avi-wigderson-expander-graphs-constructions-and-applications/"&gt;Terry Tao's summary of expanders&lt;/a&gt; inspired by an &lt;a href="http://www.math.ucla.edu/dls/2008/zigzag.pdf"&gt;Avi Wigderson talk&lt;/a&gt;.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://geomblog.blogspot.com/2007/01/soda-2007-day-2_09.html"&gt;My note from SODA 2007&lt;/a&gt; on the Alon/Schwartz/Schapira improved analysis of the replacement product.&lt;/li&gt;&lt;li&gt;Dieter van Melkebeek's &lt;a href="http://pages.cs.wisc.edu/%7Edieter/Courses/2006s-CS880/"&gt;course on expanders&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;There are some aspects of this story that are worth drawing out. First of all, it's actually possible (I think) to get the SL = L result without using the zig zag product directly. In fact, stray comment on a post by Lance suggests as much, and with the new analysis of the replacement product this might be even more likely.&lt;br /&gt;&lt;br /&gt;This doesn't mean that the zig-zag product is useless of course. In fact, there's a wonderful 'return to start' story here, which I'll attempt to convey. Essentially, as Luca describes, many early expander constructions proceeded via taking some special non-Abelian group and constructing its &lt;a href="http://en.wikipedia.org/wiki/Cayley_graph"&gt;Cayley grap&lt;/a&gt;h, which was then shown to be an expander. The zig-zag product is described "combinatorially" as a construction that takes two graphs and makes  a third out of them, and one advantage of this representation is that it gives more explicit expander constructions. But it turned out that at the core of this hides a group operator ! In a certain sense, the zig zag product of the Cayley graph of two groups is the Cayley graph of the semidirect product of the groups !  This is a result of &lt;a href="http://www.math.ias.edu/%7Eavi/PUBLICATIONS/MYPAPERS/ALW01/proc.pdf"&gt;Alon, Lubotsky and Wigderson&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6555947-4770838524053098287?l=geomblog.blogspot.com'/&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=zTGa4VPkP48:5Scg8GONu44:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt; &lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=zTGa4VPkP48:5Scg8GONu44:63t7Ie-LG7Y"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=63t7Ie-LG7Y" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/TheGeomblog/~4/zTGa4VPkP48" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://geomblog.blogspot.com/feeds/4770838524053098287/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="https://www.blogger.com/comment.g?blogID=6555947&amp;postID=4770838524053098287" title="2 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/4770838524053098287?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/4770838524053098287?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/TheGeomblog/~3/zTGa4VPkP48/more-on-zig-zag-product.html" title="more on the zig zag product.." /><author><name>Suresh</name><uri>http://www.blogger.com/profile/15898357513326041822</uri><email>noreply@blogger.com</email><gd:extendedProperty name="OpenSocialUserId" value="05767624388557425525" /></author><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">2</thr:total><feedburner:origLink>http://geomblog.blogspot.com/2009/05/more-on-zig-zag-product.html</feedburner:origLink></entry><entry gd:etag="W/&quot;Dk8DQHg-fip7ImA9WxJRGUw.&quot;"><id>tag:blogger.com,1999:blog-6555947.post-5904507136002961266</id><published>2009-05-20T20:36:00.004-06:00</published><updated>2009-05-21T08:07:51.656-06:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-05-21T08:07:51.656-06:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="godel" /><category scheme="http://www.blogger.com/atom/ns#" term="awards" /><title>Godel Prize</title><content type="html">The &lt;a href="http://sigact.acm.org/prizes/godel/"&gt;ACM site&lt;/a&gt; isn't updated yet (here's the &lt;a href="http://www.acm.org/press-room/news-releases/goedel-prize-09/view"&gt;press release&lt;/a&gt;), but &lt;a href="http://mybiasedcoin.blogspot.com/2009/05/godel-prize-to-salil-vadhan.html"&gt;Michael M&lt;/a&gt; informs us that Vadhan, Reingold and Wigderson just won the Godel prize, &lt;strike&gt;most likely&lt;/strike&gt; for their &lt;a href="http://www.wisdom.weizmann.ac.il/~reingold/publications/zigzag.ps"&gt;paper on the zig-zag graph product&lt;/a&gt;, &lt;strike&gt;which had a major role in&lt;/strike&gt; and for Reingold's proof that &lt;a href="http://www.wisdom.weizmann.ac.il/~reingold/publications/sl.ps"&gt;SL = L&lt;/a&gt;. These were inspiration for  Dinur's &lt;a href="http://www.cs.huji.ac.il/~dinuri/mypapers/combpcp.pdf"&gt;combinatorial PCP theorem&lt;/a&gt;. &lt;br /&gt;&lt;br /&gt;Congratulations !&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6555947-5904507136002961266?l=geomblog.blogspot.com'/&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=BNKDK9ZIwEQ:SW5hralwc6o:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt; &lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=BNKDK9ZIwEQ:SW5hralwc6o:63t7Ie-LG7Y"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=63t7Ie-LG7Y" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/TheGeomblog/~4/BNKDK9ZIwEQ" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://geomblog.blogspot.com/feeds/5904507136002961266/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="https://www.blogger.com/comment.g?blogID=6555947&amp;postID=5904507136002961266" title="2 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/5904507136002961266?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/5904507136002961266?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/TheGeomblog/~3/BNKDK9ZIwEQ/godel-prize.html" title="Godel Prize" /><author><name>Suresh</name><uri>http://www.blogger.com/profile/15898357513326041822</uri><email>noreply@blogger.com</email><gd:extendedProperty name="OpenSocialUserId" value="05767624388557425525" /></author><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">2</thr:total><feedburner:origLink>http://geomblog.blogspot.com/2009/05/godel-prize.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CUEHSXk4cSp7ImA9WxJREUg.&quot;"><id>tag:blogger.com,1999:blog-6555947.post-1194349391005572092</id><published>2009-05-12T12:30:00.003-06:00</published><updated>2009-05-12T12:40:38.739-06:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-05-12T12:40:38.739-06:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="research" /><category scheme="http://www.blogger.com/atom/ns#" term="game theory" /><category scheme="http://www.blogger.com/atom/ns#" term="distributions" /><title>Joint distributions aren't that bad</title><content type="html">I was reading &lt;a href="http://agtb.wordpress.com/2009/05/10/correlated-equilibria-some-cs-perspectives/"&gt;Noam Nisan's exposition of correlated equilibria&lt;/a&gt;. It's interesting to note that finding a correlated equilibrium is easy: you have to solve a linear program in the relevant variables of the underlying distribution. This is in contrast to Nash equilibria. &lt;br /&gt;&lt;br /&gt;What struck me is that this is a good example of where a joint distribution isn't that bad. In machine learning especially, trying to "learn" a joint distribution is hard firstly because the number of variables blows up, and because of nasty correlations that one has to account for. In fact, the area of &lt;a href="http://people.csail.mit.edu/tommi/papers/Jaa-var-tutorial.ps.gz"&gt;variational methods&lt;/a&gt; often uses the trick of  replacing a joint distribution by the "closest" product distribution, and optimizing over that instead. &lt;br /&gt;&lt;br /&gt;Here though, the "replacing by a product distribution" takes you from a correlated equilibrium problem to a Nash equilibrium problem, and now the individual probabilities actually multiply, yielding a nonlinear problem that's PPAD-Complete.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6555947-1194349391005572092?l=geomblog.blogspot.com'/&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=htql6T-_84k:g-lQkqQnKYk:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt; &lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=htql6T-_84k:g-lQkqQnKYk:63t7Ie-LG7Y"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=63t7Ie-LG7Y" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/TheGeomblog/~4/htql6T-_84k" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://geomblog.blogspot.com/feeds/1194349391005572092/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="https://www.blogger.com/comment.g?blogID=6555947&amp;postID=1194349391005572092" title="1 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/1194349391005572092?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/1194349391005572092?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/TheGeomblog/~3/htql6T-_84k/joint-distributions-arent-that-bad.html" title="Joint distributions aren't that bad" /><author><name>Suresh</name><uri>http://www.blogger.com/profile/15898357513326041822</uri><email>noreply@blogger.com</email><gd:extendedProperty name="OpenSocialUserId" value="05767624388557425525" /></author><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">1</thr:total><feedburner:origLink>http://geomblog.blogspot.com/2009/05/joint-distributions-arent-that-bad.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C04BR347fyp7ImA9WxJREEU.&quot;"><id>tag:blogger.com,1999:blog-6555947.post-7664567331570618012</id><published>2009-05-11T16:44:00.002-06:00</published><updated>2009-05-11T16:45:56.007-06:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-05-11T16:45:56.007-06:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="community" /><category scheme="http://www.blogger.com/atom/ns#" term="publishing" /><title>Physicists understand the web better than us, part 28596</title><content type="html">&lt;a href="http://physics.aps.org/"&gt;A new physics review site&lt;/a&gt;. via the &lt;a href="http://feedproxy.google.com/~r/TheQuantumPontiff/~3/bhsgemuaD04/physics_article_too_entangled.php"&gt;baconmeister&lt;/a&gt;:&lt;br /&gt;&lt;blockquote&gt;Physicists are drowning in a flood of research papers in their own fields and coping with an even larger deluge in other areas of physics. The Physical Review journals alone published over 18,000 papers last year. How can an active researcher stay informed about the most important developments in physics?&lt;br /&gt;&lt;br /&gt;Physics highlights exceptional papers from the Physical Review journals. To accomplish this, Physics features expert commentaries written by active researchers who are asked to explain the results to physicists in other subfields. These commissioned articles are edited for clarity and readability across fields and are accompanied by explanatory illustrations.&lt;br /&gt;&lt;br /&gt;Each week, editors from each of the Physical Review journals choose papers that merit this treatment, aided by referee comments and internal discussion. We select commentary authors from around the world who are known for their expertise and communication skills and we devote much effort to editing these commentaries for broad accessibility.&lt;br /&gt;&lt;br /&gt;Physics features three kinds of articles: Viewpoints are essays of approximately 1000–1500 words that focus on a single Physical Review paper or PRL letter and put this work into broader context. Trends are concise review articles (3000–4000 words in length) that survey a particular area and look for interesting developments in that field. Synopses (200 words) are staff-written distillations of interesting and important papers each week. In addition, we intend to publish selected Letters to the Editor to allow readers a chance to comment on the commentaries and summaries.&lt;br /&gt;&lt;br /&gt;Physics provides a much-needed guide to the best in physics, and we welcome your comments (physics@aps.org). &lt;/blockquote&gt;&lt;br /&gt;What an excellent idea !&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6555947-7664567331570618012?l=geomblog.blogspot.com'/&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=sRmajNkuVBM:eHnw9M2xryo:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt; &lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=sRmajNkuVBM:eHnw9M2xryo:63t7Ie-LG7Y"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=63t7Ie-LG7Y" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/TheGeomblog/~4/sRmajNkuVBM" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://geomblog.blogspot.com/feeds/7664567331570618012/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="https://www.blogger.com/comment.g?blogID=6555947&amp;postID=7664567331570618012" title="3 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/7664567331570618012?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/7664567331570618012?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/TheGeomblog/~3/sRmajNkuVBM/physicists-understand-web-better-than.html" title="Physicists understand the web better than us, part 28596" /><author><name>Suresh</name><uri>http://www.blogger.com/profile/15898357513326041822</uri><email>noreply@blogger.com</email><gd:extendedProperty name="OpenSocialUserId" value="05767624388557425525" /></author><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">3</thr:total><feedburner:origLink>http://geomblog.blogspot.com/2009/05/physicists-understand-web-better-than.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DUYFQH4zfip7ImA9WxJSFkk.&quot;"><id>tag:blogger.com,1999:blog-6555947.post-4595351096010059904</id><published>2009-05-06T14:10:00.004-06:00</published><updated>2009-05-06T15:58:31.086-06:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-05-06T15:58:31.086-06:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="miscellaneous" /><title>Kindle DX</title><content type="html">&lt;a href="http://weblog.fortnow.com/2009/05/my-kindle.html"&gt;Much&lt;/a&gt; well-deserved &lt;a href="http://scienceblogs.com/pontiff/2009/05/kindle_dx_drooool.php"&gt;drooling&lt;/a&gt; over the Kindle DX. The killer app is native PDF support: as &lt;a href="http://mat.tepper.cmu.edu/blog/?p=595"&gt;Michael Trick pointed out&lt;/a&gt;, the earlier Kindles didn't do too well with math support (the native Kindle document format is not PDF). I could easily see myself using the DX at conferences, rather than lugging around my laptop, or (worse) printing out copies of papers I wanted.&lt;br /&gt;&lt;br /&gt;The Kindle has a nice feature that you can email documents to a specific address and have them synced automatically to the device, but it comes at a price ($0.10/document). If you use a direct transfer over USB though, it's free of course.&lt;br /&gt;&lt;br /&gt;What I don't understand is why this has taken so long to happen: it seems to me that the academic market is the killer market for the Kindle: can you imagine transferring ALL your PDF papers to the Kindle for reading ? not to mention books ?&lt;br /&gt;&lt;br /&gt;p.s for those of you who will no doubt point out that other readers exist that can read PDF, and are puzzled by all the hype over the Kindle, I leave you to your Archos MP3 players and Opera browsers.&lt;br /&gt;&lt;br /&gt;p.p.s I spotted my first Kindle in the wild a month ago at a conference. It was rather cute looking. the DX will be much larger of course.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6555947-4595351096010059904?l=geomblog.blogspot.com'/&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=f2wzmIbH4ew:lNDYqVKxX2I:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt; &lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=f2wzmIbH4ew:lNDYqVKxX2I:63t7Ie-LG7Y"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=63t7Ie-LG7Y" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/TheGeomblog/~4/f2wzmIbH4ew" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://geomblog.blogspot.com/feeds/4595351096010059904/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="https://www.blogger.com/comment.g?blogID=6555947&amp;postID=4595351096010059904" title="6 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/4595351096010059904?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/4595351096010059904?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/TheGeomblog/~3/f2wzmIbH4ew/kindle-dx.html" title="Kindle DX" /><author><name>Suresh</name><uri>http://www.blogger.com/profile/15898357513326041822</uri><email>noreply@blogger.com</email><gd:extendedProperty name="OpenSocialUserId" value="05767624388557425525" /></author><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">6</thr:total><feedburner:origLink>http://geomblog.blogspot.com/2009/05/kindle-dx.html</feedburner:origLink></entry><entry gd:etag="W/&quot;D0cNSXs-fSp7ImA9WxJSEUg.&quot;"><id>tag:blogger.com,1999:blog-6555947.post-2130061245275038988</id><published>2009-04-30T23:14:00.003-06:00</published><updated>2009-04-30T23:18:18.555-06:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-04-30T23:18:18.555-06:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="community" /><category scheme="http://www.blogger.com/atom/ns#" term="publishing" /><title>New open access proceedings archive</title><content type="html">&lt;a href="http://processalgebra.blogspot.com/2009/04/eptcs-new-open-access-proceedings.html"&gt;Via Luca Aceto&lt;/a&gt;, news of a new effort to create an &lt;a href="http://www.eptcs.org/"&gt;open access electronic proceedings&lt;/a&gt;. The idea is that you organize your conference/workshop, and apply for inclusion of your proceedings in the EPTCS: you handle the reviewing, they take the papers and manage the long term archiving (via the arxiv).&lt;br /&gt;&lt;br /&gt;I'm imagining this will only be useful for small conferences/workshops, but it's a good way of making sure you don't have a dead-link problem a few years later when the person who hosted the website for the conference leaves their institution and forgets to hand over maintainence to someone else. Also, the arxiv imprimatur will make sure the papers will get more visibility than otherwise.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6555947-2130061245275038988?l=geomblog.blogspot.com'/&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=2-Ewi7P-Kj4:w0X7kKYTm4k:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=yIl2AUoC8zA" border="0"&gt;&lt;/img&gt;&lt;/a&gt; &lt;a href="http://feeds.feedburner.com/~ff/TheGeomblog?a=2-Ewi7P-Kj4:w0X7kKYTm4k:63t7Ie-LG7Y"&gt;&lt;img src="http://feeds.feedburner.com/~ff/TheGeomblog?d=63t7Ie-LG7Y" border="0"&gt;&lt;/img&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/TheGeomblog/~4/2-Ewi7P-Kj4" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://geomblog.blogspot.com/feeds/2130061245275038988/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="https://www.blogger.com/comment.g?blogID=6555947&amp;postID=2130061245275038988" title="5 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/2130061245275038988?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6555947/posts/default/2130061245275038988?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/TheGeomblog/~3/2-Ewi7P-Kj4/new-open-access-proceedings-archive.html" title="New open access proceedings archive" /><author><name>Suresh</name><uri>http://www.blogger.com/profile/15898357513326041822</uri><email>noreply@blogger.com</email><gd:extendedProperty name="OpenSocialUserId" value="05767624388557425525" /></author><thr:total xmlns:thr="http://purl.org/syndication/thread/1.0">5</thr:total><feedburner:origLink>http://geomblog.blogspot.com/2009/04/new-open-access-proceedings-archive.html</feedburner:origLink></entry></feed>
