<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0">

<channel>
	<title>johnmu.com - technical website tips</title>
	
	<link>http://johnmu.com</link>
	<description>John Mueller's technical website tips and tricks</description>
	<lastBuildDate>Thu, 03 Dec 2009 12:42:37 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5.1</generator>
		<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/rss+xml" href="http://feeds.feedburner.com/johnmucom" /><feedburner:info uri="johnmucom" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><item>
		<title>Hackers stealing your PageRank</title>
		<link>http://feedproxy.google.com/~r/johnmucom/~3/Czvl38Z1n_4/</link>
		<comments>http://johnmu.com/pagerank-hacker-using-suomi-co-in/#comments</comments>
		<pubDate>Sun, 07 Dec 2008 23:44:54 +0000</pubDate>
		<dc:creator>John Mueller</dc:creator>
				<category><![CDATA[Hack]]></category>

		<guid isPermaLink="false">http://johnmu.com/?p=155</guid>
		<description><![CDATA[The last time I wrote about a hacked site, it was using a redirect that sent some users to a different site. This kind of hack is pretty common (even though it&#8217;s usually not as complex as mentioned in that post), it leverages the sad fact that users are often easy to trick and not [...]]]></description>
				<content:encoded><![CDATA[<p>The last time I <a href="http://johnmu.com/hack-hidden-redirect/">wrote about a hacked site</a>, it was using a redirect that sent some users to a different site. This kind of hack is pretty common (even though it&#8217;s usually not as complex as mentioned in that post), it leverages the sad fact that users are often easy to trick and not browsing with protection (or <a href="http://googleonlinesecurity.blogspot.com/2008/07/are-you-using-latest-web-browser.html">a current browser</a>).</p>
<p>A different angle of attack is to redirect only search engine crawlers to a different site. By doing this, they can make it look like the pages of a website moved to a new domain name. In general, when search engines find redirects like that, they will more or less pass the &#8220;value&#8221; that a page had on to the new URL &#8212; that generally also applies to PageRank. So in a sense, they are trying to steal the value that a webmaster has built up over time. </p>
<p>In this particular case, a &#8220;massive amount&#8221; of sites were hacked and likely redirected through suomi.co.in.<br />
<span id="more-155"></span><br />
The webmaster generally doesn&#8217;t notice this kind of hack because there&#8217;s nothing that would alert him to a problem. Only search engine crawlers would get redirected, normal users (including the webmaster) would see the page normally. </p>
<p><strong>The first symptom that you would see is hard to interpret: <a href="http://www.google.com/support/forum/p/Webmasters/thread?tid=7f05dc5d8154229e&#038;hl=en">URLs from the website are just not indexed anymore</a></strong>. URLs not being indexed is something that could happen because of any number of reasons, so how do we find out more?</p>
<p>One of the first things I like to do in a case like this is to access the site with a search engine crawler&#8217;s user agent. This gives you a rough look at how the website reacts to a search engine crawler (although it&#8217;s not complete, it&#8217;s often pretty close). There are two relatively easy ways to do this:</p>
<ol>
<li>Use an online tool such as <a href="http://web-sniffer.net/">Web-Sniffer</a>. It&#8217;s pretty easy to use and is somewhat close to an actual crawler.</li>
<li>Use <a href="http://www.mozilla.com/firefox/">FireFox</a> with the <a href="https://addons.mozilla.org/en-US/firefox/addon/59">User Agent Switcher</a> plugin. If you use this plugin, you&#8217;ll have to add the user agent yourself. I usually use the current Googlebot user agent string:<br />
<blockquote><p>Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)</p></blockquote>
<p>Note: if you use Firefox for this, <strong>make sure that your Firefox installation is up to date and locked down properly</strong> in case you run into a site serving malware like this. Sometimes it even makes sense to use a virtual machine for this.
</li>
<li>(I wish there were a half-&#8221;li&#8221; <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  ) There&#8217;s also &#8220;wget&#8221;, which is easy for those of you who prefer use console tools. I usually use the above user agent string with wget.</li>
</ol>
<p>If you access the site using one of these tools, you&#8217;ll often be able to spot these redirects (or other issues that a site might be having with regards to being accessed by search engine crawlers). It&#8217;s rare that someone uses cloaking by IP address for things like this. In a <a href="http://www.google.com/support/forum/p/Webmasters/thread?tid=7f05dc5d8154229e&#038;hl=en">recent thread in the Webmaster Help forums</a>, &#8220;webado&#8221; spotted the redirect using Web-Sniffer. </p>
<p><img src="http://johnmu.com/wp-content/stuff/thacki.png" alt="" title="Hacked site redirecting to suomi.co.in" width="500" height="346" class="alignnone size-full wp-image-165" /></p>
<p>In this particular case, the URL was redirected to <strong>http://suomi.co.in/</strong> , from where it was redirected to a page that they wanted to promote with the original site&#8217;s &#8220;value&#8221;. I&#8217;ve seen the same kind of redirect going through <strong>http://ahtung.co.in/</strong>. </p>
<p>The webmaster responded with a note from his hoster in the thread:</p>
<blockquote><p>Note from my host server (support @ hostgator.com)<br />
I have removed the file &#8220;.htaccess&#8221; from the directory /home/aceuropa which was causing the redirect.  The logs show a massive amount of .htaccess files being edited over the last couple of days.  I would highly suggest changing your password to something more secure.  Please let us know if you have any further questions or concerns. </p></blockquote>
<p>(It&#8217;s great to see a hoster act so quickly!)</p>
<p>There&#8217;s another way to spot this kind of hack with Google Webmaster Tools: <strong>When you submit a Sitemap file, Google will show warnings for URLs that redirect.</strong> By design, you should be listing the final URL in your Sitemap file, so if the URL is redirecting for our crawlers (as in this case), we&#8217;ll show a warning in your account. </p>
<hr/>Copyright &copy; 2013 <strong><a href="http://johnmu.com">johnmu.com</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator, the site you are looking at is guilty of copyright infringement. Please contact johnmu.com so we can take legal action immediately.<br/><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/johnmucom?a=Czvl38Z1n_4:hrl_Rk5tILQ:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/johnmucom?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/johnmucom?a=Czvl38Z1n_4:hrl_Rk5tILQ:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/johnmucom?i=Czvl38Z1n_4:hrl_Rk5tILQ:D7DqB2pKExk" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/johnmucom?a=Czvl38Z1n_4:hrl_Rk5tILQ:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/johnmucom?i=Czvl38Z1n_4:hrl_Rk5tILQ:F7zBnMyn0Lo" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/johnmucom/~4/Czvl38Z1n_4" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://johnmu.com/pagerank-hacker-using-suomi-co-in/feed/</wfw:commentRss>
		<slash:comments>47</slash:comments>
		<feedburner:origLink>http://johnmu.com/pagerank-hacker-using-suomi-co-in/</feedburner:origLink></item>
		<item>
		<title>Seeing nofollow links in Google Chrome</title>
		<link>http://feedproxy.google.com/~r/johnmucom/~3/zWdAuyIkoeQ/</link>
		<comments>http://johnmu.com/seeing-nofollow-links-in-google-chrome/#comments</comments>
		<pubDate>Fri, 05 Sep 2008 22:31:15 +0000</pubDate>
		<dc:creator>John Mueller</dc:creator>
				<category><![CDATA[CSS]]></category>
		<category><![CDATA[Javascript]]></category>

		<guid isPermaLink="false">http://johnmu.com/?p=149</guid>
		<description><![CDATA[Here&#8217;s a simple trick to view nofollow links in Google Chrome. Just drag and drop the following button to your bookmark bar and hit it whenever you want to see links with the rel=nofollow HTML microformat: Nofollow? This bookmarklet inserts a tiny bit of CSS into the top of the page you&#8217;re currently viewing. The [...]]]></description>
				<content:encoded><![CDATA[<p>Here&#8217;s a simple trick to view nofollow links in Google Chrome. Just drag and drop the following button to your bookmark bar and hit it whenever you want to see links with the rel=nofollow HTML microformat:</p>
<p><!-- --></p>
<p><a href="javascript:function%20highlightNofollow(){var%20newStyle=document.createElement('style');newStyle.type='text/css';newStyle.appendChild(document.createTextNode('a[rel~=nofollow]{border:1px%20dashed%20#852!%20important;background-color:#fcc!%20important;}'));document.getElementsByTagName('head')[0].appendChild(newStyle);};highlightNofollow();" style="background-color: #eef; border: solid 1px #aaa; color: #446; padding: .5em 2em; text-decoration: none; font-weight: bold; text-shadow: 1px 1px 1px #88a; font-size: 1.1em; -webkit-box-shadow: 2px 2px 2px #666;" title="Nofollow?">Nofollow?</a></p>
<p><!-- --></p>
<p>This bookmarklet inserts a tiny bit of CSS into the top of the page you&#8217;re currently viewing. The CSS is similar to that which is used in <a href="http://www.mattcutts.com/blog/seeing-nofollow-links/">other nofollow highlighting methods</a>:<br />
<span id="more-149"></span></p>
<div id="ig-sh-1" class="syntax_hilite">	<div class="toolbar">		<div class="language-name">css</div>		<a href="#" class="view-different">&lt; view <span>plain text</span> &gt;</a>	</div>	<div class="code"><ol class="css" style="font-family:monospace;"><li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;">a<span style="color: #00AA00;">&#91;</span>rel~<span style="color: #00AA00;">=</span>nofollow<span style="color: #00AA00;">&#93;</span> <span style="color: #00AA00;">&#123;</span></div></li>
<li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;">&nbsp; <span style="color: #000000; font-weight: bold;">border</span><span style="color: #00AA00;">:</span><span style="color: #933;">1px</span> <span style="color: #993333;">dashed</span> <span style="color: #cc00cc;">#852</span>! important<span style="color: #00AA00;">;</span></div></li>
<li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;">&nbsp; <span style="color: #000000; font-weight: bold;">background-color</span><span style="color: #00AA00;">:</span><span style="color: #cc00cc;">#fcc</span>! important<span style="color: #00AA00;">;</span></div></li>
<li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;"><span style="color: #00AA00;">&#125;</span></div></li>
</ol>	</div></div>
<p>Try it out on a <a href="http://en.wikipedia.org/wiki/Google_Chrome">page with nofollowed links</a>!</p>
<p>By the way, this bookmarklet also works in Opera &#038; Firefox (but there are simpler ways to handle it in Firefox). </p>
<hr/>Copyright &copy; 2013 <strong><a href="http://johnmu.com">johnmu.com</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator, the site you are looking at is guilty of copyright infringement. Please contact johnmu.com so we can take legal action immediately.<br/><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/johnmucom?a=zWdAuyIkoeQ:C_QhwkCeJpo:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/johnmucom?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/johnmucom?a=zWdAuyIkoeQ:C_QhwkCeJpo:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/johnmucom?i=zWdAuyIkoeQ:C_QhwkCeJpo:D7DqB2pKExk" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/johnmucom?a=zWdAuyIkoeQ:C_QhwkCeJpo:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/johnmucom?i=zWdAuyIkoeQ:C_QhwkCeJpo:F7zBnMyn0Lo" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/johnmucom/~4/zWdAuyIkoeQ" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://johnmu.com/seeing-nofollow-links-in-google-chrome/feed/</wfw:commentRss>
		<slash:comments>23</slash:comments>
		<feedburner:origLink>http://johnmu.com/seeing-nofollow-links-in-google-chrome/</feedburner:origLink></item>
		<item>
		<title>Confirm that you’re using Analytics on all pages</title>
		<link>http://feedproxy.google.com/~r/johnmucom/~3/clM7LHRJlno/</link>
		<comments>http://johnmu.com/analytics-everywhere/#comments</comments>
		<pubDate>Tue, 20 May 2008 23:27:09 +0000</pubDate>
		<dc:creator>John Mueller</dc:creator>
				<category><![CDATA[Tricks]]></category>

		<guid isPermaLink="false">http://johnmu.com/analytics-everywhere/</guid>
		<description><![CDATA[Here&#8217;s something from my mailbox &#8211; someone wanted to know how he could crawl his site and confirm that all of his pages really have the Google Analytics tracking-code on them. WordPress users have it easy, there are plugins that handle it automatically. Sometimes it&#8217;s worth asking nicely &#8211; let me show you how I [...]]]></description>
				<content:encoded><![CDATA[<p>Here&#8217;s something from my mailbox &#8211; someone wanted to know how he could crawl his site and confirm that all of his pages really have the <a href="http://www.google.com/analytics/">Google Analytics</a> tracking-code on them. WordPress users have it easy, there are plugins that handle it automatically. Sometimes it&#8217;s worth asking nicely <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  &#8211; let me show you how I did it. As a bonus, I&#8217;ll also show how you can check the AdSense ID on your pages, if you&#8217;re worried that you copy/pasted it incorrectly.</p>
<p>This is pretty much cross-platform, but as a Windows-user you&#8217;ll have to grab and install two files first:</p>
<ul>
<li><a href="http://users.ugent.be/~bpuype/wget/">wget</a> &#8211; a tool to download copies of web pages</li>
<li><a href="http://sourceforge.net/project/showfiles.php?group_id=9328">UnxTools</a> &#8211; a collection of popular Unix/Linux tools for the hacker in you</li>
</ul>
<p>Extract the ZIP files, copy the contents somewhere where you can find it and make sure that the appropriate folders are in your &#8220;path&#8221; (the files you&#8217;ll need for UnxTools are in &#8220;&#8230;\usr\local\wbin&#8221;). We&#8217;ll need to access these tools through the command line. I have a feeling I may need to elaborate on that for Windows users <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  &#8212; let me know if that&#8217;s the case.</p>
<p>First, we&#8217;ll mirror our site on our local machine (this assumes that your site is crawlable; if it isn&#8217;t, then fix it first <img src='http://johnmu.com/wp-includes/images/smilies/icon_biggrin.gif' alt=':D' class='wp-smiley' />  ):<br />
<span id="more-135"></span></p>
<ol>
<li>Open a command box or terminal window (on Windows, hit Start / Run &#8230; and enter &#8220;cmd&#8221;)</li>
<li>Go to or create a temporary folder</li>
<li>Run the following command to mirror your site:
<p><code>wget --mirror --accept=html,htm,php,asp,aspx http://domain.com/</code></p>
<p>This command mirrors pages with .html, .htm, .php, .asp and .aspx extensions on http://domain.com/. It&#8217;ll create a folder for the domain and put all the files in it. Dynamic URLs will get adjusted so that they can be used as file names.</li>
<li>Wait &#8230; until it&#8217;s all downloaded &#8230; if it feels endless, you might have endless URLs, perhaps an infinite calendar script or something similar? It&#8217;s worth fixing!</li>
</ol>
<p>Alrighty, now that we have a copy of your site, let&#8217;s check things out. </p>
<h3>Finding pages without Analytics</h3>
<p>We can find pages without the Analytics tracking code by listing all pages which do not have certain content in them:</p>
<p><code>grep -r -L "google-analytics.com" *.*</code></p>
<p>This command goes through all subfolders (the &#8220;-r&#8221; option) and lists the files that do not contain a match (&#8220;-L&#8221;) for &#8220;google-analytics.com&#8221;. That could be extended to just about anything <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> . </p>
<p>How about pages that don&#8217;t have a &#8220;description&#8221; meta tag?</p>
<p><code>grep -r -L "meta name=.description" *.*</code></p>
<p>The &#8220;.&#8221; (period) matches any character &#8212; in this case, it is used to match the &#8221; (double-quote). </p>
<h3>Finding pages with AdSense (and the ID used)</h3>
<p>Finding pages that contain a certain text is even easier:</p>
<p><code>grep -r "google_ad_client" *.*</code></p>
<p>Note that all we did was drop the &#8220;-L&#8221; (and change the text, obviously). It will show the lines that match this pattern in all of your pages, which includes the AdSense ID. </p>
<p>Similar to the earlier check for missing &#8220;description&#8221; meta tags, assuming you have the contents of that tag all in one line, you can easily find all of these meta tags with:</p>
<p><code>grep -r "meta name=.description" *.*</code></p>
<p>What would you like to search for today?</p>
<hr/>Copyright &copy; 2013 <strong><a href="http://johnmu.com">johnmu.com</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator, the site you are looking at is guilty of copyright infringement. Please contact johnmu.com so we can take legal action immediately.<br/><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/johnmucom?a=clM7LHRJlno:6d6aUOn3g3U:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/johnmucom?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/johnmucom?a=clM7LHRJlno:6d6aUOn3g3U:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/johnmucom?i=clM7LHRJlno:6d6aUOn3g3U:D7DqB2pKExk" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/johnmucom?a=clM7LHRJlno:6d6aUOn3g3U:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/johnmucom?i=clM7LHRJlno:6d6aUOn3g3U:F7zBnMyn0Lo" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/johnmucom/~4/clM7LHRJlno" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://johnmu.com/analytics-everywhere/feed/</wfw:commentRss>
		<slash:comments>17</slash:comments>
		<feedburner:origLink>http://johnmu.com/analytics-everywhere/</feedburner:origLink></item>
		<item>
		<title>Running Firefox in parallel</title>
		<link>http://feedproxy.google.com/~r/johnmucom/~3/VIUeKIbvX-I/</link>
		<comments>http://johnmu.com/firefox-multiplied/#comments</comments>
		<pubDate>Fri, 01 Feb 2008 23:14:39 +0000</pubDate>
		<dc:creator>John Mueller</dc:creator>
				<category><![CDATA[Tricks]]></category>

		<guid isPermaLink="false">http://johnmu.com/firefox-multiplied/</guid>
		<description><![CDATA[Sometimes it would just be great to have multiple instances of Firefox running at the same time. Some web applications just love to eat memory in Firefox, some web pages go crazy if you have JavaScript enabled and sometimes you just want different sets of cookies to let you manage two accounts at the same [...]]]></description>
				<content:encoded><![CDATA[<p>Sometimes it would just be great to have multiple instances of Firefox running at the same time. Some web applications just love to eat memory in Firefox, some web pages go crazy if you have JavaScript enabled and sometimes you just want different sets of cookies to let you manage two accounts at the same time. </p>
<p>I&#8217;ve been trying to do that for years and did the most exotic things to make it happen. I&#8217;ve used four different browsers in parallel and I&#8217;ve even used a virtual PC running within my PC (that kind of defeats the desire to use less memory, but it feels neat anyway). In the end, a collegue in the office, who happens to use emacs as his main web browser <img src='http://johnmu.com/wp-includes/images/smilies/icon_biggrin.gif' alt=':D' class='wp-smiley' />  , pointed me into the right direction. </p>
<p>Now I have three completely independant instances of Firefox running at the same time!</p>
<p><img src='http://johnmu.com/wp-content/stuff/moz3.gif' alt='3 little Firefoxen, running on a desktop' /></p>
<p>So what&#8217;s the trick?<br />
<span id="more-131"></span><br />
Firefox has <a href="http://kb.mozillazine.org/Command_line_arguments">command line options</a> to let you start multiple profiles and specify a certain one. In our case, we&#8217;re going to change the command line to:</p>
<p><strong>&#8220;C:\Program Files\Mozilla Firefox\firefox.exe&#8221; -no-remote -P NewProfileName</strong></p>
<p>To get started, check the name of your current profile. On Windows you can find it in &#8220;c:\Documents and Settings\[user-name]\Application Data\Mozilla\Firefox\Profiles&#8221;. It will generally have a few characters and numbers, a period and then the profile name (in my case it was something like &#8220;36fc232a.default&#8221;). Use this to adjust the settings of the icon you use to start up Firefox. On Windows, right-click on the icon and select &#8220;Properties&#8221;; you can add the options in the field called &#8220;Target&#8221;:</p>
<p><img src='http://johnmu.com/wp-content/stuff/moz-set1.gif' alt='Firefox profile settings' /></p>
<p>If you click on that icon now, it should start up Firefox just as before (ok, this is not the neat part yet <img src='http://johnmu.com/wp-includes/images/smilies/icon_biggrin.gif' alt=':D' class='wp-smiley' />  ). </p>
<p>Now make a copy of the icon (I right-click drag it into a folder and select &#8220;Copy&#8221;) and change the command line options (and file name) again, only this time choose a different profile name. If you want to use a copy of your existing profile (with all cookies, bookmarks, themes and add-ons), you can do that by going into the folder where your profiles are stored (mentioned above) and copying your default profile. Now when you start up Firefox with that icon, it will bring the profile manager since it can&#8217;t find that new profile. Create a new profile and use the exact name you used in the options. You will then have a choice of either creating a completely new profile or using an existing profile folder. </p>
<p>Now you have two instances of Firefox running at the same time. They&#8217;re completely separate, so if one crashes, the other will continue normally. If one starts using too much memory, you can close it and restart it without impacting the other one. If you have conflicts with add-ons or want to use different cookie sets, just use a separate instance. </p>
<p>Since the various instances will generally look the same and be hard to keep apart, I just applied different themes to them. The &#8220;Safari-style&#8221; theme is my private one, the blue one is used for all my work-apps and the normal one is used for all kinds of testing. </p>
<p>This trick should work on all platforms with Firefox, not that I tried it out so try it at your own risk <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  . Now if only I could migrate my IE profile back to Firefox &#8230; </p>
<hr/>Copyright &copy; 2013 <strong><a href="http://johnmu.com">johnmu.com</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator, the site you are looking at is guilty of copyright infringement. Please contact johnmu.com so we can take legal action immediately.<br/><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/johnmucom?a=VIUeKIbvX-I:ufi8Ujzsgh0:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/johnmucom?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/johnmucom?a=VIUeKIbvX-I:ufi8Ujzsgh0:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/johnmucom?i=VIUeKIbvX-I:ufi8Ujzsgh0:D7DqB2pKExk" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/johnmucom?a=VIUeKIbvX-I:ufi8Ujzsgh0:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/johnmucom?i=VIUeKIbvX-I:ufi8Ujzsgh0:F7zBnMyn0Lo" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/johnmucom/~4/VIUeKIbvX-I" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://johnmu.com/firefox-multiplied/feed/</wfw:commentRss>
		<slash:comments>13</slash:comments>
		<feedburner:origLink>http://johnmu.com/firefox-multiplied/</feedburner:origLink></item>
		<item>
		<title>Go hack yourself – recovering your FTP password</title>
		<link>http://feedproxy.google.com/~r/johnmucom/~3/DWNq7N7j26w/</link>
		<comments>http://johnmu.com/go-hack-your-ftp/#comments</comments>
		<pubDate>Sat, 12 Jan 2008 16:05:20 +0000</pubDate>
		<dc:creator>John Mueller</dc:creator>
				<category><![CDATA[Tricks]]></category>

		<guid isPermaLink="false">http://johnmu.com/go-hack-your-ftp/</guid>
		<description><![CDATA[All of the websites I put together at the moment are used for playing around and testing things. It&#8217;s fun to set up a site, try some things out, delete it or just let it sit and then &#8211; usually much later &#8211; start over and try something else. The only problem is that by [...]]]></description>
				<content:encoded><![CDATA[<p>All of the websites I put together at the moment are used for playing around and testing things. It&#8217;s fun to set up a site, try some things out, delete it or just let it sit and then &#8211; usually much later &#8211; start over and try something else. The only problem is that by the time I am ready to start over, I have forgotten my password. I can find my user name, it&#8217;s in the FTP client and visible in my hosting control panel, but the password is not visible anywhere. The secure way would be to just pick a new password, but let&#8217;s assume you need your old one <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' />  . The following will also work for email passwords stored in your email client, by the way. </p>
<p>What we&#8217;ll do is &#8220;sniff&#8221; the connection that your FTP client builds up, we&#8217;ll take a look at the packets sent out and received. Remember that other people can do this as well &#8211; say if you&#8217;re on an insecure wireless connection on the road &#8212; use secure connections and protocols whenever you can!<br />
<span id="more-126"></span><br />
You&#8217;ll have to get a copy of <a rel="oh come on" href="http://www.ethereal.com/download.html">Ethereal</a> (freeware), a universal network analysis tool (there are many similar tools available, I like the flexibility of Ethereal). Download it, install it and start it up. </p>
<p>To get started, select the menu item <em>Capture</em> and <em>Start</em>, then choose your ethernet interface (WLAN, cable, etc) and let it start. You are now recording your complete network traffic, you 1337 self-h4x0r <img src='http://johnmu.com/wp-includes/images/smilies/icon_biggrin.gif' alt=':D' class='wp-smiley' />  . Depending on what you&#8217;re doing at the moment, it may record a lot of traffic. We&#8217;ll filter it later on, so don&#8217;t worry about that. </p>
<p><img src='http://johnmu.com/wp-content/stuff/thger6.png' alt='Ethereal capture in progress' /></p>
<p>Now start up your FTP client (make sure you&#8217;re not using a secure FTP connection) and connect to your server. When you connect to your server like that, you will send your user name and password over the network and Ethereal record that for you. Once you have that, you can stop capturing in Ethereal. </p>
<p><img src='http://johnmu.com/wp-content/stuff/ether1c.png' alt='Sniffed Ethereal connection' /></p>
<p>If you scroll through the data you collect like that, you&#8217;ll quickly notice that there&#8217;s a lot going over those wires. Let&#8217;s just look at the data going to and from our FTP server. You&#8217;ll have to get the IP address of your server (which you can usually do in a shell/command box by typing &#8220;nslookup ftp.yourservername.com&#8221;). In the filter box on top, enter: <strong>ip.addr eq nnn.nnn.nnn.nnn</strong> (where the &#8220;nnn&#8217;s&#8221; are the IP address of your server). </p>
<p><img src='http://johnmu.com/wp-content/stuff/ether-2-eq-ip.png' alt="Sniff your server's IP address" /></p>
<p>Once you only look at the data going to and from your server, you&#8217;ll see the authentication information right away:</p>
<p><img src='http://johnmu.com/wp-content/stuff/etherr3.png' alt='Username and password, hacked' /></p>
<p>Now that you see how easy it is to hack yourself, make sure that others can&#8217;t do the same with your account:</p>
<ul>
<li>If you&#8217;re using a wireless connection, <strong>always assume that others can listen in</strong> (even if you&#8217;re using your own access point with WEP or WPA encryption).</li>
<li>Make sure that you <strong>use a secure version of FTP</strong>. In general, they will encrypt your authentication information so that it will not be readable on your network. Double-check it with Ethereal, if you want to be sure.</li>
<li><strong>Change your FTP/email passwords</strong> after you have used them on an insecure connection like a hotel or airport wireless.</li>
<li>If you use a web-based email service, make sure that you are accessing the site with <a href="http://mail.google.com/support/bin/answer.py?answer=8155">HTTPS</a> and not HTTP. Most web-mail services allow that (though they may not activate it by default since it is a bit slower and is usually not needed on your home network). </li>
<li>Even if your FTP (or email) client encrypts passwords in the settings, they can still be read with the right tools.</li>
</ul>
<p>Stay safe!</p>
<hr/>Copyright &copy; 2013 <strong><a href="http://johnmu.com">johnmu.com</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator, the site you are looking at is guilty of copyright infringement. Please contact johnmu.com so we can take legal action immediately.<br/><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/johnmucom?a=DWNq7N7j26w:hYoEljnuD4g:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/johnmucom?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/johnmucom?a=DWNq7N7j26w:hYoEljnuD4g:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/johnmucom?i=DWNq7N7j26w:hYoEljnuD4g:D7DqB2pKExk" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/johnmucom?a=DWNq7N7j26w:hYoEljnuD4g:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/johnmucom?i=DWNq7N7j26w:hYoEljnuD4g:F7zBnMyn0Lo" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/johnmucom/~4/DWNq7N7j26w" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://johnmu.com/go-hack-your-ftp/feed/</wfw:commentRss>
		<slash:comments>15</slash:comments>
		<feedburner:origLink>http://johnmu.com/go-hack-your-ftp/</feedburner:origLink></item>
		<item>
		<title>How to use Google webmaster tools stats with Excel</title>
		<link>http://feedproxy.google.com/~r/johnmucom/~3/Gty6WWTB8UM/</link>
		<comments>http://johnmu.com/webmaster-tools-script-1/#comments</comments>
		<pubDate>Thu, 29 Nov 2007 00:34:42 +0000</pubDate>
		<dc:creator>John Mueller</dc:creator>
				<category><![CDATA[Tricks]]></category>

		<guid isPermaLink="false">http://johnmu.com/webmaster-tools-script-1/</guid>
		<description><![CDATA[Google&#8217;s webmaster tools has a neat feature that lets you download your query and click statistics (once you have verified ownership of your site). The data you can get from there is quite comprehensive, but hard to break down for use in Excel. As a fun exercise I put together a small Python-script that takes [...]]]></description>
				<content:encoded><![CDATA[<p>Google&#8217;s <a href="http://www.google.com/webmasters/tools/">webmaster tools</a> has a neat feature that lets you download your query and click statistics (once you have verified ownership of your site). The data you can get from there is quite comprehensive, but hard to break down for use in Excel. As a fun exercise I put together a small Python-script that takes the CSV file downloaded from your webmaster tools account and turns it into new CSV files for queries and for clicks (both with the position numbers as well). </p>
<p>Python is a neat little programming language, I like it more and more as I use it <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  .<br />
<span id="more-123"></span><br />
Here&#8217;s how to get started:</p>
<ol>
<li>If you do not have Python installed, go and <a href="http://www.python.org/download/">download and install Python</a>. I assume most Apple OSX will have it installed, but I don&#8217;t have a Mac so I can&#8217;t say for sure. It&#8217;ll almost certainly be installed if you&#8217;re one of the 3 Linux-users who have visited my blog. If you&#8217;re using Windows, take the version with the installer (it&#8217;s easier) and make sure that the folder where you installed Python is in your &#8220;path&#8221;. </li>
<li>Grab my <a href="http://johnmu.com/files/wmtextract01.zip">wmtextract01.zip</a> and extract it into a folder. You should have three files: wmtextract.py (the Python script) and ProcessAll.bat + ProcessAll.py.</li>
<li>Copy your query stats CSV files into that folder as well.</li>
<li>Double-click on ProcessAll.py (or ProcessAll.bat if you&#8217;re on Windows and don&#8217;t have Python set up to run scripts directly)</li>
<li>The script will now process all CSV files in the same folder, create a new folder called &#8220;output&#8221; and place the new CSV files there.</li>
<li>Open the new CSV files in Excel (or Open Office or even Google Docs + Spreadsheets)</li>
<li>Enjoy <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </li>
</ol>
<p>Here are some more ideas for the CSV files when you have them in Excel:</p>
<ul>
<li>Select everything (Ctrl-A) and set up an &#8220;AutoFilter&#8221; (menu item Data / Filter / AutoFilter). Now you can filter your stats however you want them. Want to only see the queries for which you rank #1? How about the queries that people from Switzerland used to find your site?</li>
</ul>
<p><img src='http://johnmu.com/wp-content/stuff/excel-webmaster-tools.gif' alt='excel-webmaster-tools.gif' /></p>
<ul>
<li>Set the Location to &#8220;All locations&#8221; and search type to &#8220;All searches&#8221;, now select everything and sort by the last column (menu Data / Sort / has header row / sort by Column E, ascending). Now select the last two columns (D and E) and click on the chart icon. Choose an &#8220;XY scatter&#8221; chart and let it create it. This chart shows you the ranking of your site for the search queries. There are some problems with the chart like this (keywords can be listed several times), but I think it&#8217;s neat anyway <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  </li>
</ul>
<p><img src='http://johnmu.com/wp-content/stuff/webmaster-ranking.gif' alt='webmaster-ranking.gif' /></p>
<p>What&#8217;s the neatest information you ever found in your webmaster tools query stats?</p>
<hr/>Copyright &copy; 2013 <strong><a href="http://johnmu.com">johnmu.com</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator, the site you are looking at is guilty of copyright infringement. Please contact johnmu.com so we can take legal action immediately.<br/><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/johnmucom?a=Gty6WWTB8UM:-dXRvffaXAg:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/johnmucom?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/johnmucom?a=Gty6WWTB8UM:-dXRvffaXAg:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/johnmucom?i=Gty6WWTB8UM:-dXRvffaXAg:D7DqB2pKExk" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/johnmucom?a=Gty6WWTB8UM:-dXRvffaXAg:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/johnmucom?i=Gty6WWTB8UM:-dXRvffaXAg:F7zBnMyn0Lo" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/johnmucom/~4/Gty6WWTB8UM" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://johnmu.com/webmaster-tools-script-1/feed/</wfw:commentRss>
		<slash:comments>20</slash:comments>
		<feedburner:origLink>http://johnmu.com/webmaster-tools-script-1/</feedburner:origLink></item>
		<item>
		<title>Google Webmaster Groups statistics for September 2007</title>
		<link>http://feedproxy.google.com/~r/johnmucom/~3/rcnEz0AfvD4/</link>
		<comments>http://johnmu.com/statistics-2007-september/#comments</comments>
		<pubDate>Tue, 23 Oct 2007 20:25:05 +0000</pubDate>
		<dc:creator>John Mueller</dc:creator>
				<category><![CDATA[Google]]></category>

		<guid isPermaLink="false">http://johnmu.com/statistics-2007-september/</guid>
		<description><![CDATA[I finally got around to getting the statistics for September 2007 for the Google Webmaster Help Groups finished up. I have to admit the numbers for September aren&#8217;t the best, especially the counts for the posts by Googlers. Getting started at Google took quite some time and a lot of learning . And wow &#8211; [...]]]></description>
				<content:encoded><![CDATA[<p>I finally got around to getting the statistics for September 2007 for the <a href="http://groups.google.com/group/Google_Webmaster_Help/topics">Google Webmaster Help Groups</a> finished up. I have to admit the numbers for September aren&#8217;t the best, especially the counts for the posts by Googlers. Getting started at Google took quite some time and a lot of learning <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  . And wow &#8211; look at the Googlers post in October (great work, everyone!). </p>
<p>On a slightly sader note, I don&#8217;t think I can continue to provide these statistics here. I hope I can work something out as a replacement though, we&#8217;ll see.. or maybe I would be better off just posting in the group instead <img src='http://johnmu.com/wp-includes/images/smilies/icon_biggrin.gif' alt=':D' class='wp-smiley' />  .</p>
<p>Without further ado, here are &#8230; </p>
<h2>The numbers</h2>
<ul>
<li>Number of new threads = 1157</li>
<li>Number of new posts = 5902</li>
<li>Average number posts/new thread = 4.81</li>
<li>Number of posts by new users = 915 (15.5%)</li>
<li>Number of threads by new users = 696 (60.2%)</li>
<li>Average number of posts in threads by new users = 4.8</li>
<li>Number of new threads started by Googlers = 2</li>
<li>Number of new posts by Googlers = 35</li>
</ul>
<p>Feel free to compare to <a href="http://johnmu.com/statistics-2007-august/">August</a> and <a href="http://johnmu.com/statistics-2007-july/">July 2007</a>. Comparing those numbers to the previous months there&#8217;s a visible drop, especially in the number of posts (-1774 or 23%) and threads (-172 or 13%). I hope we can push that back up soon (and answer more questions in the FAQs so that fewer threads are needed <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  ).</p>
<h2>Top posters</h2>
<p>Thank you, top posters!! It&#8217;s great to have you in our community helping so many people to get their issues solved. You&#8217;re the best! Come on back, JLH, we&#8217;re missing you.</p>
<ol>
<li><a href="http://www.webado.net/">webado</a> = 746</li>
<li><a href="http://cass-hacks.com/">cass-hacks</a> = 207</li>
<li><a href="http://www.isham-research.co.uk/">Phil Payne</a> = 195</li>
<li><a href="http://www.jlh-design.com/">JLH</a> = 178</li>
<li><a href="http://www.asymptoticdesign.com/">cristina</a> = 168</li>
<li><a href="http://www.seobuzzbox.com/">Admin Aaron</a> = 163</li>
<li><a href="http://www.tesol-direct.com/">Robbo</a> = 157</li>
<li><a href="http://blog.aitechsolutions.net/">abracadabra</a> = 145</li>
<li>seo101 = 117</li>
<li><a href="http://www.dmovers.com/">burchman519</a> = 94</li>
<li><a href="http://www.icegiant.co.uk">IceGiant</a> = 80</li>
<li><a href="http://www.utheguru.com/">dockarl</a> = 67</li>
<li><a href="http://www.redcardinal.ie/">Red Cardinal</a> = 53</li>
<li><a href="http://www.travellerspoint.com/">Sam I Am</a> = 51</li>
<li>Randy P. = 49</li>
</ol>
<p>seo101 and Randy P. &#8211; feel free to give me a site to link to, even if it isn&#8217;t yours <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  .<br />
<span id="more-120"></span></p>
<h2>Top thread starters</h2>
<ol>
<li>burchman519 = 10</li>
<li>Admin Aaron = 9</li>
<li>tal7080 = 8</li>
<li>Tomcat619 = 8</li>
<li>Alaa Maanawi = 7</li>
<li>Chibcha = 7</li>
<li>ShaneMC = 7</li>
<li>JLH = 6</li>
<li>ARC = 5</li>
<li>gl-science.com = 5</li>
<li>Phil Payne = 5</li>
<li>Vic = 5</li>
<li>yorkieron = 5</li>
<li>azholkover = 4</li>
<li>Blue Fin = 4</li>
</ol>
<h2>Threads with Googlers participating</h2>
<p>I hope we push this category off the charts in the next months!!</p>
<ul>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/f364cbd9a74fd4e9">Crawl, Index, Rank: &quot;Craig and Webado and others deserve medals&quot;</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/ae38dea0defcc680">Crawl, Index, Rank: 11 urls not found</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/e6e0179af669ab4f">Crawl, Index, Rank: Adsense Ads Time</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/955e8752518e5b68">Crawl, Index, Rank: confusion over redirection of old domains</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/42ddfcbf960b8c72">Crawl, Index, Rank: DNS error in Google Webmaster tools</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/a9df680a79504c7a">Crawl, Index, Rank: Duplicate Content Penalty For Affiliate Sales Page on Same Server</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/79caa0bdda66ec03">Crawl, Index, Rank: Effect of Parked Domain on Rankings</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/bc49bda973aecc57">Crawl, Index, Rank: Have my site and/or google been hacked ?</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/db4ffae61e9b1985">Crawl, Index, Rank: No Google Love</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/17c72e77f691a38c">Crawl, Index, Rank: Pagerank of 0?</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/72231e3bdd7a31f8">Crawl, Index, Rank: Popular Picks &#8212; What would *you* like to know more about?</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/b3308623a05e0fd3">Crawl, Index, Rank: robots.txt and domino</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/1e84dd9424583620">Crawl, Index, Rank: submiting website.</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/902ada01db3413a3">Random Chit-Chat: bye</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Sitemap/browse_thread/thread/6957368d08592296">Sitemaps: Add-on domain sitemap</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Sitemap/browse_thread/thread/a6d5ac28e650644c">Sitemaps: Authenticated pages</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Sitemap/browse_thread/thread/a8d312fb4447f4f4">Sitemaps: sitemap and extra characters</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Sitemap/browse_thread/thread/4a1083036eb5ce9c">Sitemaps: Sitemap throwing erroneous error</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Sitemap/browse_thread/thread/d8b080b7df4dacee">Sitemaps: We experienced a temporary error processing your Sitemap. Please try again later.</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Requests/browse_thread/thread/80ee9d67b22f99c8">Suggestions: Helping webmasters with duplicate content</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Requests/browse_thread/thread/e0a68791512dd273">Suggestions: ROFLMAO &#8211; dear Google!</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Tools/browse_thread/thread/b8c1ee393f02acbc">Webmaster Tools: Analytics Goals/Funnels with different domains.</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Tools/browse_thread/thread/ae950ce4c7e45283">Webmaster Tools: Business description not what submitted Hijacked?</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Tools/browse_thread/thread/b67504cd52261404">Webmaster Tools: Crack software sold on sites on blacklist buy reached by Google keyword search</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Tools/browse_thread/thread/8b6b02cdc8583973">Webmaster Tools: DNS/URL timeout for Googlebot crawler</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Tools/browse_thread/thread/c037152c721dab2d">Webmaster Tools: i cant view pictures from suddenlink</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Tools/browse_thread/thread/bd56ff8452e7f6e6">Webmaster Tools: Not Verified</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Tools/browse_thread/thread/2e536c3adc88bb51">Webmaster Tools: Re-Inclusion After Removal for &quot;Hidden Text&quot;</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Tools/browse_thread/thread/16c73cbca5899d19">Webmaster Tools: ReInclusion Request-Removal of More Hidden Text-2nd Request</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Tools/browse_thread/thread/e652132e6bd9a997">Webmaster Tools: Removing pages and directories</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Tools/browse_thread/thread/7d8c954e6c16a6eb">Webmaster Tools: robots.txt unreachable yet it has been OK for years</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Tools/browse_thread/thread/9d314c9bf94e2f39">Webmaster Tools: URLs Removal denied</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Tools/browse_thread/thread/b89946300f1562a5">Webmaster Tools: Using Optimizer on index page</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Tools/browse_thread/thread/0c6c7968b14c7f8f">Webmaster Tools: verifying a template website</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Tools/browse_thread/thread/cce5e2faa9f10c42">Webmaster Tools: Why is Google&#39;s cache for my site another domain entirely?</a></li>
</ul>
<h2>New threads started by Googlers</h2>
<ul>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/72231e3bdd7a31f8">Crawl, Index, Rank: Popular Picks &#8212; What would *you* like to know more about? (Adam Lasnik)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Requests/browse_thread/thread/80ee9d67b22f99c8">Suggestions: Helping webmasters with duplicate content (Maile Ohye)</a></li>
</ul>
<h2>New threads started by regulars (>100 posts)</h2>
<p>Lots of regulars are great to see. I love it when you come back and start a thread for something special.</p>
<ul>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/e07516a8ca433e84">Crawl, Index, Rank: A wee bit perplexed (burchman519)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/fcec060f5aa019a4">Crawl, Index, Rank: Additional Googlers&#8230; (JLH)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/868b926a0beb2fba">Crawl, Index, Rank: Any suggestions? (burchman519)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/e3710e2e9133ced6">Crawl, Index, Rank: ARGH!?! Why, why, why Google? (Sam I Am)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/4be947464763e6f8">Crawl, Index, Rank: Bad Neighborhood Checker Question (burchman519)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/2b4c56402fb77b40">Crawl, Index, Rank: Brand New URL in Google Sitemap Not Indexed!!!!! (webado)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/a6263133e098e7e4">Crawl, Index, Rank: Could this mean my site is penalized (burchman519)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/221a12acdda0b6c0">Crawl, Index, Rank: crawl Errors for pages that doesn&#39;t exist (Phil Payne)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/838f59c6fa9695f5">Crawl, Index, Rank: Datacenter Freakin&#39; Question (Admin Aaron)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/7160b99eb916a712">Crawl, Index, Rank: Don&#39;t expect PageRank update soon (JLH)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/da530ee2651f42b3">Crawl, Index, Rank: Dramatic drop in pagerank (Phil Payne)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/c769bd3a7cd84642">Crawl, Index, Rank: Duplicate blocks of text (Admin Aaron)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/51d7da0a5f99590c">Crawl, Index, Rank: google &#8211; Pagerank (cass-hacks)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/3f3ef314fbe61828">Crawl, Index, Rank: Google has forgotten about me? (Admin Aaron)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/8d4df649c10af85f">Crawl, Index, Rank: Help &#8211; Google have stopped indexing all the sites on our Class C (cass-hacks)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/b5f48dc65233d2ad">Crawl, Index, Rank: Help me repair my hidden text! (Admin Aaron)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/5a03f1bce1db405d">Crawl, Index, Rank: I thought this was bad practice. (burchman519)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/3a794f2e02d046b5">Crawl, Index, Rank: Is Google selling crawlable links? (marketingtitan)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/343994030b70da9d">Crawl, Index, Rank: Is there Redemption? &#8211; 1 year ban (mrg)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/cd740055a596d4e3">Crawl, Index, Rank: Lost Homepage in index and cache (JLH)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/15c74e908ebdd1f6">Crawl, Index, Rank: Minus 30 penalty (burchman519)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/3553fa43c99b5d71">Crawl, Index, Rank: Paid Link Report #1 (Admin Aaron)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/72231e3bdd7a31f8">Crawl, Index, Rank: Popular Picks &#8212; What would *you* like to know more about? (Adam Lasnik)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/7ba80c48302d0bd5">Crawl, Index, Rank: Posts Held in Moderation? (Admin Aaron)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/9b06b599d42291ee">Crawl, Index, Rank: Robots.txt &#8211; do conflicting disallows/sitemap work? (Sam I Am)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/66cacea09311006f">Crawl, Index, Rank: Sitemap errors and warnings (Chris Gunn)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/9bdc43cd6c281a70">Crawl, Index, Rank: This one&#39;s for the regulars (Phil Payne)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/06abd22b7a6c41fd">Crawl, Index, Rank: Traffic for Vicodin? (Admin Aaron)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/a0e4cb3fa0eeb976">Crawl, Index, Rank: What is your take on this?? (burchman519)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/5f41643502419045">Crawl, Index, Rank: What other outside influences can affect your SERP? (burchman519)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/f10948ee6b2a9c23">Crawl, Index, Rank: What&#39;s this funny URL? (Admin Aaron)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/dd7efe36e9978fc5">Crawl, Index, Rank: What&#39;s your take on this? (burchman519)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/63c2deb70058ec51">Crawl, Index, Rank: Why is Google indexing sblogsite.com &#8211; maybe because they&#39;re an Adsense publisher? (Red Cardinal)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/82ff42b299636d1c">Crawl, Index, Rank: Why would these links have two different cached dates? (burchman519)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/5897bffba31df709">Random Chit-Chat: A small PR (Public Relations) problem for Google? (Red Cardinal)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/b8ceefcce433e727">Random Chit-Chat: babe (yorkieron)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/902ada01db3413a3">Random Chit-Chat: bye (JLH)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/b739b67dfe4a4045">Random Chit-Chat: Cass Hacks (yorkieron)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/0e141c16a99e2b19">Random Chit-Chat: Craig (yorkieron)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/60963fcc9f075e36">Random Chit-Chat: Daterbases (yorkieron)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/bf36dcc65140cb02">Random Chit-Chat: Dear Chris Gunn (Sebastian)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/5c2f042439728a1d">Random Chit-Chat: Funny But True (yorkieron)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/d7aecf2f3adcb76c">Random Chit-Chat: Googles Going to the moon (surf_doggie)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/f290afcc810faac8">Random Chit-Chat: Interview with JLH! (cristina)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/a056a5625fca62fa">Random Chit-Chat: Interview with JohnMu! (cristina)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/f22d46e09ef6e9d6">Random Chit-Chat: JLH removed Matt Cutts from the Interweb, er Google (Sebastian)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/85e70f687a20d578">Random Chit-Chat: JohnMu interviews world famous Richard &quot;Red Cardinal&quot; Hearne (JLH)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/b258fb80a303583b">Random Chit-Chat: New tools interface! (cass-hacks)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/2c14cff2aa8360a3">Random Chit-Chat: Statistics for August are online <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' />  (JohnMu)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/ba0032cf7f548da2">Random Chit-Chat: The Adam Lasnik Bucket Hat changed my life (IceGiant)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/173a5f3115fe1b0f">Random Chit-Chat: Welcome Cuttlets (JLH)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/5cace75c926a1249">Random Chit-Chat: You never know what you&#39;ve got until the sword cuts both ways&#8230; (IceGiant)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Sitemap/browse_thread/thread/1061a4bda9fd6d3d">Sitemaps: DNS Errors on sitemap (roysnj)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Requests/browse_thread/thread/e143835b97c0519a">Suggestions: Idea For Google Webmaster Help (Admin Aaron)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Requests/browse_thread/thread/bdd56c9aa6121619">Suggestions: Malformed markup notification (Phil Payne)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Requests/browse_thread/thread/a8b0992cf16ba897">Suggestions: Multiple subdomain verification (Sam I Am)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Requests/browse_thread/thread/e0a68791512dd273">Suggestions: ROFLMAO &#8211; dear Google! (Phil Payne)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Tools/browse_thread/thread/7d8019c14b0e5a10">Webmaster Tools: Multiple subdomain verification option? (Sam I Am)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Tools/browse_thread/thread/7d8c954e6c16a6eb">Webmaster Tools: robots.txt unreachable yet it has been OK for years (silverstall)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Tools/browse_thread/thread/15971e651e71bc38">Webmaster Tools: Webmaster Tools UI (surf_doggie)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Tools/browse_thread/thread/078a6b9d563630ab">Webmaster Tools: wondering what url they want to remove cache (JohnMu)</a></li>
</ul>
<h2>Top most active threads</h2>
<ol>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/702d4b27298ffdea">Crawl, Index, Rank: Thoroughly sabotaged (50 new replies)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/80078243d3573402">Crawl, Index, Rank: Do Outgoing Links Increase Ranking? (48 new replies)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/e3710e2e9133ced6">Crawl, Index, Rank: ARGH!?! Why, why, why Google? (45 new replies)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/1aa7c55326fd19b4">Crawl, Index, Rank: mystified by penalty &#8230; went from #1 to #726 overnight (42 new replies)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/42ddfcbf960b8c72">Crawl, Index, Rank: DNS error in Google Webmaster tools (40 new replies)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Tools/browse_thread/thread/8b6b02cdc8583973">Webmaster Tools: DNS/URL timeout for Googlebot crawler (37 new replies)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/39a5221075d9ac56">Crawl, Index, Rank: proxy hijacked pages (but no action from google) <img src='http://johnmu.com/wp-includes/images/smilies/icon_sad.gif' alt=':(' class='wp-smiley' />  (35 new replies)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/5c3878b7b7b1b7eb">Crawl, Index, Rank: Help with SEO ! (34 new replies)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/60e3395dfeb9ab3c">Crawl, Index, Rank: Breaking Out Of Frames Used By Google Image Search With Javascript (30 new replies)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/72231e3bdd7a31f8">Crawl, Index, Rank: Popular Picks &#8212; What would *you* like to know more about? (29 new replies)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/ae995facb121e9e0">Crawl, Index, Rank: spam question (27 new replies)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/220780d2156aada2">Random Chit-Chat: Has Anyone noticed how not so relevant Googles serp&#39;s are lately? (27 new replies)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/0a26c2e804e0f950">Crawl, Index, Rank: Duplicate Content on Different CC TLDs (25 new replies)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/1c382fca3aff49fe">Crawl, Index, Rank: Former PR6 site, now PR0 and individual pages not being indexed for past 6 months (25 new replies)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/e98ca243e9a0a507">Crawl, Index, Rank: Need Help Please! &#8211; My Site Is Going Down The Google Toilet! (25 new replies)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Sitemap/browse_thread/thread/ad8bf456d474bd79">Sitemaps: Sitemap (25 new replies)</a></li>
</ol>
<h2>Top most linked Domains</h2>
<ol>
<li>google.com = 657</li>
<li>ne-design.net = 583</li>
<li>validator.w3.org = 220</li>
<li>example.com = 153</li>
<li>groups.google.com = 152</li>
<li>tool.motoricerca.info = 127</li>
<li>oyoy.eu = 124</li>
<li>home-medical-equipment-depot.com = 93</li>
<li>mysite.com = 85</li>
<li>kunstpladsen.dk = 74</li>
<li>doggypanache.com = 74</li>
<li>sitemaps.org = 73</li>
<li>expeditionsail.com = 71</li>
<li>tanklesswater.com = 59</li>
<li>w3.org = 57</li>
</ol>
<p>I always find this list interesting. Besides reference sites, examples (mysite.com is up and coming <img src='http://johnmu.com/wp-includes/images/smilies/icon_biggrin.gif' alt=':D' class='wp-smiley' />  ) and tools, the sites being linked to usually do not get listed for more than one month. I hope that&#8217;s because the issues were resolved (but I know sometimes it&#8217;s not that simple and I&#8217;m glad to see perserverence among you all). </p>
<h2>Top most linked Google answers</h2>
<ol>
<li><a href="http://www.google.com/support/webmasters/bin/answer.py?answer=66353">Webmaster Help Center &#8211; Hidden text and links</a> = 54</li>
<li><a href="http://www.google.com/support/webmasters/bin/answer.py?answer=35769">Webmaster Help Center &#8211; Webmaster Guidelines</a> = 49</li>
<li><a href="http://www.google.com/support/webmasters/bin/answer.py?answer=40349">Webmaster Help Center &#8211; How can I create a Google-friendly site?</a> = 13</li>
<li><a href="http://www.google.com/support/webmasters/bin/answer.py?answer=61050">Webmaster Help Center &#8211; Block or remove pages using meta tags</a> = 11</li>
<li><a href="http://www.google.com/support/webmasters/bin/answer.py?answer=40364">Webmaster Help Center &#8211; How do I block Googlebot?</a> = 8</li>
<li><a href="http://www.google.com/support/webmasters/bin/answer.py?answer=47334">Webmaster Help Center &#8211; How do you compile the list of links shown below some search results?</a> = 7</li>
<li><a href="http://www.google.com/support/webmasters/bin/answer.py?answer=59819">Webmaster Help Center &#8211; How can I ensure my content is eligible for removal from the Google index?</a> = 6</li>
<li><a href="http://www.google.com/support/webmasters/bin/answer.py?answer=35312">Webmaster Help Center &#8211; How does Google modify web pages for mobile viewing?</a> = 6</li>
<li><a href="http://www.google.com/support/webmasters/bin/answer.py?answer=35128">Webmaster Help Center &#8211; HTTP errors/ 401/407 authentication error</a> = 5</li>
<li><a href="http://www.google.com/support/webmasters/bin/answer.py?answer=34397">Webmaster Help Center &#8211; How do I add my site to Google&#39;s search results?</a> = 5</li>
<li><a href="http://www.google.com/support/webmasters/bin/answer.py?answer=34431">Webmaster Help Center &#8211; Does Google index dynamic pages?</a> = 4</li>
<li><a href="http://www.google.com/support/webmasters/bin/answer.py?answer=34464">Webmaster Help Center &#8211; My URL changed, so how can I get Google to index my new site?</a> = 4</li>
<li><a href="http://www.google.com/support/webmasters/bin/answer.py?answer=35157">Webmaster Help Center &#8211; URLs not followed /Redirect error</a> = 3</li>
<li><a href="http://www.google.com/support/webmasters/bin/answer.py?answer=44231">Webmaster Help Center &#8211; What&#39;s a preferred domain?</a> = 3</li>
<li><a href="http://www.google.com/support/webmasters/bin/answer.py?answer=66356">Webmaster Help Center &#8211; Link schemes</a> = 3</li>
<li><a href="http://www.google.com/support/webmasters/bin/answer.py?answer=34450">Webmaster Help Center &#8211; Why isn&#8217;t my site returning when I search for results from a particular country?</a> = 3</li>
<li><a href="http://www.google.com/support/webmasters/bin/answer.py?answer=35843">Webmaster Help Center &#8211; How do I request reconsideration of my site?</a> = 3</li>
<li><a href="http://www.google.com/support/bin/answer.py?answer=45449">Why do some of my search results say &quot;This site may harm your computer?&quot;</a> = 2</li>
<li><a href="http://www.google.com/support/webmasters/bin/answer.py?answer=66736">Webmaster Help Center &#8211; Why should I report paid links to Google?</a> = 2</li>
<li><a href="http://www.google.com/support/webmasters/bin/answer.py?answer=61062">Webmaster Help Center &#8211; How do I use the URL removal request tool?</a> = 2</li>
</ol>
<p>Looking back at September, what were the most common issues that remained present in your mind? </p>
<hr/>Copyright &copy; 2013 <strong><a href="http://johnmu.com">johnmu.com</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator, the site you are looking at is guilty of copyright infringement. Please contact johnmu.com so we can take legal action immediately.<br/><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/johnmucom?a=rcnEz0AfvD4:L5FmrBtBr_Y:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/johnmucom?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/johnmucom?a=rcnEz0AfvD4:L5FmrBtBr_Y:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/johnmucom?i=rcnEz0AfvD4:L5FmrBtBr_Y:D7DqB2pKExk" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/johnmucom?a=rcnEz0AfvD4:L5FmrBtBr_Y:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/johnmucom?i=rcnEz0AfvD4:L5FmrBtBr_Y:F7zBnMyn0Lo" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/johnmucom/~4/rcnEz0AfvD4" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://johnmu.com/statistics-2007-september/feed/</wfw:commentRss>
		<slash:comments>21</slash:comments>
		<feedburner:origLink>http://johnmu.com/statistics-2007-september/</feedburner:origLink></item>
		<item>
		<title>Being #1 with “Untitled Document” and Flash</title>
		<link>http://feedproxy.google.com/~r/johnmucom/~3/5sW0b82vSZM/</link>
		<comments>http://johnmu.com/untitled-document/#comments</comments>
		<pubDate>Sat, 13 Oct 2007 20:58:30 +0000</pubDate>
		<dc:creator>John Mueller</dc:creator>
				<category><![CDATA[Google]]></category>

		<guid isPermaLink="false">http://johnmu.com/untitled-document/</guid>
		<description><![CDATA[Untitled Document We&#8217;ve all seen it &#8211; &#8220;untitled document&#8221; is a popular page name, probably the most popular one out there. I wonder who decided that &#8220;untitled document&#8221; was better than no title at all? There are a lot of those pages out there, do they even know that a good title can do wonders? [...]]]></description>
				<content:encoded><![CDATA[<p><img src='http://johnmu.com/wp-content/stuff/untitled-document.jpg' alt='untitled Document' style="float:left;" /><strong>Untitled Document</strong></p>
<p>We&#8217;ve all seen it &#8211; &#8220;<a href="http://www.google.com/search?q=intitle%3A%22untitled+document%22">untitled document</a>&#8221; is a popular page name, probably the most popular one out there. I wonder who decided that &#8220;untitled document&#8221; was better than no title at all? There are a lot of those pages out there, do they even know that a good title can do wonders? </p>
<p>Being &#8220;untitled&#8221; doesn&#8217;t make your pages uncrawlable though. If you wanted to go all out, you could make sure that your page has no indexable content at all and heck, just use Flash to display the whole homepage while we&#8217;re at it. </p>
<p>Of course doing that, you would think that it would probably destroy your site&#8217;s chances of being shown in search results. I suppose it generally would, but imagine if your site was still #1 in the results for your niche.<br />
<span id="more-117"></span><br />
So I was in Kirkland and wanted to visit some local toy stores before I left. I&#8217;m still not sure if I go there for myself or to actually bring stuff home &#8211; having kids means that you finally get to go out and buy toys again <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  . Anyway, I try the old faithful &#8220;toy store + location&#8221; query and browse the local search results to get a first idea. Google&#8217;s neat local search one-box on top of the normal results is always a good starting point. </p>
<p>Here are the results for &#8220;<a href="http://www.google.com/search?q=toy+store+kirkland&#038;btnG=Search">toy store kirkland</a>&#8220;:<br />
<img src='http://johnmu.com/wp-content/stuff/kirkland-toy-store.jpg' alt='kirkland-toy-store' /></p>
<p>The #1 listing is one I actually checked out in person, it was easy to find and had lots of fun stuff (not as nice as the one in Los Gatos from my visit to the bay area though). </p>
<p><a href="http://www.treetoptoys.com/">Tree Top Toys</a> &#8211; a nice name with one of my keywords in it &#8211; has a site that is nothing more than an &#8220;Untitled Document&#8221; combined with a Flash-based homepage (sadly, even their old frames-based website was probably better for SEO). It ranks on top for a good local query though, imagine that. </p>
<p>When you look at the local search <a href="http://maps.google.com/maps?hl=en&#038;q=toy+store&#038;near=Kirkland,+WA&#038;fb=1&#038;view=text&#038;latlng=47674330,-122129701,11298439279698530468&#038;dtab=0">entry for their site</a>, it shows that they have a couple of reviews (data imported from other sites), a photo (also imported) and that&#8217;s about it. From the looks of it (I don&#8217;t know too much about local search) they&#8217;re at that position because they&#8217;re close to the area and have some reviews, nothing special. </p>
<p>It looks like it doesn&#8217;t take much to hit the top spot in local search <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> , and with that top local spot, you could be on top of the search results. Traditional SEO (working on on-site and off-site factors) isn&#8217;t even required (but it would probably be a good idea once the competition gets stronger) &#8211; you don&#8217;t even need paid links <img src='http://johnmu.com/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' /> . </p>
<p>How much work would it be to go out and talk to local businesses to inform them of the possibilities, to get them interested in actually putting their information online for the local search engines (e.g., getting verified and adding information and photos manually) and to help them to inform customers of feedback possibilities online? It doesn&#8217;t take much, you could set up a good, short presentation within a few hours. The good-will and reputation that you could build that way might even get you the one or other contract for traditional SEO work (or web-design, etc.). </p>
<p>As mobile search gets more popular, local search is only going to get more and more important. Helping small businesses in your community to get a foot in the door does not take much. Take the time and show people how SEO &#8211; getting set up for local search &#8211; is something that can make a difference in a positive way. </p>
<hr/>Copyright &copy; 2013 <strong><a href="http://johnmu.com">johnmu.com</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator, the site you are looking at is guilty of copyright infringement. Please contact johnmu.com so we can take legal action immediately.<br/><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/johnmucom?a=5sW0b82vSZM:3qrzUVuDroU:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/johnmucom?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/johnmucom?a=5sW0b82vSZM:3qrzUVuDroU:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/johnmucom?i=5sW0b82vSZM:3qrzUVuDroU:D7DqB2pKExk" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/johnmucom?a=5sW0b82vSZM:3qrzUVuDroU:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/johnmucom?i=5sW0b82vSZM:3qrzUVuDroU:F7zBnMyn0Lo" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/johnmucom/~4/5sW0b82vSZM" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://johnmu.com/untitled-document/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		<feedburner:origLink>http://johnmu.com/untitled-document/</feedburner:origLink></item>
		<item>
		<title>Opportunities in Search</title>
		<link>http://feedproxy.google.com/~r/johnmucom/~3/uHOrmvR1S2o/</link>
		<comments>http://johnmu.com/opportunities-in-search/#comments</comments>
		<pubDate>Sat, 22 Sep 2007 09:01:25 +0000</pubDate>
		<dc:creator>John Mueller</dc:creator>
				<category><![CDATA[Google]]></category>

		<guid isPermaLink="false">http://johnmu.com/opportunities-in-search/</guid>
		<description><![CDATA[So I went to visit Google in Mountain View &#8230; &#8230; and learned that every second sentence has to be prefixed with &#8220;so&#8221;. Wait, that&#8217;s not all. I&#8217;m sure you&#8217;re all just reading this to hear about the secret information they&#8217;ve been feeding me, heh. Sorry, you&#8217;ll have to join Google yourself to find out [...]]]></description>
				<content:encoded><![CDATA[<h2>So I went to visit Google in Mountain View &#8230;</h2>
<p>&#8230; and learned that every second sentence has to be prefixed with &#8220;so&#8221;. Wait, that&#8217;s not all.</p>
<p>I&#8217;m sure you&#8217;re all just reading this to hear about the secret information they&#8217;ve been feeding me, heh. Sorry, you&#8217;ll have to join Google yourself to find out more about that part. It&#8217;s been really interesting so far, so many documents to read and digest, so many neat people to meet and chat with, so much good food to eat (good thing I&#8217;m only here for a week). The Google campus is really neat, but it&#8217;s also good to get out and do something else, like getting some neat toys for the kids (bribe my way back to getting them to recognize me when I&#8217;m home)&#8230;</p>
<h2>Where are you in local search?</h2>
<p>So I&#8217;m off to find a neat toy store that has more than the average plastic junk. Of course I&#8217;ll try to use Google local search to help me find one, I&#8217;m sure there are lots of really great stores around here&#8230;<br />
<span id="more-114"></span><br />
<img src='http://johnmu.com/wp-content/stuff/local1.jpg' alt='toy stores near Mountain View, CA' style="float:left;"/>&#8230; only I can&#8217;t find out enough about them to be sure that I want tot go there. How far can you get with [<a href="http://www.google.com/search?q=%22toy+store%22+%22mountain+view%22">"toy store" "mountain view"</a>]? Some of the stores mentioned in the <a href="http://maps.google.com/maps?view=text&#038;q=toy+store+loc%3A+Mountain+View%2C+CA">map view</a> show more than just the location and some scraped user &#8220;reviews&#8221; (used lightly, when you read reviews which mention &#8220;conveniently located in &#8230;&#8221; and &#8220;&#8230; any seasonal holiday such as Easter, Valentine&#8217;s Day, etc.&#8221; &#8211; yeah sure, that&#8217;s how I would describe <strike>my</strike> a store) and an <a href="http://www.daplus.us/ShowPhoto.aspx?abi=1C1DE5BF09E4C6BE00F7A204FC7581B08382080339DB8142BB4AABEE6D355113&#038;Partner=400240">image copied from a page</a> mentioning the store. </p>
<p>What if you want to narrow your search down? How the heck do you put &#8220;good&#8221; into words? There are people out there who think cheap plastic junk makes for a good child&#8217;s toy, so I can&#8217;t just add the word &#8220;<a href="http://maps.google.com/maps?view=map&#038;q=good+toy+store+loc:+Mountain+View,+CA">good</a>&#8221; (what&#8217;s up with &#8220;Uncle Frank&#8217;s BBQ&#8221; in there?). In the end I go for &#8220;wooden toys&#8221; or &#8220;wooden toys store&#8221; and try several locations in the area, checking the results of several local web directories <img src='http://johnmu.com/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' />  that were included in the search results to pick out a handful of stores (which I write down on paper to feed into my GPS). I end up heading to <a href="http://www.woodenhorsetoys.com/">The Wooden Horse</a> in Los Gatos which was really fun (and made me buy way too much). </p>
<p>If you&#8217;re new to an area (or just visiting like I was), it can make sense to try to find good stuff through a search engine that can search based on location. I can understand that it&#8217;s a bit spotty way off in <strike>Sweden</strike> Switzerland (ok, in Sweden as well), but heck, we&#8217;re in the middle of Google&#8217;s backyard. </p>
<p>The average smaller brick and mortal store still has a web presence that is <a rel="really nofollow" href="http://www.kids-treasures.com/">terrible</a>. It doesn&#8217;t take much to get those websites cleaned up and ready for the minimal search engine friendliness. It doesn&#8217;t take much to get the business <a href="http://www.google.com/local/add">registered with Google local search</a>. Do it! <strong>Help your local businesses to get it done, help the small guy get his business listed properly.</strong> Put a nice image in there, add a good description (without the &#8220;etc&#8221;), put the real hours in there and make sure that all of that is also shown on the website in a readable fashion. Do a good deed and help that mom &#038; pop, where you get a special treatment, to get found online. It doesn&#8217;t take much, once you know what to do. </p>
<p>There is a lot of opportunity in local search, but it won&#8217;t happen by itself. This is where having a clean website really pays off for a local business &#8211; when the information is presented in a machine readable way a lot of this can be automated and made available correctly. It doesn&#8217;t take much work to add the rest, and suddenly even <a href="http://www.youtube.com/watch?v=5nNOBB41xdA">cars are sending customers</a>. </p>
<h2>Why can&#8217;t I browse your site on my phone?</h2>
<p>Sometimes I forget my notes or &#8211; gasp &#8211; can&#8217;t get a wireless connection with my laptop and have to look things up on the way. Why is it still the exception to have a site that can be used with a mobile phone browser? Some people might be using the iPhone, but the general public isn&#8217;t (especially those who from undeveloped countries like Switzerland). Why is it still so hard to access sites on the phone? I just want to look up the address or the times they&#8217;re open, argh! Ironically even some of the sites that are running mobile ads are not ready to be browsed with a phone. </p>
<p>Saying that nobody is using a phone to access your site now is no excuse &#8212; if your site doesn&#8217;t work on a phone, nobody will use your site that way. Make it easy for a customer to find you on the phone, make it easy for them to view at least some minimal information about your site / business while on the road. The longer people wait to get this done right, the more business those who can do it will get until then <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  . Opportunity is waiting!</p>
<h2>Spoiled by web 2.0 &#8211; easily adding neat stuff for free</h2>
<p><img src='http://johnmu.com/wp-content/stuff/map2.jpg' alt='obfuscated mapping' style="float:left;" />Well, this is kind of getting long but I&#8217;m still in Silicon Valley. I really need to get out, run up some hill and get some fresh air into my lungs. Searching for [<a href="http://www.google.com/search?q=hike+mountain+view">hike mountain view</a>] (don&#8217;t those brackets look neat?) leads me to <a href="http://www.bahiker.com/">http://www.bahiker.com/</a>, which has a fancy map that I can click on. Looks good so far, I bet I can find something there. Going to the right area, I end up on a <a href="http://www.bahiker.com/southbay.html">page with a map</a> and a gazillion links (less than 100, so we&#8217;re still good, lol) all bunched up on the map.</p>
<p>I want something nice, a couple of miles long that goes up and down a bit and is somewhere in the area. The only way to find any of that is to click my way through almost every item that is somewhere in that general location. I wonder how much work went into making that map &#8211; the one that is almost useless (it can only give you a very rough location).</p>
<p>Maybe I&#8217;m just too Google-oriented now, but adding a nice map that can be annotated is really simple now (I&#8217;m sure you can do the same with some of the other online mapping services), eg:</p>
<p><iframe width="100%" height="480" frameborder="0" scrolling="no" marginheight="0" marginwidth="0" src="http://maps.google.com/maps/ms?ie=UTF8&amp;hl=en&amp;msa=0&amp;msid=117337835690318917693.00043ab4be256c7bfd8da&amp;t=h&amp;om=1&amp;s=AARTsJqCdvCq-ZQSLngdSIAkmwGuPMP0ig&amp;ll=37.474645,-122.278637&amp;spn=0.008174,0.013733&amp;z=16&amp;output=embed"></iframe><br /><small><a href="http://maps.google.com/maps/ms?ie=UTF8&amp;hl=en&amp;msa=0&amp;msid=117337835690318917693.00043ab4be256c7bfd8da&amp;t=h&amp;om=1&amp;ll=37.474645,-122.278637&amp;spn=0.008174,0.013733&amp;z=16&amp;source=embed" style="color:#0000FF;text-align:left">View Larger Map</a></small></p>
<p>By using a system like Google Maps you can have a map that lets the user zoom in and check locations before actually going in and reading all about the details. You can even set up your own maps to be public, findable on Google Maps and Google Earth. How neat is that? Add a few images to the site, perhaps even a video and you&#8217;re all set to cover all bases for <a href="http://googlewebmastercentral.blogspot.com/2007/05/taking-advantage-of-universal-search.html">universal search</a>. </p>
<p>To cover even more (and get another entry in the search results for your site <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' />  ) you could even set up a Google Group for the site or set up a forum on your site (for more control over the look and feel, and to display your own ads). Using Google Groups you can get that done really quickly. For this site you could start a separate thread for each location and link to that from the pages themselves. As a user, it&#8217;s great to see multiple opinions about something before actually packing up and going someplace. User generated content is great &#8212; you only have to enable it, your users can fill it up for you (provided your site is compelling enough). </p>
<p>Another opportunity is kind of lost with regards to the ads on that site &#8212; they&#8217;re way on the bottom, a place where nobody would ever bother looking, let alone clicking. You can make more money with Adsense if you test where ads work best and use them there. If done right, the ads could even add value to the page. That alone doesn&#8217;t really provide value in search, but it could provide more motivation to keep the site &#8220;modern&#8221;. </p>
<p><strong>There are opportunities in search all around &#8211; look around, take advantage or help others to take advantage of them!</strong></p>
<hr/>Copyright &copy; 2013 <strong><a href="http://johnmu.com">johnmu.com</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator, the site you are looking at is guilty of copyright infringement. Please contact johnmu.com so we can take legal action immediately.<br/><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/johnmucom?a=uHOrmvR1S2o:Zep5FndR0DA:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/johnmucom?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/johnmucom?a=uHOrmvR1S2o:Zep5FndR0DA:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/johnmucom?i=uHOrmvR1S2o:Zep5FndR0DA:D7DqB2pKExk" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/johnmucom?a=uHOrmvR1S2o:Zep5FndR0DA:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/johnmucom?i=uHOrmvR1S2o:Zep5FndR0DA:F7zBnMyn0Lo" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/johnmucom/~4/uHOrmvR1S2o" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://johnmu.com/opportunities-in-search/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		<feedburner:origLink>http://johnmu.com/opportunities-in-search/</feedburner:origLink></item>
		<item>
		<title>Interview with Richard Hearne (“Red Cardinal”)</title>
		<link>http://feedproxy.google.com/~r/johnmucom/~3/2f0aFR0Kb2o/</link>
		<comments>http://johnmu.com/interview-red-cardinal/#comments</comments>
		<pubDate>Thu, 06 Sep 2007 20:55:56 +0000</pubDate>
		<dc:creator>John Mueller</dc:creator>
				<category><![CDATA[People]]></category>

		<guid isPermaLink="false">http://johnmu.com/interview-red-cardinal/</guid>
		<description><![CDATA[Hi Richard, welcome to my blog! When I look through the top posters in the Google Webmaster Help groups, you&#8217;re almost always in there &#8211; it&#8217;s great to have you there and your posts bring in a lot of background knowledge that I&#8217;m sure many site owners appreciate. It&#8217;s interesting that you are &#8211; as [...]]]></description>
				<content:encoded><![CDATA[<p>Hi Richard, welcome to my blog! When I look through the top posters in the Google Webmaster Help groups, you&#8217;re almost always in there &#8211; it&#8217;s great to have you there and your posts bring in a lot of background knowledge that I&#8217;m sure many site owners appreciate. It&#8217;s interesting that you are &#8211; as far as I can tell &#8211; the only one of the top posters who is <a href="http://www.redcardinal.ie/">professionally active</a> in the website-area.</p>
<p><em>Hi John. Thanks so much for asking me to participate &#8211; it&#8217;s a great honour to talk with you. Odd that you mention about my professional background. It has dawned on me that I might be the only professional SEO on the group (I wasn&#8217;t sure about this), and I&#8217;m slightly surprised that there isn&#8217;t more participation by other professionals. I&#8217;m quite sure that there are lurkers from the SEM industry, as the group is an excellent educational resource. More thoughts on this aspect a little later. Back to your questions now.</em></p>
<p><strong>Why do you spend so much of your time in the Google Webmaster Help groups &#8211; isn&#8217;t that almost like giving away the work that you would normally charge for? What&#8217;s in it for you?</strong></p>
<p>Funny, I&#8217;m a little embarrassed that I don&#8217;t get more time to contribute on the group. I&#8217;m not sure why that is, but I suppose I feel a small amount of ownership having posted as regularly as possible. I actually went to my profile recently to try and see how I first came across the group. I can&#8217;t say with 100% confidence how I found the group, but I was able to uncover what it was that sent me there. I was trying to find out whether Google would give .EU cTLD any special treatment in terms of country level searches. My first interaction with the group was with another &#8216;regular&#8217;, and that individual gave his time and knowledge freely. There&#8217;s something very endearing to actions which are without motive.<br />
<span id="more-113"></span><br />
I&#8217;ve been very lucky both personally and professionally in the past few years. I live a comfortable lifestyle and I&#8217;m getting opportunities to work on projects that even I would never have imagined not so long ago. In a way I owe some of my success to the group. There is only one resource I know of that provides more insight into Search Engines in general, and Google in particular. That resource is action &#8211; learning by doing. The group comes in a very close second place. The knowledge shared there is absolutely priceless. The individuals who contribute are, by and large, equal to and above many &#8216;professionals&#8217; in the SEM field. As I said, the group is as close to the coalface as you can get without actually doing everything yourself.</p>
<p>What&#8217;s in it for me? A few things. The chance to mingle and converse with some very clever and insightful people (Random Chit-Chat can contain some gems), the ability to broaden my knowledge base, and, quite simply, the ability to commit the odd good deed or two and feel good about yourself.</p>
<p><strong>As a professional website optimizer, for search engines, usability and accessibility, how do you rate the answers given in the groups? Do you feel that site owners are generally being given good advice?</strong></p>
<p>I&#8217;m neck-deep in this kind of stuff professionally. I know that in some cases individuals and companies have saved or made themselves huge sums of money by logging into a Google Group and asking a question. I can safely say that some of the advice given there for free would attract substantial fees had it come from a large agency of superstar SEO. I think that speaks volumes for those &#8216;regulars&#8217; who help people out day-in, day-out.</p>
<p>In my opinion the quality of advice given is generally of a very high quality. And I think the Group self-regulates itself pretty well, so if you see bad advice, more often than not it will be debunked fairly soon after by a regular. I can&#8217;t say that I&#8217;ve seen many instances where bad advice has been given (excepting the period where certain negative forces existed in the Group).</p>
<p>This might cause some grief, but I&#8217;m going to state for the record that I don&#8217;t believe that compliant mark-up makes much of a difference. I am very committed to clean mark-up, but I think modern crawlers can just about munch through anything, and bad mark-up is rarely the primary cause of ranking issues (indexation perhaps more so). Connected to this, I not a subscriber to the broken META validation problem. I know the code breaks validation, but if it really was an issue I think Google would do something about it. I actually shot Phil Payne an email on this once, and while I have the utmost respect for both Phil’s experience and view on this issue, I still just cannot bring myself to buy into this particular fact (or myth?).</p>
<p>I think the greatest shame about the group is the architecture of the application itself &#8211; some absolutely fantastic information gets buried by the crap platform that is &#8216;Groups&#8217;. I&#8217;ve been on it for near a year now, and I still can&#8217;t figure out a good way to sort the wheat from the chaff (and then to find the wheat later on&#8230; *sigh*).</p>
<p><strong>You have a lot of <a href="http://www.redcardinal.ie/search-engine-optimisation/19-10-2006/10-steps-to-getting-into-google-and-staying-there/">really</a> <a href="http://www.redcardinal.ie/webdev/12-11-2006/internet-marketing-strategies/">great</a> <a href="http://www.redcardinal.ie/search-engine-optimisation/16-10-2006/13-deadly-google-sins/">content</a> on your site &#8211; is there a reason why you don&#8217;t seem to promote that content in the Google groups?</strong></p>
<p>Thank you. The beauty of the Group is that it is impartial and there are virtually never hidden agendas. I think that if I or any other poster was to start promoting their wares on the Group it would be a very negative development. Besides, I&#8217;ve never been one for overtly promoting myself or my business (inside the Group, or out), I prefer to talk about those things I have a passion for, and it just so happens that SEO, on-line marketing and on-line business are topics dear to me.</p>
<p>Oh, and just in case that&#8217;s misread by anyone &#8211; I&#8217;m not saying that it&#8217;s bad to point at your own content, just that I have to be a little more careful than others given the potential for perceived conflicts.</p>
<p><strong>In the groups and in forums everywhere, the question of whether or not it&#8217;s worth it to make sure that a site is valid (X)HTML code and complies with the generally accepted usability and accessibility guidelines is always a hot topic. On your blog you often mention such errors in sites that you review, why is valid code, usability and accessibility so important to you as a SEO?</strong></p>
<p>As a child I used to love Lego. Every time I got something new I&#8217;d rip open the box, discard the instructions and build from the picture. (Ended with my progression to Tecnics&#8230;) But seriously, you can obviously see that I don&#8217;t read ahead, hence I&#8217;ve sort of answered this above.</p>
<p>The valid code issue comes into play for me because it&#8217;s just so easy to manipulate good mark-up (tables for tabular data, not layout please). In terms of usability &#8211; well SEO is about achieving high ranks in the SERPs, but traffic is rather pointless if you cannot convert it. I&#8217;ve turned away quite a few jobs because I know that the site owner wont re-develop her site, and all the traffic in the world wont make any difference to the bottom line. In the past 6 months a large proportion of my work has been in usability and conversion optimisation actually.</p>
<p>Accessibility is a no-brainer for me. You needn&#8217;t conform with every point from the strictest guidelines, but why not give as wide an audience to your content? The added bonus is that crawlers rarely if ever have issues with well coded accessible websites. It&#8217;s a win-win.</p>
<p><strong>Why is the focus of many of your blog postings on sites for and in Ireland? (I love the local touch with the unique and interesting content about search and websites in general.)</strong></p>
<p>I suppose it&#8217;s a comfort zone thing. Most people tend to write about what they know best. And besides, I love to stir things up when I see websites that are making those stupid mistakes that require more effort than doing things well. (On an aside, controversy can be a very strong marketing tool, but manage wisely <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  )</p>
<p><strong>Is there anything I can pass on to Google, from you in particular, as a professional SEO, SEM and someone very active in the Google Groups?</strong></p>
<p>*sigh* where do I start&#8230;</p>
<ol>
<li>Webmaster console used to be great, but the information is becoming so stale that it borders on useless. (Part of me thinks this is all part of the great anti-SEO crusade Google is currently on.)</li>
<li>Stop crapping out my searches with spyware interstitials &#8211; I&#8217;m logged in, you can identify me, I&#8217;m not infected. (Further tactic in Google&#8217;s anti-SEO crusade.)</li>
<li>More Blue badges in the Group please &#8211; if you guys can take time out to hang in WMW, surely it&#8217;s not too much to expect a little interaction in your own &#8216;Official&#8217; support forum?</li>
<li>On a local note &#8211; given the huge base here in Ireland (&#8216;Paddyplex&#8217;) why isn&#8217;t Google more active in the local web community? MS puts you guys to aboslute shame with the local support they give to grass roots. Not even a peep out of Google.</li>
</ol>
<p><strong>Is there anything you&#8217;d like to add?</strong></p>
<p>Very, very well done on your new job. Something tells me &#8216;perfect fit&#8217; applies here. The only negative might be that Google&#8217;s gain will be the wider community&#8217;s loss&#8230; Hand on heart, you&#8217;re definitely one of the kindest and most knowledgeable people I&#8217;ve met on my short travels across the Interweb.</p>
<p><strong>Thanks for your time, Richard!</strong></p>
<p>No, thank you John.</p>
<hr/>Copyright &copy; 2013 <strong><a href="http://johnmu.com">johnmu.com</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator, the site you are looking at is guilty of copyright infringement. Please contact johnmu.com so we can take legal action immediately.<br/><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/johnmucom?a=2f0aFR0Kb2o:lSCJs_NBGdM:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/johnmucom?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/johnmucom?a=2f0aFR0Kb2o:lSCJs_NBGdM:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/johnmucom?i=2f0aFR0Kb2o:lSCJs_NBGdM:D7DqB2pKExk" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/johnmucom?a=2f0aFR0Kb2o:lSCJs_NBGdM:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/johnmucom?i=2f0aFR0Kb2o:lSCJs_NBGdM:F7zBnMyn0Lo" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/johnmucom/~4/2f0aFR0Kb2o" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://johnmu.com/interview-red-cardinal/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://johnmu.com/interview-red-cardinal/</feedburner:origLink></item>
		<item>
		<title>Google Webmaster Groups statistics for August 2007</title>
		<link>http://feedproxy.google.com/~r/johnmucom/~3/p8GzSlmm1Ps/</link>
		<comments>http://johnmu.com/statistics-2007-august/#comments</comments>
		<pubDate>Sun, 02 Sep 2007 21:55:53 +0000</pubDate>
		<dc:creator>John Mueller</dc:creator>
				<category><![CDATA[Google]]></category>

		<guid isPermaLink="false">http://johnmu.com/statistics-2007-august/</guid>
		<description><![CDATA[Another month goes by, here are the statistics for August (and some comparisons to July in brackets) 2007. The numbers Number of new threads = 1329 [+5.3%] Number of new posts = 7676 [-1.0%] Average number posts/new thread = 5.48 [-4.5%] Number of posts by new users = 1061 (13.8%) [+30.0%] Number of threads by [...]]]></description>
				<content:encoded><![CDATA[<p>Another month goes by, here are the statistics for August (and some comparisons to <a href="http://johnmu.com/statistics-2007-july/">July</a> in brackets) 2007.</p>
<p><strong>The numbers</strong></p>
<ul>
<li>Number of new threads = 1329 <em>[+5.3%]</em></li>
<li>Number of new posts = 7676 <em>[-1.0%]</em></li>
<li>Average number posts/new thread = 5.48 <em>[-4.5%]</em></li>
<li>Number of posts by new users = 1061 (13.8%) <em>[+30.0%]</em></li>
<li>Number of threads by new users = 813 (61.2%) <em>[+13.4%]</em></li>
<li>Average number of posts in threads by new users = 4.9 <em>[-2.0%]</em></li>
<li>Number of new threads started by Googlers = 4 <em>[+33.3%]</em></li>
<li>Number of new posts by Googlers = 54 <em>[-36.5%]</em></li>
</ul>
<p>More posts by new users is nice to see &#8211; I hope that&#8217;s because the &#8220;feel&#8221; of the Groups has improved and not just because of strange things going on in the index <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' />  .<br />
<span id="more-112"></span><br />
<strong>Top posters</strong></p>
<ol>
<li><a href="http://www.webado.net/">webado</a> = 701</li>
<li><a href="http://cass-hacks.com/">cass-hacks</a> = 438</li>
<li><a href="http://www.utheguru.com/">dockarl</a> = 325</li>
<li><a href="http://www.asymptoticdesign.com/">cristina</a> = 317</li>
<li><a href="#">JohnMu</a> = 305</li>
<li><a href="http://www.isham-research.co.uk/">Phil Payne</a> = 303</li>
<li><a href="http://www.jlh-design.com/">JLH</a> = 287</li>
<li><a href="http://blog.aitechsolutions.net/">abracadabra</a> = 216</li>
<li><a href="http://coplien.com/">djc</a> = 185</li>
<li><a href="http://www.tesol-direct.com/">Robbo</a> = 124</li>
<li><a href="http://sebastians-pamphlets.com/">Sebastian</a> = 96</li>
<li>seo101 = 92</li>
<li><a href="http://www.travellerspoint.com/">Sam I Am</a> = 90</li>
<li><a href="http://www.dmovers.com/">burchman519</a> = 78</li>
<li>Randy P. = 74</li>
<li><a href="http://www.kennels.co.uk/">beckysharpe</a> = 68</li>
<li><a href="http://www.golf-holeinone.com/">roysnj</a> = 67</li>
<li><a href="http://www.sitebyjames.com/">MrOmnicron</a> = 64</li>
<li>kklynnt = 60</li>
<li><a href="http://www.redcardinal.ie/">Red Cardinal</a> = 47</li>
</ol>
<p>It&#8217;s great to see some new names in there! If you&#8217;re missing a link, put one (a clean one) in your profile and drop me a note. </p>
<p><strong>Top thread starters</strong></p>
<ol>
<li>Alaa Maanawi = 15</li>
<li>JohnMu = 13</li>
<li>GeminiZin = 9</li>
<li>Dhanjal = 8</li>
<li>burchman519 = 7</li>
<li>JLH = 7</li>
<li>bobflack = 6</li>
<li>cass-hacks = 6</li>
<li>bornblue = 5</li>
<li>dss = 5</li>
<li>easytreasure = 5</li>
<li>hsanchezp = 5</li>
<li>Roh.it = 5</li>
<li>xor = 5</li>
<li>bonsai-resources = 4</li>
</ol>
<h2>Threads with Googlers participating</h2>
<ul>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/41bc90b4e6eb5aeb">Crawl, Index, Rank: Can&#39;t find my own domain</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/1c3b16bcdebbca48">Crawl, Index, Rank: Do I need the www. when I list the url?</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/45eb12862b201a5f">Crawl, Index, Rank: Flash site</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/001bbf60a84671e1">Crawl, Index, Rank: Google&#39;s &quot;Don&#39;t Be Evil&quot; slogan</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/8d13bd197dff2f63">Crawl, Index, Rank: Googlebot admin error</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/79bd6656b11a5981">Crawl, Index, Rank: How to optimize Ajax site?</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/b2b10c0c42fd6b90">Crawl, Index, Rank: Huge Drop In Ranking&#8230; Was I penalized?</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/706ebcc6b97e8639">Crawl, Index, Rank: index question</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/1d598286c79b49ea">Crawl, Index, Rank: Is my site being penalized?</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/0963595c1b4461e0">Crawl, Index, Rank: Lower case headlines/no snippets in Google index</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/3f2a0b91c00bf67a">Crawl, Index, Rank: making money with google</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/71bc2cbba5d72dd2">Crawl, Index, Rank: Need Dreamweaver behavior help!</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/ca4241192883abcb">Crawl, Index, Rank: POSTING DELAYS</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/245de4f3db578c50">Crawl, Index, Rank: Search Results</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/624c1460a8c17392">Crawl, Index, Rank: Site www.conexmetals.com</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/e289cf3b1a37a844">Crawl, Index, Rank: Surprise.com</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/9b94ea3dd4ab3448">Crawl, Index, Rank: Violations of the Google webmaster guidelines</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/dd7f65d589023380">Random Chit-Chat: Anybody using the sexy X-Robots-Tags yet?</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/8463e669e7c571bd">Random Chit-Chat: Blue Badge</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/19dae58b42ef76a6">Random Chit-Chat: How to become a Webmaster Help superstar</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/b1ec3204aaa8184f">Random Chit-Chat: This American Life is hiring a webmaster!</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Sitemap/browse_thread/thread/7d63c3028a2488b0">Sitemaps: Sitemap status pending issue</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Tools/browse_thread/thread/f4ecb7e0316ef750">Webmaster Tools: 403 Forbidden meta tag and html page</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Tools/browse_thread/thread/8c35625434d1a1d1">Webmaster Tools: Code in place, site not verifying&#8230;</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Tools/browse_thread/thread/5ee6610f1b78acfd">Webmaster Tools: Google autofill not fully working with my form</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Tools/browse_thread/thread/374fe78085780e16">Webmaster Tools: Logout Error in Google Webmaster tools</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Tools/browse_thread/thread/31825712d89d6460">Webmaster Tools: Removal from index denied&#8230; but I&#39;m doing what they ask!</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Tools/browse_thread/thread/905a17da524eacf7">Webmaster Tools: Sub domain verification meta tag</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Tools/browse_thread/thread/84f047d9ef3f210c">Webmaster Tools: temporary error processing your Sitemap</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Tools/browse_thread/thread/cc66d3e7424f006d">Webmaster Tools: Verification Issues?</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Tools/browse_thread/thread/2a6e8791d89786d5">Webmaster Tools: Verification Problem</a></li>
</ul>
<h2>New threads started by Googlers</h2>
<ul>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/ca4241192883abcb">Crawl, Index, Rank: POSTING DELAYS (Susan Moskwa)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/19dae58b42ef76a6">Random Chit-Chat: How to become a Webmaster Help superstar (Susan Moskwa)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/b1ec3204aaa8184f">Random Chit-Chat: This American Life is hiring a webmaster! (Susan Moskwa)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Tools/browse_thread/thread/cc66d3e7424f006d">Webmaster Tools: Verification Issues? (Jonathan Simon)</a></li>
</ul>
<p>I hope these two lists can be removed in the future (because they&#8217;re too long). <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' />  </p>
<h2>New threads started by regulars (>100 posts)</h2>
<ul>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/84c498bdb260dbbe">Crawl, Index, Rank: A Cool Tool &#8211; SEOQuake (dockarl)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/7331a57cb401b0b5">Crawl, Index, Rank: Become a Member / Posting a Question (cass-hacks)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/0e90e1a76ee71544">Crawl, Index, Rank: Can somebody from Google please, please fix the script that shows conten of cached pages? (cass-hacks)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/652540915430414b">Crawl, Index, Rank: fitflex.com &#8211; how to attain more quality links (Nate121)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/f0ad62c3fe6e9d33">Crawl, Index, Rank: Going on 4 months; what more can we try?! (Sam I Am)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/5dd4351656fed534">Crawl, Index, Rank: Google Duplicate Proxy Exploit &#8211; automated (JLH)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/e4f21e08152405c3">Crawl, Index, Rank: How to walk when you can&#39;t crawl? (JohnMu)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/55f8d3d44cb9bc93">Crawl, Index, Rank: http://www.google.net (MrGamma)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/dcb82140541b13d6">Crawl, Index, Rank: Lost our google rankings we re-designed our site (djc)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/27f68ae0c595b16f">Crawl, Index, Rank: Multi National sites +domain names (surf_doggie)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/a408bfd0ab881fac">Crawl, Index, Rank: Oh For Crying out loud.. UtheGuru&#39;s SECOND best performing Page gone too.. (dockarl)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/fa936bea22142ba0">Crawl, Index, Rank: PageRank of non-indexed pages. (cass-hacks)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/b005b512c0155cd2">Crawl, Index, Rank: Please stop the me too threadjacks (JLH)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/3ab25852d907a107">Crawl, Index, Rank: site: search doesn&#39;t return index page, search for product does &#8211; penalty sign? (dockarl)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/ba558fc1a4c0a53f">Crawl, Index, Rank: Specific reason for drop in SERPs? (abracadabra)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/9f5edeeccdd497e0">Crawl, Index, Rank: Spyware interstitials (Red Cardinal)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/8f1e2857a9b702b3">Crawl, Index, Rank: www.analytics-marketing.com (seo101)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/9c335456b1f18ee6">Random Chit-Chat: &quot;I&#39;m not interested in helping spammers succeed but those who may have done something and not known any better.&quot; &#8211; an interview with JLH (JohnMu)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/dd7f65d589023380">Random Chit-Chat: Anybody using the sexy X-Robots-Tags yet? (Sebastian)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/8463e669e7c571bd">Random Chit-Chat: Blue Badge (JohnMu)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/af538c55a5d83b8b">Random Chit-Chat: chmod 777 and 775 hack attempt (djc)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/c1b4197efc47d0dc">Random Chit-Chat: DA*#!!! Punching holes in walls as company culture (JohnMu)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/012052b67ac0f54d">Random Chit-Chat: Do you ever get donations? (JohnMu)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/1166bbaf99dda9e0">Random Chit-Chat: Doesn&#39;t time fly when you&#39;re&#8230;? (IceGiant)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/620901d5310d14f0">Random Chit-Chat: Dress like a Googler &#8211; part deux (JLH)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/a20f2a09dbe1595a">Random Chit-Chat: Dress Like a Googler! (JLH)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/f63c2006d4313397">Random Chit-Chat: End of an era: dnsreprot is no more &#8211; or at least isn&#39;t free (webado)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/c87cce84b2723963">Random Chit-Chat: First impressions of an unknown site (JohnMu)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/7b70f80b25fb3c7c">Random Chit-Chat: How to protect your site from proxies (JohnMu)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/9911ad6edd2fd495">Random Chit-Chat: I&#39;ve never seen so many links on a page &#8211; statistics for July (JohnMu)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/c0886ce28efe021d">Random Chit-Chat: Interesting account of a hacked site (djc)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/3dbfc147b06dc4b4">Random Chit-Chat: Is blocking Firefox cloaking? (JohnMu)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/070937aa84fd9c2f">Random Chit-Chat: ITU-T September meeting attendees (cass-hacks)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/f1749855516ce16f">Random Chit-Chat: Next interview online <img src='http://johnmu.com/wp-includes/images/smilies/icon_sad.gif' alt=':-(' class='wp-smiley' /> ) (JohnMu)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/c78bec18a4e16daf">Random Chit-Chat: Old sites not ever indexed by google posts all of a sudden (abracadabra)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/c30b51ee1897cf1b">Random Chit-Chat: Random Chit-Chat is boring lately (JLH)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/79a16ca3c94d73d6">Random Chit-Chat: site: operator no longer ordered by &quot;importance&quot;? (cass-hacks)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/452e87783bf4880a">Random Chit-Chat: The Doc is in! (-terviewed) (JohnMu)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/55292b1cbddec290">Random Chit-Chat: To _ or not to _ or, is &#8211; better? (cass-hacks)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/3802c8c9ba16ef4f">Random Chit-Chat: Top Poster Pool (JLH)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/66964432e0bd00d0">Random Chit-Chat: Web Content Accessibility (cristina)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/1315bf341e375a19">Random Chit-Chat: What do you think about &quot;pagerank&quot;? (JohnMu)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/9a44a5d0cd020b7d">Random Chit-Chat: You can own oyoy for only $49,000.00 (JLH)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Sitemap/browse_thread/thread/1719c4190bbb8b36">Sitemaps: &quot;Too many redirects&quot; &#8211; giggle (Phil Payne)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Requests/browse_thread/thread/afe7a02fc396a544">Suggestions: If possible show HTML source at verification by meta tag (cristina)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Requests/browse_thread/thread/0b98c5c1b69e67b6">Suggestions: Problem, but might constitute a suggestion&#8230; (marketingtitan)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Requests/browse_thread/thread/7b2710df3eac8a7d">Suggestions: RELATED (or similar) command (burchman519)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Requests/browse_thread/thread/2b6ae7b5e5f9e998">Suggestions: Starring audit trail (Phil Payne)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Tools/browse_thread/thread/cb788cc742201ae4">Webmaster Tools: 000&#39;s of URLs reported as restricted by robots.txt&#8230; but they&#39;re not (Red Cardinal)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Tools/browse_thread/thread/b550e45ca120402a">Webmaster Tools: Can Google follow Yahoo&#39;s lead here? (Sam I Am)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Tools/browse_thread/thread/1bda75ad0289b369">Webmaster Tools: Error, but still &quot;We accessed your home page successfully.&quot; ? (Sam I Am)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Tools/browse_thread/thread/9aaa223cf8c3e765">Webmaster Tools: Site root page not in index &#8211; sign of a penalty? (dockarl)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Tools/browse_thread/thread/cc66d3e7424f006d">Webmaster Tools: Verification Issues? (Jonathan Simon)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Tools/browse_thread/thread/181ea5d12a78b721">Webmaster Tools: Which is wrong? Webmaster Tools or Webmaster Help Center? (Sam I Am)</a></li>
</ul>
<p>This list is getting kind of long &#8230; is it interesting enough for anyone out there? </p>
<h2>Top most active threads</h2>
<ol>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/3944cbb843ce391d">Crawl, Index, Rank: Supplemental Index&#8230;Now What? (114 new replies)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/a408bfd0ab881fac">Crawl, Index, Rank: Oh For Crying out loud.. UtheGuru&#39;s SECOND best performing Page gone too.. (66 new replies)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/fd36df41cf2ad275">Crawl, Index, Rank: 100+ days in the penalty box (61 new replies)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Tools/browse_thread/thread/cc66d3e7424f006d">Webmaster Tools: Verification Issues? (60 new replies)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/351a4a1737336fb5">Crawl, Index, Rank: Here&#39;s one for Phil Payne (53 new replies)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Requests/browse_thread/thread/3238914c52ff7b18">Suggestions: Website link on Googles links to trojan.exploit.131 (50 new replies)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/3ab25852d907a107">Crawl, Index, Rank: site: search doesn&#39;t return index page, search for product does &#8211; penalty sign? (41 new replies)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/702d4b27298ffdea">Crawl, Index, Rank: Thoroughly sabotaged (39 new replies)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/d968239fdd075c0f">Crawl, Index, Rank: Congrats to those penalized-now-released &#8211; but I&#39;m still penalized! <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  (39 new replies)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/3404691a54537c4e">Crawl, Index, Rank: Being Removed for No Apparent Reason (38 new replies)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/c7072683ad5cc933">Crawl, Index, Rank: Sitemap Error Messages (37 new replies)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/1315bf341e375a19">Random Chit-Chat: What do you think about &quot;pagerank&quot;? (37 new replies)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/3588e6c471c70abe">Crawl, Index, Rank: Really lost on meta refresh! debate: Phil vs Sebastian (36 new replies)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/8463e669e7c571bd">Random Chit-Chat: Blue Badge (36 new replies)</a></li>
<li><a target="_blank" href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/61662dcd2c731833">Crawl, Index, Rank: Different results from google.com and google.co.uk (35 new replies)</a></li>
</ol>
<h2>Top most linked Domains</h2>
<ol>
<li>google.com = 681</li>
<li>groups.google.com = 354</li>
<li>our-web-site.com = 329</li>
<li>youreviewelectronics.com = 245</li>
<li>validator.w3.org = 201</li>
<li>venusandmarz.blogspot.com = 195</li>
<li>skeel.info = 170</li>
<li>n2news.com = 167</li>
<li>cheesecakefantasy.com = 156</li>
<li>dir.yahoo.com = 140</li>
<li>uvexs.com = 122</li>
<li>venusandmarz.com = 116</li>
<li>twiztedtattoosupplies.com = 111</li>
<li>positivemoneyideas.com = 106</li>
<li>asymptoticdesign.com = 105</li>
<li>tool.motoricerca.info = 104</li>
<li>oyoy.eu = 101</li>
<li>redroller.com = 99</li>
<li>tomris.com = 89</li>
<li>domain.com = 88</li>
</ol>
<h2>Top most linked Google answers</h2>
<ol>
<li><a href="http://www.google.com/support/webmasters/bin/answer.py?answer=35769">Webmaster Help Center &#8211; Webmaster Guidelines</a> = 44 [-30%]</li>
<li><a href="http://www.google.com/support/webmasters/bin/answer.py?answer=66356">Webmaster Help Center &#8211; Link schemes</a> = 13 [+18%]</li>
<li><a href="http://www.google.com/support/webmasters/bin/answer.py?answer=35843">Webmaster Help Center &#8211; How do I request reconsideration of my site?</a> = 13 [+85%]</li>
<li><a href="http://www.google.com/support/webmasters/bin/answer.py?answer=47334">Webmaster Help Center &#8211; How do you compile the list of links shown below some search results?</a> = 11 [+10%]</li>
<li><a href="http://www.google.com/support/webmasters/bin/answer.py?answer=66353">Webmaster Help Center &#8211; Hidden text and links</a> = 7 [-50%]</li>
<li><a href="http://www.google.com/support/webmasters/bin/answer.py?answer=40364">Webmaster Help Center &#8211; How do I block Googlebot?</a> = 7</li>
<li><a href="http://www.google.com/support/jobs/bin/answer.py?answer=48264">Webmaster Trends Analyst &#8211; Seattle/Kirkland</a> = 5</li>
<li><a href="http://www.google.com/support/webmasters/bin/answer.py?answer=34431">Webmaster Help Center &#8211; Does Google index dynamic pages?</a> = 5</li>
<li><a href="http://www.google.com/support/webmasters/bin/answer.py?answer=61062">Webmaster Help Center &#8211; How do I use the URL removal request tool?</a> = 4</li>
<li><a href="http://www.google.com/support/webmasters/bin/answer.py?answer=66736">Webmaster Help Center &#8211; Why should I report paid links to Google?</a> = 4</li>
<li><a href="http://www.google.com/support/webmasters/bin/answer.py?answer=66359">Webmaster Help Center &#8211; Duplicate content</a> = 3</li>
<li><a href="http://www.google.com/support/webmasters/bin/answer.py?answer=35302">Webmaster Help Center &#8211; Block or remove your entire website using a robots.txt file</a> = 3</li>
<li><a href="http://www.google.com/support/webmasters/bin/answer.py?answer=40360">Webmaster Help Center &#8211; How do I use a robots.txt file to control access to my site?</a> = 2</li>
<li><a href="http://www.google.com/support/webmasters/bin/answer.py?answer=40349">Webmaster Help Center &#8211; How can I create a Google-friendly site?</a> = 2</li>
<li><a href="http://www.google.com/support/googleanalytics/bin/answer.py?answer=55494">Google Analytics Help Center &#8211; How do I create a filter?</a> = 2</li>
</ol>
<p>Hope you enjoyed the numbers <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' />  . </p>
<hr/>Copyright &copy; 2013 <strong><a href="http://johnmu.com">johnmu.com</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator, the site you are looking at is guilty of copyright infringement. Please contact johnmu.com so we can take legal action immediately.<br/><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/johnmucom?a=p8GzSlmm1Ps:SCuonvNdEgA:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/johnmucom?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/johnmucom?a=p8GzSlmm1Ps:SCuonvNdEgA:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/johnmucom?i=p8GzSlmm1Ps:SCuonvNdEgA:D7DqB2pKExk" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/johnmucom?a=p8GzSlmm1Ps:SCuonvNdEgA:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/johnmucom?i=p8GzSlmm1Ps:SCuonvNdEgA:F7zBnMyn0Lo" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/johnmucom/~4/p8GzSlmm1Ps" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://johnmu.com/statistics-2007-august/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		<feedburner:origLink>http://johnmu.com/statistics-2007-august/</feedburner:origLink></item>
		<item>
		<title>A set of command-line Windows website tools</title>
		<link>http://feedproxy.google.com/~r/johnmucom/~3/dBteahUIgAc/</link>
		<comments>http://johnmu.com/web-toolbox-1/#comments</comments>
		<pubDate>Thu, 30 Aug 2007 22:51:42 +0000</pubDate>
		<dc:creator>John Mueller</dc:creator>
				<category><![CDATA[Tricks]]></category>

		<guid isPermaLink="false">http://johnmu.com/web-toolbox-1/</guid>
		<description><![CDATA[If you have to do things over and over again, it&#8217;s a good idea to use a tool to make things easier. Windows is a bit limited (or very &#8211; when compared to Linux) when it comes to batch file scripts and &#8220;wget&#8221; is limited to what it can do right out the box, so [...]]]></description>
				<content:encoded><![CDATA[<p>If you have to do things over and over again, it&#8217;s a good idea to use a tool to make things easier. Windows is a bit limited (or very &#8211; when compared to Linux) when it comes to batch file scripts and &#8220;wget&#8221; is limited to what it can do right out the box, so I sat down and wrote a few command line tools to help me with some of the website checks that I like to do. </p>
<p>The tools I included in this set can do the following:</p>
<ul>
<li>Check the result codes for a URL (and follow in the case of a redirect) &#8211; or for a list of URLs</li>
<li>Create a list of the links found on a URL (or just particular ones)</li>
<li>Create a list of the links and anchor texts found on a URL (or just particular ones)</li>
<li>Create a simple keyword analysis of the indexable content on a URL</li>
</ul>
<p><span id="more-111"></span><br />
You can get the down from here (requires the Windows .NET runtime v1.1):</p>
<ul>
<li><a href="http://johnmu.com/files/WebToolbox.zip">WebToolbox.zip</a> (140kb)</li>
</ul>
<p><strong>WebResult</strong></p>
<p>This tool accesses a URL and shows the result code that was returned. If the status is a redirect, it will display the redirection location and optionally follow it to check the final result code. It may be used with a list of URLs. The output is tab-delimited.</p>
<p>Usage:<br />
<em>WebResult [options] (URL|urllist.txt)<br />
Options:<br />
 &#8211;referer|-r [referrer] (default: none)<br />
 &#8211;user-agent|-u [user-agent] (default: &#8220;WebResult&#8221;)<br />
 &#8211;follow-redirect|-f (default: not)<br />
 &#8211;headers|-h (displays the full response headers)<br />
 &#8211;verbose|-v</em></p>
<p>Example:<br />
Check for correct canonical redirect:<br />
 Webresult http://johnmu.com/<br />
 Webresult http://www.johnmu.com/</p>
<p><strong>WebLinks</strong></p>
<p>This tool lists the links that are found on a URL. Note that it has an integrated HTML/XHTML parser &#8211; if the code on the page is not fully compliant, there is a chance of the parser not recognizing all links (it is fairly fail-safe, though). </p>
<p>This tool can use a cached version of the URL (from either this tool or one of the other ones) to save bandwidth. The cached versions are saved in the user&#8217;s temp-folder. </p>
<p>You have the choice of only listing domain outbound or insite links (to help simplify the output). Additionally links with the HTML microformat &#8220;rel=nofollow&#8221; may be marked as such. The output is in alphabetical order. </p>
<p>Usage:<br />
<em>WebLinks [options] (URL|urllist.txt)<br />
Options:<br />
 &#8211;referer [referrer] (default: none)<br />
 &#8211;user-agent [user-agent] (default: &#8220;WebLinks&#8221;<br />
 &#8211;insite-only|-i (default: both in + out)<br />
 &#8211;outbound-only|-o (default: both in + out)<br />
 &#8211;ignore-nofollow|-n (default: off)<br />
 &#8211;cache|-c (default: off)<br />
 &#8211;verbose|-v (default: off)</em></p>
<p>Example:<br />
Check the outbound links on a site.<br />
 WebLinks -o http://johnmu.com/</p>
<p><strong>WebAnchors</strong></p>
<p>This tool lists the links and anchor text as found on a URL. It uses the same HTML/XHTML parser as WebLinks. It can be used to find certain links (based on the URL, domain name, URL-snippets, or even parts of the anchor text). If the anchor for a link is an image, it will use the appropriate ALT-text, etc.</p>
<p>Usage:<br />
<em>WebAnchors [options] (URL|urllist.txt)<br />
Options:<br />
 &#8211;referer|-r [referrer] (default: none)<br />
 &#8211;user-agent|-u [user-agent] (default: &#8220;WebLinks&#8221;<br />
 &#8211;find-url|-f http://URL<br />
 &#8211;find-domain|-d DOMAIN.TLD<br />
 &#8211;find-anchor|-a TEXT<br />
 &#8211;find-url-snippet|-s TEXT<br />
 &#8211;url-only|-o (default: show anchor text as well)<br />
 &#8211;skip-nofollow|-n (default: off)<br />
 &#8211;cache|-c (default: off)<br />
 &#8211;verbose|-v (default: off)</em></p>
<p>Example:<br />
Check the links with &#8220;Google&#8221; in the anchor text.<br />
 WebAnchors -s &#8220;Google&#8221; http://johnmu.com/</p>
<p><strong>WebKeywords</strong></p>
<p>This tool does a simple keyword analysis on the indexable content of a URL. It also uses the above HTML/XHTML parser to extract the indexable text. It is possible to get single-word keywords or to use multi-word-phrases. The output is tab-delimited for re-use. </p>
<p>Usage:<br />
<em>WebKeywords [options] (URL|urllist.txt)<br />
Options:<br />
 &#8211;referer|-r [referrer] (default: none)<br />
 &#8211;user-agent|-u [user-agent] (default: &#8220;WebLinks&#8221;<br />
 &#8211;verbose|-v (default: off)<br />
 &#8211;words|-w [NUM] (phrases with number of words, default: 1)<br />
 &#8211;ignore-numbers|-n (default: off)<br />
 &#8211;cache|-c (cache web page, default: off)</em></p>
<p>Example:<br />
Extract 3-word keyphrases from a page:<br />
 Webkeywords -w 3 http://johnmu.com/</p>
<p><strong>Combined usage of these tools</strong></p>
<p>Find common keyphrases on sites linked from a page (uses a temporary file to store the URLs):</p>
<p>    webanchors -c -o -a &#8220;Google&#8221; http://johnmu.com >temp.txt<br />
    webkeywords -c -w 3 temp.txt</p>
<p>Check result codes of all URLs linked from a page:</p>
<p>    weblinks -c http://johnmu.com >temp.txt<br />
    webresult temp.txt >links.tsv</p>
<p>Compare result codes for multiple accesses:</p>
<p>    echo. >results.tsv<br />
    for /L %i IN (1,1,100) DO webresult http://johnmu.com/ >>results.tsv</p>
<p>  or more complicated to test a hack based on the referrer (all on one line):</p>
<p>    for /L %i IN (1,1,100) DO webresult -u &#8220;Mozilla/5.0 (Windows; U) Gecko/20070725 Firefox/2.0.0.6&#8243; -r http://www.google.com/search?q=johnmu http://johnmu.com/ >>results.tsv</p>
<p>I&#8217;d love to hear about your usage of these tools <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  . </p>
<hr/>Copyright &copy; 2013 <strong><a href="http://johnmu.com">johnmu.com</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator, the site you are looking at is guilty of copyright infringement. Please contact johnmu.com so we can take legal action immediately.<br/><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/johnmucom?a=dBteahUIgAc:-oN8HTdHCEI:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/johnmucom?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/johnmucom?a=dBteahUIgAc:-oN8HTdHCEI:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/johnmucom?i=dBteahUIgAc:-oN8HTdHCEI:D7DqB2pKExk" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/johnmucom?a=dBteahUIgAc:-oN8HTdHCEI:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/johnmucom?i=dBteahUIgAc:-oN8HTdHCEI:F7zBnMyn0Lo" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/johnmucom/~4/dBteahUIgAc" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://johnmu.com/web-toolbox-1/feed/</wfw:commentRss>
		<slash:comments>12</slash:comments>
		<feedburner:origLink>http://johnmu.com/web-toolbox-1/</feedburner:origLink></item>
		<item>
		<title>Interview with Craig “cass-hacks”</title>
		<link>http://feedproxy.google.com/~r/johnmucom/~3/zfwgjSV30bg/</link>
		<comments>http://johnmu.com/interview-with-craig/#comments</comments>
		<pubDate>Tue, 28 Aug 2007 17:57:41 +0000</pubDate>
		<dc:creator>John Mueller</dc:creator>
				<category><![CDATA[People]]></category>

		<guid isPermaLink="false">http://johnmu.com/interview-with-craig/</guid>
		<description><![CDATA[Hi Craig, welcome to my blog ! Craig is, for those that haven&#8217;t noticed, an alien from some solar system far away. At least that&#8217;s the conclusion I came to after reading his introduction, the overview page on his site and his &#8220;my first computer&#8221; posts. I&#8217;m pretty sure that he&#8217;s either alien or very, [...]]]></description>
				<content:encoded><![CDATA[<p>Hi Craig, welcome to my blog <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' />  ! Craig is, for those that haven&#8217;t noticed, an alien from some solar system far away. At least that&#8217;s the conclusion I came to after reading his <a href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/msg/f3c10217c39031c8">introduction</a>, the <a href="http://cass-hacks.com/overview/">overview page</a> on his site and his &#8220;<a href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/msg/9bd3dc23504de8e7">my first computer</a>&#8221; posts. I&#8217;m pretty sure that he&#8217;s either alien or very, very creative (as in creative writing), I mean seriously, &#8220;<em>I built my own computer when I was 12.</em>&#8220;?! Craig has been a frequent contributor in the Google Groups, bringing in a lot of background knowledge, helping with stylesheets, javascript and all sorts of other issues that arrive on a regular schedule. </p>
<p><em>I know that wasn&#8217;t a question but I would like to comment anyway.  Although you are not the first to suggest I am not of this world, serious or not, I feel it is not so much a question of identifying the &#8220;where&#8221;, but identifying the &#8220;when&#8221;. </em></p>
<p><em>I think had I lived 150 to 200 years ago, I wouldn&#8217;t seem as much an alien as I do to so many people.  More often than not, people who I communicate with over a period of time before ever meeting in person say something similar, I seem odd to them because they try to identify me with a place and fail but after meeting me in person, understand it is not a matter of identifying a place, but a place in time. </em></p>
<p><em>Many people are still put off after realizing that but a few people are able to take it in stride.  You can tell a lot about a person by how they react to extreme situations and I guess I can be a bit extreme at times.  <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </em></p>
<p><em>Someone once called me an &#8220;anachronistic anomaly&#8221;.  That seems to describe me as well as any other description I have heard, at least descriptions appropriate for mixed company.  <img src='http://johnmu.com/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' /> </em><br />
<span id="more-110"></span><br />
<strong>So Craig, with a brain the size of a planet, I&#8217;m sure you have some really smart and cool things to do. What drives you to spend so much time in the Google webmaster help groups?</strong></p>
<p>Good question, as in the best question have no real answers.  <img src='http://johnmu.com/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' />  The closest I think I can come to a real answer though is that I enjoy observing how things work.  One of my first memories is of my parents taking me and my two sisters to a zoo where there was a carousel.  While my sisters were busy watching the pretty horses, which were just carved and painted wood, I was watching the gears and shafts and cams and wheels looking to see how it all worked.</p>
<p>Later, much later, when I was working with particle accelerators, some the size of 5 story buildings, there would be some sort of problem but one had to have a pretty good idea of what it was because as often seemed the case and as Murphy&#8217;s Law would have it, problems usually occurred in the least accessible spot and it could take up to a couple of days just to get to where the problem might be.</p>
<p>If the problem wasn&#8217;t there, all that time was wasted.  But, it also wasn&#8217;t good enough just to know where the problem was, one also had to have an idea of how to fix it and maybe more importantly, how to keep it from happening again and again.  All of what went into getting proficient at that was observing what one could of available data from what one could see and then coming up with a reasonable scenario as to what the cause might be where one couldn&#8217;t see and then testing that scenario as much as possible before putting any plan into action.</p>
<p>In Google&#8217;s Webmaster Tools Help Group, I am able to observe a lot of different situations and the more I see of a given situation, the more I have to go on to try to come up with possible scenarios to understand what may be happening.  So I guess what drives me is what has always driven me, a desire to observe and understand.</p>
<p><strong>How did you find the Google Webmaster Help groups in the first place? Looking at your <a href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/ab06b5b082db6a8d/">first posts</a> it doesn&#8217;t look like you had any particular problem that needed to be solved.</strong></p>
<p>I found the group through the Google Webmaster Tools which I found through the &#8220;Add URL&#8221; page.  I had just launched my first publicly accessible web site and had heard of submitting URLs to the various search engines so I asked &#8220;Professor Google&#8221; how to do it for the search engines I knew about the most and found what I was looking for.  From there, I played with the Webmaster Tools for a very short time which was primarily due to there being no real data to look at when a site is first indexed and then started digging into the help files and was directed to the Groups forum. It was not so much that I was having any particular problem at the time, or since, but more so, someone felt it worthwhile to publish all that information for some reason, not reading it would seem to be a serious waste of both their time and mine.</p>
<p>You are right though, I didn&#8217;t have any particular problem nor do I think I would have asked had I one. I have been around long enough on various technical forums and the like to know that there is rarely a question that hasn&#8217;t been answered or doesn&#8217;t have an answer somewhere although very possibly being &#8220;hidden&#8221; and in need of being dug for.</p>
<p>On the other hand, I also know that for some questions, there are no answers or at least no answers likely to be forthcoming so before asking too much, I&#8217;d want to know what questions are even likely to receive an answer of any use.</p>
<p>But, search engines at that time I had very little experience with, other than as a search user and having already dealt with large amounts of data, it intrigued me as to how one might deal with essentially archiving the entire Internet and more importantly, making that archive available in an intelligent and useful manner.  Large amounts of data don&#8217;t impress me as I&#8217;ve dealt with huge databases of tera and peta-record size but the easy, intelligent and fast access to the contained data is the real challenge.</p>
<p><strong>What was it that grabbed your attention about the web? Why did you decide to put together <a href="http://cass-hacks.com/">your own website</a>?</strong></p>
<p>I wouldn&#8217;t say I was particularly &#8220;grabbed&#8221; by the web.  It just seemed like a much easier platform to develop applications for.  I&#8217;ve written in almost every language from machine code to C++ and at one time burning EEPROMs just to be able to test a section of code out.  With PHP, Javascript and MySQL, I can whip up an application in a matter of hours. It may and very likely will look like hell but the basic functionality is there, sort of a proof of concept if you will.</p>
<p>As for cass-hacks specifically though, I&#8217;d built a lot of toys of various levels of usefulness over a period of time and although any one specific toy may not be all that useful, the processes that go into making them work is always useful because a given toy&#8217;s functionality is limited to what it was designed to do as well as a little bit being extensible for other purposes if designed well but the processes that go into making any toy work can be used over and over again to build whatever one can imagine.  Also, every language has a lot of very simple syntax that is pretty boring to look at but can become interesting to the point of being exciting when combined in ways one might not originally have thought of.</p>
<p>Although straying a bit from the mark, I think the most interesting project I have documented on my site so far is one that gets the least amount of traffic.  That project is a <a href="http://cass-hacks.com/articles/discussion/js_load_notice/">user notification system</a> that is actually &#8220;agent&#8221; based, i.e. artificial life or as is commonly referred to as artificial intelligence, AI.  Many people think that &#8220;AI&#8221; is some complex rule processor that attempts to simulate intelligent thought but that is only science fiction and pretty much had been given up on many years ago. Most of the work done in this area over the past couple of decades has been &#8220;Agent based&#8221;, creating simple little entities programmed to do very simple tasks and then releasing them to do what they were programmed to do.  Where this ties in with what I have been talking about though is that once I came up with the method of implementing the functionality I wanted to support, it took me all of about 20 minutes to do it using DOM, CSS and Javascript whereas trying to do the same thing in just about any other programming environment would have taken days.</p>
<p><strong>Once you have worked with different technologies, you usually get a grasp for the general problems that could come up when implementing them. What unexpected difficulties did you run into while working on your first site(s)?</strong></p>
<p>This is going to be a boring answer.  <img src='http://johnmu.com/wp-includes/images/smilies/icon_sad.gif' alt=':-(' class='wp-smiley' /> )  None.</p>
<p>I guess from my past experience, I do things a little different than many people.  I start out with a list of requirements for a given task and then look into the various methods of satisfying the requirements, with all their possible positives and minuses and then choose the available &#8220;tools&#8221; that allow me to do the most with the least.  By the time I actually get to building something, it is sort of boring because then it is most often just a matter of &#8220;plugging and chugging&#8221;, a phrase I got from a Calculus professor in the past which basically means, set up the equations, plug in the variable data and then chug through the calculations. Once you got to the &#8220;Plug and Chug&#8221; stage, it was all pretty much done.</p>
<p><strong>If you came to a situation where you absolutely had to get a website to rank high for competitive terms, which methods would you apply first?</strong></p>
<p>Probably the first thing I would do is go out and hire an SEO.  <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' />   Sorry, boring answer.  OK, first, I&#8217;d have some limitations on whether or not I even attempted it in the first place.  I&#8217;d have to be interested in and/or have some experience in the subject matter because getting different sites to rank well is not the same for all sites.  Second, I&#8217;d take a look at what the past experience of the site has been and how it is doing currently and then I&#8217;d look at what are the short term and long term goals.  I guess what all that means is that getting a website to rank high for competitive terms only, is a waste of time, energy and money.</p>
<p>But, if I didn&#8217;t care about all that and had someone else&#8217;s money to waste, I&#8217;d first make sure the site/page was even capable of ranking for the terms in the first place by making sure the terms even existed on any of the pages.  Then I&#8217;d make sure there was as much information from as many different directions as possible on the subject of the target terms and then I&#8217;d work to get enough links to the site as necessary so as to make sure the page(s) was(were) even available for searches in the first place.</p>
<p>What I can&#8217;t do though is make people search for the targeted terms.  So many people talk about wanting to rank well for this that and the other thing but so often is the case, no one is really searching for what is being targeted. I know some people use keyword generators to find out what people are searching for but I also feel that people who then decide what content to put on their site based solely on what will gain the most traffic are doing a disservice to both themselves as well as their potential visitors.</p>
<p><strong>You seem to have seen a lot of corporate environments and worked in a lot of groups, is there anything about Google that was completely unexpected to you?</strong></p>
<p>I feel another boring answer coming on. No, not really.  Google, like all companies, is made up of people.  Companies may have their policies but it is people that put them into action.  A company could have the most negative policies in the world but due to the people in its employ, the company is seen in a much more positive light than a company that may have the most altruistic policies in the world with assholes implementing them. </p>
<p>Google seems to be the best of both worlds though, company policies seeming to tend toward ensuring equality for all involved with people implementing them that also seem genuinely concerned about the people they actually serve, the users of their various products and services.  Were it not the case, I wouldn&#8217;t be sticking around because it wouldn&#8217;t make sense supporting someone else in being an asshole when I can enjoy being a much bigger one all by myself, why share?  On the other hand, when I see a situation, much like with Google, where many people feel the need to view Google as evil or have ulterior motives where having any would be counterproductive, if I can in any way help someone to possibly see the other side of things, I feel I have done some good.</p>
<p>Were it not the case of Google being a basically positive company with obviously positive people working for it, there wouldn&#8217;t be so many of them out there putting themselves in the public eye and speaking as much for themselves as they do in efforts to try to explain as much as they can about the company they work for and with.</p>
<p><strong>Turning the tables on Google, assume you had full access to everything and all the help that you needed, what would you change?</strong></p>
<p>It wouldn&#8217;t really be a matter of &#8220;turning the tables&#8221; and although I definitely feel another boring answer coming on, I don&#8217;t know enough about what goes on internally to want to change anything.  How could I know that what I wanted to change wouldn&#8217;t actually make things worse unless I knew why what I wanted to change was the way it was in the first place?</p>
<p>On the other hand, were I to have the opportunity, I would like to improve on some things, mainly things that I have been exposed to.  I&#8217;d love to revamp the Webmaster tools and make them more timely and informative to the extent possible. Getting rid of tools that are of little use while expanding on others that may seem of little use but could be much more valuable if the data they offered was expanded and made more accessible to searching through.  Also, I&#8217;d love to rewrite the Google Groups application as it seems to have the worst of all possible worlds. </p>
<p>Its use of Javascript, has to be about the most counterproductive as I have ever seen.  There are also a number of things that could be done using Javascript, but aren&#8217;t currently, that could make the Groups much easier to use.  About the only thing the Groups application has gotten right, in my opinion, is making it so that the functions of the Groups application work with Javascript enabled or disabled, which is actually a big accomplishment considering so many of the Javascript applications similar to it don&#8217;t work at all without Javascript.</p>
<p>Also, and I don&#8217;t know how much can be done in this area as I don&#8217;t know how it is currently implemented but one thing I would like to tackle would be improving the reliability of the various functions of the Groups application as it gets downright discouraging to use more often than I would like any application I was responsible for to be.</p>
<p><strong>Is there anything more you&#8217;d like to add at the moment?</strong></p>
<p>Other than thanking you for what has been my first interview in a LOOOOONNNNNGGGG time, I can&#8217;t think of anything I&#8217;d like to add.</p>
<p><strong>Thanks for your time, Craig!</strong></p>
<p>Although I&#8217;ve had a feeling this interview was coming, and dreading it, it wasn&#8217;t as painful as I thought so I thank you for making the process not too terribly intolerable! <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
<hr/>Copyright &copy; 2013 <strong><a href="http://johnmu.com">johnmu.com</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator, the site you are looking at is guilty of copyright infringement. Please contact johnmu.com so we can take legal action immediately.<br/><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/johnmucom?a=zfwgjSV30bg:1jLD8HzoNwI:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/johnmucom?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/johnmucom?a=zfwgjSV30bg:1jLD8HzoNwI:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/johnmucom?i=zfwgjSV30bg:1jLD8HzoNwI:D7DqB2pKExk" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/johnmucom?a=zfwgjSV30bg:1jLD8HzoNwI:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/johnmucom?i=zfwgjSV30bg:1jLD8HzoNwI:F7zBnMyn0Lo" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/johnmucom/~4/zfwgjSV30bg" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://johnmu.com/interview-with-craig/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		<feedburner:origLink>http://johnmu.com/interview-with-craig/</feedburner:origLink></item>
		<item>
		<title>The website hack you’d never find</title>
		<link>http://feedproxy.google.com/~r/johnmucom/~3/pqo4SREp1Mo/</link>
		<comments>http://johnmu.com/hack-hidden-redirect/#comments</comments>
		<pubDate>Thu, 23 Aug 2007 22:03:34 +0000</pubDate>
		<dc:creator>John Mueller</dc:creator>
				<category><![CDATA[Hack]]></category>

		<guid isPermaLink="false">http://johnmu.com/hack-hidden-redirect/</guid>
		<description><![CDATA[Warning: do not try the URLs here unless your system is locked down properly. I suggest using a &#8220;virual machine&#8221; (I use VMware) to test things like this. The hack itself is complicated, the system is simple &#8211; skip the complicated part if you&#8217;re in a hurry. It all started with a posting like this: [...]]]></description>
				<content:encoded><![CDATA[<p><strong>Warning: do not try the URLs here unless your system is locked down properly. I suggest using a &#8220;virual machine&#8221; (I use VMware) to test things like this. The hack itself is complicated, the system is simple &#8211; skip the complicated part if you&#8217;re in a hurry.</strong></p>
<p>It all started with a <a href="http://groups.google.com/group/Google_Webmaster_Help-Requests/browse_thread/thread/3238914c52ff7b18">posting</a> like this:</p>
<blockquote><p>When I do a google search for [Jonathan Wentworth Associates] the first result is:</p>
<p><em>Jonathan Wentworth Associates, LTD<br />
Welcome to Jonathan Wentworth Associates, a respected resource for world-class orchestral soloists,<br />
conductors, opera, chamber music, chamber orchestras, &#8230;<br />
www.jwentworth.com/ &#8211; 19k &#8211; Cached &#8211; Similar pages &#8211; Note this</em></p>
<p>The: &#8220;Jonathan Wentworth Associates, LTD&#8221; is highlighted and is a link to the web site.  If you place the mouse over the link, it shows http://www.jwentworth.com.  However, if you click the link it immeately attempts to download the trojan.  My McAfee immediatly blocked it.</p></blockquote>
<p>Looking at the page in question, it doesn&#8217;t appear to be hacked, it doesn&#8217;t appear to have any kind of scripts injected, etc. However, using LiveHTTPHeaders with Firefox, while doing the same steps (search, click on the top result) you see the following:<br />
<span id="more-108"></span></p>
<blockquote><p>GET / HTTP/1.1<br />
Host: www.jwentworth.com<br />
HTTP/1.x 302 Found<br />
Location: http://85.255.117.38/ind.htm?src=324&#038;surl=www.jwentworth.com&#038;sport=80&#8230;<br />
<br />
GET /ind.htm?src=324&#038;surl=www.jwentworth.com&#038;sport=80&#038;suri=%2F HTTP/1.1<br />
Host: 85.255.117.38<br />
Referer: http://www.google.com/search?q=Jonathan+Wentworth+associates<br />
HTTP/1.x 302 Found<br />
Location: http://www.jwentworth.com/</p></blockquote>
<p>Without going through Google, the page is returned right away, just like it should. Search engine crawlers also get it like that. After the step through Google however, the site does a 302 redirect to some IP-Address and then returns to the original site.  The average browser won&#8217;t see that, but if you&#8217;re quick you might spot it in the status-bar. A search engine crawler or any user who knew the address would get there without a redirect and not notice a thing.</p>
<p>Strange.</p>
<p>That&#8217;s something that deserves to be looked at more closely. What&#8217;s on that server? How could I be able to see it?</p>
<p>I had seen something similar a few months back which redirected me to an affiliate site the first time I went to that site through a Google referrer (in my case, the gmail.google.com referrer was enough). It would only trigger once per IP-Address. This looks like a similar hack.</p>
<p>When I was able to download the files, I had a nice collection of:</p>
<ul>
<li>an <strong>encrypted javascript</strong> file that downloaded exploits based on browser and operating system</li>
<li>an <strong>exploit</strong> from free-spy-cam.net</li>
<li>an <strong>affiliate sales</strong> page for an <strong>antivirus</strong> software. Oh the irony. &#8220;We just infected you, buy our antivirus to get clean.&#8221; That is, if that software isn&#8217;t infected with something else.</li>
<li>an affiliate signup link on that page</li>
</ul>
<p>A search engine crawler will never see these things. A user, coming in from Google, will get redirected and if the IP address is not known, it will trigger a few exploits based on the system the user has and then display an affiliate ad page. The next time the user comes, the redirect will happen but the normal page will be shown.</p>
<p><strong>Spotting the hack on your site</strong></p>
<p>It would be good to know how you could spot a hack like this on your site. In general, you wouldn&#8217;t be able to. You can check for this particular hack, but it might not trigger every time &#8230; not to mention that there are likely way too many hacks that you would need to check for.</p>
<p>A simple way to check for it would be to use wget to access the page, and check for strange redirects, eg:</p>
<blockquote><p>>wget &#8211;user-agent Firefox &#8211;save-headers &#8211;referer &#8220;http://www.google.com/search?q=duuude&#8221; &#8220;http://www.jwentworth.com/&#8221;</p></blockquote>
<p>However, as mentioned, that might not work every time.</p>
<p><strong>The technical details</strong></p>
<p>(skip this part, if you are lost already <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' />  )</p>
<p>The original spotting of the anomaly was using LiveHTTPHeaders with Firefox, while doing the steps: search, click on the top result. You see the following:</p>
<blockquote><p>GET / HTTP/1.1<br />
Host: www.jwentworth.com<br />
(&#8230;)<br />
Referer: http://www.google.com/search?q=Jonathan+Wentworth+associates<br />
<br />
HTTP/1.x 302 Found<br />
Date: Thu, 23 Aug 2007 06:38:04 GMT<br />
Server: Apache/1.3.37 (Unix) mod_auth_passthrough/1.8 mod_log_bytes/<br />
1.2 mod_bwlimited/1.4 PHP/4.4.6 FrontPage/5.0.2.2635.SR1.2 mod_ssl/<br />
2.8.28 OpenSSL/0.9.7a<br />
Location: http://85.255.117.38/ind.htm?src=324&#038;surl=www.jwentworth.com&#038;sport=80&#8230;<br />
(&#8230; added space to prevent linking &#8230;)<br />
<br />
GET /ind.htm?src=324&#038;surl=www.jwentworth.com&#038;sport=80&#038;suri=%2F HTTP/1.1<br />
Host: 85.255.117.38<br />
(&#8230;)<br />
Referer: http://www.google.com/search?q=Jonathan+Wentworth+associates<br />
HTTP/1.x 302 Found<br />
Date: Thu, 23 Aug 2007 06:38:05 GMT<br />
(&#8230;)<br />
Location: http://www.jwentworth.com/
</p></blockquote>
<p>A strange redirect like that is a really bad sign. How can we check the URL that is given to see what they are sending? Apparently it can only be triggered once per IP-address and I had already used that chance earlier. In order to view the initial page, I had to find an IP address that was not yet registered with the remote server (at least that&#8217;s my explanation). I used a proxy server from one of the lists online. Using the proxy server and wget, I was able to access the page:</p>
<blockquote><p>&gt;set http_proxy=81.63.140.37:3128<br />
<br />
&gt;wget &#8211;user-agent &#8220;Firefox&#8221; &#8211;save-headers &#8220;http://85.255.117.38/ind.htm?src=324&#038;surl=www.jwentworth.com&#038;sport=80&#038;suri=%2Findex%2Ehtml&#8221;<br />
<br />
Connecting to 81.63.140.37:3128&#8230; connected.<br />
Proxy request sent, awaiting response&#8230; 200 OK<br />
Length: unspecified<br />
20:43:23 (79.20 KB/s) &#8211; `ind.htm@src=324&#038;surl=www.jwentworth.com&#038;sport=80&#038;suri=%<br />
2Findex.html.2&#8242; saved [414]</p></blockquote>
<p>The page that was returned was a normal frameset:</p>
<div id="ig-sh-2" class="syntax_hilite">	<div class="toolbar">		<div class="language-name">HTML4</div>		<a href="#" class="view-different">&lt; view <span>plain text</span> &gt;</a>	</div>	<div class="code"><ol class="html4strict" style="font-family:monospace;"><li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;"><span style="color: #009900;">&lt;<span style="color: #000000; font-weight: bold;">HTML</span>&gt;&lt;<span style="color: #000000; font-weight: bold;">HEAD</span>&gt;&lt;<span style="color: #000000; font-weight: bold;">TITLE</span>&gt;&lt;<span style="color: #66cc66;">/</span><span style="color: #000000; font-weight: bold;">TITLE</span>&gt;&lt;<span style="color: #66cc66;">/</span><span style="color: #000000; font-weight: bold;">HEAD</span>&gt;</span></div></li>
<li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;"><span style="color: #009900;">&lt;<span style="color: #000000; font-weight: bold;">frameset</span> framespacing<span style="color: #66cc66;">=</span><span style="color: #ff0000;">&quot;0&quot;</span> <span style="color: #000066;">border</span><span style="color: #66cc66;">=</span><span style="color: #ff0000;">&quot;0&quot;</span> <span style="color: #000066;">rows</span><span style="color: #66cc66;">=</span><span style="color: #ff0000;">&quot;*,1&quot;</span> <span style="color: #000066;">frameborder</span><span style="color: #66cc66;">=</span><span style="color: #ff0000;">&quot;0&quot;</span>&gt;</span></div></li>
<li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;"><span style="color: #009900;">&lt;<span style="color: #000000; font-weight: bold;">frame</span> <span style="color: #000066;">name</span><span style="color: #66cc66;">=</span><span style="color: #ff0000;">&quot;m&quot;</span> <span style="color: #000066;">src</span><span style="color: #66cc66;">=</span><span style="color: #ff0000;">&quot;/site.htm?lng=1&amp;trg=cln&amp;oip=0&amp;trk=zszuyhbinthnpzt&quot;</span> <span style="color: #000066;">scrolling</span><span style="color: #66cc66;">=</span><span style="color: #ff0000;">&quot;no&quot;</span> <span style="color: #000066;">noresize</span> <span style="color: #000066;">marginwidth</span><span style="color: #66cc66;">=</span><span style="color: #ff0000;">&quot;0&quot;</span> <span style="color: #000066;">marginheight</span><span style="color: #66cc66;">=</span><span style="color: #ff0000;">&quot;0&quot;</span>&gt;</span></div></li>
<li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;"><span style="color: #009900;">&lt;<span style="color: #000000; font-weight: bold;">frame</span> <span style="color: #000066;">name</span><span style="color: #66cc66;">=</span><span style="color: #ff0000;">&quot;b&quot;</span> <span style="color: #000066;">src</span><span style="color: #66cc66;">=</span><span style="color: #ff0000;">&quot;about:blank&quot;</span> <span style="color: #000066;">marginwidth</span><span style="color: #66cc66;">=</span><span style="color: #ff0000;">&quot;0&quot;</span> <span style="color: #000066;">marginheight</span><span style="color: #66cc66;">=</span><span style="color: #ff0000;">&quot;0&quot;</span> <span style="color: #000066;">scrolling</span><span style="color: #66cc66;">=</span><span style="color: #ff0000;">&quot;auto&quot;</span>&gt;</span></div></li>
<li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;"><span style="color: #009900;">&lt;<span style="color: #000000; font-weight: bold;">noframes</span>&gt;&lt;<span style="color: #000000; font-weight: bold;">BODY</span>&gt;</span>Frames not supported by your browser.<span style="color: #009900;">&lt;<span style="color: #66cc66;">/</span><span style="color: #000000; font-weight: bold;">BODY</span>&gt;&lt;<span style="color: #66cc66;">/</span><span style="color: #000000; font-weight: bold;">noframes</span>&gt;</span></div></li>
<li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;"><span style="color: #009900;">&lt;<span style="color: #66cc66;">/</span><span style="color: #000000; font-weight: bold;">frameset</span>&gt;&lt;<span style="color: #000000; font-weight: bold;">body</span>&gt;&lt;<span style="color: #66cc66;">/</span><span style="color: #000000; font-weight: bold;">body</span>&gt;&lt;<span style="color: #66cc66;">/</span><span style="color: #000000; font-weight: bold;">html</span>&gt;</span></div></li>
</ol>	</div></div>
<p>The second frame was kind of funny, &#8220;about:blank&#8221;? The first one was a bit more interesting though: <strong>http://85.255.117.38/site.htm?lng=1&#038;trg=cln&#038;oip=0&#038;trk=zszuyhbinthnpzt</strong><br />
Notice the &#8220;trk&#8221; parameter.</p>
<p>Accessing that page with Opera within a VMware virtual machine running Windows 2000 (heh, paranoid is good), I was able to access that page.  I saved it for analysis (and had Ethereal running on the side just to be sure). I tried to refresh and it returned 404. You could only view the page once.</p>
<p><img src='http://johnmu.com/wp-content/stuff/showhack.jpg' alt='showhack.jpg' /></p>
<p>Looking at the files you see some interesting things:</p>
<p>- an encrypted javascript file<br />
- an exploit from free-spy-cam.net<br />
- an affiliate sales page for the antivirus software<br />
- an affiliate signup link on that page</p>
<p>The <a href="http://johnmu.com/files/hack1_WARNING.zip">ZIP-File</a> contains a full copy of the files as downloaded by the Opera browser. Check the files at your own risk, they contain the full exploit.</p>
<p>The encrypted javascript file looks like this (pulled apart and reformatted; called &#8220;__cntr000.htm&#8221; in the ZIP file):</p>
<div id="ig-sh-3" class="syntax_hilite">	<div class="toolbar">		<div class="language-name">js</div>		<a href="#" class="view-different">&lt; view <span>plain text</span> &gt;</a>	</div>	<div class="code"><ol class="javascript" style="font-family:monospace;"><li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;">&lt;script language=JavaScript&gt;</div></li>
<li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;">function dc(sed) {</div></li>
<li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;">&nbsp; l=sed.length;</div></li>
<li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;">&nbsp; var b=1024,i,j,r,p=0,s=0,w=0,t=Array(63,56,60,51,15,9,10,13,36 (...) 52,16);</div></li>
<li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;">&nbsp; soot=sed;</div></li>
<li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;">&nbsp; for(j=Math.ceil(l/b);j&gt;0;j--) {</div></li>
<li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;">&nbsp; &nbsp; &nbsp;r='';</div></li>
<li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;">&nbsp; &nbsp; &nbsp;for(i=Math.min(l,b);i&gt;0;l--,i--) {</div></li>
<li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;">&nbsp; &nbsp; &nbsp; &nbsp;saam=t[soot.charCodeAt(p++)-48];</div></li>
<li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;">&nbsp; &nbsp; &nbsp; &nbsp;sttp=saam&lt;&lt;s;w|=sttp;</div></li>
<li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;">(...)</div></li>
<li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;">&nbsp; &nbsp; &nbsp;dd1=&quot;document&quot;;</div></li>
<li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;">&nbsp; &nbsp; &nbsp;dd2=&quot;write(r)&quot;;</div></li>
<li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;">&nbsp; &nbsp; &nbsp;eval(dd1+&quot;.&quot;+dd2)</div></li>
<li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;">(...)</div></li>
<li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;">dc(&quot;AVbFxuGqAk7s5OpH (...) G2ovPVoP9dATq_&quot;)</div></li>
<li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;">&lt;/script&gt;</div></li>
</ol>	</div></div>
<p>The contents of the file are encrypted with some variation of Base64 encoding. You can decode the javascript by replacing:<br />
<em>     eval(dd1+&#8221;.&#8221;+dd2)</em><br />
with<br />
<em>     document.write(&#8220;&lt;xmp&gt;&#8221; + r + &#8220;&lt;/xmp&gt;&#8221;);</em></p>
<p>Doing that will display the full contents of the encrypted data (called &#8220;__cntr000-decoded.htm&#8221; in the ZIP file).</p>
<div id="ig-sh-4" class="syntax_hilite">	<div class="toolbar">		<div class="language-name">js</div>		<a href="#" class="view-different">&lt; view <span>plain text</span> &gt;</a>	</div>	<div class="code"><ol class="javascript" style="font-family:monospace;"><li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;">(...)</div></li>
<li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;">&nbsp; var WinOS=Get_Win_Version(IEversion);</div></li>
<li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;">&nbsp; PatchList = clientInformation.appMinorVersion;</div></li>
<li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;">&nbsp; switch (WinOS)</div></li>
<li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;">&nbsp; {</div></li>
<li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;">&nbsp; &nbsp;case &quot;wXPw&quot;:</div></li>
<li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;">&nbsp; &nbsp; XP_SP2_patched=0;</div></li>
<li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;">&nbsp; &nbsp; FullVersion=clientInformation.appMinorVersion;</div></li>
<li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;">&nbsp; &nbsp; PatchList=FullVersion.split(&quot;;&quot;);</div></li>
<li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;">&nbsp; &nbsp; for (var i=0; i &lt; PatchList.length; i++) { if (PatchList[i]==&quot;SP2&quot;) { XP_SP2_patched=1; } }</div></li>
<li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;">&nbsp; &nbsp; if (XP_SP2_patched==1) { ExploitNumber=9; }</div></li>
<li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;">(...)</div></li>
<li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;">&nbsp; &nbsp; location.href=&quot;cnte-eshdvvw.htm?trk=zszuyhbinthnpzt&quot;;</div></li>
<li style="font-weight: normal; vertical-align:top;"><div style="font: normal normal 1em/1.2em monospace; margin:0; padding:0; background:none; vertical-align:top;">(...)</div></li>
</ol>	</div></div>
<p>It is yet another javascript that triggers an exploit based on the operating system (it even test for XP service pack 2) and browser that the user is using. The exploit is also tagged with the &#8220;trk&#8221; parameter and couldn&#8217;t be downloaded separately. You can bet that&#8217;s it&#8217;s not a picture of your favorite celebrity, however.</p>
<p><strong>Next steps</strong></p>
<p>You could follow these up with:</p>
<ul>
<li>Checking the <a href="http://whois.domaintools.com/85.255.117.38">whois of the payload-server</a> and notifying the hoster (in this case probable fruitless)</li>
<li>Checking the sales page, search for the affiliate ID and the setups running and complain to the affiliate networks about this webmaster</li>
<li>Mirror a copy of the original server for analysis</li>
<li>Obviously move to a different server, perhaps even a different hoster</li>
</ul>
<p><strong>Summary</strong></p>
<p>The hacker had managed to patch the server side code (most likely the Apache server) so that<br />
- search engines see the normal page<br />
- new users from search engines are hacked with several exploits and shown an ad for anti-virus software</p>
<p>Spotting something like this on your own sites is close to impossible. The search engine crawlers would not notice anything.</p>
<p>Recognizing something like this algorithmically on Google&#8217;s side would be possible with the Googlebar-data. Assuming all shown URLs are recorded, they could compare the URL clicked in the search results with the URL finally shown on the user&#8217;s browser (within the frames). At the same time, the setup could be used to detect almost any kind of cloaking.</p>
<p>Scary stuff.</p>
<hr/>Copyright &copy; 2013 <strong><a href="http://johnmu.com">johnmu.com</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator, the site you are looking at is guilty of copyright infringement. Please contact johnmu.com so we can take legal action immediately.<br/><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/johnmucom?a=pqo4SREp1Mo:dhHR7yux67Y:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/johnmucom?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/johnmucom?a=pqo4SREp1Mo:dhHR7yux67Y:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/johnmucom?i=pqo4SREp1Mo:dhHR7yux67Y:D7DqB2pKExk" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/johnmucom?a=pqo4SREp1Mo:dhHR7yux67Y:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/johnmucom?i=pqo4SREp1Mo:dhHR7yux67Y:F7zBnMyn0Lo" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/johnmucom/~4/pqo4SREp1Mo" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://johnmu.com/hack-hidden-redirect/feed/</wfw:commentRss>
		<slash:comments>18</slash:comments>
		<feedburner:origLink>http://johnmu.com/hack-hidden-redirect/</feedburner:origLink></item>
		<item>
		<title>Interview with Matt / “Dockarl”</title>
		<link>http://feedproxy.google.com/~r/johnmucom/~3/P3R-9n6piLg/</link>
		<comments>http://johnmu.com/interview-with-dockarl/#comments</comments>
		<pubDate>Wed, 22 Aug 2007 08:21:47 +0000</pubDate>
		<dc:creator>John Mueller</dc:creator>
				<category><![CDATA[People]]></category>

		<guid isPermaLink="false">http://johnmu.com/interview-with-dockarl/</guid>
		<description><![CDATA[Hi &#8220;Doc&#8221;, it&#8217;s cool to have you here! It&#8217;s great that the web removes barriers like the physical distance from here in Switzerland to Australia. Matt has been one of the regular contributors to the Google Webmaster Help Groups since January 2007. He has a diverse background: Agriculture and Computers, an interesting mixture, or how [...]]]></description>
				<content:encoded><![CDATA[<p><img src='http://johnmu.com/wp-content/stuff/dockarl1.JPG' alt='Matt at Google' style="float:left;" /> Hi &#8220;Doc&#8221;, it&#8217;s cool to have you here! It&#8217;s great that the web removes barriers like the physical distance from here in Switzerland to Australia. Matt has been one of the regular contributors to the Google Webmaster Help Groups since January 2007. He has a diverse background: Agriculture and Computers, an interesting mixture, or how he puts it in <a href="http://groups.google.com/groups/profile?enc_user=o6OJJjIAAABqX0ba682YTKfOntXCz4UiS0I-G8MSm5BjSJQUJ687ToPT_UQzvfikPsfcDjCkbYAvPp-8dun_mzBPI6iup1z0">his profile</a>: &#8220;I know about cows and computers&#8221; <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> .</p>
<p><strong>Looking at your first posts, I see a desperate webmaster, someone even screaming for &#8220;HELP!!!&#8221; in the thread titles. How did you find the Google Webmaster Help groups and what made you decide to originally post about your problems there?</strong></p>
<p>Hmm.. how did I find the groups &#8211; I think I might have searched &#8220;How to contact Google&#8221; and came across the webmaster help groups there. I had to &#8211; I&#8217;d come across a problem that I just couldn&#8217;t get an answer to by doing a regular Google search, I knew it was an unusual problem and, like many other webmasters, I figured I might be able to find a real, living, breathing Googler somewhere to talk about the problem.</p>
<p><strong>Did you get a satisfactory answer to your original questions in the groups? What elements were vital to that outcome?</strong></p>
<p>Well, for some reason the answers to that post (it was back in 2006) have been &#8216;lost in the system&#8217; but I did get a lot of hypotheticals from the regular group members &#8211; but nothing that helped, unfortunately.<br />
<span id="more-105"></span><br />
How that came about is a very long story, but hell, you&#8217;ve asked, so I&#8217;ll tell you <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> . The person who owned the intellectual property we had been laboring to develop for the last two years had turned nasty &#8211; and was annoyed that we used their name on our website (and outranked them for it). My business partner and I were receiving ~20+ calls a day between us from the person. The phone calls started to elevate to the extent that we considered them threatening, and we were forced to call the police.</p>
<p>In the wash-up we just decided that &#8211; as a family business &#8211; we weren&#8217;t prepared to have to explain to my business partners kids (both under 5) why mum was crying and the police were &#8216;coming for a visit&#8217; on a Saturday morning &#8211; so we decided to remove the name in question to stop further stress, even though we had every right to use it.</p>
<p>So I took the quickest path possible, made the changes to the website and asked Google to remove the cache. It had unintended consequences &#8211; it totally removed the &#8216;snippets&#8217; from our website (our listings were title only), and we were left with a huge traffic decline. This, on top of everything else was absolutely crippling to the business. So, by the time I posted here I was getting a bit desperate &#8211; and it&#8217;s one reason I&#8217;m generally patient with people that come to the groups angry.</p>
<p>In the end, unfortunately no one here could give me the answer to the problem &#8211; it was out of their control. I hadn&#8217;t realized that a cache removal would remain in effect for 6 months. The main element that was vital to my outcome was Vanessa Fox (the beaut person that she is) who saw my post and stepped in and tweaked the system to let my site back in.</p>
<p><strong>You&#8217;re a webmaster, you had issues with your site and Google and posted in the groups. If a webmaster came up to you and asked if it would be worthwhile to post about his problems there, what would you tell them? Would it make any difference if the webmaster was new to webmastering?</strong></p>
<p>That&#8217;s an easy question. We&#8217;ve got a great community of beaut people here &#8211; you just don&#8217;t spend hours helping people gratis unless you&#8217;re passionate about it, so we tend to be universally &#8216;nice&#8217; to people, especially newbies. I&#8217;d say &#8216;Go ahead, write your question, try to be succinct about it and TRY NOT TO PANIC!&#8217;. I&#8217;d also make sure that they knew that the people helping would more than likely be knowledgeable volunteers, so make sure you check your frustration at the door <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p><strong>What was it that made you stick around in the Google Groups, not only to ask more questions but also to help answer other people&#8217;s questions? What makes the Groups special compared to other forums?</strong></p>
<p>Well I think that JLH and yourself made the effort to email me and help with some problems I was having with a hobby site of mine called &#8216;utheguru&#8217; &#8211; that was an awesome gesture and made me feel at home. That kind of thing, along with the occasional guest post by a Googler, is what makes this forum special</p>
<p>In parallel to that, things had degenerated a lot further with our business to the extent that lawyers had become involved, and I had to put my PhD (and hence, income) on hold to spend my time dealing with that. I was looking for a stress release, and I&#8217;ve always been the kind of person that finds learning natural, cathartic and relaxing &#8211; so I got hooked.</p>
<p>If I&#8217;m honest, I also figured it was a way I could work towards another goal of mine &#8211; working with Google.</p>
<p>As an undergrad student, I read Page and Brin&#8217;s paper, and thought &#8211; &#8220;wow, that&#8217;s a neat idea&#8221;. The whole concept of Pagerank and linkages is something that&#8217;s really been around in science for hundreds of years. A good scientific paper is one that references other authors widely, and a reputable scientist is one that has papers referenced by many others. The CONCEPT of Pagerank is really nothing new in science &#8211; it just took a neat idea by those two fellows to convert the concept into something that could transcend academia and become relevant to that new thing called &#8216;the Internet&#8217;. Google became popular, first, amongst scientists &#8211; that&#8217;s something I observed and there was certainly alot of buzz about it within that sector of society before it ever became the household name it is now.</p>
<p>I&#8217;ve been a Google user ever since, and I&#8217;m fascinated by the system itself, how it works, the company, the culture &#8211; everything about Google appeals to me.</p>
<p>Further to the reasons Google fascinates me (you didn&#8217;t ask but I&#8217;m gonna tell you anyway.. haha), before the rather wild ride of backless lingerie began, I&#8217;d worked for some time as a Scientist with the Sugar industry (especially on the field / mechanisation side), and one of the major things I worked on there was reward algorithms &#8211; trying to use disparate manufacturing measures at the mill end of the system to send &#8216;quality&#8217; signals to harvester operators. Hmmm.. how do I explain this &#8211; well, I&#8217;ve gotta go into a little background detail&#8230;</p>
<p>Sugarcane harvesters chop up cane into little lengths, about 8 inches long, called billets. Along with the cane, the leaf material is also chopped up. If that leaf material reaches the mill, it can have a bad affect on the quality of the sugar produced, and it also makes the cane more expensive to process and transport. So, the harvesting machines have big 6 foot metal fans which rotate at about 1000 rpm &#8211; that&#8217;s a phenomenal tip speed. These fans sit above the cane right after it&#8217;s been chopped, and their aim is to remove the leaf material. Unfortunately, a whole complex set of interactions conspire to result in a situation where if you try &#8216;too hard&#8217; to remove the leaf material, you also end up losing about 20% of the cane you harvest through those fans &#8211; but it&#8217;s invisible. A billet that&#8217;s gone through an extractor fan ends up looking something like dessicated coconut &#8211; and there is no way of knowing the losses exist unless you do scientific trials to prove it.</p>
<p>I&#8217;d done the trials &#8211; all through North Queensland, in Papua New Guinea &#8211; all over the place. We had proved the losses existed, and the cost to the industry was in the billions of dollars per year, let alone the environmental impact. But because you can&#8217;t actually SEE the losses, you have a hard time convincing people that they actually exist. We got to the stage where my team and I had convinced the industry that there was a serious problem, and the next step was obviously &#8220;How do we stop it&#8221;. We knew that there was a &#8216;sweet spot&#8217; where those losses could be reduced to around 5% depending upon the way the harvester was operated. Since we didn&#8217;t have the ability to measure what was happening in the field on a real time basis, we had no choice but to use indirect measurements in the mill &#8211; like fibre, the sweetness of the cane etc, to try and infer what was happening in the field &#8211; to measure &#8216;quality&#8217; of the job.</p>
<p>That became my focus, and I learnt along the way that when you&#8217;re trying to make a reward system based upon derived measures, the tiniest little change to your algorithm can have huge impacts upon the system you&#8217;re trying to model. Also, if you&#8217;re offering &#8220;rewards&#8221; based upon indirect measurements, you actually end up becoming an intrinsic part of the system you&#8217;re trying to model &#8211; in clearer terms, the whole system tends to change or adapt to maximize &#8220;profits&#8221;, which can play havoc with the &#8220;accuracy&#8221; of your algorithm.</p>
<p>It sounds completely unrelated, but that&#8217;s actually Google (and the spam struggle) in a nutshell. That&#8217;s one of the reasons I&#8217;m fascinated with it and feel at home here in the groups where occasionally we get questions that make me think quite deeply about the challenges Google must face &#8211; and we get the opportunity to debate our views <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  This <a href="http://groups.google.com/group/Google_Webmaster_Help-chit-chat/browse_thread/thread/1315bf341e375a19/">thread about pagerank</a> where Craig and I duked it out with full respect for each others opinion is one example I can think of that I&#8217;ve enjoyed.</p>
<p><strong>You studied Agriculture and set up a shop to make and sell <a href="http://www.backlesslingerie.com/">backless lingerie</a>. I bet all the guys in the groups have visited your full site (for SEO reasons, I&#8217;m sure <img src='http://johnmu.com/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' />  ). How did that ever come about?</strong></p>
<p>Ha &#8211; not only did I study Ag, but I managed to convince the government here to award me a scholarship to do a coursework Master&#8217;s degree in Computer and Comms engineering. I ended up with a few awards and an aggregate score of over 93% &#8211; without an undergrad engineering degree &#8211; I think that surprised everyone, even me <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> . But I guess it&#8217;s only natural &#8211; most people do best when they&#8217;re doing something they love. I&#8217;ve always been fascinated with those applications where IT, Engineering and Science intersect and meet &#8216;the real world&#8217; &#8211; that&#8217;s kind of Googly.</p>
<p>An example &#8211; I can remember the time when I was about 12 years old that I blew up the family commodore 64 trying to get it to drive solenoids to water the garden for Mum. I didn&#8217;t realise at the time that you need a transistor and a relay if you want to drive something hefty like a solenoid with a TTL output <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
<p>But apart from being a bit of a terror, I&#8217;ve also always been a traveler and got along easily with folks. As such, when I was writing my Masters thesis, I figured I&#8217;d go stay with some mates overseas &#8211; I had a load of frequent flyer points I wanted to use, they all offered to put me up for free, so I figured it was an opportunity too good to miss. The only &#8216;gotcha&#8217; was that I was to provide the beer &#8211; Norway was a hoot &#8211; my oh my &#8211; the Vikings ARE NOT dead!</p>
<p>I ended up (between parties) writing most of my Master&#8217;s degree tapping away on my laptop, perched on the edge of a fjord whilst staying with my Norwegian Marine Biologist friend in Northern Norway for a few months mid 2005 &#8211; the 24 hour sunlight was GREAT.</p>
<p>On the way back I dropped in to see my Indian mate in Tirupur (the south of India, in a state called Tamil Nadu) and ended up spending a few months there too. Tirupur is a big textile producing area, and I made friends with some of the big players there.</p>
<p>When I finally arrived back in Australia I mentioned that to my Brother in Law (a solicitor) and he said &#8220;well, I&#8217;ve got some clients that are looking to manufacture a neat new product they&#8217;ve developed&#8221; &#8211; so, before I knew it, I was off to India where I learnt all about ladies underwear, mobilon and thread density. We quickly got a few test shipments under our belt.</p>
<p>Upon returning my brother and I were asked if we&#8217;d like to get more deeply involved with the sale and promotion of the product &#8211; somehow I let myself be convinced. There began the roller coaster ride &#8211; I became manufacturer (traveled to China as well for that part several times), web developer, email wrangler, undy packer, book keeper, promoter and media spokesperson. It was crazy work and it was unpaid &#8211; the cost of manufacture and promotion sucked away much of my savings and any profit the product brought in before it ever had a chance to reach my pocket &#8211; although attending the modeling shoots was fun, and the POSSIBILITY that it might become something big was intoxicating!</p>
<p>But &#8211; a word from the wise &#8211; ever heard of Ali Baba and the 40 Thieves? Those folk were in the rag trade <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' />  Get involved at your peril.</p>
<p><strong>One of your sites has recently had <a href="http://groups.google.com/group/Google_Webmaster_Help-Indexing/browse_thread/thread/a408bfd0ab881fac/">a strange kind of trouble</a> with Google&#8217;s index, with all sorts of possible explanations but no resolution so far. For the average webmaster these kinds of situations are incomprehensible and terribly frustrating. What would you tell the webmaster when stuck in a rut like that &#8211; keep working on the problem or let it sit for a while?</strong></p>
<p>First I&#8217;d ask them to think about whether they&#8217;d made any big changes to their site recently &#8211; to try and hone in on whether it might be something they&#8217;d caused themselves, rather than anything algorithmic.</p>
<p>Next, if I&#8217;d decided it might indeed be a penalty, I&#8217;d usually give them a copy of the webmaster guidelines and say &#8220;What do you think it might be?&#8221; &#8211; people usually have a fairly good idea about what they might have done wrong if a potential penalty is involved. I&#8217;d then ask them to write out a list of potential issues, and correct them + submit a reconsideration request and wait a month. If that didn&#8217;t work, time to put on the &#8220;mad scientist&#8221; hat and get methodical about things.</p>
<p>First I&#8217;d probably use Google to do a search for other people experiencing the problem. From there I&#8217;d approach these groups. If that drew blanks, I&#8217;d then start tweaking things with their site &#8211; but softly softly &#8211; one change at a time, waiting at least a week between changes so that I&#8217;d have a fair idea what &#8216;the cure&#8217; was for future reference.</p>
<p>If that didn&#8217;t work I&#8217;d probably just start to assume that they were the victim of Google collateral damage &#8211; hell, we all know it happens, and I&#8217;d be submitting some attention grabbing posts to this group to try and &#8216;elevate it&#8217; to the attention of Googlers, so that they could use their gadgetry to try and work out what the story was.</p>
<p>At that stage things are out of your hands, and you just hope that perhaps you&#8217;ve alerted Google to a potential &#8220;Googlebug&#8221; that might stop others from experiencing the same kinds of issues.</p>
<p><strong>Assuming you had full access to Google&#8217;s servers and some web designers + programmers to help you, what would you change?</strong></p>
<p>Hmmm.. looking back through my prep notes for my Google interview here&#8230;</p>
<p>I think I&#8217;d start with the problem of penalties. I&#8217;d be sitting down with the alg team and trying to thrash out a way that we could actually help those &#8216;ma and pa&#8217; webmasters that have accidentally shot themselves in the foot &#8211; and to do so without giving the spammers a leg up.</p>
<p>I&#8217;d write out a list of things that we considered &#8216;top secret&#8217; and another of those factors that were &#8216;out of the bag&#8217;, and I&#8217;d set about implementing changes to Google webmaster tools to alert folks to little things &#8211; like obviously hidden text &#8211; that might be resulting in a penalty and which they might not know about. Those kind of issues, to my mind anyway, are already well known amongst spammers and you can&#8217;t lose much by letting people know about them.</p>
<p>As for the more complex things, like, for example, keyword density (it&#8217;s a simple one, I know, but let&#8217;s start there) &#8211; you know, things that aren&#8217;t black or white &#8211; things where there were shades of grey, I&#8217;d be making tools to show them which side of the line they are tending towards &#8211; like a gauge, or traffic lights.</p>
<p>&#8220;We think your site is looking a little spammy &#8211; here&#8217;s an orange alert&#8221;.</p>
<p>Naturally, the alg team would then say to me &#8220;Well Matt, that&#8217;s all well and good, but if we start giving folks that kind of info, we&#8217;re essentially giving the spammers a great tool which they can use to test the limits of our alg, too&#8221;. I&#8217;d then say to them, well, why don&#8217;t we use cluster analysis to break sites down into 100 different categories of &#8216;spamminess&#8217; &#8211; the traffic lights would just show how spammy you are relative to others in your &#8216;spamminess cluster&#8217; &#8211; so really, if we give a green light to a known spammer, all we are telling him is that he&#8217;s kind of ok compared to the other spammers within his uber spammer group &#8211; but he needn&#8217;t know that <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
<p>For the spammers, the lights system would achieve nothing. For the ma&#8217;s and pa&#8217;s that are relatively innocuous, having a red light could be a huge help &#8211; just knowing you have a penalty lets you know that it&#8217;s actually something you can track down and correct.</p>
<p>But I suspect the other engineers would raise a whole load of reasons that my approach wouldn&#8217;t work &#8211; but I love the dynamics of a group, and part of the enjoyment of working in one is often the synergy that you find when you&#8217;re sitting down with a whole bunch of folks with common interests and intellect thrashing out a new idea &#8211; that&#8217;s how a lump of coal turns into a diamond.</p>
<p>That would be a plum position to be in.</p>
<p>After that I&#8217;d probably start gravitating towards the alg design / testing side of things &#8211; as that&#8217;s something I&#8217;m fascinated with &#8211; setting up mega test networks and conducting sensitivity analysis and pre-testing of new algorithm ideas would be lots of fun and extraordinarily satisfying &#8211; I love taking good ideas and helping make them better.</p>
<p>I&#8217;ve also thought I&#8217;d like to make a tool that shows a graphical representation of the linking structure of a site &#8211; with things like nofollow, noindex as an overlay &#8211; that could be a great troubleshooting tool for lots of problems too.</p>
<p>But, to be honest, most of my programming experience is at the nuts and bolts level &#8211; A GUI to me is a command line and a prompt &#8211; I&#8217;ve got a lot of engineer in me. I&#8217;d be able to write the crawlers and mangle the database, but I&#8217;d have to leave the bells and whistles to someone else <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
<p><strong>You&#8217;ve done a lot of different things (so far, including <a href="http://www.utheguru.com/my-recent-interview-with-google">an interview with Google</a>). If you could rewind back to when you started studying, do you think you would do anything differently knowing what you know now (other than obviously buying some good stock)?</strong></p>
<p>Cool! A rewind button!</p>
<p>Firstly, I wouldn&#8217;t have flown Qantas to my big interview &#8211; it was a debacle start to finish &#8211; they lost my bags (clothes, books, notes) my flights out (and back) were both delayed 12 hours or more and diverted because of tech probs &#8211; in short, I arrived sleep deprived and not feeling prepared, and I think I only hit my feet during the interview just after lunch. It was like an out-of-body experience.. grrr&#8230;.</p>
<p>Secondly &#8211; I wouldn&#8217;t have studied Agriculture.</p>
<p>We had loads of fun out there, but my natural aptitudes are IT / Science / Engineering. My ag degree included a lot of that, but I tended to get let down by the sheer boredom of prac sessions that included watching grass grow &#8211; honestly.</p>
<p>I&#8217;m the kind of person that thrives on a challenge &#8211; so I did poorly at the &#8220;watching grass grow&#8221; practical subjects, and tended to dux the more academic subjects that others found a tad difficult &#8211; like advanced stats, biometry etc &#8211; I did the wrong degree for my skillset and, like it or not, time is a depreciating commodity.</p>
<p>I&#8217;m an extremely outdoors person, and I thought back then that if I studied IT or engineering I&#8217;d be stuck in front of a computer all day &#8211; but I now realize that that&#8217;s not really the case at all. Shucks, if I&#8217;m honest with myself, I LIKE spending time in front of the computer. I&#8217;ve come to realise that it&#8217;s the life / work balance that&#8217;s important &#8211; if you don&#8217;t have one, you tend to lose out on the other.</p>
<p>So with Ag, I just ended up naturally gravitating towards work that required me to be &#8216;stuck&#8217; in front of a computer all day anyway, but getting paid poorly for it, so the opportunities to go outside and do adventurous things in your spare time were limited.</p>
<p>I&#8217;ve had some massive, great interesting experiences with the route I chose back then, most of which I don&#8217;t regret, but if I&#8217;d done IT or Eng instead of Ag, I think I&#8217;d be in a better place, career wise. You mention &#8220;good stock&#8221; &#8211; it&#8217;s funny that, because luckily I realized early that this wasn&#8217;t what I wanted to do long term, and tended to invest my wages well &#8211; so I&#8217;ve managed to have a decent lifestyle during the recent &#8216;challenges&#8217; which is LUCKY <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
<p><strong>Is there anything you&#8217;d like to add?</strong></p>
<p>John Congrats on the new job, and I&#8217;m looking forward to achieving a dream like that myself soon, too &#8211; good on you mate! <img src='http://johnmu.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
<p><strong>Thank you very much for your time and the replies, Matt! </strong></p>
<hr/>Copyright &copy; 2013 <strong><a href="http://johnmu.com">johnmu.com</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator, the site you are looking at is guilty of copyright infringement. Please contact johnmu.com so we can take legal action immediately.<br/><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/johnmucom?a=P3R-9n6piLg:5eD8fdICDsU:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/johnmucom?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/johnmucom?a=P3R-9n6piLg:5eD8fdICDsU:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/johnmucom?i=P3R-9n6piLg:5eD8fdICDsU:D7DqB2pKExk" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/johnmucom?a=P3R-9n6piLg:5eD8fdICDsU:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/johnmucom?i=P3R-9n6piLg:5eD8fdICDsU:F7zBnMyn0Lo" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/johnmucom/~4/P3R-9n6piLg" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://johnmu.com/interview-with-dockarl/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		<feedburner:origLink>http://johnmu.com/interview-with-dockarl/</feedburner:origLink></item>
	</channel>
</rss><!-- Dynamic Page Served (once) in 0.566 seconds --><!-- Cached page served by WP-Cache -->
