<?xml version="1.0" encoding="ISO-8859-1"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0">

<channel>
	<title>All Things SEM / SEO Blog</title>
	
	<link>http://www.allthingssem.com</link>
	<description>A blog about, you guessed it, search engine marketing. And yeah, that includes search engine optimization too.</description>
	<pubDate>Wed, 17 Jun 2009 13:58:14 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.7.1</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" href="http://feeds.feedburner.com/AllThingsSEM" type="application/rss+xml" /><feedburner:emailServiceId>AllThingsSEM</feedburner:emailServiceId><feedburner:feedburnerHostname>http://feedburner.google.com</feedburner:feedburnerHostname><feedburner:feedFlare href="http://add.my.yahoo.com/rss?url=http%3A%2F%2Ffeeds.feedburner.com%2FAllThingsSEM" src="http://us.i1.yimg.com/us.yimg.com/i/us/my/addtomyyahoo4.gif">Subscribe with My Yahoo!</feedburner:feedFlare><feedburner:feedFlare href="http://www.newsgator.com/ngs/subscriber/subext.aspx?url=http%3A%2F%2Ffeeds.feedburner.com%2FAllThingsSEM" src="http://www.newsgator.com/images/ngsub1.gif">Subscribe with NewsGator</feedburner:feedFlare><feedburner:feedFlare href="http://feeds.my.aol.com/add.jsp?url=http%3A%2F%2Ffeeds.feedburner.com%2FAllThingsSEM" src="http://o.aolcdn.com/favorites.my.aol.com/webmaster/ffclient/webroot/locale/en-US/images/myAOLButtonSmall.gif">Subscribe with My AOL</feedburner:feedFlare><feedburner:feedFlare href="http://www.bloglines.com/sub/http://feeds.feedburner.com/AllThingsSEM" src="http://www.bloglines.com/images/sub_modern11.gif">Subscribe with Bloglines</feedburner:feedFlare><feedburner:feedFlare href="http://www.netvibes.com/subscribe.php?url=http%3A%2F%2Ffeeds.feedburner.com%2FAllThingsSEM" src="http://www.netvibes.com/img/add2netvibes.gif">Subscribe with Netvibes</feedburner:feedFlare><feedburner:feedFlare href="http://fusion.google.com/add?feedurl=http%3A%2F%2Ffeeds.feedburner.com%2FAllThingsSEM" src="http://buttons.googlesyndication.com/fusion/add.gif">Subscribe with Google</feedburner:feedFlare><feedburner:feedFlare href="http://www.pageflakes.com/subscribe.aspx?url=http%3A%2F%2Ffeeds.feedburner.com%2FAllThingsSEM" src="http://www.pageflakes.com/ImageFile.ashx?instanceId=Static_4&amp;fileName=ATP_blu_91x17.gif">Subscribe with Pageflakes</feedburner:feedFlare><feedburner:feedFlare href="http://www.live.com/?add=http%3A%2F%2Ffeeds.feedburner.com%2FAllThingsSEM" src="http://tkfiles.storage.msn.com/x1piYkpqHC_35nIp1gLE68-wvzLZO8iXl_JMledmJQXP-XTBOLfmQv4zhj4MhcWEJh_GtoBIiAl1Mjh-ndp9k47If7hTaFno0mxW9_i3p_5qQw">Subscribe with Live.com</feedburner:feedFlare><feedburner:feedFlare href="http://www.addtoany.com/?linkname=All%20Things%20SEM%20%2F%20SEO%20Blog&amp;linkurl=http%3A%2F%2Ffeeds.feedburner.com%2FAllThingsSEM&amp;type=feed" src="http://www.addtoany.com/addfr-b.gif">Add to Any Feed Reader</feedburner:feedFlare><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com" /><item>
		<title>An Agency SEO's Perspective on Nofollow</title>
		<link>http://feedproxy.google.com/~r/AllThingsSEM/~3/mU1Ndc_tabI/</link>
		<comments>http://www.allthingssem.com/agency-seo-perspective-on-nofollow/#comments</comments>
		<pubDate>Tue, 16 Jun 2009 22:39:28 +0000</pubDate>
		<dc:creator>Marios Alexandrou</dc:creator>
		
		<category><![CDATA[Search Engine Optimization]]></category>

		<category><![CDATA[google]]></category>

		<category><![CDATA[nofollow]]></category>

		<category><![CDATA[seo]]></category>

		<guid isPermaLink="false">http://www.allthingssem.com/?p=526</guid>
		<description><![CDATA[Alright I'm giving in. I can't help but write a piece about the recent announcement from Matt Cutts regarding <a href="http://www.mattcutts.com/blog/pagerank-sculpting/">how nofollow links are treated by Google</a>. There's been a lot of coverage on this so I'll skip rehashing it and instead provide some perspective from the side of the agency SEO in the trenches working with client sites. For simplicity, I'm going to interview myself.<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
]]></description>
			<content:encoded><![CDATA[<p>Alright I&#039;m giving in. I can&#039;t help but write a piece about the recent announcement from Matt Cutts regarding <a href="http://www.mattcutts.com/blog/pagerank-sculpting/">how nofollow links are treated by Google</a>. There&#039;s been a lot of coverage on this so I&#039;ll skip rehashing it and instead provide some perspective from the side of the agency SEO in the trenches working with client sites. For simplicity, I&#039;m going to interview myself.</p>
<p><strong>Are you embarrassed about having recommended the nofollow attribute?</strong></p>
<p>Not at all. Matt Cutts stated that the change in how nofollow is treated happened in the last year which means that recommending it was good for around 3 years. That seems pretty good to me. I know people still recommended meta keywords&#8230;</p>
<p>Also, adding the nofollow attribute was at worst ineffective. There is no penalty and it&#039;s generally a really easy thing to implement. So why not include along with dozens of other recommendations since a site is going to be updated anyway?</p>
<p><strong>Nofollow has been called an unnatural change and a hack to make to a site, so why did you do it in the first place?</strong></p>
<p>Unnatural or not, the online space is competitive and I believe you need to examine all of the tools available to you. If the potential reward outweighs the potential risk, that&#039;s a pretty compelling argument for me to move ahead with an idea.</p>
<p>And don&#039;t forget, although it was never labeled a top priority change, using nofollow was highlighted by Matt Cutts as an approved technique.</p>
<p><strong>Shouldn&#039;t you have noticed the change in how nofollow is handled?</strong></p>
<p>Ideally, yes. But there are so many variables being changed with the sites I work on that isolating the impact to just one change would be difficult to do. At some point you have trust previous results and the findings of reputable sources.</p>
<p>I think of it like what doctors have to do. They read research and rely on the findings of that research. They don&#039;t repeat every experiment before believing the results. Sure, SEO isn&#039;t quite that scientific, but the same sort of thinking holds with the idea of relying on research from reputable sources and augmenting that research with your own findings.</p>
<p><strong>Why don&#039;t you recommend a site restructuring instead?</strong></p>
<p><span class="callout">&#034;The boilerplate recommendation to re-organize a site optimally for users and search engines is out of touch with reality.&#034;</span>The boilerplate recommendation to re-organize a site optimally for users and search engines is out of touch with reality. People who make such recommendations are either playing it safe or have simply not worked on a site of significant size. Aside from the enormous resources needed to redesign a site, it is next to impossible to get different business units to decide on what will appear on the home page let alone work out a complete site architecture. Everyone wants their link and their copy in the most prominent place. So if a full restructuring is going to take a year, why not make tweaks in the meantime?</p>
<p>The nofollow at least provided the opportunity to help search engines direct users to the most appropriate destination pages despite the mess that often comes with committee-based decisions.</p>
<p><strong>Aren&#039;t there more effective SEO-related tactics than nofollow?</strong></p>
<p>Oh absolutely. Again, I point to the typical large company engaged in an SEO project. Getting anything done is hard enough. So you go for what is likely to be the easiest first while working on the long-term changes. </p>
<p>Never worked with a big company? Here&#039;s an example of how hard it is sometimes: I&#039;ve been trying to get a sitemap.xml and robots.txt file uploaded to a client&#039;s web server for 3+ months. These are changes that are invisible to end-users. Can you imagine what it&#039;s going to take to make content changes!?</p>
<p><strong>Do you find it odd how Google didn&#039;t relay the change with nofollow for over a year?</strong></p>
<p>Absolutely. But some big thinkers have already weighed in on that including <a href="http://searchengineland.com/pagerank-sculpting-is-dead-long-live-pagerank-sculpting-21102">Danny Sullivan</a>, <a href="http://www.seobook.com/worthless-hype">Aaron Wall</a> and <a href="http://www.wolf-howl.com/seo/pagerank-javascript-smx/">Michael Gray</a>. </p>
<p><strong>Will you be telling clients to remove the nofollow attribute?</strong></p>
<p>At the moment, no. At worst keeping it simply prevents link juice from flowing to pages that have been identified as low-quality anyway.</p>
<p><strong>Will you recommend the nofollow attribute to future clients?</strong></p>
<p>For the time being, no. As much to avoid having overly lengthy discussions with <a href="http://www.allthingssem.com/seo-client-types/">certain types of SEO clients</a>.</p>
<p><em>I&#039;d like to thank Marios for taking the time to speak to me today. For more commentary on the search engine marketing industry, check out Marios&#039; <a href="http://www.allthingssem.com/">SEM / SEO blog</a>.</em></p>
<p>Note: Although I&#039;m speaking <em>as</em> an agency SEO, I&#039;m not necessarily speaking <em>for</em> an SEO agency <img src='http://www.allthingssem.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' />
<p><strong>Related Posts</strong></p>
<li><a href="http://www.allthingssem.com/getting-the-most-from-your-seo-agency/" rel="bookmark" title="May 4, 2009">Getting The Most From Your SEO Agency</a></li>
<li><a href="http://www.allthingssem.com/seo-client-types/" rel="bookmark" title="April 14, 2009">SEO Clients: The Good, The Bad, The Ugly, and Umm&#8230; 2 More</a></li>
<li><a href="http://www.allthingssem.com/seomoz-seo-guides/" rel="bookmark" title="July 17, 2007">SEOmoz SEO Guides</a></li>
<li><a href="http://www.allthingssem.com/seo-quiz/" rel="bookmark" title="September 25, 2007">SEO Quiz</a></li>
<li><a href="http://www.allthingssem.com/hide-seo-under-the-cloak-of-usability/" rel="bookmark" title="March 1, 2009">Hide SEO Under the Cloak of Usability</a></li>
</ul>
<p><!-- Similar Posts took 18.208 ms --></p>
<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>

	Tags: <a href="http://technorati.com/tag/google" title="google" rel="tag">google</a>, <a href="http://technorati.com/tag/nofollow" title="nofollow" rel="tag">nofollow</a>, <a href="http://technorati.com/tag/seo" title="seo" rel="tag">seo</a><br />
<img src="http://feeds.feedburner.com/~r/AllThingsSEM/~4/mU1Ndc_tabI" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.allthingssem.com/agency-seo-perspective-on-nofollow/feed/</wfw:commentRss>
		<feedburner:origLink>http://www.allthingssem.com/agency-seo-perspective-on-nofollow/</feedburner:origLink></item>
		<item>
		<title>Twitter is like&#x2026; Metaphors for Newbies and Veterans Alike</title>
		<link>http://feedproxy.google.com/~r/AllThingsSEM/~3/Sre63vG_u_U/</link>
		<comments>http://www.allthingssem.com/twitter-is-like/#comments</comments>
		<pubDate>Thu, 07 May 2009 13:21:49 +0000</pubDate>
		<dc:creator>Marios Alexandrou</dc:creator>
		
		<category><![CDATA[Social Media Optimization]]></category>

		<category><![CDATA[social media]]></category>

		<category><![CDATA[twitter]]></category>

		<guid isPermaLink="false">http://www.allthingssem.com/?p=523</guid>
		<description><![CDATA[Describing Twitter to people that haven't used it isn't particularly easy. So I thought I'd take a stab at helping people understand what's it's like by comparing it to every day things. And with that, Twitter is like...<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
]]></description>
			<content:encoded><![CDATA[<p>Describing Twitter to people that haven&#039;t used it isn&#039;t particularly easy. So I thought I&#039;d take a stab at helping people understand what&#039;s it&#039;s like by comparing it to every day things. And with that, Twitter is like&#8230;</p>
<p><strong>1. A gaggle of teen-age girls.</strong> All talking at the same time with no one actually listening. Not that it matters, because no one is saying anything important anyway.</p>
<div style="float: right; margin-left: 5px; margin-bottom: 5px"><img src="http://www.allthingssem.com/images/alone-in-a-crowd.png" alt="Alone in a Crowd"/></div>
<p><strong>2. Walking through a crowded room.</strong> You catch snippets of conversations, but can&#039;t really tell what people are talking about because you didn&#039;t hear the beginning of the conversation and you&#039;re not around when it ends.</p>
<p><strong>3. A buffet.</strong> You get to listen to a lot of different people in a short amount of time, but in the end you have to admit that you would have had a richer experience had you just focused on one person.</p>
<p><strong>4. Speed dating.</strong> Some people are boring right from the start while others seem interesting. Sadly, they only seem interesting for the first 140 characters but the rules say you can&#039;t stop listening to them or they&#039;ll think you&#039;re mean.</p>
<p><strong>5. Scanning radio stations.</strong> You never manage to catch a conversation when it is starting, but if you scan through the tweets fast enough and long enough you can maybe piece together the list of conversation topics for the day.</p>
<p><strong>6. The housing market a few years ago.</strong> You spend too much time on it, are twittering because everyone else is, and the bubble will eventually burst.</p>
<p><strong>7. Birds at a bird feeder.</strong> All the fluttering about looks exciting and you can&#039;t stop yourself from joining. But then everyone disperses as quickly as they gathered and you realize people were just after a their share of the seeds.</p>
<p><strong>8. Going out of your way to expose yourself to too much information in too little time.</strong> Very much like grabbing every pamphlet you can from every slightly interesting place you visit and then never remembering where you went or what the hell these leafletey things are on about. Submitted by Robert K of <a href="http://www.crearecommunications.co.uk/">Creare Group</a>.</p>
<p>Got another metaphor? I&#039;ll happily add it.
<p><strong>Related Posts</strong></p>
<li><a href="http://www.allthingssem.com/twitter-will-die/" rel="bookmark" title="May 28, 2008">Twitter Will Die And I&#039;ll Tell You Why</a></li>
<li><a href="http://www.allthingssem.com/marios-tziortzis-teaches-me-a-lesson/" rel="bookmark" title="November 25, 2007">Marios Tziortzis Teaches Me a Lesson</a></li>
<li><a href="http://www.allthingssem.com/2500-sphinns-patterns-trends/" rel="bookmark" title="January 21, 2008">2,500 Hot Sphinns Reveal Patterns and Trends</a></li>
<li><a href="http://www.allthingssem.com/a-day-in-the-life-of-a-digg/" rel="bookmark" title="June 13, 2006">A Day in the Life of a Digg</a></li>
<li><a href="http://www.allthingssem.com/diggbar-iframe-foiled-by-standard-frame-buster/" rel="bookmark" title="April 3, 2009">DiggBar iFrame Foiled by Standard Frame Buster?</a></li>
</ul>
<p><!-- Similar Posts took 14.466 ms --></p>
<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>

	Tags: <a href="http://technorati.com/tag/social media" title="social media" rel="tag">social media</a>, <a href="http://technorati.com/tag/twitter" title="twitter" rel="tag">twitter</a><br />
<img src="http://feeds.feedburner.com/~r/AllThingsSEM/~4/Sre63vG_u_U" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.allthingssem.com/twitter-is-like/feed/</wfw:commentRss>
		<feedburner:origLink>http://www.allthingssem.com/twitter-is-like/</feedburner:origLink></item>
		<item>
		<title>Getting The Most From Your SEO Agency</title>
		<link>http://feedproxy.google.com/~r/AllThingsSEM/~3/sORZs82j5BE/</link>
		<comments>http://www.allthingssem.com/getting-the-most-from-your-seo-agency/#comments</comments>
		<pubDate>Mon, 04 May 2009 12:59:47 +0000</pubDate>
		<dc:creator>Marios Alexandrou</dc:creator>
		
		<category><![CDATA[Search Engine Optimization]]></category>

		<category><![CDATA[seo]]></category>

		<category><![CDATA[seo agency]]></category>

		<guid isPermaLink="false">http://www.allthingssem.com/?p=521</guid>
		<description><![CDATA[A lot of SEO clients aren't getting the most out of their SEO engagements. Not because the SEO agency is doing anything to be deceptive or being uncooperative in any way, but simply because the SEO client hasn't put much thought or effort into what they should be doing. In an earlier post I described <a href="http://www.allthingssem.com/seo-client-types/">several different SEO client types</a> along with highlighting the good vs. the bad. In this post I'm going to expand on the idea of what makes an SEO client a good client which in turn results in the client getting the biggest bang for their buck.<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
]]></description>
			<content:encoded><![CDATA[<p>A lot of SEO clients aren&#039;t getting the most out of their SEO engagements. Not because the SEO agency is doing anything to be deceptive or being uncooperative in any way, but simply because the SEO client hasn&#039;t put much thought or effort into what they should be doing. In an earlier post I described <a href="http://www.allthingssem.com/seo-client-types/">several different SEO client types</a> along with highlighting the good vs. the bad. In this post I&#039;m going to expand on the idea of what makes an SEO client a good client which in turn results in the client getting the biggest bang for their buck.</p>
<p>The overall problem I see time and time again is that clients behave as if they don&#039;t need to really be engaged in the SEO campaign. They show up for the weekly meetings where they proceed to zone out; nod their heads in agreement like a bobblehead on a car dashboard; and set new standards for what it means to push paper. Instead, a truly involved client is one that:</p>
<h2>Considers the impact of SEO recommendations&#8230;</h2>
<p>&#8230; and alerts the SEO agency to potential obstacles. Or, if that knowledge is not on hand, invites others (e.g. editorial or technical staff) from the company to the meetings where recommendations are discussed. Doing so can reduce the amount of time wasted circulating SEO recommendations for review, approval, and implementation. With more than one client I&#039;ve had recommendations circulating for over 3 months just because the right people weren&#039;t involved at the beginning.</p>
<h2>Actively identifies opportunities&#8230;</h2>
<p>&#8230; for the SEO agency to shine. No matter how good the SEO agency is, there&#039;s no chance of them knowing all of the projects and activities going on within the client&#039;s company. Having these highlighted could bring to light areas where the SEO agency can have a significant impact which is a win-win for both parties. The most common type of failure here, in my experience, is when a new site is in the works and the SEO agency doesn&#039;t hear about it until it is in the staging environment and ready for deployment.</p>
<h2>Builds trust with the SEO agency</h2>
<p>There aren&#039;t too many things that can as effectively kill the success of an SEO project than a client that is constantly asking for detailed justification of every recommendation. It&#039;s like asking your doctor for a summary of all of the research he&#039;s read regarding every ailment you&#039;ve ever had. Sure, the SEO agency <em>should</em> be able to provide this information at any time, but having them do so repeatedly uses up resources that could otherwise be allocated to actually doing something productive. Establishing trust isn&#039;t necessarily easy, but if both sides don&#039;t trust each other after a couple of months into a yearlong campaign, that&#039;s a really bad sign. </p>
<h2>Champion changes to their web site</h2>
<p>These changes could be structural or content-based. In some cases this means the client may need to help the SEO agency argue for a change to corporate policy by bringing different parties together. I worked with one company that had a formal policy of seeking out sites that had linked to them without permission and requesting that those links be removed. Shocking, I know. The client wasn&#039;t willing to put my team in touch with anyone to discuss the negative impact of this policy and to explore possible changes.</p>
<h2>Is willing to learn</h2>
<p>The more a client knows, the less the SEO agency has to explain. So if a concept has been covered a few times, the SEO agency should be able to build on that concept to constantly elevate the sophistication of the SEO effort. If, instead, the client doesn&#039;t care enough to get a handle on the concepts, I can pretty much guarantee the SEO agency won&#039;t get support for more advanced techniques like building links through social media marketing. There&#039;s only so many times you can change a page&#039;s title&#8230;</p>
<p>If you ever land an SEO client that does most or even all of the above, I recommend you do everything you can to keep them happy. If not, watch your back because I&#039;m going to come in and steal the perfect client from you. What? You think I&#039;m kidding!?
<p><strong>Related Posts</strong></p>
<li><a href="http://www.allthingssem.com/seo-client-types/" rel="bookmark" title="April 14, 2009">SEO Clients: The Good, The Bad, The Ugly, and Umm&#8230; 2 More</a></li>
<li><a href="http://www.allthingssem.com/seo-pilot-projects-are-a-bad-idea/" rel="bookmark" title="January 14, 2008">SEO Pilot Projects Are a Bad Idea</a></li>
<li><a href="http://www.allthingssem.com/agency-seo-perspective-on-nofollow/" rel="bookmark" title="June 16, 2009">An Agency SEO&#039;s Perspective on Nofollow</a></li>
<li><a href="http://www.allthingssem.com/seo-is-about-beating-other-web-sites/" rel="bookmark" title="February 1, 2007">SEO is About Beating Other Web Sites</a></li>
<li><a href="http://www.allthingssem.com/hide-seo-under-the-cloak-of-usability/" rel="bookmark" title="March 1, 2009">Hide SEO Under the Cloak of Usability</a></li>
</ul>
<p><!-- Similar Posts took 15.587 ms --></p>
<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>

	Tags: <a href="http://technorati.com/tag/seo" title="seo" rel="tag">seo</a>, <a href="http://technorati.com/tag/seo agency" title="seo agency" rel="tag">seo agency</a><br />
<img src="http://feeds.feedburner.com/~r/AllThingsSEM/~4/sORZs82j5BE" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.allthingssem.com/getting-the-most-from-your-seo-agency/feed/</wfw:commentRss>
		<feedburner:origLink>http://www.allthingssem.com/getting-the-most-from-your-seo-agency/</feedburner:origLink></item>
		<item>
		<title>SEO Clients: The Good, The Bad, The Ugly, and Umm&#x2026; 2 More</title>
		<link>http://feedproxy.google.com/~r/AllThingsSEM/~3/O9a83B-WxPc/</link>
		<comments>http://www.allthingssem.com/seo-client-types/#comments</comments>
		<pubDate>Tue, 14 Apr 2009 22:44:21 +0000</pubDate>
		<dc:creator>Marios Alexandrou</dc:creator>
		
		<category><![CDATA[Search Engine Optimization]]></category>

		<guid isPermaLink="false">http://www.allthingssem.com/?p=514</guid>
		<description><![CDATA[In an ideal world, clients would hire an SEO agency and actually follow the direction from that agency. Sadly, this is often not the case. In fact, with large companies, like the ones I've been working with for years, I sometimes wonder if there'd be any difference if they just cut a check and went a away for twelve months. OK, so I'm exaggerating, but it does seem like there's a disproportionate amount of wheel spinning in place of actual progress with some projects. What's more, it seems like the client, during the pitch process, gave plenty of clues about what type of client they were going to be. So I've put together a guide to bucketing prospects that should help in deciding whether to take a particular project on.<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
]]></description>
			<content:encoded><![CDATA[<p>In an ideal world, clients would hire an SEO agency and actually follow the direction from that agency. Sadly, this is often not the case. In fact, with large companies, like the ones I&#039;ve been working with for years, I sometimes wonder if there&#039;d be any difference if they just cut a check and went a away for twelve months. OK, so I&#039;m exaggerating, but it does seem like there&#039;s a disproportionate amount of wheel spinning in place of actual progress with some projects. What&#039;s more, it seems like the client, during the pitch process, gave plenty of clues about what type of client they were going to be. So I&#039;ve put together a guide to bucketing prospects that should help in deciding whether to take a particular project on.</p>
<div style="float: right; margin-left: 5px"><img src="http://www.allthingssem.com/images/good-bad-ugly.jpg" alt="The Good, The Bad, and The Ugly"/></div>
<h2>1. No SEO Knowledge, But Eager to Learn</h2>
<p>These are generally my favorite clients. Not because I like to be the know-it-all (although that doesn&#039;t hurt), but because clients with no knowledge are generally receptive to anyone willing to fill in the gaps. So with a list of recommendations you simply need to provide some high-level rationale for the recommendations and generally the client won&#039;t push back. If you can also turn-around answers to questions such clients have without ever showing frustration for having answered the same thing 17 times, you&#039;ll remain their favorite for a very long time.</p>
<p><strong>Identifying Traits:</strong> Asks a lot of basic questions. Often the same question multiple times.<br />
<strong>What To Do:</strong> Land this client! Don&#039;t tell anyone.<br />
<strong>What Not To Do:</strong> Don&#039;t take advantage of the client&#039;s lack of knowledge because there&#039;s a good chance you&#039;ll regret it. A client without SEO knowledge eventually becomes one with SEO knowledge. If you&#039;re twice as expensive as everyone else, you better be providing twice the value.</p>
<h2>2. Some SEO Knowledge, Overly Eager to Demonstrate Expertise</h2>
<p>I can hear my SEO readers snickering now. They understand exactly what I mean by the client that knows <em>some</em> SEO coupled with an uncontrollable urge to convince everyone that they&#039;re experts. These are the folks that read an article, attended a conference, or skimmed through a book and feel they have grasped all there is to SEO. The problem is that their knowledge is out of date (e.g. just put your keywords in the meta keywords tag) or they haven&#039;t thought through the implications of an idea (e.g. nofollow all external links). The challenge with such clients is that any SEO recommendation that is not inline with the client&#039;s knowledge must be vigorously defended and even then a compromise with a mediocre outcome is common. And with a mediocre effort comes mediocre results.</p>
<p><strong>Identifying Traits:</strong> Tells you about SEO experience during first conversation. Tries to one-up you at least once in the first 3 conversations.<br />
<strong>What To Do:</strong> If the SEO budget is big, suck it up. If not, walk away.<br />
<strong>What Not To Do:</strong> Don&#039;t let your ego get the best of you.</p>
<h2>3. Unreasonable Expectations</h2>
<p>Whether accompanied by SEO knowledge or not, there are some clients that have crazy expectations. Somewhere along the line these folks concluded that an SEO effort requires little work and/or can quadruple traffic in 30 days. If only that were the case&#8230; Many in the industry will say that these clients just need to have their expectations managed. A seemingly simple thing to do, but my experience has been that clients that are actually receptive to having their expectations managed aren&#039;t the ones that have unreasonable expectations. Know what I mean?</p>
<p><strong>Identifying Traits:</strong> Asks questions along the lines of how soon before traffic will double.<br />
<strong>What To Do:</strong> Be very clear about what success is in the contract. Sure you <em>may</em> double traffic, but only a 25% increase in the first 6 months is actually required.<br />
<strong>What Not To Do:</strong> Don&#039;t agree to all of the client&#039;s demands. You will fail.</p>
<h2>4. Limited Development Resources</h2>
<p>These sorts of clients are often pleasant to work with except for the tiny fact that nothing you recommend actually gets implemented. In some cases, the client is aware that their lack of resources is the reason for a lack of progress and so a renewal is quite possible. In other cases, the client gets frustrated as the contract end approaches and so there is no hope of a renewal.</p>
<p><strong>Identifying Traits:</strong> During the pitch, gently inquire about the process for implementing web site changes. Any mention about submitting a ticket to the web team or having to prioritize workloads is a red flag.<br />
<strong>What To Do:</strong> Take the project, but identify goals other than ones directly connected to site traffic. For example, having the client&#039;s team &#034;SEO Certified&#034; is a good goal that doesn&#039;t require as much from the client as making site changes. A renewal will get you closer to having recommendations actually implemented and so eventually everyone wins.<br />
<strong>What Not To Do:</strong> Don&#039;t point fingers at and complain about whichever team is the bottleneck. Doing so will just make you more enemies.</p>
<h2>5. Resistant to Change</h2>
<p>The worst clients are those that won&#039;t adapt to a changing online space. They don&#039;t want to change how they write titles, they don&#039;t want to target keywords, and they certainly don&#039;t want to get involved with social media. Umm&#8230; so why did you hire an SEO agency?</p>
<p><strong>Identifying Traits:</strong> During the pitch mention a type of recommendation you expect to make. Look for any humming and hawing from the prospect about the recommendation going against corporate policy.<br />
<strong>What To Do:</strong> Take the gig and hope your assessment of the client is wrong. Sometimes it is. Do your best to evangelize SEO within the company.<br />
<strong>What Not To Do:</strong> Don&#039;t count on a renewal.</p>
<p>Have more types you want to add to the list? Send descriptions my way. If they&#039;re sufficiently different than the above, I&#039;d be happy to give you a link in exchange. Happy prospecting to the rest of you!
<p><strong>Related Posts</strong></p>
<li><a href="http://www.allthingssem.com/getting-the-most-from-your-seo-agency/" rel="bookmark" title="May 4, 2009">Getting The Most From Your SEO Agency</a></li>
<li><a href="http://www.allthingssem.com/agency-seo-perspective-on-nofollow/" rel="bookmark" title="June 16, 2009">An Agency SEO&#039;s Perspective on Nofollow</a></li>
<li><a href="http://www.allthingssem.com/seo-is-about-beating-other-web-sites/" rel="bookmark" title="February 1, 2007">SEO is About Beating Other Web Sites</a></li>
<li><a href="http://www.allthingssem.com/seo-droid/" rel="bookmark" title="April 21, 2008">Diminishing Returns on Investment Dollars</a></li>
<li><a href="http://www.allthingssem.com/is-seo-creative/" rel="bookmark" title="January 8, 2007">Is SEO Creative?</a></li>
</ul>
<p><!-- Similar Posts took 15.066 ms --></p>
<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
<img src="http://feeds.feedburner.com/~r/AllThingsSEM/~4/O9a83B-WxPc" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.allthingssem.com/seo-client-types/feed/</wfw:commentRss>
		<feedburner:origLink>http://www.allthingssem.com/seo-client-types/</feedburner:origLink></item>
		<item>
		<title>WordPress Plugin Reveals a Black Hat's Network</title>
		<link>http://feedproxy.google.com/~r/AllThingsSEM/~3/6dReRApFhx8/</link>
		<comments>http://www.allthingssem.com/wordpress-plugin-reveals-black-hats-network/#comments</comments>
		<pubDate>Wed, 08 Apr 2009 19:37:41 +0000</pubDate>
		<dc:creator>Marios Alexandrou</dc:creator>
		
		<category><![CDATA[Black Hat SEO]]></category>

		<category><![CDATA[black-hat]]></category>

		<category><![CDATA[matt-cutts]]></category>

		<category><![CDATA[seo]]></category>

		<category><![CDATA[wordpress]]></category>

		<guid isPermaLink="false">http://www.allthingssem.com/?p=517</guid>
		<description><![CDATA[A while back John Andrews wrote a post about <a href="http://www.johnon.com/512/wordpress-seo-plugins.html">trusting WordPress plugins</a> and how that really isn't such a good idea. I agree with everything John wrote, but up until today I hadn't come across a plugin that actually did anything that "crossed the line".<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
]]></description>
			<content:encoded><![CDATA[<p>A while back John Andrews wrote a post about <a href="http://www.johnon.com/512/wordpress-seo-plugins.html">trusting WordPress plugins</a> and how that really isn&#039;t such a good idea. I agree with everything John wrote, but up until today I hadn&#039;t come across a plugin that actually did anything that &#034;crossed the line&#034;.</p>
<p>The plugin that I&#039;ve been digging into has code that randomly determines whether to inject a link into the outputted text. Not only is the insertion frequency dynamic, but so is the link that is injected. That is, the link is pulled from a web site that when queried, spits out a different link. A nice automated link building tool, wouldn&#039;t you say?</p>
<p>After a few minutes of requesting links from this site, it became apparent to me that the black hat has just made it easy for me to identify 18 of his sites.</p>
<p>So what would you do with a list of domains all being supported by black hat link building techniques? Take the opportunity to learn a thing or two? Call <a href="http://www.mattcutts.com/blog/">Matt Cutts</a>?
<p><strong>Related Posts</strong></p>
<li><a href="http://www.allthingssem.com/would-you-be-a-black-hat-seo-if-you-knew-how/" rel="bookmark" title="December 6, 2007">Would You Be A Black Hat SEO If You Knew How?</a></li>
<li><a href="http://www.allthingssem.com/why-does-black-hat-seo-work/" rel="bookmark" title="December 28, 2007">Why Does Black Hat SEO Work?</a></li>
<li><a href="http://www.allthingssem.com/black-hat-seo-terms/" rel="bookmark" title="April 22, 2008">5 Black Hat SEO Terms You Need Know</a></li>
<li><a href="http://www.allthingssem.com/would-you-believe-a-black-hat-seo/" rel="bookmark" title="November 19, 2007">Would You Believe a Black Hat SEO?</a></li>
<li><a href="http://www.allthingssem.com/spam-or-clever-marketing/" rel="bookmark" title="November 25, 2006">Spam or Clever Marketing?</a></li>
</ul>
<p><!-- Similar Posts took 14.904 ms --></p>
<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>

	Tags: <a href="http://technorati.com/tag/black-hat" title="black-hat" rel="tag">black-hat</a>, <a href="http://technorati.com/tag/matt-cutts" title="matt-cutts" rel="tag">matt-cutts</a>, <a href="http://technorati.com/tag/seo" title="seo" rel="tag">seo</a>, <a href="http://technorati.com/tag/wordpress" title="wordpress" rel="tag">wordpress</a><br />
<img src="http://feeds.feedburner.com/~r/AllThingsSEM/~4/6dReRApFhx8" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.allthingssem.com/wordpress-plugin-reveals-black-hats-network/feed/</wfw:commentRss>
		<feedburner:origLink>http://www.allthingssem.com/wordpress-plugin-reveals-black-hats-network/</feedburner:origLink></item>
		<item>
		<title>The Keyword and the SEM Campaign</title>
		<link>http://feedproxy.google.com/~r/AllThingsSEM/~3/rD6pEvIQtAA/</link>
		<comments>http://www.allthingssem.com/keyword-and-the-sem-campaign/#comments</comments>
		<pubDate>Sun, 05 Apr 2009 14:47:33 +0000</pubDate>
		<dc:creator>Mark Simmons</dc:creator>
		
		<category><![CDATA[Search Engine Marketing]]></category>

		<guid isPermaLink="false">http://www.allthingssem.com/?p=516</guid>
		<description><![CDATA[Often times, marketers overlook the importance of a campaign?s keywords in favor of catchy creative, fancy testing techniques or their reliance on bid management software to sort it all out. Now, I?m not saying that these individual components are not important. Everything plays a role in the success of a campaign. However, keywords make up the fundamental basis of a strong campaign.<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
]]></description>
			<content:encoded><![CDATA[<p><em>This is a guest post by Mark Simmons, a seasoned SEM, SEO, SMM professional and freelance consultant based in Raleigh, NC. As managing partner of marCis Interactive Consulting, Mark specializes in creating comprehensive online marketing strategies and providing highly-effective campaign management services. He can be reached at <a href="mailto:mark@marcisconsulting.com">mark@marcisconsulting.com</a> for more info.</em></p>
<p>Often times, marketers overlook the importance of a campaign?s keywords in favor of catchy creative, fancy testing techniques or their reliance on bid management software to sort it all out. Now, I?m not saying that these individual components are not important. Everything plays a role in the success of a campaign. However, keywords make up the fundamental basis of a strong campaign.</p>
<p>Keywords are vital because they indicate user intent. We can determine if the consumer is in the research phase looking for general information, looking for a specific product or brand and when they are ready to make a purchase. Using this information, we can effectively group the keywords into relevant categories in order to creative appropriate (and timely) ads, improving our chances of creating a positive user experience. </p>
<p>All too often, inexperienced markets will take a quantity over quality approach. Judging a campaign by the large number of keywords it contains does not indicate its potential for success. On the contrary, smaller, highly-targeted sets of keywords are the best way to control spend and drive ROI. Once best performers are identified, you can experiment with variations of these keywords instead of making random guesses. Keywords can also be used to beef up your other marketing channels. Successful PPC keywords can be used as content in email, mobile or banner creative.</p>
<p>A successful SEM campaign makes use of many tools and strategies. Start with solid keyword research and add some intuitive thought to get into the mind of your potential customer. You?ll find that using this approach from the beginning can lessen the learning curve and speed up your path to a positive ROI.
<p><strong>Related Posts</strong></p>
<li><a href="http://www.allthingssem.com/dayparting-ppc-campaigns/" rel="bookmark" title="February 6, 2007">Dayparting PPC Campaigns</a></li>
<li><a href="http://www.allthingssem.com/minimum-20k-for-ppc-campaign/" rel="bookmark" title="January 30, 2007">Minimum $20,000 for PPC Campaign</a></li>
<li><a href="http://www.allthingssem.com/50-web-analytics-metrics-worth-tracking/" rel="bookmark" title="October 21, 2006">50 Web Analytics Metrics Worth Tracking?</a></li>
<li><a href="http://www.allthingssem.com/google-adwords-quality-score/" rel="bookmark" title="February 19, 2007">Google AdWords Quality Score</a></li>
<li><a href="http://www.allthingssem.com/dont-forget-about-your-landing-pages/" rel="bookmark" title="February 14, 2007">Don&#039;t Forget About Your Landing Pages</a></li>
</ul>
<p><!-- Similar Posts took 15.299 ms --></p>
<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
<img src="http://feeds.feedburner.com/~r/AllThingsSEM/~4/rD6pEvIQtAA" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.allthingssem.com/keyword-and-the-sem-campaign/feed/</wfw:commentRss>
		<feedburner:origLink>http://www.allthingssem.com/keyword-and-the-sem-campaign/</feedburner:origLink></item>
		<item>
		<title>DiggBar iFrame Foiled by Standard Frame Buster?</title>
		<link>http://feedproxy.google.com/~r/AllThingsSEM/~3/ljuRrRduvC8/</link>
		<comments>http://www.allthingssem.com/diggbar-iframe-foiled-by-standard-frame-buster/#comments</comments>
		<pubDate>Sat, 04 Apr 2009 02:27:38 +0000</pubDate>
		<dc:creator>Marios Alexandrou</dc:creator>
		
		<category><![CDATA[Social Media Optimization]]></category>

		<guid isPermaLink="false">http://www.allthingssem.com/?p=515</guid>
		<description><![CDATA[Patrict Altoft&#039;s reaction to the new Diggbar is that it has killed linkbait because the URL that bloggers are now likely to grab is one that points to Digg.com. However, I&#039;m pretty sure a simple frame buster like the one available as a WordPress plugin would set things back to the way they were. Anyone [...]<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
]]></description>
			<content:encoded><![CDATA[<p>Patrict Altoft&#039;s reaction to the new Diggbar is that it has <a href="http://www.blogstorm.co.uk/digg-just-totally-killed-linkbait/">killed linkbait</a> because the URL that bloggers are now likely to grab is one that points to Digg.com. However, I&#039;m pretty sure a simple frame buster like the <a href="http://wordpress.org/extend/plugins/sem-frame-buster/">one available as a WordPress plugin</a> would set things back to the way they were. Anyone want to try it out?
<p><strong>Related Posts</strong></p>
<li><a href="http://www.allthingssem.com/sphinn-button-after-first-sphinn/" rel="bookmark" title="December 29, 2007">How to Show the Sphinn Button Only AFTER the First Sphinn</a></li>
<li><a href="http://www.allthingssem.com/social-media-for-firefox/" rel="bookmark" title="June 20, 2008">Social Media for Firefox</a></li>
<li><a href="http://www.allthingssem.com/a-day-in-the-life-of-a-digg/" rel="bookmark" title="June 13, 2006">A Day in the Life of a Digg</a></li>
<li><a href="http://www.allthingssem.com/hiding-old-sphinn-buttons/" rel="bookmark" title="February 25, 2008">Hiding the Sphinn Button When a Post Fails to Go Hot</a></li>
<li><a href="http://www.allthingssem.com/how-not-to-reach-out-to-bloggers/" rel="bookmark" title="October 7, 2007">How NOT to Reach Out to Bloggers</a></li>
</ul>
<p><!-- Similar Posts took 10.187 ms --></p>
<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
<img src="http://feeds.feedburner.com/~r/AllThingsSEM/~4/ljuRrRduvC8" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.allthingssem.com/diggbar-iframe-foiled-by-standard-frame-buster/feed/</wfw:commentRss>
		<feedburner:origLink>http://www.allthingssem.com/diggbar-iframe-foiled-by-standard-frame-buster/</feedburner:origLink></item>
		<item>
		<title>Accidental 404 Errors&#x2026; Ugh.</title>
		<link>http://feedproxy.google.com/~r/AllThingsSEM/~3/7gsh7M86gas/</link>
		<comments>http://www.allthingssem.com/accidental-404-errors/#comments</comments>
		<pubDate>Tue, 03 Mar 2009 11:24:25 +0000</pubDate>
		<dc:creator>Marios Alexandrou</dc:creator>
		
		<category><![CDATA[Search Engine Optimization]]></category>

		<category><![CDATA[404]]></category>

		<category><![CDATA[seo]]></category>

		<guid isPermaLink="false">http://www.allthingssem.com/?p=512</guid>
		<description><![CDATA[You wouldn't think it would be possible to accidentally issue a 404 error code for every page request to a site. But I'm a particularly talented individual and that's just what I did. You see, apparently I had seen it all so it seemed like a good idea to create a brand new problem to keep myself busy.<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
]]></description>
			<content:encoded><![CDATA[<p>You wouldn&#039;t think it would be possible to accidentally issue a 404 error code for every page request to a site. But I&#039;m a particularly talented individual and that&#039;s just what I did. You see, apparently I had seen it all so why not go ahead and create brand new problems!?</p>
<p>I make it a point to log into my master analytics account and check out traffic levels for the sites I regularly work on. I typically just look at traffic changes over the last few weeks to identify problems. It takes deeper analysis to actually affect the performance of a site, but traffic is a great diagnostic metric. At the end of January, after a week long hiatus, I saw this:</p>
<div style="text-align: center;"><img src="http://www.allthingssem.com/images/allthingssem-traffic-decline.gif" alt="Traffic Decline on AllThingsSEM.com" /><br /><small>Figure 1: Traffic Drops Precipitously</small></div>
<p>Some more digging revealed this wonderful graph showing a spike in 404 errors.</p>
<div style="text-align: center;"><img src="http://www.allthingssem.com/images/allthingssem-404-errors.gif" alt="404 Errors on AllThingsSEM.com" /><br /><small>Figure 2: 404 Errors Spike</small></div>
<p>To confirm the issue I logged into Google&#039;s Webmaster Tools. Sure enough, about 75% of the URLs in my XML sitemap were flagged as having generated a 404. The remaining 25% probably weren&#039;t crawled during the week before I found the problem.</p>
<p>What&#039;s an SEO to do when a site gets de-indexed due because of accidentally issued 404 errors? Obviously the first order of business is to fix the problem. Turns out there&#039;s a funky bug in WordPress which took me about 4 hours to uncover. The much harder task still lay ahead of me. How do I get Google to re-index my site? I had 5 ideas immediately pop into my head:</p>
<ol>
<li>Submit an updated XML sitemap with every date stamp set to the current date. Done.</li>
<li>Create some new content with links to the de-indexed content knowing that the FeedBurner pinging service will bring Google in quickly. Done.</li>
<li>Take advantage of the $1 SEOmoz Pro subscription I was evaluating and ask the pros there for their opinion. Hey, my head is not so big that I won&#039;t seek ideas from other SEOs. The response I got was that I need to be patient and let things play out. Probably the right answer, but I was hoping there was something I could actually do.</li>
<li>Crawl my site with a browser from a few locations and hope the AdSense ads trigger a visit from Google. A few servers and the iMacros plugin took care of this a few times a day for a few days. Note: I wasn&#039;t clicking ads, just crawling my own site. Seemed like an OK thing to do.</li>
<li>Redirect some indexed URLs to the de-indexed URLs and hope the content is re-indexed. Seemed a little shady so I passed on this one.</li>
</ol>
<p>Google was crazy slow with re-indexing my site. Every week I saw single digit percentage increases in organic traffic. Only 4 weeks later did I know my site was back in business &#8212; a more than doubling in traffic from one day to the next along with a return to near 100% inclusion in Google&#039;s index.</p>
<div style="text-align: center;"><img src="http://www.allthingssem.com/images/allthingssem-traffic-increase.gif" alt="Traffic Increase on AllThingsSEM.com" /><br /><small>Figure 3: Traffic Climbs</small></div>
<p>So while I&#039;m pleased to have recovered, this little excursion has cost me about 5 weeks of revenue. Ugh. If only Google Analytics had alert functionality I likely would&#039;ve caught this much earlier.
<p><strong>Related Posts</strong></p>
<li><a href="http://www.allthingssem.com/duplicate-content-and-seo/" rel="bookmark" title="June 13, 2006">Duplicate Content and SEO</a></li>
<li><a href="http://www.allthingssem.com/50-web-analytics-metrics-worth-tracking/" rel="bookmark" title="October 21, 2006">50 Web Analytics Metrics Worth Tracking?</a></li>
<li><a href="http://www.allthingssem.com/seomoz-crawl-test-seo-tool-review/" rel="bookmark" title="July 10, 2007">SEOmoz Crawl Test - SEO Tool Review</a></li>
<li><a href="http://www.allthingssem.com/google-toolkit/" rel="bookmark" title="November 17, 2005">Google Toolkit</a></li>
<li><a href="http://www.allthingssem.com/combat-plagiarism-with-seo/" rel="bookmark" title="July 19, 2006">Combat Plagiarism with SEO</a></li>
</ul>
<p><!-- Similar Posts took 16.998 ms --></p>
<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>

	Tags: <a href="http://technorati.com/tag/404" title="404" rel="tag">404</a>, <a href="http://technorati.com/tag/seo" title="seo" rel="tag">seo</a><br />
<img src="http://feeds.feedburner.com/~r/AllThingsSEM/~4/7gsh7M86gas" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.allthingssem.com/accidental-404-errors/feed/</wfw:commentRss>
		<feedburner:origLink>http://www.allthingssem.com/accidental-404-errors/</feedburner:origLink></item>
		<item>
		<title>Hide SEO Under the Cloak of Usability</title>
		<link>http://feedproxy.google.com/~r/AllThingsSEM/~3/aA8-sWBzxRc/</link>
		<comments>http://www.allthingssem.com/hide-seo-under-the-cloak-of-usability/#comments</comments>
		<pubDate>Sun, 01 Mar 2009 18:32:35 +0000</pubDate>
		<dc:creator>Marios Alexandrou</dc:creator>
		
		<category><![CDATA[Search Engine Optimization]]></category>

		<guid isPermaLink="false">http://www.allthingssem.com/?p=508</guid>
		<description><![CDATA[Sometimes SEO is a tough sell. You wouldn't think so because the benefits are quite significant and easy to measure. In such cases, approaching SEO in a more stealthy err... subtle manner can bring about the desired effects. For example, if the person or team you're talking to has demonstrated trust in the field of usability, you can piggy-back off of that trust to get your optimization recommendations implemented.<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
]]></description>
			<content:encoded><![CDATA[<p>Sometimes SEO is a tough sell. You wouldn&#039;t think so because the benefits are quite significant and easy to measure. In such cases, approaching SEO in a more stealthy err&#8230; subtle manner can bring about the desired effects. For example, if the person or team you&#039;re talking to has demonstrated trust in the field of usability, you can piggy-back off of that trust to get your optimization recommendations implemented.</p>
<p>Over at the UserEffect blog there&#039;s a great check list of usability tips. Someone has already done half the work. Your job is to take such lists and assess the SEO impact of each item. Then take the list and remove anything that, although may be great from a usability perspective, may distract from the SEO effort. Ideally, you can keep everything in, but if you&#039;ve done SEO consulting for a while you know that the ideal is rare.</p>
<p>The following is UserEffect&#039;s list of 25 usability items. The headings are the same as the original list, but below each heading I&#039;ve identified whether there is an SEO benefit and marked whether I&#039;d keep it. For more detail about the usability task associated with each heading, check out the original <a href="http://www.usereffect.com/topic/25-point-website-usability-checklist">list of 25 usability tips</a>.</p>
<h2>Section I. Accessibility</h2>
<p><strong>1. Site Load-time Is Reasonable</strong><br />
The reasons for extended load times would determine for me whether I&#039;d harp on this one or not. Lengthy HTML code is worse than a lengthy Flash movie at the bottom of the page which likely won&#039;t impact rankings.</p>
<p><strong>2. Adequate Text-to-Background Contrast</strong><br />
Unless we&#039;re talking about the old search engine spam technique of white text on a white background, this one is out.</p>
<p><strong>3. Font Size/Spacing Is Easy to Read</strong><br />
If we&#039;re not talking about hiding content with really, really small text, this one is out.</p>
<p><strong>4. Flash and Add-Ons Are Used Sparingly</strong><br />
Yes! Flash continues to be the bane of many SEO&#039;s existence. I&#039;m got one client that wants to do everything in Flash. Entire sites are built in Flash so I&#039;ve got just one URL to target keywords with. And of course, the Flash has all of the useful content in so it&#039;s largely devalued by search engines. </p>
<p><strong>5. Images Have Appropriate ALT Tags</strong><br />
This is basic image optimization so keep this one. Toss in keyword-rich file names and captions to make the effort more worthwhile. Ann Smarty has a good post on <a href="http://www.seosmarty.com/image-seo/">image seo</a>.</p>
<p><strong>6. Site Has Custom Not-found/404 Page</strong><br />
This will help keep the search engine indexes clean. Not to mention that you can&#039;t register a site with Google Webmaster Tools without a proper 404 server code.</p>
<h2>Section II. Identity</h2>
<p><strong>7. Company Logo Is Prominently Placed</strong><br />
Not a big deal on the SEO front. </p>
<p><strong>8. Tagline Makes Company&#039;s Purpose Clear Answer &#034;What do you do?&#034;</strong><br />
I&#039;d kick this one out for SEO although who in their right mind wouldn&#039;t see the value in this?</p>
<p><strong>9. Home-page Is Digestible In 5 Seconds</strong><br />
Digestible, yes. But that doesn&#039;t mean it should be just two lines long. Keep the structure clean and use headings (good for SEO) to make it scannable.</p>
<p><strong>10. Clear Path to Company Information</strong><br />
Not a big one for SEO.</p>
<p><strong>11. Clear Path to Contact Information</strong><br />
Neither is this.</p>
<h2>Section III. Navigation</h2>
<p><strong>12. Main Navigation Is Easily Identifiable</strong><br />
Identifiable and search engine friendly aren&#039;t necessarily the same here. So I guess the SEO value depends on what the recommendation is to make the navigation identifiable. </p>
<p><strong>13. Navigation Labels Are Clear &#038; Concise</strong><br />
If HTML text based, this is a good one for SEO. Clear and concise can also include keywords.</p>
<p><strong>14. Number of Buttons/Links Is Reasonable Psychologists</strong><br />
If you&#039;re a PageRank sculptor you agree with this one. Rand Fishkin of SEOmoz covered <a href="http://www.seomoz.org/blog/pagerank-sculpting-parsing-the-value-and-potential-benefits-of-sculpting-pr-with-nofollow">PageRank sculpting</a> not too long ago. His focus was with the nofollow attribute, but the logic applies when you&#039;re consider what links to include and exclude on a page.</p>
<p><strong>15. Company Logo Is Linked to Home-page</strong><br />
If you do #7, you might as well make the logo a link.</p>
<p><strong>16. Links Are Consistent and Easy to Identify</strong><br />
This might have some impact on reducing bounce rates which in turn might send a signal of quality to search engines so keep this one.</p>
<p><strong>17. Site Search Is Easy to Access</strong><br />
Keyword research anyone? Aaron Wall over at SEOBook even suggests using this data for <a href="http://www.seobook.com/using-site-search-keyword-data-create-related-content">content creation</a> ideas.</p>
<h2>Section IV. Content</h2>
<p><strong>18. Major Headings Are Clear and Descriptive</strong><br />
SEOs love headings. Use the h tags and you&#039;re all set.</p>
<p><strong>19. Critical Content Is Above The Fold</strong><br />
A keeper. You want search engines to &#034;understand&#034; a page too.</p>
<p><strong>20. Styles and Colors Are Consistent</strong><br />
Search engines are color blind. Although I wonder if they&#039;re working on this. If they know a searcher is male is there more value in sending me to a site with a palette more appealing to males? Hmmm&#8230;</p>
<p><strong>21. Emphasis (bold, etc.) Is Used Sparingly</strong><br />
I don&#039;t do a whole lot of bolding these days. I wouldn&#039;t dwell on this one.</p>
<p><strong>22. Ads and Pop-ups Are Unobtrusive</strong><br />
Pop-ups can result in disabled ads with the AdWords program. Their impact on SEO is still an area where much speculation exists. Last year the folks at SEOOptimise <a href="http://www.seoptimise.com/blog/2008/05/do-popups-have-a-negative-effect-on-google-rankings.html">posed this very question</a>. At this point I wouldn&#039;t call pop-ups an SEO deal breaker, but trying to get rid of them could very well make some enemies with the marketing team.</p>
<p><strong>23. Main Copy Is Concise and Explanatory</strong><br />
Explanatory is a big one. Tell the search engines what a page is about and they&#039;ll give you the appropriate visitors.</p>
<p><strong>24. URLs Are Meaningful and User-friendly</strong><br />
Meaningful and friendly can easily mean keyword-rich so this is a good one. Another post by Ann Smarty, but this time on Search Engine Journal is a quick read about  <a href="http://www.searchenginejournal.com/seo-best-practices-for-url-structure/7216/">optimizing URLs</a>.</p>
<p><strong>25. HTML Page Titles Are Explanatory</strong><br />
It&#039;s funny that what many consider to be the single strongest on page element for SEO is at the bottom of a usability list. Definitely keep this one.</p>
<p>You&#039;ve now become the usability&#039;s expert&#039;s best friend. After all, who else in the company are you 80% in agreement with!? If you&#039;re lucky you&#039;ll start getting e-mails asking for your opinion on usability recommendations which if they can be supported by SEO are more likely to get implemented. Similarly, you can take your SEO ideas and run them by the usability expert for help with repositioning them in a manner that will resonate with whoever holds the purse strings.</p>
<p>Want to do the same exercise with a bigger list? Jacob Neilson&#039;s <a href="http://www.useit.com/homepageusability/">Homepage Usability: 50 Websites Deconstructed</a> has 113 usability tips.
<p><strong>Related Posts</strong></p>
<li><a href="http://www.allthingssem.com/css-rollovers-for-seo/" rel="bookmark" title="September 26, 2005">CSS Rollovers for SEO</a></li>
<li><a href="http://www.allthingssem.com/seo-checklist/" rel="bookmark" title="October 5, 2004">SEO Checklist</a></li>
<li><a href="http://www.allthingssem.com/seomoz-crawl-test-seo-tool-review/" rel="bookmark" title="July 10, 2007">SEOmoz Crawl Test - SEO Tool Review</a></li>
<li><a href="http://www.allthingssem.com/tweaking-the-cutline-wordpress-theme/" rel="bookmark" title="January 1, 2007">Tweaking the Cutline WordPress Theme</a></li>
<li><a href="http://www.allthingssem.com/seo-pilot-projects-are-a-bad-idea/" rel="bookmark" title="January 14, 2008">SEO Pilot Projects Are a Bad Idea</a></li>
</ul>
<p><!-- Similar Posts took 16.918 ms --></p>
<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
<img src="http://feeds.feedburner.com/~r/AllThingsSEM/~4/aA8-sWBzxRc" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.allthingssem.com/hide-seo-under-the-cloak-of-usability/feed/</wfw:commentRss>
		<feedburner:origLink>http://www.allthingssem.com/hide-seo-under-the-cloak-of-usability/</feedburner:origLink></item>
		<item>
		<title>Rankings Don't Matter Except&#x2026;</title>
		<link>http://feedproxy.google.com/~r/AllThingsSEM/~3/TXGcExsU9cw/</link>
		<comments>http://www.allthingssem.com/rankings-dont-matter-except/#comments</comments>
		<pubDate>Sun, 08 Feb 2009 13:42:43 +0000</pubDate>
		<dc:creator>Marios Alexandrou</dc:creator>
		
		<category><![CDATA[Search Engine Optimization]]></category>

		<guid isPermaLink="false">http://www.allthingssem.com/?p=505</guid>
		<description><![CDATA[The first success metric that Target Marketing magazine highlights in their February 2009 spotlight on SEO is, 
&#034;&#8230;the web site&#8230;PortugalSpainBoth.com traveled from zero page one rankings on major search engines to 506 pages ones in nine months &#8212; all through organic search.&#034;
Hmmm&#8230;
Related Posts
Keyword Rich Domains
Eliminating Toolbar PageRank Would Be A Mistake
3 Rankings You Wouldn&#039;t Want [...]<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
]]></description>
			<content:encoded><![CDATA[<p>The <strong>first success metric</strong> that Target Marketing magazine highlights in their <a href="http://www.targetmarketingmag.com/article/seo-helps-spain-portugal-navigate-sea-travel-sites-steer-american-tourists-iberian-peninsula-402441_1.html">February 2009 spotlight on SEO</a> is, </p>
<p>&#034;&#8230;the web site&#8230;PortugalSpainBoth.com traveled from zero page one rankings on major search engines to 506 pages ones in nine months &#8212; all through organic search.&#034;</p>
<p>Hmmm&#8230;
<p><strong>Related Posts</strong></p>
<li><a href="http://www.allthingssem.com/keyword-rich-domains/" rel="bookmark" title="June 5, 2007">Keyword Rich Domains</a></li>
<li><a href="http://www.allthingssem.com/eliminating-toolbar-pagerank-would-be-a-mistake/" rel="bookmark" title="January 3, 2008">Eliminating Toolbar PageRank Would Be A Mistake</a></li>
<li><a href="http://www.allthingssem.com/3-rankings-you-wouldnt-want-to-share/" rel="bookmark" title="October 5, 2007">3 Rankings You Wouldn&#039;t Want to Share</a></li>
<li><a href="http://www.allthingssem.com/google-universal-is-messing-up-my-ranking-reports/" rel="bookmark" title="September 18, 2007">Google Universal is Messing Up My Ranking Reports</a></li>
<li><a href="http://www.allthingssem.com/search-engine-optimization-seo/" rel="bookmark" title="September 28, 2004">Search Engine Optimization (SEO)</a></li>
</ul>
<p><!-- Similar Posts took 15.665 ms --></p>
<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
<img src="http://feeds.feedburner.com/~r/AllThingsSEM/~4/TXGcExsU9cw" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.allthingssem.com/rankings-dont-matter-except/feed/</wfw:commentRss>
		<feedburner:origLink>http://www.allthingssem.com/rankings-dont-matter-except/</feedburner:origLink></item>
		<item>
		<title>Shreddies = Flatulence Underwear</title>
		<link>http://feedproxy.google.com/~r/AllThingsSEM/~3/U_GxDcnKJPk/</link>
		<comments>http://www.allthingssem.com/shreddies-flatulence/#comments</comments>
		<pubDate>Sat, 07 Feb 2009 00:05:45 +0000</pubDate>
		<dc:creator>Marios Alexandrou</dc:creator>
		
		<category><![CDATA[Search Results Analysis]]></category>

		<guid isPermaLink="false">http://www.allthingssem.com/?p=504</guid>
		<description><![CDATA[I was asked today if I ever ate Shreddies cereal as a kid. I couldn&#039;t quite remember what Shreddies were and I was actually getting them mixed up with Shredded Wheat. No problem. So I fired up Google and did a search. Check out the top paid result and the second organic result. What an [...]<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
]]></description>
			<content:encoded><![CDATA[<p>I was asked today if I ever ate Shreddies cereal as a kid. I couldn&#039;t quite remember what Shreddies were and I was actually getting them mixed up with Shredded Wheat. No problem. So I fired up Google and did a search. Check out the top paid result and the second organic result. What an unusual reputation management problem&#8230;</p>
<p><img src="http://www.allthingssem.com/images/search-results-shreddies.png" alt="Shreddies Search Results" /><br clear="all" />
<p><strong>Related Posts</strong></p>
<li><a href="http://www.allthingssem.com/tracking-universal-results-and-rankings/" rel="bookmark" title="January 22, 2009">Tracking Universal Results and Rankings Revisted</a></li>
<li><a href="http://www.allthingssem.com/killing-your-1-google-rankings/" rel="bookmark" title="October 19, 2007">Killing Your #1 Google Rankings</a></li>
<li><a href="http://www.allthingssem.com/penalized-by-google/" rel="bookmark" title="July 13, 2007">Penalized by Google</a></li>
<li><a href="http://www.allthingssem.com/a-blank-page-ranks-3/" rel="bookmark" title="March 17, 2007">A Blank Page Ranks #3</a></li>
<li><a href="http://www.allthingssem.com/google-maps-ooops/" rel="bookmark" title="July 26, 2007">Google Maps Ooops</a></li>
</ul>
<p><!-- Similar Posts took 10.144 ms --></p>
<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
<img src="http://feeds.feedburner.com/~r/AllThingsSEM/~4/U_GxDcnKJPk" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.allthingssem.com/shreddies-flatulence/feed/</wfw:commentRss>
		<feedburner:origLink>http://www.allthingssem.com/shreddies-flatulence/</feedburner:origLink></item>
		<item>
		<title>SEMMYs: Does Anyone Read the Articles?</title>
		<link>http://feedproxy.google.com/~r/AllThingsSEM/~3/fDPC84AeIjw/</link>
		<comments>http://www.allthingssem.com/semmys-does-anyone-read-the-articles/#comments</comments>
		<pubDate>Sun, 01 Feb 2009 01:29:00 +0000</pubDate>
		<dc:creator>Marios Alexandrou</dc:creator>
		
		<category><![CDATA[Social Media Optimization]]></category>

		<guid isPermaLink="false">http://www.allthingssem.com/?p=501</guid>
		<description><![CDATA[So I&#039;ve been catching up with posts in my feed reader and there are a plethora of requests from SEMMY nominees to vote for their articles. Many of those requesting votes link directly to the voting form where there is no link to the actual article. Oversight? I don&#039;t know. 
My own voting patterns got [...]<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
]]></description>
			<content:encoded><![CDATA[<p>So I&#039;ve been catching up with posts in my feed reader and there are a plethora of requests from <a href="http://www.semmys.org/">SEMMY</a> nominees to vote for their articles. Many of those requesting votes link directly to the voting form where there is no link to the actual article. Oversight? I don&#039;t know. </p>
<p>My own voting patterns got me thinking as to whether anyone is actually reading the articles or if people are just voting for the authors they already like? I&#039;m hoping it&#039;s the former. Time for a quick poll&#8230;</p>
<div style="width: 300px;">Note: There is a poll embedded within this post, please visit the site to participate in this post's poll.</div>
<p><strong>Related Posts</strong></p>
<li><a href="http://www.allthingssem.com/2500-sphinns-patterns-trends/" rel="bookmark" title="January 21, 2008">2,500 Hot Sphinns Reveal Patterns and Trends</a></li>
<li><a href="http://www.allthingssem.com/ego-spam/" rel="bookmark" title="November 21, 2007">Ego Spam</a></li>
<li><a href="http://www.allthingssem.com/twitter-will-die/" rel="bookmark" title="May 28, 2008">Twitter Will Die And I&#039;ll Tell You Why</a></li>
<li><a href="http://www.allthingssem.com/sphinn-many-voices/" rel="bookmark" title="February 8, 2008">Sphinn - Many Voices or Just a Few?</a></li>
<li><a href="http://www.allthingssem.com/a-day-in-the-life-of-a-digg/" rel="bookmark" title="June 13, 2006">A Day in the Life of a Digg</a></li>
</ul>
<p><!-- Similar Posts took 10.607 ms --></p>
<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
<img src="http://feeds.feedburner.com/~r/AllThingsSEM/~4/fDPC84AeIjw" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.allthingssem.com/semmys-does-anyone-read-the-articles/feed/</wfw:commentRss>
		<feedburner:origLink>http://www.allthingssem.com/semmys-does-anyone-read-the-articles/</feedburner:origLink></item>
		<item>
		<title>Tracking Universal Results and Rankings Revisted</title>
		<link>http://feedproxy.google.com/~r/AllThingsSEM/~3/eJ2PETZYEWE/</link>
		<comments>http://www.allthingssem.com/tracking-universal-results-and-rankings/#comments</comments>
		<pubDate>Thu, 22 Jan 2009 11:00:48 +0000</pubDate>
		<dc:creator>Marios Alexandrou</dc:creator>
		
		<category><![CDATA[Search Results Analysis]]></category>

		<guid isPermaLink="false">http://www.allthingssem.com/?p=498</guid>
		<description><![CDATA[Over the last month or so there have been multiple posts (<a href="http://yoast.com/track-seo-rankings-google-analytics/">Joost's was the most popular it seems</a>) about how to track Google rankings via Google Analytics. At first I was quite excited by this idea and I applauded the out of the box thinking. After all, what better way is there to get a picture of what is really appearing in the SERPs than to look at actual traffic? That excitement diminished somewhat when I got through the first article and realized that all I could do was determine if a keyword was on page 1, 2, or 3 of the SERPs. Still, I thought the information might be useful so I made a mental note to test out the Google Analytics filter sometime.<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
]]></description>
			<content:encoded><![CDATA[<p>Over the last month or so there have been multiple posts (<a href="http://yoast.com/track-seo-rankings-google-analytics/">Joost&#039;s was the most popular it seems</a>) about how to track Google rankings via Google Analytics. At first I was quite excited by this idea and I applauded the out of the box thinking. After all, what better way is there to get a picture of what is really appearing in the SERPs than to look at actual traffic? That excitement diminished somewhat when I got through the first article and realized that all I could do was determine if a keyword was on page 1, 2, or 3 of the SERPs. Still, I thought the information might be useful so I made a mental note to test out the Google Analytics filter sometime.</p>
<p>Then very recently <a href="http://www.searchcowboys.com/seo/256">another article</a> made the rounds describing a process for tracking organic search visits from universal results. This was particularly interesting to me because I&#039;ve yet to find rank checking software that recognizes universal results (if you know of one, please let me know). For example, every rank checking software I&#039;ve tried fails to report that a #1 ranking is actually #11 because there are 10 map listings above it. </p>
<p>I finally got through some basic testing today with both filters. I used two sites: one that is purely informational and one that is ecommerce focused. I followed the instructions and set up new profiles and applied the appropriate filters. Here are my results from the larger dataset of the two I have.</p>
<h2>Google Rankings</h2>
<p>Out of 35,535 organic search visits from Google, 16 visits were from keywords on page 2 and 7 visits were from keywords on page 3. The rest were either from page 1 or the information wasn&#039;t available for Google Analytics to capture. That works out to 99.9% of the keywords driving traffic being on page 1. That shouldn&#039;t be surprising given all of the research indicating that users don&#039;t go past the first page of results and instead prefer to refine their query. In conclusion I&#039;d say that there&#039;s little value in this filter.</p>
<h2>Google Universal Results Traffic</h2>
<p>Out of 42,853 organic search visits from Google, 42,853 were flagged as not originating from a universal result. This could mean that the result was from a traditional blue link or that the universal result link didn&#039;t provide the necessary information for Google Analytics to track it. That number works out to about 98.4% of visits not attributable to universal results. Curiously, the most popular universal search result is &#039;spelling&#039; errors at 469 visits. Next up was what I think are map/local related listings at 79 visits.</p>
<p>As with the Google Rankings test, the numbers for Google Universal Results Traffic aren&#039;t too impressive. However, I think results will vary greatly depending on the site and so I&#039;d recommend not dismissing this filter. Instead, I&#039;d set it, let it run for a bit, and then assess potential. If at the first time around there&#039;s nothing interesting in the data, check back quarterly in case Google includes a new result type or starts to include a greater percentage of blended results into the mix.</p>
<p>Anyone have other Google Analytics filter ideas? Send my way and I&#039;d be happy to test them out.
<p><strong>Related Posts</strong></p>
<li><a href="http://www.allthingssem.com/google-maps-ooops/" rel="bookmark" title="July 26, 2007">Google Maps Ooops</a></li>
<li><a href="http://www.allthingssem.com/ranking-for-anything-everything-and-nothing/" rel="bookmark" title="October 23, 2007">Ranking for Anything, Everything, and Nothing</a></li>
<li><a href="http://www.allthingssem.com/penalized-by-google/" rel="bookmark" title="July 13, 2007">Penalized by Google</a></li>
<li><a href="http://www.allthingssem.com/shreddies-flatulence/" rel="bookmark" title="February 6, 2009">Shreddies = Flatulence Underwear</a></li>
<li><a href="http://www.allthingssem.com/bundling-content-to-target-keywords/" rel="bookmark" title="May 8, 2007">Bundling Content to Target Keywords</a></li>
</ul>
<p><!-- Similar Posts took 13.134 ms --></p>
<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
<img src="http://feeds.feedburner.com/~r/AllThingsSEM/~4/eJ2PETZYEWE" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.allthingssem.com/tracking-universal-results-and-rankings/feed/</wfw:commentRss>
		<feedburner:origLink>http://www.allthingssem.com/tracking-universal-results-and-rankings/</feedburner:origLink></item>
		<item>
		<title>If Only Circuit City Had Asked Me&#x2026;</title>
		<link>http://feedproxy.google.com/~r/AllThingsSEM/~3/KthNeB3GH-4/</link>
		<comments>http://www.allthingssem.com/if-only-circuit-city-asked-me/#comments</comments>
		<pubDate>Tue, 20 Jan 2009 21:30:16 +0000</pubDate>
		<dc:creator>Marios Alexandrou</dc:creator>
		
		<category><![CDATA[Search Engine Marketing]]></category>

		<guid isPermaLink="false">http://www.allthingssem.com/?p=496</guid>
		<description><![CDATA[So, CircuitCity.com, a massive web site went down recently. All access (even to the robots.txt file) to the site is redirected to a single page that talks about the chain's closing of stores in the US. Of course, I couldn't help but think of better ways to make use of this domain instead of shutting it down.<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
]]></description>
			<content:encoded><![CDATA[<p>So, CircuitCity.com, a massive web site went down recently. All access (even to the robots.txt file) to the site is redirected to a single page that talks about the chain&#039;s closing of stores in the US. Of course, I couldn&#039;t help but think of better ways to make use of this domain instead of shutting it down.</p>
<ol>
<li>Arrange with a competitor such as Best Buy to redirect the traffic to them. This has got to be worth something, no?</li>
<li>Replace every link on CircuitCity.com with affiliate links. True, this would be a great technical undertaking, but I?m sure some of the newly unemployed web developers would&#039;ve pitched in their time for a cut.</li>
<li>Keep the site as is but put up some ads (e.g. AdSense). Both CPM or CPC should work.</li>
</ol>
<p>I suppose any money coming in would just go out to investors and lawyers so maybe this would all be moot&#8230;</p>
<p>I&#039;d also like to use this post to mark the shuttering of the doors to learn what there is to learn from the disappearance of the site. At the moment, Google is reporting 1,170,000 indexed pages. There&#039;s a 302 redirect in place rather than a 301 redirect or even a 404 not found. I wonder how long before Google drops all of the content.</p>
<p><strong>Related Posts</strong></p>
<li><a href="http://www.allthingssem.com/100000-dollars-from-google/" rel="bookmark" title="June 10, 2006">100,000 Dollars from Google</a></li>
<li><a href="http://www.allthingssem.com/more-money-from-your-web-site/" rel="bookmark" title="June 5, 2006">More Money From Your Web Site</a></li>
<li><a href="http://www.allthingssem.com/dont-forget-about-your-landing-pages/" rel="bookmark" title="February 14, 2007">Don&#039;t Forget About Your Landing Pages</a></li>
<li><a href="http://www.allthingssem.com/adwords-for-site-traffic/" rel="bookmark" title="August 1, 2006">AdWords for Site Traffic</a></li>
<li><a href="http://www.allthingssem.com/money-in-free-ringtones/" rel="bookmark" title="June 17, 2006">Money in Free Ringtones</a></li>
</ul>
<p><!-- Similar Posts took 14.690 ms --></p>
<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
<img src="http://feeds.feedburner.com/~r/AllThingsSEM/~4/KthNeB3GH-4" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.allthingssem.com/if-only-circuit-city-asked-me/feed/</wfw:commentRss>
		<feedburner:origLink>http://www.allthingssem.com/if-only-circuit-city-asked-me/</feedburner:origLink></item>
		<item>
		<title>10 Graphs Reveal Web Spam Patterns</title>
		<link>http://feedproxy.google.com/~r/AllThingsSEM/~3/XsvqS6IJsXQ/</link>
		<comments>http://www.allthingssem.com/graphs-reveal-web-spam-patterns/#comments</comments>
		<pubDate>Mon, 19 Jan 2009 00:54:43 +0000</pubDate>
		<dc:creator>Marios Alexandrou</dc:creator>
		
		<category><![CDATA[SEO Theories and Research]]></category>

		<guid isPermaLink="false">http://www.allthingssem.com/?p=495</guid>
		<description><![CDATA[They say a picture is worth a 1,000 words. So here's 10,000 words of web spam data from a research paper titled <a href="http://www.allthingssem.com/research-papers/detecting-spam-web-pages-through-content-analysis.pdf">Detecting Spam Web Pages through Content Analysis</a> by Alexandros Ntoulas et. al.<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
]]></description>
			<content:encoded><![CDATA[<p>They say a picture is worth a 1,000 words. So here&#039;s 10,000 words of web spam data from a research paper titled <a href="http://www.allthingssem.com/research-papers/detecting-spam-web-pages-through-content-analysis.pdf">Detecting Spam Web Pages through Content Analysis</a> by Alexandros Ntoulas et. al.</p>
<p><strong>Note:</strong> The pink line represents the probability of spam.</p>
<h2>1. Top Level Domain</h2>
<p>Relatively speaking, there is more spam on .biz domains than on other domains.<br />
<img src="http://www.allthingssem.com/images/web-spam-top-level-domain.png" alt="Web Spam: Top Level Domain"/></p>
<h2>2. Language</h2>
<p>And apparently spam is quite popular with the French.<br /> <br />
<img src="http://www.allthingssem.com/images/web-spam-language.png" alt="Web Spam: Language"/></p>
<h2>3. Compression Ratio</h2>
<p>Repeated (e.g. keyword stuffing) words generally lead to better compression rates.<br />
<img src="http://www.allthingssem.com/images/web-spam-compression-ratio.png" alt="Web Spam: Compression Ratio"/></p>
<h2>4. Visible Content</h2>
<p>Not to be confused with hidden content, visible content in this context basically refers to the code to text ratio. Guess what? A page with a higher ratio of code is actually less likely to be spam.<br />
<img src="http://www.allthingssem.com/images/web-spam-fraction-visible-content.png" alt="Web Spam: Fraction of Visible Content"/></p>
<h2>5. Number of Words</h2>
<p>Longer documents are more likely to be spam.<br />
<img src="http://www.allthingssem.com/images/web-spam-number-of-words.png" alt="Web Spam: Number of Words"/></p>
<h2>6. Average Word Length</h2>
<p>Long words, generally formed by combining other words such as freemp3, result in an above average word length.<br />
<img src="http://www.allthingssem.com/images/web-spam-average-word-length.png" alt="Web Spam: Average Word Length"/></p>
<h2>7. Number of Words in Title</h2>
<p>This one shouldn&#039;t surprise anyone. Stuffing the keyword title is common with spammers.<br />
<img src="http://www.allthingssem.com/images/web-spam-number-of-words-in-title.png" alt="Web Spam: Number of Words in Title"/></p>
<h2>8. Words in Anchor Text</h2>
<p>Too large a proportion of text in link text is a good indicator for web spam.<br />
<img src="http://www.allthingssem.com/images/web-spam-fraction-anchor-words.png" alt="Web Spam: Fraction of Text as Anchor Words"/></p>
<h2>9 &#038; 10. Most Frequent Words in Corpus Common with Text</h2>
<p>The fraction of words on a page that are contained in the set of 200 or 500 words that occur most frequently in the English portion of the research paper authors&#039; 105 million document corpus.</p>
<p><img src="http://www.allthingssem.com/images/web-spam-fraction-200-common-corpus.png" alt="Web Spam: Fraction of 200 Most Frequent Words in Corpus Common with Text"/></p>
<p><img src="http://www.allthingssem.com/images/web-spam-fraction-500-common-corpus.png" alt="Web Spam: Fraction of 500 Most Frequent Words in Corpus Common with Text"/><br />
<br clear="all"/>
<p><strong>Related Posts</strong></p>
<li><a href="http://www.allthingssem.com/using-link-structures-to-classify-spam/" rel="bookmark" title="January 7, 2009">Using Link Structures to Classify Web Spam</a></li>
<li><a href="http://www.allthingssem.com/statistically-speaking-that-page-is-spam/" rel="bookmark" title="November 19, 2008">Statistically Speaking, That Page Is Spam</a></li>
<li><a href="http://www.allthingssem.com/web-spam-taxonomy/" rel="bookmark" title="January 6, 2009">Web Spam Taxonomy</a></li>
<li><a href="http://www.allthingssem.com/frank-takes-on-pagerank/" rel="bookmark" title="November 17, 2008">fRank Takes on PageRank</a></li>
<li><a href="http://www.allthingssem.com/detecting-splogs-with-self-similarity-analysis/" rel="bookmark" title="January 8, 2009">Detecting Splogs with Self-Similarity Analysis</a></li>
</ul>
<p><!-- Similar Posts took 10.739 ms --></p>
<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
<img src="http://feeds.feedburner.com/~r/AllThingsSEM/~4/XsvqS6IJsXQ" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.allthingssem.com/graphs-reveal-web-spam-patterns/feed/</wfw:commentRss>
		<feedburner:origLink>http://www.allthingssem.com/graphs-reveal-web-spam-patterns/</feedburner:origLink></item>
		<item>
		<title>Detecting Cloaking Algorithmically Is Not Easy</title>
		<link>http://feedproxy.google.com/~r/AllThingsSEM/~3/VsEhchh0pZU/</link>
		<comments>http://www.allthingssem.com/detecting-cloaking-algorithmically/#comments</comments>
		<pubDate>Wed, 14 Jan 2009 11:33:24 +0000</pubDate>
		<dc:creator>Marios Alexandrou</dc:creator>
		
		<category><![CDATA[SEO Theories and Research]]></category>

		<guid isPermaLink="false">http://www.allthingssem.com/?p=493</guid>
		<description><![CDATA[Recently Matt Cutts <a href="http://www.mattcutts.com/blog/webspam-in-2009/">posted a question on his blog</a> asking for what people thought Google's Web Spam team should focus on next. Mixed in amongst the answers were requests to eliminate cloaking. Some even went so far as to list offending sites. What's interesting to me is that since cloaking isn't new, is there something tricky with its detection that has kept Google from eliminating it from their results?<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
]]></description>
			<content:encoded><![CDATA[<p>Recently Matt Cutts <a href="http://www.mattcutts.com/blog/webspam-in-2009/">posted a question on his blog</a> asking for what people thought Google&#039;s Web Spam team should focus on next. Mixed in amongst the answers were requests to eliminate cloaking. Some even went so far as to list offending sites. What&#039;s interesting to me is that since cloaking isn&#039;t new, is there something tricky with its detection that has kept Google from eliminating it from their results?</p>
<p>I can install a <a href="https://addons.mozilla.org/en-US/firefox/addon/59">Firefox add-on</a> to set my browser&#039;s user-agent to match that of the GoogleBot and spend the rest of the day identifying cloaking with near 100% accuracy. So what&#039;s been keeping the PhD-filled Google complex from wrapping what I can do into an automated process? I had no idea what the answer to that question was until I read through a research paper titled <a href="http://www.allthingssem.com/research-papers/cloaking-and-redirection-a-preliminary-study.pdf">Cloaking and Redirection: A Preliminary Study</a> by Boening Wu and Brian D. Davison of Lehigh University.</p>
<h2>Cloaking in Brief</h2>
<p>Cloaking on the web, in case this is the first time you&#039;ve come across the term, refers to the delivery of content to users that is different than what is delivered to search engines. The motivation behind doing so is to obtain rankings in search engines while driving subsequent visitors to some action without providing the promised content. For example, a site may require visitors to sign up before seeing the content, but those very same visitors wouldn&#039;t ever see the content if it didn&#039;t rank in search engines. Cloaking to the rescue &#8212; show the full content to search engines, but swap in a registration form for human visitors.</p>
<h2>Methods for Detecting Cloaking</h2>
<p>At first blush it seems that detecting a cloaked page is a simple matter of comparing the content delivered to a search engine vs. the content delivered to a browser. Unfortunately, this comparison turns out to be a non-trivial task. Some reasons for this are that some sites:</p>
<ol>
<li>Change their content frequently (e.g. news sites) so comparing two different copies would yield a false positive.</li>
<li>Rotate through content and show something different for each request (e.g. a page the profiles different people).</li>
<li>Serve &#034;clean&#034; versions of their content to search engines (e.g. by removing advertising). There&#039;s no malicious intent with this sort of activity.</li>
<li>Include dynamic elements such as a time stamps that make every version of the page content unique.</li>
</ol>
<p>Back in 2003, M. Njork filed a patent for a system and method for identifying cloaked web servers. He proposed using a browser toolbar installed by users to compare pages to what was stored by a search engine. The problem with his proposed solution is that it doesn&#039;t take into account any of the 4 items above. Fortunately, I don&#039;t believe such a toolbar ever made it to the masses.</p>
<p>Wu and Davison, authors of the research paper I mentioned above, proposed a few alternate methods for detecting cloaking. The first looks at the terms on the page using the following algorithm:</p>
<ul>
<li>Capture three copies of a web page. Two by a crawler (C1, C2) and one by a browser (B1).</li>
<li>Parse the HTML into terms and count only the number of unique terms.</li>
<li>Compare the counts for C1 and C2 (call this NCC) along with C1 and B1 (call this NBC).</li>
<li>If NBC is greater than NCC, mark the page as cloaking candidate. Note that the threshold used for what constitutes a significant difference between NBC and NCC can be changed to achieve desired level of precision vs. recall.</li>
</ul>
<p>Another approach compares the link counts on the pages in a similar fashion as to the steps described above for comparing terms. The results revealed that link comparisons identify fewer instances of cloaking than term comparisons, but the results are more accurate.</p>
<p>The final method requires an additional copy of the browser version of the page bring the total to 4: C1, C2, B1, and B2. The assumption for this third method is that deliberate cloaking will return a set of specific terms (chosen by the spammer) to search engines, but never to users. So, if C1 and C2 have common terms that don&#039;t appear at all in B1 or B2, it&#039;s likely the page is cloaking. Again, a threshold would be required to reduce false positives for insignificant differences.</p>
<p>Unfortunately, all three of these methods are flawed (even the others say so) and will result in many pages being marked as engaged in cloaking because they fail to distinguish between &#034;acceptable&#034; cloaking and that which results in web spam. In addition, search engines would be required to capture 3 or 4 copies of the same page which would add to the already daunting task of crawling and storing web content. </p>
<p>So what do you say? Should we cut Google a little slack?
<p><strong>Related Posts</strong></p>
<li><a href="http://www.allthingssem.com/web-spam-taxonomy/" rel="bookmark" title="January 6, 2009">Web Spam Taxonomy</a></li>
<li><a href="http://www.allthingssem.com/statistically-speaking-that-page-is-spam/" rel="bookmark" title="November 19, 2008">Statistically Speaking, That Page Is Spam</a></li>
<li><a href="http://www.allthingssem.com/frank-takes-on-pagerank/" rel="bookmark" title="November 17, 2008">fRank Takes on PageRank</a></li>
<li><a href="http://www.allthingssem.com/using-link-structures-to-classify-spam/" rel="bookmark" title="January 7, 2009">Using Link Structures to Classify Web Spam</a></li>
<li><a href="http://www.allthingssem.com/graphs-reveal-web-spam-patterns/" rel="bookmark" title="January 18, 2009">10 Graphs Reveal Web Spam Patterns</a></li>
</ul>
<p><!-- Similar Posts took 11.631 ms --></p>
<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
<img src="http://feeds.feedburner.com/~r/AllThingsSEM/~4/VsEhchh0pZU" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.allthingssem.com/detecting-cloaking-algorithmically/feed/</wfw:commentRss>
		<feedburner:origLink>http://www.allthingssem.com/detecting-cloaking-algorithmically/</feedburner:origLink></item>
		<item>
		<title>Detecting Splogs with Self-Similarity Analysis</title>
		<link>http://feedproxy.google.com/~r/AllThingsSEM/~3/aR5tFsIJHcc/</link>
		<comments>http://www.allthingssem.com/detecting-splogs-with-self-similarity-analysis/#comments</comments>
		<pubDate>Thu, 08 Jan 2009 13:53:50 +0000</pubDate>
		<dc:creator>Marios Alexandrou</dc:creator>
		
		<category><![CDATA[SEO Theories and Research]]></category>

		<guid isPermaLink="false">http://www.allthingssem.com/?p=490</guid>
		<description><![CDATA[A while back Google's search results were littered with splogs (spam blogs). It was common to search for a term, click on the first result, and land on a page that had advertising fill the area above the fold and useless content below. I'm not sure when, but the problem must have reached critical mass because Google cleaned up the results. And as with most things Google, the cleanup was very likely algorithmic and automated. Ever wonder how they might have accomplished this feat? A research paper from May 8, 2007 titled <a href="http://www.allthingssem.com/research-papers/splog-detection-using-self-similarity-analysis.pdf">Splog Detection Using Self-Similarity Analysis on Blog Temporal Dynamics</a> may be the answer.<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
]]></description>
			<content:encoded><![CDATA[<p>A while back Google&#039;s search results were littered with splogs (spam blogs). It was common to search for a term, click on the first result, and land on a page that had advertising fill the area above the fold and useless content below. I&#039;m not sure when, but the problem must have reached critical mass because Google cleaned up the results. And as with most things Google, the cleanup was very likely algorithmic and automated. Ever wonder how they might have accomplished this feat? A research paper from May 8, 2007 titled <a href="http://www.allthingssem.com/research-papers/splog-detection-using-self-similarity-analysis.pdf">Splog Detection Using Self-Similarity Analysis on Blog Temporal Dynamics</a> may be the answer.</p>
<p>Blogs differ from regular web sites suggesting that the splog detection process needs to be different than what is done to detect <a href="http://www.allthingssem.com/web-spam-taxonomy/">typical web spam</a>. With web spam, content is usually static whereas splogs typically employ an automated framework to regularly generate new content. This constant change is what enables temporal analysis of both the content and link structure which in turn reveals splog-identifying patterns or anomalies.</p>
<p>Why go through all this trouble? Splogs not only degrade the end-users experience, but they are a significant tax on network and storage resources. Splogs have a established a significant foothold as can be seen from some data the authors collected using primary and secondary research:</p>
<ul>
<li>identified 2.7 million splogs out of 20.3 million (over 10% splogs)</li>
<li>an average of 44 of the top 100 blogs search results in the three popular blog search engines came from splogs</li>
<li>75% of new pings come from splogs</li>
<li>50% of claimed blogs pinging the website weblogs.com are splogs</li>
</ul>
<h2>Characteristics of a Splog</h2>
<p>When evaluating a splog, the evaluation is done at a site level rather than at a page level. Also, if spam elements such as trackback spam on an otherwise legitimate blog don&#039;t make that blog a splog. With this in mind, here are the typical characteristics of a splog:</p>
<ul>
<li>Content that is generated algorithmically including random gibberish, copied snippets from other sites, and weaved content.</li>
<li>No added value e.g. the content is nonsensical or is a duplicate of another site.</li>
<li>A hidden agenda likely one tied to generating advertising revenue. Arguably most blogs have some agenda other than the desire to share and communicate so this item alone doesn&#039;t mean a blog is a splog.</li>
</ul>
<h2>Self-Similarity Characteristics</h2>
<p>The machine-generated nature of splogs make them good candidates for statistical assessment. This assessment can include looking at:</p>
<ol>
<li>Post Time: Two measures that capture regularity in posting time (micro) e.g. posts go live in the morning before the blogger&#039;s &#034;real&#034; job as well as a macro time view e.g. a large gap in posting due to a vacation.</li>
<li>Post Content: A measure of the topic drift by the blogger. Commonly a blogger will remain focused on a topic, but will sometimes write about other topics.</li>
<li>Post Links: The links on the blog can be telling. A large proportion directed to a particular domain, for example, suggests a relationship with that destination domain.</li>
</ol>
<p>The math associated with gathering this data and developing the signatures for identifying a blog vs. splog are quite complex. For details, I recommend downloading the PDF I linked to above. However, as you can see below, the authors did provide strong evidence, in the form of visuals, that the patterns that splogs produce are distinctive.</p>
<p><img src="http://www.allthingssem.com/images/self-similarity-plots-for-post-time-contents-links.png" alt="Self Similarity Plots for Post Time, Contents, and Links"/><br clear="all"/>
<p><strong>Related Posts</strong></p>
<li><a href="http://www.allthingssem.com/statistically-speaking-that-page-is-spam/" rel="bookmark" title="November 19, 2008">Statistically Speaking, That Page Is Spam</a></li>
<li><a href="http://www.allthingssem.com/using-link-structures-to-classify-spam/" rel="bookmark" title="January 7, 2009">Using Link Structures to Classify Web Spam</a></li>
<li><a href="http://www.allthingssem.com/graphs-reveal-web-spam-patterns/" rel="bookmark" title="January 18, 2009">10 Graphs Reveal Web Spam Patterns</a></li>
<li><a href="http://www.allthingssem.com/web-spam-taxonomy/" rel="bookmark" title="January 6, 2009">Web Spam Taxonomy</a></li>
<li><a href="http://www.allthingssem.com/detecting-cloaking-algorithmically/" rel="bookmark" title="January 14, 2009">Detecting Cloaking Algorithmically Is Not Easy</a></li>
</ul>
<p><!-- Similar Posts took 11.508 ms --></p>
<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
<img src="http://feeds.feedburner.com/~r/AllThingsSEM/~4/aR5tFsIJHcc" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.allthingssem.com/detecting-splogs-with-self-similarity-analysis/feed/</wfw:commentRss>
		<feedburner:origLink>http://www.allthingssem.com/detecting-splogs-with-self-similarity-analysis/</feedburner:origLink></item>
		<item>
		<title>Using Link Structures to Classify Web Spam</title>
		<link>http://feedproxy.google.com/~r/AllThingsSEM/~3/E9oNVu2SZJs/</link>
		<comments>http://www.allthingssem.com/using-link-structures-to-classify-spam/#comments</comments>
		<pubDate>Wed, 07 Jan 2009 11:25:12 +0000</pubDate>
		<dc:creator>Marios Alexandrou</dc:creator>
		
		<category><![CDATA[SEO Theories and Research]]></category>

		<guid isPermaLink="false">http://www.allthingssem.com/?p=488</guid>
		<description><![CDATA[In an earlier post I summarized content from a research paper that provided a <a href="http://www.allthingssem.com/web-spam-taxonomy/">Web Spam Taxonomy</a>. That paper is a few years old, but I believe it still provided a good foundation for discussions regarding web spam. In this post, I'm going to walk through a document titled <a href="http://www.allthingssem.com/research-papers/improving-web-spam-classifiers-using-link-structure.pdf">Improving Web Spam Classifiers Using Link Structure</a> written by Qingqing Gan and Torsten Suel of the CIS Department at Polytechnic University in Brooklyn, NY. In the world of information retrieval research, this research paper is quite current having been publishing in the May, 2007.<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
]]></description>
			<content:encoded><![CDATA[<p>In an earlier post I summarized content from a research paper that provided a <a href="http://www.allthingssem.com/web-spam-taxonomy/">Web Spam Taxonomy</a>. That paper is a few years old, but I believe it still provided a good foundation for discussions regarding web spam. In this post, I&#039;m going to walk through a document titled <a href="http://www.allthingssem.com/research-papers/improving-web-spam-classifiers-using-link-structure.pdf">Improving Web Spam Classifiers Using Link Structure</a> written by Qingqing Gan and Torsten Suel of the CIS Department at Polytechnic University in Brooklyn, NY. In the world of information retrieval research, this research paper is quite current having been publishing in the May, 2007.</p>
<p>Spam, in the context of this research paper, falls into two categories: content spam and link spam. There has been a lot of research focused on dealing with web spam (some of which I will get to in other posts) such as:</p>
<ul>
<li>Propogating distrust by reversing links aka BadRank.</li>
<li>Promoting trust from good sites in order to demote spam aka TrustRank.</li>
<li>Using statistical analysis to identify spam since spam pages typically vary from the average page.</li>
<li>Developing a SpamRank metric that uses PageRank value distribution in the in-coming pages.</li>
</ul>
<p>This research paper describes a basic classifier (consisting of 20+ content and link-based features) which is then enhanced in two different ways by integrating additional neighborhood features. </p>
<h2>Content Features</h2>
<ul>
<li>Number of words in a page.</li>
<li>Average length of words in a page.</li>
<li>Fraction of words drawn from globally popular words.</li>
<li>Fraction of globally popular words used in a page.</li>
<li>Fraction of visible content. This sounds like code to content ratio.</li>
<li>Number of words in the page title.</li>
<li>Amount of anchor text in a page to help detect pages stuffed with links.</li>
<li>Compression rate of the page, using gzip. Not sure what this one is for, but I&#039;m guessing normal text compresses a certain amount while spam doesn&#039;t.</li>
</ul>
<h2>Link Features</h2>
<ul>
<li>Percentage of pages in most populated level.</li>
<li>Top level page expansion ratio.</li>
<li>In-links per page.</li>
<li>Out-links per page.</li>
<li>Out-links per in-link.</li>
<li>Top-level in-link portion.</li>
<li>Average level of in-links.</li>
<li>Average level of out-links.</li>
<li>Percentage of in-links to most popular level.</li>
<li>Percentage of out-links from most emitting level.</li>
<li>Cross-links per page.</li>
<li>Top-level internal in-links per page on this site.</li>
<li>Average level of page in this site.</li>
</ul>
<h2>Global Features</h2>
<ul>
<li>Number of hosts in the domain. The more, the more likely spam.</li>
<li>Ratio of pages in this host to pages in the domain.</li>
<li>Number of hosts on the same IP address. Makes you wonder about IP sharing hosts like MediaTemple, no?</li>
</ul>
<p>Combined, the above features would provide a score for any given page with some measure of confidence that it is or isn&#039;t spam. To refine this score, the authors also included neighborhood data. That is, by looking at what sites are pointing into and what sites are pointed out to, the classification accuracy improves. Namely, that spam sites generally point to a lot of other spam and spam sites generally receive a disproportionate number of links from other spam sites. So combining the feature data with the neighborhood data results in spam score. But the authors wanted to improve on this basic classification and did so in two ways. One of the methods is decribed below, the other isn&#039;t because I couldn&#039;t wrap my head around the description. I suggest downloading the PDF if you&#039;re interested.</p>
<h2>Relabeling</h2>
<p>Simply put, the relabeling approach aims to flip the label of a site from non-spam to spam or vice-versa. To do this, each site in a neighborhood is given a label with a certain level of confidence. The label from each site comes about by using the feature set listed above. This neighborhood label is then compared to the site&#039;s label. If the labels disagree and there is more confidence in the neighborhood&#039;s, the site&#039;s label is switched to match that of the neighborhood. Otherwise the label stays the same. </p>
<p>Boiled down, this translates into your site being judged by the friends it keeps.
<p><strong>Related Posts</strong></p>
<li><a href="http://www.allthingssem.com/graphs-reveal-web-spam-patterns/" rel="bookmark" title="January 18, 2009">10 Graphs Reveal Web Spam Patterns</a></li>
<li><a href="http://www.allthingssem.com/statistically-speaking-that-page-is-spam/" rel="bookmark" title="November 19, 2008">Statistically Speaking, That Page Is Spam</a></li>
<li><a href="http://www.allthingssem.com/web-spam-taxonomy/" rel="bookmark" title="January 6, 2009">Web Spam Taxonomy</a></li>
<li><a href="http://www.allthingssem.com/frank-takes-on-pagerank/" rel="bookmark" title="November 17, 2008">fRank Takes on PageRank</a></li>
<li><a href="http://www.allthingssem.com/detecting-splogs-with-self-similarity-analysis/" rel="bookmark" title="January 8, 2009">Detecting Splogs with Self-Similarity Analysis</a></li>
</ul>
<p><!-- Similar Posts took 11.828 ms --></p>
<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
<img src="http://feeds.feedburner.com/~r/AllThingsSEM/~4/E9oNVu2SZJs" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.allthingssem.com/using-link-structures-to-classify-spam/feed/</wfw:commentRss>
		<feedburner:origLink>http://www.allthingssem.com/using-link-structures-to-classify-spam/</feedburner:origLink></item>
		<item>
		<title>Web Spam Taxonomy</title>
		<link>http://feedproxy.google.com/~r/AllThingsSEM/~3/z-5-7yt5DO8/</link>
		<comments>http://www.allthingssem.com/web-spam-taxonomy/#comments</comments>
		<pubDate>Tue, 06 Jan 2009 11:31:49 +0000</pubDate>
		<dc:creator>Marios Alexandrou</dc:creator>
		
		<category><![CDATA[SEO Theories and Research]]></category>

		<guid isPermaLink="false">http://www.allthingssem.com/?p=486</guid>
		<description><![CDATA[I came across an interesting research paper the other day titled <a href="http://www.allthingssem.com/research-papers/web-spam-taxonomy.pdf">Web Spam Taxonomy</a>. How could I resist that title!? The paper was written by Zoltan Gyongyi and Hector Garcia-Molina while in the Computer Science Department of Stanford University. The authors also acknowledge many discussions with an anonymous collaborator at a major search engine as a source of information.<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
]]></description>
			<content:encoded><![CDATA[<p>I came across an interesting research paper the other day titled <a href="http://www.allthingssem.com/research-papers/web-spam-taxonomy.pdf">Web Spam Taxonomy</a>. How could I resist that title!? The paper was written by Zoltan Gyongyi and Hector Garcia-Molina while in the Computer Science Department of Stanford University. The authors also acknowledge many discussions with an anonymous collaborator at a major search engine as a source of information.</p>
<p>In the context of this research paper, all SEOs are spammers since their activities are intended to boost rankings of a page without actually improving the quality of that page. The authors call out techdictionary.com as an example of a web spam. They also mention SEO Inc. and Bruce Clay as important voices in the web spam arena. Curious choices.</p>
<p>Web spam is detrimental to search engines in two ways because it:</p>
<ol>
<li>reduces the quality of search results</li>
<li>increases the cost of each processed query due to the storage and retrieval of useless pages</li>
</ol>
<p>The techniques for generating web spam have two classifications: boosting and hiding. Boosting focuses on improving rankings while hiding focuses on preventing the search engines from detecting the boosting techniques.</p>
<h2>Term Spamming</h2>
<p>Possibly the simplest form of spamming, term spamming centers around the fundamental TFIDF metric (term frequency / inverse document frequency). In practical terms this metric means that if a keyword appears 4 times out of a total of 40 keywords then the TF (term frequency) is 0.1 while the IDF (inverse document frequency) is 10. The TFIDF score of a page for a given query is computed over all common terms as the sum of the products of the TF and IDF values. With TFIDF in mind, spammers either aim to target a small number of keywords with a high TFIDF score or to receive a non-zero TFIDF for a lot of terms. </p>
<p>Page elements that make good targets for spam include the body, title, meta tag, anchor text, and URLs. Some techniques include:</p>
<ul>
<li>Repetition of keywords in the page elements to increase relevance for a small number of queries.</li>
<li>Dumping a large number of unrelated terms to increase the number of queries (mostly long tail) that the document would be relevant for.</li>
<li>Weaving which uses copied text to surround the spam keywords. Certainly wouldn&#039;t pass a visual inspection, but could trick a search engine.</li>
<li>Phrase stitching which combines different snippets of content to create new content.</li>
</ul>
<h2>Link Spamming</h2>
<p>It doesn&#039;t take long before someone new to SEO comes across articles about how links are key to rankings. So it&#039;s no wonder that link spamming exists. For spammers, links are categorized as those that are:</p>
<ol>
<li>inaccessible i.e. can&#039;t be modified</li>
<li>accessible, but not owned i.e. can be modified indirectly such as comments in a forum</li>
<li>owned i.e. complete control to modify</li>
</ol>
<p>Targeting the HITS algorithm&#039;s emphasis on hub and authority scores is a matter of creating pages that link to high quality sites to increase that page&#039;s hub score. One common technique is to mirror a directory. Using many high hub score pages to point to a chosen target can then result in that target page obtaining a high authority score. This, as you can imagine, can be accomplished with accessible and owned pages.</p>
<p>Google&#039;s PageRank algorithm, despite its success, is susceptible to manipulation as well. PageRank places great importance on a page&#039;s incoming links. Generally, the more links the better although there is increased benefit from &#034;high quality&#034; links. Enter the link farm &#8212; an artificial construction of interlinked sites that in its purest form has:</p>
<ol>
<li>All available owned pages part of the link farm</li>
<li>All accessible pages pointing to the spam farm</li>
<li>All links pointing outside of the spam farm suppressed</li>
<li>All pages within the farm having some outgoing links</li>
</ol>
<p>Aside from link farms, all methods of obtaining inbound links are fair game. For example, anyone that has successfully toyed with social media has already figured out one of the ways links can be accumulated i.e. use useful/interesting content to attract links. A spammer goes one step further to then use that link equity to boost low quality pages. Other techniques include infiltrating a web directory, posting links in forums / blog comments, participating in link exchanges, and buying expired domains.</p>
<h2>Content Hiding</h2>
<p>Once popular, but not particularly effective anymore, is hiding content using background and foreground colors that match. Hiding links is only slightly harder and can be achieved with 1&#215;1 pixel images. CSS brought with it a few new tricks such as setting page elements to be not visible along with other tricks like negative indents.</p>
<h2>Cloaking</h2>
<p>A more sophisticated hiding technique is cloaking. Sending different content to search engines vs. visitors is what cloaking is all about. Detecting the user-agent and responding accordingly is one method, but IP cloaking is much harder to detect.</p>
<h2>Redirection</h2>
<p>Allowing search engines to index a page, but redirecting human visitors to another page is another hiding technique. As with cloaking, the idea is to show search engines something while showing human visitors something else. The most common techniques are a meta refresh and a JavaScript redirect. Neither are executed by search engines, but both are apparently detectable now.</p>
<p>This web spam taxonomy, despite being somewhat dated (the document itself has no date, but the bibliography references other papers from 2005 so it&#039;s no older than that), makes for a nice segue into a series of web spam detection posts I have in the works. Stay tuned!
<p><strong>Related Posts</strong></p>
<li><a href="http://www.allthingssem.com/using-link-structures-to-classify-spam/" rel="bookmark" title="January 7, 2009">Using Link Structures to Classify Web Spam</a></li>
<li><a href="http://www.allthingssem.com/graphs-reveal-web-spam-patterns/" rel="bookmark" title="January 18, 2009">10 Graphs Reveal Web Spam Patterns</a></li>
<li><a href="http://www.allthingssem.com/statistically-speaking-that-page-is-spam/" rel="bookmark" title="November 19, 2008">Statistically Speaking, That Page Is Spam</a></li>
<li><a href="http://www.allthingssem.com/detecting-cloaking-algorithmically/" rel="bookmark" title="January 14, 2009">Detecting Cloaking Algorithmically Is Not Easy</a></li>
<li><a href="http://www.allthingssem.com/history-of-latent-semantic-indexing/" rel="bookmark" title="April 8, 2007">The History of Latent Semantic Indexing</a></li>
</ul>
<p><!-- Similar Posts took 12.663 ms --></p>
<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
<img src="http://feeds.feedburner.com/~r/AllThingsSEM/~4/z-5-7yt5DO8" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.allthingssem.com/web-spam-taxonomy/feed/</wfw:commentRss>
		<feedburner:origLink>http://www.allthingssem.com/web-spam-taxonomy/</feedburner:origLink></item>
		<item>
		<title>Are Search Engines Crawling Links in HTML Comments?</title>
		<link>http://feedproxy.google.com/~r/AllThingsSEM/~3/FOndL6sDVU8/</link>
		<comments>http://www.allthingssem.com/search-engines-crawling-html-comment-links/#comments</comments>
		<pubDate>Wed, 03 Dec 2008 12:59:14 +0000</pubDate>
		<dc:creator>Marios Alexandrou</dc:creator>
		
		<category><![CDATA[SEO Experiments]]></category>

		<guid isPermaLink="false">http://www.allthingssem.com/?p=482</guid>
		<description><![CDATA[I was looking through a client's data in Google Webmaster Tools recently and noticed tens of thousands of bad internal links. All of these links could be boiled down to a handful of patterns which struck me as odd because I should've noticed these bad links before. A little digging revealed that Google has been aggressively crawling links that are in JavaScript code.<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
]]></description>
			<content:encoded><![CDATA[<p>I was looking through a client&#039;s data in Google Webmaster Tools recently and noticed tens of thousands of bad internal links. All of these links could be boiled down to a handful of patterns which struck me as odd because I should&#039;ve noticed these bad links before. A little digging revealed that Google has been aggressively crawling links that are in JavaScript code.</p>
<p>I could sort of see search engines following fully qualified links (i.e. with http and a full path) in JavaScript code since there&#039;s a good chance those links will lead to a real page. However, the links I found were part of a concatenation of strings that required execution of the JavaScript to actually be valid. Here&#039;s what I mean:</p>
<p>link = &#039;http://&#039; + somevariable + &#039;/somepage.php&#039; + someothervariable;</p>
<p>Google decided that it would be a good idea to check out &#039;/somepage.php&#039; which of course is invalid without the variables before and after it.</p>
<p>So what this long-winded intro leads me to is the question of what is the current state of affairs at Google (and the other search engines) when it comes to links in HTML comments? I should have an answer to that in a few days&#8230;</p>
<p><strong>Update: January 8, 2009</strong><br />
My experiments suggest Google isn&#039;t crawling links that are within HTML comment tags.
<p><strong>Related Posts</strong></p>
<li><a href="http://www.allthingssem.com/noscript-and-seo/" rel="bookmark" title="February 18, 2007">NoScript and SEO</a></li>
<li><a href="http://www.allthingssem.com/duplicate-content-experiment/" rel="bookmark" title="May 24, 2007">Duplicate Content Experiment</a></li>
<li><a href="http://www.allthingssem.com/do-deep-folders-stop-search-engines/" rel="bookmark" title="January 4, 2008">Do Deep Folders Stop Search Engines?</a></li>
<li><a href="http://www.allthingssem.com/seo-for-images/" rel="bookmark" title="March 31, 2007">SEO for Images</a></li>
<li><a href="http://www.allthingssem.com/exploiting-google-geographic-targeting/" rel="bookmark" title="November 7, 2007">Exploiting Google&#039;s Geographic Targeting</a></li>
</ul>
<p><!-- Similar Posts took 15.131 ms --></p>
<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
<img src="http://feeds.feedburner.com/~r/AllThingsSEM/~4/FOndL6sDVU8" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.allthingssem.com/search-engines-crawling-html-comment-links/feed/</wfw:commentRss>
		<feedburner:origLink>http://www.allthingssem.com/search-engines-crawling-html-comment-links/</feedburner:origLink></item>
		<item>
		<title>Is BrowseRank The New PageRank?</title>
		<link>http://feedproxy.google.com/~r/AllThingsSEM/~3/4vK_XDq6--o/</link>
		<comments>http://www.allthingssem.com/is-browserank-the-new-pagerank/#comments</comments>
		<pubDate>Fri, 21 Nov 2008 11:38:26 +0000</pubDate>
		<dc:creator>Marios Alexandrou</dc:creator>
		
		<category><![CDATA[SEO Theories and Research]]></category>

		<guid isPermaLink="false">http://www.allthingssem.com/?p=480</guid>
		<description><![CDATA[I've been on a research paper reading binge recently. I've got about 10 or so under my belt now in just the last couple of weeks. I've discovered they make great reading on my train ride to work. Relatively short and to the point. Sure they're often full of crazy math formulas, but those are easy to gloss over and instead concentrate on the discussion. Many of the papers were written years ago. Despite their age, the information is... ummm you know... informative. I mean that. My most recent reading, <a href="http://www.allthingssem.com/research-papers/browserank.pdf">BrowseRank: Letting Web Users Vote for Page Importance</a>, is actually from 2008 which makes it both informative and relevant to future SEO efforts.<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
]]></description>
			<content:encoded><![CDATA[<p>I&#039;ve been on a research paper reading binge recently. I&#039;ve got about 10 or so under my belt now in just the last couple of weeks. I&#039;ve discovered they make great reading on my train ride to work. Relatively short and to the point. Sure they&#039;re often full of crazy math formulas, but those are easy to gloss over and instead concentrate on the discussion. Many of the papers were written years ago. Despite their age, the information is&#8230; ummm you know&#8230; informative. I mean that. My most recent reading, <a href="http://www.allthingssem.com/research-papers/browserank.pdf">BrowseRank: Letting Web Users Vote for Page Importance</a>, is actually from 2008 which makes it both informative and relevant to future SEO efforts.</p>
<p>BrowseRank is a measure developed by Microsoft with the purpose of outperforming Google&#039;s PageRank. Such an effort shouldn&#039;t come as a surprise as there is always a better way to build a mousetrap. Even Google has acknowledged that PageRank is no longer as effective as it once was given their devaluing of it when it comes to ranking search results. More specifically, BrowseRank set out to address two shortcomings with PageRank:</p>
<ol>
<li>PageRank relies on the link graph, but this link graph is easily manipulated by spammers who automate the creation of pages and links in numbers that are almost unimaginable. Of course these links are worthless for determining true page quality or value.</li>
<li>PageRank ignores the time spent on a page. That is, if a user spends 5 minutes reading a page, that page is likely more valuable than one that takes just 10 seconds to read. PageRank is primarily concerned with where a user ends up via clicking on links, but it completely misses out on the feedback available by time spent on a page.</li>
</ol>
<p>Using a dataset of 950 million unique URLs and 3 billion data points consisting of URL visited, time visited, and whether the visit originated from a link, the Microsoft researchers determined that BrowseRank outperformed both PageRank and TrustRank. This improvement was seen at both the web site-level and the page-level. That&#039;s pretty impressive given that just three metrics were used. Improvement, in this case, means a higher ranking for pages that users actually find engaging such as MySpace, Facebook, and YouTube rather than pages that are just linked to a lot such as Adobe (e.g. to download Acrobat Reader).</p>
<p>In addition, BrowseRank was more effective at filtering spam because it doesn&#039;t rely on easy-to-manipulate link data (critical to PageRank) or a potentially faulty seed list (critical to TrustRank).</p>
<p>I think this idea of user feedback is already in play at search engines. You can see strong evidence of this just by looking at what might come of the data already being collected by toolbars. And acquisitions of social media sites which provide strong, real-time signals of user interest are surely not just plays for additional advertising revenue. More tangible evidence is apparent with Google&#039;s almost immediate ranking of hot submissions on social bookmarking sites which wouldn&#039;t be happening as fast if PageRank had to be calculated.</p>
<p>Can you imagine the world of SEO where links had no direct ranking value? It&#039;d be like every link on the web suddenly had a nofollow slapped on it. Links would then return to their original purpose of providing a jump point for additional information. An equally important to question to ask is whether your business is capable of surviving with such a change.
<p><strong>Related Posts</strong></p>
<li><a href="http://www.allthingssem.com/frank-takes-on-pagerank/" rel="bookmark" title="November 17, 2008">fRank Takes on PageRank</a></li>
<li><a href="http://www.allthingssem.com/statistically-speaking-that-page-is-spam/" rel="bookmark" title="November 19, 2008">Statistically Speaking, That Page Is Spam</a></li>
<li><a href="http://www.allthingssem.com/using-link-structures-to-classify-spam/" rel="bookmark" title="January 7, 2009">Using Link Structures to Classify Web Spam</a></li>
<li><a href="http://www.allthingssem.com/web-spam-taxonomy/" rel="bookmark" title="January 6, 2009">Web Spam Taxonomy</a></li>
<li><a href="http://www.allthingssem.com/detecting-cloaking-algorithmically/" rel="bookmark" title="January 14, 2009">Detecting Cloaking Algorithmically Is Not Easy</a></li>
</ul>
<p><!-- Similar Posts took 11.577 ms --></p>
<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
<img src="http://feeds.feedburner.com/~r/AllThingsSEM/~4/4vK_XDq6--o" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.allthingssem.com/is-browserank-the-new-pagerank/feed/</wfw:commentRss>
		<feedburner:origLink>http://www.allthingssem.com/is-browserank-the-new-pagerank/</feedburner:origLink></item>
		<item>
		<title>InLinks Will Fail Like Others Have Before It</title>
		<link>http://feedproxy.google.com/~r/AllThingsSEM/~3/L3_Vlc0KtHc/</link>
		<comments>http://www.allthingssem.com/inlinks-will-fail-like-others-have-before-it/#comments</comments>
		<pubDate>Thu, 20 Nov 2008 11:48:22 +0000</pubDate>
		<dc:creator>Marios Alexandrou</dc:creator>
		
		<category><![CDATA[Search Engine Optimization]]></category>

		<guid isPermaLink="false">http://www.allthingssem.com/?p=481</guid>
		<description><![CDATA[There's a lot of hubhub about a new paid link program from the MediaWhiz folks. I'm on the side of everyone that thinks these links will eventually be detected by Google. My reasons are simple: if the system works it will attract enough use that such use will leave footprints. It's sort of like an investment system that works until so many people know about it that the market adjusts to account for the new trading patterns.<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
]]></description>
			<content:encoded><![CDATA[<p>There&#039;s a lot of hubbub about a new paid link program from the MediaWhiz folks. I&#039;m on the side of everyone that thinks these links will eventually be detected by Google. My reasons are simple: if the system works it will attract enough use that such use will leave footprints. It&#039;s sort of like an investment system that works until so many people know about it that the market adjusts to account for the new trading patterns.</p>
<p>There are two flaws with the system that I see. Admittedly, I&#039;m going off what I&#039;ve read. I&#039;m neither an Inlinks publisher or advertiser so I have no first hand knowledge.</p>
<p>1. Advertisers will not be smart about their text link usage. They&#039;ll target a small handful of key words leaving obvious footprints in their wake. Only a small percentage will actually spend money on poor links to balance out their purchases.</p>
<p>2. Advertisers will be at the whim of what other links the publishers accept. Even if the advertiser takes all the necessary steps to establish anchor text that is natural looking, their efforts will be in vain when the same publisher accepts other paid link offers from advertisers that are not savvy enough to hide their tracks. </p>
<p>I guess only time will tell&#8230;
<p><strong>Related Posts</strong></p>
<li><a href="http://www.allthingssem.com/google-toolkit/" rel="bookmark" title="November 17, 2005">Google Toolkit</a></li>
<li><a href="http://www.allthingssem.com/100000-dollars-from-google/" rel="bookmark" title="June 10, 2006">100,000 Dollars from Google</a></li>
<li><a href="http://www.allthingssem.com/css-rollovers-for-seo/" rel="bookmark" title="September 26, 2005">CSS Rollovers for SEO</a></li>
<li><a href="http://www.allthingssem.com/how-to-monetize-freeware/" rel="bookmark" title="February 21, 2007">How to Monetize Freeware</a></li>
<li><a href="http://www.allthingssem.com/keyword-ranking-checker/" rel="bookmark" title="May 28, 2006">Keyword Ranking Checker</a></li>
</ul>
<p><!-- Similar Posts took 14.165 ms --></p>
<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
<img src="http://feeds.feedburner.com/~r/AllThingsSEM/~4/L3_Vlc0KtHc" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.allthingssem.com/inlinks-will-fail-like-others-have-before-it/feed/</wfw:commentRss>
		<feedburner:origLink>http://www.allthingssem.com/inlinks-will-fail-like-others-have-before-it/</feedburner:origLink></item>
		<item>
		<title>Statistically Speaking, That Page Is Spam</title>
		<link>http://feedproxy.google.com/~r/AllThingsSEM/~3/scRPE0upM7M/</link>
		<comments>http://www.allthingssem.com/statistically-speaking-that-page-is-spam/#comments</comments>
		<pubDate>Wed, 19 Nov 2008 13:49:53 +0000</pubDate>
		<dc:creator>Marios Alexandrou</dc:creator>
		
		<category><![CDATA[SEO Theories and Research]]></category>

		<guid isPermaLink="false">http://www.allthingssem.com/?p=479</guid>
		<description><![CDATA[In a previous post I covered a Microsoft Research paper that discussed how static factors could be used to improve search ranking results above and beyond what PageRank alone could do. Rolled together, these factors formed what the authors of the paper called fRank to measure the quality of a web page. In this post I'm going to cover another research paper that looks at the other end of the quality spectrum. That is, what can be done algorithmically to identify a given page or domain as spam? Note that the basis of this post is from a 2004 SIGIR Paper titled Spam, Damn Spam, and Statistics.<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
]]></description>
			<content:encoded><![CDATA[<p>In a previous post I covered a Microsoft Research paper that discussed how <a href="http://www.allthingssem.com/frank-takes-on-pagerank/">static factors could be used to improve search ranking results</a> above and beyond what PageRank alone could do. Rolled together, these factors formed what the authors of the paper called fRank to measure the quality of a web page. In this post I&#039;m going to cover another research paper that looks at the other end of the quality spectrum. That is, what can be done algorithmically to identify a given page or domain as spam? Note that the basis of this post is from a 2004 SIGIR Paper titled <a href="http://www.allthingssem.com/research-papers/spam-damn-spam-and-statistics.pdf">Spam, Damn Spam, and Statistics</a>.</p>
<p>We&#039;ve all seen spam pages. More often than not we can point to a page and call it spam with a high degree of certainty. Our brains are good at quickly identifying elements of a page that immediately indicate it as spam. Alas, computers are less capable with such acts of intuition, but a certain category of spam can be easily and methodically classified as such. </p>
<p>Although numerous and beyond the scope of this research paper, some approaches to spam include:</p>
<ol>
<li>Loading pages with popular, but irrelevant keywords. Commonly called keyword stuffing.</li>
<li>Synthesizing many pages each with a narrow topic focus which in turn redirect to the page that needs to receive the traffic. Commonly called doorway pages.</li>
<li>Synthesizing many pages knowing that each will obtain a minimum PageRank which can all be channeled to the key page.</li>
</ol>
<p>The first approach is easily detected using term vector analysis. The second two are the subject of the research paper.</p>
<p>Using two datasets of 150 million URLs and 429 million URLs, respectively, the Microsoft researchers set out to demonstrate that spam pages exhibit statistical anomalies that could be used to accurately separate spam from non-spam pages. These statistical anomalies include:</p>
<ul>
<li>Length of host name. The longer the more likely it&#039;s spam.</li>
<li>Host name resolutions to the same IP. The more that resolved to the same IP, the more the pages are spam.</li>
<li>Host-machine ratio for links. The more outbound links to hosts that converge to a small set of IPs, the more likely those pages are spam.</li>
<li>The distribution of links embedded on a page vs. those pointing to a page. This distribution should follow a Zipfian Distribution and outliers are likely spam.</li>
<li>A variation in the words used across many pages while the number of words remains constant.</li>
<li>A high rate of page mutation. That is, pages that change content frequently. Note that news sites, which have just a few constantly changing pages with many otherwise static pages, do not get flagged by this measure.</li>
<li>Excessive replication of content across domains.</li>
</ul>
<p>For me there are two takeaways from this research. The first is that spam detection is not just about detecting what a page is, but also what it isn&#039;t. That is, a page that isn&#039;t like others in some via a statistical measure stands out. </p>
<p>The second takeaway is that this paper supports the notion that there is such a thing as over-optimizing a page, but the threshold is likely changing over time. By that I mean some overly exuberant SEO may push a site out of the statistical norms and into the looks like spam bucket. At the same time, the statistical norms are in flux because as more and more sites are optimized, what is normal changes with those optimization efforts.
<p><strong>Related Posts</strong></p>
<li><a href="http://www.allthingssem.com/using-link-structures-to-classify-spam/" rel="bookmark" title="January 7, 2009">Using Link Structures to Classify Web Spam</a></li>
<li><a href="http://www.allthingssem.com/graphs-reveal-web-spam-patterns/" rel="bookmark" title="January 18, 2009">10 Graphs Reveal Web Spam Patterns</a></li>
<li><a href="http://www.allthingssem.com/detecting-splogs-with-self-similarity-analysis/" rel="bookmark" title="January 8, 2009">Detecting Splogs with Self-Similarity Analysis</a></li>
<li><a href="http://www.allthingssem.com/is-browserank-the-new-pagerank/" rel="bookmark" title="November 21, 2008">Is BrowseRank The New PageRank?</a></li>
<li><a href="http://www.allthingssem.com/frank-takes-on-pagerank/" rel="bookmark" title="November 17, 2008">fRank Takes on PageRank</a></li>
</ul>
<p><!-- Similar Posts took 11.228 ms --></p>
<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
<img src="http://feeds.feedburner.com/~r/AllThingsSEM/~4/scRPE0upM7M" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.allthingssem.com/statistically-speaking-that-page-is-spam/feed/</wfw:commentRss>
		<feedburner:origLink>http://www.allthingssem.com/statistically-speaking-that-page-is-spam/</feedburner:origLink></item>
		<item>
		<title>fRank Takes on PageRank</title>
		<link>http://feedproxy.google.com/~r/AllThingsSEM/~3/D5f4IX2Ic64/</link>
		<comments>http://www.allthingssem.com/frank-takes-on-pagerank/#comments</comments>
		<pubDate>Tue, 18 Nov 2008 01:58:33 +0000</pubDate>
		<dc:creator>Marios Alexandrou</dc:creator>
		
		<category><![CDATA[SEO Theories and Research]]></category>

		<guid isPermaLink="false">http://www.allthingssem.com/?p=478</guid>
		<description><![CDATA[Where have I been? That's the question that my readers (both of you, not including my brother) may have asked in the last couple of months. I've been where I've always been, but I've been reading much more than I've been writing. Some of that reading has been research papers of the sort put out by the International World Wide Web Conference Committee or the Special Interest Group on Information Retrieval. Fancy names for fancy groups putting out fancy research papers.<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
]]></description>
			<content:encoded><![CDATA[<div style="float:right; padding-left: 5px; text-align: center;"><img title="PageRank vs. Yoda" alt="Judge me by my PageRank, do you?" src="http://www.allthingssem.com/images/yoda-pagerank.jpg" /><br /><a href="http://blaugh.com/2007/06/11/link-popularity-vs-pagerank-vs-yoda/">From bLaugh</a></div>
<p>Where have I been? That&#039;s the question that my readers (both of you, not including my brother) may have asked in the last couple of months. I&#039;ve been where I&#039;ve always been, but I&#039;ve been reading much more than I&#039;ve been writing. Some of that reading has been research papers of the sort put out by the International World Wide Web Conference Committee (<a href="http://www.iw3c2.org/">IW3C2</a>) or the Special Interest Group on Information Retrieval (<a href="http://www.sigir.org/">SIGIR</a>).</p>
<p>One of these papers came out of Microsoft and is called <a href="http://www.allthingssem.com/research-papers/beyond-pagerank.pdf">Beyond PageRank: Machine Learning for Static Ranking</a>. Ooh. Anything with PageRank in the title, must be good, right? Well, actually in this case yes. The paper was an excellent read and I highly recommend it despite it being from 2006. Because I have a memory like a sieve, I&#039;ve decided to note some of the highlights for future reference.</p>
<p>A good query-independent ranking or static ranking algorithm is key for search engine success and provides:</p>
<ol>
<li>A general indication of the overall quality of a page.</li>
<li>The ability for the search engine to quickly stop searching for results once a particular threshold of quality has been passed.</li>
<li>A clue to setting the priority for what pages should be crawled first.</li>
</ol>
<p>It&#039;s generally accepted that Google&#039;s PageRank is the best method for the static ranking of Web pages, but the authors of this page have set out to demonstrate otherwise. Their argument is stated  as such:</p>
<blockquote><p>&#034;There are a number of simple URL- or page-based features that significantly outperform PageRank (for the purposes of statically ranking web pages) despite ignoring the structure of the web. We combine these and other static features using machine learning to achieve a ranking system that is significantly better than PageRank (in pairwise agreement with human labels).&#034;</p></blockquote>
<p>Pretty bold statement, right?</p>
<p>I won&#039;t attempt to describe what is meant by a machine learning approach, but some of the benefits cited by the authors include:</p>
<ul>
<li>Multiple measures that make it hard for malicious users to manipulate (especially of the measures are kept secret).</li>
<li>An algorithm that learns allows for a feature to be de-emphasized should it become subject to manipulation.</li>
<li>Taking advantage of advances in machine learning field e.g. it is apparently possible to adjust the ranking model ahead of the spammer&#039;s attempts to circumvent it.</li>
</ul>
<p>And so RankNet was born &#8212; the Microsoft researchers&#039; implementation of a &#034;modified standard neural network back-prop algorithm&#034;. And from it a new measure call fRank (for feature-based ranking).</p>
<p>The paper includes details of various experiments which are worth reading, but the gist the results is that fRank performs significantly better than PageRank despite a lack of information about the web graph. As a side benefit, fRank tends to bias pages that web users actually prefer rather than those preferred by web authors when compared to PageRank. I had to mull that one over for a while.</p>
<p>And what simple measures in combination beat the all mighty PageRank? </p>
<ul>
<li>Popularity as measured by the number of times it was visited by users over time. The MSN Toolbar provided this data. Yes, the MSN Toolbar, as with other toolbars, could very well be a factor in rankings.</li>
<li>Anchor text length and number of unique words in that text. I&#039;m not sure what length is optimal, but I guess the authors determine such a value.</li>
<li>Page elements such as number of words in the body and the frequency of the most common term.</li>
<li>PageRank as computed on 5 billion pages.</li>
<li>Domain-level elements such as the number of outlinks on any page and the average PageRank.</li>
</ul>
<p><strong>Related Posts</strong></p>
<li><a href="http://www.allthingssem.com/statistically-speaking-that-page-is-spam/" rel="bookmark" title="November 19, 2008">Statistically Speaking, That Page Is Spam</a></li>
<li><a href="http://www.allthingssem.com/is-browserank-the-new-pagerank/" rel="bookmark" title="November 21, 2008">Is BrowseRank The New PageRank?</a></li>
<li><a href="http://www.allthingssem.com/using-link-structures-to-classify-spam/" rel="bookmark" title="January 7, 2009">Using Link Structures to Classify Web Spam</a></li>
<li><a href="http://www.allthingssem.com/detecting-splogs-with-self-similarity-analysis/" rel="bookmark" title="January 8, 2009">Detecting Splogs with Self-Similarity Analysis</a></li>
<li><a href="http://www.allthingssem.com/web-spam-taxonomy/" rel="bookmark" title="January 6, 2009">Web Spam Taxonomy</a></li>
</ul>
<p><!-- Similar Posts took 11.310 ms --></p>
<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
<img src="http://feeds.feedburner.com/~r/AllThingsSEM/~4/D5f4IX2Ic64" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.allthingssem.com/frank-takes-on-pagerank/feed/</wfw:commentRss>
		<feedburner:origLink>http://www.allthingssem.com/frank-takes-on-pagerank/</feedburner:origLink></item>
		<item>
		<title>7 Tips for SEMs to Survive a Recession</title>
		<link>http://feedproxy.google.com/~r/AllThingsSEM/~3/q2aC3wSC18A/</link>
		<comments>http://www.allthingssem.com/tips-for-surviving-a-recession/#comments</comments>
		<pubDate>Sun, 02 Nov 2008 22:09:13 +0000</pubDate>
		<dc:creator>Marios Alexandrou</dc:creator>
		
		<category><![CDATA[SEM and SEO Careers]]></category>

		<guid isPermaLink="false">http://www.allthingssem.com/tips-for-seos-to-survive-a-recession/</guid>
		<description><![CDATA[<div style="float:right; padding-left: 5px; text-align: center;"><img src="http://www.allthingssem.com/images/will-seo-for-food.jpg"/><br clear="all"/><a href="http://blaugh.com/2007/01/02/malcontent-blogger/">Original cartoon from bLaugh</a></div>Even if you don't own property and don't often pay attention to economic-related news, you can't possibly have missed credit woes that many financial institutions are dealing with these days. Not just in the US, but across the globe. But things in the SEO world are just peachy so why worry about such things, right? Well, having been a web developer prior to the dot-com boom and subsequent bust, I highly recommend you prepare for a day when SEO jobs became hard to come by and new contracts all but disappear.<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
]]></description>
			<content:encoded><![CDATA[<div style="float:right; padding-left: 5px; text-align: center;"><img src="http://www.allthingssem.com/images/will-seo-for-food.jpg"/><br clear="all"/><a href="http://blaugh.com/2007/01/02/malcontent-blogger/">Original cartoon from bLaugh</a></div>
<p>Even if you don&#039;t own property and don&#039;t often pay attention to economic-related news, you can&#039;t possibly have missed credit woes that many financial institutions are dealing with these days. Not just in the US, but across the globe. But things in the SEM world are just peachy so why worry about such things, right? Well, having been a web developer prior to the dot-com boom and subsequent bust, I highly recommend you prepare for a day when SEM jobs became hard to come by and new contracts all but disappear.</p>
<p>The first thing you need to accept is that companies don&#039;t necessarily react to bad times in a logical way. The PPC folks would argue that their PPC campaigns will never be canceled because they all have a positive ROI. SEOs are no different and they would argue something along the lines that SEO delivers <em>free</em> traffic and so who would be foolish enough to scale back efforts? Politics, panic, fear, nepotism, and a whole host of other things can conspire against you and leave you without a job. If you accept this possibility, then hopefully you&#039;ll be open to the following suggestions. For those &#034;old timers&#034; out there that have seen a dip or two in their professional lives, feel free to add your suggestions via the comments form.</p>
<h2>1. Diversify Your Skills</h2>
<p>To make it harder for you to be laid off, you need to make yourself more useful than the next guy relative to your compensation level. The more you&#039;re paid, the more value you need to bring to the table. First and foremost, you should be GREAT at your primary responsibilities. Beyond that, having <a href="http://blog.seoptimise.com/2007/12/6-most-marketable-seo-skills.html">some secondary skills</a> could go a long way to increasing your value (perceived or real). For example, if you&#039;re an expert SEO that also has some PPC experience, web development experience, or even sales experience, you&#039;re in a much better position than if SEO is all you do.</p>
<h2>2. Get Involved</h2>
<p>Have you been dragging your feet when assignments come up that are outside of your job description? Now is a good time to change that approach and get involved. Similar to diversifying your skills, diversifying who you work with and what you work on will make you a more valuable resource. Just be careful to not take too much on. Whining about having too much work to do and missing deadlines isn&#039;t going to work all that well.</p>
<h2>3. Save Some Money, OK!?</h2>
<p>This isn&#039;t so you can retire early. Instead, having some backup funds will help you avoid acts of desperation such as jumping ship just when things are turning around or completely moving out of SEM which will make your resume look weaker when the market rebounds. As is often the case with investing, hanging on while others are panicking can result in great opportunities.</p>
<h2>4. Have a Backup Plan</h2>
<p>A backup plan for what you can do when you can&#039;t do SEM should seem like an obvious suggestion. What may not be obvious is the quality of life bonus you get from having such a plan. When you can go to work in the morning knowing that even if you get there and management has changed the locks without telling you, you&#039;ll still be able to pay the bills goes a long way to eliminating stress that would otherwise weigh on you day in and day out.</p>
<h2>5. Additional Income Streams</h2>
<p>A co-worker once asked me if every SEO dreams of building web sites while sitting on a couch and making thousands of dollars via Google AdSense. I laughed and said, &#034;yes&#034;. AdSense may not be right for you, but surely there&#039;s something out there that is of interest that you can use to earn some side money. The key is make it something that doesn&#039;t rely on others paying you e.g. consulting on the side isn&#039;t the a good choice, but running affiliate programs is. An alternate income stream probably won&#039;t disappear at the same time as your primary income stream so it&#039;ll help smooth out any transition and allow you to, as in item 2, avoid the same acts of desperation. Need an example? Check out Gyutae Park who is an <a href="http://www.winningtheweb.com/about/">SEO by day and money-making blogger by night</a>. Need more ideas? Check out this <a href="http://www.doshdosh.com/ways-to-make-money-online/">list from the DoshDosh blog</a> .</p>
<h2>6. Establish Your Network Now</h2>
<p>The more people you know, the more likely you&#039;ll be able to continue working at the things that interested you. It&#039;s important to build this network before you need something though. No one likes to get an e-mail from someone who hasn&#039;t said a peep to you for over a year, but who now needs a job. Participate in the community that piques your interest and build some equity today while things are good. That community could connect you to the right people in troubled times in a way that going through recruiters and using job boards can&#039;t even come close to equaling. In addition, if your current employer is already paying for you to go to conferences, <a href="http://www.seomoz.org/blog/12-tips-to-optimize-your-networking-at-search-conferences">make the most of the networking opportunities</a>.</p>
<h2>7. Bring in a Client</h2>
<p>If you&#039;re at an agency, bringing in a client buys you a get out of jail card. Yes, the revenue from the new client going to have a positive impact on your employer&#039;s bottom-line, but that&#039;s not the only benefit. You&#039;ll also get an added layer of protection because ending your employment would surely become known to the client who may become less inclined to continue the newly formed relationship with your agency.</p>
<p>So there you have it. Seven steps to coming out on the other side of a recession in the SEM market relatively unscathed. I&#039;d like to say that following the steps will be as easy as writing them, but that would be a lie. It takes effort to not become complacent when everything is going well.</p>
<p><em>Note: This post has been expanded from a previous post now that the bottom has really fallen out of the economy.</em>
<p><strong>Related Posts</strong></p>
<li><a href="http://www.allthingssem.com/a-career-in-seo-smart-move/" rel="bookmark" title="March 22, 2007">A Career in SEO? Smart move.</a></li>
<li><a href="http://www.allthingssem.com/3-reasons-why-seo-and-sem-teams-should-work-together/" rel="bookmark" title="November 14, 2006">3 Reasons Why SEO and SEM Teams Should Work Together</a></li>
<li><a href="http://www.allthingssem.com/cant-seo-and-sem-get-along/" rel="bookmark" title="January 28, 2007">Can&#039;t SEO and SEM Get Along?</a></li>
<li><a href="http://www.allthingssem.com/seo-interview-questions/" rel="bookmark" title="December 26, 2007">55 SEO Interview Questions</a></li>
<li><a href="http://www.allthingssem.com/will-trade-food-for-seo/" rel="bookmark" title="November 1, 2007">Will Trade Food for SEO</a></li>
</ul>
<p><!-- Similar Posts took 16.113 ms --></p>
<p>This is a post from the <a href="http://www.allthingssem.com/">All Things SEM and SEO Blog</a> &copy;2009 Marios Alexandrou. All Rights Reserved.</p>
<img src="http://feeds.feedburner.com/~r/AllThingsSEM/~4/q2aC3wSC18A" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.allthingssem.com/tips-for-surviving-a-recession/feed/</wfw:commentRss>
		<feedburner:origLink>http://www.allthingssem.com/tips-for-surviving-a-recession/</feedburner:origLink></item>
	</channel>
</rss>
