<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet href="http://feeds.feedburner.com/~d/styles/rss2full.xsl" type="text/xsl" media="screen"?><?xml-stylesheet href="http://feeds.feedburner.com/~d/styles/itemcontent.css" type="text/css" media="screen"?><!-- generator="wordpress/2.3.2" --><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0">

<channel>
	<title>Judah Phillips at Web Analytics Demystified</title>
	<link>http://judah.webanalyticsdemystified.com</link>
	<description>Judah Phillips, Web Analytics Practitioner at Web Analytics Demystified</description>
	<pubDate>Fri, 18 Jul 2008 22:56:13 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.3.2</generator>
	<language>en</language>
			<geo:lat>42.370519</geo:lat><geo:long>-71.084434</geo:long><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" href="http://feeds.feedburner.com/judahphillips" type="application/rss+xml" /><feedburner:emailServiceId>860250</feedburner:emailServiceId><feedburner:feedburnerHostname>http://www.feedburner.com</feedburner:feedburnerHostname><item>
		<title>Performance, Performance, Performance</title>
		<link>http://feeds.feedburner.com/~r/judahphillips/~3/339410651/performance-performance-performance.html</link>
		<comments>http://judah.webanalyticsdemystified.com/2008/07/performance-performance-performance.html#comments</comments>
		<pubDate>Fri, 18 Jul 2008 22:56:13 +0000</pubDate>
		<dc:creator>Judah</dc:creator>
		
		<category><![CDATA[Ad Servers]]></category>

		<category><![CDATA[Audience Measurement]]></category>

		<category><![CDATA[Behavioral Targeting]]></category>

		<category><![CDATA[Data Collection]]></category>

		<category><![CDATA[Data Quality]]></category>

		<category><![CDATA[Methodology]]></category>

		<category><![CDATA[Multichannel]]></category>

		<category><![CDATA[Performance Management]]></category>

		<category><![CDATA[Web Analytics]]></category>

		<guid isPermaLink="false">http://judah.webanalyticsdemystified.com/2008/07/performance-performance-performance.html</guid>
		<description><![CDATA[From an article I wrote for MediaPost a few weeks ago:
Reach and frequency and the core concepts of traditional media planning and advertising.  For a given site, program, channel, radio station, billboard, newspaper section, a target audience (the reach) is exposed to a certain number of occurrences of the media (the frequency).  On the web, [...]]]></description>
			<content:encoded><![CDATA[<p>From an article I wrote for MediaPost a few weeks ago:</p>
<p>Reach and frequency and the core concepts of traditional media planning and advertising.  For a given site, program, channel, radio station, billboard, newspaper section, a target audience (the reach) is exposed to a certain number of occurrences of the media (the frequency).  On the web, these concepts manifest themselves in metrics collected and reported from a number of recognizable services.  Audience measurement firms, like comScore and Nielsen, web analytics firms, like Omniture and Unica, to companies somewhere in between, like Quantcast and Google, all have reach and frequency data.  Many new media metrics can be used to proxy frequency- from time-based measures, espoused by audience measurement firms, to concepts like visitor retention or the repeat visitor rate cited by web analytics firms.  On the reach side, companies refer to concepts like &#8220;unique visitors.&#8221;</p>
<p>These data, of course, available in free tools or in for pay tools are certainly helpful for planning campaigns.  But reach measures can be dirty (cookies, unduplicated unique users, estimates from panels, coverage error).  Frequency measures can be just as dirty (problems recording time in single page visits or visits on the last page, do page views really matter with AJAX and rich media, cookies again, and so on).  We all are aware of the challenges.</p>
<p>Thus using basic reach and frequency measures for planning or evaluating a campaign does not suffice.   So advertisers and agencies target demographics, like gender, age, income, education, and job title.  It&#8217;s a given that advertising in the Robb Report reaches a different audience segment than advertising in Popular Mechanics. </p>
<p>These brave new days we have &#8220;behavioral&#8221; tracking too.  By taking into account visitor activity across sessions, such as past actions taken on a site or a roster of previous purchases, we can attempt to deduce what a person or segment responds to or is interested in based on their behavior.</p>
<p>Even with reach, frequency, demographics, and behavioral data to help guide advertising and media buying, we are missing an important attribute for maximizing the potential success of our campaigns.  We do not have an available tool, whether free or paid, for advertising or buying media on or across sites according to measures of past performance.  Such measures include ad clickthrough rates, conversion rates, goal completion rates, delivered impressions, and perhaps even harder to quantify financial measures such as ROI, ROAS, and ROMI.</p>
<p>Sure, historic, tacit knowledge of campaign performance exists and is used by agencies or publishers.  However, there is no shared industry source that can help us answer &#8220;how has a site for display advertisement historically performed toward goals based on the reach, frequency, demographic and behavior of its audience segments?&#8221;  Interestingly, a company minting money right now, named Google, can masterfully demonstrate performance in paid search campaigning and help advertisers unify it with segmented reach, frequency, and demographics.</p>
<p>Outcomes based performance measurement unified with reach, frequency, demographics, and behavior is what is missing in audience measurement tools, not frequently reported externally by web analytics tools or ad serving tools, and not available in ad planning tools.  When advertisers can target display ads, or even video ads, to desired audience segments by reach, frequency, demographics, behavior in the context of known performance, media planning will be more effective.  </p>
<hr noshade style="margin:0;height:1px" /><br />
&copy; 2008 Web Analytics Demystified | <a href="http://www.webanalyticsdemystified.com">www.webanalyticsdemystified.com</A>      <br />
<br><br><b>Looking for a new job in web analytics?</b> Check out the <a href="http://www.webanalyticsdemystified.com/job_list.asp">Web Analytics Demystified Job Board!</A>            <div class="feedflare">
<a href="http://feeds.feedburner.com/~f/judahphillips?a=R202QJ"><img src="http://feeds.feedburner.com/~f/judahphillips?i=R202QJ" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=bn6LmJ"><img src="http://feeds.feedburner.com/~f/judahphillips?i=bn6LmJ" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=hA95kj"><img src="http://feeds.feedburner.com/~f/judahphillips?i=hA95kj" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=lRLiaj"><img src="http://feeds.feedburner.com/~f/judahphillips?i=lRLiaj" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=lITk6J"><img src="http://feeds.feedburner.com/~f/judahphillips?i=lITk6J" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=nrkaQj"><img src="http://feeds.feedburner.com/~f/judahphillips?i=nrkaQj" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=GTUnFJ"><img src="http://feeds.feedburner.com/~f/judahphillips?i=GTUnFJ" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=mXpLej"><img src="http://feeds.feedburner.com/~f/judahphillips?i=mXpLej" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=EgRhHJ"><img src="http://feeds.feedburner.com/~f/judahphillips?i=EgRhHJ" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=HdUXxj"><img src="http://feeds.feedburner.com/~f/judahphillips?i=HdUXxj" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/judahphillips/~4/339410651" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://judah.webanalyticsdemystified.com/2008/07/performance-performance-performance.html/feed</wfw:commentRss>
		<feedburner:origLink>http://judah.webanalyticsdemystified.com/2008/07/performance-performance-performance.html</feedburner:origLink></item>
		<item>
		<title>X Change: X Citing X Cogitation!!</title>
		<link>http://feeds.feedburner.com/~r/judahphillips/~3/332436653/xchange-xciting-xcogitation.html</link>
		<comments>http://judah.webanalyticsdemystified.com/2008/07/xchange-xciting-xcogitation.html#comments</comments>
		<pubDate>Fri, 11 Jul 2008 06:25:14 +0000</pubDate>
		<dc:creator>Judah</dc:creator>
		
		<category><![CDATA[Conferences]]></category>

		<category><![CDATA[General]]></category>

		<category><![CDATA[Measurement]]></category>

		<category><![CDATA[Other People]]></category>

		<category><![CDATA[Web Analytics]]></category>

		<guid isPermaLink="false">http://judah.webanalyticsdemystified.com/2008/07/xchange-is-xciting.html</guid>
		<description><![CDATA[Alright, I had to have fun with the title. :) We&#8217;re about 4 weeks ago from the newest and most unique analytics conference on the scene: X Change, hosted this year by Semphonic and Web Analytics Demystified. 
If you missed the first year in Napa, you gotta head to San Fran this year!  Allow me to explain how [...]]]></description>
			<content:encoded><![CDATA[<p>Alright, I had to have fun with the title. :) We&#8217;re about 4 weeks ago from the newest and most unique analytics conference on the scene: <a href="http://www.semphonic.com/conf/index.asp">X Change</a>, hosted this year by <a href="http://www.semphonic.com">Semphonic</a> and <a href="http://www.webanalyticsdemystified.com">Web Analytics Demystified</a>. </p>
<p>If you missed the first year in Napa, you gotta head to San Fran this year!  Allow me to explain how X Change differentiates as I see it:</p>
<ul>
<li><strong>Conversational</strong>. You don&#8217;t sit in a room and listen to people drone on in front of their powerpoints.  People sit in Socratic circles and talk about a topic of interest in &#8220;huddles.&#8221;  The huddle leader will bring up a topic, perhaps riff on some hard-learned experience or data point related to the topic, and ask for commentary from the participants.  The conversation then flows, like Jazz, until there&#8217;s a cadence, then the huddle leader phrases a few more notes and progression begins again&#8230;  Its atypical format depends on participants for success.  No one is going to sit there and read you slides and provide one-sided opinions.  You won&#8217;t just be sitting there listening (unless you want to).  The best huddles are interactive and encourage active participation in the pursuit of shared knowledge, not passive reception of an individual&#8217;s knowledge.</li>
<li><strong>Focused</strong>.  The huddle topics are highly specific and deeply relevant to the real world practice of web analytics today - from attribution to mobile measurement to integration to privacy to team structure, the huddle leaders selected topics that interest them to share with the participants. The focused conversational format should lead to symbiotic exchanges of information directly relevant to your job.</li>
<li><strong>Small</strong>.  100 people, 20 huddle leaders.  You get to make meet interesting people and build working relationships with them.  Cool folks like <a href="http://www.bobpage.net">Bob Page</a>, <a href="hhttp://www.linkedin.com/profile?viewProfile=&amp;key=11360164&amp;fromSearch=0&amp;sik=1215754218884&amp;split_page=1&amp;rd=in&amp;authToken=4q47jSJGqZNiN2OJBKwNt98gR91hldvhkR1jAcPdQp5cz8UdzsOdPgScj0ScP4N&amp;authType=NAME_SEARCH&amp;goback=%2Esrp_1_1215754218884_in">Rachel Scotto</a>, <a href="http://www.theanalyticsguru.com">Marshall Sponder</a>, <a href="http://weblogs.jupiterresearch.com/analysts/jlovett/">John Lovett</a>, <a href="http://www.facebook.com/people/Jared_Waxman/314129">Jared Waxman</a>, &#8220;Bob&#8221; <a href="http://www.linkedin.com/profile?viewProfile=&amp;key=4260798&amp;fromSearch=0&amp;sik=1215754218068&amp;split_page=1&amp;rd=in&amp;authToken=1bnqnimVunpWtY_esrKgrUi4digkljnQldgkUQhkgQc3cRdPwQgPoUejsMdz8Q&amp;authType=NAME_SEARCH&amp;goback=%2Esrp_1_1215754218068_in">Dylan Lewis</a> will be leading huddles and hanging out.  The Web Analytics Tuesday event will probably be bigger than the whole X Change conference!</li>
<li><strong>Exclusive</strong>.  The huddle leaders were hand selected.  In attendance will be industry leaders, corporate executives, industry analysts.  All of the attendees work with analytics.  And for gosh sake, it is at the Ritz in one of America&#8217;s most beautiful and eccentric cities. </li>
</ul>
<p>I think X Change is a unique experience and a worthwhile event where you get to really connect, and well, exchange (!) expertise with your peers and go home with new knowledge.  At least I did last year.  I&#8217;ll be leading a couple of huddles, one of the web analytics team and one on knowing when you&#8217;ve outgrown you analytics tool, so say hello when you see me. </p>
<p><a target="_blank" href="http://www.semphonic.com/conf/"><font color="#336699">Make sure you check out the official web site at Semphonic and sign up today.  The event will sell out soon.  15% discounts are available for Web Analytics Association members.</font></a> </p>
<hr noshade style="margin:0;height:1px" /><br />
&copy; 2008 Web Analytics Demystified | <a href="http://www.webanalyticsdemystified.com">www.webanalyticsdemystified.com</A>      <br />
<br><br><b>Looking for a new job in web analytics?</b> Check out the <a href="http://www.webanalyticsdemystified.com/job_list.asp">Web Analytics Demystified Job Board!</A>            <div class="feedflare">
<a href="http://feeds.feedburner.com/~f/judahphillips?a=hDT3hJ"><img src="http://feeds.feedburner.com/~f/judahphillips?i=hDT3hJ" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=FkOtIJ"><img src="http://feeds.feedburner.com/~f/judahphillips?i=FkOtIJ" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=M2s0oj"><img src="http://feeds.feedburner.com/~f/judahphillips?i=M2s0oj" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=5A90mj"><img src="http://feeds.feedburner.com/~f/judahphillips?i=5A90mj" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=oknbGJ"><img src="http://feeds.feedburner.com/~f/judahphillips?i=oknbGJ" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=npSnCj"><img src="http://feeds.feedburner.com/~f/judahphillips?i=npSnCj" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=Iku4CJ"><img src="http://feeds.feedburner.com/~f/judahphillips?i=Iku4CJ" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=Jx4Phj"><img src="http://feeds.feedburner.com/~f/judahphillips?i=Jx4Phj" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=85AuDJ"><img src="http://feeds.feedburner.com/~f/judahphillips?i=85AuDJ" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=QIiXLj"><img src="http://feeds.feedburner.com/~f/judahphillips?i=QIiXLj" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/judahphillips/~4/332436653" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://judah.webanalyticsdemystified.com/2008/07/xchange-xciting-xcogitation.html/feed</wfw:commentRss>
		<feedburner:origLink>http://judah.webanalyticsdemystified.com/2008/07/xchange-xciting-xcogitation.html</feedburner:origLink></item>
		<item>
		<title>AVG Fixes LinkScanner!!</title>
		<link>http://feeds.feedburner.com/~r/judahphillips/~3/329004497/avg-fixes-linkscanner.html</link>
		<comments>http://judah.webanalyticsdemystified.com/2008/07/avg-fixes-linkscanner.html#comments</comments>
		<pubDate>Mon, 07 Jul 2008 11:02:59 +0000</pubDate>
		<dc:creator>Judah</dc:creator>
		
		<category><![CDATA[Data Collection]]></category>

		<category><![CDATA[Data Quality]]></category>

		<category><![CDATA[Measurement]]></category>

		<category><![CDATA[Random Thoughts]]></category>

		<category><![CDATA[Web Analytics]]></category>

		<category><![CDATA[Web Analytics Tools]]></category>

		<guid isPermaLink="false">http://judah.webanalyticsdemystified.com/2008/07/avg-fixes-linkscanner.html</guid>
		<description><![CDATA[AVG has released an updated version that corrects the LinkScanner bot issue (build 138, July 4), which we&#8217;ve all noticed slamming our servers and analytics data over the last several weeks:

We have modified the Search-Shield component of the product to
only notify users of malicious sites.Search-Shield no longer
scans each search result online for new exploits, which was
causing the [...]]]></description>
			<content:encoded><![CDATA[<p>AVG has released an <a href="http://www.avg.com.au/index.cfm?section=news&amp;feature=104">updated version that corrects the LinkScanner bot issue</a> (build 138, July 4), which we&#8217;ve all noticed slamming our servers and analytics data over the last several weeks:</p>
<blockquote>
<pre>We have modified the Search-Shield component of the product to
only notify users of malicious sites.Search-Shield no longer
scans each search result online for new exploits, which was
causing the spikes that web masters addressed with us. However,
it is important to note that AVG still offers full protection
against potential exploits through the Active Surf-Shield
component of our product, which checks every page for malicious
content as it is visited, but before it is opened.</pre>
</blockquote>
<p>As you&#8217;ve just read in the quote above, AVG has stopped scanning each page that returns in a SERP for users of their free tool.  Instead pages will be scanned by proxy after a user clicks on the link. </p>
<p>For paid users, it&#8217;s a little different.  SERP&#8217;s will still be scanned but via a pure database approach (not the DDOS approach :), which means the sites listed in SERP&#8217;s will be compared to a black list of known &#8220;bad&#8221; sites.  The blacklist is based on internal AVG research and from the real-time results reported by users who have opted-into AVG&#8217;s &#8220;prevalence reporting system&#8221; (a feature of AVG 8).  This means AVG is still scanning sites, but on a very limited basis, thus the detrimental effects on analytics should be very minimal and only caused by users who participate in prevelance reporting.  Still some data pollution will occur&#8230;  </p>
<p>AVG hasn&#8217;t confirmed that they&#8217;ve released a fix to the <a href="http://judah.webanalyticsdemystified.com/2008/06/update-on-avg-linkscanner.html">&#8220;noscript&#8221; issue I mentioned</a>.  I do know they are working on it and have fixed the problem in internal builds.  Regardless, if the LinkScanner is working in the way they say it is, the problem will be negligible (but some data pollution will still occur ;).</p>
<p>Kudos to AVG Corporate, Roger Thompson, Pat Bitton, Greg Mosher, and all the other engineers who listened to the community on the web and worked quickly to fix the problem.  Now let&#8217;s hope the the build 138 update works as described. Time will tell.</p>
<hr noshade style="margin:0;height:1px" /><br />
&copy; 2008 Web Analytics Demystified | <a href="http://www.webanalyticsdemystified.com">www.webanalyticsdemystified.com</A>      <br />
<br><br><b>Looking for a new job in web analytics?</b> Check out the <a href="http://www.webanalyticsdemystified.com/job_list.asp">Web Analytics Demystified Job Board!</A>            <div class="feedflare">
<a href="http://feeds.feedburner.com/~f/judahphillips?a=2ELgYJ"><img src="http://feeds.feedburner.com/~f/judahphillips?i=2ELgYJ" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=7IYQkJ"><img src="http://feeds.feedburner.com/~f/judahphillips?i=7IYQkJ" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=uCj5Zj"><img src="http://feeds.feedburner.com/~f/judahphillips?i=uCj5Zj" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=HFLl8j"><img src="http://feeds.feedburner.com/~f/judahphillips?i=HFLl8j" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=CT6DUJ"><img src="http://feeds.feedburner.com/~f/judahphillips?i=CT6DUJ" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=cfP8Zj"><img src="http://feeds.feedburner.com/~f/judahphillips?i=cfP8Zj" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=yww7sJ"><img src="http://feeds.feedburner.com/~f/judahphillips?i=yww7sJ" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=e9RYvj"><img src="http://feeds.feedburner.com/~f/judahphillips?i=e9RYvj" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=m3E0kJ"><img src="http://feeds.feedburner.com/~f/judahphillips?i=m3E0kJ" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=xi3arj"><img src="http://feeds.feedburner.com/~f/judahphillips?i=xi3arj" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/judahphillips/~4/329004497" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://judah.webanalyticsdemystified.com/2008/07/avg-fixes-linkscanner.html/feed</wfw:commentRss>
		<feedburner:origLink>http://judah.webanalyticsdemystified.com/2008/07/avg-fixes-linkscanner.html</feedburner:origLink></item>
		<item>
		<title>AVG LinkScanner Obfuscates User Agent!</title>
		<link>http://feeds.feedburner.com/~r/judahphillips/~3/323425503/avg-linkscanner-obfuscates-user-agent.html</link>
		<comments>http://judah.webanalyticsdemystified.com/2008/06/avg-linkscanner-obfuscates-user-agent.html#comments</comments>
		<pubDate>Mon, 30 Jun 2008 11:19:48 +0000</pubDate>
		<dc:creator>Judah</dc:creator>
		
		<category><![CDATA[Data Collection]]></category>

		<category><![CDATA[Data Quality]]></category>

		<category><![CDATA[Due Diligence]]></category>

		<category><![CDATA[Random Thoughts]]></category>

		<category><![CDATA[Reporting]]></category>

		<category><![CDATA[Spiders and Bots]]></category>

		<category><![CDATA[Web Analytics]]></category>

		<category><![CDATA[Web Analytics Tools]]></category>

		<guid isPermaLink="false">http://judah.webanalyticsdemystified.com/2008/06/avg-linkscanner-obfuscates-user-agent.html</guid>
		<description><![CDATA[AVG has obfuscated their user agent.  One of the current agents for customers of their free and paid tool now cloaks itself as IE6:

Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1)

In addition to the easily detectable user agents:

Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1;1813)
User Agent:Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1;1813)  
User Agent:Mozilla/4.0 (compatible; MSIE 6.0; [...]]]></description>
			<content:encoded><![CDATA[<p>AVG has obfuscated their user agent.  One of the current agents for customers of their free and paid tool now cloaks itself as IE6:</p>
<blockquote>
<pre>Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1)</pre>
</blockquote>
<p>In addition to the easily detectable user agents:</p>
<blockquote>
<pre>Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1;1813)</pre>
<pre>User Agent:Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1;1813)  </pre>
<pre>User Agent:Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1)</pre>
</blockquote>
<p>This news is not good.  If you filter SV1 agent, you risk filtering legitimate traffic from the IE6 browser.  A few folks have commented to me that one should filter the user agent anyway, because 1) IE6 is in decline and 2) most IE6 users have .NET installed, which will show in the user agent.  Still filtering it makes me a little uneasy.</p>
<p>Is this the death toll for log file analysis and services provided by ABCe (since they can&#8217;t filter this user agent either)?  Maybe it is.  AVG is touting that agent lacks HTTP Accept-Encoding, which is just dandy, but that information isn&#8217;t normally captured in logs.</p>
<p>So the current situation is this:</p>
<ol>
<li><strong>AVG has two user agents</strong>.  Both are filterable, but the SV1 agent is problematic to filter because you risk filtering legitimate traffic.</li>
<li><strong>Both agents in the current version request gifs in noscript tags, inflating counts in page tag implementations with noscript configurations</strong>.  AVG claims they will fix this issue.</li>
<li><strong>The bot uses&#8221;mad&#8221; bandwidth</strong>.  I&#8217;ve heard stories of bandwidth increasing 100x normal levels.  Some webmasters are serving dummy files to the recognizable user agents, some aren&#8217;t serving content to IE 6 browsers (crazy), and some are redirecting the bot back to AVG (thus inflating AVG&#8217;s bandwidth, LOL!).</li>
<li><strong>Evidence points to this bot NOT inflating clicks from paid</strong> <strong>search (i.e. PPC) and thus NOT committing click fraud.</strong>   But it doesn&#8217;t remain out of the realm of possibility that the scanner may be accessing an ad vendor click redirector and causing a click.  Not trying to spread FUD here, just making a point. </li>
<li><strong>AVG is looking at option of checking either an external db (hosted by AVG) or a local cache to verify sites in SERP&#8217;s have been &#8220;scanned by AVG,&#8221;</strong> instead of repeatedly scanning sites every time they are listed in SERP, to reduce the bandwidth issue and minimize fraudulent entries in log files.</li>
<li><strong>AVG is thinking about enabling white listing of sites</strong>, so they are skipped by the scanner.</li>
<li><strong>AVG is thinking about exposing a meta-tag</strong> that instructs the scanner to ignore the site.</li>
</ol>
<p>Good luck with this nasty bot!  <a href="http://www.avg-watch.org/">Interestingly, here&#8217;s how you smurf a site with the AVG LinkScanner. </a></p>
<hr noshade style="margin:0;height:1px" /><br />
&copy; 2008 Web Analytics Demystified | <a href="http://www.webanalyticsdemystified.com">www.webanalyticsdemystified.com</A>      <br />
<br><br><b>Looking for a new job in web analytics?</b> Check out the <a href="http://www.webanalyticsdemystified.com/job_list.asp">Web Analytics Demystified Job Board!</A>            <div class="feedflare">
<a href="http://feeds.feedburner.com/~f/judahphillips?a=iEwbtI"><img src="http://feeds.feedburner.com/~f/judahphillips?i=iEwbtI" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=BHnlfI"><img src="http://feeds.feedburner.com/~f/judahphillips?i=BHnlfI" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=behpdi"><img src="http://feeds.feedburner.com/~f/judahphillips?i=behpdi" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=n8znOi"><img src="http://feeds.feedburner.com/~f/judahphillips?i=n8znOi" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=UfrriI"><img src="http://feeds.feedburner.com/~f/judahphillips?i=UfrriI" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=fdFsAi"><img src="http://feeds.feedburner.com/~f/judahphillips?i=fdFsAi" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=SrXvII"><img src="http://feeds.feedburner.com/~f/judahphillips?i=SrXvII" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=ta2C6i"><img src="http://feeds.feedburner.com/~f/judahphillips?i=ta2C6i" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=iGNyII"><img src="http://feeds.feedburner.com/~f/judahphillips?i=iGNyII" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=ZRupOi"><img src="http://feeds.feedburner.com/~f/judahphillips?i=ZRupOi" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/judahphillips/~4/323425503" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://judah.webanalyticsdemystified.com/2008/06/avg-linkscanner-obfuscates-user-agent.html/feed</wfw:commentRss>
		<feedburner:origLink>http://judah.webanalyticsdemystified.com/2008/06/avg-linkscanner-obfuscates-user-agent.html</feedburner:origLink></item>
		<item>
		<title>Update on AVG LinkScanner</title>
		<link>http://feeds.feedburner.com/~r/judahphillips/~3/320731361/update-on-avg-linkscanner.html</link>
		<comments>http://judah.webanalyticsdemystified.com/2008/06/update-on-avg-linkscanner.html#comments</comments>
		<pubDate>Thu, 26 Jun 2008 11:17:22 +0000</pubDate>
		<dc:creator>Judah</dc:creator>
		
		<category><![CDATA[Data Collection]]></category>

		<category><![CDATA[Data Quality]]></category>

		<category><![CDATA[Due Diligence]]></category>

		<category><![CDATA[Random Thoughts]]></category>

		<category><![CDATA[Reporting]]></category>

		<category><![CDATA[Web Analytics]]></category>

		<guid isPermaLink="false">http://judah.webanalyticsdemystified.com/2008/06/update-on-avg-linkscanner.html</guid>
		<description><![CDATA[Here&#8217;s the deal.  AVG LinkScanner doesn&#8217;t execute javascript nor take cookies.  I had that confirmed by the Chief Research Officer at AVG, Roger Thompson. 
So why is the AVG user agent showing up in that data collected from certain page tag configurations?  The AVG LinkScanner currently requests gifs in noscript tags!
A best practice in web analytic&#8217;s page tag configuration is [...]]]></description>
			<content:encoded><![CDATA[<p>Here&#8217;s the deal.  AVG LinkScanner doesn&#8217;t execute javascript nor take cookies.  I had that confirmed by the Chief Research Officer at AVG, Roger Thompson. </p>
<p>So why is the AVG user agent showing up in that data collected from certain page tag configurations?  The AVG LinkScanner currently requests gifs in noscript tags!</p>
<p>A best practice in web analytic&#8217;s page tag configuration is to use the noscript tag to serve the gif to non-javascript executing browsers.  Here&#8217;s some commonly seen (obscured) code for doing that:</p>
<blockquote>
<pre>&lt;<font color="#800000">noscript&gt;
&lt;div&gt;&lt;img alt=&#8221;foo&#8221; id=&#8221;bar&#8221; width=&#8221;1&#8243; height=&#8221;1&#8243; src=&#8221;http://
foo.bar.com/xyzab57yw10000s1s8g0boozt_9t1x/foo.gif?baruri=/
nojavascript&amp;xy.js=No&amp;xy.tv=1.2.3&#8243; mce_src=&#8221;http://
foo.bar.com/xyzab57yw10000s1s8g0boozt_9t1x/foo.gif?baruri=/
nojavascript&amp;xy.js=No&amp;xy.tv=1.2.3&#8243;div&gt;
&lt;/noscript&gt;</font></pre>
<pre><font color="#800000">&lt;NOSCRIPT&gt;
&lt;IMG
src=&#8221;//foo.bar.com/xyz.gif?Log=1&amp;URL=/javascript_disabled&#8221; mce_src=&#8221;//foo.bar.com/xyz.gif?Log=1&amp;URL=/javascript_disabled&#8221;
</font><font color="#800000">BORDER=&#8221;0&#8243; WIDTH=&#8221;1&#8243; HEIGHT=&#8221;1&#8243; /&gt;
&lt;/NOSCRIPT&gt;</font></pre>
<pre><font color="#800000">&lt;noscript&gt;
&lt;img src=http://pt.foobar.com/images/xyz.gif?js=0</font><font color="#800000">&#8221; height=&#8221;1&#8243;
width=&#8221;1&#8243;
border=&#8221;0&#8243; hspace=&#8221;0&#8243; vspace=&#8221;0&#8243; alt=&#8221;"&gt; </font></pre>
</blockquote>
<p>Thus, if you are using noscript tags in your page tag *and* someone with the AVG Linkscanner views a SERP (search engine results page)  from Google/Yahoo/MSN that lists your site, the traffic from the LinkScanner will be counted. </p>
<p>Of course the simple solution to fix this problem is to exclude the user agent: </p>
<blockquote>
<pre>Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1;1813)</pre>
</blockquote>
<p>If don&#8217;t have full control over your page tag based web analytics implementation (i.e. hosted), you need to verify that your vendor has excluded this agent.   And you should have them audit your data going back to April, and refund/credit you any money.  Good luck with that though! <img src='http://judah.webanalyticsdemystified.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p>How big is the problem?  Well, it depends! <img src='http://judah.webanalyticsdemystified.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p>The amount of AVG traffic will vary dramatically by site.  Your site must show up in the SERP&#8217;s on computers of visitors that have AVG LinkScanner installed, and you must be using noscript tags to serve the gif.</p>
<p>I&#8217;ve made AVG aware of this issue.  And frankly, they&#8217;ve been a fantastic company to work with, so I&#8217;m sticking with them (for now ;).  First they allowed me to join a private Google group to discuss my findings, both the Head of Global Communications and Chief Research Officer quickly responded to all my emails (good social media response), and their engineers are looking into this issue so that they can fix it&#8230;  That&#8217;s pretty impressive and quick response.  So cheers to them!</p>
<p>It&#8217;s worth mentioning that the LinkScanner isn&#8217;t _<em>supposed</em>_ to request images, so I do think this issue will get fixed.</p>
<p>Only time will tell whether or not AVG obfuscates the user agent so it looks just like a &#8220;normal&#8221; browser.  Let&#8217;s hope not! </p>
<p>What I do find interesting is that I&#8217;m already hearing that an agent exists with the string (Mozill<strong>ia</strong><font color="#800000">/4.0 (compatible; MSIE 6.0; Windows NT 5.1;1813). Note the &#8220;ia&#8221; mispelling of Mozilla as <a href="http://tech.blorge.com/Structure:%20/2008/06/14/avg-throttles-web-analytics/">incorrectly documented here</a>.  And it accepts cookies.  So AVG&#8217;s agent is already being spoofed.  Not good, not good.</font></p>
<hr noshade style="margin:0;height:1px" /><br />
&copy; 2008 Web Analytics Demystified | <a href="http://www.webanalyticsdemystified.com">www.webanalyticsdemystified.com</A>      <br />
<br><br><b>Looking for a new job in web analytics?</b> Check out the <a href="http://www.webanalyticsdemystified.com/job_list.asp">Web Analytics Demystified Job Board!</A>            <div class="feedflare">
<a href="http://feeds.feedburner.com/~f/judahphillips?a=yO7t5I"><img src="http://feeds.feedburner.com/~f/judahphillips?i=yO7t5I" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=S8HiPI"><img src="http://feeds.feedburner.com/~f/judahphillips?i=S8HiPI" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=4oPFvi"><img src="http://feeds.feedburner.com/~f/judahphillips?i=4oPFvi" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=6YeCPi"><img src="http://feeds.feedburner.com/~f/judahphillips?i=6YeCPi" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=AdU8NI"><img src="http://feeds.feedburner.com/~f/judahphillips?i=AdU8NI" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=B9rxIi"><img src="http://feeds.feedburner.com/~f/judahphillips?i=B9rxIi" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=XEnPfI"><img src="http://feeds.feedburner.com/~f/judahphillips?i=XEnPfI" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=1UYadi"><img src="http://feeds.feedburner.com/~f/judahphillips?i=1UYadi" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=zwgyJI"><img src="http://feeds.feedburner.com/~f/judahphillips?i=zwgyJI" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=yHfani"><img src="http://feeds.feedburner.com/~f/judahphillips?i=yHfani" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/judahphillips/~4/320731361" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://judah.webanalyticsdemystified.com/2008/06/update-on-avg-linkscanner.html/feed</wfw:commentRss>
		<feedburner:origLink>http://judah.webanalyticsdemystified.com/2008/06/update-on-avg-linkscanner.html</feedburner:origLink></item>
		<item>
		<title>AVG LinkScanner Bot Executes JavaScript?!?</title>
		<link>http://feeds.feedburner.com/~r/judahphillips/~3/316743159/avg-linkscanner-bot-executes-javascript.html</link>
		<comments>http://judah.webanalyticsdemystified.com/2008/06/avg-linkscanner-bot-executes-javascript.html#comments</comments>
		<pubDate>Sat, 21 Jun 2008 07:11:30 +0000</pubDate>
		<dc:creator>Judah</dc:creator>
		
		<category><![CDATA[Data Quality]]></category>

		<category><![CDATA[General]]></category>

		<category><![CDATA[Log File]]></category>

		<category><![CDATA[Measurement]]></category>

		<category><![CDATA[Page Tag]]></category>

		<category><![CDATA[Random Thoughts]]></category>

		<category><![CDATA[Spiders and Bots]]></category>

		<category><![CDATA[Web 2.0]]></category>

		<category><![CDATA[Web Analytics Tools]]></category>

		<guid isPermaLink="false">http://judah.webanalyticsdemystified.com/2008/06/avg-link-scanner-executes-javascript.html</guid>
		<description><![CDATA[The  well-researched answer is &#8220;no.&#8221;  The AVG LinkScanner Bot appears to prefetch the js and the gif (and pretty much everything else on the page), which for certain tools and their tag configurations generates false page views and visits (and the derivatives thereof), just like it&#8217;s &#8220;legitimate&#8221; traffic. 
If your tag configuration is set up with [...]]]></description>
			<content:encoded><![CDATA[<p>The  well-researched answer is &#8220;no.&#8221;  The AVG LinkScanner Bot appears to prefetch the js and the gif (and pretty much everything else on the page), which for certain tools and their tag configurations generates false page views and visits (and the derivatives thereof), just like it&#8217;s &#8220;legitimate&#8221; traffic. </p>
<p>If your tag configuration is set up with noscript tags, AVG will fetch the content in the tags, including the gif, which means that:</p>
<ul>
<li>The bot may be infesting the data of customers of web analytics vendor who configure page tag-based data collection in this way. </li>
<li>The bot may be inflating the data in such products/services offered by various web analytics companies.</li>
<li>Customers may be paying for server calls generated by this bot.</li>
</ul>
<p>Vendors, of course, could easily filter the user agent to protect their customers:</p>
<blockquote>
<pre><font color="#800000">Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1;1813) </font></pre>
</blockquote>
<p>But I haven&#8217;t heard a peep from any SaaS vendors about excluding the user agent, filtering already collected data, or refunding customers the cost of robotically generated server calls (regardless of AVG). Have you?</p>
<p>Think about this: many SaaS page tag vendors don&#8217;t provide detailed visitor-level data and user agent reporting.  That means that their customers have no ability to investigate this bot or detect it by filtering their reported data by the the true user agent.</p>
<p>I&#8217;ve been talking about <a href="http://judah.webanalyticsdemystified.com/2007/07/part-1-spiders-bots-page-views-and-web-analytics-oh-my.html">JS executing bots screwing with web data for about a year now</a>.  <a href="http://www.seomoz.org/ugc/bad-bots-confound-web-analytics-by-executing-javascript-tags">SEOMoz</a> and the folks at <a href="http://www.slicksurface.com/blog/2008-05/ways-to-identify-bad-bots-that-execute-analytics-tags">SlickSurface</a> confirmed it quite recently (quoting me no less in their fantastic analysis).  So they do exist&#8230;</p>
<p>Now let me tell you a little story.  Once upon a time I was at a conference called eMetrics when the CEO of a company came up to me and said &#8220;hey I read your blog about bot detection, and I looked in my web metrics tool for traffic with high page view to visit ratios.&#8221;  Then he narrated a story to me about how he found a bunch of traffic that had page view to visit ratios of 5,000 to 1.&#8221;  I said &#8220;do you use page tags&#8221; He said &#8220;that&#8217;s all my vendor provides, so yeah.&#8221;  And I said &#8220;you&#8217;ve found a javascript executing bot in your data.&#8221;  &#8220;I know&#8221; he said. &#8220;Well did you call your vendor and let them know?&#8221;  I said.  Now for the punch line:  he told me that the vendor (who shall remain nameless) told him &#8220;well, the traffic executed server calls&#8221;  And they wouldn&#8217;t give him a refund!</p>
<p>It&#8217;s worth mentioning that this bot definitely affects log file tools and packet sniffer tools.  Both must be configured to filter the AVG LinkScanner user agent.</p>
<p>Now here&#8217;s the rub for me.  I use AVG!!!  But I now find it increasingly difficult to support the company or continue using their products.  Why?  Because they are wearing a &#8220;bad hat&#8221; here:</p>
<ul>
<li><strong><a href="http://www.theregister.co.uk/2008/06/13/avg_scanner_skews_web_traffic_numbers/">First, they are fully aware of the affect of this bot on web analytics systems</a></strong>. <strike>They just don&#8217;t seem to care (yet).</strike>  UPDATE:  They have set up a Google Group to discuss this issue.  They must understand how companies of all types in all sectors use web analytics data to optimize their sites, set their marketing budgets, determine expected server load, and much more.  What do their Internet Marketers think? </li>
<li><strong>Second, the Link Scanner tool may have a short shelf life and may offer limited protection.</strong>  Malware creators will easily adjust. Check out what my friend Steve McInerney, a very smart security expert, said on the <a href="http://tech.groups.yahoo.com/group/webanalytics/message/17858">Web Analytics Association&#8217;s Yahoo Forum</a>:</li>
</ul>
<blockquote>
<blockquote>
<pre><font color="#800000">What strikes me about this particular solution by AVG is how
incredibly &#8230; stupid it is on several fronts.</font></pre>
<pre><font color="#800000">1. Noticeably impacting a users bandwidth is, technically, a security
breach in the first place, aka Denial of Service Attack.</font></pre>
<pre><font color="#800000">2. Some of us live in countries that have rather severe bandwidth
charges/limits and the like, whom shall I send my excess bandwidth
bill to?</font></pre>
<pre><font color="#800000">&#8230;(this) method is fundamentally
flawed. ie malware ignores any first request and only infects on a
second request - alternate cloaking. Whatever. This type of &#8220;solution&#8221;
only provides weak protection for a strictly limited period of time.</font></pre>
<pre><font color="#800000">&#8230;not just &#8220;no security&#8221; but bad
security. Because folk feel they are being protected when they are
not, and hence will take greater risks and hence inflict greater harm
on themselves. <img src='http://judah.webanalyticsdemystified.com/wp-includes/images/smilies/icon_sad.gif' alt=':-(' class='wp-smiley' /> </font></pre>
<pre><font color="#800000">Ignoring the balance of positive to harm that this problem inflicts on
the users who use this product.</font></pre>
</blockquote>
</blockquote>
<ul>
<li><strong>Third, AVG just doesn&#8217;t seem to &#8220;get it&#8221; yet.</strong>  They are potentially messing with the ability to drive commerce via data driven decision making, e-commerce analytics, site optimization, and online media measurement!  To quote <a href="http://www.theregister.co.uk/2008/06/19/avg_linkscanner_and_adwords/">The Register</a> &#8220;chief of research Roger Thompson - who designed the AVG LinkScanner - <strong><font color="#ff0000"><em><u>indicated he may do away with that unique user agent</u></em></font></strong>. His chief concern is security, and he doesn&#8217;t want webmasters or malware writers gaming his scanner. &#8220;In order to detect the really tricky - and by association, the most important - malicious content, we need to look just like a browser driven by a human being,&#8221; he argues.</li>
</ul>
<p>WebMasterWorld has some good stuff about to say <a href="http://www.webmasterworld.com/search_engine_spiders/3674410.htm">here</a>.  Read the Register&#8217;s first article <a href="http://www.theregister.co.uk/2008/06/13/avg_scanner_skews_web_traffic_numbers/">here</a>.  And check out the dude&#8217;s blog who <a href="http://osblues.com/2008/06/03/avg-destroys-web-analytics/">broke the news first</a> and responses from AVG <a href="http://osblues.com/2008/06/14/contact-from-avg/">here</a> and <a href="http://osblues.com/2008/06/18/avgs-roger-thompson-gets-in-touch/">here</a>.</p>
<p>Interesting stuff. So what do you all think? Have you seen evidence of this bot in user agent data from your page tag solutions that use the noscript tag for the image? </p>
<hr noshade style="margin:0;height:1px" /><br />
&copy; 2008 Web Analytics Demystified | <a href="http://www.webanalyticsdemystified.com">www.webanalyticsdemystified.com</A>      <br />
<br><br><b>Looking for a new job in web analytics?</b> Check out the <a href="http://www.webanalyticsdemystified.com/job_list.asp">Web Analytics Demystified Job Board!</A>            <div class="feedflare">
<a href="http://feeds.feedburner.com/~f/judahphillips?a=9pK9tI"><img src="http://feeds.feedburner.com/~f/judahphillips?i=9pK9tI" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=IJSCbI"><img src="http://feeds.feedburner.com/~f/judahphillips?i=IJSCbI" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=h0ieLi"><img src="http://feeds.feedburner.com/~f/judahphillips?i=h0ieLi" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=B7BI1i"><img src="http://feeds.feedburner.com/~f/judahphillips?i=B7BI1i" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=hG9IRI"><img src="http://feeds.feedburner.com/~f/judahphillips?i=hG9IRI" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=dP0Oai"><img src="http://feeds.feedburner.com/~f/judahphillips?i=dP0Oai" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=u5GYUI"><img src="http://feeds.feedburner.com/~f/judahphillips?i=u5GYUI" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=9Mvaqi"><img src="http://feeds.feedburner.com/~f/judahphillips?i=9Mvaqi" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=DVj8NI"><img src="http://feeds.feedburner.com/~f/judahphillips?i=DVj8NI" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=szik4i"><img src="http://feeds.feedburner.com/~f/judahphillips?i=szik4i" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/judahphillips/~4/316743159" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://judah.webanalyticsdemystified.com/2008/06/avg-linkscanner-bot-executes-javascript.html/feed</wfw:commentRss>
		<feedburner:origLink>http://judah.webanalyticsdemystified.com/2008/06/avg-linkscanner-bot-executes-javascript.html</feedburner:origLink></item>
		<item>
		<title>Sunday Night Thinking on Mobile Analytics…</title>
		<link>http://feeds.feedburner.com/~r/judahphillips/~3/312646596/thinking-on-mobile-analytics.html</link>
		<comments>http://judah.webanalyticsdemystified.com/2008/06/thinking-on-mobile-analytics.html#comments</comments>
		<pubDate>Mon, 16 Jun 2008 00:11:16 +0000</pubDate>
		<dc:creator>Judah</dc:creator>
		
		<category><![CDATA[Data Collection]]></category>

		<category><![CDATA[Due Diligence]]></category>

		<category><![CDATA[Measurement]]></category>

		<category><![CDATA[Mobile]]></category>

		<category><![CDATA[Mobile Analytics]]></category>

		<category><![CDATA[Random Thoughts]]></category>

		<category><![CDATA[Web 2.0]]></category>

		<category><![CDATA[Web Analytics Tools]]></category>

		<guid isPermaLink="false">http://judah.webanalyticsdemystified.com/2008/06/thinking-on-mobile-analytics.html</guid>
		<description><![CDATA[Mobile analytics for Internet-enabled wireless devices is a fairly hot topic for companies seeking to acquire customers, extend their brand, or expose content in &#8220;innovative&#8221; ways.  Obviously, the iPhone and Blackberry are pushing development in this area forward, but there really aren&#8217;t a lot of players in this space. 
Nedstat, CoreMetrics, and Omniture offer capabilities mixed [...]]]></description>
			<content:encoded><![CDATA[<p>Mobile analytics for Internet-enabled wireless devices is a fairly hot topic for companies seeking to acquire customers, extend their brand, or expose content in &#8220;innovative&#8221; ways.  Obviously, the iPhone and Blackberry are pushing development in this area forward, but there really aren&#8217;t a lot of players in this space. </p>
<p><a href="http://www.nedstat.com">Nedstat</a>, <a href="http://www.coremetrics.com">CoreMetrics</a>, and <a href="http://www.omniture.com">Omniture </a>offer capabilities mixed into their current offerings.  Nedstat even carves out some mobile specific reporting.  You can gain some insight into mobile activity from companies that enable log file processing, like <a href="http://www.unica.com">Unica</a> and <a href="http://www.webtrends.com">WebTrends</a>, but be prepared to configure a bunch of filters to isolate the data.</p>
<p>Lesser known companies pushing mobile offerings include: <a href="http://www.amethon.net">Amethon</a>, <a href="http://www.mobilytics.net/">Mobilytics</a>, <a href="http://www.bango.com">Bango</a>, <a href="http://www.tigtags.com">TigTags</a>, <a href="http://www.xiti.com">Xiti</a>, and <a href="http://www.admob.com">AdMob</a>.  Some of these mobile players are even offering capabilities where they cross-sell analytics as an integrated part of their ad networks, content delivery  and transactional processing systems, marketing and barcoding services, and even as infrastructure or network appliances.</p>
<p>On the audience measurement side, we&#8217;ve seen <a href="http://www.comscore.com">comScore</a> acquire M:Metrics, which was no surprise to me.</p>
<p>On the multivariate testing side, we see my friends at <a href="http://www.sitespect.com">SiteSpect</a> offering mobile MVT testing capabilities. </p>
<p>And I&#8217;ll bet we see Google get into this space within the next 6 months&#8230;  I&#8217;d even wager an announcement at eMetrics DC&#8230;</p>
<p>From what I can gather, when we&#8217;re talking about &#8220;mobile analytics&#8221; we&#8217;re talking about &#8220;mobile browser&#8221; activity across a variety of handsets, not everything that happens on the device. </p>
<p>Measurement issues in this area include:</p>
<ul>
<li><strong>Data Collection.</strong>  As many of you know, not all mobile browsers will execute javascript.  They cached the imagesThus, vendors offer us choices.  Folks like Mobilytics and Bango use an image-based data collection method, while Amethon offers a packet sniffer (they call it wireline detection), and we even have Omniture and Coremetrics talking about &#8220;no tag&#8221; implementations - what my good friend <a href="http://wam.typepad.com/">Phil Kemelor</a> mentioned on his <a href="http://www.cmswatch.com/Trends/1271-The-challenge-of-mobile-analytics----Part-2">CMS Watch blog</a> (&#8221;To compensate, you need to stuff the image tag with query strings that will collect the data you require for reporting.&#8221;)  Then we have Unica and WebTrends with log files.  Interestingly, packet sniffing has some advantages here because some devices pass unique id&#8217;s (such as the phone number) in the HTTP header or other unique id&#8217;s.</li>
<li><strong>Unique visitor identification due to lack of cookie support and IP addresses changing.</strong>  IP addresses change, I&#8217;m told, as they switch from tower to tower.   In addition many mobile devices will take the IP address of the gateway, making all the devices look the same &#8220;person.&#8221;  I&#8217;ve certainly seen evidence of the host changing pretty quickly during a mobile session. Compounding the difficulty in assessing &#8220;uniqueness&#8221; is that not all mobile devices support cookies.  In web analytics, cookies are used to define uniqueness.  The fallback method when you can&#8217;t use a cookie is IP address/user agent.  If you can&#8217;t set cookies and the IP address and user agents are the same, how do you identify uniqueness?   However, when you can detect a unique value in the header, you can easily detect uniqueness.</li>
<li><strong>Handset capability detection</strong>.  Does the device support WAP pushing, streaming video, ringtones, downloading video clips, and so on?</li>
<li><strong>Phone and Manufacturer identification</strong>.  Database from <a href="wurfl.sourceforge.net">WURFL</a> and <a href="http://deviceatlas.com/">DeviceAtlas</a> can be used to identify phone and manufacturer device attributes.  Larger vendors are further behind on integrating this data into their current offerings, whereas the smaller niche players are making use of it. </li>
<li><strong>Screen resolution detection</strong>.  The <a href="http://www.mmaglobal.com/">Mobile Marketing Association&#8217;s (MMA)</a> standards for the four &#8220;standard&#8221; screen sizes may carry enough weight to push this disdained piece of metrics trivia available from javascript based tagging in web analytics into a brighter spotlight.</li>
<li><strong>Traffic source detection</strong>.  Capabilities for traffic sources seem rudimentary.  I don&#8217;t just want to know about search and direct entry.  But I want detection of sources from my marketing and advertising campaigns, rss feeds, and email newsletters, if mobile visitors are coming in from those channels.   Interestingly, Bango solves the campaign tracking issue by pushing you to a Bango-specific URL.</li>
<li><strong>Geographic identification</strong>.  Where are the visitors viewing your site coming from?  And what does the mobile audience environment &#8220;look like&#8221; in each country.  From this information you can extrapolate country-specifics for site optimization.  But not all devices enable geographic detection because the gateway&#8217;s IP address is used or the IP address from the network is used, not a GPS signal.</li>
<li><strong>No standards.</strong>  There are few, if any, commonly supported mobile standards and no web data standards, so the problem is no standards for the devices and no standards for the tools.  There are no standards.  Did I mention that there are no standards. </li>
</ul>
<p>So I was thinking, what would I want to see in a mobile analytics solution?  Allow me to riff here.</p>
<ul>
<li><strong>Dashboards for KPI and specific-metric reporting</strong>.  Views, visits, visitors, referrers, popular pages, traffic sources, resolutions, geography, time-based reporting and custom defined KPI&#8217;s&#8230;.</li>
<li><strong>Support for multiple data collection methods</strong>.  Logs, no-js image tags , and packet sniffers.  Let me pick what I need for whatever application fits my goals.</li>
<li><strong>Support for mobile-specific constructs not present in historic web analytics data</strong>.  Manufacturers, operators, handsets, and device capabilities.</li>
<li><strong>Advertising-based reports</strong>.  CTR, CPM, eCPM, that stuff&#8230;</li>
<li><strong>Tracking for mobile downloads, installed applications, SMS, and MMS</strong>.  Seems like a no-brainer.</li>
<li><strong>API&#8217;s</strong>.  Closed systems are dead ends for integrated marketing, so give me an API or enable pre-built integrations with other systems, like CRM.</li>
<li><strong>Segmentation</strong>.  By country, by device, by network, by manufacturer, and so on.  It&#8217;s necessary.</li>
<li><strong>Repeat or return visitor identification</strong>.  Simple measures of recency and frequency, core to media buying and planning and to site optimization, should be a data point available in mobile analytics.</li>
<li><strong>Conversion and goal metrics</strong>.  Do visitors on mobile devices convert better, worse, the same?  Do they reach site goals?  Without tying performance data  and outcomes to mobile visitor activity, I&#8217;m left wondering&#8230;</li>
<li><strong>Value scoring for engagement or proxy scoring for revenue and ROI analysis</strong>.  I want to be able to score attributes or actions to approximate an engagement score or to identify value or indicate revenue. </li>
<li><strong>Non-human traffic and web-browser based detection and reporting.</strong>  Mobile pages are full of links.  The ads are links.  Mobile vendors must support detecting, filtering, and reporting, non human and web-based agents from pure mobile agents - otherwise the mobile data gets muddled and skewed.</li>
<li><strong>Data Export.</strong>  Must be able to export reports to Excel or Word, and email them.</li>
</ul>
<p>So there&#8217;s a quick blogviation on Mobile.  Am I right, wrong, what did I miss?  Let me know&#8230;</p>
<hr noshade style="margin:0;height:1px" /><br />
&copy; 2008 Web Analytics Demystified | <a href="http://www.webanalyticsdemystified.com">www.webanalyticsdemystified.com</A>      <br />
<br><br><b>Looking for a new job in web analytics?</b> Check out the <a href="http://www.webanalyticsdemystified.com/job_list.asp">Web Analytics Demystified Job Board!</A>            <div class="feedflare">
<a href="http://feeds.feedburner.com/~f/judahphillips?a=09fzOI"><img src="http://feeds.feedburner.com/~f/judahphillips?i=09fzOI" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=vVEZgI"><img src="http://feeds.feedburner.com/~f/judahphillips?i=vVEZgI" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=UHEuYi"><img src="http://feeds.feedburner.com/~f/judahphillips?i=UHEuYi" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=hNM9gi"><img src="http://feeds.feedburner.com/~f/judahphillips?i=hNM9gi" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=MRe2WI"><img src="http://feeds.feedburner.com/~f/judahphillips?i=MRe2WI" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=TdneCi"><img src="http://feeds.feedburner.com/~f/judahphillips?i=TdneCi" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=8efzsI"><img src="http://feeds.feedburner.com/~f/judahphillips?i=8efzsI" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=fJN73i"><img src="http://feeds.feedburner.com/~f/judahphillips?i=fJN73i" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=YPn8OI"><img src="http://feeds.feedburner.com/~f/judahphillips?i=YPn8OI" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=if1T4i"><img src="http://feeds.feedburner.com/~f/judahphillips?i=if1T4i" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/judahphillips/~4/312646596" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://judah.webanalyticsdemystified.com/2008/06/thinking-on-mobile-analytics.html/feed</wfw:commentRss>
		<feedburner:origLink>http://judah.webanalyticsdemystified.com/2008/06/thinking-on-mobile-analytics.html</feedburner:origLink></item>
		<item>
		<title>Why Don’t the Numbers Match?!?</title>
		<link>http://feeds.feedburner.com/~r/judahphillips/~3/306819456/why-dont-the-numbers-match-web-analytics.html</link>
		<comments>http://judah.webanalyticsdemystified.com/2008/06/why-dont-the-numbers-match-web-analytics.html#comments</comments>
		<pubDate>Sat, 07 Jun 2008 03:50:57 +0000</pubDate>
		<dc:creator>Judah</dc:creator>
		
		<category><![CDATA[Audience Measurement]]></category>

		<category><![CDATA[Data Collection]]></category>

		<category><![CDATA[Data Quality]]></category>

		<category><![CDATA[Measurement]]></category>

		<category><![CDATA[Random Thoughts]]></category>

		<category><![CDATA[Web Analytics]]></category>

		<category><![CDATA[Web Analytics Management]]></category>

		<guid isPermaLink="false">http://judah.webanalyticsdemystified.com/2008/06/why-don%e2%80%99t-the-numbers-match.html</guid>
		<description><![CDATA[A question any practitioner of Internet-based analytics will be asked by many different stakeholders is “why don’t the numbers match?”  Counts of the identically named metrics from ad servers don’t match the web analytics tool, which don’t match the for-pay third party audience measurement tools, which don’t match the free audience measurement tools, which never [...]]]></description>
			<content:encoded><![CDATA[<p>A question any practitioner of Internet-based analytics will be asked by many different stakeholders is “why don’t the numbers match?”  Counts of the identically named metrics from ad servers don’t match the web analytics tool, which don’t match the for-pay third party audience measurement tools, which don’t match the free audience measurement tools, which never match any of the homegrown internal measurement tools.  And none of them ever match each other.</p>
<p>So it’s a good question certainly valid to ask.  The answers are even fairly easy to understand, but the root causes are often difficult to pinpoint and even harder, if possible at all, to remedy.  The fact of the matter is that data discrepancies in analytics result for a multitude of reasons, such as:</p>
<ul>
<li><strong>Different data collection methods</strong>.  We have a bunch of tools and services that collect web data using various, non-standardized, proprietary data collection methods.  Ad servers use javascript page tags.  Many web analytics tools use page tags too, but it’s not uncommon in web analytics to use additional methods, such as log files or packet sniffers.  Or perhaps a combination of these methods, called hybrid data collection.  And all the tools have different algorithms for processing the data collected.</li>
</ul>
<blockquote><p>On the audience measurement side, data is collected from self-selecting panels who install proprietary software (i.e. toolbars and so on) on their computers, perhaps at work or at their university, but most likely at home.  Then, the collected data from different panels is rolled-up and combined, and the limited subset of the Internet population that chooses to be monitored, in exchange for some incentive, is inflated and projected to the entire Internet audience using proprietary statistical methods.  We also have data collected from a limited set of geographically specific ISP’s.  And regardless of whether we’re talking about audience measurement or web analytics, the different data collection methods often, but not always, involve cookies and all their inherent issues of cookie deletion.  </p></blockquote>
<ul>
<li><strong>Unique data models</strong>.  Ad servers aren’t focused on counting page views and the other dimension of web analytics (visits, time, and so on).  Rather ad servers focus on serving and counting impressions served (and loads of related derivative calculations, like CTR, CPC, and view–thru).  Metrics are based on an ad request and an ad code.  Ads may or may not be targeted to a page, and instead to various constructs, like a “zone” or “keyword.”  What that means is that the “page” dimension may not even exist in your ad server’s data model.  In other words, you aren’t looking at impressions measured on a page, but rather at the number of impressions served in a different conceptual construct.  That’s one of the reasons why people say metrics and ad-serving systems “don’t measure the same thing.”</li>
<li><strong>Untagged pages</strong>.  Specific to technologies that collect data or serve ads using javascript page tags, there are challenges to ensuring and verifying complete coverage of page tags across every page on a site.  When the pages aren’t all tagged with the different tags for the assorted technologies, guess what?  The numbers won’t come close to falling within tolerable variances.  And questions and skepticism will ensue.</li>
<li><strong>Non-JS executing clients and ad blocking software</strong>.  Let’s imagine for the moment, your site is perfectly tagged for all technologies, so the numbers between your ad server will be close to your web analytics system, right?  Nope, regardless of data model issues, not all browsers execute javascript and many Firefox users have installed Ad Block Plus. </li>
<li><strong>Cookie issues</strong>.  When you’re counting based on cookies, third-party cookies get blocked (often by privacy software).  Many ad servers and web analytics tools still serve third party cookies, and many corporations have not tricked out their DNS to accommodate this issue.  And we all know how cookie deletion affects unique visitor counts, even if you use first-party cookies.</li>
<li><strong>Many other issues</strong>.  Latency from visitors moving off the page prior to the tag executing to latency in the call to pick up an ad from a third party while your ad server counts the traffic (so your ad count differs from the agency’s count), to refresh rates making it hard to correlate page views and impressions, to no rich media installed and no fallback, to robotic traffic not being filtered from logs or <a href="http://www.seomoz.org/ugc/bad-bots-confound-web-analytics-by-executing-javascript-tags">tags</a>, to certain types of user agents (such as mobile devices) not executing javascript… there’s a whole host of other factors that cause data discrepancies.</li>
</ul>
<p>And of course, there’s always the nebulous issue around the complete lack of consensus-based, enforceable standards for online measurement.  No industry organization can say what vendors or companies “must” do, only what they “should” do… And no industry body is going to get successful companies to change their secret sauce just because they said so…</p>
<p>So what’s a practitioner to do?  Understand the potential sources of discrepancies.  Work with your team (from IT to vendors) to prevent and minimize the root causes when possible.  Educate your team when discrepancies are not remediable.  Ensure you use the different sources of metrics judiciously in the context of your business goals.  Finally, realize that none of the tools are more “correct” than any other.  All of our analytics tools serve different, and sometimes overlapping, business purposes - from counting ads, to influencing media buying, to sizing audiences, to measuring business performance, and to optimizing the site.</p>
<hr noshade style="margin:0;height:1px" /><br />
&copy; 2008 Web Analytics Demystified | <a href="http://www.webanalyticsdemystified.com">www.webanalyticsdemystified.com</A>      <br />
<br><br><b>Looking for a new job in web analytics?</b> Check out the <a href="http://www.webanalyticsdemystified.com/job_list.asp">Web Analytics Demystified Job Board!</A>            <div class="feedflare">
<a href="http://feeds.feedburner.com/~f/judahphillips?a=l1aIdI"><img src="http://feeds.feedburner.com/~f/judahphillips?i=l1aIdI" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=vNllcI"><img src="http://feeds.feedburner.com/~f/judahphillips?i=vNllcI" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=BuIQ0i"><img src="http://feeds.feedburner.com/~f/judahphillips?i=BuIQ0i" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=m63B2i"><img src="http://feeds.feedburner.com/~f/judahphillips?i=m63B2i" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=QEe4oI"><img src="http://feeds.feedburner.com/~f/judahphillips?i=QEe4oI" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=u4cnNi"><img src="http://feeds.feedburner.com/~f/judahphillips?i=u4cnNi" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=m3YefI"><img src="http://feeds.feedburner.com/~f/judahphillips?i=m3YefI" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=kOu3ii"><img src="http://feeds.feedburner.com/~f/judahphillips?i=kOu3ii" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=VfW5nI"><img src="http://feeds.feedburner.com/~f/judahphillips?i=VfW5nI" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=xlaXKi"><img src="http://feeds.feedburner.com/~f/judahphillips?i=xlaXKi" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/judahphillips/~4/306819456" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://judah.webanalyticsdemystified.com/2008/06/why-dont-the-numbers-match-web-analytics.html/feed</wfw:commentRss>
		<feedburner:origLink>http://judah.webanalyticsdemystified.com/2008/06/why-dont-the-numbers-match-web-analytics.html</feedburner:origLink></item>
		<item>
		<title>Five Rules for and some Thoughts on Deep Packet Inspection</title>
		<link>http://feeds.feedburner.com/~r/judahphillips/~3/293200491/five-rules-for-and-some-thoughts-on-deep-packet-inspection.html</link>
		<comments>http://judah.webanalyticsdemystified.com/2008/05/five-rules-for-and-some-thoughts-on-deep-packet-inspection.html#comments</comments>
		<pubDate>Mon, 19 May 2008 01:31:26 +0000</pubDate>
		<dc:creator>Judah</dc:creator>
		
		<category><![CDATA[Audience Measurement]]></category>

		<category><![CDATA[Behavioral Targeting]]></category>

		<category><![CDATA[Data Collection]]></category>

		<category><![CDATA[Measurement]]></category>

		<category><![CDATA[Random Thoughts]]></category>

		<category><![CDATA[Web Analytics]]></category>

		<guid isPermaLink="false">http://judah.webanalyticsdemystified.com/2008/05/five-rules-for-and-some-thoughts-on-deep-packet-sniffing.html</guid>
		<description><![CDATA[One of the many things on my mind in the online world these days is “deep packet inspection.” 
First, let me digress, packet sniffing isn’t new to web analytics.  From Accrue to Omniture (Visual Discover Sensor?) to AuriQ to Metronome Labs.  Packet sniffers are used to “do web analytics.”  It&#8217;s an uncommon method when compared to javascript page tags.
Web analytics [...]]]></description>
			<content:encoded><![CDATA[<p>One of the many things on my mind in the online world these days is “deep packet inspection.” </p>
<p>First, let me digress, packet sniffing isn’t new to web analytics.  From Accrue to <a href="http://www.omniture.com">Omniture</a> (<strike>Visual</strike> Discover Sensor?) to <a href="http://www.auriq.com">AuriQ</a> to <a href="http://www.metronomelabs.com">Metronome Labs</a>.  Packet sniffers are used to “do web analytics.”  It&#8217;s an uncommon method when compared to javascript page tags.</p>
<p>Web analytics packet sniffers are used to write logs for sessionization (and thus measure) the traffic on behalf of site owners (who don’t want to use tags or logs).  Once you’ve logged and sessionized you know what content people have looked at or downloaded on your site. </p>
<p>“Deep packet inspection,” like WA sniffers looks at the entire <em>payload</em>of packets in real-time across a huge number of simultaneous sessions.  Deep packet inspection, like regular packet sniffing, examines the files downloaded and the content of the pages viewed - the whole ball of wax. </p>
<p>Deep packet inspection is being offered as a hardware/software technology by companies like <a href="http://www.frontporch.com/html/index.html">FrontPorch</a> and <a href="http://www.sandvine.com">Sandvine</a> (in the US) and <a href="http://www.phorm.com">Phorm</a>(in the UK).  These companies are selling the technology to ISP’s (like Charter, Comcast, and Virgin Media) so that they can monitor the sites visited and the keywords used by customers, and then use the data collected for behavioral targeting.  The ISP’s want a slice of the <a href="http://www.news.com/8301-10784_3-9945171-7.html">juicy, lucrative online ad business</a>.</p>
<p>What’s the difference?  Site owners collect data about what you do on ONE site (or a portfolio of their sites).  ISP’s collect data about what you do on EVERY site you visit.  As I understand it, some of these companies create an anonymous profile of your surfing activity by assigning a unique key to your browser.  Then they monitor the site&#8217;s visited by your browser, and use that data so that the ISP, or the companies to which they sell your data, can serve you what they conclude to be relevant, behaviorally targeted ads. </p>
<p>Get it?  Packet sniffing by site owners = knowing about <em>one site you visit</em>.  Deep packet inspection by ISP’s = knowing about <em>every site you visit</em>.</p>
<p>Now to digress&#8230; In web analytics, we know that web analytics data is collected anonymously.  Unless there’s a login, you don’t know exactly who is coming from that IP address.  And in many cases, most companies data warehouses only contain purchase information, not the entire clickstream.  Once the data is collected, if you have the right architectures you can decode cookie values to people, and make that data non-anonymous (i.e PII).  Not difficult to do with some smart BI folks on your side.  </p>
<p>An ISP already knows who you are and can already identify the sites you visit.  Probably not that easily though on individual level.  They can dig through the logs, etc… </p>
<p>So what’s the big deal and all the hoo-hah about  the &#8220;deep packet inspection” Phorm and FrontPorch are doing?   It’s the data they are collecting and the repository they are building containing data about every site you visit and all the content you view and download… Of course, these companies say that it&#8217;s all done anonymously and that your “privacy” is preserved “to the greatest extent possible.” </p>
<p>Now <a href="http://news.bbc.co.uk/1/hi/technology/7299875.stm">let me quote Sir Tim Berners-Lee</a> about the data collected from Phorm’s ISP tracking: &#8220;It&#8217;s mine - you can&#8217;t have it. If you want to use it for something, then you have to negotiate with me. I have to agree, I have to understand what I&#8217;m getting in return.&#8221;</p>
<p>And that’s the point of the blogviation, Tim is correct.  In web analytics, we do this - we try to operate within Tim&#8217;s constraints.  We enable opt-in with P3P statements and disclosures when you register/login.  Privacy policies disclose what we are doing with the data.  It&#8217;s just ethical and smart business practice to do so.</p>
<p>Thus, I think FrontPorch and Phorm and all the ISP’s who want a piece of online advertising should adhere to the following five rules for their services.</p>
<ol>
<li><strong>Move to an obvious “opt-in” model with full disclosure</strong>.  Tracking via “deep packet inspection” should be an all opt-in model.  If you want anonymous data from your browser collected so that you can be behaviorally targeted, then you should opt-in to be.  Right now, it’s seems to be all opt-out.  You probably don’t know if it&#8217;s being done to you.  It’s buried in fine print you’ve probably never read.  Is that your fault you didn&#8217;t read the fine print? Yeah, but the point is it shouldn&#8217;t be buried in the fine print&#8230;</li>
<li><strong>Provide me with access to the data collected</strong>.  If I opt-in, I should be able to see the data collected from my browser.  It’s very simple.  I demand to see what you are collecting about my browser.  If you are building a profile, then I demand to see the data collected in the profile.  If it&#8217;s all anonymous, then explain how it is in detail, and then follow rule #1.</li>
<li><strong>Enable me to edit or prevent the data from being collected.</strong>  If I opt-in, I want to be able to edit or prevent certain types of data from being collected.  If you&#8217;re tracking my browser, alert me before the data is transmitted, so I can decide if I want to share it.  If a profile is built, I want to be able to edit it!</li>
<li><strong>Let me opt-out at any time EASILY.</strong> If I’ve opted in, and I’m unhappy with the service, allow me to opt-out simply.  Having to set an opt-out cookie on my browser is absolutely and completely absurd.  I want to be able to fully opt-out at the ISP level, just once forever, not at the browser level every time cookies are deleted.  Make it easy and permanent, not easily deletable.</li>
<li><strong>Disclose who you sell my data too</strong>.  Like online list rentals, the next step in all this ISP profiling is selling the data to third-parties.  Let me know what you&#8217;re doing with my data-before you do it- so I can opt out or prevent it from being sold to parties to which I don&#8217;t want it being sold.</li>
</ol>
<p>Consumers must be given a choice for preserving their privacy.  Anonymity to the “greatest extent possible” is not enough and neither are short-sighted opt-out cookies.  Companies like Phorm and Front Porch would be wise to apply these rules to regulate themselves.  <a href="http://www.boston.com/news/local/connecticut/articles/2008/05/16/lawmakers_raise_concerns_over_charter_web_tracking_plan/">Otherwise freedom-loving governments will almost certainly regulate them</a>. </p>
<p>And I haven’t even mentioned the issues with <a href="http://www.google.com/search?sourceid=navclient&amp;ie=UTF-8&amp;rls=HPIB,HPIB:2006-22,HPIB:en&amp;q=deep+packet+sniffing+and+net+neutrality">net neutrality and deep packet inspection</a> (i.e. traffic shaping and access restrictions (called &#8220;throttling&#8221; as <a href="http://blog.instantcognition.com">Clint</a> points out in the comment), have I?</p>
<hr noshade style="margin:0;height:1px" /><br />
&copy; 2008 Web Analytics Demystified | <a href="http://www.webanalyticsdemystified.com">www.webanalyticsdemystified.com</A>      <br />
<br><br><b>Looking for a new job in web analytics?</b> Check out the <a href="http://www.webanalyticsdemystified.com/job_list.asp">Web Analytics Demystified Job Board!</A>            <div class="feedflare">
<a href="http://feeds.feedburner.com/~f/judahphillips?a=nOz4ZH"><img src="http://feeds.feedburner.com/~f/judahphillips?i=nOz4ZH" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=3NLLvH"><img src="http://feeds.feedburner.com/~f/judahphillips?i=3NLLvH" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=Lf9pkh"><img src="http://feeds.feedburner.com/~f/judahphillips?i=Lf9pkh" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=gHrA1h"><img src="http://feeds.feedburner.com/~f/judahphillips?i=gHrA1h" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=lk4bXH"><img src="http://feeds.feedburner.com/~f/judahphillips?i=lk4bXH" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=VOxt3h"><img src="http://feeds.feedburner.com/~f/judahphillips?i=VOxt3h" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=sdPBfH"><img src="http://feeds.feedburner.com/~f/judahphillips?i=sdPBfH" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=nGmiPh"><img src="http://feeds.feedburner.com/~f/judahphillips?i=nGmiPh" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=2vRkBH"><img src="http://feeds.feedburner.com/~f/judahphillips?i=2vRkBH" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=7e5hIh"><img src="http://feeds.feedburner.com/~f/judahphillips?i=7e5hIh" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/judahphillips/~4/293200491" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://judah.webanalyticsdemystified.com/2008/05/five-rules-for-and-some-thoughts-on-deep-packet-inspection.html/feed</wfw:commentRss>
		<feedburner:origLink>http://judah.webanalyticsdemystified.com/2008/05/five-rules-for-and-some-thoughts-on-deep-packet-inspection.html</feedburner:origLink></item>
		<item>
		<title>A Few Thoughts After Another Awesome eMetrics….</title>
		<link>http://feeds.feedburner.com/~r/judahphillips/~3/289152997/a-few-thoughts-after-another-awesome-emetrics.html</link>
		<comments>http://judah.webanalyticsdemystified.com/2008/05/a-few-thoughts-after-another-awesome-emetrics.html#comments</comments>
		<pubDate>Tue, 13 May 2008 03:52:39 +0000</pubDate>
		<dc:creator>Judah</dc:creator>
		
		<category><![CDATA[Appearances]]></category>

		<category><![CDATA[Audience Measurement]]></category>

		<category><![CDATA[Due Diligence]]></category>

		<category><![CDATA[General]]></category>

		<category><![CDATA[Multichannel]]></category>

		<category><![CDATA[Random Thoughts]]></category>

		<category><![CDATA[Web Analytics]]></category>

		<category><![CDATA[Web Analytics Management]]></category>

		<category><![CDATA[Web Analytics Team]]></category>

		<guid isPermaLink="false">http://judah.webanalyticsdemystified.com/2008/05/a-few-thoughts-after-another-awesome-emetrics.html</guid>
		<description><![CDATA[Back from another excellent eMetrics.  I’m a very big fan of the eMetrics Marketing Optimization Summit…  Props go to Jim Sterne for growing this event from a little seed into an incredible, blogworthy blossom.  How involved is Jim in eMetrics?  I’d say he’s completely immersed in every little piece - he even came up to [...]]]></description>
			<content:encoded><![CDATA[<p>Back from another excellent <a href="http://www.emetrics.org">eMetrics</a>.  I’m a very big fan of the eMetrics Marketing Optimization Summit…  Props go to Jim Sterne for growing this event from a little seed into an incredible, blogworthy blossom.  How involved is Jim in eMetrics?  I’d say he’s completely immersed in every little piece - he even came up to me at the <a href="http://blog.webanalyticsdemystified.com/weblog/2008/05/web-analytics-wednesday-san-francisco-metrics-and-kpis.html">SF WAW</a> (way to go <a href="http://june.typepad.com/">June D</a>!) to find out about the <a href="http://twitter.com/jdersh/statuses/804173115">renegade AV work I did</a> in one of the sessions, and to get my take on how it could have been avoided.  He’s that intimately connected to what’s going on.  Macro and micro, micro and macro.  And when you have one of the best Internet Marketers in the world, keeping a tight rein on the Clydesdale of conferences, you know you’re in for one heck of fun ride. </p>
<p>And so it was for about 500+ of the top web analytics in the beautiful Palace hotel.  Props to consummate conference organizers Matt Finlay and his crew at <a href="http://www.risingmedia.com">Rising Media</a> for keeping the road smooth as we all trotted on it as well.  Fanny, you are one helpful polyglot of a marketing manager!  I never knew German keyboards were so wild… Thanks.</p>
<p>The eMetrics sessions were informative and actionable.  The lobby bar and after-hours parties fun and enlightening.  You really can’t ask for more out of a conference.  As I flew home thinking back on it all, there was a lot to blog about, including:</p>
<ul>
<li><strong>It’s all about attitude, dude – as in attitudinal data.</strong>  Like my father says “it’s all about your attitude.”  And so it is on the Internet in 2008.  From <a href="http://www.foreseeresults.com">ForeSeeResults</a>, to <a href="http://www.iperceptions.com">iPerceptions</a>, to <a href="http://www.opinionlab.com">OpinionLab</a>, to <a href="http://www.crmmetrix.com">CRMMetrix</a>, the often missing link in customer analytics is attitudinal data.  I’m talking here about Voice of Customer (VOC) technology that allows you to ask a question set to site visitors and then apply some sort of algorithm or model to express the meaningfulness of the data in quantifiable terms.  From the American Customer Satisfaction Index to 4Q.  VOC technology enables you to participate in a continuous, automated dialog with your customers in order to identify problem points on your web site and enable you to measure purpose and success of your most valuable segments.  Expect to see some of the big players gobble up these smaller companies.  <a href="http://www.omniture.com">Omniture</a>, <a href="http://www.unica.com">Unica</a>, <a href="http://www.webtrends.com">WebTrends</a>, and <a href="http://www.coremetrics.com">CoreMetrics </a>should be thinking about acquisition in this space to round out their offerings.</li>
<li><strong>Testing, 123… as in multivariate, MVT</strong>.  The rage is site optimization technologies beyond the simple A/B, champion challenger, test.  In this category you find folks like <a href="http://www.sitespect.com">SiteSpect</a> (the only non-intrusive multivariate testing solution!).  I’m a big fan of these guys (and was in 2006 long before they ever sponsored a <a href="http://www.webanalyticsdemystified.com/wednesday/">WAW</a>, thanks to a nice demo from Larry at my old job).  Eric Hansen and his crew have specialized software that you install in your data center.  No futzing with damned tags.  Swap out your variations, create different recipes, determine what’s statistically significant in giving you a lift to your macro or micro conversion goal, and you’re off to the races.  The good folks at Google are doing it and doing it well with <a href="https://www.google.com/accounts/ServiceLogin?service=websiteoptimizer&amp;continue=https%3A%2F%2Fwww.google.com%2Fanalytics%2Fsiteopt%2F%3Fet%3Dreset%26hl%3D">Google Site Optimizer</a> (thanks for the t-shirts!).  <a href="http://www.interwoven.com">Interwoven</a> is baking in Optimost to the CMS, and Omniture has their Test and Target integrated with the Business Optimization Suite.  Accenture has <a href="http://www.memetrics.com/about/">Memetrics</a>.  <a href="http://www.kefta.com">Kefta</a> too. And what ever happened to <a href="http://www.vertster.com/">Verster</a>?</li>
</ul>
<blockquote><p>In a nutshell, these technologies enable you to test variations of content themes, colors, creative, calls to action, points of resolution, buttons, navigational elements, &#8211;whatever you want to call the stuff on the screen—to determine what combination performs best against your goals.  But of course, this is all just software, so don’t get too excited.  The tests are about as good as the people creating them…  And complex tests that take a long time to execute may not finish.  Imagine 1-800-Flowers starting a test in January and not finishing until March, missing Valentine’s Day.  Or Intuit running a test beyond April 15th for a tax product.  Go humbly and carefully into this space, my friends, or you may end up optimizing for everyone and appealing to none.</p></blockquote>
<ul>
<li><strong>Tying it all back to the dollar for profit-generating sites and to the mission of non-profit generating sites&#8230;</strong>  It seems like a “no, duh” moment but metrics for the sake of metrics can be a big waste of time.  If you can’t tie metrics or visitor actions back to value on a revenue-producing site or to the betterment of a non-profit site’s core mission, then what’s really the point of the measurement…  That’s why I’m a big fan of the stuff <a href="http://www.zaaz.com">ZaaZ</a> does.  They totally get the fact of how actionable metrics turn the wheel of Internet commerce and ad-based models, and they can model it all to prove it out the ROI.  Folks like newly elected WAA Director Alex Langshur’s company <a href="http://www.publicinsite.com">Public InSite</a> do similar stuff for content driven sites.  That is they know how to use metrics to optimize the channel to goals, not to just puke confusing data, like most web analytics tools do.  Again, it’s all about the people you hire, not the tools you use… My good friend <a href="http://www.kaushik.net">Avinash</a>, right again!</li>
<li><strong>The emergence and rise of deeply psychological and neuro-behavioral methods for automating persuasion and conversion.</strong>   Anyone who knows my good friend <a href="http://www.bizmediascience.com/">Joseph Carrabis</a>, over at <a href="http://www.nextstagevolution.com/">NextStage Evolution</a>, knows that besides being one heck of giant kite flying, music master, he’s also got the models and the patents to help target and respond to human behavior across programmable devices.  We’re already seeing some companies, like <a href="http://www.7bpeople.com/">Seven Billion Joe’s, er People</a>, taking what he’s been saying for years and going to market with it.  The idea here being that if you can identify the affective, behavior, and motivational drivers of site visitors, you can maximize cognition in elements on the site (like pictures, text, informational flow) to appeal to target segments and persuade/provoke desired behavior.  It’s like a higher rung on the optimization ladder.  It’s not test what they see, it’s figure out how they think, then make the site better because of it.  Cool stuff.  Blows my mind.</li>
<li><strong>Integrated, multichannel marketing.</strong>  Just ask my good friend <a href="http://www.multichannelmetrics.com/">Akin Arikan</a>, author of the newly released <a href="http://www.multichannelmetrics.com/?page_id=7">Multichannel Marketing</a>.  (Disclaimer: I was a technical editor on the book.  It’s easy to do when you edit brilliance).  Make sure to check it out!  Marketing in general will become more Internet-centric, but will continue to clutch the roots of broadcast and print.  You will have the database marketer and statistical modelers working with a union of web channel and offline data.  What’s preventing it now?  A unified marketing database.  You see companies like <a href="http://www.salford-systems.com/">Salford Systems</a> circulating in this space.  And take a look at <a href="http://www.unica.com/product/emm.cfm">Unica’s blend of Enterprise Marketing Management</a>…  I’d stay tuned to see what Unica has up their sleeve for bringing together online and offline.  When you can segment and target across online and offline campaigns, if I were pure web channel player only, like Omniture or CoreMetrics, I’d be a bit concerned that people are waking up to open systems, not closed black boxes.  <a href="http://www.webtrends.com">WebTrends</a> is already moving in this direction&#8230;  But they all remain far behind Unica when it comes to multichannel marketing.</li>
</ul>
<p>And that’s just a few of the things the phenomenal eMetrics got me thinking about…  I hope to see you in Washington DC in October! </p>
<hr noshade style="margin:0;height:1px" /><br />
&copy; 2008 Web Analytics Demystified | <a href="http://www.webanalyticsdemystified.com">www.webanalyticsdemystified.com</A>      <br />
<br><br><b>Looking for a new job in web analytics?</b> Check out the <a href="http://www.webanalyticsdemystified.com/job_list.asp">Web Analytics Demystified Job Board!</A>            <div class="feedflare">
<a href="http://feeds.feedburner.com/~f/judahphillips?a=48AlbH"><img src="http://feeds.feedburner.com/~f/judahphillips?i=48AlbH" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=wX4WYH"><img src="http://feeds.feedburner.com/~f/judahphillips?i=wX4WYH" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=INfjDh"><img src="http://feeds.feedburner.com/~f/judahphillips?i=INfjDh" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=Uyh2dh"><img src="http://feeds.feedburner.com/~f/judahphillips?i=Uyh2dh" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=WtwjRH"><img src="http://feeds.feedburner.com/~f/judahphillips?i=WtwjRH" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=NvItVh"><img src="http://feeds.feedburner.com/~f/judahphillips?i=NvItVh" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=g3k9NH"><img src="http://feeds.feedburner.com/~f/judahphillips?i=g3k9NH" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=Pfeayh"><img src="http://feeds.feedburner.com/~f/judahphillips?i=Pfeayh" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=V1S5gH"><img src="http://feeds.feedburner.com/~f/judahphillips?i=V1S5gH" border="0"></img></a> <a href="http://feeds.feedburner.com/~f/judahphillips?a=Io3ISh"><img src="http://feeds.feedburner.com/~f/judahphillips?i=Io3ISh" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/judahphillips/~4/289152997" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://judah.webanalyticsdemystified.com/2008/05/a-few-thoughts-after-another-awesome-emetrics.html/feed</wfw:commentRss>
		<feedburner:origLink>http://judah.webanalyticsdemystified.com/2008/05/a-few-thoughts-after-another-awesome-emetrics.html</feedburner:origLink></item>
	</channel>
</rss>
