<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0">

<channel>
	<title>Billy's Blog: Multivariate Testing &amp; Optimization</title>
	
	<link>http://testingblog.widemile.com</link>
	<description>My blog about multivariate and a/b testing at Widemile.</description>
	<lastBuildDate>Tue, 23 Jun 2009 23:38:49 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.1</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<xhtml:meta xmlns:xhtml="http://www.w3.org/1999/xhtml" name="robots" content="noindex" />
		<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" href="http://feeds.feedburner.com/billyblogwm" type="application/rss+xml" /><item>
		<title>Rules for a successful multivariate test (Billy’s Optimization Guide Part 3)</title>
		<link>http://feedproxy.google.com/~r/billyblogwm/~3/r7-1sZSKw90/</link>
		<comments>http://testingblog.widemile.com/2009/06/16/rules-successful-multivariate-test-billys-optimization-guide-part-3/#comments</comments>
		<pubDate>Wed, 17 Jun 2009 00:15:20 +0000</pubDate>
		<dc:creator>Billy</dc:creator>
				<category><![CDATA[Methodology]]></category>
		<category><![CDATA[Testing Concerns]]></category>
		<category><![CDATA[Testing Techniques]]></category>
		<category><![CDATA[Billy's Optimization Guide]]></category>
		<category><![CDATA[fractional factorial]]></category>
		<category><![CDATA[full factorial]]></category>
		<category><![CDATA[multivariate testing]]></category>
		<category><![CDATA[stabilization]]></category>
		<category><![CDATA[test design]]></category>
		<category><![CDATA[test type]]></category>

		<guid isPermaLink="false">http://testingblog.widemile.com/?p=365</guid>
		<description><![CDATA[
If you missed it, see Part 1 (A/B Split Testing) and Part 2 (Multivariate Test Basics).
With the basics of part 2 down, it&#8217;s time to start designing a multivariate test.  Every optimization project has different challenges and goals, luckily though, there are a few rules that apply to every multivariate test design.  These rules fit [...]]]></description>
			<content:encoded><![CDATA[<p style="text-align: center;"><img class="size-full wp-image-426 aligncenter" title="Rules of Six Detail" src="http://testingblog.widemile.com/wp-content/uploads/2009/06/rules.jpg" alt="Rules of Six Detail" width="450" height="296" /></p>
<p><em>If you missed it, see <a href="../2009/01/26/pro-a-b-split-test-method/">Part 1 (A/B Split Testing)</a> and <a href="http://testingblog.widemile.com/2009/01/29/simplifying-multivariate-testing-fo-billys-optimization-guide-part-2/">Part 2 (Multivariate Test Basics)</a>.</em></p>
<p>With the basics of part 2 down, it&#8217;s time to start designing a multivariate test.  Every optimization project has different challenges and goals, luckily though, there are a few rules that apply to every multivariate test design.  These rules fit into two categories: technical rules and content rules.</p>
<p><strong>Technical rules:</strong></p>
<ol>
<li>Choose the appropriate multivariate test type (<a href="http://testingblog.widemile.com/2008/07/24/primer-full-and-fractional-factorial-test-design/">full or fractional factorial</a>)</li>
<li>Determine the number of factors and levels that can be tested based on estimated conversion traffic (choose a test array)</li>
<li>Stop the test when it has stabilized, not based on your earlier estimations</li>
</ol>
<p>These rules ensure statistical significance by constraining the test to the appropriate size at the beginning and then letting the test gather the proper amount of data at the end.</p>
<p>Running a test full factorial, if your traffic supports it, may be a good choice if you&#8217;re testing content that you believe to have many interactions or if you only want to test 2 factors with 2 levels each.  (Note: the smallest fractional factorial test size is 3 factors with 2 levels each.)  Typically though, you&#8217;ll want to run a fractional factorial test to save time and expand the number of factors and levels you can test.</p>
<p>In order to find out how many factors and levels you can test, you need to have some idea of your predicted page views, conversions, as well as an estimate of lift.  The reason that lift matters, is that a large lift will get you more conversions and so your test will stabilize quicker.  Because of this, I would be conservative with lift estimates to ensure that the test is not designed too large.  At Widemile, we have a large list of arrays available to our tool and have calculated the approximate conversions needed to stabilize, allowing me to look at the three criteria I listed and find the arrays that are statistically viable for testing.  You should look for something similar with your tool of choice.</p>
<p>To figure out when a test is stabilized, I prefer to primarily look at level influence stabilization with experiment conversion rate stabilization for support.  Widemile Optimize shows this using graphs, so I simply look for horizontal trending of lines, meaning winning levels and experiments stay winners and their level of influence or conversion rates stay fairly constant (look horizontal) over 3-5 days.  If you don&#8217;t have graphs available,  the historical cumulative conversion rate for your experiments and see if there is a lot of variance between the latest few days of your test.</p>
<p><strong>Content rules:</strong></p>
<ol>
<li>Every item you test should answer an important question</li>
<li>Test variety not quantity</li>
<li>Test opposites first then refine</li>
<li>Remember you can run more than one test</li>
</ol>
<p>The content rules are closely tied together.  In effect, they ensure that the items selected for testing have purpose and that they don&#8217;t needlessly expand the size of your test, reducing its efficiency.  I begin designing tests by creating hypothesis regarding issues with the page and then choose factors and design levels to address those issues.</p>
<p>An example hypothesis is &#8220;Having a hero shot on the right side of the page causes users to ignore the important value proposition on the left side.&#8221;  To test this, I would choose hero shot position as a factor and then have &#8220;left side hero shot&#8221; as the baseline level and &#8220;right side hero shot&#8221; as the second level.  This example also illustrates that, other than headlines and images, testing layout is possible with creative use of CSS and sometimes JavaScript.  As long as you can revert from one to another and it matches the other factors and levels, you are at liberty to test anything.</p>
<p>Coming back to the rules, make sure that you are testing as few items as possible to find out what you need.  Before testing a collection of lifestyle hero shots, choose one and test it against an iconic hero shot.  This will save you the time of going down a path of testing something that may not work.</p>
<p>Lastly, you aren&#8217;t going to be able to get the best page on the first run or even second, third, etc.  If you knew what your audience liked 100% of the time then you wouldn&#8217;t need testing.  Remember to think of your overall test plan beyond just the first run, so that you can answer all the questions you need without having to force everything into one test.</p>
<p>In summary, determine what you&#8217;re trying to achieve, select the proper testing method to meet those goals and then make sure to be purposeful and efficient with the content you end up testing in front of your visitors.  Testing and optimization is not difficult, although it can be tough to start.  Follow these rules and you&#8217;ll be on your way to conquering conversion rates, bounce rates, funnel drop-offs and many other metrics.</p>
<p>Photo credit: <a href="http://www.flickr.com/photos/arandalasch/3182768438/">Aranda\Lasch</a> (<a href="http://creativecommons.org/licenses/by-nc-nd/2.0/deed.en">CC</a>)</p>
]]></content:encoded>
			<wfw:commentRss>http://testingblog.widemile.com/2009/06/16/rules-successful-multivariate-test-billys-optimization-guide-part-3/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://testingblog.widemile.com/2009/06/16/rules-successful-multivariate-test-billys-optimization-guide-part-3/</feedburner:origLink></item>
		<item>
		<title>My response to Google’s Lead Designer leaving because of testing culture</title>
		<link>http://feedproxy.google.com/~r/billyblogwm/~3/W0KuOG3f9PQ/</link>
		<comments>http://testingblog.widemile.com/2009/04/02/googles-lead-designer-leaves-data-driven-decision-making/#comments</comments>
		<pubDate>Thu, 02 Apr 2009 22:34:10 +0000</pubDate>
		<dc:creator>Billy</dc:creator>
				<category><![CDATA[Methodology]]></category>

		<guid isPermaLink="false">http://testingblog.widemile.com/?p=386</guid>
		<description><![CDATA[
I recently read Douglas Bowman&#8217;s blog, Google&#8217;s former Visual Design Lead, about why he left Google.  In it, he describes how the engineering culture contributed to his decision to leave:
When a company is filled with engineers, it turns to engineering to solve problems. Reduce each decision to a simple logic problem. Remove all subjectivity and [...]]]></description>
			<content:encoded><![CDATA[<p style="text-align: center;"><a href="http://testingblog.widemile.com/wp-content/uploads/2009/04/design-is-dead.jpg"><img class="aligncenter size-full wp-image-399" title="design-is-dead" src="http://testingblog.widemile.com/wp-content/uploads/2009/04/design-is-dead.jpg" alt="design is dead" width="450" height="282" /></a></p>
<p>I recently read Douglas Bowman&#8217;s blog, Google&#8217;s former Visual Design Lead, about <a href="http://stopdesign.com/archive/2009/03/20/goodbye-google.html">why he left Google</a>.  In it, he describes how the engineering culture contributed to his decision to leave:</p>
<blockquote><p>When a company is filled with engineers, it turns to engineering to solve problems. Reduce each decision to a simple logic problem. Remove all subjectivity and just look at the data. [...] that data eventually becomes a crutch for every decision, paralyzing the company and preventing it from making any daring design decisions.</p></blockquote>
<p>He then references Google testing 41 shades of blue and a recent debate he had over &#8220;whether a border should be 3, 4 or 5 pixels wide&#8221; and was asked to provide data to back up that decision.</p>
<p>Bowman&#8217;s post brought up some feelings of disappointment towards Google because despite having their own optimization tool, they did not create a culture to encourage their lead designer to expand his work and actually drove him away.  Optimization and testing is still in its early stage, so mistakes will be common, however I hope news like this doesn&#8217;t scare others away from testing.</p>
<p>Rather, I hope companies can learn from Bowman&#8217;s experience.  Instead of holding designers to every detail, testing should allow them to explore, learn and refine their ideas.  Testing should not prevent &#8220;any daring design decisions,&#8221; I feel it should actually encourage them.  As I said before, <a href="http://testingblog.widemile.com/2008/08/19/gamble-conversions-raise/">gamble with your conversions to raise them</a>.</p>
<p>In the end, it&#8217;s all about having an understanding of how testing should and should not be used.  You can use testing to find the best shade of blue, but that doesn&#8217;t necessarily mean that&#8217;s what you should be testing right now.  Don&#8217;t be afraid to take a step back and try something new rather than fiddling with details, testing tools give you that freedom.  Big risks, reap big rewards in optimization.  Not taking risks leads to inefficient testing and, in Google&#8217;s case, a designer&#8217;s resignation.</p>
<p>Photo credit: <a href="http://www.flickr.com/photos/i-marco/244541946/">i-marco</a> (<a href="http://creativecommons.org/licenses/by-sa/2.0/deed.en">CC</a>)</p>
]]></content:encoded>
			<wfw:commentRss>http://testingblog.widemile.com/2009/04/02/googles-lead-designer-leaves-data-driven-decision-making/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://testingblog.widemile.com/2009/04/02/googles-lead-designer-leaves-data-driven-decision-making/</feedburner:origLink></item>
		<item>
		<title>How to pick a page to test and optimize</title>
		<link>http://feedproxy.google.com/~r/billyblogwm/~3/DRYDuFu70sE/</link>
		<comments>http://testingblog.widemile.com/2009/02/09/pick-page-test-optimize/#comments</comments>
		<pubDate>Mon, 09 Feb 2009 23:59:36 +0000</pubDate>
		<dc:creator>Billy</dc:creator>
				<category><![CDATA[Methodology]]></category>
		<category><![CDATA[before testing]]></category>
		<category><![CDATA[starting point]]></category>

		<guid isPermaLink="false">http://testingblog.widemile.com/?p=373</guid>
		<description><![CDATA[

Selecting the right page to test is possibly the most important decision of an optimization campaign.  You can have great ideas, the technology and talent behind you, but if you pick the wrong page you could be doing a lot of work for minimal return.
So how do you get the biggest bang for your buck [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://testingblog.widemile.com/wp-content/uploads/2009/02/pick.jpg"></a></p>
<p><a href="http://testingblog.widemile.com/wp-content/uploads/2009/02/pick1.jpg"><img class="alignnone size-medium wp-image-377" title="pick1" src="http://testingblog.widemile.com/wp-content/uploads/2009/02/pick1-449x300.jpg" alt="pick1 449x300" width="449" height="300" /></a></p>
<p>Selecting the right page to test is possibly the most important decision of an optimization campaign.  You can have great ideas, the technology and talent behind you, but if you pick the wrong page you could be doing a lot of work for minimal return.</p>
<p>So how do you get the biggest bang for your buck with testing?  Here&#8217;s a quick list of things to look for in a page:</p>
<ul>
<li>A single, specific and easy to measure conversion goal</li>
<li>Sizable conversion traffic (at least 200 conversions in a week)</li>
<li>A page that suffers from poor design or unclear conversion goal</li>
<li>No large technical hurdles to implementing and executing the test</li>
<li>A conversion rate that&#8217;s lower than comparable pages</li>
</ul>
<p>Attacking pages with these attributes will get you some easy wins and help establish testing in your company.  Typically landing pages are the best pages to optimize, especially if they have the end conversion goal on the page, e.g. a form submission, download or click-out.</p>
<p>From there, I would move onto other pages in the funnel, taking a look at bounce rates to help determine where you need to help push visitors further into the funnel.  If there are no other pages in the funnel, find other poor or under performing pages on your site and take a look at them according to the rules above.</p>
<p>The main idea is to see that testing is a process and that just because you have ideas to improve a page, does not mean it is the best page to spend your time improving.</p>
<p><em>Photo credit: <a href="http://flickr.com/photos/ajawin/2650685883/">lepiaf.geo</a> (<a href="http://creativecommons.org/licenses/by/2.0/deed.en">CC</a>)</em></p>
]]></content:encoded>
			<wfw:commentRss>http://testingblog.widemile.com/2009/02/09/pick-page-test-optimize/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://testingblog.widemile.com/2009/02/09/pick-page-test-optimize/</feedburner:origLink></item>
		<item>
		<title>Breaking down multivariate testing (Billy’s Optimization Guide Part 2)</title>
		<link>http://feedproxy.google.com/~r/billyblogwm/~3/UUa3PtsX7P4/</link>
		<comments>http://testingblog.widemile.com/2009/01/29/simplifying-multivariate-testing-fo-billys-optimization-guide-part-2/#comments</comments>
		<pubDate>Fri, 30 Jan 2009 01:16:22 +0000</pubDate>
		<dc:creator>Billy</dc:creator>
				<category><![CDATA[Methodology]]></category>
		<category><![CDATA[Terminology]]></category>
		<category><![CDATA[Billy's Optimization Guide]]></category>
		<category><![CDATA[multivariate test]]></category>
		<category><![CDATA[multivariate testing]]></category>

		<guid isPermaLink="false">http://testingblog.widemile.com/?p=355</guid>
		<description><![CDATA[If you missed it, see Part 1 (A/B Split Testing).  Update: Part 3 on Rules for a Successful Multivariate Test is here.

The technical and statistical aspects of multivariate testing can be complicated but in order to design successful tests you don&#8217;t need to know everything, just the basics of how it works and some guidelines.  [...]]]></description>
			<content:encoded><![CDATA[<p><em>If you missed it, see <a href="http://testingblog.widemile.com/2009/01/26/pro-a-b-split-test-method/">Part 1 (A/B Split Testing)</a>.  Update: Part 3 on <a href="http://testingblog.widemile.com/2009/06/16/rules-successful-multivariate-test-billys-optimization-guide-part-3/">Rules for a Successful Multivariate Test is here</a>.<br />
</em></p>
<p>The technical and statistical aspects of multivariate testing can be complicated but in order to design successful tests you don&#8217;t need to know everything, just the basics of how it works and some guidelines.  I&#8217;m assuming you already have some understanding of multivariate testing, however I want to cover the basics and make sure we&#8217;re on the same level before going into how to design good multivariate tests.</p>
<p>Check out the wireframe below.  Pretty standard for a landing page, right?  To properly design a multivariate test, we have to look at the page in a certain way.  Using three key terms, factors, levels and experiments, we can break down a test and describe its framework.</p>
<p style="text-align: center;"><img class="aligncenter size-medium wp-image-357" title="Web Page Example" src="http://testingblog.widemile.com/wp-content/uploads/2009/01/page.png" alt="page" width="203" height="151" /></p>
<p><strong>Factor</strong>: An element of the Web page (headline, image, text) being tested.  The element can also be groups of content, e.g. left column, button and hero shot together, or all banner ads on the page.<a href="http://testingblog.widemile.com/wp-content/uploads/2009/01/factors.png"><br />
</a></p>
<p><strong>Level</strong>: Content that is assigned to a specific factor to be tested.  For example, one variation of a hero shot.</p>
<p>Below are 4 factors from our example page (headline, hero shot, offer and button) and then each of those factors with 4 levels represented by the different colors.  Note that the levels of one factor do not have to relate in anyway to the levels of other factors.</p>
<p><img class="aligncenter size-medium wp-image-359" title="Factors and Levels example" src="http://testingblog.widemile.com/wp-content/uploads/2009/01/factors-and-levels-450x156.png" alt="factors and levels 450x156" width="450" height="156" /></p>
<p>The last term, experiments, makes use of both factors and levels.</p>
<p><strong>Experiment: </strong>A unique combination of levels used during a test.</p>
<p>Here you can see 4 different experiments.  Each experiment is different and holds different combinations of levels.  Note that there actually are many more variations (4&#215;4x4&#215;4=256 combinations).</p>
<p><img class="aligncenter size-medium wp-image-362" title="experiments example" src="http://testingblog.widemile.com/wp-content/uploads/2009/01/experiments-example-400x300.png" alt="experiments example 400x300" width="400" height="300" /></p>
<p>Essentially a multivariate test involves showing these experiments randomly to <em>live traffic</em>, while tracking how each experiment performs.  The one that performs the best wins.  Each experiment is shown to many people, but each person only sees one experiment.  (There is some complexity in this, if you are still confused or want to know more, go to <a href="http://testingblog.widemile.com/2008/07/24/primer-full-and-fractional-factorial-test-design/">my primer on full and fractional factorial testing</a>.)</p>
<p>In my next post, I will use these terms to outline the rules to creating a great multivariate test.</p>
]]></content:encoded>
			<wfw:commentRss>http://testingblog.widemile.com/2009/01/29/simplifying-multivariate-testing-fo-billys-optimization-guide-part-2/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://testingblog.widemile.com/2009/01/29/simplifying-multivariate-testing-fo-billys-optimization-guide-part-2/</feedburner:origLink></item>
		<item>
		<title>3 ways to use an a/b split test (Billy’s Optimization Guide Part 1)</title>
		<link>http://feedproxy.google.com/~r/billyblogwm/~3/cVs80CxTFBQ/</link>
		<comments>http://testingblog.widemile.com/2009/01/26/pro-a-b-split-test-method/#comments</comments>
		<pubDate>Tue, 27 Jan 2009 01:17:42 +0000</pubDate>
		<dc:creator>Billy</dc:creator>
				<category><![CDATA[About]]></category>
		<category><![CDATA[Site News]]></category>
		<category><![CDATA[Testing Techniques]]></category>
		<category><![CDATA[Why Test?]]></category>
		<category><![CDATA[a/b split test]]></category>
		<category><![CDATA[Billy's Optimization Guide]]></category>
		<category><![CDATA[funnel test]]></category>
		<category><![CDATA[optimization pro]]></category>
		<category><![CDATA[template test]]></category>

		<guid isPermaLink="false">http://testingblog.widemile.com/?p=298</guid>
		<description><![CDATA[Update: Check out Part 2 on Breaking Down Multivariate Testing and Part 3 on Rules for a Successful Multivariate Test.
Testing is not hard, but there are fundamentals that guarantee a successful optimization campaign.  To help get marketers up to speed with the basics, starting today, I will be writing about one topic per post and [...]]]></description>
			<content:encoded><![CDATA[<p style="text-align: left;"><em>Update: Check out Part 2 on </em><em><a href="http://testingblog.widemile.com/2009/01/29/simplifying-multivariate-testing-fo-billys-optimization-guide-part-2/">Breaking Down Multivariate Testing</a> and Part 3 on <a href="../2009/06/16/rules-successful-multivariate-test-billys-optimization-guide-part-3/">Rules for a Successful Multivariate Test</a>.</em></p>
<p style="text-align: left;">Testing is not hard, but there are fundamentals that guarantee a successful optimization campaign.  To help get marketers up to speed with the basics, starting today, I will be writing about one topic per post and put together what I call Billy&#8217;s Optimization Guide.</p>
<p style="text-align: left;">The natural place to start is with a/b split tests, so let&#8217;s begin there.</p>
<p><img class="size-medium wp-image-324 aligncenter" title="a/b split test" src="http://testingblog.widemile.com/wp-content/uploads/2009/01/ab-split-test-450x141.png" alt="ab split test 450x141" width="450" height="141" /><a href="http://testingblog.widemile.com/wp-content/uploads/2009/01/ab-split-test.png"></a></p>
<p>First, a quick <em>useful</em> definition of an a/b split test: the competition of two <span style="text-decoration: underline;"><strong>distinct</strong></span><strong> </strong>pages, where a portion of live traffic, usually 50%, is sent to one page and the rest to the other.  The winner is the page that provides the highest conversion rate, or whatever KPI is appropriate.</p>
<p>I want to emphasize that a good a/b split test requires distinct pages.  If that&#8217;s too vague, a simple rule that we follow at Widemile is:</p>
<blockquote><p><em>You should be able to tell the difference between the 2 pages from 15 feet away. </em></p></blockquote>
<p>Anything else isn&#8217;t a big enough change to be efficient in a split test and likely should go into a multivariate test.</p>
<p>With that definition in mind, here are three essential types of a/b split tests.  These are three of the tools in the testing toolbox that you should consider when putting together your optimization campaign.</p>
<ol>
<li><strong>Template test: </strong> Test the same general content (hero shot, copy, and button color) with a different layout and/or creative treatment.  The goal is to have a new template that better emphasizes the value proposition, improves readability and sets up well for a multivariate test.
<p style="text-align: center;"><img class="alignnone size-medium wp-image-325" title="Template Test" src="http://testingblog.widemile.com/wp-content/uploads/2009/01/template-test.png" alt="template test" width="432" height="116" /></p>
<p><em>Use this when&#8230;</em> you want to make sure you have a solid design, before or after testing messaging.  The majority of the time this should be your first test.</li>
<li><strong>New concept test: </strong> Test a totally new approach.  Don&#8217;t let anything hold you back, test what you think will work best and see if it beats the original.  The approach here is to break out of the box and create a page that&#8217;s holistically designed around a new marketing concept.  Sometimes this involves introducing new functionality, animation, interactivity and other dramatic steps.  However it can also be on the smaller scale, such as introducing new messaging that requires a complete redesign.
<p style="text-align: center;"><img class="alignnone size-medium wp-image-326" title="New Concept Test" src="http://testingblog.widemile.com/wp-content/uploads/2009/01/new-concept.png" alt="new concept" width="432" height="116" /></p>
<p><em>Use this when&#8230;</em> your current page has already been tested many times and beating it has become difficult or you believe the way to really grab your visitors is through a big change.  This should only be done when the benefits of multivariate testing (knowing individual factor influences) are outweighed by the possible gains.</li>
<li><strong>Funnel test: </strong> Send users to different multi-page experiences, e.g. no registration vs. requiring registration (below) and a one page form vs. a 3 page form. A funnel test can also be done with a multivariate but is simpler as an a/b split test.<em> </em><em></em>
<p style="text-align: center;"><img class="aligncenter size-full wp-image-329" title="Funnel Test" src="http://testingblog.widemile.com/wp-content/uploads/2009/01/funnel-test.png" alt="funnel test" width="351" height="189" /></p>
<p><em> Use this when&#8230;</em> you want to test content that extends past one page.  This should be done earlier in the testing process so that you don&#8217;t end up optimizing a page and then find out it&#8217;s a suboptimal experience.  It can be more technically demanding to do this sort of test though.</li>
</ol>
<p>Every optimization campaign is different and so knowing what kinds of tests are available is one of the most important places to start.  For my next post, I will talk about the different ways to use a multivariate test.  Please post in the comments if you have any questions or contact me via <a href="http://twitter.com/billysblog">Twitter @billysblog</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://testingblog.widemile.com/2009/01/26/pro-a-b-split-test-method/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		<feedburner:origLink>http://testingblog.widemile.com/2009/01/26/pro-a-b-split-test-method/</feedburner:origLink></item>
		<item>
		<title>Are your visitors telling you if you’re getting hotter or colder?</title>
		<link>http://feedproxy.google.com/~r/billyblogwm/~3/54HUAhfzqG8/</link>
		<comments>http://testingblog.widemile.com/2008/11/13/visitors-telling-hotter-colder/#comments</comments>
		<pubDate>Thu, 13 Nov 2008 18:30:37 +0000</pubDate>
		<dc:creator>Billy</dc:creator>
				<category><![CDATA[Methodology]]></category>
		<category><![CDATA[Testing Techniques]]></category>
		<category><![CDATA[test design]]></category>

		<guid isPermaLink="false">http://testingblog.widemile.com/?p=271</guid>
		<description><![CDATA[
In elementary school, I played the game Hot or Cold in class.  The rules of the game are simple:

One child is picked as the &#8220;searcher&#8221; and leaves the room
The class collectively chooses an object in the room, like a marker or eraser, for the searcher to find
Once the object is selected, the searcher returns to [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://testingblog.widemile.com/wp-content/uploads/2008/11/classroom.jpg"><img class="aligncenter size-full wp-image-290" title="Hot or Cold Classroom" src="http://testingblog.widemile.com/wp-content/uploads/2008/11/classroom.jpg" alt="classroom" width="450" height="300" /></a></p>
<p>In elementary school, I played the game Hot or Cold in class.  The rules of the game are simple:</p>
<ul>
<li>One child is picked as the &#8220;searcher&#8221; and leaves the room</li>
<li>The class collectively chooses an object in the room, like a marker or eraser, for the searcher to find</li>
<li>Once the object is selected, the searcher returns to the room and has to find the mystery object as quickly as possible</li>
</ul>
<p>To help the searcher out, the other kids in the room scream hot, if the searcher gets closer to the object, or cold, if they get farther.</p>
<p>To make the game more challenging, the searcher might be limited to only one clue, just hot or just cold.  Kids that were told both hot and cold found the objects fairly quickly, but if they were only allowed one type of feedback, it took them much longer.</p>
<p>For the same reasons that it is hard to find the object in the game without being told where it is closer and farther from, in testing, if you don&#8217;t design your tests with two distinct variations, you might go wandering for a long time trying to find what exactly your customer wants.</p>
<p>My metaphor fails in one way though.  In the game, the searcher does find the object eventually, even with just one type of hint.  However, If you don&#8217;t design tests correctly though, you may never find a page that resonates strongly with the audience.  You might test dozens of testimonials and find the most successful testimonial, but if you never test it against no testimonial or a review, you may be missing out on even bigger gains.</p>
<p>Let your audience tell you hot and cold by designing your tests intelligently and they&#8217;ll help you find the optimal page faster than ever.</p>
<p>Photo credit: <a href="http://flickr.com/photos/airport/6550520/">Night Owl City</a> <a href="http://creativecommons.org/licenses/by-nc-sa/2.0/deed.en">CC</a></p>
]]></content:encoded>
			<wfw:commentRss>http://testingblog.widemile.com/2008/11/13/visitors-telling-hotter-colder/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://testingblog.widemile.com/2008/11/13/visitors-telling-hotter-colder/</feedburner:origLink></item>
		<item>
		<title>Have me look over your landing page at ad:tech</title>
		<link>http://feedproxy.google.com/~r/billyblogwm/~3/xJ1mpuJHj1E/</link>
		<comments>http://testingblog.widemile.com/2008/10/28/landing-page-adtech-ny-08/#comments</comments>
		<pubDate>Tue, 28 Oct 2008 22:11:51 +0000</pubDate>
		<dc:creator>Billy</dc:creator>
				<category><![CDATA[Site News]]></category>
		<category><![CDATA[ad:tech]]></category>
		<category><![CDATA[conference]]></category>
		<category><![CDATA[Widemile]]></category>

		<guid isPermaLink="false">http://testingblog.widemile.com/?p=273</guid>
		<description><![CDATA[
I will be at ad:tech New York on November 3-4 manning the Widemile Booth #922.  If you have any questions on optimization, landing pages or anything else, stop by and we can chat.
Also, I will be giving some mini-clinics on optimization best practices, how creative influences optimization and perhaps a case study or two based [...]]]></description>
			<content:encoded><![CDATA[<p style="text-align: center;"><a href="http://testingblog.widemile.com/wp-content/uploads/2008/10/adtech.png"><img class="size-full wp-image-274 alignnone" title="ad:tech new york" src="http://testingblog.widemile.com/wp-content/uploads/2008/10/adtech.png" alt="adtech" width="342" height="56" /></a></p>
<p style="text-align: left;">I will be at <a href="http://www.ad-tech.com/ny/">ad:tech New York</a> on November 3-4 manning the Widemile Booth #922.  If you have any questions on optimization, landing pages or anything else, stop by and we can chat.</p>
<p>Also, I will be giving some mini-clinics on optimization best practices, how creative influences optimization and perhaps a case study or two based around those topics.  The last half of those clinics, I&#8217;d like to take a look at some live pages and review them.</p>
<p>But to be more effective, I&#8217;d like those live pages to be from volunteers.  If you have a page you&#8217;d like me to look at, send me a URL and a good time for you and I&#8217;ll try to make sure you get included in a clinic around that time.</p>
<p style="text-align: center;"><img class="size-full wp-image-275 aligncenter" title="Billy Shih" src="http://testingblog.widemile.com/wp-content/uploads/2008/10/me2.jpg" alt="me2" width="200" height="267" /><br />
Find me at ad:tech booth #922</p>
<p>If you catch me when I&#8217;m free, I also can take a look at your page and give you some of my thoughts on how to improve and test it.  Send me your URL&#8217;s beforehand for best results though.</p>
<p>Hope to see some of you there!</p>
]]></content:encoded>
			<wfw:commentRss>http://testingblog.widemile.com/2008/10/28/landing-page-adtech-ny-08/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://testingblog.widemile.com/2008/10/28/landing-page-adtech-ny-08/</feedburner:origLink></item>
		<item>
		<title>Gamble with your conversions to raise them</title>
		<link>http://feedproxy.google.com/~r/billyblogwm/~3/nYih5SdjWHY/</link>
		<comments>http://testingblog.widemile.com/2008/08/19/gamble-conversions-raise/#comments</comments>
		<pubDate>Tue, 19 Aug 2008 16:57:20 +0000</pubDate>
		<dc:creator>Billy</dc:creator>
				<category><![CDATA[Methodology]]></category>
		<category><![CDATA[Testing Concerns]]></category>
		<category><![CDATA[Testing Techniques]]></category>
		<category><![CDATA[Why Test?]]></category>
		<category><![CDATA[always be testing]]></category>
		<category><![CDATA[campaign optimization]]></category>
		<category><![CDATA[conversion rates]]></category>
		<category><![CDATA[optimization results]]></category>
		<category><![CDATA[reasons to test]]></category>
		<category><![CDATA[success]]></category>
		<category><![CDATA[testing culture]]></category>

		<guid isPermaLink="false">http://testingblog.widemile.com/?p=243</guid>
		<description><![CDATA[
You and your competitor&#8217;s all have the same landing pages.  You have a hero shot of the product, a big call to action button and short, punchy copy.  Or maybe you&#8217;re already ahead of your competitors and have run a few tests on your page, picking up more conversions on the way.  In either situation, [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://testingblog.widemile.com/wp-content/uploads/2008/08/pokerchips.jpg"><img class="aligncenter size-full wp-image-252" title="Gambling with poker chips, dice and cards" src="http://testingblog.widemile.com/wp-content/uploads/2008/08/pokerchips.jpg" alt="pokerchips" width="450" height="302" /></a></p>
<p>You and your competitor&#8217;s all have the same landing pages.  You have a hero shot of the product, a big call to action button and short, punchy copy.  Or maybe you&#8217;re already ahead of your competitors and have run a few tests on your page, picking up more conversions on the way.  In either situation, you&#8217;ll eventually hit a wall and struggle to get additional lift.  So how do you continue to improve?</p>
<p>Go for broke.  Try something you&#8217;ve never tried before.  It might end up being a total failure, but it also might give you the lift you want.</p>
<p>The gamble you make with optimization can end in 2 ways:</p>
<ul>
<li>You lose X amount of conversions over the week or two that the test is running</li>
<li>You gain X amount of conversions for the effective lifetime of the page</li>
</ul>
<p>The possible upside dwarfs the downside by a large margin and, either way, you learn something new and can optimize the next test more successfully based on what you learned.</p>
<p>Luckily, with skill and experience, the risks of testing are minimized, however beating a strong page is never easy or guaranteed.  But when you do find something new that works or see that your current page still is a champ, you can rest assured that you&#8217;re doing all you can to drive conversions.</p>
]]></content:encoded>
			<wfw:commentRss>http://testingblog.widemile.com/2008/08/19/gamble-conversions-raise/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		<feedburner:origLink>http://testingblog.widemile.com/2008/08/19/gamble-conversions-raise/</feedburner:origLink></item>
		<item>
		<title>SES San Jose: Landing Page Optimization Roundtable</title>
		<link>http://feedproxy.google.com/~r/billyblogwm/~3/nBjNMdQmlzE/</link>
		<comments>http://testingblog.widemile.com/2008/08/14/landing-page-optimization-roundtable-ses-san-jose/#comments</comments>
		<pubDate>Thu, 14 Aug 2008 15:57:53 +0000</pubDate>
		<dc:creator>Billy</dc:creator>
				<category><![CDATA[Industry News]]></category>
		<category><![CDATA[event]]></category>
		<category><![CDATA[roundtable]]></category>
		<category><![CDATA[SES]]></category>
		<category><![CDATA[speaker]]></category>

		<guid isPermaLink="false">http://testingblog.widemile.com/?p=244</guid>
		<description><![CDATA[
If you&#8217;re going to SES San Jose and want to really learn about optimization, check out the Landing Page Utopia: Expert Roundtable.  My boss and Director of Optimization, Frans Keylard, will be on the panel. Everything I know was taught to me by Frans, so if you&#8217;re serious about talking to an optimization expert [...]]]></description>
			<content:encoded><![CDATA[<p style="text-align: center;"><a href="http://testingblog.widemile.com/wp-content/uploads/2008/08/logotop.gif"><img class="size-full wp-image-247 aligncenter" title="SES San Jose" src="http://testingblog.widemile.com/wp-content/uploads/2008/08/logotop.gif" alt="logotop" width="313" height="80" /></a></p>
<p>If you&#8217;re going to SES San Jose and want to really learn about optimization, check out the <a title="Landing Page Optimization roundtable" href="http://www.searchenginestrategies.com/sanjose/agenda2.html#landing">Landing Page Utopia: Expert Roundtable</a>.  My boss and Director of Optimization, Frans Keylard, will be on the panel. Everything I know was taught to me by Frans, so if you&#8217;re serious about talking to an optimization expert here&#8217;s your chance.  He has a wealth of testing experience and is a fun guy in general.</p>
<p>In addition, Jonathan Mendez will be on the same panel.  If you don&#8217;t know him, he used to run OTTO Digital, a former division of Offermatica.  He is on top of the optimization game as well.  It should be a great panel!</p>
]]></content:encoded>
			<wfw:commentRss>http://testingblog.widemile.com/2008/08/14/landing-page-optimization-roundtable-ses-san-jose/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		<feedburner:origLink>http://testingblog.widemile.com/2008/08/14/landing-page-optimization-roundtable-ses-san-jose/</feedburner:origLink></item>
		<item>
		<title>An Essential Primer on Full and Fractional Factorial Test Design</title>
		<link>http://feedproxy.google.com/~r/billyblogwm/~3/4x7eowX6qeI/</link>
		<comments>http://testingblog.widemile.com/2008/07/24/primer-full-and-fractional-factorial-test-design/#comments</comments>
		<pubDate>Thu, 24 Jul 2008 17:14:11 +0000</pubDate>
		<dc:creator>Billy</dc:creator>
				<category><![CDATA[Methodology]]></category>
		<category><![CDATA[Terminology]]></category>
		<category><![CDATA[Testing Techniques]]></category>
		<category><![CDATA[design of experiments]]></category>
		<category><![CDATA[fractional factorial]]></category>
		<category><![CDATA[full factorial]]></category>
		<category><![CDATA[interactions]]></category>
		<category><![CDATA[partial factorial]]></category>
		<category><![CDATA[test design]]></category>

		<guid isPermaLink="false">http://testingblog.widemile.com/?p=194</guid>
		<description><![CDATA[
What are full and fractional factorial test designs? How do they relate to optimization and what about interactions?
Once you get down and dirty with testing, these questions matter. Whether selecting an optimization platform or trying to thoroughly understand the tests you are building, grasping these concepts will put you in greater control and allow you [...]]]></description>
			<content:encoded><![CDATA[<p style="text-align: center;"><img class="alignnone size-full wp-image-202 aligncenter" title="laptop keys" src="http://testingblog.widemile.com/wp-content/uploads/2008/05/keys.png" alt="keys" width="271" height="192" /></p>
<p>What are full and fractional factorial test designs? How do they relate to optimization and what about interactions?</p>
<p>Once you get down and dirty with testing, these questions matter. Whether selecting an optimization platform or trying to thoroughly understand the tests you are building, grasping these concepts will put you in greater control and allow you to design and analyze your tests more effectively.</p>
<p>As simply as possible, I hope to educate you and other marketers about full and fractional factorial test designs and why <strong>fractional factorial is the best choice </strong>for multivariate testing of online campaigns.</p>
<p><em>Note: &#8220;Partial factorial” and “fractional factorial” are the same. Also, if you don&#8217;t have a thorough understanding of </em><a href="http://testingblog.widemile.com/optimization-glossary/experiment/"><em>experiments</em></a> <em>and </em><a href="http://testingblog.widemile.com/optimization-glossary/interaction/"><em>interactions</em></a><em>, please read those first.<br />
</em></p>
<p>The tests used in optimization are from the design of experiments field. (From <a href="http://en.wikipedia.org/wiki/Design_of_experiments">Wikipedia</a>: “<em>Design of experiments is the design of all information-gathering exercises where variation is present, whether under the full control of the experimenter or not.</em>”) The two types of tests I will focus on are fractional factorial and full factorial.</p>
<p>Here is an example I will use to explain these concepts.   Below is a test matrix outlining a test for a landing page with 5 factors with 2 levels each. Don&#8217;t let the vocabulary scare you away, this means that there are 5 parts of the page being tested and 2 variations of each.</p>
<p style="text-align: center;"><a href="http://testingblog.widemile.com/wp-content/uploads/2008/05/matrix.png"><img class="size-full wp-image-201 aligncenter" title="Recipe Matrix for a full and fractional factorial test" src="http://testingblog.widemile.com/wp-content/uploads/2008/05/matrix.png" alt="matrix" width="456" height="121" /></a></p>
<p style="text-align: center;"><strong>Recipe Matrix:</strong> 5 factors = 5 parts (hero shot, headline, etc.) and 2 levels = 2 variations</p>
<p>These factors and their respective levels make up the possible combinations for a landing page. The combinations displayed are called <a href="http://testingblog.widemile.com/optimization-glossary/experiment/">experiments</a>.</p>
<p>Let&#8217;s calculate the total number of experiments possible (even if you know how to do this already, this is important to understanding the distinction between fractional and full factorial.) There are 2 levels for each factor, so you can have 2&#215;2x2&#215;2x2 (2 to the 5th power) = 32 possible experiments. This means there are exactly 32 combinations of hero shots, headlines, sub headlines, button text and main copy from our matrix outlined above. Note that if we add another factor, it becomes 2 to the 6th power or 64 possible experiments. Additionally, if you add 2 more levels to any of the existing 5 factors, it will increase from 32 to 4&#215;2x2&#215;2x2 = 64 experiments also.</p>
<p>In testing, each experiment must get a minimum amount of measurable conversions, known as the sample size per experiment. This ensures that there is enough data for a solid statistical analysis. Therefore the more experiments you have, the more conversions you need. You can think of conversion data as time also, since the longer you leave your web page up, the more data you get.</p>
<p>Now we&#8217;re ready to go back to the difference between the two test designs. Full factorial testing requires that every possible experiment combination is shown, so our 5-factor test would need to display all 32 experiments. This means that if there is a sample size of 100 conversions, 3,200 conversions will be required. Fractional factorial works differently, it displays a much smaller number of experiments, about 8 in this case, so it would need about 800 conversions.</p>
<p>Since full factorial gathers additional data, it reveals all possible <a title="Extended definitions for interactions" href="http://testingblog.widemile.com/optimization-glossary/interaction/">interactions</a>, but as seen by the numbers above, there is a trade-off. <strong>More data equals more information but more data also equals a longer test duration.</strong> The minimum data requirements for full factorial are very high since you are showing every experiment.</p>
<p>Even if you are using full factorial to get the same amount of information as a fractional factorial test, it will take more time since you need more data to see statistically relevant differences between the many experiments.</p>
<p>You might be wondering how fractional factorial can be accurate if interactions are possible?</p>
<p>Random interactions of high relevance are very rare, especially when looking for interactions of more than 2 factors. You really need to design tests where you look for meaningful interactions that are based on true business requirements rather than hoping for a random and low influence interaction between a red button, a hero shot and a headline.</p>
<p>Whatever the interaction is, you need to be able to understand your audience and infer why there was an interaction in the first place, only then are you ready to start designing for interactions.</p>
<p>Tests should not be filled with random levels, they should be carefully designed for success by focusing on testable hypotheses around the audience.  Could a 1 pixel drop shade on a button interacting with the copyright statement ever be truly significant, and not a victim of random error? Is it worth sacrificing thousands of conversions to learn a lesson that won’t result in any relevant increase of real world conversions?</p>
<p>There are interactions that might make sense and those that should be avoided from being measured because of the amount of testing time it adds.</p>
<p>This brings me to fractional factorial.  <strong>It is possible for <span>fractional factorial tests to detect interactions</span></strong>. How so? Using our example of a 5-factor test, fractional factorial can include everything from only main-effects all the way to 4-factor interaction effects. Full factorial’s only difference is that it is the full extension and includes the 5-factor interaction effects.</p>
<p>Fractional factorial is not a one-trick pony, it is a continuum ranging from testing for no interactions (only main effects) to one factor less than full factorial. It is exactly what the name fractional implies; even one less is a &#8220;fraction&#8221; of full factorial. It gives you the power to make trade-offs between testing only main effects to testing for interactions based on intelligent test design.</p>
<p>Once you decide to test for all possible interactions, you are committing to a full-factorial test and incur the associated traffic requirements. I’d love to see a test design that is designed for full interactions and still makes sense! Not having the ability to reduce the number of interactions is a huge detriment rather than a benefit of solutions limited to full-factorial testing.</p>
<p>Radically shorter test times allow for many more smart marketing ideas to be tested and <strong>adapted</strong> based on what you learn from each test run. You, the marketer have the ability to analyze your results and tweak follow-on tests to capitalize on what you<span> </span>learn. This common-sense approach is what hypothesis-based testing is all about and is very powerful. Focus on testing smart ideas to increase your conversion rate – that’s what matters most.</p>
<p>The graph below illustrates how much information is gained and the amount of testing needed, based on the number of interactions tested.</p>
<p style="text-align: center;"><a href="http://testingblog.widemile.com/wp-content/uploads/2008/05/effects-graph.png"><img class="alignnone size-full wp-image-203 aligncenter" title="Fractional factorial versus full factorial data comparison" src="http://testingblog.widemile.com/wp-content/uploads/2008/05/effects-graph.png" alt="effects graph" width="476" height="427" /></a></p>
<p>In my experience, the red area shows how valuable the data is based on which effects are being tested, while the blue area shows the amount of data (or time) needed to gather the data to confirm those effects. The x-axis goes from left to right, from main effects to full factorial (5-factor effects).</p>
<p>At Widemile, we believe it is more effective to perform quick, successive tests detecting only main-effects rather than randomly hoping for interactions. While interactions might give you small or even large gains, it likely will never not trump the gains from additional testing, nor the time and money lost looking for random interactions. The additional time required for full factorial tests is large and not many marketers want to wait more than a month for a test to complete.</p>
<p>Fractional factorial is preferred by a few camps, including <a href="http://www.widemile.com/">Widemile</a>, Omniture&#8217;s <a href="http://www.omniture.com/en/products/conversion/testandtarget">Test&amp;Target</a> (formerly Offermatica) and Interwoven&#8217;s <a href="http://www.optimost.com/">Optimost</a>. Full factorial is used in Google&#8217;s free <a href="http://www.google.com/websiteoptimizer/">Website Optimizer</a> and some tools offered by smaller providers.</p>
<p>Testing for all interactions sacrifices a lot of time. With the speed that audiences, marketing campaigns and seasons can change, it is important to get the most testing done in the least amount of time without sacrificing the quality of the data. Fractional factorial allows you to do just that, making it the wisest choice for multivariate testing.</p>
]]></content:encoded>
			<wfw:commentRss>http://testingblog.widemile.com/2008/07/24/primer-full-and-fractional-factorial-test-design/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		<feedburner:origLink>http://testingblog.widemile.com/2008/07/24/primer-full-and-fractional-factorial-test-design/</feedburner:origLink></item>
	</channel>
</rss>
