<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0">
<channel>
<title>Nick Kewney's Blog</title>
<link>http://kewney.com</link>
<description>Blog RSS feed for Nick Kewney's Technology Blog</description>
<lastBuildDate>Thu, 23 Feb 2012 16:14:15 GMT</lastBuildDate>
<language>en-us</language>

<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/rss+xml" href="http://feeds.feedburner.com/kewney" /><feedburner:info uri="kewney" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><item>
<title>ASP.NET MVC 4 - Bundling and Minification of JavaScript and CSS</title>
<link>http://feedproxy.google.com/~r/kewney/~3/EvvpBOivDDU/aspnet-mvc-4-bundling-and-minification-of-javascript-and-css</link>
<guid isPermaLink="false">http://kewney.com/posts/software-development/aspnet-mvc-4-bundling-and-minification-of-javascript-and-css</guid>
<pubDate>Thu, 23 Feb 2012 16:14:15 GMT</pubDate>
<description>&lt;p&gt;I usually use a combination of &lt;a href="http://www.crockford.com/javascript/jsmin.html"&gt;JSMin&lt;/a&gt; and the &lt;a href="http://www.telerik.com/products/aspnet-mvc.aspx"&gt;Telerik&lt;/a&gt; script manager to minify my JavaScript and CSS, however the &lt;a href="http://www.asp.net/mvc/mvc4"&gt;ASP.NET MVC 4&lt;/a&gt; beta has built in &lt;a href="http://en.wikipedia.org/wiki/Minification_(programming)"&gt;minification&lt;/a&gt;, which reduces the number of requests and general payload size, resulting in faster and better performing applications.&lt;/p&gt;

&lt;p&gt;This feature allows you can create custom bundles where you specify the resource name and order of your stylesheets and JavaScript files.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Introducing ASP.NET MVC 4 BundleTables&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Upon creating a new project in ASP.NET MVC 4, you'll notice that your default &lt;em&gt;_Layout.cshtml&lt;/em&gt; file will reference System.Web.Optimization and BundleTable.Bundles in its head. &lt;/p&gt;&lt;p&gt;According to Microsoft, these references are for the beta and will be accessed via a helper in the RTM release, expected sometime next quarter.&lt;/p&gt;

&lt;pre&gt;&amp;lt;link href="@System.Web.Optimization.BundleTable.Bundles.ResolveBundleUrl("~/Content/css")" rel="stylesheet" type="text/css" /&amp;gt;&lt;br&gt;&amp;lt;link href="@System.Web.Optimization.BundleTable.Bundles.ResolveBundleUrl("~/Content/themes/base/css")" rel="stylesheet"&amp;nbsp;&amp;nbsp;&amp;nbsp; type="text/css" /&amp;gt;&lt;br&gt;&amp;lt;script src="@System.Web.Optimization.BundleTable.Bundles.ResolveBundleUrl("~/Scripts/js")"&amp;gt;&amp;lt;/script&amp;gt;&lt;/pre&gt;

&lt;p&gt;If you look at the actual source code in the browser, you will see these get rendered as:&lt;/p&gt;

&lt;pre&gt;&amp;lt;link href="/Content/css?v=x" rel="stylesheet" type="text/css" /&amp;gt;&lt;br&gt;&amp;lt;link href="/Content/themes/base/css?v=y" rel="stylesheet" type="text/css" /&amp;gt;&lt;br&gt;&amp;lt;script src="/Scripts/js?v=z"&amp;gt;&amp;lt;/script&amp;gt;&lt;/pre&gt;

&lt;p&gt;The CSS and JavaScript files in this project are minified and compressed and the querystring parameter, v, is a hash of the current files being served.  This will change when you alter your .js and .css files.&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/kewney/~4/EvvpBOivDDU" height="1" width="1"/&gt;</description>
<feedburner:origLink>http://kewney.com/posts/software-development/aspnet-mvc-4-bundling-and-minification-of-javascript-and-css</feedburner:origLink></item>

<item>
<title>Thoughts on QR codes</title>
<link>http://feedproxy.google.com/~r/kewney/~3/OGyCLd-9xU8/thoughts-on-qr-codes</link>
<guid isPermaLink="false">http://kewney.com/posts/technology/thoughts-on-qr-codes</guid>
<pubDate>Thu, 26 Jan 2012 15:02:15 GMT</pubDate>
<description>&lt;p&gt;For those not in the know, QR codes are the black and white squares you see plastered over marketing materials.&lt;/p&gt;&lt;p&gt;If you have a compatible smart phone, just locate an app for your mobile OS, open it, scan the code, wait a while and you'll be taken to a website. Hang on... was that really worth it?&lt;/p&gt;&lt;p&gt;QR codes gets quite a lot of bad press, however negativity around the technology itself is definitely undeserved. The codes are simply modern-day barcodes, nothing more.&amp;nbsp; The bad press should lie with the growing number of awful implementations of QR codes.&lt;/p&gt;&lt;p&gt;I wouldn't stick a barcode on my marketing material to advertise a promotion or product, so why would I use a QR code?&amp;nbsp; From a consumer point of view, I think the barrier to&amp;nbsp;entry is still&amp;nbsp;too high due to the fact that QR recognition isn't native on the on many popular devices and by the time you open the app, the moment has gone.&lt;/p&gt;&lt;p&gt;A few years ago, QR codes were a marketeer's dream.&amp;nbsp; Phones had keyboards and people hated predictive text. Now we have smartphones designed for web browsing, is scanning a QR code any easier than typing a URL in or searching?&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/kewney/~4/OGyCLd-9xU8" height="1" width="1"/&gt;</description>
<feedburner:origLink>http://kewney.com/posts/technology/thoughts-on-qr-codes</feedburner:origLink></item>

<item>
<title>Scotland for sale on eBay</title>
<link>http://feedproxy.google.com/~r/kewney/~3/r-FL76m3utw/scotland-for-sale-on-ebay</link>
<guid isPermaLink="false">http://kewney.com/posts/off-topic/scotland-for-sale-on-ebay</guid>
<pubDate>Thu, 19 Jan 2012 15:35:12 GMT</pubDate>
<description>&lt;p&gt;For anybody who missed it...&lt;/p&gt;&lt;img border="0" hspace="0" alt="Scotland for sale on eBay" align="baseline" src="/files/media/image/scotland-for-sale.gif"&gt;&lt;img src="http://feeds.feedburner.com/~r/kewney/~4/r-FL76m3utw" height="1" width="1"/&gt;</description>
<feedburner:origLink>http://kewney.com/posts/off-topic/scotland-for-sale-on-ebay</feedburner:origLink></item>

<item>
<title>Getting started with github</title>
<link>http://feedproxy.google.com/~r/kewney/~3/7xUdD8M8mMI/getting-started-with-github</link>
<guid isPermaLink="false">http://kewney.com/posts/software-development/getting-started-with-github</guid>
<pubDate>Fri, 02 Dec 2011 22:37:56 GMT</pubDate>
<description>&lt;p&gt;I've just started using git/&lt;a href="http://github.com"&gt;github&lt;/a&gt; and have summarised below (for my own reference) a few of the commonly used git commands. I haven't included&amp;nbsp;setting up your git client as this is well documented on &lt;a href="http://help.github.com/win-set-up-git/"&gt;github's help portal&lt;/a&gt;.&lt;/p&gt;






&lt;h3 style="font-weight: bold;"&gt;Setting up a git repository for an existing project&lt;/h3&gt;
&lt;p&gt;To initialise the git repository in the current directory&lt;/p&gt;
&lt;pre&gt;$ git init&lt;/pre&gt;
&lt;p&gt;Create a new file named README&lt;/p&gt;
&lt;pre&gt;$ touch README&lt;/pre&gt;
&lt;p&gt;Add README file to the repository&lt;/p&gt;
&lt;pre&gt;$ git add README&lt;/pre&gt;
&lt;p&gt;Add all files in the directory to the repository&lt;/p&gt;
&lt;pre&gt;$ git add *&lt;/pre&gt;
&lt;p&gt;Commit to your &lt;strong&gt;local&lt;/strong&gt; repository&lt;/p&gt;
&lt;pre&gt;$ git commit -m 'first commit'&lt;/pre&gt;


&lt;h3 style="font-weight: bold;"&gt;Commiting changes locally&lt;/h3&gt;
&lt;p&gt;Summarise changes since your last commit&lt;/p&gt;
&lt;pre&gt;$ git diff&lt;/pre&gt;

&lt;p&gt;Commit your changes to your local git repository&lt;/p&gt;
&lt;pre&gt;$ git commit -a -m "whatever"&lt;/pre&gt;&lt;p&gt;or to discard the changes to your local git repository&lt;/p&gt;&lt;pre&gt;git checkout -f&lt;/pre&gt;


&lt;h3 style="font-weight: bold;"&gt;Commiting to github (First time)&lt;/h3&gt;

&lt;p&gt;Add the remote origin to the local git repository. Do this once per repository.&lt;/p&gt;
&lt;pre&gt;$ git remote add origin git@github.com:username/repo.git&lt;/pre&gt;


&lt;h3 style="font-weight: bold;"&gt;Commiting to github (Subsequent Commits)&lt;/h3&gt;
&lt;p&gt;Send your commit to GitHub&lt;/p&gt;

&lt;pre&gt;$ git push origin master&lt;/pre&gt;

&lt;h3 style="font-weight: bold;"&gt;New files to add?&lt;/h3&gt;
&lt;p&gt;Add any new files since your last update&lt;/p&gt;
&lt;pre&gt;$ git add *&lt;/pre&gt;

&lt;h3 style="font-weight: bold;"&gt;Getting changes from your remote repository&lt;/h3&gt;
&lt;p&gt;To update any changed files from the server to your local repository, if you are working from more than one machine, for example&lt;/p&gt;
&lt;pre&gt;$ git pull&lt;/pre&gt;



&lt;h3 style="font-weight: bold;"&gt;Deleting&lt;/h3&gt;
&lt;p&gt;To delete a local branch&lt;/p&gt;
&lt;pre&gt;$ git branch -d 2012.1-stable&lt;/pre&gt;
&lt;p&gt;To delete a remote branch&lt;/p&gt;
&lt;pre&gt;$ git push origin :branch&lt;/pre&gt;



&lt;h3 style="font-weight: bold;"&gt;Checking out a branch&lt;/h3&gt;
&lt;p&gt;To check out a branch&lt;/p&gt;
&lt;pre&gt;$ git checkout branch&lt;/pre&gt;



&lt;h3 style="font-weight: bold;"&gt;Cloning&lt;/h3&gt;
&lt;p&gt;Checking out a project when you don't have a local copy&lt;/p&gt;
&lt;pre&gt;$ git clone http://github.com/user/repo.git
$ git checkout -b 2012.1-stable origin/2012.1-stable &lt;/pre&gt;




&lt;p&gt;Two great sites are &lt;a href="http://git.or.cz/course/svn.html"&gt;Git - SVN Crash Course&lt;/a&gt; or&amp;nbsp;&lt;a href="http://gitref.org/basic/#commit"&gt;an&amp;nbsp;alternative&lt;/a&gt;&amp;nbsp;and &lt;a href="http://help.github.com/"&gt;github help resourses&lt;/a&gt;. I'm using Git for Windows.&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/kewney/~4/7xUdD8M8mMI" height="1" width="1"/&gt;</description>
<feedburner:origLink>http://kewney.com/posts/software-development/getting-started-with-github</feedburner:origLink></item>

<item>
<title>Design pattern for ASP.NET MVC 3 application using repository pattern</title>
<link>http://feedproxy.google.com/~r/kewney/~3/b6BsKDGgzf0/aspnet-mvc-3-using-repository-pattern</link>
<guid isPermaLink="false">http://kewney.com/posts/software-development/aspnet-mvc-3-using-repository-pattern</guid>
<pubDate>Wed, 30 Nov 2011 14:33:43 GMT</pubDate>
<description>&lt;p&gt;Below is an example of how I typically implement the repository pattern in an ASP.NET MVC 3 project when using Entity Framework.&lt;/p&gt;
&lt;h2&gt;Model (Address.cs)&lt;/h2&gt;
&lt;pre&gt;  [Bind(Include = "Id,Address")]
  [MetadataType(typeof(Address_Validation))]
  public partial class Address {}
  public class Address_Validation&lt;br&gt;  {
    //validation here
  }&lt;/pre&gt;
&lt;h2&gt;Repository Interface (IAddressRepository.cs)&lt;/h2&gt;
&lt;pre&gt;public interface IAddressRepository
  {&lt;br&gt;   void Delete(Address u);&lt;br&gt;   void Save();&lt;br&gt;   void Add(Address b);&lt;br&gt;   Address GetAddressById(int intAddressId);&lt;br&gt;  }&lt;/pre&gt;
&lt;h2&gt;Repository class (AddressRepository.cs)&lt;/h2&gt;
&lt;pre&gt;public class AddressRepository : IAddressRepository &lt;br&gt;  {&lt;br&gt;  private myEntities db = new myEntities();&lt;br&gt;  public Address GetAddressById(int intAddressId)&lt;br&gt;  {&lt;br&gt;    return db.Addresses.FirstOrDefault(d =&amp;gt; ((d.Address_ID == intAddressId)))&lt;br&gt;  }&lt;br&gt;  public void Delete(Address u)&lt;br&gt;  {&lt;br&gt;    db.Addresses.DeleteObject(u);&lt;br&gt;  }&lt;br&gt;  public void Save()&lt;br&gt;  {&lt;br&gt;    db.SaveChanges();&lt;br&gt;  }&lt;br&gt;  public void Add(Address b)&lt;br&gt;  {&lt;br&gt;    db.AddToAddresses(b);&lt;br&gt;    db.SaveChanges();&lt;br&gt;  }&lt;br&gt;  }&lt;/pre&gt;
&lt;h2&gt;Controller (AddressController.cs)&lt;/h2&gt;
&lt;pre&gt;  public class AddressController : Controller&lt;br&gt;  {
  private IAddressRepository _addressRepository { get; set; }
  protected override void Initialize(RequestContext requestContext) {&lt;br&gt;  &lt;br&gt;  if (_addressRepository == null)&lt;br&gt;  {&lt;br&gt;  _addressRepository = new AddressRepository();&lt;br&gt;  }&lt;br&gt;    base.Initialize(requestContext);&lt;br&gt;  }&lt;br&gt;
  [AcceptVerbs(HttpVerbs.Post)] // Example..&lt;br&gt;  public ActionResult _AddressInsert(AddressPartial objItem)&lt;br&gt;  {&lt;br&gt;  try&lt;br&gt;  {
   var objAddress = new Address&lt;br&gt;  {&lt;br&gt;   AddressTypeID = objItem.AddressTypeID,&lt;br&gt;  ...&lt;br&gt;  }
   _addressRepository.Add(objAddress);
   _addressRepository.Save();
  }
  }&lt;/pre&gt;
&lt;p&gt;Moving IContactRepository to a IRepository&amp;lt;T&amp;gt; interface would avoid code repetition and allow you to share the interface between all your repositories.&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/kewney/~4/b6BsKDGgzf0" height="1" width="1"/&gt;</description>
<feedburner:origLink>http://kewney.com/posts/software-development/aspnet-mvc-3-using-repository-pattern</feedburner:origLink></item>

<item>
<title>Apple iPhone 4S Siri with a Scottish accent</title>
<link>http://feedproxy.google.com/~r/kewney/~3/gbUsBDPucn0/apple-iphone-4s-siri-with-a-scottish-accent-</link>
<guid isPermaLink="false">http://kewney.com/posts/mobile/apple-iphone-4s-siri-with-a-scottish-accent-</guid>
<pubDate>Mon, 31 Oct 2011 10:53:58 GMT</pubDate>
<description>&lt;p&gt;Bad news for tech loving Scots—the &lt;a href="http://www.jhlcomms.com/products/apple/iphone-4s"&gt;iPhone 4S&lt;/a&gt; doesn't always understand what you're saying. Siri, Apple's personal assistant, can recognise accents everywhere from Australia to London.  But live in Glasgow and you're out of luck!&lt;/p&gt;
&lt;iframe height="315" src="http://www.youtube.com/embed/278DtTXfJUU" frameBorder="0" width="560" allowfullscreen=""&gt;&lt;/iframe&gt;&lt;img src="http://feeds.feedburner.com/~r/kewney/~4/gbUsBDPucn0" height="1" width="1"/&gt;</description>
<feedburner:origLink>http://kewney.com/posts/mobile/apple-iphone-4s-siri-with-a-scottish-accent-</feedburner:origLink></item>

<item>
<title>Pre-generating views to improve query performance in Entity Framework</title>
<link>http://feedproxy.google.com/~r/kewney/~3/66dTETtkd-g/pre-generating-views-to-improve-query-performance-in-aspnet-mvc-3-entity-framework</link>
<guid isPermaLink="false">http://kewney.com/posts/software-development/pre-generating-views-to-improve-query-performance-in-aspnet-mvc-3-entity-framework</guid>
<pubDate>Fri, 28 Oct 2011 16:11:20 GMT</pubDate>
<description>&lt;p&gt;If you have used &lt;a href="http://en.wikipedia.org/wiki/ADO.NET_Entity_Framework"&gt;.NET Entity Framework&lt;/a&gt;, there’s a chance you will have run into performance issues when first loading your application.&amp;nbsp; One way to reduce this overhead is by pre-generating your view metadata as per this &lt;a href="http://msdn.microsoft.com/en-us/library/bb896240.aspx"&gt;Microsoft guide&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;I had a few problems using this resource so have prepared this step-by-step guide on how to pre-generate your views in EF 4.0.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Step 1: Change processing type for your .edmx model&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Locate your model and open it in the designer.&amp;nbsp; Select “Copy to Output Directory” for the “Metadata Artefact Processing” option. This will result in .ssdl, .csdl and .msl files appearing in your output directory.&amp;nbsp; During debug, this will be bin/debug.
&lt;br&gt;&lt;img border="0" hspace="0" alt="" align="baseline" src="/files/media/image/entity-1.png"&gt;&lt;/p&gt;


&lt;p&gt;&lt;strong&gt;Step 2: Set up a pre-build event&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Set up a pre-build event in your project by right clicking the project name, clicking properties and navigating to Build Events.&amp;nbsp; In the “Pre Build” box, include the following all on one line, replacing DataModel with the name of your model.&lt;/p&gt;

&lt;pre&gt;"%windir%\Microsoft.NET\Framework\v4.0.30319\EdmGen.exe" /mode:ViewGeneration "/inssdl:$(TargetDir)DataModel.ssdl" "/incsdl:$(TargetDir)DataModel.csdl" "/inmsl:$(TargetDir)DataModel.msl" "/outviews:$(ProjectDir)DataModel.Views.cs" /nologo /language:CSharp
&lt;/pre&gt;

&lt;p&gt;&lt;strong&gt;Step 3: Set up post-build event&lt;/strong&gt;&lt;/p&gt;


&lt;p&gt;Next, build your project and you’ll see three files in your output directory (bin/debug).&amp;nbsp; We can add these to the project by creating a post-build event and using XCOPY to copy these to the root of your project.&lt;/p&gt;

&lt;pre&gt;XCOPY "$(TargetDir)DataModel.csdl" "$(ProjectDir)" /R /Y 
XCOPY "$(TargetDir)DataModel.ssdl" "$(ProjectDir)" /R /Y 
XCOPY "$(TargetDir)DataModel.msl" "$(ProjectDir)" /R /Y&lt;/pre&gt;

&lt;p&gt;&lt;strong&gt;Step 4: Include files in your project&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;
Build the project which contains your model and you should see the following files in root directory of your project&lt;/p&gt;
&lt;div class="list-holder"&gt;&lt;ul&gt;
&lt;li&gt;DataModel.Views.cs&lt;/li&gt;
&lt;li&gt;DataModel.csdl&lt;/li&gt;
&lt;li&gt;DataModel.ssdl&lt;/li&gt;
&lt;li&gt;DataModel.msl&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;p&gt;Include all these files in your project via Show All Files &amp;gt; Right Click &amp;gt; Include In Project&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Step 5: Set as embedded resources&lt;/strong&gt;&lt;/p&gt;

Include DataModel.csl, DataModel.ssdl, DataModel.msl as an embedded resourcees in your project by clicking on the file and selecting "Embedded Resource" as your build action in the properties window.&lt;p&gt;&lt;/p&gt;


&lt;p&gt;&lt;strong&gt;Step 6: Update your web.config/app.config connection string&lt;/strong&gt;&lt;/p&gt;
Update your connection string and you’re ready to go. Mine looks like this.&amp;nbsp; Be sure to use fully qualified names for your resources.

&lt;pre&gt;&amp;lt;add name="projectEntities" connectionString="metadata=res://*/Project.EF.DataModel.csdl|res://*/ Project.EF.DataModel.ssdl|res://*/ Project.EF.DataModel.msl;provider=System.Data.SqlClient;provider connection string=&amp;amp;quot;Data Source=xxx.xxx.xxx.xxx;Initial Catalog=xxx;Persist Security Info=True;User ID=xxx;Password=xxx;MultipleActiveResultSets=True&amp;amp;quot;" providerName="System.Data.EntityClient"/&amp;gt;&lt;/pre&gt;

&lt;p&gt;If you receive, “Unable to load the specified metadata resource”, the path to your model in your configuration file is probably wrong.&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/kewney/~4/66dTETtkd-g" height="1" width="1"/&gt;</description>
<feedburner:origLink>http://kewney.com/posts/software-development/pre-generating-views-to-improve-query-performance-in-aspnet-mvc-3-entity-framework</feedburner:origLink></item>

<item>
<title>Using regular expressions to get initials from a string in C#</title>
<link>http://feedproxy.google.com/~r/kewney/~3/N7qCu9xnxHQ/using-regular-expressions-to-get-initials-from-a-string-in-c-sharp</link>
<guid isPermaLink="false">http://kewney.com/posts/software-development/using-regular-expressions-to-get-initials-from-a-string-in-c-sharp</guid>
<pubDate>Thu, 20 Oct 2011 11:13:07 GMT</pubDate>
<description>&lt;p&gt;Here is a really simple string extension to help you get initials from a string in C#&lt;/p&gt;

&lt;pre&gt;    public static string ToInitials(this string str)
    {
      return Regex.Replace(str, @"^(?'b'\w)\w*,\s*(?'a'\w)\w*$|^(?'a'\w)\w*\s*(?'b'\w)\w*$", "${a}${b}", RegexOptions.Singleline)
    }&lt;/pre&gt;
&lt;p&gt;Here's a breakdown on what the expression is doing:&lt;/p&gt;
&lt;div class="list-holder"&gt;

&lt;/div&gt;&lt;div class="list-holder"&gt;&lt;ul&gt;
&lt;li&gt;^ matches beginning of the string&lt;/li&gt;
 
&lt;li&gt;(?'b'\w) captures first character of a word and stores in in 'b'&lt;/li&gt;
 
&lt;li&gt;\w* matches the rest of the name&lt;/li&gt;
 
&lt;li&gt;, matches a comma&lt;/li&gt;
 
&lt;li&gt;\s* matches &amp;gt; 0 spaces&lt;/li&gt;
 
&lt;li&gt;(?'a'\w)\w* matches the second part of the name capturing the first letter into 'b'&lt;/li&gt;
 
&lt;li&gt;$ matches end of string&lt;/li&gt;
 
&lt;li&gt;| or the alternate pattern w ithout the ,&lt;/li&gt;
 
&lt;li&gt;Notice the 'a' and 'b' are swapped&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;

&lt;p&gt;There are more handy string extensions on '&lt;a href="http://www.kewney.com/posts/software-development/string-extension-collection-for-c-"&gt;String Extension Collection for C#&lt;/a&gt;'&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/kewney/~4/N7qCu9xnxHQ" height="1" width="1"/&gt;</description>
<feedburner:origLink>http://kewney.com/posts/software-development/using-regular-expressions-to-get-initials-from-a-string-in-c-sharp</feedburner:origLink></item>

<item>
<title>Father's anti-terror row over Braehead phone camera snap</title>
<link>http://feedproxy.google.com/~r/kewney/~3/9ekfJHn-Kk8/fathers-anti-terror-row-over-braehead-phone-camera-snap</link>
<guid isPermaLink="false">http://kewney.com/posts/off-topic/fathers-anti-terror-row-over-braehead-phone-camera-snap</guid>
<pubDate>Mon, 10 Oct 2011 20:35:42 GMT</pubDate>
<description>&lt;p&gt;A Glasgow man &lt;a href="http://www.bbc.co.uk/news/uk-scotland-glasgow-west-15250846"&gt;received a warning yesterday&lt;/a&gt; after being caught blatantly taking a photograph of his own daughter innocently eating an ice cream at &lt;a href="http://www.braehead.co.uk/"&gt;Braehead Shopping Centre&lt;/a&gt;.&amp;nbsp; &lt;/p&gt;&lt;p&gt;After an alert from centre security, Strathclyde Police anti-terrorism unit was quick to arrive at the scene to investigate the incident.&amp;nbsp;An 'operation Whippy' spokesman told me: &lt;br&gt;&amp;nbsp;&lt;/p&gt;&lt;blockquote&gt;"By taking&lt;a href="http://www.bbc.co.uk/news/uk-scotland-15238676"&gt; the action we did&lt;/a&gt;, my officers acted with restraint.&amp;nbsp; Under normal circumstances, we'd have detonated a controlled explosion of every ice cream cone in the vicinity of the shopping centre."&lt;/blockquote&gt;&lt;p&gt;It is reassuring to know that in these days of hightened security, that &lt;a href="http://www.braehead.co.uk/"&gt;Braehead&lt;/a&gt; security guards and the police can work together so swiftly to ensure that no more fathers can photograph their children in shopping centres.&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/kewney/~4/9ekfJHn-Kk8" height="1" width="1"/&gt;</description>
<feedburner:origLink>http://kewney.com/posts/off-topic/fathers-anti-terror-row-over-braehead-phone-camera-snap</feedburner:origLink></item>

<item>
<title>Using the web as a knowledgebase for question answering</title>
<link>http://feedproxy.google.com/~r/kewney/~3/T-F0x_QlJlI/question-answering-systems-using-the-web-as-a-knowledgebase</link>
<guid isPermaLink="false">http://kewney.com/posts/technology/question-answering-systems-using-the-web-as-a-knowledgebase</guid>
<pubDate>Mon, 10 Oct 2011 06:48:59 GMT</pubDate>
<description>&lt;p&gt;&lt;span&gt;Interest in question answering
  systems has been revived since the birth of the World Wide Web in 1989
  (Cailliau, 2000) and the launch of &lt;a href="http://www.apple.com/iphone/features/siri.html"&gt;Apple's Siri&lt;/a&gt; in 2011. Using the web as a knowledgebase for such a system
  allows answers to be retrieved from a potentially limitless number of sources. However,
  the organisation and scale of the web makes this an extremely difficult task,
  which is why comparing existing search engine algorithms' suitability for this
  purpose is a worthwhile area of research. Although question answering systems
  have been around for some time, utilising the web as the knowledgebase is a
  relatively new concept, of which there are many considerations.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span lang="EN-US"&gt;The aim of this post is to establish the most effective way for an
  Internet based question answering system to use the World Wide Web as its
  knowledgebase. Three major factors are investigated and are: &lt;strong&gt;the search
  algorithm&lt;/strong&gt; used to obtain candidate documents for answer extraction, the &lt;strong&gt;number
  of candidate documents&lt;/strong&gt; used and the importance of &lt;strong&gt;classifying answer types&lt;/strong&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span lang="EN-US"&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;Some key conclusions from my research include 1) &lt;/span&gt;&lt;span lang="EN-US"&gt;Increasing the number of candidate
    documents undoubtedly improves the accuracy of potential answers and 2) 
  &lt;/span&gt;&lt;span lang="EN-US"&gt;Google's
    PageRank has a positive effect on obtaining candidate documents for a web based
    question answering system (compared to other algorithms)&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span lang="EN-US"&gt;In addition to this, further primary research was conduced on the
  Google Application Programming Interface (API) due to reliability issues which
  led to rouge results and anomalies appearing throughout the data acquisition
  phase. Although these issues have been resolved, the problems with the Google
  API are fully documented in Chapter 5, section 5.3.&lt;/span&gt;&lt;b&gt;&lt;i&gt;&lt;span style="color: black; text-transform: uppercase; line-height: 150%; font-size: 14pt;" lang="EN-US"&gt;&amp;nbsp;&lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&lt;/p&gt;&lt;p class="MsoToc1"&gt;&lt;a name="_Toc135413397"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;1.1&lt;span style='font: 7pt/normal "Times New Roman"; font-size-adjust: none; font-stretch: normal;'&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;
&lt;/span&gt;&lt;/span&gt;&lt;/b&gt;&lt;b&gt;&lt;span style="line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;Introduction&lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;/p&gt;&lt;div&gt;
&lt;b&gt;&lt;span style="line-height: 150%; font-size: 3pt;" lang="EN-US"&gt;&amp;nbsp;&lt;/span&gt;&lt;/b&gt;&lt;span lang="EN-US"&gt;This chapter will provide an overview of the study and the reasons
for undertaking research in this field. It will also provide a brief
introduction to the main concepts explored in the study and an overview of the
research aims and objectives. It will also provide an outline of the structure
of the study. &lt;/span&gt;&lt;/div&gt;&lt;p&gt;&lt;span&gt;Using the World Wide Web as the
knowledgebase for a question answering system extends beyond information
retrieval. It results in the entire world participating in creating a
repository of information ready to be interfaced with question answering
technology. In a webcast by Sun Microsystems and eBay, it was said that the web
is "… switching from a market where the value is in access, to a market where
the value is in participation." (Schwartz, 2005). Instead of the traditional
approach of obtaining pre-keyed answers from one location, the public nature of
the web results in potential answers coming from any individual or business
that has published information online. Although concerns for accuracy from
unverified sources (factoids) is an issue (Economist Article, 2004), the value
of incorporating user participation greatly extends the scope of question
answering systems.&lt;/span&gt;&lt;/p&gt;&lt;p style="margin: 0cm -18pt 0pt 36pt; line-height: 150%; text-indent: -36pt;"&gt;&lt;a name="_Toc135413398"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;1.2&lt;span style='font: 7pt/normal "Times New Roman"; font-size-adjust: none; font-stretch: normal;'&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;
  &lt;/span&gt;&lt;/span&gt;&lt;/b&gt;&lt;b&gt;&lt;span style="line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;Rationale
  for study&lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;/p&gt;&lt;blockquote&gt;&amp;nbsp;"The greatest problem of today
  is how to teach people to ignore the irrelevant, how to refuse to know things
  before they are suffocated. Too many facts are as bad as none at all." &lt;span&gt;(W.H. Auden, 2000)&lt;/span&gt;&lt;/blockquote&gt;&lt;p&gt;&lt;span&gt;Google's search engine is one of the
largest and widely used resources on the web. Its index has grown from just
fifty-five million pages in 1999 (SEO Journal 2004)&lt;/span&gt;&lt;span style="color: black; line-height: 150%; font-size: 8pt;" lang="EN-US"&gt; &lt;/span&gt;&lt;span&gt;to over &lt;a href="http://googleblog.blogspot.com/2008/07/we-knew-web-was-big.html"&gt;one trillion in 2008&lt;/a&gt;; over one-hundred
fold in just six years. The evident popularity of the Internet as a medium for
research makes it an extremely attractive resource for seeking quick answers to
simple, fact based questions such as 'What is the tallest mountain in
Scotland?'&lt;i&gt;. &lt;/i&gt;For many users "inexperienced with the art of web research"
(Brin &amp;amp; Page, 1998), getting answers to questions can sometimes be very
frustrating. Instead of receiving a direct answer to a question, a list of
websites is returned from which the answer must be sourced manually. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span&gt;Despite having indexed a large
  proportion of the World Wide Web, the major search providers have not yet found
  solutions to obtaining answers to questions posed in natural language form. Furthermore,
  Google was  described as the "default command-line interface for
  the Web" (The Linux Journal), an accurate analogy as it is an
  extremely powerful resource, yet requires some knowledge to utilise it
  effectively – especially when seeking specific answers to questions.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;In a recent experiment (Toms et. al.
  2001), ninety of two hundred participants (45%) asked to answer a question
  using the Google search engine entered it in natural language form, while the remainder
  entered only keywords to locate their desired page. The former resulted in the query
  string having more irrelevant stop words in it; such as 'it', 'the' and 'how'.
  In addition to this, when questions are posed in natural language form, stems
  are added to words such as 'ing', 's' and 'ed'. These are both factors which
  undoubtedly affect the quality of results returned to the end user. This is
  just an example of such a study, yet gives a valuable insight into the scope
  for research in this area, and how traditional question answering systems could
  be adopted and modified to utilize the World Wide Web as a knowledgebase. This
  document will investigate areas of previous research and comment on their
  outcomes.&lt;/span&gt;&lt;/p&gt;&lt;p style="margin: 0cm -18pt 0pt 36pt; line-height: 150%; text-indent: -36pt;"&gt;&lt;a name="_Toc135413399"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;1.3&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;b&gt;&lt;span style="line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;Research aims and
  objectives&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;The overall aim of the study is to investigate the factors that
influence the accuracy of answers for question answering systems using the
World Wide Web as a knowledgebase.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;In order to accomplish this aim, the following research objectives
  were formulated:&lt;/span&gt;&lt;/p&gt;&lt;p style="text-align: justify; line-height: 150%; text-indent: -18pt; margin-left: 18pt;"&gt;&lt;span lang="EN-US"&gt;1.&lt;span style='font: 7pt/normal "Times New Roman"; font-size-adjust: none; font-stretch: normal;'&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;To review and analyse question answering systems and components. This
  is answered through the review of literature.&lt;/span&gt;&lt;/p&gt;&lt;p style="text-align: justify; line-height: 150%; text-indent: -18pt; margin-left: 18pt;"&gt;&lt;span lang="EN-US"&gt;2.&lt;span style='font: 7pt/normal "Times New Roman"; font-size-adjust: none; font-stretch: normal;'&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;To establish the most effective way to present a question to a document
retriever to obtain optimum results. This is answered through the review of
literature.&lt;/span&gt;&lt;/p&gt;&lt;p style="text-align: justify; line-height: 150%; text-indent: -18pt; margin-left: 18pt;"&gt;&lt;span lang="EN-US"&gt;3.&lt;span style='font: 7pt/normal "Times New Roman"; font-size-adjust: none; font-stretch: normal;'&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;To determine if the search algorithm used affects the quality of
results retrieved. This is answered though primary research.&lt;/span&gt;&lt;/p&gt;&lt;p style="text-align: justify; line-height: 150%; text-indent: -18pt; margin-left: 18pt;"&gt;&lt;span lang="EN-US"&gt;4.&lt;span style='font: 7pt/normal "Times New Roman"; font-size-adjust: none; font-stretch: normal;'&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;To determine whether increasing the number of candidate documents increases
the likelihood of obtaining potential answers through repetition. This is
answered though primary research.&lt;/span&gt;&lt;/p&gt;&lt;p style="text-align: justify; line-height: 150%; text-indent: -18pt; margin-left: 18pt;"&gt;&lt;span lang="EN-US"&gt;5.&lt;span style='font: 7pt/normal "Times New Roman"; font-size-adjust: none; font-stretch: normal;'&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;To identify the extent of problems that exist with the Google API.
This is answered though primary research.&lt;/span&gt;&lt;/p&gt;&lt;p style="text-align: justify; line-height: 150%; text-indent: -18pt; margin-left: 18pt;"&gt;&lt;span lang="EN-US"&gt;6.&lt;span style='font: 7pt/normal "Times New Roman"; font-size-adjust: none; font-stretch: normal;'&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;To identify and recommend the ways in which future web question
answering systems can be improved. This is addressed in the conclusion&lt;/span&gt;&lt;/p&gt;&lt;p style="margin: 0cm -18pt 0pt 36pt; line-height: 150%; text-indent: -36pt;"&gt;&lt;a name="_Toc135413400"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;1.4&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;b&gt;&lt;span style="line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;Research methodology&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
&lt;p&gt;&lt;span lang="EN-US"&gt;Various methods of research were reviewed and considered for the
undertaking of this post, and both secondary and primary research were
deemed necessary in order to address all research objectives. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span lang="EN-US"&gt;Secondary research entailed looking into how existing systems
  operate, their strengths and weaknesses, and why there is not currently a
  commercial-scale question answering system. It was important to understand each
  area search engine technology, and to determine what strategies are already in
  place with regards to question answering.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span lang="EN-US"&gt;The primary research was conducted through the development of a
question keyword analysis in order to gather quantifiable results about the
optimum method of utilising the web as a knowledgebase for a question answering
system.&lt;/span&gt;&lt;/p&gt;
&lt;p style="margin: 0cm -18pt 0pt 36pt; line-height: 150%; text-indent: -36pt;"&gt;&lt;a name="_Toc135413401"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;1.5&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;b&gt;&lt;span style="line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;Points to prove!&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;&lt;i&gt;&lt;span style="line-height: 150%; font-size: 1pt;" lang="EN-US"&gt;&amp;nbsp;&lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&lt;span lang="EN-US"&gt;Some areas of interest in this document area:&lt;/span&gt;&lt;/p&gt;
&lt;p style="line-height: 150%; text-indent: -18pt; margin-left: 27pt;"&gt;1.&lt;span style='font: 7pt/normal "Times New Roman"; font-size-adjust: none; font-stretch: normal;'&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;
  &lt;/span&gt;Google's search algorithm has a negative impact on obtaining accurate
  candidate documents for a web based question answering system.&lt;/p&gt;&lt;p style="text-align: justify; line-height: 150%; text-indent: -18pt; margin-left: 27pt;"&gt;2.&lt;span style='font: 7pt/normal "Times New Roman"; font-size-adjust: none; font-stretch: normal;'&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;
&lt;/span&gt;&lt;span lang="EN-US"&gt;Increasing the number of candidate documents increases
the likelihood of obtaining potential answers through pattern repetition.&lt;/span&gt;&lt;/p&gt;&lt;p style="line-height: 150%; text-indent: -18pt; margin-left: 27pt;"&gt;3.&lt;span style='font: 7pt/normal "Times New Roman"; font-size-adjust: none; font-stretch: normal;'&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;
&lt;/span&gt;The Google application programming interface is not currently suitable
for commercial use.&lt;/p&gt;&lt;p style="line-height: 150%; text-indent: -18pt; margin-left: 27pt;"&gt;4.&lt;span style='font: 7pt/normal "Times New Roman"; font-size-adjust: none; font-stretch: normal;'&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;
&lt;/span&gt;Stemming is the optimum &lt;span lang="EN-US"&gt;way to prepare a question for
presentation to the document retrieval component of a question answering system.&lt;/span&gt;&lt;/p&gt;&lt;p style="margin: 0cm -18pt 0pt 36pt; line-height: 150%; text-indent: -36pt;"&gt;&lt;span style="line-height: 150%; font-size: 1pt;" lang="EN-US"&gt;&amp;nbsp;&lt;/span&gt;&lt;a name="_Toc135413409"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;2.1&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;b&gt;&lt;span style="line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;Introduction&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;i&gt;&lt;span style="font-size: 3pt;" lang="EN-US"&gt;&amp;nbsp;&lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&lt;span&gt;This chapter contains the literature
  analysis and information on key technologies that are relevant to the project.
  It aims to critically analyse various question answering techniques, and will
  ultimately result in areas of further research becoming clearer. Moreover, it
  should help define the objectives of the project from the perspective of the
  literature. Literature has been sourced from journal archives, books and the
  Internet. Additional resources have also been provided by project supervisor
  Katrin Hartmann. In addition, experimental requirements will be established by
  analysing data obtained from other researchers resulting in the development of
  a prototype keyword analysis system to aid in addressing the primary research topics.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;a name="_Toc135413410"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;2.2&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;b&gt;&lt;span style="line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;Question answering
background&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 3pt;" lang="EN-US"&gt;&amp;nbsp;&lt;/span&gt;&lt;span&gt;Obtaining the answer to a question
using a search engine can sometimes be very frustrating. Instead of getting a
direct response to a question, a list of websites is provided for the user to
view and locate relevant information manually. Question answering is a task
that "aims beyond document retrieval and towards natural language
understanding." (Aunimo &amp;amp; Kuuskoski, 2001). Systems which use the World
Wide Web as a knowledgebase aim to parse documents retrieved by a search engine
for scrapes of relevant information, and identify and return correct answers
directly to the user; effectively removing an entire step from the search
process.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 8pt;" lang="EN-US"&gt;&amp;nbsp;&lt;/span&gt;&lt;span&gt;A pre-requisite to the success of a question
answering system is a solid understanding of the English language by the
researcher. It was said that "…ambiguity is an essential part of language; and
it is often an obstacle ignored" (Quiroga-Clare, 2002). It is clear that it
would be a mistake not to take core values of the English language such as this
into consideration when creating this type of system, however many tools and
databases exist to assist developers with refining questions and establishing
the answer type expected.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;Examples of resources available to
developers of question answering systems include algorithms to break down words
to their simplest form (Porter, M.F, 2002), as well as thesaurus-style
databases of words with their alternatives already exist (Wordnet, 2005). This
work analyses existing techniques such as these in addition to conducting
primary research in the field to attempt to improve the quality of answers
obtained for fact based questions using the World Wide Web as a knowledgebase.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;A notable resource ranked highly
  among developers is the Text REtrieval Conference (TREC), which was set up as a
  forum to support research in all areas of Information Retrieval. &amp;nbsp;TREC provides
  the infrastructure and funding necessary for the development of "large-scale
  evaluation of text retrieval methodologies" (NIST, 2004) and in addition to its
  core purposes, acts as an open forum for the international community of
  information retrieval academia. &amp;nbsp;TREC is extremely relevant to question
  answering developers and encourages development in the field by settings
  challenges and tasks. Organisers of the conference provide test data for
  members to analyse and test their systems (Wikipaedia, 2006). Question
  answering tests will be in the form of questions, however other areas of
  Information Retrieval can be tested using topics, or other features (TIPSTER,
  2002). A scoring system is implemented to enable participants systems to be
  evaluated fairly. After evaluation of the results, a workshop provides a place
  for participants to collect together thoughts and ideas and present current and
  future research work. The practical aspect of this work makes use of the TREC
  sample data by running many of the stemmed questions through the keyword
  analysis system developed for this project.&lt;/span&gt;&lt;/p&gt;&lt;p style="margin: 0cm -18pt 0pt 36pt; line-height: 150%; text-indent: -36pt;"&gt;&lt;a name="_Toc135413411"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;2.3&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;b&gt;&lt;span style="line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;The history of question
  answering&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;i&gt;&lt;span style="font-size: 3pt;" lang="EN-US"&gt;&amp;nbsp;&lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&lt;span&gt;Variants of question answering
systems date back to the early days of computing (Witten, 1994), and despite
many advances in the field of online information retrieval, major search engine
companies such as Google and Microsoft (MSN) have yet to unveil a publicly
available, automated question answering system. &lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;The earliest evidence of such
  research can be dated back as far as the 1950s, when Turing came up with the
  theory of whether or not machines were capable of rational thought (Turing, A;
  1957). He proposed a task he called 'The Turing Test' which originated from a
  previous task called 'The Imitation Game' in which a user must identify the
  different between a man and a woman via instant messaging style interface The
  Turing Test, however, involved a human candidate communicating via an instant messaging
  style interface with another human and a machine. (Copeland, 2004) The test was
  said to be passed if the human was able to identify which was the human
  candidate and which was the machine. This system can be related to question
  answering as the computer relied on identifying patterns in the user's dialogue
  and tried to match it to pre-programmed data.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;As interest in mainstream computing
heightened in the mid-60s, a system called Baseball was implemented on top of a
database (Green, BF; 1963).&lt;/span&gt;&lt;span style="color: black; line-height: 150%; font-size: 8pt;" lang="EN-US"&gt; &lt;/span&gt;&lt;span&gt;The
system was able to answer questions about baseball scores recorded in the USA and used parsing to identify the teams and statistics. This system was able to produce
more accurate results than Turing's system as it relied on NLP rather than
pattern matching. It was also able to handle more complex queries than involved
locating multiple answers from different tables within the database. From the
1970's onwards, attempts were made to create systems capable of understanding
and learning language in the same way a human being does. One system, Margie, (Schank
et al.), could read a document and answer simple questions on it. It worked by
parsing the text and organised it in the same way the human brain organises
data. This was the first attempt to emulate what a human does when reading a
document.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;The first implementation of such a
utility in mainstream computing emerged in Expert systems developed the 1970's
such as Lunar and Baseball (Bert, F et al 1963). The 1990's saw the first
online Encyclopedia named Murax, and the year 2000 saw the launch of BrainBoost.com,
the first fully fledged question answering system to utilise the World Wide Web
as its knowledgebase. &lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;a name="_Toc135413412"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;2.4&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;b&gt;&lt;span style="line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;Components of a typical web
  based question answering system&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="line-height: 150%; font-size: 3pt;" lang="EN-US"&gt;&amp;nbsp;&lt;/span&gt;&lt;span&gt;The majority of web based question
answering systems can be broken down into four components; question analysis,
document retrieval, passage retrieval and answer extraction (Hirschman and
Gaizauskas, 2001). These techniques are illustrated below and analysed in depth
in the following paragraphs:&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;&lt;img src="/files/media/image/question-answering/image002.gif" width="576" height="346"&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;Figure 1: Typical Question Answering System Components
(Aunimo &amp;amp; Kuuskoski , 2002)&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;The first component, the question
  analyser&lt;i&gt;, &lt;/i&gt;identifies the type of response expected from the question.
  For example, "Where is Kilmarnock?" would result in a location result.
  Questions can be classed into categories then further refined to determine the
  answer type expected. Detailed information on question types can be found in
  the next section (Section 2.5).&lt;b&gt; &lt;/b&gt;Once the question has been parsed by
  either stemming or query expansion, defined later in this section, the document
  retrieval&lt;i&gt; &lt;/i&gt;component prepares a list of candidate documents.
  Subsequently, the passage retrieval component selects passages of text which
  may be relevant and indexes them according to relevancy. The final component,
  the answer extractor, searches and ranks the passages in more detail and
  produce a list of candidate results to the question. The scope of this project
  extends to altering the traditional four-step model and introducing another
  stage to further refine the document retrieval process. &lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;One factor which affects the quality
  of results retrieved by a question answering system is the keyword preparation technique.
  Stemming&lt;b&gt; &lt;/b&gt;(Porter, M.F., 1980)&lt;/span&gt;&lt;span style="color: black; line-height: 150%; font-size: 8pt;" lang="EN-US"&gt; &lt;/span&gt;&lt;span&gt;is
    a process in which words are broken down into their simplest form with all
    suffixes stripped. The Porter stemming algorithm removes the common endings
    from words and cross checks the word against a database of dictionary words for
    verification. Conversely, query expansion, as the name would indicate, does the
    opposite. Its aim is to increase the accuracy of results by expanding the query
  using words or phrases with a similar meaning. &lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;In a series of recent experiments,
  these two types of technique were compared (Bilotti and Katz 2004). The first these
  tests indexed variations of words during the 'document retrieval' process using
  query expansion, and the other broke down words into their simplest forms at
  retrieval time using stemming. The results of this experiment were contrary to the
  researcher's initial assumptions, as it transpires that Porter's stemming algorithm
  positively affects the accuracy of results whilst generating a full spectrum of
  words at indexing time by query expansion decreased the accuracy of the
  documents returned. The results of Bilotti and Katz's experiment resulted in
  'stemming' being utilised as part of the keyword analyser component of the
  question answering system developed for this project. As Chapter 5 results
  indicate, phrasing questions as one would expect them to be answered and
  truncating the stems dramatically improves the quality of candidate documents
  returned by the search engine.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;Many systems, however, utilise both
  query expansion and stemming in two separate queries to maintain the maximum
  possible amount of candidate documents. One such system is Brainboost
  (Brainboost.com 2006), does this method with some success. Although results
  appear relatively accurate, the system is let down by extremely slow execution
  times. Speed is a major consideration when designing a question answering
  system, and research indicates that users will only wait a maximum of ten
  seconds for a page to load (Webmasterworld Article 2004). This has resulted
  only one technique, stemming, will be used for the practical implementation of
  the prototype system.&lt;/span&gt;&lt;/p&gt;&lt;p style="margin: 0cm -18pt 0pt 36pt; text-align: justify; line-height: 150%; text-indent: -36pt;"&gt;&lt;a name="_Toc135413413"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;2.5&lt;span style='font: 7pt/normal "Times New Roman"; font-size-adjust: none; font-stretch: normal;'&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;
  &lt;/span&gt;&lt;/span&gt;&lt;/b&gt;&lt;b&gt;&lt;span style="line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;Determining
  question types&lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;/p&gt;&lt;p style="text-align: justify; line-height: 150%; margin-right: -18pt;"&gt;&lt;b&gt;&lt;span style="line-height: 150%; font-size: 1pt;" lang="EN-US"&gt;&amp;nbsp;&lt;/span&gt;&lt;/b&gt;&lt;span lang="EN-US"&gt;It is logical to assume that if the
type of answer expected is determined, it will be easier to search for that
answer within a set of documents. Unfortunately, however, knowing the type of
answer expected is not enough to assist in locating a suitable answer
(Moldovian et. al. 2000), which is why attempts must me made to further refine
questions into particular groups (Figure 2.5a).&lt;/span&gt;&lt;/p&gt;&lt;p style="line-height: 150%;"&gt;&lt;span lang="EN-US"&gt;&lt;img src="/files/media/image/question-answering/image003.gif" width="576" height="542"&gt;&lt;/span&gt;&lt;/p&gt;&lt;p style="line-height: 150%;"&gt;&lt;span style="line-height: 150%; font-size: 7pt;" lang="EN-US"&gt;&amp;nbsp;&lt;/span&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;Figure
2: Types of questions and corresponding answer types (Data from Lampert 2004)&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p style="line-height: 150%;"&gt;&lt;span lang="EN-US"&gt;The above table provides a simplified graphical analysis
of how answer types can be determined from the input question (&lt;span style="color: black;"&gt;Lampert 2004)&lt;/span&gt;. For categories that are more
structured, such as people's names and place names, it is possible to further
refine the list of scrapes for analysis using systems such as Wordnet, a
lexical database of word associations (Section 2.7). For instance, if the
question 'Who was the first UK prime minister?' is posed to a question
answering system, the answer type according to Lampert, is 'Person /
Organisation'. Now the type of answer expected has been established, a database
of names can be cross checked, and dictionary words and items which have been
established as non-matching can be removed from the candidate answer list,
further narrowing down the list of potential answers to the question. (Lampert
2003).&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;It has also established that the focus of the question,
"a word or number that indicates what is being asked" (Moldovian et. Al., 2002)
is another important factor in reaching the correct answer. For example, the
question "Who was the first prime minister of Great Britain" has the focus
"first prime minister". If both the question type and focus are both known, the
system can more easily reach a conclusion.&lt;/span&gt;&lt;/p&gt;&lt;p style="margin: 0cm -18pt 0pt 36pt; line-height: 150%; text-indent: -36pt;"&gt;&lt;a name="_Toc135413414"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;2.5&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;b&gt;&lt;span style="line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;Analysis of existing
  systems&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;i&gt;&lt;span style="font-size: 8pt;" lang="EN-US"&gt;&amp;nbsp;&lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&lt;span&gt;As discussed, many systems have been
developed to address the problem of question answering for the web, many of which
are publicly available via the Internet. The best known public question
answering system is Ask Jeeves (Roussinov, Chau &amp;amp; Filatova, . Contrary to
public belief, this is not a fully automated question answering system and
instead attempts to improve the quality of search results obtained for
questions formed in "natural language style" (AskJeeves, Last Accessed 2006). &amp;nbsp;The
system no longer operates the same way it did when it was launched, and has
been recently rebranded as Ask.com in an attempt to follow the success of the
Google empire. Although it was recently said that "…AskJeeves is still a
place for questions and answers." (Information Week Article, 2005), it
appears to have lost many of the qualities for which it was originally admired.
AskJeeves, therefore, can not be classed as a full question answering system as
it still returns a set of documents for users to peruse, instead of direct
answers, or even fragments of potential answers. &lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;An example of an accurate and
well-structured accurate system for fact based questions START (START 2006),
which claims to be the first system of its kind. It takes the same approach as
most question answering systems by utilising the World Wide Web as its
knowledgebase, and attempts to analyse passages retrieved from candidate
documents and parses them into meaningful sentences. It was developed at the
Massachusetts Institute of Technology, and claims to "supply users with
"just the right information," instead of "merely providing a list of
hits" (MIT, 2001).&lt;/span&gt;&lt;span style="color: black; line-height: 150%; font-size: 8pt;" lang="EN-US"&gt; &lt;/span&gt;&lt;span&gt;A technique
called "natural language annotation" is integral to the success of
START. This technique utilises 'natural language' phrases as descriptions of
content that are associated with "information segments". An information segment
is retrieved when its phrase matches an input question. This method allows the system
to deal with a wide range of question types, in addition to being able to display
media such as graphics and sounds. The former results in the system being
extremely accurate for fact based questions, however ineffective in answering a
broader range of questions.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;Mulder, another legacy system, is
  believed to be the first question answering system made publicly available on
  the World Wide Web. The system works similarly to the model described
  previously. &amp;nbsp;The user enters a question on a web based form. The system then
  constructs a table of the structure and classification of the question. Next,
  the query formulator prepares a list of search engine queries which are issued
  to the likes of Google. The answer extractor then obtains fragments of these
  documents, which are then scored and ranked. The list of fragments obtained are
  then displayed to the user. In 2000, performed A recent experiment showed that
  each component of the Mulder system contributed equally to the success of the
  system, and that user effort is reduced by a factor of 6.6 compared to Google (Kwok
  and Etzioni, 2000). Additionally, the system statistically performed a massive three
  times better than AskJeeves according to this document.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;A more recent question answering
  system which uses the web as its knowledgebase is AnswerBus, developed in 2001 (Zheng
  et al. 2003) It is very similar in terms of structure to START, however
  incorporates a multi-lingual element into the equation. Zheng states that the
  "system is only designed for short, fact based questions" (Zheng 2001), and appears
  to enjoy relatively high accuracy rates. It answered over two thirds of a
  sample of TREC-8 questions accurately (Zheng et al. 2003) using extremely low
  resources. The primary difference between Answerbus and other question
  answering systems is in its document retrieval component. Instead of just
  interfacing with one search engine, it uses multiple sources to obtain its candidate
  documents. In addition, Zheng's algorithm also takes into account the search engine's
  ranking of the page which helps with several of the hypothesis in this project.
  It can be concluded that Answerbus is one of the most accurate question
  answering systems that uses only the web as its knowledgebase.&lt;/span&gt;&lt;/p&gt;&lt;p style="margin: 0cm -18pt 0pt 36pt; line-height: 150%; text-indent: -36pt;"&gt;&lt;a name="_Toc135413415"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;2.6&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;b&gt;&lt;span style="line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;Issues with current
  systems&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;i&gt;&lt;span style="font-size: 3pt;" lang="EN-US"&gt;&amp;nbsp;&lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&lt;span&gt;Several problems have been
identified which are common among existing question answering systems (Kwok et
al, 2000). Firstly, it is imperative that the document retrieval module is
given the correct queries to ensure the most relevant results will be obtained
from the search engine. Establishing the most effective way to prepare the
query is crucial to the success of the document retrieval module, and it has
been established that stemming words yields a higher accuracy rate than query expansion
in terms of keyword preparation. Noise is a major obstacle for systems which
use the web as the knowledgebase. This refers to irrelevant pages, which may
affect the overall answer if a low number of candidate documents are used in
the system. For this reason, research has been undertaken in this project to
determine whether increasing the number of candidate documents reduces the
amount of noise compared to results retrieved with lower numbers of candidate
documents. Finally, false information, or factoids, are present in all aspects
of life, especially on the world wide web, and search engines are no exception.
A factoid is a fact that is not true that is commonly believed to be. This rogue
information appears all over the World Wide Web due to the simplicity of
creating and publishing a website. Essentially, source reliability could be impacted
if non-reliable sources are used in the document retrieval process. Finally,
the question answering system relies heavily on network resources and dealing
with and processing this amount of information can be a challenging task. Research
shows that the user will not wait very long for the system to return the
results. Reliability is another major issue associated with such a system,
however there is very little research on the reliability of the world wide web
at this time.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;Additional issues with question
answering systems are present in a system developed by Kontos and Malagardi in
1999. The system had the aim of accepting queries in natural language and
generating answers from various texts. Although I found the article to be fairly
vague (it didn't go into great detail about the actual implementation), the
description of the system was well-formed and from it, the following issues
were clear:&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;The system was written to analyse
  Greek documents. Unlike English, The Greek language has relatively relaxed word
  order rules, therefore adapting the algorithms for other languages would
  probably be very difficult due to the fact that it doesn't concentrate as much
  on word order; something that is very important in the English language. In addition
  to this, only simple sentences could be parsed as it only looks at one verb. In
  experiments, this often made the system ignore the subject area completely,
  providing inaccurate, irrelevant results. Finally, the system also suffered
  from grammar limitations. Although a grammar parser is present, it appears
  incomplete and poorly explained in the documentation. In summary, this system
  could not deal effectively with more complex questions, and is not easily
  portable to other languages.&lt;/span&gt;&lt;b&gt;&lt;span style='color: black; line-height: 150%; font-family: "Times New Roman","serif"; font-size: 16pt;' lang="EN-US"&gt;  &lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p style="margin: 0cm -18pt 0pt 36pt; line-height: 150%; text-indent: -36pt;"&gt;&lt;a name="_Toc135413416"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;2.7&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;b&gt;&lt;span style="line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;Challenges for developers&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;The following table is a summary of system wide issues with current
systems:&lt;/span&gt;&lt;/p&gt;&lt;div&gt;&lt;table style="border: currentColor; border-collapse: collapse;" class="MsoTableGrid" border="1" cellSpacing="0" cellPadding="0"&gt;
  &lt;tbody&gt;&lt;tr&gt;
  &lt;td style="padding: 0cm 5.4pt; border: 1pt solid windowtext; width: 176.4pt;" vAlign="top" width="235"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span lang="EN-US"&gt;Query Formation Issues&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p&gt;&lt;b&gt;&lt;span lang="EN-US"&gt;&lt;/span&gt;&lt;/b&gt;&amp;nbsp;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: 1pt 1pt 1pt medium; border-style: solid solid solid none; border-color: windowtext windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 266.4pt;" vAlign="top" width="355"&gt;
  &lt;p&gt;&lt;span lang="EN-US"&gt;It is important to form queries that will return the most relevant
  results from a search engine.&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0cm 5.4pt; width: 176.4pt;" vAlign="top" width="235"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span lang="EN-US"&gt;Noise&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p&gt;&lt;b&gt;&lt;span lang="EN-US"&gt;&lt;/span&gt;&lt;/b&gt;&amp;nbsp;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 266.4pt;" vAlign="top" width="355"&gt;
  &lt;p&gt;&lt;span lang="EN-US"&gt;Regardless of how good the query is, it may still return
  irrelevant pages that discuss something totally different.&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0cm 5.4pt; width: 176.4pt;" vAlign="top" width="235"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span lang="EN-US"&gt;Factoids&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p&gt;&lt;b&gt;&lt;span lang="EN-US"&gt;&lt;/span&gt;&lt;/b&gt;&amp;nbsp;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 266.4pt;" vAlign="top" width="355"&gt;
  &lt;p&gt;&lt;span lang="EN-US"&gt;This refers to the traditional sense of the word factoid, that is
  to say a fact that is not true, but is commonly believed to be.&lt;/span&gt;&lt;/p&gt;
  &lt;p&gt;&lt;span lang="EN-US"&gt;&lt;/span&gt;&amp;nbsp;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0cm 5.4pt; width: 176.4pt;" vAlign="top" width="235"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span lang="EN-US"&gt;Resource limitations&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p&gt;&lt;b&gt;&lt;span lang="EN-US"&gt;&lt;/span&gt;&lt;/b&gt;&amp;nbsp;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 266.4pt;" vAlign="top" width="355"&gt;
  &lt;p&gt;&lt;span lang="EN-US"&gt;Although search engines and computers are getting faster all the
  time, question answering can be a demanding task and the user will not wait
  long for the system to return an answer.&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
&lt;/tbody&gt;&lt;/table&gt;&lt;/div&gt;&lt;p&gt;&lt;b&gt;&lt;span style="line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;Table 1: Issues with
current systems&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;Table 2.7 illustrates that common problems are encountered by
  developers of question answering systems, many of which can be addressed by implementing
  existing data or code. An example of the former is Wordnet, a &lt;span style="color: black;"&gt;"machine readable lexical database which can be interfaced
    by any application." (Miller 1997). Wordnet is a&lt;/span&gt;n open-source database
  of associated words which is useful for addressing questions relating to
  specific subjects such as computing or animals (Chen, Ji, Jiang, 2004). The
  noun in a sentence is identified and compared to entries in the Wordnet
  database in order to find an association. For the question "What colour is the
  sky?", WordNet can be used to find out if colour can be associated with sky,
  and if so the answer type will be established as 'color', and may be able to be
  retrieved from a database of colours.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;Other questions, such as "In which year did Tony Blair become prime
  minister?" can be answered if a database of names is available to the system. As
  discussed in the previous paragraph, if the answer type is able to be established
  as a name, high scoring candidate answers can then be contrasted against names
  stored in this database. Many other techniques can be adopted such as titles of
  books, the cast of a film (START already uses IMDB for this) and location and
  place names. The purpose of this research is to attempt to steer away from
  pre-keyed information and provide research which will assist developers in utilising,
  as far as possible, the web alone as the knowledgebase.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;In addition to techniques described
  in the previous paragraphs, there are many other resources which are beyond the
  scope of this project, but are also important to the success of many question
  answering systems.&lt;/span&gt;&lt;/p&gt;&lt;p style="margin: 0cm -18pt 0pt 36pt; line-height: 150%; text-indent: -36pt;"&gt;&lt;a name="_Toc135413417"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;2.8&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;b&gt;&lt;span style="line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;Emerging methods&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;i&gt;&lt;span style="font-size: 3pt;" lang="EN-US"&gt;&amp;nbsp;&lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&lt;span&gt;As discussed, none of the major
search engine companies have released a full-scale question answering system,
however recently Yahoo! adopted a slightly different approach to the problem
which, although not automated is proving popular with web surfers. Yahoo! have
developed a system which is community based and enables users to "Ask a
question on any topic and get answers from real people." (Yahoo, 2006)&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;In addition to this, Microsoft is
  pushing forward the boundaries of research into question answering systems with
  its 'Ask Microsoft Research' engine (Economist 2004). This system is currently
  in development and is based on the four component answering model described in
  the next chapter.&amp;nbsp; Its purpose is to answer fact based questions from a
  knowledgebase with a single or multiple word answer. Ask MSR manages to extract
  the verb from a query phrase and uses the Wordnet library (as discussed) to
  find alternatives to broaden the query. It also focuses on the order of the
  words in the query and tests every possible combination to achieve the most
  accurate result. The system has proven to be fairly reliable with a 61% average
  success rate&lt;/span&gt;&lt;span style="color: black; line-height: 150%; font-size: 8pt;" lang="EN-US"&gt;18&lt;/span&gt;&lt;span&gt;. However, the
  largest amount of errors&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;come form not knowing what units are
likely to be in an answer given a question (e.g. How fast can a Renault Clio go
in xxx mph?). It is reported that 34% of their 40% error rate were a result of
answers being returned correctly, but in the wrong format. This type of problem
exists, as a search engine can not be queried for an empty value, therefore a
value must be submitted for document retrieval. For example, a variation of the
query 'How many islands does Fiji have' would be 'Fiji has x islands', but this
query can't be sent to the search engines. 12% of queries submitted to the
AskMSR system fell into this&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;category. This is an area which
warrants a great deal of further research but is outwith the scope of this
particular study. &lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;In addition to this, researchers on
  the MIT's START system (as outlined in previous chapter) are currently working
  on a revised version of their question answering model which will broaden the
  scope of it's knowledgebase.&lt;/span&gt;&lt;span style="color: black; line-height: 150%; font-size: 8pt;" lang="EN-US"&gt;19 &lt;/span&gt;&lt;span&gt;As
    discussed in the previous sections, START's knowledgebase is limited to certain
    domains, such as movies, weather, music etc. I believe these advancements would
    greatly improve the system, but it is difficult to see how the researchers
    could deal with all types of domains. Answerbus and Brainboost are two other
  systems which are constantly evolving and utilizing current research findings.&lt;/span&gt;&lt;/p&gt;&lt;p style="margin: 0cm -18pt 0pt 36pt; line-height: 150%; text-indent: -36pt;"&gt;&lt;a name="_Toc135413418"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;2.9&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;b&gt;&lt;span style="line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;Factors affecting accuracy
  within question answering systems&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;i&gt;&lt;span style="font-size: 3pt;" lang="EN-US"&gt;&amp;nbsp;&lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&lt;span&gt;Although Google, or any other major
search company, have not yet implemented a commercial scale question answering
system, the company plays a part in acting as a document retrieval component
for many of the smaller question answering systems developed by students and
researchers. 'The Anatomy of a Large-Scale Hypertextual Web Search Engine'
(Brim &amp;amp; Page, 1998), aims to give the reader an insight into PageRank – an algorithm
developed by Google in order to improve the overall quality of search results.
Instead of ranking sites using traditional methods such as frequency of
keywords and depth of content, the Page Rank algorithm uses a metric of inbound
and outbound links, as well as anchor text. For example, if site A linked to
site B with the anchor 'Free Downloads', site A's ranking for the keyword 'Free
Downloads' would increase, even though site A contains to reference of the
query phrase 'Free Downloads'. It is within the scope of this project to determine
what effect, if any, different search algorithms have on the accuracy of
candidate documents retrieved. Very little research exists at this time to
prove or disprove the theory that PageRank adversely affects the abundance of
the query terms in candidate documents. For this reason, this has been selected
as a primary research area and will be discussed in detail in the methods
section&lt;/span&gt;&lt;span style="font-size: 6.5pt;" lang="EN-US"&gt;&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p style="margin: 0cm -18pt 0pt 36pt; line-height: 150%; text-indent: -36pt;"&gt;&lt;a name="_Toc135413419"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;2.10&amp;nbsp; &lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;b&gt;&lt;span style="line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;Conclusions&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 3pt;" lang="EN-US"&gt;&amp;nbsp;&lt;/span&gt;&lt;span style="line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;Current question answering
systems which use the web as a knowledgebase; such as Brainboost and Answerbus;
all appear to utilise the Google API or equivalent to fetch candidate documents
for analysis. To date, nobody has challenged whether Google's PageRank
algorithm has a negative impact on obtaining candidate documents for answer
extraction. Current systems seem focused on creating algorithms for specific
types of questions, whereas many believe the focus lies on relying further on
the web and perhaps writing a new ranking algorithm dedicated to question
answering. Consideration of the above has led to the chosen research topics.&lt;/span&gt;&lt;/p&gt;&lt;p style="margin: 0cm -18pt 0pt 36pt; line-height: 150%; text-indent: -36pt;"&gt;&lt;a name="_Toc135413421"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;3.1&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;b&gt;&lt;span style="line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;Method Analysis&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;This chapter presents the research methodology that was used for
this study. It begins with a summary of the research problem and breakdown of
methods used by other researchers obtained from the literature review. It
follows with a summary of the chosen research topics and the primary methods
used to answer them. Furthermore, the chapter discusses time and resource
limitations and follows with a section outlining the prototype system developed
for this project.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;There are various frameworks that can be used in developing a
research methodology; however I have chosen a variation of the following
(Wilson 2003) who suggests the following five steps; the original approach had
seven steps, two of which were not relevant to this work:&lt;/span&gt;&lt;/p&gt;&lt;p style="text-align: justify; line-height: 150%; text-indent: -18pt; margin-left: 72pt;"&gt;&lt;span lang="EN-US"&gt;1.&lt;span style='font: 7pt/normal "Times New Roman"; font-size-adjust: none; font-stretch: normal;'&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;
  &lt;/span&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;Identification of problems and opportunities; Chapter
  1&lt;/span&gt;&lt;/p&gt;&lt;p style="text-align: justify; line-height: 150%; text-indent: -18pt; margin-left: 72pt;"&gt;&lt;span lang="EN-US"&gt;2.&lt;span style='font: 7pt/normal "Times New Roman"; font-size-adjust: none; font-stretch: normal;'&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;
&lt;/span&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;Collection of secondary data; Chapter 2&lt;/span&gt;&lt;/p&gt;&lt;p style="text-align: justify; line-height: 150%; text-indent: -18pt; margin-left: 72pt;"&gt;&lt;span lang="EN-US"&gt;3.&lt;span style='font: 7pt/normal "Times New Roman"; font-size-adjust: none; font-stretch: normal;'&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;
&lt;/span&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;Creation of primary data through experiment;
Chapters 3 &amp;amp; 4&lt;/span&gt;&lt;/p&gt;&lt;p style="text-align: justify; line-height: 150%; text-indent: -18pt; margin-left: 72pt;"&gt;&lt;span lang="EN-US"&gt;4.&lt;span style='font: 7pt/normal "Times New Roman"; font-size-adjust: none; font-stretch: normal;'&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;
&lt;/span&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;Collection &amp;amp; analysis of primary data; Chapter
5&lt;/span&gt;&lt;/p&gt;&lt;p style="text-align: justify; line-height: 150%; text-indent: -18pt; margin-left: 72pt;"&gt;&lt;span lang="EN-US"&gt;5.&lt;span style='font: 7pt/normal "Times New Roman"; font-size-adjust: none; font-stretch: normal;'&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;
&lt;/span&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;Analysis of methods and findings; Chapter 6&lt;/span&gt;&lt;/p&gt;&lt;p style="margin: 0cm -18pt 0pt 36pt; line-height: 150%; text-indent: -36pt;"&gt;&lt;a name="_Toc135413422"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;3.2&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;b&gt;&lt;span style="line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;Research Problem&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;As discussed in the literature review, the variable
quality of results returned by existing question answering systems leaves scope
for a considerable amount of research. The scope of this project extends to isolating
factors which affect the quality of question answering systems that use the
World Wide Web as a knowledgebase. The two main factors addressed by this
document are; search engine algorithm used and number of candidate documents
used.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;A recent article defined the role of
  a search engine algorithm as "… mathematical formula that takes a problem as
  input and returns a solution." (J. Cassidy, 2003)&lt;/span&gt;&lt;span style="color: black; line-height: 150%; font-size: 8pt;" lang="EN-US"&gt;. &lt;/span&gt;&lt;span&gt;Each search engine uses a different algorithm to calculate
    its results, the best known of these being Google's PageRank technology, which,
  as discussed uses more factors than the content of the page to rank results.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;The software described in Chapter 4
  aims to compare the quality of results received from two different search
  engines, Microsoft Network (MSN) and Google, in order to find out if the
  aforementioned PageRank has any effect on the results when keywords are posed
  to its engine. In addition to this, the software also aims to establish if
  there is a direct link between the number of candidate documents used for
  analysis and the quality of keywords returned which could be potential answers.
  Rationale for this, and other research questions is detailed in the results analysis
  chapter (Chapter 5).&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;The scope of this project only
  extends to testing short, fact based questions such as "What is the currency in
  Cuba?", however potential future improvements may include expanding the
  system to cope with a wider range of more complex and detailed questions. &lt;/span&gt;&lt;/p&gt;&lt;p style="margin: 0cm -18pt 0pt 36pt; line-height: 150%; text-indent: -36pt;"&gt;&lt;a name="_Toc135413423"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;3.3&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;b&gt;&lt;span style="line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;Research aims and objectives&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;As shown in Wilson, 2003, identification of problems and
opportunities is the first stage to any research process. It is evident that "without
a fixed, overt objective, coordination and direction of purpose are very
difficult, if not impossible" (Webb, 2002).&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;&lt;i&gt;&lt;span lang="EN-US"&gt;Aim&lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&lt;span lang="EN-US"&gt;: The overall
  aim of the methodology chapter is to determine which factors affect the quality
  of results obtained by a question answering system; specifically the role of
  keyword frequency within candidate documents of a system which uses the World
  Wide Web (WWW) as the knowledgebase.&lt;/span&gt;&lt;b&gt;&lt;i&gt;&lt;span style='line-height: 150%; font-family: "Times New Roman","serif"; font-size: 12pt;' lang="EN-US"&gt;  &lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;&lt;i&gt;&lt;span lang="EN-US"&gt;Objectives&lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&lt;i&gt;&lt;span lang="EN-US"&gt;: &lt;/span&gt;&lt;/i&gt;&lt;span lang="EN-US"&gt;The specific objectives of the primary research for this study are
  as follows:&lt;/span&gt;&lt;/p&gt;&lt;p style="text-align: justify; line-height: 150%; text-indent: -18pt; margin-left: 36pt;"&gt;&lt;span lang="EN-US"&gt;1&lt;span style='font: 7pt/normal "Times New Roman"; font-size-adjust: none; font-stretch: normal;'&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;
  &lt;/span&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;To determine if the search algorithm used
    affects the quality of results retrieved.&lt;br&gt;
Information needs: To determine whether keyword frequency on a set of pages
obtained by one search engine differs from that of another, but furthermore
identifying the most effective type of search algorithm for obtaining candidate
documents for web based questions answering systems.&lt;/span&gt;&lt;/p&gt;&lt;p style="text-align: justify; line-height: 150%; text-indent: -18pt; margin-left: 36pt;"&gt;&lt;span lang="EN-US"&gt;2&lt;span style='font: 7pt/normal "Times New Roman"; font-size-adjust: none; font-stretch: normal;'&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;
&lt;/span&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;To determine whether increasing the number of
candidate documents improves the quality of results obtained. Information
needs: To determine whether analysing keywords from a large number of candidate
documents reduces the accuracy of the keywords, or potential answers, returned,
ultimately determining whether less relevant pages have enough ability to skew
accurate results.&lt;/span&gt;&lt;/p&gt;&lt;p style="text-align: justify; line-height: 150%; text-indent: -18pt; margin-left: 36pt;"&gt;&lt;span lang="EN-US"&gt;3&lt;span style='font: 7pt/normal "Times New Roman"; font-size-adjust: none; font-stretch: normal;'&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;
&lt;/span&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;To identify the extent of problems with the
Google API. Information needs: To determine whether the Google API is ready for
use in a production environment through the use of data indicating the
consistency of results returned.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;&lt;i&gt;&lt;span lang="EN-US"&gt;Secondary Research Used&lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&lt;i&gt;&lt;span lang="EN-US"&gt;: &lt;/span&gt;&lt;/i&gt;&lt;span lang="EN-US"&gt;From the review of literature, it was
established that stemming was more effective than query expansion for obtaining
results for a question answering system. For this reason all questions were
posed to the keyword analysis system in stemmed form.&lt;/span&gt;&lt;/p&gt;&lt;p style="margin: 0cm -18pt 0pt 36pt; line-height: 150%; text-indent: -36pt;"&gt;&lt;a name="_Toc135413424"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;3.4&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;b&gt;&lt;span style="line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;Research Methods&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;As discussed, secondary data has already been gathered from
literature in order to analyse existing question answering methods with the aim
of identifying strengths and weaknesses. This has resulted in clarification of
the subject area from the perspective of the researcher. The literature review
was essential to fully understand the complex area of information retrieval and
question answering and has identified issues which need to be addressed and
refined. Chapter 5 outlines the rationale behind the methods described in this
chapter, details the experiments and provides concise results for all research objectives.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;A basic question answering system
  has been developed in order to manipulate and record search engine output. The
  software makes use of two major search engine algorithms and can be switched
  simply by editing the document retrieval module. It will utilise the developed
  logic to gather potential keywords which will be used to answer two of the
  research questions. A full breakdown and comprehensive documentation of this
  question answering system is available (Chapter 4 – The Prototype System).&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;For each objective outlined above, a
  common method was used to obtain results.&lt;/span&gt;&lt;/p&gt;&lt;p style="text-align: justify; line-height: 150%; text-indent: -18pt; margin-left: 36pt;"&gt;&lt;span&gt;1&lt;span style='font: 7pt/normal "Times New Roman"; font-size-adjust: none; font-stretch: normal;'&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;
&lt;/span&gt;&lt;/span&gt;&lt;span&gt;Coding the program;
documented in Chapter 4 to return the expected results&lt;/span&gt;&lt;/p&gt;&lt;p style="text-align: justify; line-height: 150%; text-indent: -18pt; margin-left: 36pt;"&gt;&lt;span&gt;2&lt;span style='font: 7pt/normal "Times New Roman"; font-size-adjust: none; font-stretch: normal;'&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;
&lt;/span&gt;&lt;/span&gt;&lt;span&gt;Execution of the program
with test data; detailed in Chapter 5&lt;/span&gt;&lt;/p&gt;&lt;p style="text-align: justify; line-height: 150%; text-indent: -18pt; margin-left: 36pt;"&gt;&lt;span&gt;3&lt;span style='font: 7pt/normal "Times New Roman"; font-size-adjust: none; font-stretch: normal;'&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;
&lt;/span&gt;&lt;/span&gt;&lt;span&gt;Data Summarization;
Available in the Appendix&lt;/span&gt;&lt;/p&gt;&lt;p style="text-align: justify; line-height: 150%; text-indent: -18pt; margin-left: 36pt;"&gt;&lt;span&gt;4&lt;span style='font: 7pt/normal "Times New Roman"; font-size-adjust: none; font-stretch: normal;'&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;
&lt;/span&gt;&lt;/span&gt;&lt;span&gt;Analysis of the results;
published in Chapter 5&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;It should be noted that methods are
refined in further detail within the analysis chapter (Chapter 5) However, this
four step process is an overview of tasks carried out for both primary research
questions, and are explained in more detail in the following sections:&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;3.4.1
Designing and editing analysis software&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;In order to answer several research
  questions, a prototype question answering system was developed over a three
  month period. In Chapter 4, the logic behind this answering system is
  explained, as well as its key purposes. In Chapter 5, research questions are
  addressed and answered using this system. It was required to slightly modify
  this program for each objective. For example, question one will attempt to
  establish the link between search engine algorithm and the accuracy of
  candidate documents, therefore the document retrieval component will have to be
  changed. Question two will attempt to establish the link between the number of
  documents used and accuracy and will require the number of documents analysed
  to be changed in order to obtain useful results.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;3.4.2
  Running experiments with test data&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;"To sample something is to examine a
small portion of it, usually for the purpose of judging the nature or quality
of` the whole." (Proctor, 2003 p.100)&amp;nbsp; A sample of test questions was sourced
from the TREC database (TREC Website, 2006). Since our sample size for each
experiment is fifteen questions, the types of questions selected have been
chosen carefully and an explanation is given (Section 6.1.4) which defines the
logic behind the selected test data. To determine the appropriate sample size
it was important to consider factors such as time-constraints and limited resources
&lt;/span&gt;&lt;span lang="EN-US"&gt;(Wilson, 2003)&lt;span style="color: black;"&gt;. Each question
posed to the analysis system took up to twenty minutes to return a response.&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;3.4.3
  Result summarisation&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;Upon completing each test, a result document
was created which contained the raw results of the experiment. Once all results
were available, a 'results summary table' was created which assisted the
researcher in formatting the data in a way which eased the analysis process
(Appendix E). &amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;3.4.4
  Analysis of Results&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;Coding is &lt;i&gt;"the process of grouping and assigning
numeric codes to the various responses to a particular question" &lt;/i&gt;(McDaniel
&amp;amp; Gates, 2001) &lt;span style="color: black;"&gt;No manual coding was required as
the software was designed to output results in order which eased the task of
analysing for the researcher. Keyword frequency was ranked and compared for
both research questions, and is fully documented (Appendix E). Results were
analysed for keyword frequency within a set number of documents, and were
ranked accordingly with the intention of identifying patterns. This has been
achieved and is fully documented (Chapter 5) &lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p style="margin: 0cm -18pt 0pt 36pt; line-height: 150%; text-indent: -36pt;"&gt;&lt;a name="_Toc135413425"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;3.5&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;b&gt;&lt;span style="line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;Limitations&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;Despite best efforts to ensure results obtained were
precise and appropriate, there were several limitations which should be taken
into consideration. Firstly, the lack of experience of the researcher on
conducting experiments of this nature. Although a great deal was learned during
the process, future studies would be conducted with greater knowledge and
focus.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;Secondly, many of the processes required to be run by
  the application took in excess of twenty minutes. This resulted in the sample
  question set being reduced from twenty to fifteen to ensure the researcher had
  ample time to analyse the results effectively.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;No hardware or software limitations were present as a
  server has been purchased which is capable of running the required software
  comfortably. In addition, licenses are already owned by the researcher for
  Microsoft Windows 2003 Server and Microsoft SQL Server Express Edition 2005.&lt;/span&gt;&lt;/p&gt;&lt;p style="margin: 0cm -18pt 0pt 36pt; line-height: 150%; text-indent: -36pt;"&gt;&lt;a name="_Toc135413426"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;3.6&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;b&gt;&lt;span style="line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;Conclusions&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;In summary, this chapter has examined and determined the research
methods used for this study. Through literature, it has been analysed through
secondary research that questions should be stemmed before being processed by
the system. The subsequent chapters, the prototype system and analysis of
results, will continue by discussing rationale behind each of the research
questions and the findings of the primary research in detail.&lt;/span&gt;&lt;/p&gt;&lt;p style="margin: 0cm -18pt 0pt 36pt; line-height: 150%; text-indent: -36pt;"&gt;&lt;a name="_Toc135413428"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;4.1&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;b&gt;&lt;span style="line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;Development of prototype system&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;div&gt;&lt;span&gt;In order to address several research  aims and objectives, a prototype system was developed over a three month  period which aims to summarise keyword abundance within a set of documents.  In this section, the logic behind this answering system is explained, as well  as its key purposes. Research questions are addressed using this system in  the next chapter (Section 5 – Results Analysis).&lt;/span&gt;&lt;/div&gt;&lt;p&gt;&lt;a name="_Toc135413429"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;4.2&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;b&gt;&lt;span style="line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;Hardware and software
requirements&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;In order to conduct these experiments, several items of
  hardware and software were required. An Intel Pentium 4 running at 3GHz was
  selected as the specification for the web server. The system had 1GB DDR2 RAM
  and 2x80GB hard disks set up on a RAID 1 mirror configuration in the unlikely
  event of hard drive failure. The generous amount of RAM and relatively high
  processor speed allowed for some headroom, as many of the scripts and routines
  were fairly processor intensive. &lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;Microsoft Windows Server 2003 with the latest version of
  Internet Information Server (v6) were selected as the preferred operating
  system and server software. In addition to this, Microsoft SQL Server was
  installed following initial trials which proved that Microsoft Access was
  inadequate for indexing more than 1,000 records over a short space of time. The
  system in turn was connected via Gigabit Ethernet to a Cisco 1900 series switch
  and issued with a static IP address and domain name.&lt;/span&gt;&lt;/p&gt;&lt;p style="margin: 0cm -18pt 0pt 36pt; line-height: 150%; text-indent: -36pt;"&gt;&lt;a name="_Toc135413430"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;4.3&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;b&gt;&lt;span style="line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;Prototype in detail&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;The application itself follows these
steps to produce a table containing keyword frequency within a set number of web
documents. The following is a list of steps taken to achieve this result:&lt;/span&gt;&lt;/p&gt;&lt;ol style="margin-top: 0cm;" type="1" start="1"&gt;
  &lt;li style="text-align: justify; color: black; line-height: 150%;"&gt;&lt;span lang="EN-US"&gt;To &lt;b&gt;gather&lt;/b&gt; a varying number of
     Internet addresses (URLs) based on stemmed question keywords issued by the
     user&lt;/span&gt;&lt;/li&gt;
 &lt;li style="text-align: justify; color: black; line-height: 150%;"&gt;&lt;span lang="EN-US"&gt;To &lt;b&gt;obtain&lt;/b&gt; the full raw source
     of the URL through method GetFullHTML()&lt;/span&gt;&lt;/li&gt;
 &lt;li style="text-align: justify; color: black; line-height: 150%;"&gt;&lt;span lang="EN-US"&gt;To &lt;b&gt;strip&lt;/b&gt; all HTML tags from
     pages leaving behind raw text through StripHTML()&lt;/span&gt;&lt;/li&gt;
 &lt;li style="text-align: justify; color: black; line-height: 150%;"&gt;&lt;span lang="EN-US"&gt;To &lt;b&gt;remove&lt;/b&gt; 'stop words' from
     the text;&lt;i&gt; &lt;/i&gt;which included limited manual interaction&lt;/span&gt;&lt;/li&gt;
 &lt;li style="text-align: justify; color: black; line-height: 150%;"&gt;&lt;span lang="EN-US"&gt;To &lt;b&gt;store&lt;/b&gt; each word from the
     page in an array slot through SplitStoreURL()&lt;/span&gt;&lt;/li&gt;
 &lt;li style="text-align: justify; color: black; line-height: 150%;"&gt;&lt;span lang="EN-US"&gt;To &lt;b&gt;count&lt;/b&gt; the frequency of
     repetition of each word within the set number of documents via DisplaySearchResults()&lt;/span&gt;&lt;/li&gt;
 &lt;li style="text-align: justify; color: black; line-height: 150%;"&gt;&lt;span lang="EN-US"&gt;To &lt;b&gt;purge&lt;/b&gt; up the database for
     the next session through CleanUpTheMess() method&lt;/span&gt;&lt;/li&gt;
&lt;/ol&gt;&lt;p&gt;&lt;span&gt;The application was developed in
  Microsoft Visual Studio 2003.NET and is written in the Visual Basic.NET
  programming language. &amp;nbsp;It takes advantage of the Microsoft .NET Framework
  (version 1.1) and utilises 'Microsoft SQL Server Express Edition 2005' as its
  database engine. A familiarity with the Visual Basic was the main reason for its
  selection as a development language language, although in hindsight, C#, a language
  developed specifically for the .NET platform may have been a suitable
  alternative. The purpose of the application is to send a stemmed keyphrase to
  either &lt;i&gt;Google&lt;/i&gt; or &lt;i&gt;MSN's&lt;/i&gt; search systems to obtain a list of
  candidate URLs which can be parsed to obtain keyword frequency. Stemming is
  done manually and not by the software for the purpose of the experiment.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;Future improvements to the software
  may include automatically stemming the question and implementing the 'expected
  answer type' algorithm which was written but not implemented in the prototype
  version. The purpose of this algorithm is to assist the engine in establishing
  what type of result is required (i.e. Person / Organisation) This was only
  utilized to determine the type of question for the keyword analysis phase of
  the project in order to rank results according to type.&lt;/span&gt;&lt;/p&gt;
&lt;p style="text-align: center; line-height: 150%;" align="center"&gt;&lt;span lang="EN-US"&gt;&lt;img src="/files/media/image/question-answering/image005.gif" width="487" height="487"&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;&lt;span lang="EN-US"&gt;Figure 4: Keyword analysis application flow&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;a name="_Toc135413431"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;4.3.1 Components in detail&lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;The following is a breakdown of the
components of the keyword analysis system. Each method is documented
individually.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;a name="_Toc135413432"&gt;&lt;b&gt;&lt;span style="line-height: 150%; font-size: 14pt;" lang="EN-US"&gt;4.3.1.1 Session creation and initialisation&lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;/p&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;Upon submitting the search term, a unique user session
ID is created to ensure the system, if required, can perform multiple searches
simultaneously. The system clears out the keyword database using an SQL query
called using &lt;i&gt;CleanUpTheMess()&lt;/i&gt; &amp;nbsp;At this time, only one user session can
be maintained at a time, as the database is fully purged each time a new query
is submitted to the engine. Future versions of the program will support
multiple simultaneous users. The following methods are then run in order to
achieve the desired result:&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;&lt;img src="/files/media/image/question-answering/image006.gif" width="576" height="312"&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;&lt;span lang="EN-US"&gt;Figure 5: Keyword analysis application components&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;a name="_Toc135413433"&gt;&lt;b&gt;&lt;span style="line-height: 150%; font-size: 14pt;" lang="EN-US"&gt;4.3.1.2 Gathering candidate documents&lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;/p&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;Searches are conducted for each keyphrase passed to the
  script by two separate search algorithms resulting in an output of candidate
  documents from each. This is achieved through altering the document retrieval
  method to utilise the appropriate API. Instead of replacing the code, two
  different versions of the application were created. Both API modules return
  information using SOAP protocol and XML standard output, which makes for easy
  utilisation in this application as XML is a universally readable format.
  Variables sent to the APIs are &lt;i&gt;search string&lt;/i&gt; (which has been stemmed for
  optimum results), and the &lt;i&gt;number of documents&lt;/i&gt; returned, which initially
  is set to the default value (10). This can be altered for subsequent tests
  simply by changing a variable.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;a name="_Toc135413434"&gt;&lt;b&gt;&lt;span style="line-height: 150%; font-size: 14pt;" lang="EN-US"&gt;4.3.1.3 Obtaining full source of page&lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;/p&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;In order to manipulate the contents of these pages, it was
deemed necessary to obtain the full source of the each document. After
comparing the various types of tools available to achieve this, ASPTear was
selected as the most appropriate solution. ASPTear is a freeware ASP component
which, when given the URL of a website and the type of content expected (in
this case – text/html), extracts the full source code. Due to a high number of
errors experienced using this software, an exception handler was implemented
which set the contents of a page to 'null' should an error occur. These errors
were primarily caused by the software attempting to obtain PDF, Microsoft Word
and other unsupported doctypes. Support for such formats is recommended for future
releases of the software.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;On completion of the procedure, the full source of the
  HTML page is returned to the main method as a string which is further refined
  in the next stage of the process.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;a name="_Toc135413435"&gt;&lt;b&gt;&lt;span style="line-height: 150%; font-size: 14pt;" lang="EN-US"&gt;4.3.1.4 Stripping HTML tags from pages&lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;/p&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;The purpose of the software is to analyse each word in a
predetermined number of documents for frequency. HTML pages contain tags which
let the client browser know what to do with the content stored between them.
Removing the tags accurately was an extremely important task which required
extensive testing with a number of different web pages.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;Originally, split and join was used to identify angle
  brackets and remove the text in between them. The algorithm would search the
  document for '&amp;lt;' and would iterate one character to the right until it found
  either '&amp;gt;' or a space, in which case it was a non-matching (i.e. not an HTML
  tag) character. If a closing &amp;gt; was found before a space, all text between
  the start of the string and the end were stripped, thus removing the tag in its
  entirety.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;After identifying a few major flaws with the algorithm,
  the method was re-developed using regular expressions. This had several
  advantages over the original 'split and join' routine. It was much more
  manageable. For instance, a set number of HTML tags could be determined and
  removed as opposed to 'anything containing &amp;lt;&amp;gt;', in addition, processing
  time was reduced noticeably using this technique. Regular expressions also
  provided the opportunity to remove unwanted line breaks and tabs which caused
  problems at various stages of the testing process.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;a name="_Toc135413436"&gt;&lt;b&gt;&lt;span style="line-height: 150%; font-size: 14pt;" lang="EN-US"&gt;4.3.1.5 Indexing word frequency&lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;/p&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;Counting and indexing the frequency of words on a given
number of pages required each word to be stored in the database and for
frequency to be increased on repetitions. Prerequisites for this step involved
removing unusual characters such as asterisks, question marks and ampersand
symbols. The &lt;i&gt;split&lt;/i&gt; function was then used on the entire content of the
page using the 'space' character as the split point. This resulted in each word
being placed in a brand new array slot. The intention of the application is to
answer questions, which are inevitably likely to be more than one word in
length. For this reason, the word in question, the word directly after the
indexed word was appended to the first. For example, 'John' is stored in array
position one, and 'John Major' is stored in Array position two. Conclusions can
then be drawn from the matching frequency entry of the individual words
compared to the combined word.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;After a page has been indexed and the content inserted
  into the array, it is then processed and copied over to the SQL database. A
  stored procedure determines whether the word has already appeared in previous
  pages or in the array itself, and if so, updates the frequency count field in
  the database by one. If the word is new, it is inserted into the database. The
  stored procedure is highlighted below:&lt;/span&gt;&lt;/p&gt;&lt;div style="padding: 1pt 4pt; border: 1pt solid windowtext;"&gt;
  
  &lt;p style="padding: 0cm; border: currentColor; text-align: justify; line-height: 150%;"&gt;&lt;span style="line-height: 150%; font-size: 9pt;" lang="EN-US"&gt;IF EXISTS(SELECT 'True' FROM [session] WHERE session_pattern =
keyword)&amp;nbsp; &lt;/span&gt;&lt;/p&gt;

&lt;p style="padding: 0cm; border: currentColor; text-align: justify; line-height: 150%;"&gt;&lt;span style="line-height: 150%; font-size: 9pt;" lang="EN-US"&gt;BEGIN&lt;/span&gt;&lt;/p&gt;

&lt;p style="padding: 0cm; border: currentColor; text-align: justify; line-height: 150%;"&gt;&lt;span style="line-height: 150%; font-size: 9pt;" lang="EN-US"&gt;UPDATE [session] SET [session_frequency] =
(session_frequency+1) &lt;/span&gt;&lt;/p&gt;

&lt;p style="padding: 0cm; border: currentColor; text-align: justify; line-height: 150%;"&gt;&lt;span style="line-height: 150%; font-size: 9pt;" lang="EN-US"&gt;WHERE [session_pattern] = keyword&lt;/span&gt;&lt;/p&gt;

&lt;p style="padding: 0cm; border: currentColor; text-align: justify; line-height: 150%;"&gt;&lt;span style="line-height: 150%; font-size: 9pt;" lang="EN-US"&gt;END ELSE &lt;/span&gt;&lt;/p&gt;

&lt;p style="padding: 0cm; border: currentColor; text-align: justify; line-height: 150%;"&gt;&lt;span style="line-height: 150%; font-size: 9pt;" lang="EN-US"&gt;BEGIN&lt;/span&gt;&lt;/p&gt;

&lt;p style="padding: 0cm; border: currentColor; text-align: justify; line-height: 150%;"&gt;&lt;span style="line-height: 150%; font-size: 9pt;" lang="EN-US"&gt;INSERT INTO [session] ([session_uid], [session_pattern],
[session_frequency]) VALUES ('" &amp;amp; Session.SessionID &amp;amp;
"','" &amp;amp; keyword &amp;amp; "' , 1)&lt;/span&gt;&lt;/p&gt;

&lt;p style="padding: 0cm; border: currentColor; text-align: justify; line-height: 150%;"&gt;&lt;span style="line-height: 150%; font-size: 9pt;" lang="EN-US"&gt;END&lt;/span&gt;&lt;/p&gt;

&lt;/div&gt;&lt;p&gt;&lt;b&gt;&lt;span style="line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;Figure 6: SQL
stored procedure (truncated)&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;a name="_Toc135413437"&gt;&lt;b&gt;&lt;span style="line-height: 150%; font-size: 14pt;" lang="EN-US"&gt;4.3.1.6 Obtaining answer type&lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;/p&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;This method is not utilised by the main application, and
answer types were established using this system on a separate search than that
used for recording results. The results of the experiments show an ordered
listing of keyword frequency, and also keyword frequency with the answer type.
The purpose of this is to attempt to further refine the question answering
process&lt;/span&gt;&lt;span style="line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;.&lt;/span&gt;&lt;/p&gt;&lt;p style="margin: 0cm -18pt 0pt 36pt; line-height: 150%; text-indent: -36pt;"&gt;&lt;a name="_Toc134351091"&gt;&lt;/a&gt;&lt;a name="_Toc135413438"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;4.4&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;b&gt;&lt;span style="line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;Summary&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;This chapter has documented the components of the keyword analysis
system used to calculate the frequency word occurrences within a specified
number of documents. As discussed, the system can utilise either the Google or
MSN API as the document retrieval component, enabling research aims to be
addressed. Chapter 5 will address the research questions in detail using this
system&lt;span style="font-size: 11pt;"&gt;.&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p style="margin: 0cm -18pt 0pt 36pt; line-height: 150%; text-indent: -36pt;"&gt;&lt;a name="_Toc135413440"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;5.1&amp;nbsp;&amp;nbsp;&amp;nbsp;Does '&lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;b&gt;&lt;span style="line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;PageRank' negatively impact
  candidate document quality?&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
&lt;p style="margin: 0cm -18pt 0pt 36pt; line-height: 150%; text-indent: -36pt;"&gt;&lt;a name="_Toc135413441"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 14pt;" lang="EN-US"&gt;5.1.1 Introduction&lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;span&gt;This phase of research aims at
  establishing what effect, if any, different search engine algorithms have on
  the quality of candidate documents retrieved for question answering. Many
  current web based QA systems such as 'Brainboost' and 'Answerbus' employ the
  Google Application Programming Interface (API) as part of their document
  retrieval component, however the company's PageRank algorithm relies on a lot
  more than the content of the page to match keywords to results, including the
  abundance of pages with link anchor text containing the searched keyword(s);
  also known as backlinks. This phase aims to establish whether obtaining
  candidate documents using the Microsoft Network's (MSN) search algorithm against
  Google's, would have a positive impact on the quality of documents obtained for
  keyword analysis. The MSN search API relies more heavily on the content of the
  page than the number of pages linking to it, however also employs a back
  linking system to a lesser extent to assist with expunging rouge results and
spammers.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;a name="_Toc135413442"&gt;&lt;/a&gt;&lt;a name="_Toc134351092"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 14pt;" lang="EN-US"&gt;5.1.2 &lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 14pt;" lang="EN-US"&gt;Rationale for research&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;Logic behind this research question
stems from Google's algorithm for ranking pages. As discussed, Google ranks
pages with the most sites linking to it for the particular keyword at the top
of its listings; resulting in the page's content playing less of a role in the
order results are displayed. For instance, issuing the key-phrase 'click here'
to Google's search engine returns a popular PDF reader as its first result
(Adobe Acrobat Reader), despite having no iterations of the keyphrase 'click
here' within the body of the page. This result is based purely on the amount of
inbound links with anchor text 'click here' which other website use to allow
users to download this popular program to view documents in their proprietary
PDF format. Question answering in its core is looking for relevant text, not
related or linked material. This was the catalyst for my research question; "does
this type of ranking algorithm have an adverse effect on the quality of
documents retrieved?"&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;a name="_Toc135413443"&gt;&lt;/a&gt;&lt;a name="_Toc134351093"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 14pt;" lang="EN-US"&gt;5.1.3 &lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 14pt;" lang="EN-US"&gt;Breakdown of experiment&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;Testing this theory required
preparation of the aforementioned prototype system (section 3.2.4) which indexes
the frequency of keywords within a given number of documents. The purpose of
this experiment is to compare two well-known search algorithms for quality of candidate
documents retrieved. The two engines have been carefully selected as Google and
Microsoft's proprietary MSN Search. As discussed, the basic keyword analysis
system has been developed in ASP.NET with one variable; the algorithm used. A
set of ten documents returned by each engine for a query will be used to obtain
answers to forty questions sourced from the TREC database. For the purpose of
the experiment, questions posed to the engine will have answers which are
between one and two words long.&amp;nbsp; The latter will be implemented by obtaining
the URL of each &amp;nbsp;page from both API's and stripping the HTML tags resulting in
pure content which can be analysed for answers with our stemmed query. Ten
candidate documents will then be indexed according to frequency and the top 30
occurring keywords will be displayed on a table on the results page.&amp;nbsp; The
position of the expected answer (1-30) will be recorded when run with the MSN
API in place and with the Google API in place.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;a name="OLE_LINK6"&gt;&lt;/a&gt;&lt;a name="OLE_LINK5"&gt;&lt;span&gt;The expected result is for expected answers to rank lower
  on the keyword table than answers obtained using the MSN API. Raw results will
  be published in the following format and are included in the appendix of this
  document (see example below). Additionally, source code and screen shots are
  also available. &lt;/span&gt;&lt;/a&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;Percentage
  change keyword quality when comparing the algorithms (full results available in
  the appendix) is calculated using the number of &lt;b&gt;iterations&lt;/b&gt; with
  weighting 0.25 and the answer &lt;b&gt;ranking&lt;/b&gt; as 0.75 for each engine and noting
  change.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span&gt;&lt;img src="/files/media/image/question-answering/image007.jpg" width="575" height="160"&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;Figure
7: Sample format for raw data output&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;a name="_Toc135413444"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 14pt;" lang="EN-US"&gt;5.1.4 Selecting sample questions&lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;"To sample something is to examine a
small portion of it, usually for the purpose of judging the nature or quality
of` the whole." (Proctor, 2003 p.100)&amp;nbsp; Proctor states that samples can be
chosen in many ways, varying from throwing a dice to random questions picked
from a computer database. However, the test questions selected from the TREC
database &amp;nbsp;were carefully selected to cover different types of question.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;Questions can be subdivided into
  various categories, then further refined in order to determine the answer
  type expected. For the purpose of analysis, code has been developed to obtain
  the type of result expected.&amp;nbsp; This code takes a question as its input and
  returns an expected answer type as its ouput. For example, issuing 'What was
  the name of the first UK Prime Minister?' returns Person or Organisation as
the expected answer type.&lt;/p&gt;&lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 7pt;" lang="EN-US"&gt;&amp;nbsp;&lt;/span&gt;&lt;span&gt;To determine the appropriate sample
size, it was important to consider factors such as question type, answer type
and focus. The table below illustrates a breakdown of the fifteen questions to
be posed to the system.&lt;/span&gt;&amp;nbsp;&lt;/p&gt;&lt;div&gt;&lt;table style="border: currentColor; border-collapse: collapse;" class="MsoTableGrid" border="1" cellSpacing="0" cellPadding="0"&gt;
 &lt;tbody&gt;&lt;tr&gt;
  &lt;td style="padding: 0cm 5.4pt; border: 1pt solid windowtext; width: 28.7pt;" vAlign="top" width="38"&gt;
  &lt;p&gt;&lt;b&gt;&lt;i&gt;&lt;span&gt;Qty&lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: 1pt 1pt 1pt medium; border-style: solid solid solid none; border-color: windowtext windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 63.95pt;" vAlign="top" width="85"&gt;
  &lt;p&gt;&lt;b&gt;&lt;i&gt;&lt;span&gt;Class&lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: 1pt 1pt 1pt medium; border-style: solid solid solid none; border-color: windowtext windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 56.05pt;" vAlign="top" width="75"&gt;
  &lt;p&gt;&lt;b&gt;&lt;i&gt;&lt;span&gt;Subclass&lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: 1pt 1pt 1pt medium; border-style: solid solid solid none; border-color: windowtext windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 65.35pt;" vAlign="top" width="87"&gt;
  &lt;p&gt;&lt;b&gt;&lt;i&gt;&lt;span&gt;Expected&lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: rgb(230, 230, 230); border-width: 1pt 1pt 1pt medium; border-style: solid solid solid none; border-color: windowtext windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 13.35pt;" vAlign="top" width="18"&gt;
  &lt;p&gt;&lt;b&gt;&lt;i&gt;&lt;span&gt;&lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&amp;nbsp;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: 1pt 1pt 1pt medium; border-style: solid solid solid none; border-color: windowtext windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 30.25pt;" vAlign="top" width="40"&gt;
  &lt;p&gt;&lt;b&gt;&lt;i&gt;&lt;span&gt;Qty&lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: 1pt 1pt 1pt medium; border-style: solid solid solid none; border-color: windowtext windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 60.45pt;" vAlign="top" width="81"&gt;
  &lt;p&gt;&lt;b&gt;&lt;i&gt;&lt;span&gt;Class&lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: 1pt 1pt 1pt medium; border-style: solid solid solid none; border-color: windowtext windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 56.3pt;" vAlign="top" width="75"&gt;
  &lt;p&gt;&lt;b&gt;&lt;i&gt;&lt;span&gt;Subclass&lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: 1pt 1pt 1pt medium; border-style: solid solid solid none; border-color: windowtext windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 60.7pt;" vAlign="top" width="81"&gt;
  &lt;p&gt;&lt;b&gt;&lt;i&gt;&lt;span&gt;Expected&lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0cm 5.4pt; width: 28.7pt;" vAlign="top" width="38"&gt;
  &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 9.5pt;" lang="EN-US"&gt;2&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 63.95pt;" vAlign="top" width="85"&gt;
  &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 9.5pt;" lang="EN-US"&gt;What&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 56.05pt;" vAlign="top" width="75"&gt;
  &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 9.5pt;" lang="EN-US"&gt;Basic&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 65.35pt;" vAlign="top" width="87"&gt;
  &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 9.5pt;" lang="EN-US"&gt;Undefined&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: rgb(230, 230, 230); border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 13.35pt;" vAlign="top" width="18"&gt;
  &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 9.5pt;" lang="EN-US"&gt;&lt;/span&gt;&amp;nbsp;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 30.25pt;" vAlign="top" width="40"&gt;
  &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 9.5pt;" lang="EN-US"&gt;1&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 60.45pt;" vAlign="top" width="81"&gt;
  &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 9.5pt;" lang="EN-US"&gt;When&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 56.3pt;" vAlign="top" width="75"&gt;
  &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 9.5pt;" lang="EN-US"&gt;When
  did&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 60.7pt;" vAlign="top" width="81"&gt;
  &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 9.5pt;" lang="EN-US"&gt;Date
  / Time&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0cm 5.4pt; width: 28.7pt;" vAlign="top" width="38"&gt;
  &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 9.5pt;" lang="EN-US"&gt;2&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 63.95pt;" vAlign="top" width="85"&gt;
  &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 9.5pt;" lang="EN-US"&gt;Who&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 56.05pt;" vAlign="top" width="75"&gt;
  &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 9.5pt;" lang="EN-US"&gt;Who
  is&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 65.35pt;" vAlign="top" width="87"&gt;
  &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 9.5pt;" lang="EN-US"&gt;Person
  / Corp&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: rgb(230, 230, 230); border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 13.35pt;" vAlign="top" width="18"&gt;
  &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 9.5pt;" lang="EN-US"&gt;&lt;/span&gt;&amp;nbsp;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 30.25pt;" vAlign="top" width="40"&gt;
  &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 9.5pt;" lang="EN-US"&gt;2&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 60.45pt;" vAlign="top" width="81"&gt;
  &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 9.5pt;" lang="EN-US"&gt;Which&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 56.3pt;" vAlign="top" width="75"&gt;
  &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 9.5pt;" lang="EN-US"&gt;In
  Which&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 60.7pt;" vAlign="top" width="81"&gt;
  &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 9.5pt;" lang="EN-US"&gt;Unfedined&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0cm 5.4pt; width: 28.7pt;" vAlign="top" width="38"&gt;
  &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 9.5pt;" lang="EN-US"&gt;1&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 63.95pt;" vAlign="top" width="85"&gt;
  &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 9.5pt;" lang="EN-US"&gt;Where&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 56.05pt;" vAlign="top" width="75"&gt;
  &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 9.5pt;" lang="EN-US"&gt;Where
  &amp;nbsp;did&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 65.35pt;" vAlign="top" width="87"&gt;
  &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 9.5pt;" lang="EN-US"&gt;Location&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: rgb(230, 230, 230); border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 13.35pt;" vAlign="top" width="18"&gt;
  &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 9.5pt;" lang="EN-US"&gt;&lt;/span&gt;&amp;nbsp;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 30.25pt;" vAlign="top" width="40"&gt;
  &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 9.5pt;" lang="EN-US"&gt;1&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 60.45pt;" vAlign="top" width="81"&gt;
  &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 9.5pt;" lang="EN-US"&gt;How&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 56.3pt;" vAlign="top" width="75"&gt;
  &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 9.5pt;" lang="EN-US"&gt;How
  did&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 60.7pt;" vAlign="top" width="81"&gt;
  &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 9.5pt;" lang="EN-US"&gt;Manner&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
&lt;/tbody&gt;&lt;/table&gt;&lt;/div&gt;&lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;Table
2: Questions to be presented to keyword analysis system&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;The table above was compiled from a
table of questions and corresponding answer types (Figure 2) and covers the
majority of questions which expect one specific keyword or phrase as the
answer. Due to the limited scope of this project, only fact based questions
will be assessed by the system, which is why the above sample of question types
has been identified. Fifteen questions from the categories above will be
selected from the TREC sample data (NIST, 2005) and submitted to the engine.
The analysis will run twice, once with the Google API as the document retrieval
component and once with the MSN API. The outcome is to determine which is most
effective for obtaining potential answers to questions when stemmed keywords
are posed to the system. This will be obtained noting the increase or decrease
in performance of two factors, the ranking and the frequency. Results will be
compiled in a table such as this, and illustrated graphically. The answer type
expected will be used to&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;Ultimately, this part of the project
  aims to establish if there is a direct link between search engine algorithm and
  the relevance of the information retrieved from documents using search engines'
  API's. &lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;a name="_Toc135413445"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 14pt;" lang="EN-US"&gt;5.1.5 Limitations&lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;Limitations include only ranking one
keyword, and not several. Answers with multiple words as answers are given an
average frequency or ranking. In addition, problems removing certain types of
line break from the HTML code resulted in discontinuous arrays, of which empty
values are removed manually. These factors have no impact on the analysis of
results; merely require more manual intervention than originally expected.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;In addition to this, performance of
  the Google API was variable at best, which resulted in many results having to
  be verified manually. A full mini-analysis of the Google API is available in
  Chapter 4 – Post Results Analysis.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;a name="_Toc135413446"&gt;&lt;/a&gt;&lt;a name="_Toc134351094"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 14pt;" lang="EN-US"&gt;5.1.6 Results &lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 14pt;" lang="EN-US"&gt;&amp;amp; conclusions&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;For most questions posed to the
system, the &lt;i&gt;question text&lt;/i&gt; appears more frequently in the MSN search
results than in Google's. This suggests that MSN relies more heavily on pattern
matching than back linking; a theory suggested earlier in this document. In
each case however, the quantity repetitions of potential answers is greater
from documents returned by the Google search API. Thus proving that the Google
PageRank algorithm actually has a positive effect on retrieving candidate
documents for question answering systems. The following data paragraphs
substantiate this conclusion:&lt;/span&gt;&lt;/p&gt;&lt;div&gt;&lt;table style="border: currentColor; border-collapse: collapse;" class="MsoTableGrid" border="1" cellSpacing="0" cellPadding="0"&gt;
  &lt;tbody&gt;&lt;tr&gt;
  &lt;td style="padding: 1.4pt 5.4pt; border: 1pt solid windowtext; width: 129.4pt;" vAlign="top" width="173"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;&lt;img src="/files/media/image/question-answering/image009.jpg" width="158" height="51"&gt;&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: 1pt 1pt 1pt medium; border-style: solid solid solid none; border-color: windowtext windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 313.7pt;" vAlign="top" width="418" colSpan="6"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 8pt;" lang="EN-US"&gt;&lt;/span&gt;&lt;/b&gt;&amp;nbsp;&lt;/p&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 14pt;" lang="EN-US"&gt;Ranking of Correct Answer within Candidates&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style="height: 14.95pt;"&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 1.4pt 5.4pt; width: 129.4pt; height: 14.95pt;" vAlign="top" width="173"&gt;
  &lt;p style="line-height: 150%;"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 6pt;" lang="EN-US"&gt;&lt;/span&gt;&lt;/b&gt;&amp;nbsp;&lt;/p&gt;
  &lt;p style="line-height: 150%;"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 14pt;" lang="EN-US"&gt;&amp;nbsp;&amp;nbsp; RANKING:&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: rgb(243, 243, 243); border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 55.35pt; height: 14.95pt;" vAlign="top" width="74"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 5pt;" lang="EN-US"&gt;&lt;/span&gt;&lt;/b&gt;&amp;nbsp;&lt;/p&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;1&lt;sup&gt;st&lt;/sup&gt; – 5&lt;sup&gt;th&lt;/sup&gt; &lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: rgb(243, 243, 243); border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 53.2pt; height: 14.95pt;" vAlign="top" width="71"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 5pt;" lang="EN-US"&gt;&lt;/span&gt;&lt;/b&gt;&amp;nbsp;&lt;/p&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;6&lt;sup&gt;th&lt;/sup&gt; – 10&lt;sup&gt;th&lt;/sup&gt; &lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: rgb(243, 243, 243); border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 62.05pt; height: 14.95pt;" vAlign="top" width="83"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 5pt;" lang="EN-US"&gt;&lt;/span&gt;&lt;/b&gt;&amp;nbsp;&lt;/p&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;11&lt;sup&gt;th&lt;/sup&gt; – 15&lt;sup&gt;th&lt;/sup&gt; &lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 53.2pt; height: 14.95pt;" vAlign="top" width="71"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 5pt;" lang="EN-US"&gt;&lt;/span&gt;&lt;/b&gt;&amp;nbsp;&lt;/p&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;16 – 20&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 39.4pt; height: 14.95pt;" vAlign="top" width="53"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 5pt;" lang="EN-US"&gt;&lt;/span&gt;&lt;/b&gt;&amp;nbsp;&lt;/p&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;21 -25&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 50.5pt; height: 14.95pt;" vAlign="top" width="67"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 5pt;" lang="EN-US"&gt;&lt;/span&gt;&lt;/b&gt;&amp;nbsp;&lt;/p&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;TOTAL&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 1.4pt 5.4pt; width: 129.4pt;" vAlign="top" width="173"&gt;
  &lt;p style="text-align: center;" align="center"&gt;&lt;b&gt;&lt;span style="font-size: 14pt;" lang="EN-US"&gt;&lt;img src="/files/media/image/question-answering/image010.gif" width="156" height="42"&gt;&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p style="text-align: center;" align="center"&gt;&lt;b&gt;&lt;i&gt;&lt;span style="font-size: 8pt;" lang="EN-US"&gt;MSN API VIA FROO&lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: rgb(243, 243, 243); border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 55.35pt;" vAlign="top" width="74"&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;5&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;17%&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: rgb(243, 243, 243); border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 53.2pt;" vAlign="top" width="71"&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;2&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;7%&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: rgb(243, 243, 243); border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 62.05pt;" vAlign="top" width="83"&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;1&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;3%&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 53.2pt;" vAlign="top" width="71"&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;2&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;7%&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 39.4pt;" vAlign="top" width="53"&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;1&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;3%&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 50.5pt;" vAlign="top" width="67"&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;11&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;37%&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 1.4pt 5.4pt; width: 129.4pt;" vAlign="top" width="173"&gt;
  &lt;p style="text-align: center;" align="center"&gt;&lt;b&gt;&lt;span style="font-size: 14pt;" lang="EN-US"&gt;&lt;img src="/files/media/image/question-answering/image011.gif" width="151" height="51"&gt;&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p style="text-align: center;" align="center"&gt;&lt;b&gt;&lt;i&gt;&lt;span style="font-size: 8pt;" lang="EN-US"&gt;GOOGLE API VIA FROO&lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: rgb(243, 243, 243); border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 55.35pt;" vAlign="top" width="74"&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;7&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;23%&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: rgb(243, 243, 243); border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 53.2pt;" vAlign="top" width="71"&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;2&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;7%&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: rgb(243, 243, 243); border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 62.05pt;" vAlign="top" width="83"&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;1&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;3%&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 53.2pt;" vAlign="top" width="71"&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;2&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;7%&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 39.4pt;" vAlign="top" width="53"&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;2&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;7%&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 50.5pt;" vAlign="top" width="67"&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;14&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;47%&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 1.4pt 5.4pt; width: 392.6pt;" vAlign="top" width="523" colSpan="6"&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;&lt;/span&gt;&lt;/b&gt;&amp;nbsp;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 50.5pt;" vAlign="top" width="67"&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;i&gt;&lt;span style="color: black; font-size: 14pt;" lang="EN-US"&gt;30&lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;i&gt;&lt;span style="color: black; font-size: 14pt;" lang="EN-US"&gt;100%*&lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
&lt;/tbody&gt;&lt;/table&gt;&lt;/div&gt;&lt;p&gt;&lt;i&gt;&lt;span style="color: black; line-height: 150%; font-size: 10pt;" lang="EN-US"&gt;*The
remainder of results were outside our useful sample range of 1-25&lt;/span&gt;&lt;/i&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;&lt;span&gt;Table 3: Summary of ranking of
correct answers within candidate documents &lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;The above table summarises the data
  output from the &lt;i&gt;'froo'&lt;/i&gt; application when the fifteen questions were posed
  to the system. It was decided to use answers that rank between positions one
  and twenty-five for our sample range. The lower value in the 'total' field
  indicates that a few results were outside our useful sample range. &lt;i&gt;73%&lt;/i&gt;
  (11/15) of correct answers were ranked highly (positions 1-25) on MSN's search
  engine, whereas &lt;i&gt;94%&lt;/i&gt; (14/15) appeared in this category when the Google
  search engine was utilised. The remaining answers, not used for this study,
  were ranked between positions 25-50. Overall, Google supplies more 'useful' or
  top 25 ranking answers than MSN.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;The proportion of top ranking
  answers appearing between positions one and five is greater when the Google API
  is in use, however are identical for the subsequent two ranking classifications
  (6-10, 11-15). Two possible conclusions can be drawn from this outcome: Firstly
  it is possible that the sample range was too small, and that increasing the
  number of questions posed to the system would set the two search engines apart
  from each other. However, this is unlikely, and a more viable explanation may
  be that the search engine algorithms are more similar than first thought. For
  this reason, it has been decided to take the number of occurrences of words
  into consideration to attempt to determine whether Google's PageRank does in
  fact have a detrimental effect on obtaining results. So far, this does not
  appear to be the case as our summary table above shows that the search engines
  have very similar occurrences of the answers, which may imply that many of the
  same documents are being used for comparison.&lt;/span&gt;&lt;/p&gt;&lt;div&gt;&lt;table style="width: 450pt; margin-left: 5.4pt; border-collapse: collapse;" class="MsoNormalTable" border="0" cellSpacing="0" cellPadding="0" width="600"&gt;
  &lt;tbody&gt;&lt;tr style="height: 15pt;"&gt;
  &lt;td style="padding: 0cm 5.4pt; width: 193.15pt; height: 15pt;" vAlign="bottom" width="258" noWrap=""&gt;&lt;a name="_Toc134351096"&gt;&lt;/a&gt;
  &lt;p&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;&lt;/span&gt;&amp;nbsp;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: 1pt medium medium 1pt; border-style: solid none none solid; border-color: windowtext currentColor currentColor windowtext; padding: 0cm 5.4pt; width: 94.85pt; height: 15pt;" vAlign="bottom" width="126" colSpan="2" noWrap=""&gt;
  &lt;p style="text-align: center;" align="center"&gt;&lt;b&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;Answer
  Iterations&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: white; border-width: 1pt 1pt medium; border-style: solid solid none; border-color: windowtext black currentColor windowtext; padding: 0cm 5.4pt; width: 99pt; height: 15pt;" vAlign="bottom" width="132" colSpan="2" noWrap=""&gt;
  &lt;p style="text-align: center;" align="center"&gt;&lt;b&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;Answer
  Ranking&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="padding: 0cm 5.4pt; width: 63pt; height: 15pt;" vAlign="bottom" width="84" noWrap=""&gt;
  &lt;p style="text-align: center;" align="center"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;&lt;/span&gt;&amp;nbsp;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style="height: 15.75pt;"&gt;
  &lt;td style="padding: 0cm 5.4pt; width: 193.15pt; height: 15.75pt;" vAlign="bottom" width="258" noWrap=""&gt;
  &lt;p&gt;&lt;span style='font-family: "Arial","sans-serif";' lang="EN-US"&gt;&lt;/span&gt;&amp;nbsp;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium medium 1pt 1pt; border-style: none none solid solid; border-color: currentColor currentColor windowtext windowtext; padding: 0cm 5.4pt; width: 94.85pt; height: 15.75pt;" vAlign="bottom" width="126" colSpan="2" noWrap=""&gt;
  &lt;p style="text-align: center;" align="center"&gt;&lt;i&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;Weighting
  0.25&lt;/span&gt;&lt;/i&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor black windowtext windowtext; padding: 0cm 5.4pt; width: 99pt; height: 15.75pt;" vAlign="bottom" width="132" colSpan="2" noWrap=""&gt;
  &lt;p style="text-align: center;" align="center"&gt;&lt;i&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;Weighting
  0.75&lt;/span&gt;&lt;/i&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="padding: 0cm 5.4pt; width: 63pt; height: 15.75pt;" vAlign="bottom" width="84" noWrap=""&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style='font-family: "Arial","sans-serif";' lang="EN-US"&gt;&lt;/span&gt;&lt;/b&gt;&amp;nbsp;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style="height: 15.75pt;"&gt;
  &lt;td style="background: black; border-width: 1pt medium 1pt 1pt; border-style: solid none solid solid; border-color: windowtext currentColor windowtext windowtext; padding: 0cm 5.4pt; width: 193.15pt; height: 15.75pt;" vAlign="bottom" width="258" noWrap=""&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style='color: white; font-family: "Arial","sans-serif"; font-size: 11pt;' lang="EN-US"&gt;TREC8 QUESTION&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: black; border-width: medium medium 1pt; border-style: none none solid; border-color: currentColor currentColor windowtext; padding: 0cm 5.4pt; width: 49.85pt; height: 15.75pt;" vAlign="bottom" width="66" noWrap=""&gt;
  &lt;p style="text-align: center;" align="center"&gt;&lt;b&gt;&lt;span style='color: white; font-family: "Arial","sans-serif"; font-size: 11pt;' lang="EN-US"&gt;Google&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: black; border-width: medium medium 1pt; border-style: none none solid; border-color: currentColor currentColor windowtext; padding: 0cm 5.4pt; width: 45pt; height: 15.75pt;" vAlign="bottom" width="60" noWrap=""&gt;
  &lt;p style="text-align: center;" align="center"&gt;&lt;b&gt;&lt;span style='color: white; font-family: "Arial","sans-serif"; font-size: 11pt;' lang="EN-US"&gt;MSN&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: black; border-width: medium medium 1pt; border-style: none none solid; border-color: currentColor currentColor windowtext; padding: 0cm 5.4pt; width: 48.7pt; height: 15.75pt;" vAlign="bottom" width="65" noWrap=""&gt;
  &lt;p style="text-align: center;" align="center"&gt;&lt;b&gt;&lt;span style='color: white; font-family: "Arial","sans-serif"; font-size: 11pt;' lang="EN-US"&gt;Google&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: black; border-width: medium medium 1pt; border-style: none none solid; border-color: currentColor currentColor windowtext; padding: 0cm 5.4pt; width: 50.3pt; height: 15.75pt;" vAlign="bottom" width="67" noWrap=""&gt;
  &lt;p style="text-align: center;" align="center"&gt;&lt;b&gt;&lt;span style='color: white; font-family: "Arial","sans-serif"; font-size: 11pt;' lang="EN-US"&gt;MSN&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: black; border-width: 1pt 1pt 1pt medium; border-style: solid solid solid none; border-color: windowtext windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 63pt; height: 15.75pt;" vAlign="bottom" width="84" noWrap=""&gt;
  &lt;p style="text-align: center;" align="center"&gt;&lt;b&gt;&lt;span style='color: white; font-family: "Arial","sans-serif"; font-size: 9pt;' lang="EN-US"&gt;% Improved&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style="height: 12.75pt;"&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0cm 5.4pt; width: 193.15pt; height: 12.75pt;" vAlign="bottom" width="258" noWrap=""&gt;
  &lt;p&gt;&lt;span style='font-family: "Arial Unicode MS","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;Who
  is the voice of Miss Piggy?&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 49.85pt; height: 12.75pt;" vAlign="bottom" width="66" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;84&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 45pt; height: 12.75pt;" vAlign="bottom" width="60" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;75&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 48.7pt; height: 12.75pt;" vAlign="bottom" width="65" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;2.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 50.3pt; height: 12.75pt;" vAlign="bottom" width="67" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;6.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: rgb(243, 243, 243); border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 63pt; height: 12.75pt;" vAlign="bottom" width="84" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;-47.00%&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style="height: 12.75pt;"&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0cm 5.4pt; width: 193.15pt; height: 12.75pt;" vAlign="bottom" width="258" noWrap=""&gt;
  &lt;p&gt;&lt;span style='font-family: "Arial Unicode MS","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;When
  did the Jurassic Period end?&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 49.85pt; height: 12.75pt;" vAlign="bottom" width="66" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;127&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 45pt; height: 12.75pt;" vAlign="bottom" width="60" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;34&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 48.7pt; height: 12.75pt;" vAlign="bottom" width="65" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;14.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 50.3pt; height: 12.75pt;" vAlign="bottom" width="67" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;31.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: rgb(243, 243, 243); border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 63pt; height: 12.75pt;" vAlign="bottom" width="84" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;27.25%&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style="height: 12.75pt;"&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0cm 5.4pt; width: 193.15pt; height: 12.75pt;" vAlign="bottom" width="258" noWrap=""&gt;
  &lt;p&gt;&lt;span style='font-family: "Arial Unicode MS","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;What
  does the Peugeot &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 49.85pt; height: 12.75pt;" vAlign="bottom" width="66" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;158&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 45pt; height: 12.75pt;" vAlign="bottom" width="60" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;124&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 48.7pt; height: 12.75pt;" vAlign="bottom" width="65" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;1.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 50.3pt; height: 12.75pt;" vAlign="bottom" width="67" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;1.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: rgb(243, 243, 243); border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 63pt; height: 12.75pt;" vAlign="bottom" width="84" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;6.85%&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style="height: 12.75pt;"&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0cm 5.4pt; width: 193.15pt; height: 12.75pt;" vAlign="bottom" width="258" noWrap=""&gt;
  &lt;p&gt;&lt;span style='font-family: "Arial Unicode MS","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;Who
  was the first American in space?&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 49.85pt; height: 12.75pt;" vAlign="bottom" width="66" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;106&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 45pt; height: 12.75pt;" vAlign="bottom" width="60" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;78&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 48.7pt; height: 12.75pt;" vAlign="bottom" width="65" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;16.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 50.3pt; height: 12.75pt;" vAlign="bottom" width="67" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;32.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: rgb(243, 243, 243); border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 63pt; height: 12.75pt;" vAlign="bottom" width="84" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;-28.53%&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style="height: 12.75pt;"&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0cm 5.4pt; width: 193.15pt; height: 12.75pt;" vAlign="bottom" width="258" noWrap=""&gt;
  &lt;p&gt;&lt;span style='font-family: "Arial Unicode MS","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;Who
  wrote "Hamlet"?&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 49.85pt; height: 12.75pt;" vAlign="bottom" width="66" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;149&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 45pt; height: 12.75pt;" vAlign="bottom" width="60" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;133&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 48.7pt; height: 12.75pt;" vAlign="bottom" width="65" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;1.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 50.3pt; height: 12.75pt;" vAlign="bottom" width="67" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;1.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: rgb(243, 243, 243); border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 63pt; height: 12.75pt;" vAlign="bottom" width="84" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;3.01%&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style="height: 12.75pt;"&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0cm 5.4pt; width: 193.15pt; height: 12.75pt;" vAlign="bottom" width="258" noWrap=""&gt;
  &lt;p&gt;&lt;span style='font-family: "Arial Unicode MS","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;Where
  was George Washington born?&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 49.85pt; height: 12.75pt;" vAlign="bottom" width="66" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;123&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 45pt; height: 12.75pt;" vAlign="bottom" width="60" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;97&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 48.7pt; height: 12.75pt;" vAlign="bottom" width="65" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;2.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 50.3pt; height: 12.75pt;" vAlign="bottom" width="67" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;22.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: rgb(243, 243, 243); border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 63pt; height: 12.75pt;" vAlign="bottom" width="84" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;-61.48%&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style="height: 12.75pt;"&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0cm 5.4pt; width: 193.15pt; height: 12.75pt;" vAlign="bottom" width="258" noWrap=""&gt;
  &lt;p&gt;&lt;span style='font-family: "Arial Unicode MS","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;What
  is the name of the condition … brain?&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 49.85pt; height: 12.75pt;" vAlign="bottom" width="66" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;46&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 45pt; height: 12.75pt;" vAlign="bottom" width="60" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;8&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 48.7pt; height: 12.75pt;" vAlign="bottom" width="65" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;29.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 50.3pt; height: 12.75pt;" vAlign="bottom" width="67" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;126.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: rgb(243, 243, 243); border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 63pt; height: 12.75pt;" vAlign="bottom" width="84" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;61.01%&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style="height: 12.75pt;"&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0cm 5.4pt; width: 193.15pt; height: 12.75pt;" vAlign="bottom" width="258" noWrap=""&gt;
  &lt;p&gt;&lt;span style='font-family: "Arial Unicode MS","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;Where
  can I buy a Big Mac?&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 49.85pt; height: 12.75pt;" vAlign="bottom" width="66" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;122&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 45pt; height: 12.75pt;" vAlign="bottom" width="60" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;110&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 48.7pt; height: 12.75pt;" vAlign="bottom" width="65" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;22.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 50.3pt; height: 12.75pt;" vAlign="bottom" width="67" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;19.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: rgb(243, 243, 243); border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 63pt; height: 12.75pt;" vAlign="bottom" width="84" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;14.57%&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style="height: 12.75pt;"&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0cm 5.4pt; width: 193.15pt; height: 12.75pt;" vAlign="bottom" width="258" noWrap=""&gt;
  &lt;p&gt;&lt;span style='font-family: "Arial Unicode MS","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;What
  is the capital of Kosovo?&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 49.85pt; height: 12.75pt;" vAlign="bottom" width="66" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;132&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 45pt; height: 12.75pt;" vAlign="bottom" width="60" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;133&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 48.7pt; height: 12.75pt;" vAlign="bottom" width="65" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;5.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 50.3pt; height: 12.75pt;" vAlign="bottom" width="67" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;3.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: rgb(243, 243, 243); border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 63pt; height: 12.75pt;" vAlign="bottom" width="84" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;49.81%&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style="height: 12.75pt;"&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0cm 5.4pt; width: 193.15pt; height: 12.75pt;" vAlign="bottom" width="258" noWrap=""&gt;
  &lt;p&gt;&lt;span style='font-family: "Arial Unicode MS","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;In
  which year was Queen Victoria born?&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 49.85pt; height: 12.75pt;" vAlign="bottom" width="66" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;74&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 45pt; height: 12.75pt;" vAlign="bottom" width="60" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;70&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 48.7pt; height: 12.75pt;" vAlign="bottom" width="65" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;17.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 50.3pt; height: 12.75pt;" vAlign="bottom" width="67" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;19.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: rgb(243, 243, 243); border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 63pt; height: 12.75pt;" vAlign="bottom" width="84" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;-6.47%&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style="height: 12.75pt;"&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0cm 5.4pt; width: 193.15pt; height: 12.75pt;" vAlign="bottom" width="258" noWrap=""&gt;
  &lt;p&gt;&lt;span style='font-family: "Arial Unicode MS","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;In
  which state is Houston&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 49.85pt; height: 12.75pt;" vAlign="bottom" width="66" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;168&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 45pt; height: 12.75pt;" vAlign="bottom" width="60" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;92&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 48.7pt; height: 12.75pt;" vAlign="bottom" width="65" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;1.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 50.3pt; height: 12.75pt;" vAlign="bottom" width="67" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;12.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: rgb(243, 243, 243); border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 63pt; height: 12.75pt;" vAlign="bottom" width="84" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;-48.10%&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style="height: 12.75pt;"&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0cm 5.4pt; width: 193.15pt; height: 12.75pt;" vAlign="bottom" width="258" noWrap=""&gt;
  &lt;p&gt;&lt;span style='font-family: "Arial Unicode MS","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;Which
  company created the &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 49.85pt; height: 12.75pt;" vAlign="bottom" width="66" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;98&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 45pt; height: 12.75pt;" vAlign="bottom" width="60" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;57&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 48.7pt; height: 12.75pt;" vAlign="bottom" width="65" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;1.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 50.3pt; height: 12.75pt;" vAlign="bottom" width="67" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;5.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: rgb(243, 243, 243); border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 63pt; height: 12.75pt;" vAlign="bottom" width="84" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;-42.02%&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style="height: 12.75pt;"&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0cm 5.4pt; width: 193.15pt; height: 12.75pt;" vAlign="bottom" width="258" noWrap=""&gt;
  &lt;p&gt;&lt;span style='font-family: "Arial Unicode MS","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;What
  is the length of border &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 49.85pt; height: 12.75pt;" vAlign="bottom" width="66" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='color: black; font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;59&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 45pt; height: 12.75pt;" vAlign="bottom" width="60" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='color: black; font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;32&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 48.7pt; height: 12.75pt;" vAlign="bottom" width="65" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;25.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 50.3pt; height: 12.75pt;" vAlign="bottom" width="67" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;194.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: rgb(243, 243, 243); border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 63pt; height: 12.75pt;" vAlign="bottom" width="84" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;-44.24%&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style="height: 12.75pt;"&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0cm 5.4pt; width: 193.15pt; height: 12.75pt;" vAlign="bottom" width="258" noWrap=""&gt;
  &lt;p&gt;&lt;span style='font-family: "Arial Unicode MS","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;Who
  is the managing director of Fasthosts?&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 49.85pt; height: 12.75pt;" vAlign="bottom" width="66" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;51&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 45pt; height: 12.75pt;" vAlign="bottom" width="60" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;76&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 48.7pt; height: 12.75pt;" vAlign="bottom" width="65" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;9.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 50.3pt; height: 12.75pt;" vAlign="bottom" width="67" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;4.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: rgb(243, 243, 243); border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 63pt; height: 12.75pt;" vAlign="bottom" width="84" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;85.53%&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style="height: 13.5pt;"&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0cm 5.4pt; width: 193.15pt; height: 13.5pt;" vAlign="bottom" width="258" noWrap=""&gt;
  &lt;p&gt;&lt;span style='font-family: "Arial Unicode MS","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;How
  did Mary Queen of Scots die?&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 2.25pt medium; border-style: none solid double none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 49.85pt; height: 13.5pt;" vAlign="bottom" width="66" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;127&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 2.25pt medium; border-style: none solid double none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 45pt; height: 13.5pt;" vAlign="bottom" width="60" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;77&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 2.25pt medium; border-style: none solid double none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 48.7pt; height: 13.5pt;" vAlign="bottom" width="65" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;6.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 2.25pt medium; border-style: none solid double none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 50.3pt; height: 13.5pt;" vAlign="bottom" width="67" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;7.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: rgb(243, 243, 243); border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 63pt; height: 13.5pt;" vAlign="bottom" width="84" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8.5pt;' lang="EN-US"&gt;5.52%&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style="height: 18.75pt;"&gt;
  &lt;td style="padding: 0cm 5.4pt; width: 193.15pt; height: 18.75pt;" vAlign="bottom" width="258" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;b&gt;&lt;i&gt;&lt;span style='font-family: "Arial","sans-serif";' lang="EN-US"&gt;Averages:&lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="padding: 0cm 5.4pt; width: 49.85pt; height: 18.75pt;" vAlign="bottom" width="66" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;b&gt;&lt;i&gt;&lt;span style='font-family: "Arial","sans-serif";' lang="EN-US"&gt;108.27 &lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="padding: 0cm 5.4pt; width: 45pt; height: 18.75pt;" vAlign="bottom" width="60" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;b&gt;&lt;i&gt;&lt;span style='font-family: "Arial","sans-serif";' lang="EN-US"&gt;79.73 &lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="padding: 0cm 5.4pt; width: 48.7pt; height: 18.75pt;" vAlign="bottom" width="65" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;b&gt;&lt;i&gt;&lt;span style='font-family: "Arial","sans-serif";' lang="EN-US"&gt;10.07 &lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="padding: 0cm 5.4pt; width: 50.3pt; height: 18.75pt;" vAlign="bottom" width="67" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;b&gt;&lt;i&gt;&lt;span style='font-family: "Arial","sans-serif";' lang="EN-US"&gt;32.13 &lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="padding: 0cm 5.4pt; width: 63pt; height: 18.75pt;" vAlign="bottom" width="84" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;b&gt;&lt;i&gt;&lt;span style='background: black; color: white; font-family: "Arial","sans-serif";' lang="EN-US"&gt;&amp;nbsp;-68.67%&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
&lt;/tbody&gt;&lt;/table&gt;&lt;/div&gt;&lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 5pt;" lang="EN-US"&gt;&lt;/span&gt;&lt;/b&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;&lt;b&gt;&lt;span&gt;Table 4: Summary of ranking
correct answers and iterations within candidates &lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;The above table is a summary of raw data
  obtained using the keyword analysis application. Its purpose is to use a
  weighted average several factors into consideration to attempt to prove that
  PageRank has a negative impact on supplying candidate documents with an
  abundance of correct answers.&amp;nbsp; When taking into account the answer's ranking
  within our list of keywords &lt;i&gt;and &lt;/i&gt;the number of occurrences in our set of
  documents, we can establish in more detail which search algorithm is more
  suitable for obtaining candidate documents.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;Using a weighting of 0.75 (more
  important) for the ranking of the answer in terms of frequency of iterations
  within the sample of documents, and 0.25 (less important) for the number of
  iterations in total of the keyword on the page, it can be deducted that in most
  cases, the MSN API ranks potential answers lower than Google. It should be
  noted that from the sample data obtained from running these tests, it can be
  concluded that MSN's search algorithm focuses more heavily on the page content.
  This deduction has been made as in nearly all outputs, the &lt;i&gt;original question&lt;/i&gt;
  was ranked highly in the keyword frequency column. &lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;There is a direct correlation
  between the answer ranking and number of iterations. In the majority of cases,
  as the number of iterations increases, the ranking position of the correct
  answer becomes lower. From our sample, it is evident that potential answers to
  questions appear much more frequently in documents obtained using the Google
  API. An average of &lt;i&gt;108&lt;/i&gt; repetitions of the desired answer were found on
  our sample of candidate documents generated by Google, as opposed to just &lt;i&gt;80&lt;/i&gt;
  using the MSN API. Once again, this is most likely due to the MSN API focusing
  more emphasis on matching the pages' content to the keywords passed to the
  engine.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;This ultimately means that Google's
  documents appear to be more relevant and include a higher abundance of the
  desired data than MSN's, despite not having as many occurrences of the
  question's keywords in the document. Using this data, it can be concluded that,
  contrary to the hypothesis, the Google PageRank algorithm actually has a &lt;i&gt;positive&lt;/i&gt;
  impact on gathering candidate documents.&lt;/span&gt;&lt;/p&gt;&lt;p style="margin: 0cm -18pt 0pt 36pt; line-height: 150%; text-indent: -36pt;"&gt;&lt;a name="_Toc135413447"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;5.2&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;Increasing candidate
  documents boosts the position of keywords&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 3pt;" lang="EN-US"&gt;&amp;nbsp;&lt;/span&gt;&lt;a name="_Toc135413448"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 14pt;" lang="EN-US"&gt;5.2.1 Introduction&lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;This phase of research extends to
  improving accuracy of answers retrieved by increasing the number of candidate
  documents used for answer extraction and introducing a scoring system. In
  theory, increasing the number of documents should provide a higher scope of
  choice for the answer selector, however it is the purpose of this experiment to
  determine whether or not this is the case. Results will be recorded from the
  system created for the previous experiment. This time, fifteen questions will
  be posed to our question answering system. Program one will select the answer
  from the first five documents retrieved from the search algorithm used (Google
  – which proved to be more accurate in the previous study). Program two will
  select the answer from the first ten documents. It has been decided that a range
  of between 5 and 10 candidate documents (100% increase) will suffice to spot
  any noticeable difference in the quality of results. The purpose of the
  experiment is to establish whether using a higher number of candidate documents
  from which to obtain answers from increases the likelihood of finding a
  relevant answer, or if using lower results and combining them with top ranking
  pages adversely affects the question answering process. &lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;a name="_Toc135413449"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 14pt;" lang="EN-US"&gt;5.2.2 Breakdown of Experiment&lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;&lt;i&gt;&lt;span style="color: black; line-height: 150%; font-size: 3pt;" lang="EN-US"&gt;&amp;nbsp;&lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&lt;span&gt;This experiment utilises a variation
of the keyword analysis application as outlined in the previous chapter.
Several modifications have been made to my original software for the purpose of
this experiment, including:&lt;/span&gt;&lt;/p&gt;&lt;ul style="margin-top: 0cm;" type="disc"&gt;
  &lt;li style="text-align: justify; color: black; line-height: 150%;"&gt;&lt;span lang="EN-US"&gt;Using the Google API to obtain all
     candidate documents&lt;/span&gt;&lt;/li&gt;
 &lt;li style="text-align: justify; color: black; line-height: 150%;"&gt;&lt;span lang="EN-US"&gt;Allowing the sample size of
     candidate documents to be modified&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;&lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 10pt;" lang="EN-US"&gt;&amp;nbsp;&lt;/span&gt;&lt;span&gt;A keyword analysis will be executed
a set of fifteen stemmed questions twice over, with one variable; the number of
candidate documents used for analysis. The first time the frequency of keywords
within the page will be analysed using a set of five candidate documents, then
the test will be repeated with ten candidate documents. For each iteration, the
position of the expected answer will be recorded, and if the answer is a
combination of words, an average position will be calculated from the mean of
the combined iterations of the keyword(s). It should be noted that this system
is designed to handle questions with answers of between one and two words long.
Future improvements may include modifying the system to cope with longer answers.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span&gt;The expected result is for answers
  to rank lower on the keyword table when processed with five candidate
  documents, rather than ten. Rationale for this is that when given a wider scope
  of material, there will be more repetitions of the expected answer, minimizing
the margin for error.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;a name="_Toc135413450"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 14pt;" lang="EN-US"&gt;5.2.3 Results &amp;amp; Conclusions&lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;/p&gt;&lt;div&gt;&lt;table style="border: currentColor; width: 455.4pt; border-collapse: collapse;" class="MsoTableGrid" border="1" cellSpacing="0" cellPadding="0" width="607"&gt;
 &lt;tbody&gt;&lt;tr&gt;
  &lt;td style="padding: 1.4pt 5.4pt; border: 1pt solid windowtext; width: 129.35pt;" vAlign="top" width="172"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;&lt;img src="/files/media/image/question-answering/image012.jpg" width="158" height="51"&gt;&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: 1pt 1pt 1pt medium; border-style: solid solid solid none; border-color: windowtext windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 326.05pt;" vAlign="top" width="435" colSpan="6"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 8pt;" lang="EN-US"&gt;&lt;/span&gt;&lt;/b&gt;&amp;nbsp;&lt;/p&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 14pt;" lang="EN-US"&gt;Ranking of Correct Answer within Candidates&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style="height: 14.95pt;"&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 1.4pt 5.4pt; width: 129.35pt; height: 14.95pt;" vAlign="top" width="172"&gt;
  &lt;p style="line-height: 150%;"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;Position&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 51.6pt; height: 14.95pt;" vAlign="top" width="69"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;1&lt;sup&gt;st&lt;/sup&gt; – 5&lt;sup&gt;th&lt;/sup&gt; &lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 58.45pt; height: 14.95pt;" vAlign="top" width="78"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;6&lt;sup&gt;th&lt;/sup&gt; – 10&lt;sup&gt;th&lt;/sup&gt; &lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 63pt; height: 14.95pt;" vAlign="top" width="84"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;11&lt;sup&gt;th&lt;/sup&gt; – 15&lt;sup&gt;th&lt;/sup&gt; &lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 63pt; height: 14.95pt;" vAlign="top" width="84"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;16&lt;sup&gt;th&lt;/sup&gt; – 20&lt;sup&gt;th&lt;/sup&gt;&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 45pt; height: 14.95pt;" vAlign="top" width="60"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;21 -25 &lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 45pt; height: 14.95pt;" vAlign="top" width="60"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;Total&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 1.4pt 5.4pt; width: 129.35pt;" vAlign="top" width="172"&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 14pt;" lang="EN-US"&gt;&amp;nbsp;5 Candidates: &lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 51.6pt;" vAlign="top" width="69"&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;5&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;17%&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 58.45pt;" vAlign="top" width="78"&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;2&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;7%&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 63pt;" vAlign="top" width="84"&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;7&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;23%&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 63pt;" vAlign="top" width="84"&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;1&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;3%&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 45pt;" vAlign="top" width="60"&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;0&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;0%&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 45pt;" vAlign="top" width="60"&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;15&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;50%&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 1.4pt 5.4pt; width: 129.35pt;" vAlign="top" width="172"&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 14pt;" lang="EN-US"&gt;10 Candidates: &lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 51.6pt;" vAlign="top" width="69"&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;9&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;30%&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 58.45pt;" vAlign="top" width="78"&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;1&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;3%&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 63pt;" vAlign="top" width="84"&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;1&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;3%&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 63pt;" vAlign="top" width="84"&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;2&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;7%&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 45pt;" vAlign="top" width="60"&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;2&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;7%&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 45pt;" vAlign="top" width="60"&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;15&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;50%&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 1.4pt 5.4pt; width: 410.4pt;" vAlign="top" width="547" colSpan="6"&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;&lt;/span&gt;&lt;/b&gt;&amp;nbsp;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 1.4pt 5.4pt; width: 45pt;" vAlign="top" width="60"&gt;
  &lt;p style="text-align: justify;"&gt;&lt;b&gt;&lt;span style="color: black; font-size: 11pt;" lang="EN-US"&gt;30&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;/td&gt;
 &lt;/tr&gt;
&lt;/tbody&gt;&lt;/table&gt;&lt;/div&gt;&lt;p&gt;&lt;b&gt;&lt;span&gt;Table 5: Ranking of relevant
keywords based on number of candidates used &lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;A sample size of 15 was selected to
ensure at least two types of each question were tested with the system to
improve accuracy. Execution times of over 20 minutes for lengthy 10 document
parses seriously impaired progress during peak server usage periods, however
the previous table has been compiled from raw data gained from the keyword
analysis system and suggests several things.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;When comparing the results on a
  larger scale, the top ten most frequently used keywords during our experiment
  do appear to relate directly to the correct answer. &lt;b&gt;33%&lt;/b&gt; of correct
  answers were ranked in position 1-10 when using ten candidate documents, as
  opposed to just &lt;b&gt;24%&lt;/b&gt; when using five candidate documents. This clearly
  shows that many of the suspected answers have gained a higher ranking with the
  increase of candidate documents&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;Although increasing candidate
  documents for our sample appears to improve the number of correct answers
  ranking highly on our system (positions 1-5), several other factors seem to
  degrade performance at the lower end of the scale when candidate the document
  size is increased. One reason for this degradation may be the decreasing
  relevance of documents as the application proceeds through later search
  results. Results 1 to 5 may be accurate and contain a higher abundance of
  iterations of the correct answer, whereas results 6 to 10 may begin to become
  more irrelevant for certain questions. However the increased ranking of the
  correct answer does appear to have more of an impact than the former.&lt;/span&gt;&lt;/p&gt;&lt;div&gt;&lt;table style="width: 432pt; margin-left: 5.4pt; border-collapse: collapse;" class="MsoNormalTable" border="0" cellSpacing="0" cellPadding="0" width="576"&gt;
  &lt;tbody&gt;&lt;tr style="height: 15.75pt;"&gt;
  &lt;td style="background: black; border-width: 1pt medium; border-style: solid none; border-color: windowtext currentColor; padding: 0.55pt 5.4pt; width: 45pt; height: 15.75pt;" vAlign="top" width="60"&gt;
  &lt;p style="text-align: center;" align="center"&gt;&lt;b&gt;&lt;span style='color: white; font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;Q. &lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p style="text-align: center;" align="center"&gt;&lt;b&gt;&lt;span style='color: white; font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;No.&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: black; border-width: 1pt medium; border-style: solid none; border-color: windowtext currentColor; padding: 0.55pt 5.4pt; width: 118.3pt; height: 15.75pt;" vAlign="top" width="158"&gt;
  &lt;p style="text-align: center;" align="center"&gt;&lt;b&gt;&lt;span style='color: white; font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;Question&lt;br&gt;
  Type*&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: black; border-width: 1pt medium; border-style: solid none; border-color: windowtext currentColor; padding: 0.55pt 5.4pt; width: 79.7pt; height: 15.75pt;" vAlign="bottom" width="106" noWrap=""&gt;
  &lt;p style="text-align: center;" align="center"&gt;&lt;b&gt;&lt;span style='color: white; font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;5 Candidate&lt;br&gt;
  Documents&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: black; border-width: 1pt medium; border-style: solid none; border-color: windowtext currentColor; padding: 0.55pt 5.4pt; width: 81pt; height: 15.75pt;" vAlign="bottom" width="108" noWrap=""&gt;
  &lt;p style="text-align: center;" align="center"&gt;&lt;b&gt;&lt;span style='color: white; font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;10 Candidate Documents&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="background: black; border-width: 1pt 1pt 1pt medium; border-style: solid solid solid none; border-color: windowtext windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 108pt; height: 15.75pt;" vAlign="bottom" width="144" noWrap=""&gt;
  &lt;p style="text-align: center;" align="center"&gt;&lt;b&gt;&lt;span style='color: white; font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;Improvement&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p style="text-align: center;" align="center"&gt;&lt;b&gt;&lt;span style='color: white; font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;(Percent %)&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style="height: 15pt;"&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0.55pt 5.4pt; width: 45pt; height: 15pt;" vAlign="top" width="60"&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;1&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 118.3pt; height: 15pt;" vAlign="top" width="158"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;Who&lt;/span&gt;&lt;/b&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;&lt;br&gt;
  &lt;/span&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8pt;' lang="EN-US"&gt;Person
  / Organisation&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 79.7pt; height: 15pt;" vAlign="bottom" width="106" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;14.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 81pt; height: 15pt;" vAlign="bottom" width="108" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;2.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 108pt; height: 15pt;" vAlign="bottom" width="144" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;600.00%&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style="height: 15pt;"&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0.55pt 5.4pt; width: 45pt; height: 15pt;" vAlign="top" width="60"&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;2&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 118.3pt; height: 15pt;" vAlign="top" width="158"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;When&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8pt;' lang="EN-US"&gt;Date
  / Time&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 79.7pt; height: 15pt;" vAlign="bottom" width="106" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;13.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 81pt; height: 15pt;" vAlign="bottom" width="108" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;14.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 108pt; height: 15pt;" vAlign="bottom" width="144" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;-7.14%&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style="height: 15pt;"&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0.55pt 5.4pt; width: 45pt; height: 15pt;" vAlign="top" width="60"&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;3&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 118.3pt; height: 15pt;" vAlign="top" width="158"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;What (Does)&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8pt;' lang="EN-US"&gt;Money
  / Definition / Title&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 79.7pt; height: 15pt;" vAlign="bottom" width="106" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;1.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 81pt; height: 15pt;" vAlign="bottom" width="108" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;1.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 108pt; height: 15pt;" vAlign="bottom" width="144" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;0.00%&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style="height: 15pt;"&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0.55pt 5.4pt; width: 45pt; height: 15pt;" vAlign="top" width="60"&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;4&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 118.3pt; height: 15pt;" vAlign="top" width="158"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;Who&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8pt;' lang="EN-US"&gt;Person
  / Organisation&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 79.7pt; height: 15pt;" vAlign="bottom" width="106" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;8.50 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 81pt; height: 15pt;" vAlign="bottom" width="108" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;16.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 108pt; height: 15pt;" vAlign="bottom" width="144" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;-46.88%&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style="height: 15pt;"&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0.55pt 5.4pt; width: 45pt; height: 15pt;" vAlign="top" width="60"&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;5&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 118.3pt; height: 15pt;" vAlign="top" width="158"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;Who&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8pt;' lang="EN-US"&gt;Person
  / Organisation&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 79.7pt; height: 15pt;" vAlign="bottom" width="106" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;1.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 81pt; height: 15pt;" vAlign="bottom" width="108" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;1.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 108pt; height: 15pt;" vAlign="bottom" width="144" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;0.00%&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style="height: 15pt;"&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0.55pt 5.4pt; width: 45pt; height: 15pt;" vAlign="top" width="60"&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;6&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 118.3pt; height: 15pt;" vAlign="top" width="158"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;Where&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8pt;' lang="EN-US"&gt;Location&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 79.7pt; height: 15pt;" vAlign="bottom" width="106" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;13.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 81pt; height: 15pt;" vAlign="bottom" width="108" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;2.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 108pt; height: 15pt;" vAlign="bottom" width="144" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;550.00%&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style="height: 15pt;"&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0.55pt 5.4pt; width: 45pt; height: 15pt;" vAlign="top" width="60"&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;7&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 118.3pt; height: 15pt;" vAlign="top" width="158"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;What (Did)&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8pt;' lang="EN-US"&gt;Money
  / Definition / Title&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 79.7pt; height: 15pt;" vAlign="bottom" width="106" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;15.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 81pt; height: 15pt;" vAlign="bottom" width="108" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;29.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 108pt; height: 15pt;" vAlign="bottom" width="144" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;-48.28%&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style="height: 15pt;"&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0.55pt 5.4pt; width: 45pt; height: 15pt;" vAlign="top" width="60"&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;8&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 118.3pt; height: 15pt;" vAlign="top" width="158"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;Where (Does)&lt;/span&gt;&lt;/b&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;&lt;br&gt;
  &lt;/span&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8pt;' lang="EN-US"&gt;Location&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 79.7pt; height: 15pt;" vAlign="bottom" width="106" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;19.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 81pt; height: 15pt;" vAlign="bottom" width="108" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;22.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 108pt; height: 15pt;" vAlign="bottom" width="144" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;-13.64%&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style="height: 15pt;"&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0.55pt 5.4pt; width: 45pt; height: 15pt;" vAlign="top" width="60"&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;9&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 118.3pt; height: 15pt;" vAlign="top" width="158"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;What (Is)&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8pt;' lang="EN-US"&gt;Definition&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 79.7pt; height: 15pt;" vAlign="bottom" width="106" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;11.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 81pt; height: 15pt;" vAlign="bottom" width="108" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;5.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 108pt; height: 15pt;" vAlign="bottom" width="144" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;120.00%&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style="height: 15pt;"&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0.55pt 5.4pt; width: 45pt; height: 15pt;" vAlign="top" width="60"&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;10&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 118.3pt; height: 15pt;" vAlign="top" width="158"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;Which&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8pt;' lang="EN-US"&gt;Which&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 79.7pt; height: 15pt;" vAlign="bottom" width="106" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;15.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 81pt; height: 15pt;" vAlign="bottom" width="108" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;17.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 108pt; height: 15pt;" vAlign="bottom" width="144" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;-11.76%&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style="height: 15pt;"&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0.55pt 5.4pt; width: 45pt; height: 15pt;" vAlign="top" width="60"&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;11&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 118.3pt; height: 15pt;" vAlign="top" width="158"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;Which&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8pt;' lang="EN-US"&gt;Which&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 79.7pt; height: 15pt;" vAlign="bottom" width="106" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;3.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 81pt; height: 15pt;" vAlign="bottom" width="108" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;1.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 108pt; height: 15pt;" vAlign="bottom" width="144" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;200.00%&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style="height: 15pt;"&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0.55pt 5.4pt; width: 45pt; height: 15pt;" vAlign="top" width="60"&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;12&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 118.3pt; height: 15pt;" vAlign="top" width="158"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;What (Year)&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8pt;' lang="EN-US"&gt;Year&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 79.7pt; height: 15pt;" vAlign="bottom" width="106" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;1.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 81pt; height: 15pt;" vAlign="bottom" width="108" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;1.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 108pt; height: 15pt;" vAlign="bottom" width="144" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;0.00%&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style="height: 15pt;"&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0.55pt 5.4pt; width: 45pt; height: 15pt;" vAlign="top" width="60"&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;13&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 118.3pt; height: 15pt;" vAlign="top" width="158"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;Who&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8pt;' lang="EN-US"&gt;Person
  / Organisation&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 79.7pt; height: 15pt;" vAlign="bottom" width="106" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;5.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 81pt; height: 15pt;" vAlign="bottom" width="108" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;1.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 108pt; height: 15pt;" vAlign="bottom" width="144" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;400.00%&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style="height: 15pt;"&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0.55pt 5.4pt; width: 45pt; height: 15pt;" vAlign="top" width="60"&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;14&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 118.3pt; height: 15pt;" vAlign="top" width="158"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;How (Did)&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8pt;' lang="EN-US"&gt;Manner&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 79.7pt; height: 15pt;" vAlign="bottom" width="106" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;14.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 81pt; height: 15pt;" vAlign="bottom" width="108" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;5.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 108pt; height: 15pt;" vAlign="bottom" width="144" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;180.00%&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style="height: 15.75pt;"&gt;
  &lt;td style="border-width: medium 1pt 2.25pt; border-style: none solid double; border-color: currentColor windowtext windowtext; padding: 0.55pt 5.4pt; width: 45pt; height: 15.75pt;" vAlign="top" width="60"&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;15&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 2.25pt medium; border-style: none solid double none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 118.3pt; height: 15.75pt;" vAlign="top" width="158"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;How (Many)&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;p&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 8pt;' lang="EN-US"&gt;Number&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 2.25pt medium; border-style: none solid double none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 79.7pt; height: 15.75pt;" vAlign="bottom" width="106" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;9.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 2.25pt medium; border-style: none solid double none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 81pt; height: 15.75pt;" vAlign="bottom" width="108" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;6.00 &lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 2.25pt medium; border-style: none solid double none; border-color: currentColor windowtext windowtext currentColor; padding: 0.55pt 5.4pt; width: 108pt; height: 15.75pt;" vAlign="bottom" width="144" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;span style='font-family: "Arial","sans-serif"; font-size: 10pt;' lang="EN-US"&gt;50.00%&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr style="height: 6.75pt;"&gt;
  &lt;td style="padding: 0.55pt 5.4pt; width: 45pt; height: 6.75pt;" vAlign="top" width="60"&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;b&gt;&lt;span style='font-family: "Arial","sans-serif";' lang="EN-US"&gt;&lt;/span&gt;&lt;/b&gt;&amp;nbsp;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="padding: 0.55pt 5.4pt; width: 118.3pt; height: 6.75pt;" vAlign="top" width="158"&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;b&gt;&lt;span style='font-family: "Arial","sans-serif";' lang="EN-US"&gt;&lt;/span&gt;&lt;/b&gt;&amp;nbsp;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="padding: 0.55pt 5.4pt; width: 79.7pt; height: 6.75pt;" vAlign="bottom" width="106" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;b&gt;&lt;span style='font-family: "Arial","sans-serif";' lang="EN-US"&gt;&amp;nbsp;Avg:&amp;nbsp; 9.50 &lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="padding: 0.55pt 5.4pt; width: 81pt; height: 6.75pt;" vAlign="bottom" width="108" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;b&gt;&lt;span style='font-family: "Arial","sans-serif";' lang="EN-US"&gt;8.20 &lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="padding: 0.55pt 5.4pt; width: 108pt; height: 6.75pt;" vAlign="bottom" width="144" noWrap=""&gt;
  &lt;p style="text-align: right;" align="right"&gt;&lt;b&gt;&lt;span style='font-family: "Arial","sans-serif";' lang="EN-US"&gt;15.85%&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
&lt;/tbody&gt;&lt;/table&gt;&lt;/div&gt;&lt;p&gt;&lt;b&gt;&lt;span&gt;Table 6: Increase / decrease in ranking
by doubling candidate documents&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;The above sample indicates that out
  of fifteen questions posted to the system, ten answers had obtained higher or
  the same ranking when the number of candidate documents was doubled. The
  original assumption was that increased candidate documents would result in an
  increased occurrence of the desired answer, and from the results from this
  sample, this would appear to be the case. The largest (negative) difference in
  results appears to be for What Did / What does style questions which have an
  'undefined' answer type. The question "Who was the first American in Space?"
  resulted in a top ten ranking when five candidates were used, but dropped to
  sixteenth when ten were used. This may simply be an anomaly where irrelevant or
  unlucky information was obtained from the search engine, or it may be due to
  the fact that the keywords 'First', 'American' and 'Space' could return
  different types of answer not necessarily related to the question.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;In conclusion, increasing the number
  of candidate documents does appear to improve the ability to retrieve an
  accurate answer It was suspected that this activity might provide more
  irrelevant documents, however the abundance of correct answers appears to
  override this.&lt;/span&gt;&lt;span style='line-height: 150%; font-family: "Times New Roman","serif"; font-size: 12pt;' lang="EN-US"&gt;  &lt;/span&gt;&lt;/p&gt;&lt;p style="margin: 0cm -18pt 0pt 36pt; line-height: 150%; text-indent: -36pt;"&gt;&lt;a name="_Toc135413451"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;5.3&amp;nbsp;&amp;nbsp;&amp;nbsp; Does the Google
  API suffer from consistency issues?&lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 3pt;" lang="EN-US"&gt;&amp;nbsp;&lt;/span&gt;&lt;a name="_Toc135413452"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 14pt;" lang="EN-US"&gt;5.3.1 Introduction&lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;This research aims to address many
  of the issues encountered with the development of the system and the reason why
  many results when issued to &lt;i&gt;'froo'&lt;/i&gt; for the first time were returning
  unusual results. This includes a full analysis of the Google API, and reasoning
  behind these. The analysis includes issuing a test question to the Google API
  and comparing it against the regular Google search engine for accuracy. Results
  obtained are slightly concerning considering how widely used this API is.&lt;/span&gt;&lt;/p&gt;&lt;div&gt;&amp;nbsp;&lt;/div&gt;&lt;p&gt;&lt;a name="_Toc135413453"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 14pt;" lang="EN-US"&gt;5.3.2 Rationale for research&lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;/p&gt;&lt;p&gt;&lt;i&gt;&lt;span style="color: black; line-height: 150%; font-size: 1pt;" lang="EN-US"&gt;&amp;nbsp;&lt;/span&gt;&lt;/i&gt;&lt;i&gt;&lt;span&gt;&amp;nbsp;"Our search API is way better
than their search API" (Gates W, 2005)&lt;/span&gt;&lt;/i&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;Throughout
  the development of the practical side of the project, there have been several
  setbacks, the majority of which were down to the reliability of the Google API.
  It appears that the 'Google Web Service' &lt;/span&gt;&lt;span&gt;is not even close for use in a production environment. It
    must be noted that this API is currently in use by hundreds of developers, many
    of which are using to obtain data for question answering systems, and although Google
    implicitly state that the system is not suitable for a production environment
    and is still in beta mode, many existing question answering systems such as &lt;i&gt;'Brainboost'&lt;/i&gt;
  already utilise this component.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;a name="_Toc135413454"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 14pt;" lang="EN-US"&gt;5.3.3 The experiment&lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;In this short experiment, the
results from Google's search engine were compared against the results from
Google's API.&amp;nbsp; The same keyword was searched for ten times in a row using the
Google API and subsequently Google's regular search engine. The URLs were then
recorded and compared to ensure accurate results were being obtained during the
document retrieval process of our keyword analysis system. The experiment was
carried out several times to ensure that propagation or updates weren't taking
place at the time of data capture. Results were gathered between the hours of
09:00 and 10:00 GMT to ensure optimal resources were available.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;a name="_Toc135413455"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 14pt;" lang="EN-US"&gt;5.3.4 Results analysis&lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;The tests outlined were run on the
Google Search Engine and on our local server using the Google Application
Programming Interface (API). Although the scope of this project did not extend
to analysing more than 10 results at a time, the following table indicated a
sample search using the Google API.&lt;/span&gt;&lt;/p&gt;&lt;div&gt;&lt;table style="border: currentColor; border-collapse: collapse;" class="MsoTableGrid" border="1" cellSpacing="0" cellPadding="0"&gt;
  &lt;tbody&gt;&lt;tr&gt;
  &lt;td style="padding: 0cm 5.4pt; border: 1pt solid windowtext; width: 136.25pt;" vAlign="top" width="182"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 14pt;" lang="EN-US"&gt;Page&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: 1pt 1pt 1pt medium; border-style: solid solid solid none; border-color: windowtext windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 153.6pt;" vAlign="top" width="205"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 14pt;" lang="EN-US"&gt;Results&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: 1pt 1pt 1pt medium; border-style: solid solid solid none; border-color: windowtext windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 153.25pt;" vAlign="top" width="204"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 14pt;" lang="EN-US"&gt;Total Pages Found&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0cm 5.4pt; width: 136.25pt;" vAlign="top" width="182"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;1&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 153.6pt;" vAlign="top" width="205"&gt;
  &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;1-10&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 153.25pt;" vAlign="top" width="204"&gt;
  &lt;p&gt;&lt;i&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;273000&lt;/span&gt;&lt;/i&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0cm 5.4pt; width: 136.25pt;" vAlign="top" width="182"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;2&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 153.6pt;" vAlign="top" width="205"&gt;
  &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;11-20&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 153.25pt;" vAlign="top" width="204"&gt;
  &lt;p&gt;&lt;i&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;141000&lt;/span&gt;&lt;/i&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0cm 5.4pt; width: 136.25pt;" vAlign="top" width="182"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;3&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 153.6pt;" vAlign="top" width="205"&gt;
  &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;21-30&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 153.25pt;" vAlign="top" width="204"&gt;
  &lt;p&gt;&lt;i&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;141000&lt;/span&gt;&lt;/i&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0cm 5.4pt; width: 136.25pt;" vAlign="top" width="182"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;4&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 153.6pt;" vAlign="top" width="205"&gt;
  &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;31-40&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 153.25pt;" vAlign="top" width="204"&gt;
  &lt;p&gt;&lt;i&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;273000 (502 - Bad Gateway)&lt;/span&gt;&lt;/i&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0cm 5.4pt; width: 136.25pt;" vAlign="top" width="182"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;5&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 153.6pt;" vAlign="top" width="205"&gt;
  &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;41-50&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 153.25pt;" vAlign="top" width="204"&gt;
  &lt;p&gt;&lt;i&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;273000&lt;/span&gt;&lt;/i&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0cm 5.4pt; width: 136.25pt;" vAlign="top" width="182"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;6&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 153.6pt;" vAlign="top" width="205"&gt;
  &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;51-60&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 153.25pt;" vAlign="top" width="204"&gt;
  &lt;p&gt;&lt;i&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;273000&lt;/span&gt;&lt;/i&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0cm 5.4pt; width: 136.25pt;" vAlign="top" width="182"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;7&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 153.6pt;" vAlign="top" width="205"&gt;
  &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;61-70&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 153.25pt;" vAlign="top" width="204"&gt;
  &lt;p&gt;&lt;i&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;141000 (502 - Bad Gateway)&lt;/span&gt;&lt;/i&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0cm 5.4pt; width: 136.25pt;" vAlign="top" width="182"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;8&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 153.6pt;" vAlign="top" width="205"&gt;
  &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;71-80&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 153.25pt;" vAlign="top" width="204"&gt;
  &lt;p&gt;&lt;i&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;273000&lt;/span&gt;&lt;/i&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0cm 5.4pt; width: 136.25pt;" vAlign="top" width="182"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;9&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 153.6pt;" vAlign="top" width="205"&gt;
  &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;81-90&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 153.25pt;" vAlign="top" width="204"&gt;
  &lt;p&gt;&lt;i&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;141000&lt;/span&gt;&lt;/i&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
  &lt;td style="border-width: medium 1pt 1pt; border-style: none solid solid; border-color: currentColor windowtext windowtext; padding: 0cm 5.4pt; width: 136.25pt;" vAlign="top" width="182"&gt;
  &lt;p&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;10&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 153.6pt;" vAlign="top" width="205"&gt;
  &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;91-100&lt;/span&gt;&lt;/p&gt;
  &lt;/td&gt;
  &lt;td style="border-width: medium 1pt 1pt medium; border-style: none solid solid none; border-color: currentColor windowtext windowtext currentColor; padding: 0cm 5.4pt; width: 153.25pt;" vAlign="top" width="204"&gt;
  &lt;p&gt;&lt;i&gt;&lt;span style="color: black; line-height: 150%; font-size: 11pt;" lang="EN-US"&gt;273000&lt;/span&gt;&lt;/i&gt;&lt;/p&gt;
  &lt;/td&gt;
 &lt;/tr&gt;
&lt;/tbody&gt;&lt;/table&gt;&lt;/div&gt;&lt;p&gt;&lt;b&gt;&lt;span&gt;Table 7: Anomalies in the Google
API &lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;This table shows a real
  inconsistency in the level of results obtained by the API.&amp;nbsp; Page one returns
  2,730,000 results, whereas page two (11-20) displays 1,410,000 results. When
  the same search term is applied to the regular Google website, a steady
  2,730,000 results are displayed (for each iteration) which is to be expected.
  What's more the first and third result on the Google website search does not
  appear at all in the first 100 results from the API in this query. This issue
  raises major concerns as there is a possibility that the most important
  candidate documents are being purged from the search results altogether, which
  may have had a negative impact on many of the keyword analyses run for this
  project.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;In addition to this, two out of the
  ten pages returned 'Bad Gateway' errors. A problem encountered several times
  during the development and testing of the prototype keyword analysis system. A
  description of this error is "This server received an invalid response from an
  upstream server it accessed to fulfill the request." (CheckUpDown.com 2006).
  which would imply an invalid response from the developer's end. However&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;a name="_Toc135413456"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 14pt;" lang="EN-US"&gt;5.3.5 Conclusions&lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;Although inconclusive, the most
likely explanation for the erratic behavior of this API relates to broken views
of Google's production index. It is possible that Google has a separate server
farm set up for the API, with several machines each having a different version
of their production index. It appears there is nothing that can be done to
resolve these issues and through multiple posts to discussion boards (Google
Groups) and newsgroups (Usenet), looks unlikely that Google will release a
commercial-grade component of this nature in the near future.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span&gt;In summary, it has been proven that
  the Google API does suffer from some fairly serious consistency issues. Had
  this information been available at development time, it would have been taken
  into consideration when selecting a document retrieval component for a question
  answering system. This issue, had it not been identified, would have been a
  major limitation of this project and may have rendered some of the results of
  the previous two research objectives null and void. However, when skewed
  results were identified, the test was re-run and in every case was able to
  connect to a server in the Google API farm with an up to date version of the production
  index.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;a name="_Toc135413458"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;6.1&amp;nbsp;&amp;nbsp;&amp;nbsp; Conclusions&lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;/p&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;This final chapter presents the conclusions reached from the
  analysis in the previous chapters. Conclusions and recommendations are made in
  relation to each of the specific research objectives, and towards the overall
  aim of this study.&lt;/span&gt;&lt;/p&gt;&lt;p style="margin: 0cm -18pt 0pt 36pt; line-height: 150%; text-indent: -36pt;"&gt;&lt;a name="_Toc135413459"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;6.2&amp;nbsp;&amp;nbsp;&amp;nbsp; Conclusions of the
  study&lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;/p&gt;&lt;h3&gt;&lt;a name="_Toc135413460"&gt;&lt;/a&gt;&lt;a name="_Toc127335500"&gt;&lt;span style='font-family: "Times New Roman","serif";' lang="EN-US"&gt;6.2.1. &lt;/span&gt;&lt;/a&gt;&lt;span style='font-family: "Times New Roman","serif";' lang="EN-US"&gt;Effect of PageRank of
  Candidate Document Quality&lt;/span&gt;&lt;span style='font-family: "Times New Roman","serif";' lang="EN-US"&gt;
&lt;/span&gt;&lt;/h3&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;From the study, it appears that PageRank, Google's search algorithm,
has a positive effect on candidate document quality and it is possible to
obtain answers using a keyword ranking system. It was originally thought that
utilising Google as part of the document retrieval component of a question
answering system would have a negative impact on retrieving a high quantity of
candidate answers within the pages. &lt;span style="color: black;"&gt;Google's
documents appear to be more relevant and include a higher abundance of the
desired data than MSN's, despite not having as many occurrences of the
question's keywords in the document. Using this data, it can be concluded that,
contrary to the hypothesis, the Google PageRank algorithm actually has a &lt;i&gt;positive&lt;/i&gt;
impact on gathering candidate documents.&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;h3&gt;&lt;a name="_Toc135413461"&gt;&lt;span style='font-family: "Times New Roman","serif";' lang="EN-US"&gt;6.2.2.
  Candidate document quantity in relation to results received&lt;/span&gt;&lt;/a&gt;&lt;/h3&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;From the study, it appears that this question has been
fully answered; however a sample of a greater number of questions may be
required to verify the accuracy of these results. &lt;b&gt;&lt;i&gt;&lt;span style="color: black;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&lt;span style="color: black;"&gt;Increasing the number of
candidate documents does appear to improve the ability to retrieve an accurate
answer. It was suspected that this activity might provide more irrelevant
documents, however the abundance of correct answers appears to override this.&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;h3&gt;&lt;a name="_Toc135413462"&gt;&lt;span style='font-family: "Times New Roman","serif";' lang="EN-US"&gt;6.2.3.
Google API consistency issues&lt;/span&gt;&lt;/a&gt;&lt;/h3&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;It appears there is nothing that can be done to resolve
these issues and it looks unlikely that Google will release a commercial-grade
component of this nature in the near future. In summary, it has been proven
that the Google API does suffer from consistency issues. This should be taken
into consideration when selecting a document retrieval component for a question
answering system.&lt;/span&gt;&lt;/p&gt;&lt;h3 style="text-indent: -36pt; margin-left: 36pt;"&gt;&lt;a name="_Toc135413463"&gt;&lt;span style='font-family: "Times New Roman","serif";' lang="EN-US"&gt;6.2.3.&lt;span style='font: 7pt/normal "Times New Roman"; font-size-adjust: none; font-stretch: normal;'&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;span style='font-family: "Times New Roman","serif";' lang="EN-US"&gt;Literature conclusions&lt;/span&gt;&lt;/a&gt;&lt;/h3&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;The most significant finding from the literature review
  was that stemming had a more positive impact on the quality of candidate
  documents returned than query expansion. Original intentions were to explore
  these in detail through experiment; however the results of the tests conducted
  by &lt;span style="color: black;"&gt;Bilotti and Katz&amp;nbsp; in 2004 (Section 2.5) resulted
    in&amp;nbsp; research efforts being focused on further refining the document retrieval
  process by addressing the three main research areas above.&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p style="margin: 0cm -18pt 0pt 36pt; line-height: 150%; text-indent: -36pt;"&gt;&lt;a name="_Toc135413464"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;6.3&amp;nbsp;&amp;nbsp;&amp;nbsp; Further Research&lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;/p&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;The system returns a list of keywords on a specific number of pages
ordered by frequency. Despite many results yielding high accuracy rates, it may
be possible to further refine the system by establishing the type of answer
expected from the question. Answer type clarification could be easily
implemented on this system, further refining the answer selection process for
the answer selector module. An example of this could be related to the question
'Where was George Washington Born?' The answer to the question is 'Virginia',
so if the answer type 'Location' was established, and a database of locations
was compiled and utilised on the server, any results not matching records in the
'location' table could be removed from the list of candidate keywords,
resulting in the correct answer, 'Virginia' having a higher ranking in the
keyword database.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;Although stemming was statistically the most effective way to supply
  queries to the search engine for document selection, query expansion had a
  positive effect on many queries. Further research may result in a combination
  of query expansion and stemming being used to re-test this system to establish
  if this would have any impact on the results obtained. The main consideration
  for omitting this from the scope of this project was limitations on time and
  resources. Sending two queries to the search engines would have resulted in our
  ~12 minute script execution time for each query being doubled. In addition to
  this, questions phrased in non-standard formats should be addressed by the
  system. For example, a variation of the query 'How many islands does Fiji have' would be '  Fiji has x islands'. A solid and well built question answering system should
  have the ability to deal with such requests as well as standard questions.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;Based on the results of the Google API analysis (Section 5.3) it was
  decided to set a maximum amount of candidate documents to ten. This resulted in
  comparing the analysis of between five and ten documents, however ideally more
  documents should have been taken into consideration. As discussed, reliability
  of the Google API appears to decrease after the first page (10) of results,
  therefore to ensure data integrity, it was decided not to exceed the ten
  results obtained from the API on the first page. Future research may involve
  developing a system to spot anomalies reported by the API, ultimately resulting
  in more candidate documents being used for keyword analysis by the prototype
  system.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span lang="EN-US"&gt;The final recommendation takes into consideration the scale of data
  collected by the prototype system. The theories discussed could be tested on a
  wider scale over a longer period of time to analyse a broader set of results.
  Although these results are accurate to an extent, an increase in the number of
  questions posed to the system would be a major improvement and may produce more
  reliable results for the system. &lt;/span&gt;&lt;/p&gt;&lt;p style="line-height: 150%; margin-right: -18pt;"&gt;&lt;a name="_Toc135413465"&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;6.4&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/b&gt;&lt;/a&gt;&lt;b&gt;&lt;span style="color: black; line-height: 150%; font-size: 16pt;" lang="EN-US"&gt;Conclusion&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;&lt;div&gt;

    &lt;p&gt;&lt;span style="color: black; line-height: 150%; font-size: 3pt;" lang="EN-US"&gt;&amp;nbsp;&lt;/span&gt;&lt;span&gt;The document has addressed all
      research questions through an extensive literature review and through primary
      research in the form of a keyword frequency analysis system. &lt;/span&gt;  &lt;/p&gt;
  &lt;/div&gt;
  &lt;div&gt;
&lt;span lang="EN-US"&gt;Chapter 3, the methodology of the research, examines in
detail how the research has been carried out and justifies the primary research
for this study. Ultimately, u&lt;span style="color: black;"&gt;tilising several search
engines to collect candidates for the document retrieval component and
selecting the most relevant of each is theoretically the most effective way to
improve question answering systems using the web as a knowledgebase. However, as
the application execution time indicates (Appendix C), an extremely powerful
back-end would be required to effectively interface with several external
sources simultaneously.&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/kewney/~4/T-F0x_QlJlI" height="1" width="1"/&gt;</description>
<feedburner:origLink>http://kewney.com/posts/technology/question-answering-systems-using-the-web-as-a-knowledgebase</feedburner:origLink></item>

</channel>
</rss>

