<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:blogger='http://schemas.google.com/blogger/2008' xmlns:georss='http://www.georss.org/georss' xmlns:gd="http://schemas.google.com/g/2005" xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-15177538</id><updated>2023-06-07T03:14:01.111-07:00</updated><title type='text'>Web Spider Research Blogs</title><subtitle type='html'></subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://ddspiderresearch.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default?alt=atom'/><link rel='alternate' type='text/html' href='http://ddspiderresearch.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>David Dennis</name><uri>http://www.blogger.com/profile/14925803156237533662</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>24</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>25</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-15177538.post-114608324810505994</id><published>2006-04-26T13:17:00.000-07:00</published><updated>2006-04-26T13:27:31.806-07:00</updated><title type='text'>David &amp; Miranda</title><content type='html'>The paper so far includes all of the goals and purpose of the project.  We also include a summary of the process used to complete different projects.  Next week we&#39;ll edit the whole thing so its presentable to the CREU.&lt;br /&gt;&lt;br /&gt;As far as the programming goes, we added a button to the site that fetches all of the new MD5 signatures.  This makes the site much more efficient by allowing one to go to the page without being required to wait for every single page in the database to be examined.  We did this mostly because we need to change our program to work with the Webguide database instead of our small test database.  Having the page take a signature of every site in the official database would take a very long time.</content><link rel='replies' type='application/atom+xml' href='http://ddspiderresearch.blogspot.com/feeds/114608324810505994/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=15177538&amp;postID=114608324810505994' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/114608324810505994'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/114608324810505994'/><link rel='alternate' type='text/html' href='http://ddspiderresearch.blogspot.com/2006/04/david-miranda.html' title='David &amp; Miranda'/><author><name>Anonymous</name><uri>http://www.blogger.com/profile/14925803156237533662</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-15177538.post-114547711711916676</id><published>2006-04-19T13:01:00.000-07:00</published><updated>2006-04-19T13:42:59.630-07:00</updated><title type='text'>Miranda &amp; David</title><content type='html'>Today we finally got our Approve buttons to work!  We&#39;ve added in a new test page on the local server on order to easily play around with this (adding changes).  One simply presses &#39;approve&#39; in order to input the new md5 signature into the database.&lt;br /&gt;&lt;br /&gt;Other than our direct site, we have begun working on our final report for this project.  Right now we have an outline of all the work we&#39;ve done during the year--it has turned out to be a lot more than we remembered.</content><link rel='replies' type='application/atom+xml' href='http://ddspiderresearch.blogspot.com/feeds/114547711711916676/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=15177538&amp;postID=114547711711916676' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/114547711711916676'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/114547711711916676'/><link rel='alternate' type='text/html' href='http://ddspiderresearch.blogspot.com/2006/04/miranda-david_19.html' title='Miranda &amp; David'/><author><name>Anonymous</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/blank.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-15177538.post-114487251747197941</id><published>2006-04-12T13:06:00.000-07:00</published><updated>2006-04-12T13:08:48.596-07:00</updated><title type='text'>Miranda &amp; David</title><content type='html'>These past two weeks, we&#39;d been having a lot of issues getting our buttons to somehow call a function that will properly correspond to the row we desire.  We&#39;ve switched between normal buttons, to a button template, to a button column within our DataGrid and still had issues.  We finally got a related example to work on our machines, so next week we&#39;ll be working on adjusting this to our needs.</content><link rel='replies' type='application/atom+xml' href='http://ddspiderresearch.blogspot.com/feeds/114487251747197941/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=15177538&amp;postID=114487251747197941' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/114487251747197941'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/114487251747197941'/><link rel='alternate' type='text/html' href='http://ddspiderresearch.blogspot.com/2006/04/miranda-david.html' title='Miranda &amp; David'/><author><name>Anonymous</name><uri>http://www.blogger.com/profile/14925803156237533662</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-15177538.post-114366735792017691</id><published>2006-03-29T13:17:00.000-08:00</published><updated>2006-03-29T13:22:46.213-08:00</updated><title type='text'>Miranda &amp; David</title><content type='html'>This week we managed to squeeze &quot;approve&quot; buttons into our table (now a datagrid instead of generic table).  The code behind the datagrid buttons do not execute any real functions yet but that should be easy to add next week.&lt;br /&gt;&lt;br /&gt;We originally ordered our urls using a &quot;link ID&quot; as an index.  We now use the actual url instead of the link ID with an understanding that no two sites can possibly have the same url.&lt;br /&gt;&lt;br /&gt;Our goal for next week is to get the approve button working so that a person can visit the website given with a different signature and &quot;Approve&quot; that change by inputting the new signature in the table.  From then on the site will be checking the database for MD5 comparisons with the newly approved page instead of the old page.&lt;br /&gt;&lt;br /&gt;Of course we still need INSERT() and DELETE().</content><link rel='replies' type='application/atom+xml' href='http://ddspiderresearch.blogspot.com/feeds/114366735792017691/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=15177538&amp;postID=114366735792017691' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/114366735792017691'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/114366735792017691'/><link rel='alternate' type='text/html' href='http://ddspiderresearch.blogspot.com/2006/03/miranda-david_29.html' title='Miranda &amp; David'/><author><name>Anonymous</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/blank.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-15177538.post-114245668431094008</id><published>2006-03-15T13:01:00.000-08:00</published><updated>2006-03-15T13:06:20.633-08:00</updated><title type='text'>Miranda &amp; David</title><content type='html'>Today we finally finally got an MD5 signature to be recorded for websites listed in our database.  New columns include spaces for MD5 signatures and boolean values indicating changes from previous signatures.  The Object Oriented aspects of ASP.NET have made it easier for us to organize the disparate functions necessary to get new signatures and connect to the database.&lt;br /&gt;&lt;br /&gt;Spring Break next week.&lt;br /&gt;&lt;br /&gt;In two weeks time we plan to import the functions from our old ASP page to the new .NET page (insert/delete entries).  Additionally we are hoping to allow the user to approve the page before updating an entry&#39;s signature.</content><link rel='replies' type='application/atom+xml' href='http://ddspiderresearch.blogspot.com/feeds/114245668431094008/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=15177538&amp;postID=114245668431094008' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/114245668431094008'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/114245668431094008'/><link rel='alternate' type='text/html' href='http://ddspiderresearch.blogspot.com/2006/03/miranda-david_15.html' title='Miranda &amp; David'/><author><name>Anonymous</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/blank.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-15177538.post-114185173857124355</id><published>2006-03-08T12:57:00.000-08:00</published><updated>2006-03-08T13:02:25.326-08:00</updated><title type='text'>Miranda &amp; David</title><content type='html'>Our goals for the day were to get the website to fetch HTML from a separate site and MD5() the text.  We did get MD5() to finally work with ASP.NET however getting the HTML was a lot harder.  We spent the bulk of our time trying to figure out how to use the internet transfer control to get HTML from a given URL.  We were confused as to how one would declare the variable used in many examples called &quot;inet&quot; that would store and retrieve site information.  Most of the online examples do not show which namespaces were included in order to showcase their abilities.&lt;br /&gt;&lt;br /&gt;We have sent Dr. Couch an email and plan on grilling him about this issue tomorrow.  The examples he found for us last week do not work with ASP.NET</content><link rel='replies' type='application/atom+xml' href='http://ddspiderresearch.blogspot.com/feeds/114185173857124355/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=15177538&amp;postID=114185173857124355' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/114185173857124355'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/114185173857124355'/><link rel='alternate' type='text/html' href='http://ddspiderresearch.blogspot.com/2006/03/miranda-david_08.html' title='Miranda &amp; David'/><author><name>Anonymous</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/blank.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-15177538.post-114124693283152473</id><published>2006-03-01T12:59:00.000-08:00</published><updated>2006-03-01T13:02:23.700-08:00</updated><title type='text'>Miranda &amp; David</title><content type='html'>Today we were able to get our connection to the server database back up using ASP.NET&lt;br /&gt;&lt;br /&gt;This was a lot harder than we thought it would be.  It seemed as if there are many different ways to connect to a database and only when we copied an example line by line from our book did it work.&lt;br /&gt;&lt;br /&gt;Next week we hope to get HTML and use MD5.&lt;br /&gt;&lt;br /&gt;love,&lt;br /&gt;&lt;br /&gt;M&amp;D</content><link rel='replies' type='application/atom+xml' href='http://ddspiderresearch.blogspot.com/feeds/114124693283152473/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=15177538&amp;postID=114124693283152473' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/114124693283152473'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/114124693283152473'/><link rel='alternate' type='text/html' href='http://ddspiderresearch.blogspot.com/2006/03/miranda-david.html' title='Miranda &amp; David'/><author><name>Anonymous</name><uri>http://www.blogger.com/profile/14925803156237533662</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-15177538.post-114072403420926807</id><published>2006-02-23T11:43:00.000-08:00</published><updated>2006-02-23T11:47:22.506-08:00</updated><title type='text'>Miranda &amp; David</title><content type='html'>Today we realized we were writing all of our code in ASP and not ASP.NET.  We found this out by trying to run various snippets of code from example sites.  In an attempt to get MD5() up and running we were unable to get the &quot;System&quot; namespace to work correctly.&lt;br /&gt;&lt;br /&gt;We have subsequently printed out our previous database manipulation code and next meeting we plan to translate this from VBScript to Visual Basic.&lt;br /&gt;&lt;br /&gt;Tufts runs on a Monday Schedule this week so we will not be able to meet with Dr. Couch until next week.  This gives us more time to catch up with the new language requirements.</content><link rel='replies' type='application/atom+xml' href='http://ddspiderresearch.blogspot.com/feeds/114072403420926807/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=15177538&amp;postID=114072403420926807' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/114072403420926807'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/114072403420926807'/><link rel='alternate' type='text/html' href='http://ddspiderresearch.blogspot.com/2006/02/miranda-david_23.html' title='Miranda &amp; David'/><author><name>Anonymous</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/blank.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-15177538.post-114003716185385121</id><published>2006-02-15T12:51:00.000-08:00</published><updated>2006-02-15T12:59:34.853-08:00</updated><title type='text'>Miranda &amp; David</title><content type='html'>Today we added the option of adding new data entries to the database.  A person can type in new data values at the top of the page and the database will log and refresh the interface to show the new entries added.  This proved difficult due to still learning VBScript and finding an index into our database id list which did not exist already.  We had to rewrite a simple search 3 times to fix logical errors.&lt;br /&gt;&lt;br /&gt;Tomorrow we meet with Dr. Couch to discuss ways of going forward.  We have a good idea of what we want to do next.  Upon hitting a &quot;refresh&quot; button every entry&#39;s web address in the database should be checked for broken links and changes from previous refreshes.&lt;br /&gt;&lt;br /&gt;Return 0;</content><link rel='replies' type='application/atom+xml' href='http://ddspiderresearch.blogspot.com/feeds/114003716185385121/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=15177538&amp;postID=114003716185385121' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/114003716185385121'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/114003716185385121'/><link rel='alternate' type='text/html' href='http://ddspiderresearch.blogspot.com/2006/02/miranda-david_15.html' title='Miranda &amp; David'/><author><name>Anonymous</name><uri>http://www.blogger.com/profile/14925803156237533662</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-15177538.post-113986431410033547</id><published>2006-02-13T12:54:00.000-08:00</published><updated>2006-02-13T12:59:16.520-08:00</updated><title type='text'>Miranda &amp; David</title><content type='html'>Today we worked on an interface for the database in which we can delete entries.  This is the first step to being able to fully manipulate the cfw database.  Next time we plan on adding insertion and updates to the database that show up on the interface.  Right now we are using a temporary database so that we do not mess with our copy of the real cfw system.&lt;br /&gt;&lt;br /&gt;Once our schedule permits we will implement an md5 change detection system which will be reported on the interface.</content><link rel='replies' type='application/atom+xml' href='http://ddspiderresearch.blogspot.com/feeds/113986431410033547/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=15177538&amp;postID=113986431410033547' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/113986431410033547'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/113986431410033547'/><link rel='alternate' type='text/html' href='http://ddspiderresearch.blogspot.com/2006/02/miranda-david.html' title='Miranda &amp; David'/><author><name>Anonymous</name><uri>http://www.blogger.com/profile/14925803156237533662</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-15177538.post-113822149299437599</id><published>2006-01-25T12:31:00.000-08:00</published><updated>2006-01-25T12:39:00.946-08:00</updated><title type='text'>Miranda &amp; David</title><content type='html'>We met for the first time after the break.  Our web page is now listed as a link on the CREU website which at the very least ranks us higher on google.&lt;br /&gt;&lt;br /&gt;We turned in our midway reports and received positive responses from the program directors.  David tried to set up the CFW webguide database system on his computer but for some reason SQL would not install correctly.&lt;br /&gt;&lt;br /&gt;Today, we created a simple database and tried to pull values off the server as a test.  We keep running into a problem where the server tells us we are using the file even though we closed all programs that edit the file.  We set up a new weekly meeting schedule with Dr. Couch and we look forward to meeting with him next week to strategize .&lt;br /&gt;&lt;br /&gt;We updated the website.</content><link rel='replies' type='application/atom+xml' href='http://ddspiderresearch.blogspot.com/feeds/113822149299437599/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=15177538&amp;postID=113822149299437599' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/113822149299437599'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/113822149299437599'/><link rel='alternate' type='text/html' href='http://ddspiderresearch.blogspot.com/2006/01/miranda-david.html' title='Miranda &amp; David'/><author><name>Anonymous</name><uri>http://www.blogger.com/profile/14925803156237533662</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-15177538.post-113451618368747691</id><published>2005-12-13T15:17:00.000-08:00</published><updated>2005-12-13T15:23:03.710-08:00</updated><title type='text'>David &amp; Miranda</title><content type='html'>Recently we have been trying to get our ASP code to work with our server and have started work on the interface for the content management system.&lt;br /&gt;&lt;br /&gt;Our ASP code has had a hard time viewing and manipulating the fake database we have constructed.  Dr. Couch is still working on getting the webguide database into a usable format.&lt;br /&gt;&lt;br /&gt;We expect to write our end of the semester report soon.  It will include many of the problems we have encountered thus far.</content><link rel='replies' type='application/atom+xml' href='http://ddspiderresearch.blogspot.com/feeds/113451618368747691/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=15177538&amp;postID=113451618368747691' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/113451618368747691'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/113451618368747691'/><link rel='alternate' type='text/html' href='http://ddspiderresearch.blogspot.com/2005/12/david-miranda.html' title='David &amp; Miranda'/><author><name>Anonymous</name><uri>http://www.blogger.com/profile/14925803156237533662</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-15177538.post-113133994083948124</id><published>2005-11-06T20:58:00.000-08:00</published><updated>2005-11-06T21:05:50.786-08:00</updated><title type='text'>Miranda &amp; David</title><content type='html'>Hello, world.&lt;br /&gt;&lt;br /&gt;We finally were able to connect to the server but we found that our usernames do not have administrative acccess.  Therefore we are unable to test ASP code and continue our research as planned.  Even though we did spend much time on trying to find a way to test ASP code on our own computer and other websites we had access to, we were able to learn a few more concepts about ASP but only in an abstract sense.&lt;br /&gt;&lt;br /&gt;Dr. Couch has been very busy lately and it is understandable that we are unable to make much progress when updates on our server access come weeks at a time.  We will continue to try to learn ASP strictly from the books we&#39;ve been given as well as online tutorials including the Comp20 website. Learning PERL and ASP while trying to build a content management system from scratch is an arduous task.  However we are confident that we will be able to make a lot more progress once we have a server up and running.  We should have an interesting report ready by the end of the semester.</content><link rel='replies' type='application/atom+xml' href='http://ddspiderresearch.blogspot.com/feeds/113133994083948124/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=15177538&amp;postID=113133994083948124' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/113133994083948124'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/113133994083948124'/><link rel='alternate' type='text/html' href='http://ddspiderresearch.blogspot.com/2005/11/miranda-david.html' title='Miranda &amp; David'/><author><name>Anonymous</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/blank.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-15177538.post-113000934122665706</id><published>2005-10-22T12:23:00.000-07:00</published><updated>2005-10-22T12:29:06.996-07:00</updated><title type='text'>David Dennis</title><content type='html'>On Oct 20th we met again with Dr. Couch.  We spent a little time before the meeting reading a bit from each book and informing each other about some interesting aspects.&lt;br /&gt;&lt;br /&gt;Dr. Couch set up a server in his office that we could access from within the building using a remote desktop connection.  We now have log-in names so we can start to use the database on there as a practice for our ASP programs.  Dr. Couch emphasized that it would behoove the both of us to learn ASP as soon as possible.&lt;br /&gt;&lt;br /&gt;The Offline Checker can be done in PERL so its now up to Miranda or I if we wish to start working on that in ASP or PERL.  This is the easiest program of the three so we might as well work on this first.</content><link rel='replies' type='application/atom+xml' href='http://ddspiderresearch.blogspot.com/feeds/113000934122665706/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=15177538&amp;postID=113000934122665706' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/113000934122665706'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/113000934122665706'/><link rel='alternate' type='text/html' href='http://ddspiderresearch.blogspot.com/2005/10/david-dennis_113000934122665706.html' title='David Dennis'/><author><name>Anonymous</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/blank.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-15177538.post-113000883580866962</id><published>2005-10-22T10:53:00.000-07:00</published><updated>2005-10-22T12:20:35.830-07:00</updated><title type='text'>David Dennis</title><content type='html'>On Oct. 13th Miranda and I met with Dr. Couch to get some new guidance as well as set up a regular meeting schedule.  We agreed that at 4:30 PM every Thursday we would meet in Dr. Couch&#39;s office.  Dr. Couch literally sketched out for us the complete design of the system (images upcoming) and gave us two books.  Miranda took the one on content mamnagement while I took the ASP tutorial book. The sketch showed us that the design for managing a website could be illustrated as a state machine where the document is either being submitted for listing, accepted, rejected, reviewed again, or non-existent (its master link leads nowhere).&lt;br /&gt;&lt;br /&gt;Several Questions were also outlined:&lt;br /&gt;&lt;br /&gt;1.  What do we store about each page?&lt;br /&gt;     - How much?&lt;br /&gt;     - How long?&lt;br /&gt;2.  What constitutes a substantative change (When do we flag the reviewer?) ?&lt;br /&gt;     - Avoid false positives&lt;br /&gt;     - Make small changes non-problematic&lt;br /&gt;3.  How do we best present information to a user?&lt;br /&gt;     - A cached copy?&lt;br /&gt;     - Only changes highlighted?&lt;br /&gt;&lt;br /&gt;Miranda and I now know the system contains only 3 programs:&lt;br /&gt;&lt;br /&gt;1.  Content Management &quot;core&quot;&lt;br /&gt;     - Present sites to database for re-review&lt;br /&gt;2.  Offline checker&lt;br /&gt;     - Given a list of links, the checker will log for the database what links&lt;br /&gt;       are traversable and what links do not go anywhere.&lt;br /&gt;3.  Display program&lt;br /&gt;     - List for the website all the links that are approved.</content><link rel='replies' type='application/atom+xml' href='http://ddspiderresearch.blogspot.com/feeds/113000883580866962/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=15177538&amp;postID=113000883580866962' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/113000883580866962'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/113000883580866962'/><link rel='alternate' type='text/html' href='http://ddspiderresearch.blogspot.com/2005/10/david-dennis_22.html' title='David Dennis'/><author><name>Anonymous</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/blank.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-15177538.post-112890493651426144</id><published>2005-10-09T17:30:00.000-07:00</published><updated>2005-10-09T17:42:16.563-07:00</updated><title type='text'>David Dennis</title><content type='html'>After working hard on this preparation project, Miranda and I felt that it was time to go directly after our project goals.  On 10/6 we quit trying to copy the html of the webguide after realizing the complexity of the project was starting to outweigh the benefit of the exercise.&lt;br /&gt;&lt;br /&gt;Our next project shall be to create a web interface for our project where a list of sites can be added and their md5 signatures monitored every night.  From this model we can start to see how to handle the specific change cases outlined in our proposal.&lt;br /&gt;&lt;br /&gt;We will start our interface in PERL.  We have plans to change to ASP.NET next semester.</content><link rel='replies' type='application/atom+xml' href='http://ddspiderresearch.blogspot.com/feeds/112890493651426144/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=15177538&amp;postID=112890493651426144' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/112890493651426144'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/112890493651426144'/><link rel='alternate' type='text/html' href='http://ddspiderresearch.blogspot.com/2005/10/david-dennis.html' title='David Dennis'/><author><name>Anonymous</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/blank.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-15177538.post-112828429065571889</id><published>2005-10-02T13:04:00.000-07:00</published><updated>2005-10-02T13:18:10.663-07:00</updated><title type='text'>David &amp; Miranda</title><content type='html'>Today we worked three hours.  At the beginning we tried to eliminate repeated links on the first page being examined by the website copying program.&lt;br /&gt;&lt;br /&gt;We succeeded at making the program list all important first-depth links.  We continued by opening a mother folder for the entire website in our directory called &quot;guide&quot; and then made the program recurse through all links and create folders where folders exist in the url extension and create files where files exist in the extenstion.  We did not yet handle first level files or implied files (such as those urls only listing folders at the end and relying on automatically finding index.html), but this should be easy.&lt;br /&gt;&lt;br /&gt;We have not yet put the HTML content into their respective files.  Dr. Couch has been very busy and has not gotten back to us regarding the database access we requested.&lt;br /&gt;&lt;br /&gt;Miranda&#39;s serverspace quota has been increased to 500MB and David&#39;s has been increased to 200MB.  This is in preparation for all of the application code needed to run the webguide website programs.&lt;br /&gt;&lt;br /&gt;We are getting a better feel for PERL with every meeting.&lt;br /&gt;&lt;br /&gt;Our next meeting should consist of finishing the copying program by starting link recursion and incorporating the MD5() algorithm to look for changes.</content><link rel='replies' type='application/atom+xml' href='http://ddspiderresearch.blogspot.com/feeds/112828429065571889/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=15177538&amp;postID=112828429065571889' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/112828429065571889'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/112828429065571889'/><link rel='alternate' type='text/html' href='http://ddspiderresearch.blogspot.com/2005/10/david-miranda.html' title='David &amp; Miranda'/><author><name>Anonymous</name><uri>http://www.blogger.com/profile/14925803156237533662</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-15177538.post-112699359138663210</id><published>2005-09-17T14:41:00.000-07:00</published><updated>2005-09-17T14:46:46.840-07:00</updated><title type='text'>David &amp; Miranda</title><content type='html'>Today we looked at spider.pl and tried to adapt the program to only retrieve links with a starting url that we define.  We had a little success but need to look into how to avoid duplicate urls being listed.  We also need to make this program recursive.  In order to fully &quot;steal&quot; the CFW website we need to be able to open new directories and copy new files into them.  We started studying this by learning how to open and close files and directories in PERL.&lt;br /&gt;&lt;br /&gt;Dr. Couch still had not gotten back to us regarding the CFW database access we requested.  We are still learning ASP.</content><link rel='replies' type='application/atom+xml' href='http://ddspiderresearch.blogspot.com/feeds/112699359138663210/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=15177538&amp;postID=112699359138663210' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/112699359138663210'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/112699359138663210'/><link rel='alternate' type='text/html' href='http://ddspiderresearch.blogspot.com/2005/09/david-miranda.html' title='David &amp; Miranda'/><author><name>Anonymous</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/blank.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-15177538.post-112649530377457387</id><published>2005-09-11T20:03:00.000-07:00</published><updated>2005-09-11T20:28:27.823-07:00</updated><title type='text'>David Dennis</title><content type='html'>Miranda and I met at 4pm on Thursday the 8th to really see where each of us were in terms of programming in PERL. We acknowledged that we still had not touched the assignment so that was a good place to start our meeting. We managed to make a PERL program that was able to go the Webguide website and use the MD5 algorithm to generate a 128-bit number based on the HTML content. This was very impressive to both us seeing as we thought our PERL skills were still extremely basic. After this hard work we met with Dr. Couch in his office to discuss where to go from here. Dr. Couch praised our progress and decided that now was the time to get all the Webguide website&#39;s content copied to our accounts. We concluded that we would need this website and all of its programming in order to understand how the entire system works currently, and how we can take it to the next level. Dr. Couch informed us that now that we had conquered PERL (somewhat) we needed to start learning how to use ASP and its abilities to query databases. Miranda has had experience with databases as she has taken the CS@Tufts databases course. I have has no experience save for hearing &quot;MySQL&quot; every once in a while. The Comp20 website does have a lot of documentation on these subjects so I see no reason why I should not be able to learn all that there is to know about table generation and ASP soon. Dr. Couch is currently contacting those in charge of the CS databases and trying to obtain a user account for Miranda and I. Until he contacts us and informs us that we have both an account and a copied version of the Webguide, Miranda and I will try to learn all that there is to know about playing with this system.&lt;br /&gt;&lt;br /&gt;I am very excited about our progress. A sample of the program&#39;s output so far:&lt;br /&gt;&lt;blockquote&gt;&lt;p&gt;&lt;strong&gt;sunfire13{ddenni01}56: perl my_program&lt;br /&gt;Received the website&#39;s content&lt;br /&gt;Digest is &quot;0e755daf497d6bfdd649d36d7855cdb9&quot;&lt;/strong&gt;&lt;/p&gt;&lt;/blockquote&gt;&lt;p&gt;The output states that a response was heard from the Website&#39;s server and that it&#39;s unique 128-bit number (Shown in Hexidecimal) is as seen.&lt;/p&gt;</content><link rel='replies' type='application/atom+xml' href='http://ddspiderresearch.blogspot.com/feeds/112649530377457387/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=15177538&amp;postID=112649530377457387' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/112649530377457387'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/112649530377457387'/><link rel='alternate' type='text/html' href='http://ddspiderresearch.blogspot.com/2005/09/david-dennis.html' title='David Dennis'/><author><name>Anonymous</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/blank.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-15177538.post-112508279534757983</id><published>2005-08-26T11:56:00.000-07:00</published><updated>2005-08-26T11:59:55.353-07:00</updated><title type='text'>Miranda Steed</title><content type='html'>So I flew into Boston on a red-eye Tuesday night.  I read DryDock on the plane and now have a bit more of an understanding of how this project will work, though I still think I&#39;m lacking comprehension in a few areas.&lt;br /&gt;&lt;br /&gt;Turns out my house doesn&#39;t have internet yet, so I spent this afternoon in Halligan (the computer science and electrical engineering building) and spent some time going through the advanced Perl lectures on the Comp 20 website.  I didn&#39;t get to a lot of it, but I printed it all out, so that way I can do some learning in my internet-less home, without trekking the 15 minutes to Halligan.</content><link rel='replies' type='application/atom+xml' href='http://ddspiderresearch.blogspot.com/feeds/112508279534757983/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=15177538&amp;postID=112508279534757983' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/112508279534757983'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/112508279534757983'/><link rel='alternate' type='text/html' href='http://ddspiderresearch.blogspot.com/2005/08/miranda-steed_26.html' title='Miranda Steed'/><author><name>Anonymous</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/blank.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-15177538.post-112365550714550236</id><published>2005-08-09T23:25:00.000-07:00</published><updated>2005-08-09T23:31:47.150-07:00</updated><title type='text'>David Dennis</title><content type='html'>I read DryDock tonight.  I can see why Dr. Couch wanted us to read this.  It&#39;s a good example of a program like ours designed to deal with a lot of documents.  And while the DryDock program was designed to let managers approve documents before being pushed out to the world, our program will have to approve documents every day to amke sure they can stay inside the webguide.&lt;br /&gt;&lt;br /&gt;I messed with some PERL today.  I thought about converting our research page to a cgi that would manually grab our posts from blogger instead of going through feedburner.  That way we could easily customize text styles to make this look a little nicer.</content><link rel='replies' type='application/atom+xml' href='http://ddspiderresearch.blogspot.com/feeds/112365550714550236/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=15177538&amp;postID=112365550714550236' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/112365550714550236'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/112365550714550236'/><link rel='alternate' type='text/html' href='http://ddspiderresearch.blogspot.com/2005/08/david-dennis_09.html' title='David Dennis'/><author><name>Anonymous</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/blank.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-15177538.post-112353965666603028</id><published>2005-08-08T15:09:00.000-07:00</published><updated>2005-08-08T15:20:56.673-07:00</updated><title type='text'>Miranda Steed</title><content type='html'>Finished going through all of Prof. Couch&#39;s basic Perl lectures from the Comp20 course website.  I&#39;m hoping to finish all the advanced Perl lectures before I head back to Massachusetts on the 23rd.  Maybe even get to some of the readings/research Prof. Couch recommended.&lt;br /&gt;&lt;br /&gt;My life at the moment, however, is mostly occupied by studying for the GRE general test, which I&#39;m taking on the 22nd.  I set off this afternoon for Spider-related activities, but I think I&#39;ll have to start studying some new vocabulary words for the rest of the evening.  I have so much work still to go!&lt;br /&gt;&lt;br /&gt;When I get back to campus, I&#39;ll be rehearsing with the Jackson Jills most of the time for our Freshmen Orientation shows, but will be spending most of my free time working on Spider-related things.  Luckily, without homework/classes yet and the GRE general test over (though I still have the CS subject test in November), I won&#39;t have much else to do during that time other than eat and sleep.</content><link rel='replies' type='application/atom+xml' href='http://ddspiderresearch.blogspot.com/feeds/112353965666603028/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=15177538&amp;postID=112353965666603028' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/112353965666603028'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/112353965666603028'/><link rel='alternate' type='text/html' href='http://ddspiderresearch.blogspot.com/2005/08/miranda-steed.html' title='Miranda Steed'/><author><name>Anonymous</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/blank.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-15177538.post-112348174432278809</id><published>2005-08-07T23:05:00.000-07:00</published><updated>2005-08-07T23:27:05.493-07:00</updated><title type='text'>David Dennis</title><content type='html'>We started reading Drydock today. The document is a whole research article about a program called Drydock that was created to force system administrators to review all inside documents before they are displayed to the world. Our project is similar but different in that we are designing a program that reviews all &lt;em&gt;outside&lt;/em&gt; documents before they are displayed on the webguide website.&lt;br /&gt;&lt;br /&gt;I made this really nice site art today. I proboably should have been working on the program instead. I have one more week of work before I can really devote myself to getting started.&lt;br /&gt;&lt;br /&gt;I gave Dr. Couch and Miranda the username and password needed to start posting on the site.&lt;br /&gt;&lt;br /&gt;I&#39;m trying to make the feed simply say &quot;Web Spider Research Blogs&quot; instead of &quot;David&#39;s&quot;.&lt;br /&gt;&lt;br /&gt;More to come.</content><link rel='replies' type='application/atom+xml' href='http://ddspiderresearch.blogspot.com/feeds/112348174432278809/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=15177538&amp;postID=112348174432278809' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/112348174432278809'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/112348174432278809'/><link rel='alternate' type='text/html' href='http://ddspiderresearch.blogspot.com/2005/08/david-dennis_07.html' title='David Dennis'/><author><name>Anonymous</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/blank.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-15177538.post-112337427922207919</id><published>2005-08-06T17:17:00.000-07:00</published><updated>2005-08-06T17:44:08.980-07:00</updated><title type='text'>David Dennis</title><content type='html'>I took a look at DryDock today and found out that only the abstract is available online without a membership to the USENIX site.&lt;br /&gt;&lt;br /&gt;I uploaded the new website and created an interface where our research updates can occur simply by posting a new blog on blogger.com&lt;br /&gt;&lt;br /&gt;I downloaded spider.pl but didn&#39;t really look at it.&lt;br /&gt;&lt;br /&gt;I&#39;m not sure I totally understand how the site feed works or if I&#39;m even doing it right but it works now so I&#39;ll just leave it at that.&lt;br /&gt;&lt;br /&gt;I found out frontpage is 1000 times better to use when compared to Dreamweaver.&lt;br /&gt;&lt;br /&gt;I hope to format the text soon to make it a little more beautified.</content><link rel='replies' type='application/atom+xml' href='http://ddspiderresearch.blogspot.com/feeds/112337427922207919/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=15177538&amp;postID=112337427922207919' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/112337427922207919'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/15177538/posts/default/112337427922207919'/><link rel='alternate' type='text/html' href='http://ddspiderresearch.blogspot.com/2005/08/david-dennis.html' title='David Dennis'/><author><name>Anonymous</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/blank.gif'/></author><thr:total>0</thr:total></entry></feed>