<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:blogger='http://schemas.google.com/blogger/2008' xmlns:georss='http://www.georss.org/georss' xmlns:gd="http://schemas.google.com/g/2005" xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-619441053309172159</id><updated>2024-11-01T03:05:57.170-04:00</updated><category term="BalisageConference08"/><category term="XML"/><category term="XSL-FO"/><category term="XQuery"/><category term="XSL"/><category term="XSLT"/><category term="introduction"/><category term="APA"/><category term="DTD"/><category term="SGML"/><category term="USAir"/><category term="XML Tags"/><category term="airline"/><category term="college"/><category term="dita"/><category term="docbook"/><category term="document_analysis"/><category term="font"/><category term="free"/><category term="lyrics"/><category term="markup"/><category term="ncbi"/><category term="projects ocean_city new_jersey"/><category term="tag_set"/><category term="twitter jobs"/><category term="xml conference xml2008 twitter"/><title type='text'>Manorfield Consulting, LLC</title><subtitle type='html'></subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://manorfieldconsulting.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default?redirect=false'/><link rel='alternate' type='text/html' href='http://manorfieldconsulting.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><link rel='next' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default?start-index=26&amp;max-results=25&amp;redirect=false'/><author><name>Mark</name><uri>http://www.blogger.com/profile/15847827266240786646</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>36</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>25</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-619441053309172159.post-7336040610958386437</id><published>2009-01-23T11:14:00.003-05:00</published><updated>2009-01-23T11:51:18.752-05:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="projects ocean_city new_jersey"/><title type='text'>New Project</title><content type='html'>I just uploaded the first iteration of a new project to the web.  &lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;a href=&quot;http://cli.gs/ocbenchplaques&quot;&gt;Ocean City, New Jersey Bench Plaques&lt;/a&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The site is a bit rough and all the content is not yet available.  There are ~300 bench plaques on the boardwalk and town of OC, NJ and I am manually entering all the content.  So I do about 5-15 photos a day and upload new pics every couple days.  I&#39;m also playing with a redesign offline, perhaps rounded corners (!?), so even that might change soon.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Maybe it would be good to document my reasons for the site.  I do explain this a little in the &lt;a href=&quot;http://cli.gs/OCBenchPlaquesFAQ&quot;&gt;FAQ&lt;/a&gt; at the site.  In summary, I&#39;ve been heading to OC since I was 6 weeks old.  My grandparents have had a house at the north end of the boardwalk since the late 60&#39;s.  I really consider it a second home.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The plaques started appearing on the boardwalk in the early-1990s.  The program has been stopped and started several times since then.  There have been some controversy as well about how long the plaques are meant to stay on the benches.  When I discovered that I wondered if anyone had chroniceled the existing plaques.  Having graduated from college as the internet was exploding on the world, my first thought was to create a website.  And it is a natural for backend storage in an XML/XSLT application.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;So I spent an early morning in mid-July 2008 walking the length of the boardwalk.  Along the way I snapped photos with my low-end Kodak camera of each plaque.  That explains the fuzzy qualities of many of the photos.  As I walked I made mental notes (first mistake) of where each plaque was located.  I also didn&#39;t bring a monopod to stabilize the camera (second mistake).  The entire walk took about 3.5 hours.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I expected to have more people ask me what I was doing.  In fact only 3 people stopped me at all.  The first was a couple near the north end Kohr Brother&#39;s Ice Cream stand.  I had to ask them to slide over so I could photograph the plaque.  Obviously they asked what I was up to.  I called it an &#39;art project&#39;.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The second was a woman who stopped me near the Sindia pavillion.  I was taking a shot of the bench across from the pavillion and she asked if I wanted my picture taken with the plaque. Naturally she thought I was taking it due to personal signficance.  I wasn&#39;t expecting that question and blurted out a too quick, &#39;no&#39;...which I quickly apoligized for.  She took off immediately so I didn&#39;t get a chance to explain myself.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The third was a group of Leigh University alumni (circa 1940s) who I ran into at the south end turnaround.  They were a wonderful source of information about the plaques and the city&#39;s program.  And they told me of a book that had been printed with photos of the plaques.  I&#39;ve since bought the book and misplaced it.  However, it wasn&#39;t available through any online retailer and I only found it by asking at the Atlantic Book Warehouse on the boardwalk itself.  I like to think I am taking the next step of that author&#39;s work and putting the plaques out for a wider audience.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Going forward I have several things to accomplish:&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;finish inputting all photos I have on hand&lt;/li&gt;&lt;li&gt;re-walk the boardwalk to retake fuzzy photos, check locations, add new plaques&lt;/li&gt;&lt;li&gt;site redesign&lt;/li&gt;&lt;li&gt;accumulate more information about the people and events mentioned on the plaques&lt;/li&gt;&lt;li&gt;accumulate more information about the program and the benches themselves&lt;/li&gt;&lt;li&gt;publicize the site more&lt;/li&gt;&lt;/ul&gt;Overall it has been a learning experience and looks to teach me a lot about all aspects of web design, maintenence, XSLT development, and hopefully, community building.&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://manorfieldconsulting.blogspot.com/feeds/7336040610958386437/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment/fullpage/post/619441053309172159/7336040610958386437' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/7336040610958386437'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/7336040610958386437'/><link rel='alternate' type='text/html' href='http://manorfieldconsulting.blogspot.com/2009/01/new-project.html' title='New Project'/><author><name>Mark</name><uri>http://www.blogger.com/profile/15847827266240786646</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-619441053309172159.post-5281445134977496371</id><published>2009-01-22T14:58:00.003-05:00</published><updated>2009-01-23T11:50:55.583-05:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="twitter jobs"/><title type='text'>Twitter and Position Postings</title><content type='html'>I enjoy using Twitter.  I have my &lt;a href=&quot;http://twitter.com/mshellenberger&quot;&gt;professional account&lt;/a&gt; and my private one.  I&#39;m not prolific on the professional one since I mostly use it to follow others in the industry but I do post a bit on the private one.&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Today I was checking into &lt;a href=&quot;http://www.linkedin.com/pub/3/a33/6a6&quot;&gt;LinkedIn.com&lt;/a&gt; for the first time in a while and noticed that one of the members of the &lt;a href=&quot;http://www.linkedin.com/groups?home=&amp;amp;gid=45330&amp;amp;trk=anet_ug_hm&amp;amp;goback=.hom.anh_45330&quot;&gt;Philadelphia Technology Group&lt;/a&gt; was experimenting with Twitter.  &lt;a href=&quot;http://www.linkedin.com/in/jasonrmartin&quot;&gt;Jason Martin&lt;/a&gt; is a recruiter at Apex Systems, Inc.  He&#39;s in my network at LinkedIn but I&#39;ve only spoken with him via email once.  I&#39;ve never worked with him.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Jason has been Twittering his position opportunities for the past month.  &lt;a href=&quot;http://twitter.com/jasonrmartin&quot;&gt;So far I see 16 postings&lt;/a&gt;.  Perhaps this is standard practice in the recruiting industry but it was new to me.  But I didn&#39;t begin to follow Jason on Twitter.  &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;As I said before I have a couple Twitter accounts.  What I&#39;ve found in both of them is that the tweets pass by pretty quickly and I only check my Twitter accounts once or twice a day.  So I probably miss a lot of what is being said on the Twitter stream.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The great thing about the internet today is that information is available in many forms.  So instead of following Jason I grabbed his Twitter feed.  Now I can check Jason&#39;s new job postings whenever I please without wondering if I have the most recent ones.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;This isn&#39;t rocket science by any means but it might provide a reminder to people who just &#39;don&#39;t get&#39; this Twitter stuff but understand an RSS feed.  I think it is time to look around Twitter some more and see what else is being offered.  Maybe that next great opportunity is just a Tweet away!&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://manorfieldconsulting.blogspot.com/feeds/5281445134977496371/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment/fullpage/post/619441053309172159/5281445134977496371' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/5281445134977496371'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/5281445134977496371'/><link rel='alternate' type='text/html' href='http://manorfieldconsulting.blogspot.com/2009/01/twitter-and-position-postings.html' title='Twitter and Position Postings'/><author><name>Mark</name><uri>http://www.blogger.com/profile/15847827266240786646</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-619441053309172159.post-2115374290604069958</id><published>2008-12-09T08:23:00.004-05:00</published><updated>2008-12-09T08:32:35.804-05:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="xml conference xml2008 twitter"/><title type='text'>Of Twitter and Conferences</title><content type='html'>I am attending a couple days of the XML-In-Practice 2008 conference.  And one thing I have noticed is a decided lack of laptops.  Sure people have them but a lot of them are closed during the sessions.  That is in stark contrast to Balisage where almost every attendee has a laptop and is taking notes or searching for a rebuttal to the speaker.&lt;br /&gt;&lt;br /&gt;However, I did see that a few people are twittering, &lt;a href=&quot;http://twitter.com/mshellenberger&quot;&gt;myself&lt;/a&gt; included.  &lt;a href=&quot;http://twitter.com/bobdc&quot;&gt;Bob DuCharme&lt;/a&gt;,  &lt;a href=&quot;http://twitter.com/scottabel&quot;&gt;Scott Abel&lt;/a&gt;, and &lt;a href=&quot;http://twitter.com/bobdc&quot;&gt;Adam Hill&lt;/a&gt; have put up several posts each...in Scott&#39;s case it appears to be something of a torrent of tweets.  The links above will take you to their Twitter home pages but, as Twitter is designed to do, these will have moved on from conference tweets by tomorrow.&lt;br /&gt;&lt;br /&gt;If you&#39;d like to get a view of the tweets from the conference try these links:&lt;br /&gt;&lt;br /&gt;&lt;a href=&quot;http://search.twitter.com/search?q=xml+in+practice&quot;&gt;XML In Practice&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;a href=&quot;http://search.twitter.com/search?q=%23xml2008&quot;&gt;#xml2008&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;The first is mostly used by Scott Abel, the second was coined by Bob DuCharme and picked up by Adam and I.&lt;br /&gt;&lt;br /&gt;As usual this is a stream of consciousness stuff like Twitter is prone to be.  But I think the intrepid reader might find useful nuggets in there.&lt;br /&gt;&lt;br /&gt;I have notes from several of the sessions that I hope to write up on Wednesday after I get back from the conference.</content><link rel='replies' type='application/atom+xml' href='http://manorfieldconsulting.blogspot.com/feeds/2115374290604069958/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment/fullpage/post/619441053309172159/2115374290604069958' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/2115374290604069958'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/2115374290604069958'/><link rel='alternate' type='text/html' href='http://manorfieldconsulting.blogspot.com/2008/12/of-twitter-and-conferences.html' title='Of Twitter and Conferences'/><author><name>Mark</name><uri>http://www.blogger.com/profile/15847827266240786646</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-619441053309172159.post-8537774984347434597</id><published>2008-09-16T21:10:00.004-04:00</published><updated>2008-09-16T21:37:28.069-04:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="XML Tags"/><title type='text'>To &quot;tag&quot; Or To &quot;mark up&quot;</title><content type='html'>There was a &lt;a href=&quot;http://lists.xml.org/archives/xml-dev/200809/msg00037.html&quot;&gt;lively discussion&lt;/a&gt; recently on the XML-Dev listserv regarding the use of the terms &#39;tag&#39; and &#39;mark up&#39;.  A synopsis by one of the participants is available &lt;a href=&quot;http://www.xfront.com/is-markup-a-noun-or-verb/index.html&quot;&gt;here&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;I didn&#39;t want to address the verb vs. noun question that was the root of the discussion.  I was more interested in the use of tag vs. markup (or more correctly, mark up).&lt;br /&gt;&lt;br /&gt;The question, to me comes down to professional terminology.  To be seen as &#39;XML-ish&#39; it is probably best to use the term &#39;mark up&#39;, as in &quot;I marked up the document&quot;.  Since XML comes out of the publishing sector and the term &#39;mark up&#39; has a long history in that realm the it carries over.&lt;br /&gt;&lt;br /&gt;At the same time most non-XML professionals I&#39;ve met use the sentence &quot;I tagged the document&quot; interchangeably with the previous one.  Are they wrong?  I don&#39;t think so but that comes down to my impression of how they think they are working with the document.  For example:&lt;br /&gt;&lt;br /&gt;What is this? &amp;lt;table&amp;gt;&lt;br /&gt;&lt;br /&gt;What is this? &amp;lt;table&amp;gt;Ikea Round&amp;lt;/table&amp;gt;&lt;br /&gt;&lt;br /&gt;The first is a tag (a start tag) and the second is an element.  So, if you have a document without any pointy-bracket tags in it and you start putting them around existing words, aren&#39;t you applying tags (to form elements)?  Thus tagging the document?&lt;br /&gt;&lt;br /&gt;In the end I have to agree with Debbie Lapeyre when &lt;a href=&quot;http://lists.xml.org/archives/xml-dev/200809/msg00065.html&quot;&gt;she said&lt;/a&gt;:&lt;br /&gt;&lt;blockquote&gt;&quot;You may state which the pros find is best practice, but I think you should also admit there is a popular culture alternative. Those have a habit of taking over the language over the long term. (Contact IS a verb now, as well as a noun, for all that purists may howl.)&quot;&lt;/blockquote&gt;</content><link rel='replies' type='application/atom+xml' href='http://manorfieldconsulting.blogspot.com/feeds/8537774984347434597/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment/fullpage/post/619441053309172159/8537774984347434597' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/8537774984347434597'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/8537774984347434597'/><link rel='alternate' type='text/html' href='http://manorfieldconsulting.blogspot.com/2008/09/to-tag-or-to-mark-up.html' title='To &quot;tag&quot; Or To &quot;mark up&quot;'/><author><name>Mark</name><uri>http://www.blogger.com/profile/15847827266240786646</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-619441053309172159.post-2698355334568034801</id><published>2008-09-12T14:26:00.003-04:00</published><updated>2008-09-12T14:41:43.721-04:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="document_analysis"/><category scheme="http://www.blogger.com/atom/ns#" term="DTD"/><category scheme="http://www.blogger.com/atom/ns#" term="markup"/><category scheme="http://www.blogger.com/atom/ns#" term="XML"/><title type='text'>Tagging Data - OrgNames</title><content type='html'>A few years ago I was working a project where we were converting journal articles from an SGML DTD to an XML DTD.  At that time we thought it might be useful to remove or add tagging as the case may be, based on business needs.  One of the elements that was deemed to be too complex was the address.&lt;br /&gt;&lt;br /&gt;Addresses are one of the first things that we all learn to tag when taking that initial XML class.  It&#39;s mostly because everyone has one (or three) and thinks they know all the pieces of it; Name, Street, City, Zipcode, etc.  Of course, once you do something in the real world that simplicity falls away.&lt;br /&gt;&lt;br /&gt;I&#39;ll just focus on the piece that is most salient in my mind, the Organizational Name or OrgName.  This is the university, business, etc. where the author is worked when (s)he wrote the paper.  It normally included several levels of Organizational Division elements or OrgDiv.  These were the department, college, divisions where the author specifically worked within the OrgName.  Of course there were times when there were several of these and where defining which was OrgName or OrgDiv was complicated.&lt;br /&gt;&lt;br /&gt;For simplicity, and since nothing had ever been done with the different elements, the difference between OrgDiv and OrgName was done away with and replaced with commas.  It simplified initial markup of the documents and didn&#39;t impact the current (or foreseeable) ouputs.&lt;br /&gt;&lt;br /&gt;Why is this important?  Because, when developing, choosing, or modifying a DTD you must take into account more than just the &quot;we might need it in the future&quot; ideas.  Those have to be weighed against costs of marking up the document, handling the elements for output, and increased quality control time for staff.&lt;br /&gt;&lt;br /&gt;It seems like tagging everything is the direction we should be going but checking for a return on investment, even when it comes to markup is always a good idea.</content><link rel='replies' type='application/atom+xml' href='http://manorfieldconsulting.blogspot.com/feeds/2698355334568034801/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment/fullpage/post/619441053309172159/2698355334568034801' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/2698355334568034801'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/2698355334568034801'/><link rel='alternate' type='text/html' href='http://manorfieldconsulting.blogspot.com/2008/09/tagging-data-orgnames.html' title='Tagging Data - OrgNames'/><author><name>Mark</name><uri>http://www.blogger.com/profile/15847827266240786646</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-619441053309172159.post-4682167630790830151</id><published>2008-08-20T13:21:00.002-04:00</published><updated>2008-08-20T13:21:00.581-04:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="font"/><category scheme="http://www.blogger.com/atom/ns#" term="free"/><category scheme="http://www.blogger.com/atom/ns#" term="XSL-FO"/><title type='text'>Fonts and Circumstance</title><content type='html'>If you are a single employee company like me you know the desire to keep costs down.  Since I work with XSL-FO for a good bit of my projects, I often get requests to use fonts outside the normally installed sets.&lt;br /&gt;&lt;br /&gt;Obviously for XSL-FO the client is going to need the font installed on site.  But for many of my projects I am doing work remotely and it is useful to have the font available locally.  Like a recent project where the client was using the &lt;a href=&quot;http://en.wikipedia.org/wiki/Frutiger&quot;&gt;Frutiger font family (condensed type)&lt;/a&gt;.  I searched around a little bit and found the whole family for &lt;a href=&quot;http://www.fonts.com/findfonts/detail.htm?pid=425865&amp;amp;gclid=CM-50LjMnJUCFQIWFQodaVy9HQ&quot;&gt;$339.00&lt;/a&gt;.  That&#39;s a little out of my price range for a single project.&lt;br /&gt;&lt;br /&gt;A little more google-fu (or if I had just read the Wikipedia article) and I discovered that &lt;a href=&quot;http://www.microsoft.com/reader/downloads/pc.mspx&quot;&gt;Microsoft&#39;s MSReader application&lt;/a&gt; installs the &lt;a href=&quot;http://en.wikipedia.org/wiki/Frutiger#Frutiger_Linotype&quot;&gt;Frutiger Linotype&lt;/a&gt; font family by default.  So three minutes later I was up and running with a reasonable option for Frutiger Condensed.&lt;br /&gt;&lt;br /&gt;Just thought I&#39;d share that sometimes you can find a solid option for free.</content><link rel='replies' type='application/atom+xml' href='http://manorfieldconsulting.blogspot.com/feeds/4682167630790830151/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment/fullpage/post/619441053309172159/4682167630790830151' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/4682167630790830151'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/4682167630790830151'/><link rel='alternate' type='text/html' href='http://manorfieldconsulting.blogspot.com/2008/08/fonts-and-circumstance.html' title='Fonts and Circumstance'/><author><name>Mark</name><uri>http://www.blogger.com/profile/15847827266240786646</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-619441053309172159.post-1570757298216883179</id><published>2008-08-18T19:20:00.004-04:00</published><updated>2008-08-26T08:05:34.890-04:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="BalisageConference08"/><title type='text'>Heard In The Halls</title><content type='html'>Just a link this time...in case people aren&#39;t finding it via Google (or your favorite search engine)&lt;br /&gt;&lt;br /&gt;&lt;a href=&quot;http://www.concretesyntax.com/heard_in_the_halls&quot;&gt;Heard In The Halls&lt;/a&gt; is a Markup Conference in August in Montreal tradition that I find particularly fun.  It shines a light on one of the interesting points of Balisage.&lt;br /&gt;&lt;br /&gt;(no, not the stupid one-liners, you cynic)&lt;br /&gt;&lt;br /&gt;A blank couple sheets of paper where people write down things they overhear while attending the conference.  It always ends up with strange statements that can be taken WAAAAY out of context.  In fact, one of the heards this year was, &quot;You can&#39;t say anything without it being taken out of context.&quot;.&lt;br /&gt;&lt;br /&gt;Take a look at the link above and perhaps get a giggle or two.  It is a VERY in-crowd kind of thing.  But even some of them make sense to the mildly technologically literate.</content><link rel='replies' type='application/atom+xml' href='http://manorfieldconsulting.blogspot.com/feeds/1570757298216883179/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment/fullpage/post/619441053309172159/1570757298216883179' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/1570757298216883179'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/1570757298216883179'/><link rel='alternate' type='text/html' href='http://manorfieldconsulting.blogspot.com/2008/08/hear-in-halls.html' title='Heard In The Halls'/><author><name>Mark</name><uri>http://www.blogger.com/profile/15847827266240786646</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-619441053309172159.post-3512166241285414117</id><published>2008-08-15T12:43:00.005-04:00</published><updated>2008-08-15T13:30:06.395-04:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="BalisageConference08"/><title type='text'>But Wait There&#39;s More...</title><content type='html'>&lt;i&gt;&lt;a class=&quot;biolink&quot; href=&quot;http://www.balisage.org/Bios.html#Sperberg-McQueen&quot;&gt;C. M. Sperberg-McQueen - Closing speech&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;/i&gt;XML 10 is a good time to have a look back and where we&#39;ve been, where we wanted to go, where we actually went, and where we want to go from here.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;There are two kinds of projects in the world&lt;br /&gt;&lt;ol&gt;&lt;li&gt;Barn raisings - gather materials and people, do the thing, and then everybody goes home.&lt;/li&gt;&lt;li&gt;community farming - gather materials and people, work together, but no one EVER goes home.&lt;/li&gt;&lt;/ol&gt;Standards development is more of the later than the former.  Revolutions are almost never barn raisings...success means that you are in charge...and then the work is never done.&lt;br /&gt;&lt;br /&gt;The biggest problem is Semantics.&lt;br /&gt;It is all those things that we don&#39;t know how to do really well.  Goal should be to isolate substructures within Semantics to fix them and thus make them no longer part of Semantics.&lt;br /&gt;&lt;br /&gt;&quot;the road to hell is paved with compact syntax&quot;&lt;br /&gt;&lt;br /&gt;Next Year in Montreal.</content><link rel='replies' type='application/atom+xml' href='http://manorfieldconsulting.blogspot.com/feeds/3512166241285414117/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment/fullpage/post/619441053309172159/3512166241285414117' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/3512166241285414117'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/3512166241285414117'/><link rel='alternate' type='text/html' href='http://manorfieldconsulting.blogspot.com/2008/08/but-wait-theres-more.html' title='But Wait There&#39;s More...'/><author><name>Mark</name><uri>http://www.blogger.com/profile/15847827266240786646</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-619441053309172159.post-3799551335455112619</id><published>2008-08-15T11:57:00.006-04:00</published><updated>2008-08-15T12:43:11.259-04:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="BalisageConference08"/><title type='text'>Text retrieval of XML-encoded corpora: A lexical approach</title><content type='html'>&lt;a href=&quot;http://www.balisage.org/Bios.html#Quin&quot;&gt;Liam Quinn&lt;/a&gt; - W3C - &lt;a href=&quot;http://www.balisage.org/Proceedings/html/2008/Quin01/Balisage2008-Quin01.html&quot;&gt;Paper&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Text retrieval - building a persistent index that makes finding documents for things they contain more simple.&lt;br /&gt;&lt;br /&gt;lq-test design- a text retrieval system he developed a long time ago and is seeing if it will work in the XML world.&lt;br /&gt;&lt;br /&gt;He&#39;s doing a little demonstration of the index using &quot;75 or 300 MB of data, I don&#39;t really recall&quot;.&lt;br /&gt;&lt;br /&gt;I hope his paper has some of this code in it...might be interesting to examine...it appears he&#39;s making his own xml-database that is accessed without the need of XQuery...but only because the needs are so basic.&lt;br /&gt;&lt;br /&gt;It is useful? For him it is...&lt;br /&gt;&lt;br /&gt;Is it as useful as XQuery - XQ can&#39;t do match highlighting.  XQ can&#39;t mix with broken HTML and text.  But overall XQuery is more useful...but perhaps not for concordincies like he is using here.&lt;br /&gt;&lt;br /&gt;How much to update to XML?  Probably better to rebuild than retrofit.&lt;br /&gt;&lt;br /&gt;Practical question - won&#39;t run under windows (pipelines are implemented by running a program competly to a temporary file and then giving that to another program....in Unix the programs alternate)</content><link rel='replies' type='application/atom+xml' href='http://manorfieldconsulting.blogspot.com/feeds/3799551335455112619/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment/fullpage/post/619441053309172159/3799551335455112619' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/3799551335455112619'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/3799551335455112619'/><link rel='alternate' type='text/html' href='http://manorfieldconsulting.blogspot.com/2008/08/text-retrieval-of-xml-encoded-corpora.html' title='Text retrieval of XML-encoded corpora: A lexical approach'/><author><name>Mark</name><uri>http://www.blogger.com/profile/15847827266240786646</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-619441053309172159.post-4539925626935339563</id><published>2008-08-15T11:04:00.006-04:00</published><updated>2008-08-15T11:57:10.123-04:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="BalisageConference08"/><title type='text'>Freedom to constrain: Where does attribute constraint come from, Mommy?</title><content type='html'>&lt;a href=&quot;http://www.balisage.org/Bios.html#Bauman&quot;&gt;Syd Bauman&lt;/a&gt; - &lt;span class=&quot;orgname&quot;&gt;Brown University, Women Writers Project&lt;/span&gt; - &lt;a href=&quot;http://www.balisage.org/Proceedings/html/2008/Bauman01/Balisage2008-Bauman01.html&quot;&gt;Paper&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&quot;This paper does not represent an enormous amount of original research.&quot;  Syd in a rare self-deprecating moment.  (&lt;span style=&quot;font-size:85%;&quot;&gt;&lt;span style=&quot;font-style: italic;&quot;&gt;I kid, I had a great dinner with Syd last night...I would love to chat with him for hours, but David Durbin and the German guys(TM) had some great games to play instead&lt;/span&gt;&lt;/span&gt;)&lt;br /&gt;&lt;br /&gt;EOA - Explanation of Acronyms&lt;br /&gt;&lt;br /&gt;Freedom to Constrain:&lt;br /&gt;Whose freedom am I talking about?  The typical humanities project doing text encoding.&lt;br /&gt;&lt;ul&gt;&lt;li&gt;subject matter expert&lt;/li&gt;&lt;li&gt;XML expert&lt;/li&gt;&lt;li&gt;encoders (newkeying, post OCR, vendor)&lt;/li&gt;&lt;li&gt;proofreaders, web designers, managers, Research Assistants&lt;/li&gt;&lt;/ul&gt;Constrain what?&lt;br /&gt;For this talk he is narrowing down to an enumerated list of possible vales of an attribute.&lt;br /&gt;&lt;br /&gt;&lt;span style=&quot;font-size:85%;&quot;&gt;&lt;span style=&quot;font-style: italic;&quot;&gt;Syd is an interesting speaker.  He drives directly through topics and talks in a way that makes them accessible to me, which means they are likely easier for the rest of the room to understand.  But then he throws in strange things like referencing a his asthma (due to cats) and then throws a cat image on the screen.  Which both works and doesn&#39;t work at the same time.  For me it is a good presentation style...it gives my brain time to catch up with the topic while he branches off to some strange digression.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Schema languages are used for constraint....duh?  Oh he was comparing it to spam filters....I think I get it.&lt;br /&gt;&lt;br /&gt;Literate programming....one source file contains the program code and the end-user documentation.  Single file advantages include that you are forced to think about documentation as you write the code...&lt;span style=&quot;font-size:85%;&quot;&gt;&lt;span style=&quot;font-style: italic;&quot;&gt;that is a GREAT idea&lt;/span&gt;&lt;/span&gt;...&lt;br /&gt;&lt;br /&gt;TEI uses this concept in the ODD file, which uses the concept of declarative constraint with formal documentation language.&lt;br /&gt;&lt;br /&gt;Comments/Questions:&lt;br /&gt;Matt Johnson - Lexus-Nexis:&lt;br /&gt;How do we get this information exposed to the user without having to have an XML expert insert this into the file?  Basically a nice user interface is required.</content><link rel='replies' type='application/atom+xml' href='http://manorfieldconsulting.blogspot.com/feeds/4539925626935339563/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment/fullpage/post/619441053309172159/4539925626935339563' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/4539925626935339563'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/4539925626935339563'/><link rel='alternate' type='text/html' href='http://manorfieldconsulting.blogspot.com/2008/08/freedom-to-constrain-where-does.html' title='Freedom to constrain: Where does attribute constraint come from, Mommy?'/><author><name>Mark</name><uri>http://www.blogger.com/profile/15847827266240786646</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-619441053309172159.post-1620011895571473228</id><published>2008-08-15T09:50:00.002-04:00</published><updated>2008-08-15T11:03:45.929-04:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="BalisageConference08"/><title type='text'>Translation between RDF and Topic Maps: Divide and Translate</title><content type='html'>&lt;a href=&quot;http://www.balisage.org/Bios.html#Dichev&quot;&gt;Christo Dichev&lt;/a&gt; - Winston-Salem State University - &lt;a href=&quot;http://www.balisage.org/Proceedings/html/2008/Dichev01/Balisage2008-Dichev01.html&quot;&gt;Paper&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Creation of a tool to reuse and integrate between existing Topic Maps and RDFs.&lt;br /&gt;&lt;br /&gt;This is a good discussion of of translation between different XML tagsets.  What do you drop?  What matches?  What is duplication in the original/combination?</content><link rel='replies' type='application/atom+xml' href='http://manorfieldconsulting.blogspot.com/feeds/1620011895571473228/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment/fullpage/post/619441053309172159/1620011895571473228' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/1620011895571473228'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/1620011895571473228'/><link rel='alternate' type='text/html' href='http://manorfieldconsulting.blogspot.com/2008/08/translation-between-rdf-and-topic-maps.html' title='Translation between RDF and Topic Maps: Divide and Translate'/><author><name>Mark</name><uri>http://www.blogger.com/profile/15847827266240786646</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-619441053309172159.post-6255563849903285869</id><published>2008-08-15T09:18:00.005-04:00</published><updated>2008-08-15T09:50:04.478-04:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="BalisageConference08"/><title type='text'></title><content type='html'>&lt;a href=&quot;http://www.balisage.org/Bios.html#Brown&quot;&gt;Peter Brown&lt;/a&gt; - Pensive - &lt;a href=&quot;http://www.balisage.org/Proceedings/html/2008/Brown01/Balisage2008-Brown01.html&quot;&gt;Paper&lt;/a&gt;&lt;span style=&quot;display: block;&quot; id=&quot;formatbar_Buttons&quot;&gt;&lt;span class=&quot;down&quot; style=&quot;display: block;&quot; id=&quot;formatbar_CreateLink&quot; title=&quot;Link&quot; onmouseover=&quot;ButtonHoverOn(this);&quot; onmouseout=&quot;ButtonHoverOff(this);&quot; onmouseup=&quot;&quot; onmousedown=&quot;CheckFormatting(event);FormatbarButton(&#39;richeditorframe&#39;, this, 8);ButtonMouseDown(this);&quot;&gt;&lt;/span&gt;&lt;br /&gt;He has been speaking very interestingly on the situations and contexts of context.  But my system would not find a wireless connection...Hotel Europa really needs to invest in 802.11g routers/access points...so I don&#39;t have much in the way of notes.  But read his paper because it made some interesting points.&lt;br /&gt;&lt;br /&gt;And because I have enjoyed listening to Peter talk during much of the week. &lt;br /&gt;&lt;br /&gt;Slides will be available at &lt;a href=&quot;http://www.pensive.eu/uid/0203&quot;&gt;www.pensive.eu/uid/0203&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;</content><link rel='replies' type='application/atom+xml' href='http://manorfieldconsulting.blogspot.com/feeds/6255563849903285869/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment/fullpage/post/619441053309172159/6255563849903285869' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/6255563849903285869'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/6255563849903285869'/><link rel='alternate' type='text/html' href='http://manorfieldconsulting.blogspot.com/2008/08/peter-brown-pensive-paper-he-has-been.html' title=''/><author><name>Mark</name><uri>http://www.blogger.com/profile/15847827266240786646</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-619441053309172159.post-3353860264683039091</id><published>2008-08-14T13:34:00.005-04:00</published><updated>2008-08-14T14:39:59.289-04:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="BalisageConference08"/><title type='text'>Parser possibilities: Why write a markup parser?</title><content type='html'>&lt;a href=&quot;http://www.balisage.org/Bios.html#Smith&quot;&gt;Norman Smith&lt;/a&gt; - Science Applications International Corporation - &lt;a href=&quot;http://www.balisage.org/Proceedings/html/2008/Smith01/Balisage2008-Smith01.html&quot;&gt;Paper&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Markup Parser: A parser that handles SGML, ML, WordML, etc transparently.&lt;br /&gt;&lt;br /&gt;It is a non-validating parser&lt;br /&gt;&lt;br /&gt;Reasons not to write a parser: it is a lot of work, validating parsers are even more work. &lt;br /&gt;&lt;br /&gt;Reasons to write one: use multiple meta-languages.  Learning experience both of the markup language and maybe a programming language.</content><link rel='replies' type='application/atom+xml' href='http://manorfieldconsulting.blogspot.com/feeds/3353860264683039091/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment/fullpage/post/619441053309172159/3353860264683039091' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/3353860264683039091'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/3353860264683039091'/><link rel='alternate' type='text/html' href='http://manorfieldconsulting.blogspot.com/2008/08/parser-possibilities-why-write-markup.html' title='Parser possibilities: Why write a markup parser?'/><author><name>Mark</name><uri>http://www.blogger.com/profile/15847827266240786646</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-619441053309172159.post-7896430466389579933</id><published>2008-08-14T11:36:00.006-04:00</published><updated>2008-08-14T12:31:25.842-04:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="BalisageConference08"/><title type='text'>Structural metadata and the social limitation of interoperability: A sociotechnical view of XML and digital library standards development</title><content type='html'>&lt;span style=&quot;font-size:100%;&quot;&gt;&lt;a style=&quot;font-weight: bold;&quot; href=&quot;http://www.balisage.org/Bios.html#McDonough&quot;&gt;&lt;em style=&quot;&quot;&gt;&lt;/em&gt;&lt;/a&gt;&lt;em style=&quot;&quot;&gt;&lt;a class=&quot;biolink&quot;&gt;Jerome McDonough&lt;/a&gt;&lt;/em&gt;&lt;span style=&quot;font-weight: bold;&quot;&gt;&lt;em style=&quot;&quot;&gt;&lt;a class=&quot;biolink&quot;&gt; - &lt;/a&gt;&lt;/em&gt;&lt;/span&gt;&lt;/span&gt;&lt;a class=&quot;biolink&quot;&gt;University of Illinois at Urbana-Champaign -&lt;/a&gt;&lt;a href=&quot;http://www.balisage.org/Bios.html#McDonough&quot;&gt;&lt;/a&gt;&lt;a class=&quot;biolink&quot;&gt; Paper&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&quot;What I&#39;m really going to try and convince you of that we have a tremendous problem in the XML world...we simply do not have enough standards in XML community.&quot;&lt;br /&gt;&lt;br /&gt;There is a considerable number of different tagsets that need to be interconnected...but there is no standard of content (&lt;span style=&quot;font-size:85%;&quot;&gt;&lt;span style=&quot;font-style: italic;&quot;&gt;dates are the common example and the one he used here&lt;/span&gt;&lt;/span&gt;).&lt;br /&gt;&lt;br /&gt;Socio-technical system theory - States that technology and social context are not separable. &lt;br /&gt;&lt;br /&gt;&lt;span style=&quot;font-style: italic;&quot;&gt;This is kind of like a porthole view into the way librarians handle (and have handled) markup technologies.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Why is XML like a rope?  - give them enough flexibility and they&#39;ll hang themselves.&lt;br /&gt;&lt;br /&gt;Solutions?&lt;br /&gt;Argues for treating translations as standards in the same way we treat tagsets as standards.&lt;br /&gt;&lt;br /&gt;Also that education has to look into communication vs control.</content><link rel='replies' type='application/atom+xml' href='http://manorfieldconsulting.blogspot.com/feeds/7896430466389579933/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment/fullpage/post/619441053309172159/7896430466389579933' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/7896430466389579933'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/7896430466389579933'/><link rel='alternate' type='text/html' href='http://manorfieldconsulting.blogspot.com/2008/08/structural-metadata-and-social.html' title='Structural metadata and the social limitation of interoperability: A sociotechnical view of XML and digital library standards development'/><author><name>Mark</name><uri>http://www.blogger.com/profile/15847827266240786646</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-619441053309172159.post-1044083784340558668</id><published>2008-08-14T11:36:00.005-04:00</published><updated>2008-08-14T12:16:26.990-04:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="BalisageConference08"/><title type='text'>Wordle Image</title><content type='html'>Just a little graphical display of the past few posts.  I noticed the NCBI folks messing with this website between sessions and thought I&#39;d do the same.&lt;br /&gt;&lt;br /&gt;&lt;a href=&quot;http://dwarfurl.com/20d7a&quot;&gt;Kind of cool&lt;/a&gt;.</content><link rel='replies' type='application/atom+xml' href='http://manorfieldconsulting.blogspot.com/feeds/1044083784340558668/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment/fullpage/post/619441053309172159/1044083784340558668' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/1044083784340558668'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/1044083784340558668'/><link rel='alternate' type='text/html' href='http://manorfieldconsulting.blogspot.com/2008/08/wordle-image.html' title='Wordle Image'/><author><name>Mark</name><uri>http://www.blogger.com/profile/15847827266240786646</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-619441053309172159.post-5909610924981539571</id><published>2008-08-14T11:06:00.005-04:00</published><updated>2008-08-14T11:36:02.050-04:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="BalisageConference08"/><title type='text'>An Onion of Documents and Metadata</title><content type='html'>&lt;h3 class=&quot;author&quot;&gt;&lt;span style=&quot;font-weight: normal; font-family: arial;&quot;&gt;&lt;a href=&quot;http://www.balisage.org/Proceedings/authors.html#KelleherDMatthew&quot;&gt;D. Matthew Kelleher&lt;/a&gt; - Y-12 - &lt;/span&gt;&lt;a href=&quot;http://www.balisage.org/Proceedings/html/2008/Mason01/Balisage2008-Mason01.html&quot;&gt;Paper&lt;/a&gt;&lt;br /&gt;&lt;/h3&gt;Must create an electronic inspection form using XML....But it must be usable by floor shop managers, look the same as the current forms, and be able to run out to HTML,  back into XML, and then out again to PDF.&lt;br /&gt;&lt;br /&gt;But it is not just the XML file, but it is also things like image, HTML, transform files, etc.  So all this had to be packaged together.&lt;br /&gt;&lt;br /&gt;They used a Arbortext due to the interface ease of use for non-XML users.&lt;br /&gt;&lt;br /&gt;Workflow- Online Form collects data into a XML data file which is merged into an XML file and then output to the final document.&lt;br /&gt;&lt;br /&gt;Next Steps: need to handle form management and automatic generation of HTML.  Data validation and entry need to be improved.  And of course, train the users....which is ALWAYS a challenge.</content><link rel='replies' type='application/atom+xml' href='http://manorfieldconsulting.blogspot.com/feeds/5909610924981539571/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment/fullpage/post/619441053309172159/5909610924981539571' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/5909610924981539571'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/5909610924981539571'/><link rel='alternate' type='text/html' href='http://manorfieldconsulting.blogspot.com/2008/08/onion-of-documents-and-metadata.html' title='An Onion of Documents and Metadata'/><author><name>Mark</name><uri>http://www.blogger.com/profile/15847827266240786646</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-619441053309172159.post-974091038269439137</id><published>2008-08-14T09:46:00.003-04:00</published><updated>2008-08-14T10:33:04.640-04:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="BalisageConference08"/><title type='text'>Linking Page Images To Transcriptions With SVG</title><content type='html'>&lt;a href=&quot;http://www.balisage.org/Bios.html#Cayless&quot;&gt;Hugh A. Cayless&lt;/a&gt; - Carolina Digital Library and Archives - &lt;a href=&quot;http://www.balisage.org/Proceedings/html/2008/Cayless01/Balisage2008-Cayless01.html&quot;&gt;Paper&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Preface:  &quot;There is very little practice or theory in this talk.&quot; &lt;br /&gt;&lt;br /&gt;There is a lot of discussion in certain areas of the blogsphere regarding the problems of text-image linking. Hugh has an upcoming huge manuscript digitization projects and might be able to use this theory/practice.&lt;br /&gt;&lt;br /&gt;Goals:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Create an SVG overlay of the manuscript page image&lt;/li&gt;&lt;li&gt;Analyse the structure of the SVG document to detect lines, etc.&lt;/li&gt;&lt;li&gt;Link the groups so produced to structures in a TEI transcription&lt;/li&gt;&lt;li&gt;Display the results in a usable GUI&lt;/li&gt;&lt;/ul&gt;&lt;a href=&quot;http://en.wikipedia.org/wiki/Inkscape&quot;&gt;Inkscape&lt;/a&gt; (the SVG drawing tool) has a tracing tool...trace the bmp (or jpg, etc) and then plug the resulting SVG output into your XML application (freaking cool...didin&#39;t know Inkscape could do that).&lt;br /&gt;&lt;br /&gt;He gathered tools from the open source commnity rather than writing his own monolithic thing.&lt;br /&gt;&lt;br /&gt;potrace - takes bitmap and converts to a vector graphic image - the image has to be bitmap (using imageMagick)  The output from potrace was not what he wanted exactly so he used Inkscape from the command line to convert to absolute coordinates.  Then he used XSLT to do a little cleanup (adding specifically named IDs)&lt;br /&gt;&lt;br /&gt;He then used lxml - ElementTree and numpy - the script reads in the SVG produced by potrace and filtered through Inkscape, does some filtering,detects the lines, then serializes the results back to SVG and Javascript.&lt;br /&gt;&lt;br /&gt;Problems:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;How much can be automated?  Now not much is.&lt;/li&gt;&lt;li&gt;How deeply can this be analysed?&lt;/li&gt;&lt;li&gt;What is the best testing mechanism?&lt;/li&gt;&lt;/ul&gt;How to tell potrace the black/white cutoff is the major sticking point right now.  Also image pre-processing isn&#39;t clearly defined and automation of linking/path disposal is pretty important.&lt;br /&gt;&lt;br /&gt;&lt;span style=&quot;font-size:85%;&quot;&gt;&lt;span style=&quot;font-style: italic;&quot;&gt;His process is very preliminary but really sounds fascinating to me.  I have tons of documents from m grandmothers&#39; homes that I am in fear of losing....to either the elements or the trashcan.  Not everyone in the family finds them useful.  But if I could digitize them, at least I could maintain the content, if not the source material.  I will have to look into aspects of this process.&lt;/span&gt;&lt;/span&gt;</content><link rel='replies' type='application/atom+xml' href='http://manorfieldconsulting.blogspot.com/feeds/974091038269439137/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment/fullpage/post/619441053309172159/974091038269439137' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/974091038269439137'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/974091038269439137'/><link rel='alternate' type='text/html' href='http://manorfieldconsulting.blogspot.com/2008/08/linking-page-images-to-transcriptions.html' title='Linking Page Images To Transcriptions With SVG'/><author><name>Mark</name><uri>http://www.blogger.com/profile/15847827266240786646</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-619441053309172159.post-5391070635654335025</id><published>2008-08-14T09:04:00.003-04:00</published><updated>2008-08-14T09:46:13.498-04:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="BalisageConference08"/><title type='text'>Reconsidering Conventional Markup for Knowledge Representation</title><content type='html'>&lt;a href=&quot;http://www.balisage.org/Bios.html#Dubin&quot;&gt;David Dubin&lt;/a&gt; - University of Illinois Graduate School of Library and Information Science in Champaign, Illinois - &lt;a href=&quot;http://www.balisage.org/Proceedings/html/2008/Dubin01/Balisage2008-Dubin01.html&quot;&gt;Paper&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Looks like a theoretical paper....but it&#39;s a Dubin presentation so who knows?&lt;br /&gt;&lt;br /&gt;Ok, he is discussing finding the youngest (closest to the leaves in the tree) ancestor of node X and node Y using a type of formal logic programming.&lt;br /&gt;&lt;br /&gt;He&#39;s using formal logic statements to setup a more difficult finding experience.  Unfortunately for you, dear reader, I have not used logic operators in a VERY long time, so I won&#39;t be able to provide anything beyond his paper.&lt;br /&gt;&lt;br /&gt;If you are interested in using formal logic in your Prolog demo, then you know more about this than I ever will and should just go read the paper.  ;-)</content><link rel='replies' type='application/atom+xml' href='http://manorfieldconsulting.blogspot.com/feeds/5391070635654335025/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment/fullpage/post/619441053309172159/5391070635654335025' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/5391070635654335025'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/5391070635654335025'/><link rel='alternate' type='text/html' href='http://manorfieldconsulting.blogspot.com/2008/08/reconsidering-conventional-markup-for.html' title='Reconsidering Conventional Markup for Knowledge Representation'/><author><name>Mark</name><uri>http://www.blogger.com/profile/15847827266240786646</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-619441053309172159.post-166498867597690220</id><published>2008-08-13T13:50:00.006-04:00</published><updated>2008-08-13T15:44:20.061-04:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="BalisageConference08"/><title type='text'>Dirty Laundry : Comittee Disasters, What Happened,What We Learned</title><content type='html'>&lt;span style=&quot;font-family: georgia;font-size:130%;&quot; &gt;&lt;span style=&quot;font-style: italic; font-weight: bold;&quot;&gt;Cavet: I am typing this as quickly as I can...I am probably getting things wrong (names, dates, concepts, whatever).  If it doesn&#39;t have quotes around it is it completely a paraphrase and not their actual words.  Think of this as just getting the atmosphere of the session, NOT some kind of historical record.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;David Orchard - Current Available for Employment:&lt;br /&gt;One of the creators of Xinclude  Was the BEA standards lead.&lt;br /&gt;Dirty Laundry - URIs and non-use of URIs in web services&lt;br /&gt;&lt;br /&gt;His story was about how the SOAP group would not embrace URIs.  The lesson learned is that if a group doesn&#39;t buy into the feedback then there really isn&#39;t any point to give the feedback at all.  &lt;span style=&quot;font-size:85%;&quot;&gt;&lt;span style=&quot;font-style: italic;&quot;&gt;Sounds to me that you can&#39;t force anything on a Working Group if they don&#39;t want it...that doesn&#39;t sound like a great way to get solid specifications.   &lt;/span&gt;&lt;span style=&quot;font-style: italic;&quot;&gt;Maybe I don&#39;t want to know how the sausage is made.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;James David Mason - Y-12 National Security Complex&lt;br /&gt;In standards since 1981 when he attended the organizational meeting of V1(?).  Charles Goldfarb was the chief evangelist behind SGML...and we wouldn&#39;t have SGML/XML without him...but...&lt;br /&gt;&lt;br /&gt;James tells of seeing Goldfarb standing at a blackboard lecturing while two people were arguing loudly about something else, and someone else is yelling at Goldfarb. The only person with the correct response in the room was the person curled in a fetal position in the corner.  Steve Newcomb is sitting next to me and states &quot;have you ever been to a standards meeting?&quot; me: &quot;no&quot; Steve: &quot;He isn&#39;t exageratting.&quot;&lt;br /&gt;&lt;br /&gt;What did he learn?  Committee chairmen have to be stuborrn as a mule.  Editors are not allowed to lock up a text and show it to no one else...in electronic form.&lt;br /&gt;&lt;br /&gt;Lauren Wood -&lt;br /&gt;Chair DOM working group&lt;br /&gt;Her husband is Tim Bray, who was editor of the XML spec, Charles Goldfarb tried to get him to do something by working on her....obviously misunderstanding the professionalism of Lauren Wood.&lt;br /&gt;&lt;br /&gt;The hardest thing to deal with is when you can&#39;t get companies to care about the topic of the committee.  But sometimes when companies care too much it becomes a backroom kind of political games thing.  Where people would call around to committee members and their bossses to put pressure for their particular spec change or desire.&lt;br /&gt;&lt;br /&gt;Microsoft wanted Tim Bray to drop as XML spec Editor because he was consulting with Netscape.  But Softquad, where Lauren Wood was working, took the stand that their suggestion for a replacement Eve Mahler who worked for Arbortext.  What is comes down to is that people can do good technical work even if they are working for your competitor.&lt;br /&gt;&lt;br /&gt;Mavis Cournane&lt;br /&gt;&lt;br /&gt;In 2002 a European standards committee through Oasis where the after-market participants were non-technical and the automotive participants had no interest changing their processes.  So the process went nowhere and the process died.  No spec was created, it was a failed committee spec.  So in 2007 a European Union legislative requirement came through that states automotive manufactures have to hold to the Oasis spec (that doesn&#39;t exist and no one agreed to at the time).  So this is as a mess.&lt;br /&gt;&lt;br /&gt;TO fix the mess...Oasis created a new type called an Oasis format...and the auto makers are basically screwed.&lt;br /&gt;&lt;br /&gt;What she learned:  Curb the powers of the EU.  A sucessful standard will have to have a win-win for the key implementors of the standard.  There should be some way to stop failed standards from becoming law.&lt;br /&gt;&lt;br /&gt;Jon Bosak -&lt;br /&gt;Started with formal standards on DSSL.&lt;br /&gt;&lt;br /&gt;XML namespaces is his committe disaster.  Namespaces were in play from almost the beginning...but not the forefront like we have now.  Until the WWW6 conference in Santa Clara.  Tim Burners Lee wanted to have a way to combine multiple repositories (?) in order to support something he called &quot;the semantic web&quot;. &lt;br /&gt;&lt;br /&gt;By 1997 the group tried to put the whole namespace thing behind them...but Tim wouldn&#39;t let it.  By July of 1998 they had something but nothing that they were happy about.  It arrived at W3C but Tim wouldn&#39;t approve it.  Because they were using public identifiers for namespace implementation.  In the end they were given one week to base namespaces on attributes based on an existing project of Microsoft.  That&#39;s what we have here.&lt;br /&gt;&lt;br /&gt;Jon doesn&#39;t blame Tim for this...although he wasn&#39;t happy with him at the time.  The way W3C was setup at the time made everything roll through Tim which caused these problems.  He also said that there is no way of knowing if the public identifiers would have worked better.  But pushing it through too quickly is a real problem.&lt;br /&gt;&lt;br /&gt;And governance matters.  We need to setup committees and then wait awhile to see how things work out.  Pushing things isn&#39;t working.&lt;br /&gt;&lt;br /&gt;Also be concerned about the transparancy of the process.  A lot of the problems came from hidding things.&lt;br /&gt;&lt;br /&gt;Patrick Durusau -&lt;br /&gt;&lt;br /&gt;A group of vendors wrote part of a copright protection standard that they wanted to push through Oasis.  It was a DRM standard...a horrible piece of work.  But the opposition wasn&#39;t uniform.  There were people who wanted to just say &quot;hell no&quot; and hold the process hostage.  It eventually goes on until there is a motion to disolve the TC.  One of the very few TCs that just died.  So the original group took part of the standard, went to ISO, and got it as a standard.  But Sony turned around said,we aren&#39;t taking this standard...it sucks.&lt;br /&gt;&lt;br /&gt;He compares this to OOXML.  &lt;span style=&quot;font-size:85%;&quot;&gt;&lt;span style=&quot;font-style: italic;&quot;&gt;I don&#39;t understand the problem there.  First time I heard about it was yesterday.&lt;/span&gt;&lt;/span&gt; &lt;span style=&quot;font-size:100%;&quot;&gt;&lt;br /&gt;&lt;br /&gt;Microphone Comments (&lt;span style=&quot;font-size:85%;&quot;&gt;&lt;span style=&quot;font-style: italic;&quot;&gt;I didn&#39;t get them all)&lt;/span&gt;&lt;/span&gt;:&lt;br /&gt;David Lee - How does one get involved with a standards body?&lt;br /&gt;Response: Jon Bosak and Lauren Wood - $$$ is an issue.  But figure out what you are interested in, take a look and see if they will let you in, and if you really want to be in.&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;Simon St. Laurent -&lt;br /&gt;He states that no one seems to be able to enforce their standards or even their own process or procedures?  Is there any cause for hope in standards organizations. &lt;span style=&quot;font-size:85%;&quot;&gt;&lt;span style=&quot;font-style: italic;&quot;&gt; Simon seems very disturbed by the entire standards process.  I wonder if he&#39;s written about it anywhere?&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;Response: Patrick says there have been informal conversation about reform but that&#39;s been the way it has been before. &lt;br /&gt;&lt;br /&gt;Kurt Cagle - ISO has taken a significant hit in credibility due to the OOXML standard problem (&lt;span style=&quot;font-size:85%;&quot;&gt;what the hell happened?&lt;/span&gt;)  How do you reestablish authority again after that kind of failing?&lt;br /&gt;Response:&lt;br /&gt;Jon Bosak - JTC1 is actually responsible but everyone seems to be blaming ISO.  JTC1 is a combination of ISO and IEEC (?).  So it is hard to get credibiilty when the actual culprit isn&#39;t identified correctly?&lt;br /&gt;Response:&lt;br /&gt;James Mason - ISO does a lot of standards that have nothing to do with software or protocalls or anything like that.  So the processes and procedures may not always line up and they might be spread a bit thin at times.&lt;br /&gt;&lt;br /&gt;Murray Altheim-&lt;br /&gt;The motivation involved in becoming a part of a standard committee.  Altruism is often a part of being a member.  If you feel as a person that your skills can contribute to a standard that impacts millions of people then you have reached the highest part of charity.&lt;br /&gt;&lt;br /&gt;Response: Jon Bosak- &quot;Those of us who weren&#39;t put there as attack dogs for our company are there for the glory. Pure and simple.&quot; (&lt;span style=&quot;font-size:85%;&quot;&gt;&lt;span style=&quot;font-style: italic;&quot;&gt;I sure hope I got that out of context cause that doesn&#39;t seem to be the point Murray was making&lt;/span&gt;&lt;/span&gt;) - He clarified that by glory he means &quot;between you and your maker&quot;, not between you and the rest of humanity.&lt;br /&gt;&lt;br /&gt;Steve DeRose -&lt;br /&gt;There is a human cost to doing what they are doing.  I think that eludes back to what James was talking about earlier.  Part of it is human nature (people get in a bad mood,just happens).  Almost everyone on the original XML committee threatened to walk out, but no body did.  Somehow they got past that and made it work.  The way the committees are setup makes it difficult.&lt;br /&gt;&lt;ol&gt;&lt;li&gt;There is a cost in $$ and time&lt;/li&gt;&lt;li&gt;Experts may want to serve...but you serve at the will of the corporate interests&lt;/li&gt;&lt;li&gt;Lack of respect from co-workers and academic institutions.&lt;/li&gt;&lt;/ol&gt;(&lt;span style=&quot;font-size:85%;&quot;&gt;&lt;span style=&quot;font-style: italic;&quot;&gt;Seriously?  The people working on these things are geniuses.  How can they be considered less than by others for taking on the challenge and REALLY understanding how this all works?  I don&#39;t get it.&lt;/span&gt;&lt;/span&gt;)&lt;br /&gt;&lt;br /&gt;How might we be able to change that bias? &lt;br /&gt;&lt;em style=&quot;&quot;&gt;&lt;a class=&quot;biolink&quot; href=&quot;http://www.balisage.net/Bios.html#Cournane&quot;&gt;&lt;/a&gt;&lt;/em&gt;</content><link rel='replies' type='application/atom+xml' href='http://manorfieldconsulting.blogspot.com/feeds/166498867597690220/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment/fullpage/post/619441053309172159/166498867597690220' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/166498867597690220'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/166498867597690220'/><link rel='alternate' type='text/html' href='http://manorfieldconsulting.blogspot.com/2008/08/dirty-laundry-comittee-disasters-what.html' title='Dirty Laundry : Comittee Disasters, What Happened,What We Learned'/><author><name>Mark</name><uri>http://www.blogger.com/profile/15847827266240786646</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-619441053309172159.post-3523630428231693215</id><published>2008-08-13T11:44:00.003-04:00</published><updated>2008-08-13T12:31:28.166-04:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="BalisageConference08"/><title type='text'>State of the Art of Streaming: Why W3C Xproc, W3C XSLT WGs and ISO SC34 WG 1 are looking closely at streaming</title><content type='html'>&lt;a href=&quot;http://www.balisage.net/Bios.html#Zergaoui&quot;&gt;Mohamed Zergaoui&lt;/a&gt; - Xproc &amp;amp; XSLT 2.0 WG - &lt;a style=&quot;font-size: 24px; line-height: 29.7px;&quot; href=&quot;http://balisage.org/Proceedings/html/2008/Zergaoui01/Balisage2008-Zergaoui01.html&quot;&gt;Paper&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Define  Streaming:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;related to memory usage&lt;/li&gt;&lt;li&gt;related to input size&lt;/li&gt;&lt;li&gt;related to latency of process&lt;/li&gt;&lt;/ul&gt;It appears that the document size and the requirement to keep it in memory greatly increases the time of processing and thus the difficulty of constant streaming.&lt;br /&gt;&lt;br /&gt;Murray Altheim: How to deal with petabytes of data in streaming?  Isn&#39;t the stream being returned in an event model?  Why couldn&#39;t there be a model where you take every single hit on the query and not wait until the end?&lt;br /&gt;Response: Not in scope of the XML but having an entry should trigger a usage...but might be a problem of defining what WG should be handling this.&lt;br /&gt;&lt;br /&gt;Kurt Cagle: What&#39;s happening with DSDL?  The site seems out of date on a lot of points.&lt;br /&gt;Response:  ODF and OOXML is taking up most of the time and perhaps they can get back to work now that 2 years have been lost.  &lt;span style=&quot;font-size:85%;&quot;&gt;&lt;span style=&quot;font-style: italic;&quot;&gt;He seemed tentative about saying that.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Alex Milowski: Has anybody done the work to test XPath 2 within the concept of streaming?&lt;br /&gt;Response:  XPath has many different subsets of stream and it is going to be a problem.  But there are papers out there talking about that very subject.</content><link rel='replies' type='application/atom+xml' href='http://manorfieldconsulting.blogspot.com/feeds/3523630428231693215/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment/fullpage/post/619441053309172159/3523630428231693215' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/3523630428231693215'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/3523630428231693215'/><link rel='alternate' type='text/html' href='http://manorfieldconsulting.blogspot.com/2008/08/state-of-art-of-streaming-why-w3c-xproc.html' title='State of the Art of Streaming: Why W3C Xproc, W3C XSLT WGs and ISO SC34 WG 1 are looking closely at streaming'/><author><name>Mark</name><uri>http://www.blogger.com/profile/15847827266240786646</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-619441053309172159.post-7035142436952019677</id><published>2008-08-13T10:50:00.003-04:00</published><updated>2008-08-13T11:44:06.807-04:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="BalisageConference08"/><title type='text'>Xmlsh - a command language (shell) based on the philospohy of Unix Shells designed for XML</title><content type='html'>&lt;a style=&quot;font-size: 24px; line-height: 29.4px;&quot; href=&quot;http://www.balisage.net/Bios.html#Lee&quot;&gt;David A. Lee&lt;/a&gt; - Epocrates - &lt;a style=&quot;font-size: 24px; line-height: 29.4px;&quot; href=&quot;http://balisage.org/Proceedings/html/2008/Lee01/Balisage2008-Lee01.html&quot;&gt;Paper&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Why? - Unix was a radical paradigm shift and 40 years of &#39;progress&#39; have eroded the core design fundamentals.  Data types are not byte/line streams,tools have not evolved with the data (XML), and working with XML is WAY too complicated.&lt;br /&gt;&lt;br /&gt;Open Source/closed development - test it out, he needs the feedback but no coding assistance.&lt;br /&gt;&lt;a style=&quot;font-size: 24px; line-height: 29.4px;&quot; href=&quot;http://www.xmlsh.org&quot;&gt;www.xmlsh.org&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Pure Java, Saxon 9, Log4J, Optional external OS commands (not reinventing the wheel)&lt;br /&gt;&lt;br /&gt;Why not write new commands? - The shell is aging....focus is on streams and byte streams...he wants to use XML natively.&lt;br /&gt;&lt;br /&gt;&lt;span style=&quot;font-size: 19.5px; line-height: 22.05px;font-size:85%;&quot; &gt;&lt;span style=&quot;font-style: italic; font-size: 19.5px; line-height: 22.05px;&quot;&gt;He does a bunch of live demo...can&#39;t show that, but the link above does take you to the code.  If you were here you could have gotten it on a tiny 512 USB stick.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Some annoying things but the one that seems to upset him the most is that console IO is limited in Java...no pure java implementation of clear screen...&lt;br /&gt;&lt;br /&gt;&lt;span style=&quot;font-size: 19.5px; line-height: 22.05px;font-size:85%;&quot; &gt;&lt;span style=&quot;font-style: italic; font-size: 19.5px; line-height: 22.05px;&quot;&gt;Really what this is a personal project that everyone can try out.  He has it working..but he&#39;d love to see how it works for other people.  Check it out.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Questions/Comments:&lt;br /&gt;Murry Altheim: Has he looked into Groovey Scripting? &lt;br /&gt;Response: more interested in a scripting environment...implementing a Groovy interface seems trivial.  Something missing right now is the lack of documentation for calling it from another environment...but that is something for later.&lt;br /&gt;&lt;br /&gt;It isn&#39;t ready to scale...and David is aware of that.  He&#39;s hoping to learn something from other speakers about non-serialized XML.&lt;br /&gt;&lt;br /&gt;Alex Milowski: Have you looked at XProc?&lt;br /&gt;Response: He is looking for command line interface and XProc seems to be something that requires building entire documents for the XProc.  But XProc to xmlsh seems doable.&lt;br /&gt;&lt;br /&gt;Kurt Cagle:&lt;br /&gt;Have you thought of integrating with eXist?&lt;br /&gt;Response:  I hadn&#39;t even heard of eXist until yesterday.  but I definitively am going to look into it.&lt;br /&gt;&lt;br /&gt;Another Attendee: Unicode support?&lt;br /&gt;Response: It is a goal...since it is java it is native, but the parser is utf-8 but testing hasn&#39;t occurred on the extended sets.  No fundelmental reason why it can&#39;t be end-to-end and I want it to be.&lt;br /&gt;&lt;br /&gt;Same Attendee: Memory management?&lt;br /&gt;Response: Xquery serialized to a native XML dbase.  eXist might assist this problem.</content><link rel='replies' type='application/atom+xml' href='http://manorfieldconsulting.blogspot.com/feeds/7035142436952019677/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment/fullpage/post/619441053309172159/7035142436952019677' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/7035142436952019677'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/7035142436952019677'/><link rel='alternate' type='text/html' href='http://manorfieldconsulting.blogspot.com/2008/08/xmlsh-command-language-shell-based-on.html' title='Xmlsh - a command language (shell) based on the philospohy of Unix Shells designed for XML'/><author><name>Mark</name><uri>http://www.blogger.com/profile/15847827266240786646</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-619441053309172159.post-5791998692128151757</id><published>2008-08-13T09:57:00.008-04:00</published><updated>2008-08-26T08:15:30.384-04:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="BalisageConference08"/><title type='text'>Topic Maps In Near Real-Time</title><content type='html'>Sam Hunting - Universal Pantograph - &lt;a href=&quot;http://balisage.org/Proceedings/html/2008/Hunting01/Balisage2008-Hunting01.html&quot;&gt;Paper&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Update: &lt;a href=&quot;http://news.oreilly.com/2008/08/web-meet-semantic-web.html&quot;&gt;Simon St. Laurent&lt;/a&gt; has more well thought out discussions of the proceedings. &lt;br /&gt;Rejects the search paradigm...he only finds &#39;finding&#39; interesting.&lt;br /&gt;&lt;br /&gt;States that his blog is part of the top 5k in the world.  &lt;span style=&quot;font-size:85%;&quot;&gt;&lt;span style=&quot;font-style: italic;&quot;&gt;I must figure out if I already have that blog in my reader.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;He is presenting with a working Drupal application not a set of slides.&lt;br /&gt;&lt;br /&gt;&lt;span style=&quot;font-size:85%;&quot;&gt;&lt;span style=&quot;font-style: italic;&quot;&gt;Just in case you ever meet Sam, be aware that his presentation here was a topic map of &#39;The Criminal Bush Administration&#39;.  So, I doubt he&#39;s much of a Republican.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style=&quot;font-size:85%;&quot;&gt;&lt;span style=&quot;font-style: italic;&quot;&gt;I apologize in advance....I just don&#39;t get this Topic Map stuff.  But he is discussing some &#39;markup&#39; related things so I&#39;ll focus on that.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style=&quot;font-size:85%;&quot;&gt;&lt;span style=&quot;font-style: italic;&quot;&gt;Looks like Sam also attended another conference this year.  Eschacon happened March 28-30, 2008...and this appears to have been the official T-Shirt.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;a onblur=&quot;try {parent.deselectBloggerImageGracefully();} catch(e) {}&quot; href=&quot;http://eschacon08.blogspot.com/2008/03/keep-yer-shoits-on.html&quot;&gt;&lt;img style=&quot;margin: 0pt 10px 10px 0pt; float: left; cursor: pointer;&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhMFZTR6I_cnUPm3arCRd_oQIIWN6YLOHN8MXouVYnem9RjwK__Z_gQsnvERE27Cn1XMPzM7xrI9gkDvgC6i3spRjocdlEzETYPvj9gHp1p0oa6TQ9a4kxZgxoMsxiO8e5eu7DK4REeB-8/s200/DFH_Cardinal_Template.jpg&quot; alt=&quot;&quot; id=&quot;BLOGGER_PHOTO_ID_5234005114835828082&quot; border=&quot;0&quot; /&gt;&lt;/a&gt;&lt;br /&gt;&lt;span style=&quot;font-size:85%;&quot;&gt;&lt;span style=&quot;font-style: italic;&quot;&gt;Sam was wearing it during his presentation.  You&#39;d think DFH means something in XML land where everything has an abbreviation...but you&#39;d be wrong.  DFH is Dirty F-ing Hippy&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;His Drupal application has validation of the wiki markup (his assertions)...which it seems that wiki-media does not have.  Which seems like a serious deficiency if true.&lt;br /&gt;&lt;br /&gt;&lt;span style=&quot;font-size:85%;&quot;&gt;&lt;span style=&quot;font-style: italic;&quot;&gt;Also something to know about Sam...he just said &quot;I LOVE administering Drupal sites&quot;.  I think that is a pretty useful thing to know about a programmer of any stripe.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;He shows how much of the pieces of his Drupal application function.  Basically it is a smooth application for creating a real-time (just hit submit) website.&lt;br /&gt;&lt;br /&gt;He had a couple hours of material but Steve Newcomb keeps badgering him about wrapping up...and now we are overtime.&lt;br /&gt;&lt;br /&gt;&lt;span style=&quot;font-size:85%;&quot;&gt;&lt;span style=&quot;font-style: italic;&quot;&gt;12 minutes ago Steve asked for 10 minutes for questions...this is funny the old guard is haranguing him into submission...I think they are going to pull the video feed any moment.&lt;/span&gt;&lt;/span&gt;</content><link rel='replies' type='application/atom+xml' href='http://manorfieldconsulting.blogspot.com/feeds/5791998692128151757/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment/fullpage/post/619441053309172159/5791998692128151757' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/5791998692128151757'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/5791998692128151757'/><link rel='alternate' type='text/html' href='http://manorfieldconsulting.blogspot.com/2008/08/topic-maps-in-near-real-time.html' title='Topic Maps In Near Real-Time'/><author><name>Mark</name><uri>http://www.blogger.com/profile/15847827266240786646</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhMFZTR6I_cnUPm3arCRd_oQIIWN6YLOHN8MXouVYnem9RjwK__Z_gQsnvERE27Cn1XMPzM7xrI9gkDvgC6i3spRjocdlEzETYPvj9gHp1p0oa6TQ9a4kxZgxoMsxiO8e5eu7DK4REeB-8/s72-c/DFH_Cardinal_Template.jpg" height="72" width="72"/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-619441053309172159.post-6970558498923202086</id><published>2008-08-12T19:17:00.007-04:00</published><updated>2008-08-12T23:55:03.751-04:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="BalisageConference08"/><title type='text'>XQuery Tips and Tricks - Kurt Cagle - Nocturne</title><content type='html'>Discussion on oXygen vs. Stylus Studio.  Appears to come down to price and responsiveness.  Kurt also mentioned that if you are comfortable with the Stylus Studio interface then you are fine, but he likes the oXygen interface.&lt;br /&gt;&lt;br /&gt;He&#39;s using eXist as the database.  Reminds people that images and binaries are accessible from the database as well.&lt;br /&gt;&lt;br /&gt;&amp;lt;Pizza Break - Thank You Anonymous Donor&amp;gt;&lt;br /&gt;&lt;br /&gt;One of the coolest features: &quot;You can store the XQueries themselves within the database.&quot;&lt;br /&gt;&lt;br /&gt;&lt;span style=&quot;font-style: italic;&quot;&gt;GEEK OUT!&lt;/span&gt;  He is using creation of characters for a D&amp;amp;D like game as the example....Chaotic Neutral Cleric for me!&lt;br /&gt;&lt;br /&gt;Tip:  &lt;span style=&quot;font-style: italic;font-size:85%;&quot; &gt;declare function local:randchoose($ctermset as node()) as xs:string{&lt;/span&gt;&lt;br /&gt;     Defining the function as local allows you to keep that namespace private and have another namespace for the public output.&lt;br /&gt;&lt;br /&gt;Tip:  &lt;span style=&quot;font-size:85%;&quot;&gt;&lt;span style=&quot;font-style: italic;&quot;&gt;&lt;gender&gt;{$gender}&lt;/gender&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;     Embed Xquery in XML by using the {} operator.&lt;br /&gt;&lt;br /&gt;Tip:  &lt;span style=&quot;font-size:85%;&quot;&gt;string{$character/gender}&lt;br /&gt;&lt;/span&gt;      Remember that XPath returns element() and attribute() (and others) not text, unless specifically requested.  Could easily cause problems that aren&#39;t immediately clear in a browser.&lt;br /&gt;&lt;br /&gt;conf.xml in eXist:&lt;br /&gt;setup -  serielizer, scheduler, automatic XSL generation, indentation, XSLT transformer, validation mode, Xquery modules section...optional modules (turn them on -uncomment them)&lt;br /&gt;then run the build routine.&lt;br /&gt;&lt;br /&gt;Tip: in eXist -&lt;br /&gt;&lt;ul&gt;&lt;li&gt;util, and request modules are good to get to know.&lt;/li&gt;&lt;li&gt;transform allows the use of XSLT on XML in the XQuery.  So use XQuery to collect the data and XSLT to transform it...easier recursion.&lt;/li&gt;&lt;li&gt;sql module allows you to get connections from just about anywhere with a URI&lt;/li&gt;&lt;/ul&gt;I&#39;m not sure there was a lot of information for people using a dbase other than eXist or people who are pretty knowledgeable about XQuery.  But it was pretty cool to watch him code in front of us.  I like seeing people &lt;a href=&quot;http://www.amazon.com/exec/obidos/ASIN/0672324792/manorfieldconsulting-20&quot;&gt;who do their jobs well, do their jobs well&lt;/a&gt;.</content><link rel='replies' type='application/atom+xml' href='http://manorfieldconsulting.blogspot.com/feeds/6970558498923202086/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment/fullpage/post/619441053309172159/6970558498923202086' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/6970558498923202086'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/6970558498923202086'/><link rel='alternate' type='text/html' href='http://manorfieldconsulting.blogspot.com/2008/08/xquery-tips-and-tricks-kurt-cagle.html' title='XQuery Tips and Tricks - Kurt Cagle - Nocturne'/><author><name>Mark</name><uri>http://www.blogger.com/profile/15847827266240786646</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-619441053309172159.post-4397921365050949067</id><published>2008-08-12T17:43:00.003-04:00</published><updated>2008-08-12T18:03:54.995-04:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="BalisageConference08"/><title type='text'>Evening Break</title><content type='html'>Well that day at Balisage went by quickly.  I wonder why?  The speakers were interesting and the topics were intriguing...perhaps time really does fly when one is having fun.&lt;br /&gt;&lt;br /&gt;I had the chance to sit at lunch with several people who had bad trips to Montreal and several who had uneventful ones...but it appears that there is always someone with a worse flying story than I have...I guess I need to travel more.&lt;br /&gt;&lt;br /&gt;I&#39;ve just finished a Skype call with my wife and daughters and I will head out in search of a meal soon.  It&#39;ll have to be a quick one because 7pm is both Kurt Cagle presenting on XQuery tips and tricks and the W3Québec user group.  I was looking forward to the user group discussion but I will have to choose..and since XQuery is more likely to help pay the bills...&lt;br /&gt;&lt;br /&gt;But I do wish I could hear about using XML in cool web sites.  Hopefully someone else blogs about that session.&lt;br /&gt;&lt;br /&gt;BTW, I noticed Simon St. Laurent taking pictures today.  I hope he posts them soon to flickr.</content><link rel='replies' type='application/atom+xml' href='http://manorfieldconsulting.blogspot.com/feeds/4397921365050949067/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment/fullpage/post/619441053309172159/4397921365050949067' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/4397921365050949067'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/4397921365050949067'/><link rel='alternate' type='text/html' href='http://manorfieldconsulting.blogspot.com/2008/08/evening-break.html' title='Evening Break'/><author><name>Mark</name><uri>http://www.blogger.com/profile/15847827266240786646</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-619441053309172159.post-2405794145558338319</id><published>2008-08-12T15:56:00.006-04:00</published><updated>2008-08-12T17:08:53.080-04:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="BalisageConference08"/><title type='text'>Office Suite Markup: Is It Worthwhile</title><content type='html'>&lt;a href=&quot;http://balisage.org/Bios.html#Durusau&quot;&gt;Patrick Durusau&lt;/a&gt; - &lt;a href=&quot;http://balisage.org/Proceedings/html/2008/Durusau01/Balisage2008-Durusau01.html&quot;&gt;Paper&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;He is the Editor of ODF....so I imagine he thinks it is worthwhile.&lt;br /&gt;&lt;br /&gt;Open Office Allows:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;We are interested in explicit semantics.  But users just want to get things done.&lt;/li&gt;&lt;li&gt;Interchange&lt;/li&gt;&lt;li&gt;Application Independence&lt;/li&gt;&lt;li&gt;Long term preservation of content&lt;/li&gt;&lt;/ul&gt;Server pack 2 for Office 2007 will have native ODF support&lt;br /&gt;&lt;br /&gt;Lots of argument with his conclusions that everyone making their own XML is a good thing.  My computer crashed (damn Windows) so I am behind on the conversation.  Liam and Tommie had some great back-n-forth with Patrick.&lt;br /&gt;&lt;br /&gt;ODF tags are presentational?  Patrick says they aren&#39;t all presentational but isn&#39;t specific about it.  Sounds like most of the tags are presentational...which is not how I recall XML being used to its full potential.&lt;br /&gt;&lt;br /&gt;Wendell:  &quot;What you left out is that these formats are REALLY ugly.&quot;&lt;br /&gt;&lt;br /&gt;This talk brought up a lot of discussion but I think it can be mostly narrowed down to:&lt;br /&gt;&lt;ol&gt;&lt;li&gt;How does this help me as an XML developer?&lt;/li&gt;&lt;li&gt;Microsoft Office is pervasive...it is going to be difficult to get through that?&lt;/li&gt;&lt;/ol&gt;</content><link rel='replies' type='application/atom+xml' href='http://manorfieldconsulting.blogspot.com/feeds/2405794145558338319/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment/fullpage/post/619441053309172159/2405794145558338319' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/2405794145558338319'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/619441053309172159/posts/default/2405794145558338319'/><link rel='alternate' type='text/html' href='http://manorfieldconsulting.blogspot.com/2008/08/office-suite-markup-is-it-worthwhile.html' title='Office Suite Markup: Is It Worthwhile'/><author><name>Mark</name><uri>http://www.blogger.com/profile/15847827266240786646</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry></feed>