<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0">

<channel>
	<title>ChemSpider Blog</title>
	
	<link>http://www.chemspider.com/blog</link>
	<description>Helping to Build a Structure Centric Community for Chemists</description>
	<lastBuildDate>Tue, 10 Nov 2009 21:56:15 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" href="http://feeds.feedburner.com/ChemspiderBlog" type="application/rss+xml" /><feedburner:emailServiceId>ChemspiderBlog</feedburner:emailServiceId><feedburner:feedburnerHostname>http://feedburner.google.com</feedburner:feedburnerHostname><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com" /><item>
		<title>Our RSC Lab on a Chip Article is Free Access immediately</title>
		<link>http://feedproxy.google.com/~r/ChemspiderBlog/~3/G5Pgcp4KNr8/our-rsc-lab-on-a-chip-article-is-free-access-immediately.html</link>
		<comments>http://www.chemspider.com/blog/our-rsc-lab-on-a-chip-article-is-free-access-immediately.html#comments</comments>
		<pubDate>Tue, 10 Nov 2009 21:48:39 +0000</pubDate>
		<dc:creator>Antony Williams</dc:creator>
				<category><![CDATA[Community Building]]></category>
		<category><![CDATA[Presentations]]></category>
		<category><![CDATA[ADME/Tox data]]></category>
		<category><![CDATA[ChemSpider]]></category>
		<category><![CDATA[Drug Discovery]]></category>
		<category><![CDATA[Lab on a Chip]]></category>

		<guid isPermaLink="false">http://www.chemspider.com/blog/?p=1496</guid>
		<description><![CDATA[My friend and colleague Sean Ekins and I wrote a perspective for the RSC&#8217;s Lab on a Chip journal and it was released as an Advance Article, as Free Access, this evening.
The perspective is entitled &#8220;Precompetitive preclinical ADME/Tox data: set it free on the web to facilitate computational model building and assist drug development&#8220; . The [...]]]></description>
			<content:encoded><![CDATA[<p>My friend and colleague Sean Ekins and I wrote a perspective for the RSC&#8217;s Lab on a Chip journal and it was released as an Advance Article, as Free Access, this evening.</p>
<p>The perspective is entitled &#8220;<span style="color: #800000;"><strong>Precompetitive preclinical ADME/Tox data: set it free on the web to facilitate computational model building and assist drug development</strong></span>&#8220; . The title is self-explanatory in terms of what we are trying to communicate. <a href="http://www.rsc.org/delivery/_ArticleLinking/DisplayHTMLArticleforfree.cfm?JournalCode=LC&amp;Year=2010&amp;ManuscriptID=b917760b&amp;Iss=Advance_Article">The paper is online now and available here</a>.</p>
<p><img class="aligncenter size-full wp-image-1499" title="ADME" src="http://www.chemspider.com/blog/wp-content/uploads/2009/11/ADME.png" alt="ADME" width="349" height="193" /></p>
<script type="text/javascript">
  addthis_url    = 'http%3A%2F%2Fwww.chemspider.com%2Fblog%2Four-rsc-lab-on-a-chip-article-is-free-access-immediately.html';
  addthis_title  = 'Our+RSC+Lab+on+a+Chip+Article+is+Free+Access+immediately';
  addthis_pub    = '';
</script><script type="text/javascript" src="http://s7.addthis.com/js/addthis_widget.php?v=12" ></script>
<p class="buymebeer"><form action="https://www.paypal.com/cgi-bin/webscr" target="paypal" method="post"><input type="hidden" name="cmd" value="_xclick" /><input type="hidden" name="business" value="tony27587@gmail.com" /><input type="hidden" name="return" value="Thanks...I'll use the caffeine to keep me awake and keep blogging..." /><input type="hidden" name="item_name" value="Buy me a Coffee for Our RSC Lab on a Chip Article is Free Access immediately" /><input type="hidden" name="currency_code" value="USD" /><input type="hidden" name="amount" value="" /><input type="image" src="http://www.chemspider.com/blog/wp-content/plugins/buy-me-beer/icon_cafe.gif" align="left" alt="Buy me a Coffee" title="Buy me a Coffee" hspace="3" /></form><a href="https://www.paypal.com/cgi-bin/webscr?cmd=_xclick&amp;business=tony27587@gmail.com&amp;currency_code=USD&amp;amount=&amp;return=Thanks...I'll use the caffeine to keep me awake and keep blogging...&amp;item_name=Buy+me+a+Coffee+for+Our+RSC+Lab+on+a+Chip+Article+is+Free+Access+immediately" target="paypal">Buy me a Coffee</a></p>]]></content:encoded>
			<wfw:commentRss>http://www.chemspider.com/blog/our-rsc-lab-on-a-chip-article-is-free-access-immediately.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://www.chemspider.com/blog/our-rsc-lab-on-a-chip-article-is-free-access-immediately.html</feedburner:origLink></item>
		<item>
		<title>Presentation at the German Conference for Cheminformatics</title>
		<link>http://feedproxy.google.com/~r/ChemspiderBlog/~3/0qJeulEF1Fc/presentation-at-the-german-conference-for-cheminformatics.html</link>
		<comments>http://www.chemspider.com/blog/presentation-at-the-german-conference-for-cheminformatics.html#comments</comments>
		<pubDate>Tue, 10 Nov 2009 21:36:25 +0000</pubDate>
		<dc:creator>Antony Williams</dc:creator>
				<category><![CDATA[Chemspider Journal]]></category>
		<category><![CDATA[Community Building]]></category>
		<category><![CDATA[Presentations]]></category>
		<category><![CDATA[ChemSpider]]></category>
		<category><![CDATA[Data Quality]]></category>
		<category><![CDATA[Drugbank]]></category>
		<category><![CDATA[German Conference of Cheminformatics]]></category>
		<category><![CDATA[Goslar]]></category>
		<category><![CDATA[Open Data]]></category>
		<category><![CDATA[Open notebook Science]]></category>
		<category><![CDATA[Project Prospect]]></category>
		<category><![CDATA[PubChem]]></category>
		<category><![CDATA[RSC]]></category>

		<guid isPermaLink="false">http://www.chemspider.com/blog/?p=1494</guid>
		<description><![CDATA[I have been at the German Conference for Cheminformatics for the past three days. The conference is in Goslar. I twittered the conference using #goslarcheminf and it appears that there was little interest in twittering here&#8230;seems like it&#8217;s an &#8220;American&#8221; thing to do. I gave a presentation entitled &#8220;ChemSpider &#8211; Building a Foundation for the [...]]]></description>
			<content:encoded><![CDATA[<p>I have been at the <a href="http://www.gdch.de/vas/tagungen/tg/5358/program__e.htm">German Conference for Cheminformatics </a>for the past three days. The conference is in Goslar. I twittered the conference using <a href="http://twitter.com/#search?q=%23goslarcheminf">#goslarcheminf </a>and it appears that there was little interest in twittering here&#8230;seems like it&#8217;s an &#8220;American&#8221; thing to do. I gave a presentation entitled &#8220;ChemSpider &#8211; Building a Foundation for the Semantic Web by Hosting a Crowd Sourced Databasing Platform for Chemistry&#8221; and have put it on SlideShare <a href="http://www.slideshare.net/AntonyWilliams/chemspider-building-a-foundation-for-the-semantic-web-by-hosting-a-crowd-sourced-databasing-platform-for-chemistry">here</a>. The abstract for the talk is below as well as the embedded Slideshare widget for the talk. This talk was a lot less rushed than usual&#8230;not just 20 minutes and I personally <em>enjoyed</em> giving this talk to the audience. Commonly I feel that the talks I give are very rished and I only get to scratch the surface of what we are up to with ChemSpider. It&#8217;s amazing how an additional 15 minutes allowed me to expand on the issues and the work. The presentation drew a lot of questions and attention after the session and I&#8217;m hoping that many of the discussions regarding collaboration and depositions of new data come to fruition.</p>
<p><strong>Abstract</strong></p>
<p>There is an increasing availability of free and open access resources for chemists to use on the internet. Coupled with the increasing availability of Open Source software tools we are in the middle of a revolution in data availability and tools to manipulate these data. ChemSpider is a free access website for chemists built with the intention of providing a structure centric community for chemists. It was developed with the intention of aggregating and indexing available sources of chemical structures and their associated information into a single searchable repository and making it available to everybody, at no charge.</p>
<p>There are tens if not hundreds of chemical structure databases such as literature data, chemical vendor catalogs, molecular properties, environmental data, toxicity data, analytical data etc. and no single way to search across them.  Despite the fact that there were a large number of databases containing chemical compounds and data available online their inherent quality, accuracy and completeness was lacking in many regards. The intention with ChemSpider was to provide a platform whereby the chemistry community could contribute to cleaning up the data, improving the quality of data online and expanding the information available to include data such as reaction syntheses, analytical data, experimental properties and linking to other valuable resources. It has grown into a resource containing over 21 million unique chemical structures from over 200 data sources.</p>
<p>ChemSpider has enabled real time curation of the data, association of analytical data with chemical structures, real-time deposition of single or batch chemical structures (including with activity data) and transaction-based predictions of physicochemical data. The social community aspects of the system demonstrate the potential of this approach. Curation of the data continues daily and thousands of edits and depositions by members of the community have dramatically improved the quality of the data relative to other public resources for chemistry.</p>
<p>This presentation will provide an overview of the history of ChemSpider, the present capabilities of the platform and how it can become one of the primary foundations of the semantic web for chemistry. It will also discuss some of the present projects underway since the acquisition of ChemSpider by the Royal Society of Chemistry.</p>
<div id="__ss_2464828" style="text-align: left; width: 425px;"><a style="font:14px Helvetica,Arial,Sans-serif;display:block;margin:12px 0 3px 0;text-decoration:underline;" title="ChemSpider - Building a Foundation for the Semantic Web by Hosting a Crowd Sourced Databasing Platform for Chemistry" href="http://www.slideshare.net/AntonyWilliams/chemspider-building-a-foundation-for-the-semantic-web-by-hosting-a-crowd-sourced-databasing-platform-for-chemistry">ChemSpider &#8211; Building a Foundation for the Semantic Web by Hosting a Crowd Sourced Databasing Platform for Chemistry</a><object style="margin:0px" classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="425" height="355" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="allowFullScreen" value="true" /><param name="allowScriptAccess" value="always" /><param name="src" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=cfakepathbuildingafoundationforthesemanticwebbyhostingacrowdsourceddatabasingplatformforchemistry-091110052123-phpapp01&amp;stripped_title=chemspider-building-a-foundation-for-the-semantic-web-by-hosting-a-crowd-sourced-databasing-platform-for-chemistry" /><param name="allowfullscreen" value="true" /><embed style="margin:0px" type="application/x-shockwave-flash" width="425" height="355" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=cfakepathbuildingafoundationforthesemanticwebbyhostingacrowdsourceddatabasingplatformforchemistry-091110052123-phpapp01&amp;stripped_title=chemspider-building-a-foundation-for-the-semantic-web-by-hosting-a-crowd-sourced-databasing-platform-for-chemistry" allowscriptaccess="always" allowfullscreen="true"></embed></object></p>
<div style="font-family: tahoma,arial; height: 26px; font-size: 11px; padding-top: 2px;">View more <a style="text-decoration:underline;" href="http://www.slideshare.net/">presentations</a> from <a style="text-decoration:underline;" href="http://www.slideshare.net/AntonyWilliams">Antony Williams, ChemSpiderman</a>.</div>
</div>
<script type="text/javascript">
  addthis_url    = 'http%3A%2F%2Fwww.chemspider.com%2Fblog%2Fpresentation-at-the-german-conference-for-cheminformatics.html';
  addthis_title  = 'Presentation+at+the+German+Conference+for+Cheminformatics';
  addthis_pub    = '';
</script><script type="text/javascript" src="http://s7.addthis.com/js/addthis_widget.php?v=12" ></script>
<p class="buymebeer"><form action="https://www.paypal.com/cgi-bin/webscr" target="paypal" method="post"><input type="hidden" name="cmd" value="_xclick" /><input type="hidden" name="business" value="tony27587@gmail.com" /><input type="hidden" name="return" value="Thanks...I'll use the caffeine to keep me awake and keep blogging..." /><input type="hidden" name="item_name" value="Buy me a Coffee for Presentation at the German Conference for Cheminformatics" /><input type="hidden" name="currency_code" value="USD" /><input type="hidden" name="amount" value="" /><input type="image" src="http://www.chemspider.com/blog/wp-content/plugins/buy-me-beer/icon_cafe.gif" align="left" alt="Buy me a Coffee" title="Buy me a Coffee" hspace="3" /></form><a href="https://www.paypal.com/cgi-bin/webscr?cmd=_xclick&amp;business=tony27587@gmail.com&amp;currency_code=USD&amp;amount=&amp;return=Thanks...I'll use the caffeine to keep me awake and keep blogging...&amp;item_name=Buy+me+a+Coffee+for+Presentation+at+the+German+Conference+for+Cheminformatics" target="paypal">Buy me a Coffee</a></p>]]></content:encoded>
			<wfw:commentRss>http://www.chemspider.com/blog/presentation-at-the-german-conference-for-cheminformatics.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://www.chemspider.com/blog/presentation-at-the-german-conference-for-cheminformatics.html</feedburner:origLink></item>
		<item>
		<title>Building Small Curated Databases versus Large Data Collections: ChEBI vs ChEMBL</title>
		<link>http://feedproxy.google.com/~r/ChemspiderBlog/~3/rM_sj94Mybc/building-small-curated-databases-versus-large-data-collections-chebi-vs-chembl.html</link>
		<comments>http://www.chemspider.com/blog/building-small-curated-databases-versus-large-data-collections-chebi-vs-chembl.html#comments</comments>
		<pubDate>Fri, 06 Nov 2009 04:30:49 +0000</pubDate>
		<dc:creator>Antony Williams</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[ChEBI]]></category>
		<category><![CDATA[ChemBL]]></category>
		<category><![CDATA[Curated chemistry data]]></category>

		<guid isPermaLink="false">http://www.chemspider.com/blog/?p=1492</guid>
		<description><![CDATA[For those of you who read this blog you will be aware that it can take a lot of time just to get a single chemical curated against its correct associations of chemical names and synonyms. I&#8217;ve shown this for vancomycin, Taxol (1,2,3), Ginkgolide B and it is presently underway with Digitonin, though not yet [...]]]></description>
			<content:encoded><![CDATA[<p>For those of you who read this blog you will be aware that it can take a lot of time just to get a single chemical curated against its correct associations of chemical names and synonyms. I&#8217;ve shown this for vancomycin, Taxol (1,2,3), Ginkgolide B and it is presently underway with Digitonin, though not yet complete. Working on one structure is hard enough. Building a database of a few thousand curated structures is difficult work yet the EBI did it, and did it well when they built ChEBI. ChEBI is also not perfect as we discovered working on vancomycin and I still find occasional small issues.</p>
<p>The EBI recently released the ChEMBL database. This is a much bigger resource as described at the <a href="http://www.ebi.ac.uk/chembldb/index.php">home page for the resource here</a>. The site states &#8220;ChEMBL is a database of ca. 500,000 bioactive compounds, their quantitative properties and bioactivities (binding constants, pharmacology and ADMET, etc). The data is abstracted and curated from the primary scientific literature and the data made available due to funding by the <a href="http://www.wellcome.ac.uk/" target="_blank">Wellcome Trust</a>.&#8221; It is MUCH harder to curate larger databases and 1/2 a million records is a challenge.</p>
<p>I downloaded the data from the <a href="ftp://ftp.ebi.ac.uk/pub/databases/chembl/">FTP site</a> and took a browse of the data. There are definitely structures in the data file that we don&#8217;t have in ChemSpider but I found an issue with charge balance for many hundreds of records where the counterions were charged (for example, chloride or bromide) but the primary component was neutral. <a href="http://www.ebi.ac.uk/chembldb/index.php/compound/inspect/1435">An example is here</a> where the compound is named as a hydrochloride but the compound has the chloride anion. I think this likely arises from treatment with some type of standardizer so it should be a matter of changing the standardizer settings and regenerating. We deal with over 23 million compounds and have been through such issues ourselves when it comes to generation of structure images.</p>
<p>For an example of a rich record in ChEMBL take a look at this record showing the target, assay, activity type, value and reference all listed. ChEMBL is sure to be an invaluable reference for the Life Sciences.</p>
<script type="text/javascript">
  addthis_url    = 'http%3A%2F%2Fwww.chemspider.com%2Fblog%2Fbuilding-small-curated-databases-versus-large-data-collections-chebi-vs-chembl.html';
  addthis_title  = 'Building+Small+Curated+Databases+versus+Large+Data+Collections%3A+ChEBI+vs+ChEMBL';
  addthis_pub    = '';
</script><script type="text/javascript" src="http://s7.addthis.com/js/addthis_widget.php?v=12" ></script>
<p class="buymebeer"><form action="https://www.paypal.com/cgi-bin/webscr" target="paypal" method="post"><input type="hidden" name="cmd" value="_xclick" /><input type="hidden" name="business" value="tony27587@gmail.com" /><input type="hidden" name="return" value="Thanks...I'll use the caffeine to keep me awake and keep blogging..." /><input type="hidden" name="item_name" value="Buy me a Coffee for Building Small Curated Databases versus Large Data Collections: ChEBI vs ChEMBL" /><input type="hidden" name="currency_code" value="USD" /><input type="hidden" name="amount" value="" /><input type="image" src="http://www.chemspider.com/blog/wp-content/plugins/buy-me-beer/icon_cafe.gif" align="left" alt="Buy me a Coffee" title="Buy me a Coffee" hspace="3" /></form><a href="https://www.paypal.com/cgi-bin/webscr?cmd=_xclick&amp;business=tony27587@gmail.com&amp;currency_code=USD&amp;amount=&amp;return=Thanks...I'll use the caffeine to keep me awake and keep blogging...&amp;item_name=Buy+me+a+Coffee+for+Building+Small+Curated+Databases+versus+Large+Data+Collections:+ChEBI+vs+ChEMBL" target="paypal">Buy me a Coffee</a></p>]]></content:encoded>
			<wfw:commentRss>http://www.chemspider.com/blog/building-small-curated-databases-versus-large-data-collections-chebi-vs-chembl.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://www.chemspider.com/blog/building-small-curated-databases-versus-large-data-collections-chebi-vs-chembl.html</feedburner:origLink></item>
		<item>
		<title>In Memoriam – Warren DeLano</title>
		<link>http://feedproxy.google.com/~r/ChemspiderBlog/~3/9S2xd86yJAI/in-memoriam-warren-delano.html</link>
		<comments>http://www.chemspider.com/blog/in-memoriam-warren-delano.html#comments</comments>
		<pubDate>Fri, 06 Nov 2009 01:01:38 +0000</pubDate>
		<dc:creator>Antony Williams</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[PyMOL]]></category>
		<category><![CDATA[Warren DeLano]]></category>

		<guid isPermaLink="false">http://www.chemspider.com/blog/?p=1489</guid>
		<description><![CDATA[I have never met Warren DeLano. But, I have respected him from afar for a long time. Warren is the developer of PyMol, an Open Source molecular visualization system that has made enormous contributions to the community and can produce stunning visualizations of Proteins. His impact on the field of protein visualization has been recognized [...]]]></description>
			<content:encoded><![CDATA[<p>I have never met Warren DeLano. But, I have respected him from afar for a long time. Warren is the developer of <a href="http://en.wikipedia.org/wiki/PyMOL">PyMol</a>, an Open Source molecular visualization system that has made enormous contributions to the community and can produce <a href="http://www.youtube.com/watch?v=h-2fQCIsBnk">stunning visualizations of Proteins</a>. His impact on the field of protein visualization has been recognized many times by the community and his tools are used in labs all over the world. He has garnered respect across our community.</p>
<p>A few months ago I had the opportunity to spend an hour on the phone with him after he had made <a href="http://www.chemspider.com/blog/recognition-from-warren-delano-father-of-pymol.html">such positive comments</a> when the RSC acquired ChemSpider. We talked about Open Science, Open Source and models of business. We talked about the adventure of trying to change the world one step at a time by making our humble contributions to the world of science. By the end of our conversation I knew that when I met Warren we would be able to talk for many more hours as we shared many common views and, primarily, a want to make a difference.</p>
<p>Today I learned of the sad news that Warren had passed away. Despite the fact that I hadn&#8217;t yet managed to sit with Warren face to face I was immediately  saddened. My truth is that there is a specific type of shock I feel when someone younger than myself passes away. Warren and I talked about the impact of our chosen career paths on our relationships with our wives and the hours spent in front of a screen instead of spending them with those we share our lives with. We both reflected on the fact that we have given too much to the keyboard over the years driven by our need to make a difference. Warren&#8217;s hard work and superior programming skills and are paralleled by the fact that he was clearly a charitable contributor to science by giving his code away to the world and was, even based on only one phone call, a kind man.</p>
<p>My thoughts go out to his wife and family for his loss.</p>
<script type="text/javascript">
  addthis_url    = 'http%3A%2F%2Fwww.chemspider.com%2Fblog%2Fin-memoriam-warren-delano.html';
  addthis_title  = 'In+Memoriam+%26%238211%3B+Warren+DeLano';
  addthis_pub    = '';
</script><script type="text/javascript" src="http://s7.addthis.com/js/addthis_widget.php?v=12" ></script>
<p class="buymebeer"><form action="https://www.paypal.com/cgi-bin/webscr" target="paypal" method="post"><input type="hidden" name="cmd" value="_xclick" /><input type="hidden" name="business" value="tony27587@gmail.com" /><input type="hidden" name="return" value="Thanks...I'll use the caffeine to keep me awake and keep blogging..." /><input type="hidden" name="item_name" value="Buy me a Coffee for In Memoriam - Warren DeLano" /><input type="hidden" name="currency_code" value="USD" /><input type="hidden" name="amount" value="" /><input type="image" src="http://www.chemspider.com/blog/wp-content/plugins/buy-me-beer/icon_cafe.gif" align="left" alt="Buy me a Coffee" title="Buy me a Coffee" hspace="3" /></form><a href="https://www.paypal.com/cgi-bin/webscr?cmd=_xclick&amp;business=tony27587@gmail.com&amp;currency_code=USD&amp;amount=&amp;return=Thanks...I'll use the caffeine to keep me awake and keep blogging...&amp;item_name=Buy+me+a+Coffee+for+In+Memoriam+-+Warren+DeLano" target="paypal">Buy me a Coffee</a></p>]]></content:encoded>
			<wfw:commentRss>http://www.chemspider.com/blog/in-memoriam-warren-delano.html/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		<feedburner:origLink>http://www.chemspider.com/blog/in-memoriam-warren-delano.html</feedburner:origLink></item>
		<item>
		<title>Jean-Claude Bradley and His Cheminfo Retrieval Seminar</title>
		<link>http://feedproxy.google.com/~r/ChemspiderBlog/~3/1hYHZR07h5A/jean-claude-bradley-and-his-cheminfo-retrieval-seminar.html</link>
		<comments>http://www.chemspider.com/blog/jean-claude-bradley-and-his-cheminfo-retrieval-seminar.html#comments</comments>
		<pubDate>Tue, 27 Oct 2009 20:22:52 +0000</pubDate>
		<dc:creator>Antony Williams</dc:creator>
				<category><![CDATA[ChemSpider Chemistry]]></category>
		<category><![CDATA[Presentations]]></category>
		<category><![CDATA[Cheminfo Retrieval]]></category>
		<category><![CDATA[ChemSpider coursework]]></category>
		<category><![CDATA[Jean-Claude Bradley]]></category>
		<category><![CDATA[UsefulChem]]></category>

		<guid isPermaLink="false">http://www.chemspider.com/blog/?p=1487</guid>
		<description><![CDATA[JC has given a great overview of how students might want to use ChemSpider for the purpose of chemical information retrieval on the internet. JC&#8217;s course lecture thoroughly exercises ChemSpider, in real time, to do searches across the internet. He posted his seminar to Scivee here and I have embedded the lecture below. It&#8217;s a [...]]]></description>
			<content:encoded><![CDATA[<p>JC has given a great overview of <a href="http://usefulchem.blogspot.com/2009/10/fourth-cheminfo-retrieval-class.html">how students might want to use ChemSpider for the purpose of chemical information retrieval</a> on the internet. JC&#8217;s course lecture thoroughly exercises ChemSpider, in real time, to do searches across the internet. He posted his seminar to Scivee here and I have embedded the lecture below. It&#8217;s a good talk for students and I encourage you to share it and review how ChemSpider can be used in your classwork and in your laboratories.</p>
<p><object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="480" height="400" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="allowfullscreen" value="true" /><param name="flashvars" value="id=13330&amp;type=3" /><param name="src" value="http://www.scivee.tv/flash/embedCast.swf" /><embed type="application/x-shockwave-flash" width="480" height="400" src="http://www.scivee.tv/flash/embedCast.swf" flashvars="id=13330&amp;type=3" allowfullscreen="true"></embed></object></p>
<script type="text/javascript">
  addthis_url    = 'http%3A%2F%2Fwww.chemspider.com%2Fblog%2Fjean-claude-bradley-and-his-cheminfo-retrieval-seminar.html';
  addthis_title  = 'Jean-Claude+Bradley+and+His+Cheminfo+Retrieval+Seminar';
  addthis_pub    = '';
</script><script type="text/javascript" src="http://s7.addthis.com/js/addthis_widget.php?v=12" ></script>
<p class="buymebeer"><form action="https://www.paypal.com/cgi-bin/webscr" target="paypal" method="post"><input type="hidden" name="cmd" value="_xclick" /><input type="hidden" name="business" value="tony27587@gmail.com" /><input type="hidden" name="return" value="Thanks...I'll use the caffeine to keep me awake and keep blogging..." /><input type="hidden" name="item_name" value="Buy me a Coffee for Jean-Claude Bradley and His Cheminfo Retrieval Seminar" /><input type="hidden" name="currency_code" value="USD" /><input type="hidden" name="amount" value="" /><input type="image" src="http://www.chemspider.com/blog/wp-content/plugins/buy-me-beer/icon_cafe.gif" align="left" alt="Buy me a Coffee" title="Buy me a Coffee" hspace="3" /></form><a href="https://www.paypal.com/cgi-bin/webscr?cmd=_xclick&amp;business=tony27587@gmail.com&amp;currency_code=USD&amp;amount=&amp;return=Thanks...I'll use the caffeine to keep me awake and keep blogging...&amp;item_name=Buy+me+a+Coffee+for+Jean-Claude+Bradley+and+His+Cheminfo+Retrieval+Seminar" target="paypal">Buy me a Coffee</a></p>]]></content:encoded>
			<wfw:commentRss>http://www.chemspider.com/blog/jean-claude-bradley-and-his-cheminfo-retrieval-seminar.html/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		<feedburner:origLink>http://www.chemspider.com/blog/jean-claude-bradley-and-his-cheminfo-retrieval-seminar.html</feedburner:origLink></item>
		<item>
		<title>Integration to QSAR World and Aggregating Data Linking</title>
		<link>http://feedproxy.google.com/~r/ChemspiderBlog/~3/oEjGYBhMl4E/integration-to-qsar-world-and-aggregating-data-linking.html</link>
		<comments>http://www.chemspider.com/blog/integration-to-qsar-world-and-aggregating-data-linking.html#comments</comments>
		<pubDate>Tue, 27 Oct 2009 19:18:33 +0000</pubDate>
		<dc:creator>Antony Williams</dc:creator>
				<category><![CDATA[Community Building]]></category>
		<category><![CDATA[Quality and Content]]></category>
		<category><![CDATA[ChemSpider]]></category>
		<category><![CDATA[QSAR data]]></category>
		<category><![CDATA[QSAR World]]></category>

		<guid isPermaLink="false">http://www.chemspider.com/blog/?p=1483</guid>
		<description><![CDATA[Following on from my earlier post regarding our interest in aggregating physicochemical data for other groups to use in building their models and algorithms we announce that we are now depositing the data from QSAR world into ChemSpider and pointing back to the original sources on QSAR World. We harvest the SDF files, deposit onto [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignleft size-full wp-image-1484" title="QSAR world" src="http://www.chemspider.com/blog/wp-content/uploads/2009/10/QSAR-world.png" alt="QSAR world" width="173" height="78" />Following on from my earlier post regarding our interest in aggregating physicochemical data for other groups to use in building their models and algorithms we announce that we are now depositing the data from QSAR world into ChemSpider and pointing back to the original sources on QSAR World. We harvest the SDF files, deposit onto ChemSpider and provide direct links into the original SDF file, with the appropriate titles, so that our users can proceed to gather the data for re-analysis if they find it of interest. An example record is here for <a href="http://www.chemspider.com/Chemical-Structure.10482034.html">Atovaquone</a> where we list the links to data residing on QSAR world for download. The links can be seen under the supplemental information section as shown below where you can see links to seven different types of data. We have chosen, for the time being, to not deposit the values associated with these data onto ChemSpider as the data are very heterogeneous in representation even though they are all delivered as SDF files.</p>
<p><img class="aligncenter size-full wp-image-1485" title="supplemental information" src="http://www.chemspider.com/blog/wp-content/uploads/2009/10/supplemental-information.png" alt="supplemental information" width="357" height="229" /></p>
<script type="text/javascript">
  addthis_url    = 'http%3A%2F%2Fwww.chemspider.com%2Fblog%2Fintegration-to-qsar-world-and-aggregating-data-linking.html';
  addthis_title  = 'Integration+to+QSAR+World+and+Aggregating+Data+Linking';
  addthis_pub    = '';
</script><script type="text/javascript" src="http://s7.addthis.com/js/addthis_widget.php?v=12" ></script>
<p class="buymebeer"><form action="https://www.paypal.com/cgi-bin/webscr" target="paypal" method="post"><input type="hidden" name="cmd" value="_xclick" /><input type="hidden" name="business" value="tony27587@gmail.com" /><input type="hidden" name="return" value="Thanks...I'll use the caffeine to keep me awake and keep blogging..." /><input type="hidden" name="item_name" value="Buy me a Coffee for Integration to QSAR World and Aggregating Data Linking " /><input type="hidden" name="currency_code" value="USD" /><input type="hidden" name="amount" value="" /><input type="image" src="http://www.chemspider.com/blog/wp-content/plugins/buy-me-beer/icon_cafe.gif" align="left" alt="Buy me a Coffee" title="Buy me a Coffee" hspace="3" /></form><a href="https://www.paypal.com/cgi-bin/webscr?cmd=_xclick&amp;business=tony27587@gmail.com&amp;currency_code=USD&amp;amount=&amp;return=Thanks...I'll use the caffeine to keep me awake and keep blogging...&amp;item_name=Buy+me+a+Coffee+for+Integration+to+QSAR+World+and+Aggregating+Data+Linking+" target="paypal">Buy me a Coffee</a></p>]]></content:encoded>
			<wfw:commentRss>http://www.chemspider.com/blog/integration-to-qsar-world-and-aggregating-data-linking.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://www.chemspider.com/blog/integration-to-qsar-world-and-aggregating-data-linking.html</feedburner:origLink></item>
		<item>
		<title>How would you draw Mercury (II) Acetate</title>
		<link>http://feedproxy.google.com/~r/ChemspiderBlog/~3/AtnNf16DcDI/how-would-you-draw-mercury-ii-acetate.html</link>
		<comments>http://www.chemspider.com/blog/how-would-you-draw-mercury-ii-acetate.html#comments</comments>
		<pubDate>Tue, 27 Oct 2009 18:29:36 +0000</pubDate>
		<dc:creator>Antony Williams</dc:creator>
				<category><![CDATA[ChemSpider Chemistry]]></category>

		<guid isPermaLink="false">http://www.chemspider.com/blog/?p=1479</guid>
		<description><![CDATA[What&#8217;s your favorite flavor of mercury acetate..on Wikipedia here? on CAS Common Chemistry here or on ChemSpider here?
How would you represent this structure if you were to draw it as a 2D diagram?


  addthis_url    = 'http%3A%2F%2Fwww.chemspider.com%2Fblog%2Fhow-would-you-draw-mercury-ii-acetate.html';
  addthis_title  = 'How+would+you+draw+Mercury+%28II%29+Acetate';
  addthis_pub    = '';

Buy me a Coffee]]></description>
			<content:encoded><![CDATA[<p>What&#8217;s your favorite flavor of mercury acetate..on Wikipedia <a href="http://en.wikipedia.org/wiki/Mercury%28II%29_acetate">here</a>? on CAS Common Chemistry <a href="http://www.commonchemistry.org/ChemicalDetail.aspx?ref=1600-27-7">here</a> or on ChemSpider <a href="http://www.chemspider.com/Chemical-Structure.14599.html">here</a>?</p>
<p>How would you represent this structure if you were to draw it as a 2D diagram?</p>
<p><img class="alignleft size-full wp-image-1480" title="mercury acetate" src="http://www.chemspider.com/blog/wp-content/uploads/2009/10/mercury-acetate.png" alt="mercury acetate" width="201" height="541" /></p>
<script type="text/javascript">
  addthis_url    = 'http%3A%2F%2Fwww.chemspider.com%2Fblog%2Fhow-would-you-draw-mercury-ii-acetate.html';
  addthis_title  = 'How+would+you+draw+Mercury+%28II%29+Acetate';
  addthis_pub    = '';
</script><script type="text/javascript" src="http://s7.addthis.com/js/addthis_widget.php?v=12" ></script>
<p class="buymebeer"><form action="https://www.paypal.com/cgi-bin/webscr" target="paypal" method="post"><input type="hidden" name="cmd" value="_xclick" /><input type="hidden" name="business" value="tony27587@gmail.com" /><input type="hidden" name="return" value="Thanks...I'll use the caffeine to keep me awake and keep blogging..." /><input type="hidden" name="item_name" value="Buy me a Coffee for How would you draw Mercury (II) Acetate" /><input type="hidden" name="currency_code" value="USD" /><input type="hidden" name="amount" value="" /><input type="image" src="http://www.chemspider.com/blog/wp-content/plugins/buy-me-beer/icon_cafe.gif" align="left" alt="Buy me a Coffee" title="Buy me a Coffee" hspace="3" /></form><a href="https://www.paypal.com/cgi-bin/webscr?cmd=_xclick&amp;business=tony27587@gmail.com&amp;currency_code=USD&amp;amount=&amp;return=Thanks...I'll use the caffeine to keep me awake and keep blogging...&amp;item_name=Buy+me+a+Coffee+for+How+would+you+draw+Mercury+(II)+Acetate" target="paypal">Buy me a Coffee</a></p>]]></content:encoded>
			<wfw:commentRss>http://www.chemspider.com/blog/how-would-you-draw-mercury-ii-acetate.html/feed</wfw:commentRss>
		<slash:comments>9</slash:comments>
		<feedburner:origLink>http://www.chemspider.com/blog/how-would-you-draw-mercury-ii-acetate.html</feedburner:origLink></item>
		<item>
		<title>Enormous Jump in Number of Integrated Links from Wikipedia to ChemSpider</title>
		<link>http://feedproxy.google.com/~r/ChemspiderBlog/~3/Done52LFqBE/enormous-jump-in-number-of-integrated-links-from-wikipedia-to-chemspider.html</link>
		<comments>http://www.chemspider.com/blog/enormous-jump-in-number-of-integrated-links-from-wikipedia-to-chemspider.html#comments</comments>
		<pubDate>Tue, 27 Oct 2009 17:12:47 +0000</pubDate>
		<dc:creator>Antony Williams</dc:creator>
				<category><![CDATA[ChemSpider Chemistry]]></category>
		<category><![CDATA[Community Building]]></category>
		<category><![CDATA[Quality and Content]]></category>
		<category><![CDATA[WiChempedia]]></category>
		<category><![CDATA[Wikipedia Services]]></category>
		<category><![CDATA[ChemSpider]]></category>
		<category><![CDATA[Linked data]]></category>
		<category><![CDATA[Quality in Chemistry]]></category>
		<category><![CDATA[Semantic Chemistry]]></category>
		<category><![CDATA[Wikipedia Chemistry]]></category>

		<guid isPermaLink="false">http://www.chemspider.com/blog/?p=1474</guid>
		<description><![CDATA[As an active member of the Wikipedia Chemistry team I continue to be impressed with the dedication and commitment that the members have to improving the quality AND quantity of information available on Wikipedia for chemists. The number of lost hours of sleep freely given to the benefit of Wikipedia, and in this specific case [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignleft size-full wp-image-1475" title="roadrunner" src="http://www.chemspider.com/blog/wp-content/uploads/2009/10/roadrunner.png" alt="roadrunner" width="180" height="136" />As an active member of the Wikipedia Chemistry team I continue to be impressed with the dedication and commitment that the members have to improving the quality AND quantity of information available on Wikipedia for chemists. The number of lost hours of sleep freely given to the benefit of Wikipedia, and in this specific case to the chemistry community, is immense. The number of &#8220;Compound Pages&#8221; on Wikipedia dedicated to drugs/chemicals has continued to grow and, despite a sincere effort on our part to keep everything linked up from ChemSpider to Wikipedia it&#8217;s a little like chasing the Road Runner&#8230;.we&#8217;re always behind!</p>
<p>We have been working with the WikiChem team of late to embed links from Wikipedia back to ChemSpider. I am humbled to know that our hard work to establish ChemSpider as a source of quality information has reached a level of trust such that Wikipedia now links from the ChemBoxes out to ChemSpider. The links are being updated on an on going basis at present with hundreds of new links already established and more being generated on an ongoing basis. <a href="http://en.wikipedia.org/wiki/User:Beetstra">Wikipedia User: Beetstra</a> has written a &#8216;bot that is inserting ChemSpiderIDs across the database (see below) and we ARE doing rigorous checking of all of the links.This was using a file that we generated on our side showing links to Wikipedia from ChemSpider.</p>
<p><img class="aligncenter size-full wp-image-1476" title="beetstra" src="http://www.chemspider.com/blog/wp-content/uploads/2009/10/beetstra.png" alt="beetstra" width="484" height="258" /></p>
<p>We will then be able to generate a list of all ChemBoxes/DrugBoxes without links from Wikipedia to ChemSpider and we will then make the links on our side, manually curating the structures, and then hand back a file to finish all linking. At this point we will have the backfile under control and we can perform ongoing updates as new compound pages are created on ChemSpider and, if we curate and find errors on Wikipedia or ChemSpider making a few manual edits is easy.</p>
<p>There are very dedicated teams on Wikipedia and ChemSpider carefully poring over data with their robots and eyeballs to create a linked data set of quality chemistry. It&#8217;s long, tedious AND important work. When its done we will have an expanded set of data to semantically link from RSC articles when we do markup.</p>
<script type="text/javascript">
  addthis_url    = 'http%3A%2F%2Fwww.chemspider.com%2Fblog%2Fenormous-jump-in-number-of-integrated-links-from-wikipedia-to-chemspider.html';
  addthis_title  = 'Enormous+Jump+in+Number+of+Integrated+Links+from+Wikipedia+to+ChemSpider';
  addthis_pub    = '';
</script><script type="text/javascript" src="http://s7.addthis.com/js/addthis_widget.php?v=12" ></script>
<p class="buymebeer"><form action="https://www.paypal.com/cgi-bin/webscr" target="paypal" method="post"><input type="hidden" name="cmd" value="_xclick" /><input type="hidden" name="business" value="tony27587@gmail.com" /><input type="hidden" name="return" value="Thanks...I'll use the caffeine to keep me awake and keep blogging..." /><input type="hidden" name="item_name" value="Buy me a Coffee for Enormous Jump in Number of Integrated Links from Wikipedia to ChemSpider" /><input type="hidden" name="currency_code" value="USD" /><input type="hidden" name="amount" value="" /><input type="image" src="http://www.chemspider.com/blog/wp-content/plugins/buy-me-beer/icon_cafe.gif" align="left" alt="Buy me a Coffee" title="Buy me a Coffee" hspace="3" /></form><a href="https://www.paypal.com/cgi-bin/webscr?cmd=_xclick&amp;business=tony27587@gmail.com&amp;currency_code=USD&amp;amount=&amp;return=Thanks...I'll use the caffeine to keep me awake and keep blogging...&amp;item_name=Buy+me+a+Coffee+for+Enormous+Jump+in+Number+of+Integrated+Links+from+Wikipedia+to+ChemSpider" target="paypal">Buy me a Coffee</a></p>]]></content:encoded>
			<wfw:commentRss>http://www.chemspider.com/blog/enormous-jump-in-number-of-integrated-links-from-wikipedia-to-chemspider.html/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		<feedburner:origLink>http://www.chemspider.com/blog/enormous-jump-in-number-of-integrated-links-from-wikipedia-to-chemspider.html</feedburner:origLink></item>
		<item>
		<title>Gathering Physicochemical Data Onto ChemSpider</title>
		<link>http://feedproxy.google.com/~r/ChemspiderBlog/~3/WjEKy9JFhfc/gathering-physicochemical-data-onto-chemspider.html</link>
		<comments>http://www.chemspider.com/blog/gathering-physicochemical-data-onto-chemspider.html#comments</comments>
		<pubDate>Tue, 27 Oct 2009 03:12:16 +0000</pubDate>
		<dc:creator>Antony Williams</dc:creator>
				<category><![CDATA[ChemSpider Services]]></category>
		<category><![CDATA[Community Building]]></category>
		<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[ACD/Labs LogP]]></category>
		<category><![CDATA[CDK]]></category>
		<category><![CDATA[ChemSpider]]></category>
		<category><![CDATA[logP prediction]]></category>
		<category><![CDATA[ONS Challenge]]></category>
		<category><![CDATA[Open notebook Science]]></category>
		<category><![CDATA[Physicochemical data]]></category>
		<category><![CDATA[VCCLab]]></category>

		<guid isPermaLink="false">http://www.chemspider.com/blog/?p=1468</guid>
		<description><![CDATA[I&#8217;ve been in discussions with JC Bradley and Andy Lang about the Open Notebook Science Solubility Data project. Specifically we&#8217;ve been comparing  logP predictions from the CDK versus those listed on ChemSpider. We actually have six values of logP listed for some records. For example, for toluene we have 4 predicted values, 1 experimental value from a [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve been in discussions with JC Bradley and Andy Lang about the <a href="http://onschallenge.wikispaces.com/">Open Notebook Science Solubility Data project</a>. Specifically we&#8217;ve been comparing  logP predictions from the CDK versus those listed on ChemSpider. We actually have six values of logP listed for some records. For example, for toluene we have 4 predicted values, 1 experimental value from a database and 1 experimental value from a publication. These are shown below:</p>
<p><img class="aligncenter size-full wp-image-1469" title="toluene4 logp" src="http://www.chemspider.com/blog/wp-content/uploads/2009/10/toluene4-logp.png" alt="toluene4 logp" width="507" height="437" />There are three predicted logP values from three different algorithms (ACD/LogP, XlogP and AlogPs) as shown at the top of the figure. There is a predicted value and a database value from the EPISuite from the EPA (middle of the figure) and there is a LogP value from a <a href="http://dx.doi.org/10.1021/ci050282s">publication</a> with the link out indicated by the arrow (this datum was deposited by Egon Willighagen when he deposited the data from his publication). If you examine the list of data, both experimental and predicted, you will see a general value of  around 2.65+/- error. This should be compared with the CDK value listed in the <a href="http://spreadsheets.google.com/ccc?key=0Ak1R8T6wt4YQdHdIcEEwNjc4eUtMdXptQXpkWElra2c&amp;hl=en">ONS spreadsheet </a>that gives a predicted value of 0.64. This was the primary reason that we were discussing the comparison&#8230;the values of predicted logP from CDK were different from the predicted values listed on ChemSpider for a number of examples in the spreadsheet.</p>
<p>Egon and I exchanged a couple of emails discussing the fact that logP predictions could be generated by a number of parties if there was a good Open Data training set available. A recent publication entitled &#8220;Calculation of Molecular Lipophilicity:State of the Art and Comparison of Log P Methods on More Than 96000 Compounds&#8221; performed a thorough analysis of different logP methods on a very large dataset. The publication is available online <a href="http://www.vcclab.org/articles/96000.pdf">here</a>. They compared &#8220;the predictive power of representative methods for one public (N = 266) and two in house datasets from Nycomed(N = 882) and Pfizer (N = 95 809). A total of 30 and 18 methods were tested for public and industrial datasets, respectively.&#8221; During the work they <span style="font-family: TimesNewRomanPSMT;">derived a simple equation based on the number of carbon atoms, NC, and the number of hetero atoms, NHET: log P = 1.46(±0.02) + 0.11(±0.001) NC &#8211; 0.11(±0.001) NHET. This equation was shown to outperform a large number of programs benchmarked in this study.</span><span style="font-family: TimesNewRomanPS-ItalicMT;"><em> </em>This would certainly be easy to implement on ChemSpider and, just out of interest, applying this equation to toluene gives us a value of 2.23. Compare this with the values listed above.</span>&#8221;</p>
<p>Unfortunately there doesn&#8217;t appear to be too many Open logP datasets available around for people to use as training sets. Also, with the thorough work reported in the publication above is it necessary to build yet another logP prediction algorithm? ACD/Labs have made their logP prediction software free for download (<a href="http://www.acdlabs.com/download/logp.html">http://www.acdlabs.com/download/logp.html</a>), the VCCLab software is available for free (<a href="http://www.vcclab.org/lab/alogps/">http://www.vcclab.org/lab/alogps/</a>), the EPISuite software is available for free (<a href="http://www.epa.gov/oppt/exposure/pubs/episuite.htm">http://www.epa.gov/oppt/exposure/pubs/episuite.htm</a>) and if you just want to predict a value for a compound not on ChemSpider then you can use the services here: <a href="http://www.chemspider.com/Services.aspx">http://www.chemspider.com/Services.aspx</a>.</p>
<p>However, even though there are a lot of predictors available it still makes sense to gather data and provide it as an experimental dataset, made available as Open Data for the developers of such algorithms to ake the benefits of structural diversity and fresh data to potentially improve their models. If you have any logP data available please point me to the data to download or contact me offline to discuss. We are presently working on enhancing our data model to provide improved access to experimental data on ChemSpider as well as access to the predicted data via web services. More to follow&#8230;</p>
<script type="text/javascript">
  addthis_url    = 'http%3A%2F%2Fwww.chemspider.com%2Fblog%2Fgathering-physicochemical-data-onto-chemspider.html';
  addthis_title  = 'Gathering+Physicochemical+Data+Onto+ChemSpider';
  addthis_pub    = '';
</script><script type="text/javascript" src="http://s7.addthis.com/js/addthis_widget.php?v=12" ></script>
<p class="buymebeer"><form action="https://www.paypal.com/cgi-bin/webscr" target="paypal" method="post"><input type="hidden" name="cmd" value="_xclick" /><input type="hidden" name="business" value="tony27587@gmail.com" /><input type="hidden" name="return" value="Thanks...I'll use the caffeine to keep me awake and keep blogging..." /><input type="hidden" name="item_name" value="Buy me a Coffee for Gathering Physicochemical Data Onto ChemSpider" /><input type="hidden" name="currency_code" value="USD" /><input type="hidden" name="amount" value="" /><input type="image" src="http://www.chemspider.com/blog/wp-content/plugins/buy-me-beer/icon_cafe.gif" align="left" alt="Buy me a Coffee" title="Buy me a Coffee" hspace="3" /></form><a href="https://www.paypal.com/cgi-bin/webscr?cmd=_xclick&amp;business=tony27587@gmail.com&amp;currency_code=USD&amp;amount=&amp;return=Thanks...I'll use the caffeine to keep me awake and keep blogging...&amp;item_name=Buy+me+a+Coffee+for+Gathering+Physicochemical+Data+Onto+ChemSpider" target="paypal">Buy me a Coffee</a></p>]]></content:encoded>
			<wfw:commentRss>http://www.chemspider.com/blog/gathering-physicochemical-data-onto-chemspider.html/feed</wfw:commentRss>
		<slash:comments>4</slash:comments>
		<feedburner:origLink>http://www.chemspider.com/blog/gathering-physicochemical-data-onto-chemspider.html</feedburner:origLink></item>
		<item>
		<title>Thieme Announce Access to DOI’ed Primary Analytical Data Available</title>
		<link>http://feedproxy.google.com/~r/ChemspiderBlog/~3/S_8InkD4n_o/thieme-announce-access-to-doied-primary-analytical-data-available.html</link>
		<comments>http://www.chemspider.com/blog/thieme-announce-access-to-doied-primary-analytical-data-available.html#comments</comments>
		<pubDate>Tue, 27 Oct 2009 01:26:37 +0000</pubDate>
		<dc:creator>Antony Williams</dc:creator>
				<category><![CDATA[Community Building]]></category>
		<category><![CDATA[Quality and Content]]></category>
		<category><![CDATA[Analytical Data]]></category>
		<category><![CDATA[Open Data]]></category>
		<category><![CDATA[Thieme]]></category>

		<guid isPermaLink="false">http://www.chemspider.com/blog/?p=1466</guid>
		<description><![CDATA[Today I received an email via the CHMINF list server pointing to the following Press Release. Part of the press release is shown here:
&#8220;In collaboration with the German National Library of Science and Technology (TIB) Thieme is the first publisher to make primary chemistry data accessible worldwide. Analytical data, from various experiments, is the foundation [...]]]></description>
			<content:encoded><![CDATA[<p>Today I received an email via the CHMINF list server pointing to the following <a href="http://www.thieme.de/connect/en/services/press-release/tc_oct_06_09.html">Press Release</a>. Part of the press release is shown here:</p>
<p>&#8220;In collaboration with the German National Library of Science and Technology (TIB) Thieme is the first publisher to make primary chemistry data accessible worldwide. Analytical data, from various experiments, is the foundation of research work and scientific papers. From now on, primary data will be registered and made available online via the Thieme eJournals website (<a href="http://www.thieme-connect.com/ejournals">www.thieme-connect.com/ejournals</a>) using digital object recognition in the form of Digital Object Identifiers (DOI). This will enable scientists to easily locate research articles, including accompanying data, and make enhanced use of the scientific content.&#8221;</p>
<p>There has been a lot of discussion over the years regarding making available &#8220;primary data&#8221;. We offered to do this on the ChemSpider Journal of Chemistry : if people wanted to submit analytical data with an article that we published then we would post them as spectra associated with the article. Unfortunately the general consensus based on a few conversations that I had is that it is a lot of work to prepare data and deposit it. This is one of the reasons that, until now, publishers have generally made the spectral data available as plots and printouts of the data. These data are generally made available as electronic supplementary data. These data ARE valuable even in that form but, and I believe that the majority of scientists would agree, they would be of more valuable if they were available in a format that would allow display in online applets, downloadable for processing and expansions etc. The RSC would certainly welcome the availability of spectral data associated with publications especially since they can now be hosted on ChemSpider.</p>
<p>Thieme have actually managed to pull off quite a coup and I commend them for their efforts. The first example datasets <a href="http://www.thieme-connect.de/DOI/DOI?10.1055/s-2008-1067226">are available here</a>. The listing includes &#8220;FIDs and associated files for the <sup>1</sup>H, <sup>13</sup>C and DEPT NMR spectra for compounds <em>14</em>, (<em>S</em><sub>S</sub>)-<em>23</em>, (<em>S</em><sub>S</sub>)-<em>25</em>, (<em>R</em><sub>S</sub>)-<em>26</em>, <em>27</em>, (<em>S</em><sub>S</sub>)-<em>28</em>, (<em>R</em><sub>S</sub>,<em>S</em><sub>S</sub>)-<em>29</em>, <em>30</em>, (<em>R</em><sub>S</sub>)-<em>36</em>, (<em>S</em><sub>S</sub>)-<em>36</em>, (<em>S</em><sub>S</sub>)-<em>37</em>, <em>38</em>, (<em>R</em><sub>S</sub>)-<em>39</em>, (<em>S</em><sub>S</sub>)-<em>39</em>, (<em>S</em><sub>S</sub>)-<em>44</em>, (<em>R</em><sub>S</sub>)-<em>46</em>, (<em>S</em><sub>S</sub>)-<em>46</em>, (<em>R</em><sub>S</sub>)-<em>48</em>, (<em>S</em><sub>S</sub>)-<em>48</em>, (<em>S</em><sub>S</sub>)-<em>49</em>, <em>52</em>, (<em>R</em><sub>S</sub>)-<em>53</em>, (<em>R</em><sub>S</sub>)-<em>55</em>, (<em>R</em><sub>S</sub>)-<em>57</em>, (<em>S</em><sub>S</sub>)-<em>57</em>, (<em>S</em><sub>S</sub>)-<em>58</em>, (<em>R</em><sub>S</sub>)-<em>61</em>, (<em>S</em><sub>S</sub>)-<em>61</em>, (<em>R</em><sub>S</sub>)-<em>62</em>, (<em>S</em><sub>S</sub>)-<em>62</em>, (<em>R</em><sub>S</sub>)-<em>65</em> and (<em>S</em><sub>S</sub>)-<em>65</em> are summarized.&#8221; That&#8217;s a lot of data.</p>
<p>Since these are primary data they cannot be copyrighted so I chose to download the data, take a look and insert a couple into ChemSpider as an example of what can be done with these data. The associated PDF for the data says &#8220;The files can be processed using the following programs: MestReC, Bruker’s WINNMR and XWINNMR.&#8221; The files came as binary Bruker files so needed to be reprocessed and, in order to be deposited, had to be converted to JCAMP-DX format, the format supported by the JSpecView applet used on ChemSpider to display spectra. In order to this I am fortunate to have access to <a href="http://www.acdlabs.com/products/spec_lab/exp_spectra/nmr_proc/">ACD/NMR Processor</a>, a product I managed for a few years while working at ACD/Labs. This product also supports the Bruker format so I imported the data, processed and exported as JCAMP and imported to ChemSpider.  For compound <strong>14</strong> I have attached the H1 and C13 spectra and <a href="http://www.chemspider.com/Chemical-Structure.23076230.html#spectra">they can be seen here</a>. I didn&#8217;t attach the &#8220;DEPT spectrum&#8221; yet. In order for me to download the spectra, redraw the structure, process the spectra, export as JCAMP and deposit to ChemSpider took about 15 minutes. However, there are a lot of spectra and it will take me a while. There are 32 compounds, I assume 3 spectra per compound (HNMR, CNMR and DEPT) so that&#8217;s a total of 96 spectra. It&#8217;ll take me about 10-12 hours just to deposit this collection so that&#8217;s a lot of work to do in my spare time. If anyone wants to help out and can process the spectra to deposit please do!</p>
<p>One of the spectra are shown below using the <a href="http://www.chemspider.com/blog/why-are-spectra-like-youtube-videos.html">Spectral Embed</a> function we introduced previously:<br />
<script src="http://www.chemspider.com/csjsapi.ashx?op=spec&amp;tk=7283856c-a2d2-4634-8a11-1ea873f84244&amp;bid=2147&amp;w=750&amp;h=400" type="text/javascript"></script></p>
<p>This is a rich collection of data&#8230;it can feed <a href="http://www.jcheminf.com/content/pdf/1758-2946-1-9.pdf">the Spectral Game described in this article</a>. I look forward to getting the data onto ChemSpider and will be following up with Thieme to see if we can work together to host the data in a more generic format for the future. It&#8217;s a shame that the data are locked into a binary file format that needs reprocessing to view and I believe display through the JSpecView applet is advantageous for all. I encourage Thieme to consider also making the structure collection available in molfile, SMILES, InChI and InChIKey format &#8211; the InChIs will make the article discoverable via internet searches and through the InChI Resolver while the download of molfiles will speed up the loading process to ChemSpider and other systems.</p>
<script type="text/javascript">
  addthis_url    = 'http%3A%2F%2Fwww.chemspider.com%2Fblog%2Fthieme-announce-access-to-doied-primary-analytical-data-available.html';
  addthis_title  = 'Thieme+Announce+Access+to+DOI%26%238217%3Bed+Primary+Analytical+Data+Available';
  addthis_pub    = '';
</script><script type="text/javascript" src="http://s7.addthis.com/js/addthis_widget.php?v=12" ></script>
<p class="buymebeer"><form action="https://www.paypal.com/cgi-bin/webscr" target="paypal" method="post"><input type="hidden" name="cmd" value="_xclick" /><input type="hidden" name="business" value="tony27587@gmail.com" /><input type="hidden" name="return" value="Thanks...I'll use the caffeine to keep me awake and keep blogging..." /><input type="hidden" name="item_name" value="Buy me a Coffee for Thieme Announce Access to DOI'ed Primary Analytical Data Available" /><input type="hidden" name="currency_code" value="USD" /><input type="hidden" name="amount" value="" /><input type="image" src="http://www.chemspider.com/blog/wp-content/plugins/buy-me-beer/icon_cafe.gif" align="left" alt="Buy me a Coffee" title="Buy me a Coffee" hspace="3" /></form><a href="https://www.paypal.com/cgi-bin/webscr?cmd=_xclick&amp;business=tony27587@gmail.com&amp;currency_code=USD&amp;amount=&amp;return=Thanks...I'll use the caffeine to keep me awake and keep blogging...&amp;item_name=Buy+me+a+Coffee+for+Thieme+Announce+Access+to+DOI'ed+Primary+Analytical+Data+Available" target="paypal">Buy me a Coffee</a></p>]]></content:encoded>
			<wfw:commentRss>http://www.chemspider.com/blog/thieme-announce-access-to-doied-primary-analytical-data-available.html/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		<feedburner:origLink>http://www.chemspider.com/blog/thieme-announce-access-to-doied-primary-analytical-data-available.html</feedburner:origLink></item>
	</channel>
</rss>
