<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0"> <channel><title>Campagne Laboratory</title> <link>http://campagnelab.org</link> <description>News about research from the Campagne laboratory.</description> <lastBuildDate>Wed, 22 May 2013 20:35:57 +0000</lastBuildDate> <language>en-US</language> <sy:updatePeriod>hourly</sy:updatePeriod> <sy:updateFrequency>1</sy:updateFrequency> <generator>http://wordpress.org/?v=3.5.1</generator> <atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/rss+xml" href="http://feeds.feedburner.com/campagnelab" /><feedburner:info uri="campagnelab" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><geo:lat>40.76842</geo:lat><geo:long>-73.96045</geo:long><feedburner:emailServiceId>campagnelab</feedburner:emailServiceId><feedburner:feedburnerHostname>http://feedburner.google.com</feedburner:feedburnerHostname><item><title>Goby 2.3 released</title><link>http://feedproxy.google.com/~r/campagnelab/~3/P-qUUGq0dbE/</link> <comments>http://campagnelab.org/goby-2-3-released/#comments</comments> <pubDate>Fri, 26 Apr 2013 15:12:16 +0000</pubDate> <dc:creator>Fabien Campagne</dc:creator> <category><![CDATA[Uncategorized]]></category> <guid isPermaLink="false">http://campagnelab.org/?p=4407</guid> <description><![CDATA[We have released version 2.3 of Goby. New in this release is the ability to restrict output to a genomic slice when concatenating multiple sorted alignments. The release also has features to help generate BAM alignments with read groups from Goby alignments. The Goby somatic output format is now quite stable as far as we [...]]]></description> <content:encoded><![CDATA[<div
class="page-restrict-output"><p>We have released version 2.3 of Goby. New in this release is the ability to restrict output to a genomic slice when concatenating multiple sorted alignments. The release also has features to help generate BAM alignments with read groups from Goby alignments. The <a
href="http://campagnelab.org/software/goby/tutorials/detecting-somatic-variations-with-goby-2-1-3/">Goby somatic output format</a> is now quite stable as far as we can tell from our internal benchmarks (comparing performance and somatic calls with Mutect 1.1.4).  You can download this release <a
href="http://campagnelab.org/software/goby/download-goby/">here</a> or browse the more detailed <a
href="http://campagnelab.org/software/goby/change-log/">change log</a>.</p></div><img src="http://feeds.feedburner.com/~r/campagnelab/~4/P-qUUGq0dbE" height="1" width="1"/>]]></content:encoded> <wfw:commentRss>http://campagnelab.org/goby-2-3-released/feed/</wfw:commentRss> <slash:comments>0</slash:comments> <feedburner:origLink>http://campagnelab.org/goby-2-3-released/</feedburner:origLink></item> <item><title>Goby 2.2.1 released</title><link>http://feedproxy.google.com/~r/campagnelab/~3/P5tI7O6WW_8/</link> <comments>http://campagnelab.org/goby-2-2-1-released/#comments</comments> <pubDate>Mon, 01 Apr 2013 21:12:31 +0000</pubDate> <dc:creator>Fabien Campagne</dc:creator> <category><![CDATA[Uncategorized]]></category> <guid isPermaLink="false">http://campagnelab.org/?p=4256</guid> <description><![CDATA[We have released version 2.2.1 of Goby. This release includes several bug fixes for the new somatic variation output. New features include the ability of the FDR mode to detect columns to adjust automatically when the column is marked with the &#8220;p-value&#8221; group. You can download this release here or browse the more detailed change [...]]]></description> <content:encoded><![CDATA[<div
class="page-restrict-output"><p>We have released version 2.2.1 of Goby. This release includes several bug fixes for the new somatic variation output. New features include the ability of the FDR mode to detect columns to adjust automatically when the column is marked with the &#8220;p-value&#8221; group. You can download this release <a
href="http://campagnelab.org/software/goby/download-goby/">here</a> or browse the more detailed <a
href="http://campagnelab.org/software/goby/change-log/">change log</a>.</p></div><img src="http://feeds.feedburner.com/~r/campagnelab/~4/P5tI7O6WW_8" height="1" width="1"/>]]></content:encoded> <wfw:commentRss>http://campagnelab.org/goby-2-2-1-released/feed/</wfw:commentRss> <slash:comments>0</slash:comments> <feedburner:origLink>http://campagnelab.org/goby-2-2-1-released/</feedburner:origLink></item> <item><title>Goby 2.2 released</title><link>http://feedproxy.google.com/~r/campagnelab/~3/__WeBgddtXM/</link> <comments>http://campagnelab.org/goby-2-2-released/#comments</comments> <pubDate>Wed, 20 Mar 2013 01:38:27 +0000</pubDate> <dc:creator>Fabien Campagne</dc:creator> <category><![CDATA[Uncategorized]]></category> <guid isPermaLink="false">http://campagnelab.org/?p=4181</guid> <description><![CDATA[We have released Goby 2.2. Briefly, this version of Goby improves genotype calling and provides a tool designed to detect somatic mutations. We also provide a new option to suggest-position-slices to split alignment files by amount of data to process.  You can download this new version here or see a detailed change log. Feedback welcome on the user forum [...]]]></description> <content:encoded><![CDATA[<div
class="page-restrict-output"><p>We have released Goby 2.2. Briefly, this version of Goby improves genotype calling and provides a tool designed to detect somatic mutations. We also provide a new option to suggest-position-slices to split alignment files by amount of data to process.  You can download this new version <a
href="http://campagnelab.org/software/goby/download-goby/" target="_blank">here</a> or see a detailed <a
href="http://campagnelab.org/software/goby/change-log/">change log</a>. Feedback welcome on the <a
href="https://groups.google.com/forum/?fromgroups=#!forum/goby-framework">user forum</a> or <a
href="https://github.com/CampagneLaboratory/goby/issues">GitHub issue tracker</a>.</p></div><img src="http://feeds.feedburner.com/~r/campagnelab/~4/__WeBgddtXM" height="1" width="1"/>]]></content:encoded> <wfw:commentRss>http://campagnelab.org/goby-2-2-released/feed/</wfw:commentRss> <slash:comments>0</slash:comments> <feedburner:origLink>http://campagnelab.org/goby-2-2-released/</feedburner:origLink></item> <item><title>In case you’ve been wondering</title><link>http://feedproxy.google.com/~r/campagnelab/~3/KsyLcEUClO0/</link> <comments>http://campagnelab.org/in-case-youve-been-wondering/#comments</comments> <pubDate>Thu, 07 Mar 2013 13:32:36 +0000</pubDate> <dc:creator>Fabien Campagne</dc:creator> <category><![CDATA[Uncategorized]]></category> <guid isPermaLink="false">http://campagnelab.org/?p=4160</guid> <description><![CDATA[We have been a bit silent lately as we were working on new major releases of GobyWeb. We set ambitious goals for GobyWeb2 last fall and have been moving towards these goals steadily.  Here&#8217;s a summary of features we are now testing internally. Configuration and maintenance: We have been cleaning up the source code and [...]]]></description> <content:encoded><![CDATA[<div
class="page-restrict-output"><p><img
class="alignleft  wp-image-1861" alt="gobyweb_logo" src="http://campagnelab.org/wp-content/uploads/2010/03/gobyweb_logo.png" /> We have been a bit silent lately as we were working on new major releases of GobyWeb. We set ambitious goals for GobyWeb2 last fall and have been moving towards these goals steadily.  Here&#8217;s a summary of features we are now testing internally.</p><p><strong>Configuration and maintenance:</strong> We have been cleaning up the source code and completely redesigned the configuration of the system to make it easier to install and maintain. This was becoming urgent because we maintain several instances of GobyWeb internally for different projects and groups and sometimes had trouble remembering where to change an item of configuration. Manuele Simi has completely reorganized the configuration files and documented the new structure. GobyWeb 2.0 is currently under test with this new config and will make it much easier to setup a new instance of GobyWeb.</p><p><strong>Artifacts:</strong> The ability for plugins to download and install their own software or data artifacts. Such plugins don&#8217;t include pre-compiled binaries or big tarballs, they now use a mechanism to grab the source code or data archive from the internet or local repository and install on the node used for computation. Artifact support is expected to be released in GobyWeb 2.1. Because we need to adapt a number of existing plugins to use the new artifact mechanism, we need to test these changes thoroughly before we can release 2.1. For users of GobyWeb outside of Weill, these changes mean that GobyWeb 2.1 will be much easier to install. We are essentially removing the need to prepare the /scratchLocal/gobyweb directory described in the current installation instructions and obtaining and compiling third party software. Importantly, artifacts are now responsible for creating indices of genomes for specific organisms needed in an analysis (adding a new organism is as simple as adding a few lines in the organism configuration file). Artifacts will also help to produce perfectly reproducible analysis runs that can be traced back to specific versions of analysis software. Artifacts also make it easier to extend the list of plugins with new tools and functionality (e.g., we will use the Variant Effect Predictor to annotate the effect of genotype changes on transcripts and proteins).</p><p>There is a lot more in the work, but I thought I would give you an idea of some of the key developments you can expect in GobyWeb in the following months.</p></div><img src="http://feeds.feedburner.com/~r/campagnelab/~4/KsyLcEUClO0" height="1" width="1"/>]]></content:encoded> <wfw:commentRss>http://campagnelab.org/in-case-youve-been-wondering/feed/</wfw:commentRss> <slash:comments>0</slash:comments> <feedburner:origLink>http://campagnelab.org/in-case-youve-been-wondering/</feedburner:origLink></item> <item><title>Rocky’s bioinformatics 2012 Prezi</title><link>http://feedproxy.google.com/~r/campagnelab/~3/wTtA33tqt9g/</link> <comments>http://campagnelab.org/rockys-bioinformatics-2012-prezi/#comments</comments> <pubDate>Sat, 08 Dec 2012 15:36:55 +0000</pubDate> <dc:creator>Fabien Campagne</dc:creator> <category><![CDATA[Uncategorized]]></category> <guid isPermaLink="false">http://campagnelab.org/?p=4079</guid> <description><![CDATA[Here&#8217;s the prezi I presented at this year&#8217;s Rocky&#8217;s bioinformatics conference. The talk is mostly about GobyWeb. For more details about GobyWeb, see the pre-print. Goby/GobyWeb Rocky&#8217;s bioinformatics Dec 2012 on Prezi]]></description> <content:encoded><![CDATA[<div
class="page-restrict-output"><p>Here&#8217;s the prezi I presented at <a
href="http://www.iscb.org/rocky2012">this year&#8217;s Rocky&#8217;s bioinformatics conference</a>. The talk is mostly about GobyWeb. For more details about GobyWeb, see the <a
href="http://arxiv.org/abs/1211.6666">pre-print</a>.</p><div
class="prezi-player"><p><object
id="prezi_zbqk9zol8npg" width="550" height="400" classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0" name="prezi_zbqk9zol8npg" bgcolor="#ffffff"><param
name="allowfullscreen" value="true" /><param
name="allowFullScreenInteractive" value="true" /><param
name="allowscriptaccess" value="always" /><param
name="wmode" value="direct" /><param
name="flashvars" value="prezi_id=zbqk9zol8npg&amp;lock_to_path=0&amp;color=ffffff&amp;autoplay=no&amp;autohide_ctrls=0" /><param
name="src" value="http://prezi.com/bin/preziloader.swf" /><param
name="allowfullscreeninteractive" value="true" /><embed
id="prezi_zbqk9zol8npg" width="550" height="400" type="application/x-shockwave-flash" src="http://prezi.com/bin/preziloader.swf" allowfullscreen="true" allowFullScreenInteractive="true" allowscriptaccess="always" wmode="direct" flashvars="prezi_id=zbqk9zol8npg&amp;lock_to_path=0&amp;color=ffffff&amp;autoplay=no&amp;autohide_ctrls=0" allowfullscreeninteractive="true" name="prezi_zbqk9zol8npg" bgcolor="#ffffff" /></object></p><div
class="prezi-player-links"><p><a
title="Goby/GobyWeb Rocky's bioinformatics Dec 2012" href="http://prezi.com/zbqk9zol8npg/gobygobyweb-rockys-bioinformatics-dec-2012/">Goby/GobyWeb Rocky&#8217;s bioinformatics Dec 2012</a> on <a
href="http://prezi.com">Prezi</a></p></div></div></div><img src="http://feeds.feedburner.com/~r/campagnelab/~4/wTtA33tqt9g" height="1" width="1"/>]]></content:encoded> <wfw:commentRss>http://campagnelab.org/rockys-bioinformatics-2012-prezi/feed/</wfw:commentRss> <slash:comments>0</slash:comments> <feedburner:origLink>http://campagnelab.org/rockys-bioinformatics-2012-prezi/</feedburner:origLink></item> <item><title>GobyWeb release 1.9.1 (binary distribution and open-source plugins)</title><link>http://feedproxy.google.com/~r/campagnelab/~3/-667op9g-AU/</link> <comments>http://campagnelab.org/gobyweb-release-1-9-1-binary-distribution-and-open-source-plugins/#comments</comments> <pubDate>Tue, 20 Nov 2012 22:28:00 +0000</pubDate> <dc:creator>Fabien Campagne</dc:creator> <category><![CDATA[Uncategorized]]></category> <guid isPermaLink="false">http://campagnelab.org/?p=3967</guid> <description><![CDATA[After internal testing of GobyWeb 1.9, we pushed a binary distribution of GobyWeb 1.9.1. This version offers a new feature: A mechanism to periodically calculate data usage for sets of users. A configuration parameter (intervalForStatisticsInHours, set in webapp/conf/Config.groovy) allows to configure the interval in between two statistics&#8217; calculations. Usage data can be viewed for all users [...]]]></description> <content:encoded><![CDATA[<div
class="page-restrict-output"><p><img
class="alignleft" title="GobyWeb logo" src="http://campagnelab.org/wp-content/uploads/2010/03/gobyweb_logo.png" alt="GobyWeb logo" width="181" height="116" />After internal testing of GobyWeb 1.9, we pushed a binary distribution of GobyWeb 1.9.1. This version offers a new feature:</p><ul><li>A mechanism to periodically calculate data usage for sets of users. A configuration parameter (intervalForStatisticsInHours, set in webapp/conf/Config.groovy) allows to configure the interval in between two statistics&#8217; calculations. Usage data can be viewed for all users by administror accounts. Individual users can see how much data their account holds (break-down by samples, alignments and group comparison analyses).</li></ul><div>And bug fixes/small improvements:</div><ul><li>Lucene index view of the results for EdgeR plugin.</li><li> Increased JVM memory for Bismark plugin (we found one dataset where one chunk ran out of memory with the previous configuration).</li><li>Changed Bismark plugin to support mapping reads longer than 60bp (bismark requires all bases on one line in the fastq file so we changed the way fastq files are formatted before calling Bismark).</li></ul><p>Please note that we now license GobyWeb plugins under the LGPL3 license. This change makes it possible for others to contribute new GobyWeb plugins, and facilitates the inspection, reuse and extension the initial set of plugins distributed with GobyWeb.</p><p>You can find GobyWeb plugins at GitHub: <a
href="https://github.com/CampagneLaboratory/gobyweb-plugins">CampagneLaboratory/gobyweb-plugins</a>.</p></div><img src="http://feeds.feedburner.com/~r/campagnelab/~4/-667op9g-AU" height="1" width="1"/>]]></content:encoded> <wfw:commentRss>http://campagnelab.org/gobyweb-release-1-9-1-binary-distribution-and-open-source-plugins/feed/</wfw:commentRss> <slash:comments>0</slash:comments> <feedburner:origLink>http://campagnelab.org/gobyweb-release-1-9-1-binary-distribution-and-open-source-plugins/</feedburner:origLink></item> <item><title>Goby 2.1.1</title><link>http://feedproxy.google.com/~r/campagnelab/~3/jfT1EiMrrtk/</link> <comments>http://campagnelab.org/goby-2-1-1/#comments</comments> <pubDate>Mon, 19 Nov 2012 18:55:42 +0000</pubDate> <dc:creator>Fabien Campagne</dc:creator> <category><![CDATA[Uncategorized]]></category> <guid isPermaLink="false">http://campagnelab.org/?p=3901</guid> <description><![CDATA[We have released Goby 2.1.1. This is a maintenance release with one new feature:   &#8211; Add extract-splicing-events mode. This mode is used by GobyWeb 1.9 to extract splicing events from spliced Goby alignments (generated either by GSNAP or STAR at this time). and a few bug fixes. See project Change Log for details.]]></description> <content:encoded><![CDATA[<div
class="page-restrict-output"><p>We have released Goby 2.1.1. This is a maintenance release with one new feature:</p><div>  &#8211; Add extract-splicing-events mode. This mode is used by GobyWeb 1.9 to extract splicing events from spliced Goby alignments (generated either by GSNAP or STAR at this time).</div><div><div></div><div>and a few bug fixes. See project <a
href="http://campagnelab.org/software/goby/change-log/">Change Log</a> for details.</div></div></div><img src="http://feeds.feedburner.com/~r/campagnelab/~4/jfT1EiMrrtk" height="1" width="1"/>]]></content:encoded> <wfw:commentRss>http://campagnelab.org/goby-2-1-1/feed/</wfw:commentRss> <slash:comments>0</slash:comments> <feedburner:origLink>http://campagnelab.org/goby-2-1-1/</feedburner:origLink></item> <item><title>GobyWeb 1.9</title><link>http://feedproxy.google.com/~r/campagnelab/~3/fMCdeOuvOMI/</link> <comments>http://campagnelab.org/gobyweb-1-9/#comments</comments> <pubDate>Fri, 16 Nov 2012 14:35:39 +0000</pubDate> <dc:creator>Fabien Campagne</dc:creator> <category><![CDATA[Uncategorized]]></category> <guid isPermaLink="false">http://campagnelab.org/?p=3888</guid> <description><![CDATA[We have releasedGobyWeb 1.9 to our local CTSC users last week. Internal testing is ongoing and so far identified only minor issues. The project now has its own user forum, where anybody can subscribe, get announcements and discuss features or ask questions of other users or the development team. We are preparing a binary distribution of [...]]]></description> <content:encoded><![CDATA[<div
class="page-restrict-output"><p><img
class="alignleft size-full wp-image-1861" title="gobyweb_logo" src="http://campagnelab.org/wp-content/uploads/2010/03/gobyweb_logo.png" alt="" width="181" height="116" /></p><p>We have releasedGobyWeb 1.9 to our local CTSC users last week. Internal testing is ongoing and so far identified only minor issues. The project now has its own <a
href="https://groups.google.com/forum/?fromgroups#!forum/gobyweb">user forum</a>, where anybody can subscribe, get announcements and discuss features or ask questions of other users or the development team. We are preparing a binary distribution of 1.9.1 for others to download and install GobyWeb at their site.</p><p>To summarize the most prominent new features, GobyWeb version 1.9 offers:</p><div>1. The EdgeR statistical method for differential expression (see Robinson MD et al. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics, 2011)</div><div>2. The STAR alignment tool (version 2.4.1, see Dobin, Alex et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 2012).</div><div>3. A method to call differential splicing usage at observed splice junctions (using DESeq or EdgeR statistics)</div><div>4. The Bismark aligner for bisulfite converted reads (see Krueger F, Andrews SR. Bioinformatics 2012)</div><div>5. A tool to detect pathogens in human or other sequences</div><div>6. Improved performance for large table of results, and a redesigned table viewer to help work with tables with hundreds of columns and tens of millions of rows.</div><p/><div>For details about these features, see the <a
href="http://campagnelab.org/software/gobyweb/change-log/">GobyWeb Change Log</a>.</div></div><img src="http://feeds.feedburner.com/~r/campagnelab/~4/fMCdeOuvOMI" height="1" width="1"/>]]></content:encoded> <wfw:commentRss>http://campagnelab.org/gobyweb-1-9/feed/</wfw:commentRss> <slash:comments>0</slash:comments> <feedburner:origLink>http://campagnelab.org/gobyweb-1-9/</feedburner:origLink></item> <item><title>Goby SAM differences</title><link>http://feedproxy.google.com/~r/campagnelab/~3/XQLHsKrx1cM/</link> <comments>http://campagnelab.org/goby-sam-differences/#comments</comments> <pubDate>Thu, 05 Jul 2012 19:23:20 +0000</pubDate> <dc:creator>Fabien Campagne</dc:creator> <category><![CDATA[Uncategorized]]></category> <guid isPermaLink="false">http://campagnelab.org/?p=3698</guid> <description><![CDATA[Since we posted an announcement for Goby 2.0 on the samtools developer mailling list, I received a number of questions about differences between Goby alignments and SAM alignments. Many people start by assuming that Goby stores alignments in a way somewhat similar to SAM, but there are key differences, which I attempt to summarize here: [...]]]></description> <content:encoded><![CDATA[<div
class="page-restrict-output"><p>Since we posted an announcement for Goby 2.0 on the samtools developer mailling list, I received a number of questions about differences between Goby alignments and SAM alignments. Many people start by assuming that Goby stores alignments in a way somewhat similar to SAM, but there are key differences, which I attempt to summarize here:</p><ol><li>Goby supports 5 minute schema changes (i.e., data schema flexibility)</li><li>Alignments that span a splice site are stored differently</li><li>Sequence variations are stored explicitly in Goby</li></ol><h3>Goby supports 5 minute schema changes</h3><p>Goby stores data with protocol buffers, according to well-defined data schemas. A key feature of protocol buffers is that it supports &#8220;5 minute schema changes&#8221;. You can add a new field (e.g., a collection of new data structures) to a schema and recompile the code (typically takes about 5 minutes). The recompiled program will seamlessly read old data files, and old programs will be able to read data generated by the recompiled program. This functionality is quite useful when you are developing new programs, or when a community of developers work on various tools that need a common schema, but often also require tool specific extensions. Goby supports 5 minutes schema changes. SAM is limited to extensions via new tags. You can encode structured data in tags, but you are left to write a serialization mechanism between data structures and strings (this will likely take more than 5 minutes).</p><h3>Alignments that span a splice site are stored differently</h3><p>Consider this alignment in SAM format (one alignment block spanning a splice site, from a paired-end file):</p><pre>61PKHAAXX_HWUSI-EAS627_0007:5:38:17885:13714    97      chr1    1231595 255     66M144N9M       =       1231903 0 \
      AGCAGCTCAGGCTCCCACGAGTCCAGCGTCAGGGACCGCACCTTGGAGCAGTGGACACCCAGGCTCCTGTGGATG GGGGGFGGGGGDFGGGGGEGGEGGCGGFCGGGFG?GGEEEEEEGEGCEEEBBDDBBDBCED=CACDC@C9@A:&lt;A     NM:i:0  XS:A:-  NH:i:1  MD:Z:75</pre><p>The same alignment is stored by Goby in data structures that contain the following (the field names are shown for clarity, but are not stored). Please note that both Goby entries are indexed independently. Storing two entries make it possible to represent gene fusions easily. Imaging the fusion is between two chromosomes, Goby will store the fusion as usual, with splice links pointing at positions on different chromosomes.</p><pre>{   // first part of the alignment block, before the splice site:
    query_index: 0
    fragment_index: 0  // first part/fragment of the read.
    target_index: 0
    position: 1231594
    query_position: 0
    matching_reverse_strand: false
    multiplicity: 1
    query_length: 75
    query_aligned_length: 66 // this alignment matches the first 66 bases
    target_aligned_length: 66
    mapping_quality: 255
    pair_flags: 97
    pair_alignment_link { // indicates this block is part of a pair,
                          // found at this location:
      target_index: 0
      position: 1231902
      fragment_index: 2 // the pair is another fragment of the same read.
    }
    spliced_forward_alignment_link { // indicates this block is part of a
                                     // spliced alignment, points to second
                                     // entry
      target_index: 0   // indicates the target/chromosome of the block
                        // after the splice
      position: 1231804 // the position of the alignment block
                        // after the slice
      fragment_index: 1 // which part of the read we link to
                        // after the splice
    }
    bam_attributes: "NH:i:1"
    bam_attributes: "NM:i:0"
    bam_attributes: "XS:A:-"
}
{   // second part of the alignment block, after the splice site:
    query_index: 0    // query index match with first alignment, this is the same read.
    fragment_index: 1 // second part of a read
    target_index: 0
    position: 1231804
    query_position: 66
    matching_reverse_strand: false
    multiplicity: 1
    query_length: 75
    query_aligned_length: 9  // this alignment block matches only the 9 last bases
    target_aligned_length: 9
    mapping_quality: 255
    pair_flags: 97
    pair_alignment_link { // indicates this block is part of a pair,
                          // found at this location
      target_index: 0
      position: 1231902
      fragment_index: 2
    }
    spliced_backward_alignment_link { // indicates this block is part of a spliced alignment,
                                      // points back to first entry:
      target_index: 0
      position: 1231594
      fragment_index: 0
    }
    bam_attributes: "NH:i:1"
    bam_attributes: "NM:i:0"
    bam_attributes: "XS:A:-"
}</pre><h3>Sequence variations are stored explicitly in Goby</h3><p>SAM/BAM rely on CIGAR strings and  MD tags to represent sequence variations. A complicated CIGAR can look like this: 6S23M1I6M1D16M1I47M while the corresponding MD tag will look like this: MD:Z:3G4A7C4C1A2T2^T2C4T3A0G5C44. Goby replaces these codes with explicit data structures.The SAM alignment:</p><pre>509.6.68.19057.157284	83	chr1	45881869	29	6S23M1I6M1D16M1I47M	=	45881519	-443	TTACCCGCTTTCCTTGCCCAAATTTTAAGTTTCNGGAAAAGGGGAGGGAAATGGGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGTGACAGAGTGTCAC	#######################################################ECGGGGGGGGGGGGGGGGGGGGGGGGHHHHHHHHHHHHHHHHHHH	MD:Z:3G4A7C4C1A2T2^T2C4T3A0G5C44	RG:Z:1	XG:i:3	AM:i:29	NM:i:14	SM:i:29	XM:i:10	XO:i:3	XT:A:M</pre><p>Will be stored as shown below. Please note that while MD tags are optional in SAM, the reference base is typically currently stored in Goby alignments. The read_index field indicates the cycle in the read where the variation was observed.</p><pre>{
        query_index: 0
        target_index: 0
        position: 45881868
        query_position: 6
        matching_reverse_strand: true
        multiplicity: 1
        query_length: 100
        query_aligned_length: 94
        target_aligned_length: 93
        mapping_quality: 29
        pair_flags: 83
        fragment_index: 0
        insert_size: -443
        read_origin_index: 0
        softClippedBasesLeft: "TTACCC"
        softClippedQualityLeft: "\002\002\002\002\002\002"
        bam_attributes: "XG:i:3"
        bam_attributes: "AM:i:29"
        bam_attributes: "NM:i:14"
        bam_attributes: "SM:i:29"
        bam_attributes: "XM:i:10"
        bam_attributes: "XO:i:3"
        bam_attributes: "XT:A:M"
        sequence_variations {
          to: "T"     // the read contained a T
          from: "G"   // the reference had a G
          position: 4 // at position +4 from the start of the alignment
          to_quality: "\002" // quality of the base was Phred=2
          read_index: 91     // the variation was observed at cycle 91 (the read matches the reverse strand, only a fraction of 100bp aligned).
        }
        sequence_variations {
          to: "T"
          from: "A"
          position: 9
          to_quality: "\002"
          read_index: 86
        }
        sequence_variations {
          to: "T"
          from: "C"
          position: 17
          to_quality: "\002"
          read_index: 78
        }
        sequence_variations {
          to: "A"
          from: "C"
          position: 22
          to_quality: "\002"
          read_index: 73
        }
        sequence_variations {
          to: "T"
          from: "-"
          position: 23
          to_quality: "\002"
          read_index: 71
        }
        sequence_variations {
          to: "T"
          from: "A"
          position: 24
          to_quality: "\002"
          read_index: 70
        }
        sequence_variations {
          to: "N"
          from: "T"
          position: 27
          to_quality: "\002"
          read_index: 67
        }
        sequence_variations {
          to: "-"
          from: "T"
          position: 30
          read_index: 64
        }
        sequence_variations {
          to: "A"
          from: "C"
          position: 33
          to_quality: "\002"
          read_index: 62
        }
        sequence_variations {
          to: "G"
          from: "T"
          position: 38
          to_quality: "\002"
          read_index: 57
        }
        sequence_variations {
          to: "GA"
          from: "AG"
          position: 42
          to_quality: "\002\002"
          read_index: 53
        }
        sequence_variations {
          to: "G"
          from: "-"
          position: 46
          to_quality: "\002"
          read_index: 48
        }
        sequence_variations {
          to: "T"
          from: "C"
          position: 49
          to_quality: "$"
          read_index: 45
        }
}</pre></div><img src="http://feeds.feedburner.com/~r/campagnelab/~4/XQLHsKrx1cM" height="1" width="1"/>]]></content:encoded> <wfw:commentRss>http://campagnelab.org/goby-sam-differences/feed/</wfw:commentRss> <slash:comments>0</slash:comments> <feedburner:origLink>http://campagnelab.org/goby-sam-differences/</feedburner:origLink></item> <item><title>Goby feature comparison matrix</title><link>http://feedproxy.google.com/~r/campagnelab/~3/rUVh2_gOUfw/</link> <comments>http://campagnelab.org/goby-feature-comparison-matrix/#comments</comments> <pubDate>Wed, 13 Jun 2012 16:42:16 +0000</pubDate> <dc:creator>Fabien Campagne</dc:creator> <category><![CDATA[Uncategorized]]></category> <guid isPermaLink="false">http://campagnelab.org/?p=3669</guid> <description><![CDATA[We&#8217;ve posted a feature matrix that compares Goby 1.x, 2.0, BAM, CRAM 0.7 and FASTQ file formats along about 20 dimensions. We welcome suggestions for additions or corrections to this matrix. What other functionality would you like to include in such a comparison?]]></description> <content:encoded><![CDATA[<div
class="page-restrict-output"><p>We&#8217;ve posted a <a
href="http://campagnelab.org/software/goby/feature-matrix/">feature matrix</a> that compares Goby 1.x, 2.0, BAM, CRAM 0.7 and FASTQ file formats along about 20 dimensions. We welcome suggestions for additions or corrections to this matrix. What other functionality would you like to include in such a comparison?</p><p><a
href="http://campagnelab.org/software/goby/feature-matrix/"><img
class="alignnone size-full wp-image-3671" title="feature comparison brief-1" src="http://campagnelab.org/wp-content/uploads/2012/06/feature-comparison-brief-1.png" alt="" width="698" height="153" /></a></p></div><img src="http://feeds.feedburner.com/~r/campagnelab/~4/rUVh2_gOUfw" height="1" width="1"/>]]></content:encoded> <wfw:commentRss>http://campagnelab.org/goby-feature-comparison-matrix/feed/</wfw:commentRss> <slash:comments>0</slash:comments> <feedburner:origLink>http://campagnelab.org/goby-feature-comparison-matrix/</feedburner:origLink></item> </channel> </rss>
