<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>生信菜鸟团 &#187; 融合基因</title>
	<atom:link href="http://www.bio-info-trainee.com/tag/%e8%9e%8d%e5%90%88%e5%9f%ba%e5%9b%a0/feed" rel="self" type="application/rss+xml" />
	<link>http://www.bio-info-trainee.com</link>
	<description>欢迎去论坛biotrainee.com留言参与讨论，或者关注同名微信公众号biotrainee</description>
	<lastBuildDate>Sat, 28 Jun 2025 14:30:13 +0000</lastBuildDate>
	<language>zh-CN</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>https://wordpress.org/?v=4.1.33</generator>
	<item>
		<title>Bioconductor包chimeraviz嵌合RNA可视化</title>
		<link>http://www.bio-info-trainee.com/2955.html</link>
		<comments>http://www.bio-info-trainee.com/2955.html#comments</comments>
		<pubDate>Sat, 06 Jan 2018 09:41:26 +0000</pubDate>
		<dc:creator><![CDATA[ulwvfje]]></dc:creator>
				<category><![CDATA[转录组软件]]></category>
		<category><![CDATA[可视化]]></category>
		<category><![CDATA[融合基因]]></category>

		<guid isPermaLink="false">http://www.bio-info-trainee.com/?p=2955</guid>
		<description><![CDATA[Bioconductor包chimeraviz嵌合RNA可视化 高通量RNA测序 &#8230; <a href="http://www.bio-info-trainee.com/2955.html">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<h1 class="md-end-block md-heading">Bioconductor包chimeraviz嵌合RNA可视化</h1>
<p><span class="md-line md-end-block"><span class="">高通量RNA测序已经能够更高效地检测融合转录本，但是融合检测的技术和相关软件通常产生高错误发现率。而一个自动整合RNA数据和已知基因组特征的可视化框架对于结果的检验是有帮助的。2017年发布的一个bioconductor包，chimeraviz就可以做到自动创建嵌合RNA可视化。 </span></span></p>
<p><span class="md-line md-end-block">支持来自9种不同融合发现工具（<span class=""><a spellcheck="false" href="http://www.bioinformatics.com.cn/?/article/601">deFuse</a></span>、<span class=""><a spellcheck="false" href="http://www.bioinformatics.com.cn/?/article/497">EricScript</a></span>、InFusion、<span class=""><a spellcheck="false" href="http://www.bioinformatics.com.cn/?/article/367">JAFFA</a></span>、FusionCatcher、FusionMap、PRADA、SOAPfuse和STAR-FUSION）的输入。</span><span id="more-2955"></span></p>
<h2 class="md-end-block md-heading">官网教程</h2>
<p><span class="md-line md-end-block">直接在bioconductor可以看到详细说明：<span spellcheck="false"><a href="https://bioconductor.org/packages/release/bioc/html/chimeraviz.html">https://bioconductor.org/packages/release/bioc/html/chimeraviz.html</a></span> | <span class=""><a spellcheck="false" href="https://bioconductor.org/packages/release/bioc/vignettes/chimeraviz/inst/doc/chimeraviz-vignette.html">HTML</a></span> | <span class=""><a spellcheck="false" href="https://bioconductor.org/packages/release/bioc/vignettes/chimeraviz/inst/doc/chimeraviz-vignette.R">R Script</a></span> |</span></p>
<p><span class="md-line md-end-block">下载安装好该R包后，自带一系列的融合基因可视化的测试数据，文件如下：</span></p>
<pre class="md-fences md-end-block" lang="" contenteditable="false">  1.1K Oct 16 22:36 5267readsAligned.bam
   96B Oct 16 22:36 5267readsAligned.bam.bai
   22K Oct 16 22:36 FusionMap_01_TestDataset_InputFastq.FusionReport.txt
   37K Oct 16 22:36 Homo_sapiens.GRCh37.74.sqlite
   68K Oct 16 22:36 Homo_sapiens.GRCh37.74_subset.gtf
  1.9K Oct 16 22:36 PRADA.acc.fusion.fq.TAF.tsv
   32K Oct 16 22:36 UCSC.HG19.Human.CytoBandIdeogram.txt
   32K Oct 16 22:36 UCSC.HG38.Human.CytoBandIdeogram.txt
   16K Oct 16 22:36 defuse_833ke_results.filtered.tsv
  4.6K Oct 16 22:36 ericscript_SRR1657556.results.total.tsv
  1.7M Oct 16 22:36 fusion5267and11759reads.bam
   57K Oct 16 22:36 fusion5267and11759reads.bam.bai
  4.1K Oct 16 22:36 fusioncatcher_833ke_final-list-candidate-fusion-genes.txt
  2.1K Oct 16 22:36 infusion_fusions.txt
  4.3K Oct 16 22:36 jaffa_results.csv
  2.6K Oct 16 22:36 reads.1.fq
  2.6K Oct 16 22:36 reads.2.fq
  1.0K Oct 16 22:36 reads_supporting_defuse_fusion_5267.1.fq
  1.0K Oct 16 22:36 reads_supporting_defuse_fusion_5267.2.fq
  3.3K Oct 16 22:36 soapfuse_833ke_final.Fusion.specific.for.genes
  2.0K Oct 16 22:36 star-fusion.fusion_candidates.final.abridged.txt</pre>
<p><span class="md-line md-end-block">可以看到，所支持的9种融合基因检测工具的示例结果都在这里了，比如我最喜欢的star-fusion的结果节选如下：</span></p>
<pre class="md-fences md-end-block" lang="" contenteditable="false">#FusionName JunctionReadCount   SpanningFragCount   SpliceType  LeftGene    LeftBreakpoint  RightGene   RightBreakpoint
THRA--AC090627.1    27  93  ONLY_REF_SPLICE THRA^ENSG00000126351.8  chr17:38243106:+    AC090627.1^ENSG00000235300.3    chr17:46371709:+
THRA--AC090627.1    5   93  ONLY_REF_SPLICE THRA^ENSG00000126351.8  chr17:38243106:+    AC090627.1^ENSG00000235300.3    chr17:46384693:+
ACACA--STAC2    12  51  ONLY_REF_SPLICE ACACA^ENSG00000132142.15    chr17:35479453:-    STAC2^ENSG00000141750.6 chr17:37374426:-
RPS6KB1--SNF8   10  43  ONLY_REF_SPLICE RPS6KB1^ENSG00000108443.9   chr17:57970686:+    SNF8^ENSG00000159210.5  chr17:47021337:-
TOB1--SYNRG 8   30  ONLY_REF_SPLICE TOB1^ENSG00000141232.4  chr17:48943419:-    SYNRG^ENSG00000006114.11    chr17:35880751:-
VAPB--IKZF3 4   46  ONLY_REF_SPLICE VAPB^ENSG00000124164.11 chr20:56964573:+    IKZF3^ENSG00000161405.12    chr17:37934020:-
ZMYND8--CEP250  2   44  ONLY_REF_SPLICE ZMYND8^ENSG00000101040.15   chr20:45852970:-    CEP250^ENSG00000126001.11   chr20:34078463:+
AHCTF1--NAAA    3   38  ONLY_REF_SPLICE AHCTF1^ENSG00000153207.10   chr1:247094880:-    NAAA^ENSG00000138744.10 chr4:76846964:-
VAPB--IKZF3 1   46  ONLY_REF_SPLICE VAPB^ENSG00000124164.11 chr20:56964573:+    IKZF3^ENSG00000161405.12    chr17:37944627:-
VAPB--IKZF3 1   46  ONLY_REF_SPLICE VAPB^ENSG00000124164.11 chr20:56964573:+    IKZF3^ENSG00000161405.12    chr17:37922746:-
STX16--RAE1 4   33  ONLY_REF_SPLICE STX16^ENSG00000124222.17    chr20:57227143:+    RAE1^ENSG00000101146.8  chr20:55929088:+</pre>
<p><span class="md-line md-end-block" contenteditable="true"><span class="">这些结果文件导入R里面统一用import系列函数，比如：</span></span></p>
<pre class="md-fences md-end-block" lang="R" contenteditable="false"><span class="cm-variable">library</span>(<span class="cm-variable">chimeraviz</span>)
​
<span class="cm-comment"># Get reference to results file from deFuse</span>
<span class="cm-variable">defuse833ke</span> <span class="cm-operator cm-arrow">&lt;-</span> <span class="cm-variable">system.file</span>(
  <span class="cm-string">"extdata"</span>,
  <span class="cm-string">"defuse_833ke_results.filtered.tsv"</span>,
  <span class="cm-variable">package</span><span class="cm-arg-is">=</span><span class="cm-string">"chimeraviz"</span>)
​
<span class="cm-comment"># Load the results file into a list of fusion objects</span>
<span class="cm-variable">fusions</span> <span class="cm-operator cm-arrow">&lt;-</span> <span class="cm-variable">importDefuse</span>(<span class="cm-variable">defuse833ke</span>, <span class="cm-string">"hg19"</span>)
​
<span class="cm-comment">## ---- message = FALSE------------------------------------------------------</span>
<span class="cm-variable">length</span>(<span class="cm-variable">fusions</span>)</pre>
<h2 class="md-end-block md-heading">基因组全局可视化</h2>
<pre class="md-fences md-end-block" lang="R" contenteditable="false"><span class="cm-variable">soapfuse833ke</span> <span class="cm-operator cm-arrow">&lt;-</span> <span class="cm-variable">system.file</span>(
  <span class="cm-string">"extdata"</span>,
  <span class="cm-string">"soapfuse_833ke_final.Fusion.specific.for.genes"</span>,
  <span class="cm-variable">package</span> <span class="cm-arg-is">=</span> <span class="cm-string">"chimeraviz"</span>)
<span class="cm-variable">fusions</span> <span class="cm-operator cm-arrow">&lt;-</span> <span class="cm-variable">importSoapfuse</span>(<span class="cm-variable">soapfuse833ke</span>, <span class="cm-string">"hg38"</span>, <span class="cm-number">10</span>)
<span class="cm-comment"># Plot!</span>
<span class="cm-variable">plotCircle</span>(<span class="cm-variable">fusions</span>)</pre>
<p><span class="md-line md-end-block">主要是一个环形图，如下：</span></p>
<p><a href="http://www.bio-info-trainee.com/wp-content/uploads/2018/01/chimeraviz-fusion-circle-plot.png"><img class="alignnone size-full wp-image-2957" src="http://www.bio-info-trainee.com/wp-content/uploads/2018/01/chimeraviz-fusion-circle-plot.png" alt="chimeraviz-fusion-circle-plot" width="1094" height="998" /></a></p>
<p><span class="">红色条带-</span><span class=""><strong>染色体内融合</strong></span>，蓝色条带-<span class=""><strong>染色体间融合。</strong></span></p>
<h3 class="md-end-block md-heading">单独可视化某个融合事件</h3>
<pre class="md-fences md-end-block" lang="R" contenteditable="false">​
<span class="cm-keyword">if</span>(<span class="cm-operator">!</span><span class="cm-variable">exists</span>(<span class="cm-string">"defuse833ke"</span>))
  <span class="cm-variable">defuse833ke</span> <span class="cm-operator cm-arrow">&lt;-</span> <span class="cm-variable">system.file</span>(
    <span class="cm-string">"extdata"</span>,
    <span class="cm-string">"defuse_833ke_results.filtered.tsv"</span>,
    <span class="cm-variable">package</span> <span class="cm-arg-is">=</span> <span class="cm-string">"chimeraviz"</span>)
<span class="cm-variable">fusions</span> <span class="cm-operator cm-arrow">&lt;-</span> <span class="cm-variable">importDefuse</span>(<span class="cm-variable">defuse833ke</span>, <span class="cm-string">"hg19"</span>, <span class="cm-number">1</span>)
<span class="cm-comment"># Choose a fusion object</span>
<span class="cm-variable">fusion</span> <span class="cm-operator cm-arrow">&lt;-</span> <span class="cm-variable">getFusionById</span>(<span class="cm-variable">fusions</span>, <span class="cm-number">5267</span>)
<span class="cm-comment"># Load edb</span>
<span class="cm-keyword">if</span>(<span class="cm-operator">!</span><span class="cm-variable">exists</span>(<span class="cm-string">"edbSqliteFile"</span>))
  <span class="cm-variable">edbSqliteFile</span> <span class="cm-operator cm-arrow">&lt;-</span> <span class="cm-variable">system.file</span>(
    <span class="cm-string">"extdata"</span>,
    <span class="cm-string">"Homo_sapiens.GRCh37.74.sqlite"</span>,
    <span class="cm-variable">package</span><span class="cm-arg-is">=</span><span class="cm-string">"chimeraviz"</span>)
<span class="cm-variable">edb</span> <span class="cm-operator cm-arrow">&lt;-</span> <span class="cm-variable">ensembldb</span><span class="cm-operator">::</span><span class="cm-variable">EnsDb</span>(<span class="cm-variable">edbSqliteFile</span>)
<span class="cm-comment"># bamfile with reads in the regions of this fusion event</span>
<span class="cm-keyword">if</span>(<span class="cm-operator">!</span><span class="cm-variable">exists</span>(<span class="cm-string">"fusion5267and11759reads"</span>))
  <span class="cm-variable">fusion5267and11759reads</span> <span class="cm-operator cm-arrow">&lt;-</span> <span class="cm-variable">system.file</span>(
    <span class="cm-string">"extdata"</span>,
    <span class="cm-string">"fusion5267and11759reads.bam"</span>,
    <span class="cm-variable">package</span> <span class="cm-arg-is">=</span> <span class="cm-string">"chimeraviz"</span>)
<span class="cm-comment"># Plot!</span>
<span class="cm-variable">plotFusion</span>(
  <span class="cm-variable">fusion</span> <span class="cm-arg-is">=</span> <span class="cm-variable">fusion</span>,
  <span class="cm-variable">bamfile</span> <span class="cm-arg-is">=</span> <span class="cm-variable">fusion5267and11759reads</span>,
  <span class="cm-variable">edb</span> <span class="cm-arg-is">=</span> <span class="cm-variable">edb</span>,
  <span class="cm-variable">nonUCSC</span> <span class="cm-arg-is">=</span> <span class="cm-variable">TRUE</span>)
​
<span class="cm-comment">## ---- echo = FALSE, message = FALSE, fig.height = 5, fig.width = 10, dev='png'----</span>
<span class="cm-comment"># Plot!</span>
<span class="cm-variable">plotFusion</span>(
  <span class="cm-variable">fusion</span> <span class="cm-arg-is">=</span> <span class="cm-variable">fusion</span>,
  <span class="cm-variable">bamfile</span> <span class="cm-arg-is">=</span> <span class="cm-variable">bamfile5267</span>,
  <span class="cm-variable">edb</span> <span class="cm-arg-is">=</span> <span class="cm-variable">edb</span>,
  <span class="cm-variable">nonUCSC</span> <span class="cm-arg-is">=</span> <span class="cm-variable">TRUE</span>,
  <span class="cm-variable">reduceTranscripts</span> <span class="cm-arg-is">=</span> <span class="cm-variable">TRUE</span>)
​</pre>
<p><span class="md-line md-end-block">这个可视化比较复杂一点，需要融合基因的事件详情，包含两个融合基因的bam片段文件，以及参考基因组的数据库信息。</span></p>
<p><span class="md-line md-end-block">然后有两种展现方式，一种是基于转录本的融合情况，一种是基于基因</span></p>
<p><a href="http://www.bio-info-trainee.com/wp-content/uploads/2018/01/chimeraviz-fusion-plot.png"><img class="alignnone size-full wp-image-2958" src="http://www.bio-info-trainee.com/wp-content/uploads/2018/01/chimeraviz-fusion-plot.png" alt="chimeraviz-fusion-plot" width="1310" height="1406" /></a></p>
<p><span class="md-line md-end-block">RCC1-HENMT1融合例子。</span></p>
<p><span class="md-line md-end-block md-focus">顶部：显示融合的染色体位置。支持断裂点（红色曲线）的discordant reads数10（其中split的6，spanning的4），注释的转录本及read数图。</span></p>
]]></content:encoded>
			<wfw:commentRss>http://www.bio-info-trainee.com/2955.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>融合基因检测软件-soapfusion</title>
		<link>http://www.bio-info-trainee.com/1463.html</link>
		<comments>http://www.bio-info-trainee.com/1463.html#comments</comments>
		<pubDate>Tue, 15 Mar 2016 11:30:21 +0000</pubDate>
		<dc:creator><![CDATA[ulwvfje]]></dc:creator>
				<category><![CDATA[基础软件]]></category>
		<category><![CDATA[转录组软件]]></category>
		<category><![CDATA[soap]]></category>
		<category><![CDATA[soapfuse]]></category>
		<category><![CDATA[融合基因]]></category>

		<guid isPermaLink="false">http://www.bio-info-trainee.com/?p=1463</guid>
		<description><![CDATA[开发单位：华大，SOAP系列软件套装！ 功能：检测合基因 优点：在现有的各种软件 &#8230; <a href="http://www.bio-info-trainee.com/1463.html">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p>开发单位：华大，SOAP系列软件套装！</p>
<div>
<div>功能：检测合基因</div>
<div>优点：在现有的各种软件里面表现算是最好的</div>
<div>算法：是hash index，跟其它bwt算法不太一样</div>
<div>官网：<a href="http://soap.genomics.org.cn/soapfuse.html">http://soap.genomics.org.cn/soapfuse.html</a></div>
<div>paper：<a href="https://genomebiology.biomedcentral.com/articles/10.1186/gb-2013-14-2-r12">https://genomebiology.biomedcentral.com/articles/10.1186/gb-2013-14-2-r12</a></div>
<div></div>
<div>其它软件有： FusionSeq [<span class=""><a href="https://genomebiology.biomedcentral.com/articles/10.1186/gb-2013-14-2-r12#CR21">21</a></span>], deFuse [<span class=""><a href="https://genomebiology.biomedcentral.com/articles/10.1186/gb-2013-14-2-r12#CR22">22</a></span>], TopHat-Fusion [<span class=""><a href="https://genomebiology.biomedcentral.com/articles/10.1186/gb-2013-14-2-r12#CR23">23</a></span>], FusionHunter [<span class=""><a href="https://genomebiology.biomedcentral.com/articles/10.1186/gb-2013-14-2-r12#CR24">24</a></span>], SnowShoes-FTD [<span class=""><a href="https://genomebiology.biomedcentral.com/articles/10.1186/gb-2013-14-2-r12#CR25">25</a></span>], chimerascan [<span class=""><a href="https://genomebiology.biomedcentral.com/articles/10.1186/gb-2013-14-2-r12#CR26">26</a></span>] and FusionMap [<span class=""><a href="https://genomebiology.biomedcentral.com/articles/10.1186/gb-2013-14-2-r12#CR27">27</a></span>]</div>
<div></div>
<div>具体的算法我没看，因为只是有需求，正好有一些RNA-seq数据又想看看样本融合基因情况。所以就测试这个软件，通俗点说，融合基因原理其实很简单，如果有足够多的reads一部分比对到一个基因，另一部分比对到另一个基因，就可以说明它们两个基因发生了融合现象！如果是PE测序，那么更方便，左右两端reads比对情况也可以考虑。我就不多说废话了，直接上教程吧！</div>
<div></div>
<div>
<div><span style="color: #ff0000;">一，软件安装</span></div>
<div>
<div>软件下载地址：<a href="https://sourceforge.net/projects/soapfuse/files/SOAPfuse_Package/SOAPfuse-v1.27.tar.gz">https://sourceforge.net/projects/soapfuse/files/SOAPfuse_Package/SOAPfuse-v1.27.tar.gz</a></div>
</div>
<div>下载压缩包，解压后即可使用！！！</div>
<div>推荐用最新版，然后看作者说明书的时候也要看清楚！</div>
<div>我反正好几次都搞糊涂了，最后联系了作者才搞明白，作者说他想更新到2.0版本，直接用HISAT的比对sam文件来做，但是还在筹备中，我觉得有点悬！</div>
<div><a href="http://www.bio-info-trainee.com/wp-content/uploads/2016/03/12.png"><img class="alignnone size-full wp-image-1465" src="http://www.bio-info-trainee.com/wp-content/uploads/2016/03/12.png" alt="1" width="655" height="177" /></a></div>
<div></div>
<div>解压后是一堆perl程序，都在source目录下，source目录下面还有bin下面附带了几个第三方软件，包括bwa，blast和soap，最后都用得着！</div>
<div>有个很重要的问题，一定要软件自带的perl模块添加到perl的环境变量。不然那些perl程序运行会报错！</div>
<div>配置文件需要修改，就把几个目录放进去即可</div>
<div></div>
<div></div>
<p><span style="color: #ff0000;">二，输入数据准备</span></p>
<div>这里最重要的就是制作数据库！！！</div>
<div>作者给了非常详细的制作过程，我觉得还是不够清楚，所以再讲一遍！</div>
<div>
<div><a href="https://sourceforge.net/p/soapfuse/blog/2013/07/strategy-for-recurrent-transcriptname-and-genename-in-ensembl-gtf-file">https://sourceforge.net/p/soapfuse/blog/2013/07/strategy-for-recurrent-transcriptname-and-genename-in-ensembl-gtf-file</a></div>
<div>首先下载5个文件：</div>
<div>
<blockquote>
<div>6.5K Jun 15  2009 cytoBand.txt.gz</div>
<div>3.0G Oct 12  2012 hg19.fa</div>
<div>2.5M Mar 15 10:30 HGNC_Gene_Family_dataset</div>
<div>38M Feb  8  2014 Homo_sapiens.GRCh37.75.gtf.gz</div>
<div>202 Jan 19 16:07 HumanRef_refseg_symbols_relationship.list</div>
</blockquote>
<p>文件下载地址，作者已经给出了！</p>
</div>
<div>我把这些文件都放在的当前文件夹下面的raw这个子文件夹，因为我要当前文件夹作为该软件的database文件夹！！！</div>
<div>然后运行命令！</div>
<div>
<div>我在SOAPfuse-v1.27文件下面运行：</div>
<div>perl ../SOAPfuse-v1.27/source/SOAPfuse-S00-Generate_SOAPfuse_database.pl  \</div>
<div>-wg raw/hg19.fa  -gtf raw/Homo_sapiens.GRCh37.75.gtf.gz  -cbd raw/cytoBand.txt.gz   -gf raw/HGNC_Gene_Family_dataset \</div>
<div>-rft raw/HumanRef_refseg_symbols_relationship.list \</div>
<div> -sd ../SOAPfuse-v1.27 -dd ./</div>
<p>这一步耗时很长，4~6小时，创造了transcript.fa和gene.fa，然后还对他们建立bwa和soap的index，所以有点慢！</p>
</div>
<div>构建成功会有提示：</div>
</div>
</div>
</div>
<blockquote>
<div><span style="font-family: Monaco,Consolas,Courier,Lucida Console,monospace;">Congratulations!</span></div>
<div><span style="font-family: Monaco,Consolas,Courier,Lucida Console,monospace;">You have constructed SOAPfuse database files successfully.</span></div>
<div><span style="font-family: Monaco,Consolas,Courier,Lucida Console,monospace;">These database files are all stored in directory you supplied:</span></div>
<div><span style="font-family: Monaco,Consolas,Courier,Lucida Console,monospace;">/home/jmzeng/biosoft/SOAPfuse/db_v1.27/</span></div>
<div><span style="font-family: Monaco,Consolas,Courier,Lucida Console,monospace;">They are all generated based on public data files you supplied:</span></div>
<div><span style="font-family: Monaco,Consolas,Courier,Lucida Console,monospace;">whole_genome_fasta_file:   /home/jmzeng/biosoft/SOAPfuse/db_v1.27/raw/hg19.fa</span></div>
<div><span style="font-family: Monaco,Consolas,Courier,Lucida Console,monospace;">gtf_annotation_file:       /home/jmzeng/biosoft/SOAPfuse/db_v1.27/raw/Homo_sapiens.GRCh37.75.gtf.gz</span></div>
<div><span style="font-family: Monaco,Consolas,Courier,Lucida Console,monospace;">Chr_Bandregion_file:       /home/jmzeng/biosoft/SOAPfuse/db_v1.27/raw/cytoBand.txt.gz</span></div>
<div><span style="font-family: Monaco,Consolas,Courier,Lucida Console,monospace;">HGNC_gene_family_file:     /home/jmzeng/biosoft/SOAPfuse/db_v1.27/raw/HGNC_Gene_Family_dataset</span></div>
<div><span style="font-family: Monaco,Consolas,Courier,Lucida Console,monospace;">gtf_segname2refseg_list:   /home/jmzeng/biosoft/SOAPfuse/db_v1.27/raw/HumanRef_refseg_symbols_relationship.list</span></div>
</blockquote>
<div><span style="font-family: Monaco,Consolas,Courier,Lucida Console,monospace;">这些目录很重要，接下来制作配置文件会用得着！</span></div>
<div><span style="font-family: Monaco,Consolas,Courier,Lucida Console,monospace;">To use these database files, just set the 'DB_db_dir' in config file as belowed:</span></div>
<div><span style="font-family: Monaco,Consolas,Courier,Lucida Console,monospace;">DB_db_dir  =   /home/jmzeng/biosoft/SOAPfuse/db_v1.27</span></div>
<div>
<div>
<div>
<div>配置文件需要修改下面5个</div>
</div>
</div>
</div>
<blockquote>
<pre>DB_db_dir = /DATABASE_DIR/</pre>
<pre>PG_pg_dir = /TOOL_DIR/source/bin</pre>
<pre>PS_ps_dir = /TOOL_DIR/source</pre>
<pre>PD_all_out = /out_directory/</pre>
<pre>PA_all_fq_postfix = PostFix</pre>
</blockquote>
<div>
<div>
<div>
<div></div>
<div>其实你仔细阅读了说明书，你就知道该修改成什么样子了！</div>
<div>最后制作sample list文件</div>
<div>我这里只有一个sample,所以文件就一句话即可</div>
<div>test test test 100</div>
<div>所以我的有下面两个文件，都是为了顺应作者的需求我才搞了test/test/test这么无聊的东西！！！</div>
<div>/home/jmzeng/test_for_soapfuse/test/test/test_1.fq.gz</div>
<div>/home/jmzeng/test_for_soapfuse/test/test/test_2.fq.gz</div>
<div>如果你有多个sample需要一起运行，你就要仔细读作者的readme了，它把这个配置文件搞得特别复杂！！！</div>
</div>
<p><span style="color: #ff0000;">三，运行命令</span></p>
<div>如果文件都准备好了，运行命令非常简单！！</div>
<div>
<div>
<pre>perl<span style="color: #ff00ff;"> SOAPfuse-RUN.pl</span> -c &lt;<strong>config_file</strong>&gt; -fd &lt;<strong>WHOLE_SEQ-DATA_DIR</strong>&gt; -l &lt;<strong>sample_list</strong>&gt; -o &lt;<strong>out_directory</strong>&gt; [Options]</pre>
<p>运行的非常慢！！！</p>
</div>
<div>因为需要重新比对，知道</div>
</div>
<p><span style="color: #ff0000;">四，数据结果解读</span></p>
<div>结果，作者已经说的很清楚了，我就不多说了！</div>
<div>
<div><a href="http://soap.genomics.org.cn/soapfuse.html">http://soap.genomics.org.cn/soapfuse.html</a></div>
</div>
<div></div>
<div></div>
</div>
</div>
]]></content:encoded>
			<wfw:commentRss>http://www.bio-info-trainee.com/1463.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>居然还可以出售TCGA的数据，只有你稍微进行分析一下即可</title>
		<link>http://www.bio-info-trainee.com/1043.html</link>
		<comments>http://www.bio-info-trainee.com/1043.html#comments</comments>
		<pubDate>Fri, 16 Oct 2015 10:33:07 +0000</pubDate>
		<dc:creator><![CDATA[ulwvfje]]></dc:creator>
				<category><![CDATA[未分类]]></category>
		<category><![CDATA[FusionSCOUT]]></category>
		<category><![CDATA[融合基因]]></category>

		<guid isPermaLink="false">http://www.bio-info-trainee.com/?p=1043</guid>
		<description><![CDATA[亮瞎了我的双眼，原来还可以这样挣钱。 这个数据库的作者在2011年发了一篇如何寻 &#8230; <a href="http://www.bio-info-trainee.com/1043.html">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<div>亮瞎了我的双眼，原来还可以这样挣钱。</div>
<div>这个数据库的作者在2011年发了一篇如何寻找融合基因的文章：<em>*Edgren, Henrik, et al. "Identification of fusion genes in breast cancer by paired-end RNA-sequencing." Genome Biol 12.1 (2011): R6.</em></div>
<p>然后基于此，把TCGA计划里面的所有癌症样本数据都处理了，并且得到了融合基因数据集，然后就以此出售</p>
<div>
<div><a href="http://medisapiens.com/products/fusion-scout/fusionscout-cancer-datasets">http://medisapiens.com/products/fusion-scout/fusionscout-cancer-datasets</a>  （网站好像需要翻墙才能打开）</div>
<div>价格高达一万欧元，折合人民币七万多，一本万利，而且人家TCGA计划的数据的公开而且免费的，他做了二次处理就可以拿来挣钱，让我感觉很不爽。</div>
<div>到目前为止他们处理了TCGA计划里面的7652个癌症样本的数据，建立了一个囊括28种癌症的融合基因数据集，并且打包成了一个叫做FusionSCOUT 的产品来出售。</div>
<div>价格如下：</div>
<div>
<h3>Pricing of FusionSCOUT datasets:</h3>
<ul>
<li>Single gene in one cancer set                        490€    /  580$ per dataset</li>
<li>Single gene fusions across all cancers          4900€  /  5800$ dataset</li>
<li>Individual cancer set                                       990 €   /  1250 $ per dataset</li>
<li>Full TCGA dataset                                          9900€  /  12500$ per dataset</li>
</ul>
</div>
<div>该网站是这样介绍他们的产品的，号称有3500个研究团体已经使用了他们的数据，但是我感觉纯粹是吹牛，毕竟他这篇文献也就一百多的引用量，再说3500次购买，就这一个产品就能让他成为亿万富翁了，想想都觉得可怕。而且这网站这么烂，中国访问速度是渣渣，也就是相当于失去了中国的所有土豪客户了，怎么可能还有3500的销量，搞笑！</div>
<p>One of the latest therapeutics angles in the fight against cancer is fusion genes and their regulation. To aid in fusion gene research and reveal the multitude of gene fusion event in cancer samples MediSapiens has developed a proprietary FusionSCOUT pipeline for identifying fusion genes from RNA sequencing datasets.</p>
<p>Currently we have analysed 7625 tumour samples from the TCGA project building a fusion gene dataset covering 28 different cancers within the TCGA project which can be accessed through our FusionSCOUT product.</p>
<p>Using this pipeline, we have discovered 3930 samples with gene fusions with 9667 different fusion genes. We´ve discovered numerous novel gene fusions as well as new cancer types in which previously known fusions appear.</p>
<p>You can now purchase these gene fusions datasets with few mouse clicks and get the worlds most comprehensive gene fusions from cancer sets within days</p>
<div>
<div>
<h2>FusionSCOUT cancer Reports</h2>
<p>With FusionSCOUT you can access the full listings of all fusion genes in specific cancer datasets. Find new leads for possible cause of the cancer, examine the pathways that are affected by different fusions, stratify patients by shared fusion genes or search for potential target for drugs and companion diagnostics.</p>
<p>Once you purchase a FusionSCOUT dataset we will send you a detailed report with information on the fused genes, sample ID from the TCGA dataset, fusion frequencies across the dataset as well as fusion mRNA sequences and lists of protein domains present in the fusion transcripts.</p>
<p>By ordering the MediSapiens FusionSCOUT dataset, you´ll get:</p>
<ul>
<li>A list of all gene fusions that involve your gene of interest, across all TCGA cancer types</li>
<li>TCGA sample ID: s of the for the samples with fusions</li>
<li>Exact exon junctions for the fusions, including alternatively spliced variants and data on whether reading frame is retained</li>
<li>Detailed list of protein domains retained in the fusion genes</li>
<li>cDNA sequence for the fusion mRNAs</li>
</ul>
<p><em>Contact us to access the most up-to-date and comprehensive datasets of fusion gene events in different cancers!<a href="mailto:contact@medisapiens.com">contact@medisapiens.com</a></em></p>
<p><em>Check out also our <a href="http://medisapiens.com/services/fusion-pipeline/fusion-genes-general">Fusion Gene Detection pipeline service</a> for your samples!</em></p>
<p>Dataset missing? Email us and well add your favorite dataset to FusionSCOUT!</p>
</div>
<div>
<h3><strong>FusionSCOUT Cancer sets, March 2015</strong></h3>
<table border="1" width="607" cellspacing="0" cellpadding="2">
<tbody>
<tr>
<td><strong>Cancer type</strong></td>
<td><strong>Number of samples</strong></td>
<td><strong>Number of fusion genes</strong></td>
</tr>
<tr>
<td>Acute Myeloid Leukemia, LAML</td>
<td>153</td>
<td>69</td>
</tr>
<tr>
<td>Adrenocortical carcinoma, ACC</td>
<td>79</td>
<td>115</td>
</tr>
<tr>
<td>Bladder Urothelial Carcinoma, BLCA</td>
<td>273</td>
<td>473</td>
</tr>
<tr>
<td>Brain Lower Grade Glioma, LGG</td>
<td>467</td>
<td>309</td>
</tr>
<tr>
<td>Breast Invasive Carcinoma, BRCA</td>
<td>1029</td>
<td>3267</td>
</tr>
<tr>
<td>Cervical Squamous Cell Carcinoma and Endocervical Adenocarcinoma, CESC</td>
<td>195</td>
<td>190</td>
</tr>
<tr>
<td>Colon Adenocarcinoma, COAD</td>
<td>287</td>
<td>212</td>
</tr>
<tr>
<td>Glioblastoma multiforme, GBM</td>
<td>170</td>
<td>379</td>
</tr>
<tr>
<td>Head and Neck Squamous Cell Carcinoma, HNSC</td>
<td>412</td>
<td>386</td>
</tr>
<tr>
<td>Kidney Chromophobe, KICH</td>
<td>66</td>
<td>19</td>
</tr>
<tr>
<td>Kidney Renal Clear Cell Carcinoma, KIRC</td>
<td>523</td>
<td>217</td>
</tr>
<tr>
<td>Kidney Renal Papillary Cell Carcinoma, KIRP</td>
<td>226</td>
<td>145</td>
</tr>
<tr>
<td>Liver Hepatocellular Carcinoma, LIHC</td>
<td>198</td>
<td>317</td>
</tr>
<tr>
<td>Lung Adenocarcinoma, LUAD</td>
<td>456</td>
<td>991</td>
</tr>
<tr>
<td>Lung Squamous Cell Carcinoma, LUSC</td>
<td>482</td>
<td>1374</td>
</tr>
<tr>
<td>Lymphoid Neoplasm Diffuse Large B-cell Lymphoma, DLBC</td>
<td>28</td>
<td>18</td>
</tr>
<tr>
<td>Mesothelioma, MESO</td>
<td>36</td>
<td>26</td>
</tr>
<tr>
<td>Ovarian Serous Cystadenocarcinoma, OV</td>
<td>420</td>
<td>1166</td>
</tr>
<tr>
<td>Pancreatic Adenocarcinoma, PAAD</td>
<td>84</td>
<td>46</td>
</tr>
<tr>
<td>Pheochromocytoma and Paraganglioma, PCPG</td>
<td>184</td>
<td>83</td>
</tr>
<tr>
<td>Prostate Adenocarcinoma, PRAD</td>
<td>336</td>
<td>859</td>
</tr>
<tr>
<td>Rectum Adenocarcinoma, READ</td>
<td>85</td>
<td>74</td>
</tr>
<tr>
<td>Sarcoma, SARC</td>
<td>161</td>
<td>799</td>
</tr>
<tr>
<td>Skin Cutaneous Melanoma, SKCM</td>
<td>355</td>
<td>620</td>
</tr>
<tr>
<td>Stomach Adenocarcinoma, STAD</td>
<td>190</td>
<td>311</td>
</tr>
<tr>
<td>Thyroid Carcinoma, THCA</td>
<td>506</td>
<td>195</td>
</tr>
<tr>
<td>Uterine Carcinosarcoma, UCS</td>
<td>57</td>
<td>229</td>
</tr>
<tr>
<td>Uterine Corpus Endometrial Carcinoma, UCEC</td>
<td>167</td>
<td>422</td>
</tr>
</tbody>
</table>
</div>
</div>
</div>
]]></content:encoded>
			<wfw:commentRss>http://www.bio-info-trainee.com/1043.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
