<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>生信菜鸟团 &#187; 批量</title>
	<atom:link href="http://www.bio-info-trainee.com/tag/%e6%89%b9%e9%87%8f/feed" rel="self" type="application/rss+xml" />
	<link>http://www.bio-info-trainee.com</link>
	<description>欢迎去论坛biotrainee.com留言参与讨论，或者关注同名微信公众号biotrainee</description>
	<lastBuildDate>Sat, 28 Jun 2025 14:30:13 +0000</lastBuildDate>
	<language>zh-CN</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>https://wordpress.org/?v=4.1.33</generator>
	<item>
		<title>批量运行GSEA，命令行版本</title>
		<link>http://www.bio-info-trainee.com/1334.html</link>
		<comments>http://www.bio-info-trainee.com/1334.html#comments</comments>
		<pubDate>Mon, 11 Jan 2016 13:12:48 +0000</pubDate>
		<dc:creator><![CDATA[ulwvfje]]></dc:creator>
				<category><![CDATA[基础软件]]></category>
		<category><![CDATA[GSEA]]></category>
		<category><![CDATA[批量]]></category>

		<guid isPermaLink="false">http://www.bio-info-trainee.com/?p=1334</guid>
		<description><![CDATA[之前用过有界面的那种，那样非常方便，只需要做好数据即可，但是如果有非常多的数据， &#8230; <a href="http://www.bio-info-trainee.com/1334.html">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p>之前用过有界面的那种，那样非常方便，只需要做好数据即可，但是如果有非常多的数据，每次都要点击文件，点击下一步，也很烦，不过，，它既然是java软件，就可以以命令行的形式来玩转它！</p>
<div>能够命令行运行了，就很容易批量啦</p>
<div><b>一、程序安装</b></div>
<p>直接在官网下载java版本软件即可：<b><a href="http://software.broadinstitute.org/gsea/downloads.jsp" target="_blank">http://software.broadinstitute.org/gsea/downloads.jsp</a></b></p>
<div><span style="color: #ff0000;"><b><b>二、输入数据</b></b></span></div>
<p>需要下载gmt文件，自己制作gct和cls文件，或者直接下载测试文件p53</p>
<div>见：<a href="http://www.broadinstitute.org/cancer/software/gsea/wiki/index.php/Data_formats" target="_blank">http://www.broadinstitute.org/cancer/software/gsea/wiki/index.php/Data_formats</a></p>
<div><a href="http://www.bio-info-trainee.com/wp-content/uploads/2016/01/12.png"><img class="alignnone size-full wp-image-1335" src="http://www.bio-info-trainee.com/wp-content/uploads/2016/01/12.png" alt="1" width="514" height="361" /></a></div>
<div><span style="color: #ff0000;"><b><b>三、运行命令</b></b></span></div>
<p>hgu95av2的芯片数据，只有一万多探针，所以很快就可以出结果</p>
<div> java -cp gsea2-2.2.1.jar  -Xmx1024m xtools.gsea.Gsea   -gmx c2.cp.kegg.v5.0.symbols.gmt  \</div>
<div> -res P53_hgu95av2.gct  -cls P53.cls   -chip  chip/HG_U95Av2.chip  -out result -rpt_label p53_example</div>
<div>这个参数其实非常多，见文件<a href="http://software.broadinstitute.org/gsea/doc/linkedFiles/GSEAParameters.txt" target="_blank">http://software.broadinstitute.org/gsea/doc/linkedFiles/GSEAParameters.txt</a></div>
<div>但是一般我们都用默认的即可</p>
<div><a href="http://www.bio-info-trainee.com/wp-content/uploads/2016/01/21.png"><img class="alignnone size-full wp-image-1336" src="http://www.bio-info-trainee.com/wp-content/uploads/2016/01/21.png" alt="2" width="723" height="405" /></a></div>
<div>里面报错说有些探针找不到，不要管它</p>
<div><span style="color: #ff0000;"><b><b>四、输出数据</b></b></span></div>
<div><a href="http://www.bio-info-trainee.com/wp-content/uploads/2016/01/32.png"><img class="alignnone size-full wp-image-1337" src="http://www.bio-info-trainee.com/wp-content/uploads/2016/01/32.png" alt="3" width="823" height="387" /></a></div>
<div>自己看官网去理解这些结果咯！</div>
<div><span style="color: #000000;">需要下载的数据如下：<br />
</span></p>
<div>首先需要下载 Molecular Signatures Database (MSigDB)，一般选择C2的kegg，BioCarta 还有Reactome</p>
<div><a href="http://software.broadinstitute.org/gsea/downloads.jsp" target="_blank">http://software.broadinstitute.org/gsea/downloads.jsp</a></div>
<div>都是gmt格式的文件！</div>
<div>
<table border="1" width="100%" cellspacing="0" cellpadding="2">
<tbody>
<tr>
<th>CP: Canonical pathways<br />
(<a href="http://software.broadinstitute.org/gsea/msigdb/genesets.jsp?collection=CP">browse 1330 gene sets</a>)</th>
<td>Gene sets from the pathway databases. Usually, these gene sets are canonical representations of a biological process compiled by domain experts. <a href="http://software.broadinstitute.org/gsea/msigdb/collection_details.jsp#CP">details</a></td>
<td>Download GMT Files<br />
<a href="http://software.broadinstitute.org/gsea/msigdb/download_file.jsp?filePath=/resources/msigdb/5.0/c2.cp.v5.0.orig.gmt">original identifiers</a><br />
<a href="http://software.broadinstitute.org/gsea/msigdb/download_file.jsp?filePath=/resources/msigdb/5.0/c2.cp.v5.0.symbols.gmt">gene symbols</a><br />
<a href="http://software.broadinstitute.org/gsea/msigdb/download_file.jsp?filePath=/resources/msigdb/5.0/c2.cp.v5.0.entrez.gmt">entrez genes ids</a></td>
</tr>
<tr>
<th>CP:BIOCARTA: BioCarta gene sets<br />
(<a href="http://software.broadinstitute.org/gsea/msigdb/genesets.jsp?collection=CP:BIOCARTA">browse 217 gene sets</a>)</th>
<td>Gene sets derived from the BioCarta pathway database (<a href="http://www.biocarta.com/genes/index.asp">http://www.biocarta.com/genes/index.asp</a>).</td>
<td>Download GMT Files<br />
<a href="http://software.broadinstitute.org/gsea/msigdb/download_file.jsp?filePath=/resources/msigdb/5.0/c2.cp.biocarta.v5.0.orig.gmt">original identifiers</a><br />
<a href="http://software.broadinstitute.org/gsea/msigdb/download_file.jsp?filePath=/resources/msigdb/5.0/c2.cp.biocarta.v5.0.symbols.gmt" target="_blank">gene symbols</a><br />
<a href="http://software.broadinstitute.org/gsea/msigdb/download_file.jsp?filePath=/resources/msigdb/5.0/c2.cp.biocarta.v5.0.entrez.gmt">entrez genes ids</a></td>
</tr>
<tr>
<th>CP:KEGG: KEGG gene sets<br />
(<a href="http://software.broadinstitute.org/gsea/msigdb/genesets.jsp?collection=CP:KEGG">browse 186 gene sets</a>)</th>
<td>Gene sets derived from the KEGG pathway database (<a href="http://www.genome.jp/kegg/pathway.html">http://www.genome.jp/kegg/pathway.html</a>).</td>
<td>Download GMT Files<br />
<a href="http://software.broadinstitute.org/gsea/msigdb/download_file.jsp?filePath=/resources/msigdb/5.0/c2.cp.kegg.v5.0.orig.gmt">original identifiers</a><br />
<a href="http://software.broadinstitute.org/gsea/msigdb/download_file.jsp?filePath=/resources/msigdb/5.0/c2.cp.kegg.v5.0.symbols.gmt" target="_blank">gene symbols</a><br />
<a href="http://software.broadinstitute.org/gsea/msigdb/download_file.jsp?filePath=/resources/msigdb/5.0/c2.cp.kegg.v5.0.entrez.gmt">entrez genes ids</a></td>
</tr>
<tr>
<th>CP:REACTOME: Reactome gene sets<br />
(<a href="http://software.broadinstitute.org/gsea/msigdb/genesets.jsp?collection=CP:REACTOME">browse 674 gene sets</a>)</th>
<td>Gene sets derived from the Reactome pathway database (<a href="http://www.reactome.org/">http://www.reactome.org/</a>).</td>
<td>Download GMT Files<br />
<a href="http://software.broadinstitute.org/gsea/msigdb/download_file.jsp?filePath=/resources/msigdb/5.0/c2.cp.reactome.v5.0.orig.gmt">original identifiers</a><br />
<a href="http://software.broadinstitute.org/gsea/msigdb/download_file.jsp?filePath=/resources/msigdb/5.0/c2.cp.reactome.v5.0.symbols.gmt" target="_blank">gene symbols</a><br />
<a href="http://software.broadinstitute.org/gsea/msigdb/download_file.jsp?filePath=/resources/msigdb/5.0/c2.cp.reactome.v5.0.entrez.gmt">entrez genes ids</a></td>
</tr>
</tbody>
</table>
</div>
<div>然后做出表达数据gct文件和cls表型文件~</div>
<div>见：<a href="http://www.broadinstitute.org/cancer/software/gsea/wiki/index.php/Data_formats" target="_blank">http://www.broadinstitute.org/cancer/software/gsea/wiki/index.php/Data_formats</a></div>
<div>然后就可以直接运行了</div>
<div>如果是芯片数据，第一列是芯片探针，那么还需要下载chip数据：<a href="ftp://ftp.broadinstitute.org/pub/gsea/annotations" target="_blank">ftp://ftp.broadinstitute.org/pub/gsea/annotations</a></div>
</div>
</div>
</div>
</div>
</div>
</div>
]]></content:encoded>
			<wfw:commentRss>http://www.bio-info-trainee.com/1334.html/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>
