<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>生信菜鸟团 &#187; biomart</title>
	<atom:link href="http://www.bio-info-trainee.com/tag/biomart/feed" rel="self" type="application/rss+xml" />
	<link>http://www.bio-info-trainee.com</link>
	<description>欢迎去论坛biotrainee.com留言参与讨论，或者关注同名微信公众号biotrainee</description>
	<lastBuildDate>Sat, 28 Jun 2025 14:30:13 +0000</lastBuildDate>
	<language>zh-CN</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>https://wordpress.org/?v=4.1.33</generator>
	<item>
		<title>Bioconductor系列之biomaRt</title>
		<link>http://www.bio-info-trainee.com/877.html</link>
		<comments>http://www.bio-info-trainee.com/877.html#comments</comments>
		<pubDate>Sat, 18 Jul 2015 13:26:43 +0000</pubDate>
		<dc:creator><![CDATA[ulwvfje]]></dc:creator>
				<category><![CDATA[R]]></category>
		<category><![CDATA[bioconductor]]></category>
		<category><![CDATA[biomart]]></category>

		<guid isPermaLink="false">http://www.bio-info-trainee.com/?p=877</guid>
		<description><![CDATA[biomaRt是一个超级网络资源库，里面的信息非常之多，就是网页版的biomaR &#8230; <a href="http://www.bio-info-trainee.com/877.html">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<div class="markdown-here-wrapper" data-md-url="http://www.bio-info-trainee.com/wp-admin/post.php?post=877&amp;action=edit">
<blockquote style="margin: 1.2em 0px; border-left: 4px solid #dddddd; padding: 0px 1em; color: #777777; quotes: none;">
<p style="margin: 0px 0px 1.2em !important;">biomaRt是一个超级网络资源库，里面的信息非常之多，就是网页版的biomaRt的R语言接口。谷歌搜索 <strong>the biomart user’s guide filetype:pdf</strong> 这个关键词，就看到关于这个包的详细介绍以及例子，我这里简单总结一下它的用法。</p>
</blockquote>
<h4 style="margin: 1.3em 0px 1em; padding: 0px; font-weight: bold; font-size: 1.4em; border-bottom: 1px solid #eeeeee;"><span style="color: #ff00ff;">这个包一直在更新，函数总是变化，而且需要联网才能使用，基本上等于废物一个，希望大家不要使用它~</span></h4>
<h2 id="-" style="margin: 1.3em 0px 1em; padding: 0px; font-weight: bold; font-size: 1.4em; border-bottom: 1px solid #eeeeee;">包的安装</h2>
<p><span id="more-877"></span></p>
<p style="margin: 0px 0px 1.2em !important;">Bioconductor系列包的安装方法都一样</p>
<pre style="font-size: 1em; font-family: Consolas, Inconsolata, Courier, monospace; line-height: 1.2em; margin: 1.2em 0px;"><code class="hljs language-R" style="font-size: 0.85em; font-family: Consolas, Inconsolata, Courier, monospace; margin: 0px 0.15em; padding: 0.5em; white-space: pre; border: 1px solid #cccccc; background-color: #f8f8f8; border-radius: 3px; display: block; overflow: auto; overflow-x: auto; color: #333333; background: #f8f8f8; text-size-adjust: none;"><span class="hljs-keyword" style="color: #333333; font-weight: bold;">source</span>(<span class="hljs-string" style="color: #dd1144;">"http://bioconductor.org/biocLite.R"</span>)
biocLite(“biomaRt”)
</code></pre>
<h2 id="-" style="margin: 1.3em 0px 1em; padding: 0px; font-weight: bold; font-size: 1.4em; border-bottom: 1px solid #eeeeee;">选择数据库</h2>
<p style="margin: 0px 0px 1.2em !important;">然后我们就可以加载这个包来看看它的一些性质啦，大家也可以根据这个包的说明书里面的代码一行行代码敲进去看看效果，这样学习最快也最容易记住。</p>
<pre style="font-size: 1em; font-family: Consolas, Inconsolata, Courier, monospace; line-height: 1.2em; margin: 1.2em 0px;"><code class="hljs language-R" style="font-size: 0.85em; font-family: Consolas, Inconsolata, Courier, monospace; margin: 0px 0.15em; padding: 0.5em; white-space: pre; border: 1px solid #cccccc; background-color: #f8f8f8; border-radius: 3px; display: block; overflow: auto; overflow-x: auto; color: #333333; background: #f8f8f8; text-size-adjust: none;"><span class="hljs-keyword" style="color: #333333; font-weight: bold;">library</span>(<span class="hljs-string" style="color: #dd1144;">"biomaRt"</span>)
listMarts() <span class="hljs-comment" style="color: #999988; font-style: italic;">#看看有多少数据库资源</span>
ensembl=useMart(<span class="hljs-string" style="color: #dd1144;">"ensembl"</span>)
listDatasets(ensembl) <span class="hljs-comment" style="color: #999988; font-style: italic;">#看看选择的数据库里面有多少数据表，这个跟物种相关</span>
ensembl = useDataset(<span class="hljs-string" style="color: #dd1144;">"hsapiens_gene_ensembl"</span>,mart=ensembl)
ensembl = useMart(<span class="hljs-string" style="color: #dd1144;">"ensembl"</span>,dataset=<span class="hljs-string" style="color: #dd1144;">"hsapiens_gene_ensembl"</span>)
<span class="hljs-comment" style="color: #999988; font-style: italic;"># 这是一步法选择人类的ensembl数据库代码。</span>
</code></pre>
<h2 id="-getbm-getsequence-getlds" style="margin: 1.3em 0px 1em; padding: 0px; font-weight: bold; font-size: 1.4em; border-bottom: 1px solid #eeeeee;">三个主要函数getBM,getSequence,getLDS</h2>
<blockquote style="margin: 1.2em 0px; border-left: 4px solid #dddddd; padding: 0px 1em; color: #777777; quotes: none;">
<p style="margin: 0px 0px 1.2em !important;">我们选择好了数据库就要开始干活啦，这个数据库的检索主要是三个函数getBM,getSequence,getLDS, 其中getBM这个函数可以部分用select语句替代。</p>
</blockquote>
<p style="margin: 0px 0px 1.2em !important;">首先我们讲讲getBM函数，它就四个参数。</p>
<ul style="margin: 1.2em 0px; padding-left: 2em;">
<li style="margin: 0.5em 0px;">filter来控制根据什么东西来过滤，可是不同数据库的ID，也可以是染色体定位系统坐标</li>
<li style="margin: 0.5em 0px;">Attributes来控制我们想获得什么，一般是不同数据库的ID</li>
<li style="margin: 0.5em 0px;">Values是我们用来检索的关键词向量</li>
<li style="margin: 0.5em 0px;">Mart是我们前面选择好的数据库，一般都是ensembl = useMart(“ensembl”,dataset=”hsapiens_gene_ensembl”)</li>
</ul>
<p style="margin: 0px 0px 1.2em !important;">getBM函数唯一的用处，就是做各种ID转换。</p>
<p style="margin: 0px 0px 1.2em !important;">首先我们可以查看filter和attribute有哪些东西。</p>
<pre style="font-size: 1em; font-family: Consolas, Inconsolata, Courier, monospace; line-height: 1.2em; margin: 1.2em 0px;"><code class="hljs language-R" style="font-size: 0.85em; font-family: Consolas, Inconsolata, Courier, monospace; margin: 0px 0.15em; padding: 0.5em; white-space: pre; border: 1px solid #cccccc; background-color: #f8f8f8; border-radius: 3px; display: block; overflow: auto; overflow-x: auto; color: #333333; background: #f8f8f8; text-size-adjust: none;">filters = listFilters(ensembl)
filters[<span class="hljs-number" style="color: #008080;">1</span>:<span class="hljs-number" style="color: #008080;">5</span>,]
attributes = listAttributes(ensembl)
attributes[<span class="hljs-number" style="color: #008080;">1</span>:<span class="hljs-number" style="color: #008080;">5</span>,]
</code></pre>
<p style="margin: 0px 0px 1.2em !important;">然后我们简单讲几个例子咯：<br />
Ps：这些都是在线注释，所以都是要网络的，网速慢的会非常坑</p>
<h2 id="-" style="margin: 1.3em 0px 1em; padding: 0px; font-weight: bold; font-size: 1.4em; border-bottom: 1px solid #eeeeee;">几个实用的例子</h2>
<h3 id="-id-entrezid" style="margin: 1.3em 0px 1em; padding: 0px; font-weight: bold; font-size: 1.3em;">一．对几个芯片探针的ID号，注释它所捕获的基因的entrezID</h3>
<pre style="font-size: 1em; font-family: Consolas, Inconsolata, Courier, monospace; line-height: 1.2em; margin: 1.2em 0px;"><code class="hljs language-R" style="font-size: 0.85em; font-family: Consolas, Inconsolata, Courier, monospace; margin: 0px 0.15em; padding: 0.5em; white-space: pre; border: 1px solid #cccccc; background-color: #f8f8f8; border-radius: 3px; display: block; overflow: auto; overflow-x: auto; color: #333333; background: #f8f8f8; text-size-adjust: none;">ensembl = useMart(<span class="hljs-string" style="color: #dd1144;">"ensembl"</span>,dataset=<span class="hljs-string" style="color: #dd1144;">"hsapiens_gene_ensembl"</span>)
affyids=c(<span class="hljs-string" style="color: #dd1144;">"202763_at"</span>,<span class="hljs-string" style="color: #dd1144;">"209310_s_at"</span>,<span class="hljs-string" style="color: #dd1144;">"207500_at"</span>)
getBM(attributes=c(<span class="hljs-string" style="color: #dd1144;">'affy_hg_u133_plus_2'</span>, <span class="hljs-string" style="color: #dd1144;">'entrezgene'</span>), filters = <span class="hljs-string" style="color: #dd1144;">'affy_hg_u133_plus_2'</span>, values = affyids, mart = ensembl)
</code></pre>
<p style="margin: 0px 0px 1.2em !important;">结果如下：</p>
<blockquote style="margin: 1.2em 0px; border-left: 4px solid #dddddd; padding: 0px 1em; color: #777777; quotes: none;">
<p style="margin: 0px 0px 1.2em !important;">affy_hg_u133_plus_2 entrezgene<br />
1 202763_at 836<br />
2 207500_at 838<br />
3 209310_s_at 837</p>
</blockquote>
<p style="margin: 0px 0px 1.2em !important;">成功的把affymetrix的芯片探针ID，转为了对应的基因的entrez ID</p>
<h3 id="-id-" style="margin: 1.3em 0px 1em; padding: 0px; font-weight: bold; font-size: 1.3em;">二．对刚才的那三个探针ID号进行多个内容注释，每个探针都对应着基因名已经染色体及起始终止坐标。</h3>
<pre style="font-size: 1em; font-family: Consolas, Inconsolata, Courier, monospace; line-height: 1.2em; margin: 1.2em 0px;"><code class="hljs language-R" style="font-size: 0.85em; font-family: Consolas, Inconsolata, Courier, monospace; margin: 0px 0.15em; padding: 0.5em; white-space: pre; border: 1px solid #cccccc; background-color: #f8f8f8; border-radius: 3px; display: block; overflow: auto; overflow-x: auto; color: #333333; background: #f8f8f8; text-size-adjust: none;">affyids=c(<span class="hljs-string" style="color: #dd1144;">"202763_at"</span>,<span class="hljs-string" style="color: #dd1144;">"209310_s_at"</span>,<span class="hljs-string" style="color: #dd1144;">"207500_at"</span>)
getBM(attributes=c(<span class="hljs-string" style="color: #dd1144;">'affy_hg_u133_plus_2'</span>, <span class="hljs-string" style="color: #dd1144;">'hgnc_symbol'</span>, 
<span class="hljs-string" style="color: #dd1144;">'chromosome_name'</span>,<span class="hljs-string" style="color: #dd1144;">'start_position'</span>,<span class="hljs-string" style="color: #dd1144;">'end_position'</span>, <span class="hljs-string" style="color: #dd1144;">'band'</span>),
filters = <span class="hljs-string" style="color: #dd1144;">'affy_hg_u133_plus_2'</span>, values = affyids, mart = ensembl
)
</code></pre>
<blockquote style="margin: 1.2em 0px; border-left: 4px solid #dddddd; padding: 0px 1em; color: #777777; quotes: none;">
<p style="margin: 0px 0px 1.2em !important;">affy_hg_u133_plus_2 hgnc_symbol chromosome_name start_position end_position band<br />
1 202763_at CASP3 4 184627696 184649509 q35.1<br />
2 207500_at CASP5 11 104994235 105023168 q22.3<br />
3 209310_s_at CASP4 11 104942866 104969436 q22.3</p>
</blockquote>
<h3 id="-id-go-" style="margin: 1.3em 0px 1em; padding: 0px; font-weight: bold; font-size: 1.3em;">三．对给定的基因ID号进行GO注释</h3>
<pre style="font-size: 1em; font-family: Consolas, Inconsolata, Courier, monospace; line-height: 1.2em; margin: 1.2em 0px;"><code class="hljs language-R" style="font-size: 0.85em; font-family: Consolas, Inconsolata, Courier, monospace; margin: 0px 0.15em; padding: 0.5em; white-space: pre; border: 1px solid #cccccc; background-color: #f8f8f8; border-radius: 3px; display: block; overflow: auto; overflow-x: auto; color: #333333; background: #f8f8f8; text-size-adjust: none;"><span class="hljs-keyword" style="color: #333333; font-weight: bold;">library</span>(<span class="hljs-string" style="color: #dd1144;">"biomaRt"</span>)
ensembl = useMart(<span class="hljs-string" style="color: #dd1144;">"ensembl"</span>,dataset=<span class="hljs-string" style="color: #dd1144;">"hsapiens_gene_ensembl"</span>)
entrez=c(<span class="hljs-string" style="color: #dd1144;">"673"</span>,<span class="hljs-string" style="color: #dd1144;">"837"</span>)
goids = getBM(attributes=c(<span class="hljs-string" style="color: #dd1144;">'entrezgene'</span>,<span class="hljs-string" style="color: #dd1144;">'go_id'</span>), filters=<span class="hljs-string" style="color: #dd1144;">'entrezgene'</span>, values=entrez, mart=ensembl)
head(goids)
</code></pre>
<blockquote style="margin: 1.2em 0px; border-left: 4px solid #dddddd; padding: 0px 1em; color: #777777; quotes: none;">
<p style="margin: 0px 0px 1.2em !important;">entrezgene go_id<br />
1 673 GO:0004674<br />
2 673 GO:0005524<br />
3 673 GO:0006468<br />
4 673 GO:0006464<br />
5 673 GO:0008150<br />
6 673 GO:0003674</p>
</blockquote>
<h3 id="-" style="margin: 1.3em 0px 1em; padding: 0px; font-weight: bold; font-size: 1.3em;">四．通过染色体及起始终止坐标来挑选基因</h3>
<pre style="font-size: 1em; font-family: Consolas, Inconsolata, Courier, monospace; line-height: 1.2em; margin: 1.2em 0px;"><code class="hljs language-R" style="font-size: 0.85em; font-family: Consolas, Inconsolata, Courier, monospace; margin: 0px 0.15em; padding: 0.5em; white-space: pre; border: 1px solid #cccccc; background-color: #f8f8f8; border-radius: 3px; display: block; overflow: auto; overflow-x: auto; color: #333333; background: #f8f8f8; text-size-adjust: none;">getBM(c(<span class="hljs-string" style="color: #dd1144;">'affy_hg_u133_plus_2'</span>,<span class="hljs-string" style="color: #dd1144;">'ensembl_gene_id'</span>), filters = c(<span class="hljs-string" style="color: #dd1144;">'chromosome_name'</span>,<span class="hljs-string" style="color: #dd1144;">'start'</span>,<span class="hljs-string" style="color: #dd1144;">'end'</span>),
 values=list(<span class="hljs-number" style="color: #008080;">16</span>,<span class="hljs-number" style="color: #008080;">1100000</span>,<span class="hljs-number" style="color: #008080;">1250000</span>), mart=ensembl)
</code></pre>
<blockquote style="margin: 1.2em 0px; border-left: 4px solid #dddddd; padding: 0px 1em; color: #777777; quotes: none;">
<p style="margin: 0px 0px 1.2em !important;">affy_hg_u133_plus_2 ensembl_gene_id<br />
1 ENSG00000260702<br />
2 215502_at ENSG00000260532<br />
3 ENSG00000273551<br />
4 205845_at ENSG00000196557<br />
5 ENSG00000196557<br />
6 ENSG00000260403<br />
7 ENSG00000259910<br />
8 ENSG00000261294<br />
9 220339_s_at ENSG00000116176<br />
10 ENSG00000277010<br />
11 217023_x_at ENSG00000197253<br />
12 210084_x_at ENSG00000197253<br />
13 215382_x_at ENSG00000197253<br />
14 216474_x_at ENSG00000197253<br />
15 207134_x_at ENSG00000197253<br />
16 205683_x_at ENSG00000197253<br />
17 217023_x_at ENSG00000172236<br />
18 210084_x_at ENSG00000172236<br />
19 215382_x_at ENSG00000172236<br />
20 207741_x_at ENSG00000172236<br />
21 216474_x_at ENSG00000172236<br />
22 207134_x_at ENSG00000172236<br />
23 205683_x_at ENSG00000172236</p>
</blockquote>
<h3 id="-go-id-go-" style="margin: 1.3em 0px 1em; padding: 0px; font-weight: bold; font-size: 1.3em;">五．对特定的GO ID号来查询该go通路上面的基因是哪些。</h3>
<pre style="font-size: 1em; font-family: Consolas, Inconsolata, Courier, monospace; line-height: 1.2em; margin: 1.2em 0px;"><code class="hljs language-R" style="font-size: 0.85em; font-family: Consolas, Inconsolata, Courier, monospace; margin: 0px 0.15em; padding: 0.5em; white-space: pre; border: 1px solid #cccccc; background-color: #f8f8f8; border-radius: 3px; display: block; overflow: auto; overflow-x: auto; color: #333333; background: #f8f8f8; text-size-adjust: none;">getBM(c(<span class="hljs-string" style="color: #dd1144;">'entrezgene'</span>,<span class="hljs-string" style="color: #dd1144;">'hgnc_symbol'</span>), filters=<span class="hljs-string" style="color: #dd1144;">'go_id'</span>, values=<span class="hljs-string" style="color: #dd1144;">'GO:0004707'</span>, mart=ensembl)
</code></pre>
<blockquote style="margin: 1.2em 0px; border-left: 4px solid #dddddd; padding: 0px 1em; color: #777777; quotes: none;">
<p style="margin: 0px 0px 1.2em !important;">entrezgene hgnc_symbol<br />
1 5596 MAPK4<br />
2 225689 MAPK15<br />
3 5601 MAPK9<br />
4 6300 MAPK12<br />
5 5594 MAPK1<br />
6 51701 NLK<br />
7 5597 MAPK6<br />
8 5599 MAPK8<br />
9 1432 MAPK14<br />
10 5603 MAPK13<br />
11 5595 MAPK3<br />
12 5600 MAPK11<br />
13 5602 MAPK10<br />
14 5598 MAPK7</p>
</blockquote>
<h3 id="-refseq-nm-id-" style="margin: 1.3em 0px 1em; padding: 0px; font-weight: bold; font-size: 1.3em;">六．根据refseq数据库的NM系列ID号来获取信息</h3>
<pre style="font-size: 1em; font-family: Consolas, Inconsolata, Courier, monospace; line-height: 1.2em; margin: 1.2em 0px;"><code class="hljs language-R" style="font-size: 0.85em; font-family: Consolas, Inconsolata, Courier, monospace; margin: 0px 0.15em; padding: 0.5em; white-space: pre; border: 1px solid #cccccc; background-color: #f8f8f8; border-radius: 3px; display: block; overflow: auto; overflow-x: auto; color: #333333; background: #f8f8f8; text-size-adjust: none;">refseqids = c(<span class="hljs-string" style="color: #dd1144;">"NM_005359"</span>,<span class="hljs-string" style="color: #dd1144;">"NM_000546"</span>)
ipro = getBM(attributes=c(<span class="hljs-string" style="color: #dd1144;">"refseq_dna"</span>,<span class="hljs-string" style="color: #dd1144;">"interpro"</span>,<span class="hljs-string" style="color: #dd1144;">"interpro_description"</span>), filters=<span class="hljs-string" style="color: #dd1144;">"refseq_dna"</span>,values=refseqids, mart=ensembl)
</code></pre>
<blockquote style="margin: 1.2em 0px; border-left: 4px solid #dddddd; padding: 0px 1em; color: #777777; quotes: none;">
<p style="margin: 0px 0px 1.2em !important;">这个例子的代码有错误，因为refseq的信息没有refseq_dna<br />
Error in getBM(attributes = c(“refseq_dna”, “interpro”, “interpro_description”), :<br />
Invalid attribute(s): refseq_dna<br />
Please use the function ‘listAttributes’ to get valid attribute names<br />
我简单检查了一下，发现需要更正为refseq_mrna</p>
<pre style="font-size: 1em; font-family: Consolas, Inconsolata, Courier, monospace; line-height: 1.2em; margin: 1.2em 0px;"><code class="hljs language-R" style="font-size: 0.85em; font-family: Consolas, Inconsolata, Courier, monospace; margin: 0px 0.15em; padding: 0.5em; white-space: pre; border: 1px solid #cccccc; background-color: #f8f8f8; border-radius: 3px; display: block; overflow: auto; overflow-x: auto; color: #333333; background: #f8f8f8; text-size-adjust: none;">tmp=listAttributes(ensembl)
grep(<span class="hljs-string" style="color: #dd1144;">"refseq"</span>,tmp[,<span class="hljs-number" style="color: #008080;">1</span>])
<span class="hljs-comment" style="color: #999988; font-style: italic;"># [1] 81 82 83 84 85 86</span>
tmp[grep(<span class="hljs-string" style="color: #dd1144;">"refseq"</span>,tmp[,<span class="hljs-number" style="color: #008080;">1</span>]),]
</code></pre>
<p style="margin: 0px 0px 1.2em !important;">name description<br />
81 refseq_mrna RefSeq mRNA [e.g. NM_001195597]<br />
82 refseq_mrna_predicted RefSeq mRNA predicted [e.g. XM_001125684]<br />
83 refseq_ncrna RefSeq ncRNA [e.g. NR_002834]<br />
84 refseq_ncrna_predicted RefSeq ncRNA predicted [e.g. XR_108264]<br />
85 refseq_peptide RefSeq Protein ID [e.g. NP_001005353]<br />
86 refseq_peptide_predicted RefSeq Predicted Protein ID [e.g. XP_001720922]</p>
</blockquote>
<p style="margin: 0px 0px 1.2em !important;">然后我用了新的代码</p>
<pre style="font-size: 1em; font-family: Consolas, Inconsolata, Courier, monospace; line-height: 1.2em; margin: 1.2em 0px;"><code class="hljs language-R" style="font-size: 0.85em; font-family: Consolas, Inconsolata, Courier, monospace; margin: 0px 0.15em; padding: 0.5em; white-space: pre; border: 1px solid #cccccc; background-color: #f8f8f8; border-radius: 3px; display: block; overflow: auto; overflow-x: auto; color: #333333; background: #f8f8f8; text-size-adjust: none;">refseqids = c(<span class="hljs-string" style="color: #dd1144;">"NM_005359"</span>,<span class="hljs-string" style="color: #dd1144;">"NM_000546"</span>)
ipro = getBM(attributes=c(<span class="hljs-string" style="color: #dd1144;">"refseq_mrna"</span>,<span class="hljs-string" style="color: #dd1144;">"interpro"</span>,<span class="hljs-string" style="color: #dd1144;">"interpro_description"</span>), filters=<span class="hljs-string" style="color: #dd1144;">"refseq_mrna"</span>,values=refseqids, mart=ensembl)
ipro
</code></pre>
<blockquote style="margin: 1.2em 0px; border-left: 4px solid #dddddd; padding: 0px 1em; color: #777777; quotes: none;">
<p style="margin: 0px 0px 1.2em !important;">refseq_mrna interpro interpro_description<br />
1 NM_000546 IPR011615 p53, DNA-binding domain<br />
2 NM_000546 IPR010991 p53, tetramerisation domain<br />
3 NM_000546 IPR013872 p53 transactivation domain<br />
4 NM_000546 IPR002117 p53 tumour suppressor family<br />
5 NM_000546 IPR008967 p53-like transcription factor, DNA-binding<br />
6 NM_005359 IPR001132 SMAD domain, Dwarfin-type<br />
7 NM_005359 IPR003619 MAD homology 1, Dwarfin-type<br />
8 NM_005359 IPR013019 MAD homology, MH1<br />
9 NM_005359 IPR008984 SMAD/FHA domain</p>
</blockquote>
<h3 id="-entrez-id-" style="margin: 1.3em 0px 1em; padding: 0px; font-weight: bold; font-size: 1.3em;">七．根据基因的entrez ID号来挑选该基因的指定上下游区域信息或者蛋白序列</h3>
<pre style="font-size: 1em; font-family: Consolas, Inconsolata, Courier, monospace; line-height: 1.2em; margin: 1.2em 0px;"><code class="hljs language-R" style="font-size: 0.85em; font-family: Consolas, Inconsolata, Courier, monospace; margin: 0px 0.15em; padding: 0.5em; white-space: pre; border: 1px solid #cccccc; background-color: #f8f8f8; border-radius: 3px; display: block; overflow: auto; overflow-x: auto; color: #333333; background: #f8f8f8; text-size-adjust: none;">entrez=c(<span class="hljs-string" style="color: #dd1144;">"673"</span>,<span class="hljs-string" style="color: #dd1144;">"7157"</span>,<span class="hljs-string" style="color: #dd1144;">"837"</span>)
getSequence(id = entrez, type=<span class="hljs-string" style="color: #dd1144;">"entrezgene"</span>,seqType=<span class="hljs-string" style="color: #dd1144;">"coding_gene_flank"</span>,upstream=<span class="hljs-number" style="color: #008080;">100</span>, mart=ensembl)
</code></pre>
<p style="margin: 0px 0px 1.2em !important;">这样就得到了三条序列，是给定基因的上游100bp序列 coding_gene_flank entrezgene</p>
<blockquote style="margin: 1.2em 0px; border-left: 4px solid #dddddd; padding: 0px 1em; color: #777777; quotes: none;">
<p style="margin: 0px 0px 1.2em !important;">1 CCTCCGCCTCCGCCTCCGCCTCCGCCTCCCCCAGCTCTCCGCCTCCCTTCCCCCTCCCCGCCCGACAGCGGCCGCTCGGGCCCCGGCTCTCGGTTATAAG 673<br />
2 TCCTTCTCTGCAGGCCCAGGTGACCCAGGGTTGGAAGTGTCTCATGCTGGATCCCCACTTTTCCTCTTGCAGCAGCCAGACTGCCTTCCGGGTCACTGCC 7157<br />
3 CACGTTTCCGCCCTTTGCAATAAGGAAATACATAGTTTACTTTCATTTTTGACTCTGAGGCTCTTTCCAACGCTGTAAAAAAGGACAGAGGCTGTTCCCT 837</p>
</blockquote>
<p style="margin: 0px 0px 1.2em !important;">其实这个getSequence函数还有非常多的用法，当然主要的变化在其readme上面可以看到，主要是seqType可以有多个选择。</p>
<pre style="font-size: 1em; font-family: Consolas, Inconsolata, Courier, monospace; line-height: 1.2em; margin: 1.2em 0px;"><code class="hljs language-R" style="font-size: 0.85em; font-family: Consolas, Inconsolata, Courier, monospace; margin: 0px 0.15em; padding: 0.5em; white-space: pre; border: 1px solid #cccccc; background-color: #f8f8f8; border-radius: 3px; display: block; overflow: auto; overflow-x: auto; color: #333333; background: #f8f8f8; text-size-adjust: none;">utr5 = getSequence(chromosome=<span class="hljs-number" style="color: #008080;">3</span>, start=<span class="hljs-number" style="color: #008080;">185514033</span>, end=<span class="hljs-number" style="color: #008080;">185535839</span>,
 type=<span class="hljs-string" style="color: #dd1144;">"entrezgene"</span>,seqType=<span class="hljs-string" style="color: #dd1144;">"5utr"</span>, mart=ensembl)
utr5
<span class="hljs-comment" style="color: #999988; font-style: italic;">#这样就查询到了 chromosome=3, start=185514033, end=185535839,这个定位里面的所有utr5信息。</span>
protein = getSequence(id=c(<span class="hljs-number" style="color: #008080;">100</span>, <span class="hljs-number" style="color: #008080;">5728</span>),type=<span class="hljs-string" style="color: #dd1144;">"entrezgene"</span>,
 seqType=<span class="hljs-string" style="color: #dd1144;">"peptide"</span>, mart=ensembl)
protein
<span class="hljs-comment" style="color: #999988; font-style: italic;">#这样就查到了这两个基因转录本的所有蛋白序列，100这个基因有5条序列，5728这个基因有四条序列。</span>
</code></pre>
<h3 id="-snp-" style="margin: 1.3em 0px 1em; padding: 0px; font-weight: bold; font-size: 1.3em;">八，选择其它数据库来进行查询，比如snp数据库</h3>
<p style="margin: 0px 0px 1.2em !important;">当然还有一些数据库的小技巧，第一个是参数 archive = TRUE，设置只用能获取的数据库<br />
ensembl = useMart(“ensembl_mart_46”, dataset=”hsapiens_gene_ensembl”, archive = TRUE)<br />
然后是设置特定选取hg19对应的信息。</p>
<pre style="font-size: 1em; font-family: Consolas, Inconsolata, Courier, monospace; line-height: 1.2em; margin: 1.2em 0px;"><code class="hljs language-R" style="font-size: 0.85em; font-family: Consolas, Inconsolata, Courier, monospace; margin: 0px 0.15em; padding: 0.5em; white-space: pre; border: 1px solid #cccccc; background-color: #f8f8f8; border-radius: 3px; display: block; overflow: auto; overflow-x: auto; color: #333333; background: #f8f8f8; text-size-adjust: none;">listMarts(host=<span class="hljs-string" style="color: #dd1144;">'may2009.archive.ensembl.org'</span>)
ensembl54=useMart(host=<span class="hljs-string" style="color: #dd1144;">'may2009.archive.ensembl.org'</span>, biomart=<span class="hljs-string" style="color: #dd1144;">'ENSEMBL_MART_ENSEMBL'</span>)
ensembl54=useMart(host=<span class="hljs-string" style="color: #dd1144;">'may2009.archive.ensembl.org'</span>, biomart=<span class="hljs-string" style="color: #dd1144;">'ENSEMBL_MART_ENSEMBL'</span>, dataset=<span class="hljs-string" style="color: #dd1144;">'hsapiens_gene_ensembl'</span>)
</code></pre>
<p style="margin: 0px 0px 1.2em !important;">还可以选取其它物种，秀丽隐杆线虫</p>
<pre style="font-size: 1em; font-family: Consolas, Inconsolata, Courier, monospace; line-height: 1.2em; margin: 1.2em 0px;"><code class="hljs language-R" style="font-size: 0.85em; font-family: Consolas, Inconsolata, Courier, monospace; margin: 0px 0.15em; padding: 0.5em; white-space: pre; border: 1px solid #cccccc; background-color: #f8f8f8; border-radius: 3px; display: block; overflow: auto; overflow-x: auto; color: #333333; background: #f8f8f8; text-size-adjust: none;">wormbase=useMart(<span class="hljs-string" style="color: #dd1144;">"WS220"</span>,dataset=<span class="hljs-string" style="color: #dd1144;">"wormbase_gene"</span>)
listFilters(wormbase)
listAttributes(wormbase)
getBM(attributes = c(<span class="hljs-string" style="color: #dd1144;">"public_name"</span>,<span class="hljs-string" style="color: #dd1144;">"rnai"</span>,<span class="hljs-string" style="color: #dd1144;">"rnai_phenotype_phenotype_label"</span>),
filters=<span class="hljs-string" style="color: #dd1144;">"gene_name"</span>, values=c(<span class="hljs-string" style="color: #dd1144;">"unc-26"</span>,<span class="hljs-string" style="color: #dd1144;">"his-33"</span>),
mart=wormbase)
</code></pre>
<h2 id="-" style="margin: 1.3em 0px 1em; padding: 0px; font-weight: bold; font-size: 1.4em; border-bottom: 1px solid #eeeeee;">写在最后</h2>
<p style="margin: 0px 0px 1.2em !important;">最后我简单提一下select函数是如何部分替代getBM函数的，因为biomaRt是在线数据库，本来只能用它自己的getBM系列函数，但是为了对接其它bioconductor系列包，也可以用select函数来操作这个在线数据库。</p>
<pre style="font-size: 1em; font-family: Consolas, Inconsolata, Courier, monospace; line-height: 1.2em; margin: 1.2em 0px;"><code class="hljs language-R" style="font-size: 0.85em; font-family: Consolas, Inconsolata, Courier, monospace; margin: 0px 0.15em; padding: 0.5em; white-space: pre; border: 1px solid #cccccc; background-color: #f8f8f8; border-radius: 3px; display: block; overflow: auto; overflow-x: auto; color: #333333; background: #f8f8f8; text-size-adjust: none;">mart&lt;-useMart(dataset=<span class="hljs-string" style="color: #dd1144;">"hsapiens_gene_ensembl"</span>,biomart=<span class="hljs-string" style="color: #dd1144;">'ensembl'</span>) head(keytypes(mart), n=<span class="hljs-number" style="color: #008080;">3</span>) head(columns(mart), n=<span class="hljs-number" style="color: #008080;">3</span>) k = keys(mart, keytype=<span class="hljs-string" style="color: #dd1144;">"chromosome_name"</span>) head(k, n=<span class="hljs-number" style="color: #008080;">3</span>) k = keys(mart, keytype=<span class="hljs-string" style="color: #dd1144;">"chromosome_name"</span>, pattern=<span class="hljs-string" style="color: #dd1144;">"LRG"</span>) head(k, n=<span class="hljs-number" style="color: #008080;">3</span>) affy=c(<span class="hljs-string" style="color: #dd1144;">"202763_at"</span>,<span class="hljs-string" style="color: #dd1144;">"209310_s_at"</span>,<span class="hljs-string" style="color: #dd1144;">"207500_at"</span>) select(mart, keys=affy, columns=c(<span class="hljs-string" style="color: #dd1144;">'affy_hg_u133_plus_2'</span>,<span class="hljs-string" style="color: #dd1144;">'entrezgene'</span>), keytype=<span class="hljs-string" style="color: #dd1144;">'affy_hg_u133_plus_2'</span>)
</code></pre>
<div style="height: 0; width: 0; max-height: 0; max-width: 0; overflow: hidden; font-size: 0em; padding: 0; margin: 0;" title="MDH:PHA+Jmd0OyBiaW9tYVJ05piv5LiA5Liq6LaF57qn572R57uc6LWE5rqQ5bqT77yM6YeM6Z2i55qE
5L+h5oGv6Z2e5bi45LmL5aSa77yM5bCx5piv572R6aG154mI55qEYmlvbWFSdOeahFLor63oqIDm
jqXlj6PjgILosLfmrYzmkJzntKIgKip0aGUgYmlvbWFydCB1c2VyJ3MgZ3VpZGUgZmlsZXR5cGU6
cGRmKiog6L+Z5Liq5YWz6ZSu6K+N77yM5bCx55yL5Yiw5YWz5LqO6L+Z5Liq5YyF55qE6K+m57uG
5LuL57uN5Lul5Y+K5L6L5a2Q77yM5oiR6L+Z6YeM566A5Y2V5oC757uT5LiA5LiL5a6D55qE55So
5rOV44CCPC9wPjxwPiMjIOWMheeahOWuieijhTxicj5CaW9jb25kdWN0b3Lns7vliJfljIXnmoTl
ronoo4Xmlrnms5Xpg73kuIDmoLc8YnI+YGBgUjxicj5zb3VyY2UoImh0dHA6Ly9iaW9jb25kdWN0
b3Iub3JnL2Jpb2NMaXRlLlIiKTxicj5iaW9jTGl0ZSjigJxiaW9tYVJ04oCdKTxicj5gYGA8YnI+
IyMg6YCJ5oup5pWw5o2u5bqTPGJyPueEtuWQjuaIkeS7rOWwseWPr+S7peWKoOi9vei/meS4quWM
headpeeci+eci+Wug+eahOS4gOS6m+aAp+i0qOWVpu+8jOWkp+WutuS5n+WPr+S7peagueaNrui/
meS4quWMheeahOivtOaYjuS5pumHjOmdoueahOS7o+eggeS4gOihjOihjOS7o+eggeaVsui/m+WO
u+eci+eci+aViOaenO+8jOi/meagt+WtpuS5oOacgOW/q+S5n+acgOWuueaYk+iusOS9j+OAgjxi
cj5gYGBSPGJyPmxpYnJhcnkoImJpb21hUnQiKTxicj5saXN0TWFydHMoKSAj55yL55yL5pyJ5aSa
5bCR5pWw5o2u5bqT6LWE5rqQPGJyPmVuc2VtYmw9dXNlTWFydCgiZW5zZW1ibCIpPGJyPmxpc3RE
YXRhc2V0cyhlbnNlbWJsKSAj55yL55yL6YCJ5oup55qE5pWw5o2u5bqT6YeM6Z2i5pyJ5aSa5bCR
5pWw5o2u6KGo77yM6L+Z5Liq6Lef54mp56eN55u45YWzPGJyPmVuc2VtYmwgPSB1c2VEYXRhc2V0
KCJoc2FwaWVuc19nZW5lX2Vuc2VtYmwiLG1hcnQ9ZW5zZW1ibCk8YnI+ZW5zZW1ibCA9IHVzZU1h
cnQoImVuc2VtYmwiLGRhdGFzZXQ9ImhzYXBpZW5zX2dlbmVfZW5zZW1ibCIpPGJyPiMg6L+Z5piv
5LiA5q2l5rOV6YCJ5oup5Lq657G755qEZW5zZW1ibOaVsOaNruW6k+S7o+eggeOAgjxicj5gYGA8
L3A+PHA+IyMg5LiJ5Liq5Li76KaB5Ye95pWwZ2V0Qk0sZ2V0U2VxdWVuY2UsZ2V0TERTPGJyPiZn
dDsg5oiR5Lus6YCJ5oup5aW95LqG5pWw5o2u5bqT5bCx6KaB5byA5aeL5bmy5rS75ZWm77yM6L+Z
5Liq5pWw5o2u5bqT55qE5qOA57Si5Li76KaB5piv5LiJ5Liq5Ye95pWwZ2V0Qk0sZ2V0U2VxdWVu
Y2UsZ2V0TERTLCDlhbbkuK1nZXRCTei/meS4quWHveaVsOWPr+S7pemDqOWIhueUqHNlbGVjdOiv
reWPpeabv+S7o+OAgjwvcD48cD7pppblhYjmiJHku6zorrLorrJnZXRCTeWHveaVsO+8jOWug+Ww
seWbm+S4quWPguaVsOOAgjwvcD48cD4qIGZpbHRlcuadpeaOp+WItuagueaNruS7gOS5iOS4nOil
v+adpei/h+a7pO+8jOWPr+aYr+S4jeWQjOaVsOaNruW6k+eahElE77yM5Lmf5Y+v5Lul5piv5p+T
6Imy5L2T5a6a5L2N57O757uf5Z2Q5qCHPGJyPiogQXR0cmlidXRlc+adpeaOp+WItuaIkeS7rOaD
s+iOt+W+l+S7gOS5iO+8jOS4gOiIrOaYr+S4jeWQjOaVsOaNruW6k+eahElEPGJyPiogVmFsdWVz
5piv5oiR5Lus55So5p2l5qOA57Si55qE5YWz6ZSu6K+N5ZCR6YePIDxicj4qIE1hcnTmmK/miJHk
u6zliY3pnaLpgInmi6nlpb3nmoTmlbDmja7lupPvvIzkuIDoiKzpg73mmK9lbnNlbWJsID0gdXNl
TWFydCgiZW5zZW1ibCIsZGF0YXNldD0iaHNhcGllbnNfZ2VuZV9lbnNlbWJsIik8L3A+PHA+Z2V0
Qk3lh73mlbDllK/kuIDnmoTnlKjlpITvvIzlsLHmmK/lgZrlkITnp41JROi9rOaNouOAgjwvcD48
cD7pppblhYjmiJHku6zlj6/ku6Xmn6XnnItmaWx0ZXLlkoxhdHRyaWJ1dGXmnInlk6rkupvkuJzo
pb/jgII8YnI+YGBgUjxicj5maWx0ZXJzID0gbGlzdEZpbHRlcnMoZW5zZW1ibCk8YnI+ZmlsdGVy
c1sxOjUsXTxicj5hdHRyaWJ1dGVzID0gbGlzdEF0dHJpYnV0ZXMoZW5zZW1ibCk8YnI+YXR0cmli
dXRlc1sxOjUsXTxicj5gYGA8YnI+54S25ZCO5oiR5Lus566A5Y2V6K6y5Yeg5Liq5L6L5a2Q5ZKv
77yaPGJyPlBz77ya6L+Z5Lqb6YO95piv5Zyo57q/5rOo6YeK77yM5omA5Lul6YO95piv6KaB572R
57uc55qE77yM572R6YCf5oWi55qE5Lya6Z2e5bi45Z2RPC9wPjxwPiMjIOWHoOS4quWunueUqOea
hOS+i+WtkDwvcD48cD4jIyMg5LiA77yO5a+55Yeg5Liq6Iqv54mH5o6i6ZKI55qESUTlj7fvvIzm
s6jph4rlroPmiYDmjZXojrfnmoTln7rlm6DnmoRlbnRyZXpJRDxicj5gYGBSPGJyPmVuc2VtYmwg
PSB1c2VNYXJ0KCJlbnNlbWJsIixkYXRhc2V0PSJoc2FwaWVuc19nZW5lX2Vuc2VtYmwiKTxicj5h
ZmZ5aWRzPWMoIjIwMjc2M19hdCIsIjIwOTMxMF9zX2F0IiwiMjA3NTAwX2F0Iik8YnI+Z2V0Qk0o
YXR0cmlidXRlcz1jKCdhZmZ5X2hnX3UxMzNfcGx1c18yJywgJ2VudHJlemdlbmUnKSwgZmlsdGVy
cyA9ICdhZmZ5X2hnX3UxMzNfcGx1c18yJywgdmFsdWVzID0gYWZmeWlkcywgbWFydCA9IGVuc2Vt
YmwpPGJyPmBgYDxicj7nu5PmnpzlpoLkuIvvvJo8YnI+Jmd0OyBhZmZ5X2hnX3UxMzNfcGx1c18y
IGVudHJlemdlbmU8YnI+MSAyMDI3NjNfYXQgODM2PGJyPjIgMjA3NTAwX2F0IDgzODxicj4zIDIw
OTMxMF9zX2F0IDgzNzwvcD48cD7miJDlip/nmoTmiophZmZ5bWV0cml455qE6Iqv54mH5o6i6ZKI
SUTvvIzovazkuLrkuoblr7nlupTnmoTln7rlm6DnmoRlbnRyZXogSUQ8L3A+PHA+PGJyPiMjIyDk
uozvvI7lr7nliJrmiY3nmoTpgqPkuInkuKrmjqLpkohJROWPt+i/m+ihjOWkmuS4quWGheWuueaz
qOmHiu+8jOavj+S4quaOoumSiOmDveWvueW6lOedgOWfuuWboOWQjeW3sue7j+afk+iJsuS9k+WP
iui1t+Wni+e7iOatouWdkOagh+OAgjwvcD48cD5gYGBSPGJyPmFmZnlpZHM9YygiMjAyNzYzX2F0
IiwiMjA5MzEwX3NfYXQiLCIyMDc1MDBfYXQiKTxicj5nZXRCTShhdHRyaWJ1dGVzPWMoJ2FmZnlf
aGdfdTEzM19wbHVzXzInLCAnaGduY19zeW1ib2wnLCA8YnI+J2Nocm9tb3NvbWVfbmFtZScsJ3N0
YXJ0X3Bvc2l0aW9uJywnZW5kX3Bvc2l0aW9uJywgJ2JhbmQnKSw8YnI+ZmlsdGVycyA9ICdhZmZ5
X2hnX3UxMzNfcGx1c18yJywgdmFsdWVzID0gYWZmeWlkcywgbWFydCA9IGVuc2VtYmw8YnI+KTxi
cj5gYGA8L3A+PHA+Jmd0OyBhZmZ5X2hnX3UxMzNfcGx1c18yIGhnbmNfc3ltYm9sIGNocm9tb3Nv
bWVfbmFtZSBzdGFydF9wb3NpdGlvbiBlbmRfcG9zaXRpb24gYmFuZDxicj4xIDIwMjc2M19hdCBD
QVNQMyA0IDE4NDYyNzY5NiAxODQ2NDk1MDkgcTM1LjE8YnI+MiAyMDc1MDBfYXQgQ0FTUDUgMTEg
MTA0OTk0MjM1IDEwNTAyMzE2OCBxMjIuMzxicj4zIDIwOTMxMF9zX2F0IENBU1A0IDExIDEwNDk0
Mjg2NiAxMDQ5Njk0MzYgcTIyLjM8L3A+PHA+IyMjIOS4ie+8juWvuee7meWumueahOWfuuWboElE
5Y+36L+b6KGMR0/ms6jph4o8YnI+YGBgUjxicj5saWJyYXJ5KCJiaW9tYVJ0Iik8YnI+ZW5zZW1i
bCA9IHVzZU1hcnQoImVuc2VtYmwiLGRhdGFzZXQ9ImhzYXBpZW5zX2dlbmVfZW5zZW1ibCIpPGJy
PmVudHJlej1jKCI2NzMiLCI4MzciKTxicj5nb2lkcyA9IGdldEJNKGF0dHJpYnV0ZXM9YygnZW50
cmV6Z2VuZScsJ2dvX2lkJyksIGZpbHRlcnM9J2VudHJlemdlbmUnLCB2YWx1ZXM9ZW50cmV6LCBt
YXJ0PWVuc2VtYmwpPGJyPmhlYWQoZ29pZHMpPGJyPmBgYDxicj4mZ3Q7IGVudHJlemdlbmUgZ29f
aWQ8YnI+MSA2NzMgR086MDAwNDY3NDxicj4yIDY3MyBHTzowMDA1NTI0PGJyPjMgNjczIEdPOjAw
MDY0Njg8YnI+NCA2NzMgR086MDAwNjQ2NDxicj41IDY3MyBHTzowMDA4MTUwPGJyPjYgNjczIEdP
OjAwMDM2NzQ8L3A+PHA+IyMjIOWbm++8jumAmui/h+afk+iJsuS9k+WPiui1t+Wni+e7iOatouWd
kOagh+adpeaMkemAieWfuuWboDxicj5gYGBSPGJyPmdldEJNKGMoJ2FmZnlfaGdfdTEzM19wbHVz
XzInLCdlbnNlbWJsX2dlbmVfaWQnKSwgZmlsdGVycyA9IGMoJ2Nocm9tb3NvbWVfbmFtZScsJ3N0
YXJ0JywnZW5kJyksPGJyPiB2YWx1ZXM9bGlzdCgxNiwxMTAwMDAwLDEyNTAwMDApLCBtYXJ0PWVu
c2VtYmwpPGJyPmBgYDwvcD48cD4mZ3Q7IGFmZnlfaGdfdTEzM19wbHVzXzIgZW5zZW1ibF9nZW5l
X2lkPGJyPjEgRU5TRzAwMDAwMjYwNzAyPGJyPjIgMjE1NTAyX2F0IEVOU0cwMDAwMDI2MDUzMjxi
cj4zIEVOU0cwMDAwMDI3MzU1MTxicj40IDIwNTg0NV9hdCBFTlNHMDAwMDAxOTY1NTc8YnI+NSBF
TlNHMDAwMDAxOTY1NTc8YnI+NiBFTlNHMDAwMDAyNjA0MDM8YnI+NyBFTlNHMDAwMDAyNTk5MTA8
YnI+OCBFTlNHMDAwMDAyNjEyOTQ8YnI+OSAyMjAzMzlfc19hdCBFTlNHMDAwMDAxMTYxNzY8YnI+
MTAgRU5TRzAwMDAwMjc3MDEwPGJyPjExIDIxNzAyM194X2F0IEVOU0cwMDAwMDE5NzI1Mzxicj4x
MiAyMTAwODRfeF9hdCBFTlNHMDAwMDAxOTcyNTM8YnI+MTMgMjE1MzgyX3hfYXQgRU5TRzAwMDAw
MTk3MjUzPGJyPjE0IDIxNjQ3NF94X2F0IEVOU0cwMDAwMDE5NzI1Mzxicj4xNSAyMDcxMzRfeF9h
dCBFTlNHMDAwMDAxOTcyNTM8YnI+MTYgMjA1NjgzX3hfYXQgRU5TRzAwMDAwMTk3MjUzPGJyPjE3
IDIxNzAyM194X2F0IEVOU0cwMDAwMDE3MjIzNjxicj4xOCAyMTAwODRfeF9hdCBFTlNHMDAwMDAx
NzIyMzY8YnI+MTkgMjE1MzgyX3hfYXQgRU5TRzAwMDAwMTcyMjM2PGJyPjIwIDIwNzc0MV94X2F0
IEVOU0cwMDAwMDE3MjIzNjxicj4yMSAyMTY0NzRfeF9hdCBFTlNHMDAwMDAxNzIyMzY8YnI+MjIg
MjA3MTM0X3hfYXQgRU5TRzAwMDAwMTcyMjM2PGJyPjIzIDIwNTY4M194X2F0IEVOU0cwMDAwMDE3
MjIzNjwvcD48cD4jIyMg5LqU77yO5a+554m55a6a55qER08gSUTlj7fmnaXmn6Xor6Lor6Vnb+mA
mui3r+S4iumdoueahOWfuuWboOaYr+WTquS6m+OAgjxicj5gYGBSPGJyPmdldEJNKGMoJ2VudHJl
emdlbmUnLCdoZ25jX3N5bWJvbCcpLCBmaWx0ZXJzPSdnb19pZCcsIHZhbHVlcz0nR086MDAwNDcw
NycsIG1hcnQ9ZW5zZW1ibCk8YnI+YGBgPC9wPjxwPiZndDsgZW50cmV6Z2VuZSBoZ25jX3N5bWJv
bDxicj4xIDU1OTYgTUFQSzQ8YnI+MiAyMjU2ODkgTUFQSzE1PGJyPjMgNTYwMSBNQVBLOTxicj40
IDYzMDAgTUFQSzEyPGJyPjUgNTU5NCBNQVBLMTxicj42IDUxNzAxIE5MSzxicj43IDU1OTcgTUFQ
SzY8YnI+OCA1NTk5IE1BUEs4PGJyPjkgMTQzMiBNQVBLMTQ8YnI+MTAgNTYwMyBNQVBLMTM8YnI+
MTEgNTU5NSBNQVBLMzxicj4xMiA1NjAwIE1BUEsxMTxicj4xMyA1NjAyIE1BUEsxMDxicj4xNCA1
NTk4IE1BUEs3PC9wPjxwPiMjIyDlha3vvI7moLnmja5yZWZzZXHmlbDmja7lupPnmoROTeezu+WI
l0lE5Y+35p2l6I635Y+W5L+h5oGvPGJyPmBgYFI8YnI+cmVmc2VxaWRzID0gYygiTk1fMDA1MzU5
IiwiTk1fMDAwNTQ2Iik8YnI+aXBybyA9IGdldEJNKGF0dHJpYnV0ZXM9YygicmVmc2VxX2RuYSIs
ImludGVycHJvIiwiaW50ZXJwcm9fZGVzY3JpcHRpb24iKSwgZmlsdGVycz0icmVmc2VxX2RuYSIs
dmFsdWVzPXJlZnNlcWlkcywgbWFydD1lbnNlbWJsKTxicj5gYGA8L3A+PHA+Jmd0OyDov5nkuKrk
vovlrZDnmoTku6PnoIHmnInplJnor6/vvIzlm6DkuLpyZWZzZXHnmoTkv6Hmga/msqHmnIlyZWZz
ZXFfZG5hPGJyPkVycm9yIGluIGdldEJNKGF0dHJpYnV0ZXMgPSBjKCJyZWZzZXFfZG5hIiwgImlu
dGVycHJvIiwgImludGVycHJvX2Rlc2NyaXB0aW9uIiksIDo8YnI+SW52YWxpZCBhdHRyaWJ1dGUo
cyk6IHJlZnNlcV9kbmE8YnI+UGxlYXNlIHVzZSB0aGUgZnVuY3Rpb24gJ2xpc3RBdHRyaWJ1dGVz
JyB0byBnZXQgdmFsaWQgYXR0cmlidXRlIG5hbWVzPGJyPuaIkeeugOWNleajgOafpeS6huS4gOS4
i++8jOWPkeeOsOmcgOimgeabtOato+S4unJlZnNlcV9tcm5hPGJyPmBgYFI8YnI+dG1wPWxpc3RB
dHRyaWJ1dGVzKGVuc2VtYmwpPGJyPmdyZXAoInJlZnNlcSIsdG1wWywxXSk8YnI+IyBbMV0gODEg
ODIgODMgODQgODUgODY8YnI+dG1wW2dyZXAoInJlZnNlcSIsdG1wWywxXSksXTxicj5gYGAgPGJy
PiZndDsgbmFtZSBkZXNjcmlwdGlvbjxicj44MSByZWZzZXFfbXJuYSBSZWZTZXEgbVJOQSBbZS5n
LiBOTV8wMDExOTU1OTddPGJyPjgyIHJlZnNlcV9tcm5hX3ByZWRpY3RlZCBSZWZTZXEgbVJOQSBw
cmVkaWN0ZWQgW2UuZy4gWE1fMDAxMTI1Njg0XTxicj44MyByZWZzZXFfbmNybmEgUmVmU2VxIG5j
Uk5BIFtlLmcuIE5SXzAwMjgzNF08YnI+ODQgcmVmc2VxX25jcm5hX3ByZWRpY3RlZCBSZWZTZXEg
bmNSTkEgcHJlZGljdGVkIFtlLmcuIFhSXzEwODI2NF08YnI+ODUgcmVmc2VxX3BlcHRpZGUgUmVm
U2VxIFByb3RlaW4gSUQgW2UuZy4gTlBfMDAxMDA1MzUzXTxicj44NiByZWZzZXFfcGVwdGlkZV9w
cmVkaWN0ZWQgUmVmU2VxIFByZWRpY3RlZCBQcm90ZWluIElEIFtlLmcuIFhQXzAwMTcyMDkyMl08
L3A+PHA+54S25ZCO5oiR55So5LqG5paw55qE5Luj56CBPGJyPmBgYFI8YnI+cmVmc2VxaWRzID0g
YygiTk1fMDA1MzU5IiwiTk1fMDAwNTQ2Iik8YnI+aXBybyA9IGdldEJNKGF0dHJpYnV0ZXM9Yygi
cmVmc2VxX21ybmEiLCJpbnRlcnBybyIsImludGVycHJvX2Rlc2NyaXB0aW9uIiksIGZpbHRlcnM9
InJlZnNlcV9tcm5hIix2YWx1ZXM9cmVmc2VxaWRzLCBtYXJ0PWVuc2VtYmwpPGJyPmlwcm88YnI+
YGBgPC9wPjxwPiZndDsgcmVmc2VxX21ybmEgaW50ZXJwcm8gaW50ZXJwcm9fZGVzY3JpcHRpb248
YnI+MSBOTV8wMDA1NDYgSVBSMDExNjE1IHA1MywgRE5BLWJpbmRpbmcgZG9tYWluPGJyPjIgTk1f
MDAwNTQ2IElQUjAxMDk5MSBwNTMsIHRldHJhbWVyaXNhdGlvbiBkb21haW48YnI+MyBOTV8wMDA1
NDYgSVBSMDEzODcyIHA1MyB0cmFuc2FjdGl2YXRpb24gZG9tYWluPGJyPjQgTk1fMDAwNTQ2IElQ
UjAwMjExNyBwNTMgdHVtb3VyIHN1cHByZXNzb3IgZmFtaWx5PGJyPjUgTk1fMDAwNTQ2IElQUjAw
ODk2NyBwNTMtbGlrZSB0cmFuc2NyaXB0aW9uIGZhY3RvciwgRE5BLWJpbmRpbmc8YnI+NiBOTV8w
MDUzNTkgSVBSMDAxMTMyIFNNQUQgZG9tYWluLCBEd2FyZmluLXR5cGU8YnI+NyBOTV8wMDUzNTkg
SVBSMDAzNjE5IE1BRCBob21vbG9neSAxLCBEd2FyZmluLXR5cGU8YnI+OCBOTV8wMDUzNTkgSVBS
MDEzMDE5IE1BRCBob21vbG9neSwgTUgxPGJyPjkgTk1fMDA1MzU5IElQUjAwODk4NCBTTUFEL0ZI
QSBkb21haW48L3A+PHA+IyMjIOS4g++8juagueaNruWfuuWboOeahGVudHJleiBJROWPt+adpeaM
kemAieivpeWfuuWboOeahOaMh+WumuS4iuS4i+a4uOWMuuWfn+S/oeaBr+aIluiAheibi+eZveW6
j+WIlzwvcD48cD5gYGBSPGJyPmVudHJlej1jKCI2NzMiLCI3MTU3IiwiODM3Iik8YnI+Z2V0U2Vx
dWVuY2UoaWQgPSBlbnRyZXosIHR5cGU9ImVudHJlemdlbmUiLHNlcVR5cGU9ImNvZGluZ19nZW5l
X2ZsYW5rIix1cHN0cmVhbT0xMDAsIG1hcnQ9ZW5zZW1ibCk8YnI+YGBgPC9wPjxwPui/meagt+Ww
seW+l+WIsOS6huS4ieadoeW6j+WIl++8jOaYr+e7meWumuWfuuWboOeahOS4iua4uDEwMGJw5bqP
5YiXIGNvZGluZ19nZW5lX2ZsYW5rIGVudHJlemdlbmU8L3A+PHA+Jmd0OzEgQ0NUQ0NHQ0NUQ0NH
Q0NUQ0NHQ0NUQ0NHQ0NUQ0NDQ0NBR0NUQ1RDQ0dDQ1RDQ0NUVENDQ0NDVENDQ0NHQ0NDR0FDQUdD
R0dDQ0dDVENHR0dDQ0NDR0dDVENUQ0dHVFRBVEFBRyA2NzM8YnI+MiBUQ0NUVENUQ1RHQ0FHR0ND
Q0FHR1RHQUNDQ0FHR0dUVEdHQUFHVEdUQ1RDQVRHQ1RHR0FUQ0NDQ0FDVFRUVENDVENUVEdDQUdD
QUdDQ0FHQUNUR0NDVFRDQ0dHR1RDQUNUR0NDIDcxNTc8YnI+MyBDQUNHVFRUQ0NHQ0NDVFRUR0NB
QVRBQUdHQUFBVEFDQVRBR1RUVEFDVFRUQ0FUVFRUVEdBQ1RDVEdBR0dDVENUVFRDQ0FBQ0dDVEdU
QUFBQUFBR0dBQ0FHQUdHQ1RHVFRDQ0NUIDgzNzwvcD48cD48YnI+5YW25a6e6L+Z5LiqZ2V0U2Vx
dWVuY2Xlh73mlbDov5jmnInpnZ7luLjlpJrnmoTnlKjms5XvvIzlvZPnhLbkuLvopoHnmoTlj5jl
jJblnKjlhbZyZWFkbWXkuIrpnaLlj6/ku6XnnIvliLDvvIzkuLvopoHmmK9zZXFUeXBl5Y+v5Lul
5pyJ5aSa5Liq6YCJ5oup44CCPGJyPmBgYFI8YnI+dXRyNSA9IGdldFNlcXVlbmNlKGNocm9tb3Nv
bWU9Mywgc3RhcnQ9MTg1NTE0MDMzLCBlbmQ9MTg1NTM1ODM5LDxicj4gdHlwZT0iZW50cmV6Z2Vu
ZSIsc2VxVHlwZT0iNXV0ciIsIG1hcnQ9ZW5zZW1ibCk8YnI+dXRyNTxicj4j6L+Z5qC35bCx5p+l
6K+i5Yiw5LqGIGNocm9tb3NvbWU9Mywgc3RhcnQ9MTg1NTE0MDMzLCBlbmQ9MTg1NTM1ODM5LOi/
meS4quWumuS9jemHjOmdoueahOaJgOaciXV0cjXkv6Hmga/jgII8YnI+cHJvdGVpbiA9IGdldFNl
cXVlbmNlKGlkPWMoMTAwLCA1NzI4KSx0eXBlPSJlbnRyZXpnZW5lIiw8YnI+IHNlcVR5cGU9InBl
cHRpZGUiLCBtYXJ0PWVuc2VtYmwpPGJyPnByb3RlaW48YnI+I+i/meagt+WwseafpeWIsOS6hui/
meS4pOS4quWfuuWboOi9rOW9leacrOeahOaJgOacieibi+eZveW6j+WIl++8jDEwMOi/meS4quWf
uuWboOaciTXmnaHluo/liJfvvIw1NzI46L+Z5Liq5Z+65Zug5pyJ5Zub5p2h5bqP5YiX44CCPGJy
PmBgYDwvcD48cD48YnI+IyMjIOWFq++8jOmAieaLqeWFtuWug+aVsOaNruW6k+adpei/m+ihjOaf
peivou+8jOavlOWmgnNucOaVsOaNruW6kzwvcD48cD7lvZPnhLbov5jmnInkuIDkupvmlbDmja7l
upPnmoTlsI/mioDlt6fvvIznrKzkuIDkuKrmmK/lj4LmlbAgYXJjaGl2ZSA9IFRSVUXvvIzorr7n
va7lj6rnlKjog73ojrflj5bnmoTmlbDmja7lupM8YnI+ZW5zZW1ibCA9IHVzZU1hcnQoImVuc2Vt
YmxfbWFydF80NiIsIGRhdGFzZXQ9ImhzYXBpZW5zX2dlbmVfZW5zZW1ibCIsIGFyY2hpdmUgPSBU
UlVFKTxicj7nhLblkI7mmK/orr7nva7nibnlrprpgInlj5ZoZzE55a+55bqU55qE5L+h5oGv44CC
PGJyPmBgYFIgPGJyPmxpc3RNYXJ0cyhob3N0PSdtYXkyMDA5LmFyY2hpdmUuZW5zZW1ibC5vcmcn
KTxicj5lbnNlbWJsNTQ9dXNlTWFydChob3N0PSdtYXkyMDA5LmFyY2hpdmUuZW5zZW1ibC5vcmcn
LCBiaW9tYXJ0PSdFTlNFTUJMX01BUlRfRU5TRU1CTCcpPGJyPmVuc2VtYmw1ND11c2VNYXJ0KGhv
c3Q9J21heTIwMDkuYXJjaGl2ZS5lbnNlbWJsLm9yZycsIGJpb21hcnQ9J0VOU0VNQkxfTUFSVF9F
TlNFTUJMJywgZGF0YXNldD0naHNhcGllbnNfZ2VuZV9lbnNlbWJsJyk8YnI+YGBgIDxicj7ov5jl
j6/ku6XpgInlj5blhbblroPniannp43vvIznp4DkuL3pmpDmnYbnur/omas8YnI+YGBgUiA8YnI+
d29ybWJhc2U9dXNlTWFydCgiV1MyMjAiLGRhdGFzZXQ9Indvcm1iYXNlX2dlbmUiKTxicj5saXN0
RmlsdGVycyh3b3JtYmFzZSk8YnI+bGlzdEF0dHJpYnV0ZXMod29ybWJhc2UpPGJyPmdldEJNKGF0
dHJpYnV0ZXMgPSBjKCJwdWJsaWNfbmFtZSIsInJuYWkiLCJybmFpX3BoZW5vdHlwZV9waGVub3R5
cGVfbGFiZWwiKSw8YnI+ZmlsdGVycz0iZ2VuZV9uYW1lIiwgdmFsdWVzPWMoInVuYy0yNiIsImhp
cy0zMyIpLDxicj5tYXJ0PXdvcm1iYXNlKTxicj5gYGA8L3A+PHA+IyMg5YaZ5Zyo5pyA5ZCOPGJy
PuacgOWQjuaIkeeugOWNleaPkOS4gOS4i3NlbGVjdOWHveaVsOaYr+WmguS9lemDqOWIhuabv+S7
o2dldEJN5Ye95pWw55qE77yM5Zug5Li6YmlvbWFSdOaYr+WcqOe6v+aVsOaNruW6k++8jOacrOad
peWPquiDveeUqOWug+iHquW3seeahGdldEJN57O75YiX5Ye95pWw77yM5L2G5piv5Li65LqG5a+5
5o6l5YW25a6DYmlvY29uZHVjdG9y57O75YiX5YyF77yM5Lmf5Y+v5Lul55Soc2VsZWN05Ye95pWw
5p2l5pON5L2c6L+Z5Liq5Zyo57q/5pWw5o2u5bqT44CCPGJyPmBgYFIgPGJyPm1hcnQmbHQ7LXVz
ZU1hcnQoZGF0YXNldD0iaHNhcGllbnNfZ2VuZV9lbnNlbWJsIixiaW9tYXJ0PSdlbnNlbWJsJykg
aGVhZChrZXl0eXBlcyhtYXJ0KSwgbj0zKSBoZWFkKGNvbHVtbnMobWFydCksIG49MykgayA9IGtl
eXMobWFydCwga2V5dHlwZT0iY2hyb21vc29tZV9uYW1lIikgaGVhZChrLCBuPTMpIGsgPSBrZXlz
KG1hcnQsIGtleXR5cGU9ImNocm9tb3NvbWVfbmFtZSIsIHBhdHRlcm49IkxSRyIpIGhlYWQoaywg
bj0zKSBhZmZ5PWMoIjIwMjc2M19hdCIsIjIwOTMxMF9zX2F0IiwiMjA3NTAwX2F0Iikgc2VsZWN0
KG1hcnQsIGtleXM9YWZmeSwgY29sdW1ucz1jKCdhZmZ5X2hnX3UxMzNfcGx1c18yJywnZW50cmV6
Z2VuZScpLCBrZXl0eXBlPSdhZmZ5X2hnX3UxMzNfcGx1c18yJyk8YnI+YGBgPC9wPg==">​</div>
</div>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bio-info-trainee.com/877.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Ensembl数据库在线网页工具biomart简单教程</title>
		<link>http://www.bio-info-trainee.com/545.html</link>
		<comments>http://www.bio-info-trainee.com/545.html#comments</comments>
		<pubDate>Tue, 31 Mar 2015 07:28:29 +0000</pubDate>
		<dc:creator><![CDATA[ulwvfje]]></dc:creator>
				<category><![CDATA[基础数据库]]></category>
		<category><![CDATA[biomart]]></category>
		<category><![CDATA[ENSEMBL]]></category>
		<category><![CDATA[ID]]></category>
		<category><![CDATA[数据转换]]></category>

		<guid isPermaLink="false">http://www.bio-info-trainee.com/?p=545</guid>
		<description><![CDATA[这个工具主要是针对不会bioperl不会API调取数据的生信纯菜鸟准备的，主要是 &#8230; <a href="http://www.bio-info-trainee.com/545.html">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p>这个工具主要是针对不会bioperl不会API调取数据的生信纯菜鸟准备的，主要是方便大家批量研究某些感兴趣的基因，需要准备的数据就是基因名或者基因的ID号，能从该网站获取的资料非常多，可以是关于你的输入的基因名的各种数据库有的信息。</p>
<p><a href="http://www.ensembl.org/biomart/">http://www.ensembl.org/biomart/</a></p>
<p>第一步：选取数据库，我一般选取人的</p>
<p><a href="http://www.bio-info-trainee.com/wp-content/uploads/2015/03/Ensembl数据库在线网页工具biomart简单教程243.png"><img class="alignnone size-full wp-image-546" src="http://www.bio-info-trainee.com/wp-content/uploads/2015/03/Ensembl数据库在线网页工具biomart简单教程243.png" alt="Ensembl数据库在线网页工具biomart简单教程243" width="554" height="240" /></a></p>
<p>第二步，选择上传数据的格式</p>
<p><a href="http://www.bio-info-trainee.com/wp-content/uploads/2015/03/Ensembl数据库在线网页工具biomart简单教程259.png"><img class="alignnone size-full wp-image-547" src="http://www.bio-info-trainee.com/wp-content/uploads/2015/03/Ensembl数据库在线网页工具biomart简单教程259.png" alt="Ensembl数据库在线网页工具biomart简单教程259" width="554" height="206" /></a></p>
<p>这个下拉框里面可以选取很多种格式，你随便张贴进去哪一种格式的基因ID都可以，也可以把做好的ID文件上传进去，批量获取基因信息。</p>
<p><a href="http://www.bio-info-trainee.com/wp-content/uploads/2015/03/Ensembl数据库在线网页工具biomart简单教程325.png"><img class="alignnone size-full wp-image-548" src="http://www.bio-info-trainee.com/wp-content/uploads/2015/03/Ensembl数据库在线网页工具biomart简单教程325.png" alt="Ensembl数据库在线网页工具biomart简单教程325" width="454" height="162" /></a></p>
<p>我这里输入的是几个免疫基因。</p>
<p>第三步，选择下载数据的格式</p>
<p>首先可以选择你上传的gene的可以转换的各种ID</p>
<p><a href="http://www.bio-info-trainee.com/wp-content/uploads/2015/03/Ensembl数据库在线网页工具biomart简单教程356.png"><img class="alignnone size-full wp-image-549" src="http://www.bio-info-trainee.com/wp-content/uploads/2015/03/Ensembl数据库在线网页工具biomart简单教程356.png" alt="Ensembl数据库在线网页工具biomart简单教程356" width="553" height="173" /></a></p>
<p>然后可以选择你上传的gene的各种序列</p>
<p><a href="http://www.bio-info-trainee.com/wp-content/uploads/2015/03/Ensembl数据库在线网页工具biomart简单教程358.png"><img class="alignnone size-full wp-image-550" src="http://www.bio-info-trainee.com/wp-content/uploads/2015/03/Ensembl数据库在线网页工具biomart简单教程358.png" alt="Ensembl数据库在线网页工具biomart简单教程358" width="553" height="183" /></a></p>
<p>可以选择的信息非常多，基本上可以想到的转换在这里都能做！！！</p>
<p>但是，始终没有脚本方便，只适合不太懂编程的菜鸟使用！</p>
<p>然后点击result即可，看到结果还可以导出成txt文档，点击右上角的GO即可</p>
<p><a href="http://www.bio-info-trainee.com/wp-content/uploads/2015/03/Ensembl数据库在线网页工具biomart简单教程458.png"><img class="alignnone size-full wp-image-551" src="http://www.bio-info-trainee.com/wp-content/uploads/2015/03/Ensembl数据库在线网页工具biomart简单教程458.png" alt="Ensembl数据库在线网页工具biomart简单教程458" width="553" height="261" /></a></p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bio-info-trainee.com/545.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
