<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>生信菜鸟团 &#187; gemini</title>
	<atom:link href="http://www.bio-info-trainee.com/tag/gemini/feed" rel="self" type="application/rss+xml" />
	<link>http://www.bio-info-trainee.com</link>
	<description>欢迎去论坛biotrainee.com留言参与讨论，或者关注同名微信公众号biotrainee</description>
	<lastBuildDate>Sat, 28 Jun 2025 14:30:13 +0000</lastBuildDate>
	<language>zh-CN</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>https://wordpress.org/?v=4.1.33</generator>
	<item>
		<title>用GEMINI来探索vcf格式的突变数据</title>
		<link>http://www.bio-info-trainee.com/1603.html</link>
		<comments>http://www.bio-info-trainee.com/1603.html#comments</comments>
		<pubDate>Thu, 05 May 2016 11:39:37 +0000</pubDate>
		<dc:creator><![CDATA[ulwvfje]]></dc:creator>
				<category><![CDATA[基础数据库]]></category>
		<category><![CDATA[基础软件]]></category>
		<category><![CDATA[gemini]]></category>
		<category><![CDATA[mysql]]></category>
		<category><![CDATA[sneff]]></category>
		<category><![CDATA[vcf]]></category>
		<category><![CDATA[VEP]]></category>

		<guid isPermaLink="false">http://www.bio-info-trainee.com/?p=1603</guid>
		<description><![CDATA[第一次听说这个软件，是一个香港朋友推荐的：http://davetang.org &#8230; <a href="http://www.bio-info-trainee.com/1603.html">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p>第一次听说这个软件，是一个香港朋友推荐的：<a href="http://davetang.org/muse/2016/01/13/getting-started-with-gemini/">http://davetang.org/muse/2016/01/13/getting-started-with-gemini/</a> 他写的很棒，但是我当初以为是一个类似于SQLite的数据库浏览模式，所以没在意。实际上，我现在仍然觉得这个软件没什么用！</p>
<p>软件官网有详细的介绍：<a href="https://gemini.readthedocs.io/en/latest/">https://gemini.readthedocs.io/en/latest/</a></p>
<p>而且提供丰富的教程：</p>
<p>We recommend that you follow these tutorials in order, as they introduce concepts that build upon one another.</p>
<ul>
<li>Introduction to GEMINI, basic variant querying and data exploration. <a href="https://speakerdeck.com/arq5x/an-introduction-and-tutorial-for-variant-exploration-with-gemini">html</a> <a href="https://s3.amazonaws.com/gemini-tutorials/Intro-To-Gemini.pdf">pdf</a></li>
<li>Identifying de novo mutations underlying Mendelian disease <a href="https://speakerdeck.com/arq5x/identifying-de-novo-mutations-with-gemini">html</a> <a href="https://s3.amazonaws.com/gemini-tutorials/Gemini-DeNovo-Tutorial.pdf">pdf</a></li>
<li>Identifying autosomal recessive variants underlying Mendelian disease <a href="https://speakerdeck.com/arq5x/identifying-recessive-candidates-with-gemini">html</a> <a href="https://s3.amazonaws.com/gemini-tutorials/Gemini-Recessive-Tutorial.pdf">pdf</a></li>
<li>Identifying autosomal dominant variants underlying Mendelian disease <a href="https://speakerdeck.com/arq5x/identifying-dominant-candidates-with-gemini">html</a> <a href="https://s3.amazonaws.com/gemini-tutorials/Gemini-Dominant-Tutorial.pdf">pdf</a></li>
<li>Other GEMINI tools <a href="https://speakerdeck.com/arq5x/other-gemini-tools">html</a> <a href="https://s3.amazonaws.com/gemini-tutorials/GEMINI-Other-Tools.pdf">pdf</a></li>
</ul>
<p>软件本身并不提供注释，虽然它的功能的确包括注释，号称可以利用(ENCODE tracks, UCSC tracks, OMIM, dbSNP, KEGG, and HPRD.)对你的突变位点注释，比如你输入1       861389  .       C       T       ，它告诉你这个突变发生在哪个基因，对蛋白改变如何？是否会产生某些疾病？</p>
<p>虽然它本身没有注释功能，但是它会调用snpEFF或者VEP进行注释，你需要自己先学习它们。</p>
<p><a href="http://www.bio-info-trainee.com/wp-content/uploads/2016/05/12.png"><img class="alignnone size-full wp-image-1604" src="http://www.bio-info-trainee.com/wp-content/uploads/2016/05/12.png" alt="1" width="523" height="305" /></a></p>
<h1><span style="color: #ff0000;">软件安装：</span></h1>
<p>GEMINI是用python写的，有一个小脚本可以自动完成安装过程：</p>
<p>7.3K May  4 14:44 gemini_install.py</p>
<p>下载这个脚本，然后安装即可</p>
<p>wget https://github.com/arq5x/gemini/raw/master/gemini/scripts/gemini_install.py</p>
<p>python gemini_install.py $tools $data</p>
<p>PATH=$tools/bin:$data/anaconda/bin:$PATH</p>
<p>where <em>$tools</em> and <em>$data</em> are paths writable on your system.</p>
<p>我把$tools用的就是当前文件夹，$data也是当前文件夹下面的gemini文件夹。</p>
<p>这样就会在当前文件夹下面生成两个文件夹，bin是存储程序，gemini是存储数据用的，而且注意要把bin目录的全路径添加到环境变量！</p>
<h1><span style="color: #ff0000;">输入数据：</span></h1>
<p>我们可以直接下载软件作者提供的测试数据</p>
<p>首先是22号染色体的所有突变位点经过WEP注释的文件</p>
<p>然后是一个三口直接的突变ped格式数据</p>
<p>数据存放在亚马逊云，所有的教程pdf也在</p>
<p><a href="http://s3.amazonaws.com/gemini-tutorials">http://s3.amazonaws.com/gemini-tutorials</a></p>
<p>如果是你自己的vcf文件，需要自己用VEP注释一下</p>
<p><a href="http://www.bio-info-trainee.com/wp-content/uploads/2016/05/13.png"><img class="alignnone size-full wp-image-1606" src="http://www.bio-info-trainee.com/wp-content/uploads/2016/05/13.png" alt="1" width="271" height="56" /></a></p>
<h1><span style="color: #ff0000;">运行命令：</span></h1>
<p><a href="http://www.bio-info-trainee.com/wp-content/uploads/2016/05/2.png"><img class="alignnone size-full wp-image-1605" src="http://www.bio-info-trainee.com/wp-content/uploads/2016/05/2.png" alt="2" width="419" height="154" /></a></p>
<h1><span style="color: #ff0000;">结果解读：</span></h1>
<p>产生是chr22.db就是一个数据库格式的文件，但是需要用gemini 来进行查询，个人认为，并没有多大意思！</p>
<p>你只要熟悉mySQL等SQL语言，完全可以自己来！</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bio-info-trainee.com/1603.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
