<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>生信菜鸟团 &#187; HLA分型</title>
	<atom:link href="http://www.bio-info-trainee.com/tag/hla%e5%88%86%e5%9e%8b/feed" rel="self" type="application/rss+xml" />
	<link>http://www.bio-info-trainee.com</link>
	<description>欢迎去论坛biotrainee.com留言参与讨论，或者关注同名微信公众号biotrainee</description>
	<lastBuildDate>Sat, 28 Jun 2025 14:30:13 +0000</lastBuildDate>
	<language>zh-CN</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>https://wordpress.org/?v=4.1.33</generator>
	<item>
		<title>使用Seq2HLA进行HLA分型</title>
		<link>http://www.bio-info-trainee.com/841.html</link>
		<comments>http://www.bio-info-trainee.com/841.html#comments</comments>
		<pubDate>Mon, 13 Jul 2015 08:00:35 +0000</pubDate>
		<dc:creator><![CDATA[ulwvfje]]></dc:creator>
				<category><![CDATA[基础软件]]></category>
		<category><![CDATA[HLA分型]]></category>
		<category><![CDATA[seq2HLA]]></category>

		<guid isPermaLink="false">http://www.bio-info-trainee.com/?p=841</guid>
		<description><![CDATA[基于高通量测序数据进行HLA分型的软件挺多的，比较老的有三个，作者分别是Boeg &#8230; <a href="http://www.bio-info-trainee.com/841.html">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p><strong>基于高通量测序数据进行HLA分型的软件挺多的，比较老的有三个，作者分别是Boegel et al.Kim et al.Major et al.，然后他们都被OptiType这个软件的作者被批评了，我这里先介绍Kim et al的seq2HLA使用方法，以下是它的一些链接。<br />
</strong></p>
<ul>
<li><a href="http://tron-mainz.de/tron-facilities/computational-medicine/seq2hla/">Seq2HLA Homepage</a></li>
<li><a href="http://tron-mainz.de/tron-facilities/computational-medicine/seq2hla/">Seq2HLA Source code</a></li>
<li><a href="http://blogs.biomedcentral.com/bmcblog/2013/01/03/reverting-to-type/">Seq2HLA Description</a></li>
</ul>
<p>功能概述</p>
<p>seq2HLA is a computational tool to determine Human Leukocyte Antigen (HLA) directly from existing and future short RNA-Seq reads. It takes standard RNA-Seq sequence reads in fastq format as input, uses a bowtie index comprising known HLA alleles and outputs the most likely HLA class I and class II types, a p-value for each call, and the expression of each class.</p>
<p>软件简介</p>
<p>Type of tool     Program</p>
<p>Nature of tool          Standalone</p>
<p>Operating system   Unix/Linux, Mac OS X</p>
<p>Language        Python, R</p>
<p>Article     (Boegel et al., 2013) HLA typing from RNA-Seq sequence reads. Genome medicine.</p>
<p>PubMed <a href="http://www.ncbi.nlm.nih.gov/pubmed/23259685">http://www.ncbi.nlm.nih.gov/pubmed/23259685</a></p>
<p>URL          <a href="https://bitbucket.org/sebastian_boegel/seq2hla">https://bitbucket.org/sebastian_boegel/seq2hla</a></p>
<p>源代码，下载并安装</p>
<p><a href="https://bitbucket.org/sebastian_boegel/seq2hla/src">https://bitbucket.org/sebastian_boegel/seq2hla/src</a></p>
<p><a href="http://tron-mainz.de/tron-facilities/computational-medicine/seq2hla/">http://tron-mainz.de/tron-facilities/computational-medicine/seq2hla/</a></p>
<p>第一版是这样的</p>
<p><a href="http://www.bio-info-trainee.com/wp-content/uploads/2015/07/image001.png"><img class="alignnone size-full wp-image-842" src="http://www.bio-info-trainee.com/wp-content/uploads/2015/07/image001.png" alt="image001" width="358" height="111" /></a></p>
<p>第二版是这样的</p>
<p><a href="http://www.bio-info-trainee.com/wp-content/uploads/2015/07/image002.png"><img class="alignnone size-full wp-image-843" src="http://www.bio-info-trainee.com/wp-content/uploads/2015/07/image002.png" alt="image002" width="515" height="293" /></a></p>
<p>只有第二版才支持gz压缩包格式的fastq，而且不需要指定length了</p>
<p>其中reference文件夹下面的是发布这个软件的团体已经制备好来的HLA库文件</p>
<p><a href="http://www.bio-info-trainee.com/wp-content/uploads/2015/07/image003.png"><img class="alignnone size-full wp-image-844" src="http://www.bio-info-trainee.com/wp-content/uploads/2015/07/image003.png" alt="image003" width="264" height="293" /></a></p>
<p>下载即可使用，前提是你的系统其它环境都OK</p>
<p>用法：</p>
<p>python seq2HLA.py -1 &lt;readfile1&gt; -2 &lt;readfile2&gt; -r "&lt;runname&gt;" [-p &lt;int&gt;]* [-3 &lt;int&gt;]**</p>
<p><a href="http://www.bio-info-trainee.com/wp-content/uploads/2015/07/image004.png"><img class="alignnone size-full wp-image-845" src="http://www.bio-info-trainee.com/wp-content/uploads/2015/07/image004.png" alt="image004" width="733" height="107" /></a></p>
<p>很简单，-1和-2指定我们的双端测序数据即可，可以是压缩包格式的（自动调用gzip），-r的输出目录，会输出7个文件，需要一个个解读，-p指定线程数给bowtie用的，-3是指定需要trim几个低质量碱基。</p>
<p>但是运行这个软件的要求非常多，需要安装好python和R，而且还有版本限制，需要安装好biopython而且还必须是双端测序，而且当前文件夹下面的reference文件夹下面必须有参考基因组的bowtie索引，而且系统必须安装好了bowtie，还需要在快捷方式里面！</p>
<p>我这里用的是第二版的</p>
<p><a href="http://www.bio-info-trainee.com/wp-content/uploads/2015/07/image006.png"><img class="alignnone size-full wp-image-847" src="http://www.bio-info-trainee.com/wp-content/uploads/2015/07/image006.png" alt="image006" width="452" height="190" /></a></p>
<p>所以，我用的也是第二版改进的命令。非常好用,我这里用的是一个外显子测序数据，是hiseq2500测的PE100</p>
<p>python seq2HLA.py -1 ../../6-exon/PC3-1.read1_Clean.fastq.gz -2 ../../6-exon/PC3-1.read2_Clean.fastq.gz -r PC3</p>
<p>貌似输出文件太多了一点</p>
<p>#Output:#The results are output to stdout and to textfiles. Most important are:</p>
<p>#i) &lt;prefix&gt;-ClassI.HLAgenotype2digits =&gt; 2 digit result of Class I</p>
<p>#ii) &lt;prefix&gt;-ClassII.HLAgenotype2digits =&gt; 2 digit result of Class II</p>
<p>#iii) &lt;prefix&gt;-ClassI.HLAgenotype4digits =&gt; 4 digit result of Class I</p>
<p>#iv) &lt;prefix&gt;-ClassII.HLAgenotype4digits =&gt; 4 digit result of Class II</p>
<p>#v) &lt;prefix&gt;.ambiguity =&gt; reports typing ambuigities (more than one solution for an allele possible)</p>
<p>#vi) &lt;prefix&gt;-ClassI.expression =&gt; expression of Class I alleles</p>
<p>#vii) &lt;prefix&gt;-ClassII.expression =&gt; expression of Class II alleles</p>
<p>根据文献，我简单看了一下，文件的确好复杂，不过我们只需要看输出日志即可</p>
<p>-----------2 digit typing results-------------</p>
<p>#Locus Allele 1       Confidence     Allele 2       Confidence</p>
<p>A       A*68   7.287148e-05   A*24   0.03680272</p>
<p>B       B*52   0.1717737       B*53   0.3952319</p>
<p>C       C*12   0.03009331     hoz("C*14")     0.6783964</p>
<p>Calculation of locus-specific expression ...</p>
<p>BC1-1/BC1-1-ClassI.bowtielog</p>
<p>A: 7.93 RPKM</p>
<p>C: 9.75 RPKM</p>
<p>B: 8.35 RPKM</p>
<p>The digital haplotype is written into BC1-1/BC1-1-ClassI.digitalhaplotype3</p>
<p>-----------4 digit typing results-------------</p>
<p>#Locus Allele 1       Confidence     Allele 2       Confidence</p>
<p>!A     A*68:01 7.287148e-05   A*24:02 0.03680272</p>
<p>!B     B*52:01 0.1717737       B*53:01'       0.6542288</p>
<p>!C     C*12:02 0.03371717     C*12:02 0.6783964</p>
<p>上面的HLA的class I的数据结果</p>
<p>接下来是class II的数据结果，是不是很简单呀！</p>
<p>-----------2 digit typing results-------------</p>
<p>#Locus Allele 1       Confidence     Allele 2       Confidence</p>
<p>DQA     DQA1*01 0.1511134       DQA1*02 0</p>
<p>DQB     DQB1*02 0.02321615     DQB1*05 0.42202</p>
<p>DRB     DRB1*15 2.595144e-05   DRB1*07 0.321219</p>
<p>Calculation of locus-specific expression ...</p>
<p>BC1-1/BC1-1-ClassII.bowtielog</p>
<p>DQB1: 4.47 RPKM</p>
<p>DRB1: 5.59 RPKM</p>
<p>DQA1: 0.44 RPKM</p>
<p>-----------4 digit typing results-------------</p>
<p>#Locus Allele 1       Confidence     Allele 2       Confidence</p>
<p>!DQA   DQA1*01:02'     0.1511134       DQA1*02:01     0.0</p>
<p>!DQB   DQB1*02:01'     0.02321615     DQB1*05:01     0.42202</p>
<p>!DRB   DRB1*15:02'     2.595144e-05   DRB1*07:01     0.321219</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bio-info-trainee.com/841.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
