<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>生信菜鸟团 &#187; bowtie2</title>
	<atom:link href="http://www.bio-info-trainee.com/tag/bowtie2/feed" rel="self" type="application/rss+xml" />
	<link>http://www.bio-info-trainee.com</link>
	<description>欢迎去论坛biotrainee.com留言参与讨论，或者关注同名微信公众号biotrainee</description>
	<lastBuildDate>Sat, 28 Jun 2025 14:30:13 +0000</lastBuildDate>
	<language>zh-CN</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>https://wordpress.org/?v=4.1.33</generator>
	<item>
		<title>自学CHIP-seq分析第五讲~测序数据比对</title>
		<link>http://www.bio-info-trainee.com/1742.html</link>
		<comments>http://www.bio-info-trainee.com/1742.html#comments</comments>
		<pubDate>Tue, 05 Jul 2016 00:42:39 +0000</pubDate>
		<dc:creator><![CDATA[ulwvfje]]></dc:creator>
				<category><![CDATA[tutorial]]></category>
		<category><![CDATA[alignment]]></category>
		<category><![CDATA[bowtie2]]></category>
		<category><![CDATA[CHIP-seq]]></category>
		<category><![CDATA[mapping]]></category>
		<category><![CDATA[unique]]></category>

		<guid isPermaLink="false">http://www.bio-info-trainee.com/?p=1742</guid>
		<description><![CDATA[比对本质是是很简单的了，各种mapping工具层出不穷，我们一般常用的就是BWA &#8230; <a href="http://www.bio-info-trainee.com/1742.html">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p>比对本质是是很简单的了，各种mapping工具层出不穷，我们一般常用的就是BWA和bowtie了，我这里就挑选bowtie2吧，反正别人已经做好了各种工具效果差异的比较，我们直接用就好了，代码如下：</p>
<blockquote><p>## step5 : alignment to hg19/ using bowtie2 to do alignment<br />
## ~/biosoft/bowtie/bowtie2-2.2.9/bowtie2-build ~/biosoft/bowtie/hg19_index /hg19.fa ~/biosoft/bowtie/hg19_index/hg19<br />
## cat &gt;run_bowtie2.sh<br />
ls *.fastq | while read id ;<br />
do<br />
echo $id<br />
#~/biosoft/bowtie/bowtie2-2.2.9/<span style="color: #ff0000;">bowtie2 -p 8 -x</span> ~/biosoft/bowtie/hg19_index/hg19 -U $id -S ${id%%.*}.sam 2&gt;${id%%.*}.align.log;<br />
<span style="color: #ff0000;">#samtools view -bhS -q 30</span> ${id%%.*}.sam &gt; ${id%%.*}.bam ## -F 1548 https://broadinstitute.github.io/picard/explain-flags.html<br />
# -F 0x4 remove the reads that didn't match<br />
samtools sort ${id%%.*}.bam ${id%%.*}.sort ## prefix for the output<br />
# samtools view -bhS a.sam | samtools sort -o - ./ &gt; a.bam<br />
samtools index ${id%%.*}.sorted.bam<br />
done</p></blockquote>
<p>这个索引~/biosoft/bowtie/hg19_index/hg19需要自己提取建立好，见前文</p>
<p>初步比对的sam文件到底该如何过滤，我查了很多文章都没有给出个子丑寅卯，各执一词，我也没办法给大家一个标准，反正我测试了好几种，看起来call peaks的差异不大，就是得不到文章给出的那些结果！！</p>
<p>一般来说，初步比对的sam文件只能选取unique mapping的结果，所以我用了#samtools view -bhS -q 30，但是结果并没什么改变，有人说是peak caller这些工具本身就会做这件事，所以取决于你下游分析所选择的工具。</p>
<p>给大家看比对的日志吧：</p>
<blockquote><p>SRR1042593.fastq<br />
16902907 reads; of these:<br />
16902907 (100.00%) were unpaired; of these:<br />
667998 (3.95%) aligned 0 times<br />
12467095 (73.76%) aligned exactly 1 time<br />
3767814 (22.29%) aligned &gt;1 times<br />
96.05% overall alignment rate<br />
[samopen] SAM header is present: 93 sequences.<br />
SRR1042594.fastq<br />
60609833 reads; of these:<br />
60609833 (100.00%) were unpaired; of these:<br />
9165487 (15.12%) aligned 0 times<br />
39360173 (64.94%) aligned exactly 1 time<br />
12084173 (19.94%) aligned &gt;1 times<br />
84.88% overall alignment rate<br />
[samopen] SAM header is present: 93 sequences.<br />
SRR1042595.fastq<br />
14603295 reads; of these:<br />
14603295 (100.00%) were unpaired; of these:<br />
918028 (6.29%) aligned 0 times<br />
10403045 (71.24%) aligned exactly 1 time<br />
3282222 (22.48%) aligned &gt;1 times<br />
93.71% overall alignment rate<br />
[samopen] SAM header is present: 93 sequences.<br />
SRR1042596.fastq<br />
65911151 reads; of these:<br />
65911151 (100.00%) were unpaired; of these:<br />
10561790 (16.02%) aligned 0 times<br />
42271498 (64.13%) aligned exactly 1 time<br />
13077863 (19.84%) aligned &gt;1 times<br />
83.98% overall alignment rate<br />
[samopen] SAM header is present: 93 sequences.<br />
SRR1042597.fastq<br />
22210858 reads; of these:<br />
22210858 (100.00%) were unpaired; of these:<br />
1779568 (8.01%) aligned 0 times<br />
15815218 (71.20%) aligned exactly 1 time<br />
4616072 (20.78%) aligned &gt;1 times<br />
91.99% overall alignment rate<br />
[samopen] SAM header is present: 93 sequences.<br />
SRR1042598.fastq<br />
58068816 reads; of these:<br />
58068816 (100.00%) were unpaired; of these:<br />
8433671 (14.52%) aligned 0 times<br />
37527468 (64.63%) aligned exactly 1 time<br />
12107677 (20.85%) aligned &gt;1 times<br />
85.48% overall alignment rate<br />
[samopen] SAM header is present: 93 sequences.<br />
SRR1042599.fastq<br />
24019489 reads; of these:<br />
24019489 (100.00%) were unpaired; of these:<br />
1411095 (5.87%) aligned 0 times<br />
17528479 (72.98%) aligned exactly 1 time<br />
5079915 (21.15%) aligned &gt;1 times<br />
94.13% overall alignment rate<br />
[samopen] SAM header is present: 93 sequences.<br />
SRR1042600.fastq<br />
76361026 reads; of these:<br />
76361026 (100.00%) were unpaired; of these:<br />
8442054 (11.06%) aligned 0 times<br />
50918615 (66.68%) aligned exactly 1 time<br />
17000357 (22.26%) aligned &gt;1 times<br />
88.94% overall alignment rate<br />
[samopen] SAM header is present: 93 sequences.</p></blockquote>
<p>可以看到比对非常成功！！！我这里就不用表格的形式来展现了，毕竟我又不是给客户写报告，大家就将就着看吧。</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bio-info-trainee.com/1742.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
