<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>生信菜鸟团 &#187; DESeq2</title>
	<atom:link href="http://www.bio-info-trainee.com/tag/deseq2/feed" rel="self" type="application/rss+xml" />
	<link>http://www.bio-info-trainee.com</link>
	<description>欢迎去论坛biotrainee.com留言参与讨论，或者关注同名微信公众号biotrainee</description>
	<lastBuildDate>Sat, 28 Jun 2025 14:30:13 +0000</lastBuildDate>
	<language>zh-CN</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>https://wordpress.org/?v=4.1.33</generator>
	<item>
		<title>我用rmarkdown写过的教程</title>
		<link>http://www.bio-info-trainee.com/2372.html</link>
		<comments>http://www.bio-info-trainee.com/2372.html#comments</comments>
		<pubDate>Wed, 15 Mar 2017 09:16:05 +0000</pubDate>
		<dc:creator><![CDATA[ulwvfje]]></dc:creator>
				<category><![CDATA[R]]></category>
		<category><![CDATA[DESeq2]]></category>
		<category><![CDATA[GEOquery]]></category>
		<category><![CDATA[limma]]></category>
		<category><![CDATA[rmarkdown]]></category>

		<guid isPermaLink="false">http://www.bio-info-trainee.com/?p=2372</guid>
		<description><![CDATA[用rmarkdown写教程真心非常方便，尤其是R语言相关的，比如一些R包的应用， &#8230; <a href="http://www.bio-info-trainee.com/2372.html">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<div>用rmarkdown写教程真心非常方便，尤其是R语言相关的，比如一些R包的应用，或者一些可视化，或者一些统计，下面我简单列出一些我以前写过的，图文并茂，关键是还非常省心，不需要排版，不需要上传图片，整理图片。</div>
<p>一般来说看链接最后的文件名就知道这篇文章讲的是什么了：</p>
<div>首先是几个R包的讲解：<br />
<a href="http://www.bio-info-trainee.com/bioconductor_China/software/limma.html" target="_blank">http://www.bio-info-trainee.com/ ... software/limma.html</a><br />
<a href="http://www.bio-info-trainee.com/bioconductor_China/software/DESeq2.html" target="_blank">http://www.bio-info-trainee.com/ ... oftware/DESeq2.html</a><br />
<a href="http://www.bio-info-trainee.com/bioconductor_China/software/GEOquery.html" target="_blank">http://www.bio-info-trainee.com/ ... tware/GEOquery.html</a><br />
<a href="http://www.bio-info-trainee.com/bioconductor_China/software/limma_voom.html" target="_blank">http://www.bio-info-trainee.com/ ... are/limma_voom.html</a><br />
当然，一些并不是bioconductor的包我也会写教程， 偶尔：<br />
<a href="http://www.bio-info-trainee.com/bioconductor_China/software/GOplot.html" target="_blank">http://www.bio-info-trainee.com/ ... oftware/GOplot.html</a><br />
<a href="http://www.bio-info-trainee.com/bioconductor_China/software/Rcircos.html" target="_blank">http://www.bio-info-trainee.com/ ... ftware/Rcircos.html</a></div>
<p><span id="more-2372"></span></p>
<div></div>
<div>下面是一个统计学里面的逻辑分析的讲解</div>
<div><a href="http://www.bio-info-trainee.com/tmp/tutorial_for_logical_analysis.html">http://www.bio-info-trainee.com/tmp/tutorial_for_logical_analysis.html</a></div>
<div>下面是一个表达矩阵的15个常见的可视化图形的制作：</div>
<div><a href="http://bio-info-trainee.com/tmp/basic_visualization_for_expression_matrix.html">http://bio-info-trainee.com/tmp/basic_visualization_for_expression_matrix.html</a></div>
<div></div>
<div>
<h1 class="title toc-ignore">用deconstructSigs来做cosmic的mutation signature图</h1>
</div>
<div><a href="http://biotrainee.com/jmzeng/markdown/deconstuctSigs.html" target="_blank">http://biotrainee.com/jmzeng/markdown/deconstuctSigs.html</a></div>
<div></div>
<div>这个史上最全方差分析，不是我写的，但是写的很赞，我就不多此一举了：</div>
<div><a href="http://biotrainee.com/jmzeng/markdown/ANOVA.html" target="_blank">http://biotrainee.com/jmzeng/markdown/ANOVA.html  </a>推荐大家看看</div>
<div></div>
<div>
<h1 class="title toc-ignore">标准的基因检测报告目录  <a href="http://www.biotrainee.com/jmzeng/blogMyGenome/name_introduction.html" target="_blank">http://www.biotrainee.com/jmzeng/blogMyGenome/name_introduction.html</a></h1>
</div>
<div></div>
<div></div>
<div></div>
<h1><strong><span style="color: #ff0000;">下面是一堆高通量测序分析的结题报告：</span></strong></h1>
<div></div>
<div> 简单 <span style="color: #6e8b3d;">RNA-seq</span> 项目结题报告</div>
<div><a href="http://www.biotrainee.com/jmzeng/html_report/d/e/e/p/i/n/Ref_RNAseq_result/index.html" target="_blank">http://www.biotrainee.com/jmzeng/html_report/d/e/e/p/i/n/Ref_RNAseq_result/index.html</a></div>
<div></div>
<div>
<div>16s rDNA 高变区测序 项目结题报告</div>
<div><a href="http://www.biotrainee.com/jmzeng/html_report/d/e/e/p/i/n/16sRNA/index.html">http://www.biotrainee.com/jmzeng/html_report/d/e/e/p/i/n/16sRNA/index.html</a></div>
<div></div>
<div><a href="http://www.biotrainee.com/jmzeng/html_report/d/e/e/p/i/n/16sRNA/index.html">示范 宏基因组分析 结题报告</a></div>
<div><a href="http://www.biotrainee.com/jmzeng/html_report/d/e/e/p/i/n/MetaGenome_result/index.html">http://www.biotrainee.com/jmzeng/html_report/d/e/e/p/i/n/MetaGenome_result/index.html</a></div>
<div></div>
<div><a href="http://www.biotrainee.com/jmzeng/html_report/d/e/e/p/i/n/MetaGenome_result/index.html">示范 细菌基因组分析 结题报告</a></div>
<div><a href="http://www.biotrainee.com/jmzeng/html_report/d/e/e/p/i/n/Pacbio_Genome_result/index.html">http://www.biotrainee.com/jmzeng/html_report/d/e/e/p/i/n/Pacbio_Genome_result/index.html</a></div>
<div></div>
<div><a href="http://www.biotrainee.com/jmzeng/html_report/d/e/e/p/i/n/Pacbio_Genome_result/index.html">示范 小RNA 项目结题报告</a></div>
<div><a href="http://www.biotrainee.com/jmzeng/html_report/d/e/e/p/i/n/SmallRNA_result/index.html">http://www.biotrainee.com/jmzeng/html_report/d/e/e/p/i/n/SmallRNA_result/index.html</a></div>
<div></div>
<div><a href="http://www.biotrainee.com/jmzeng/html_report/d/e/e/p/i/n/SmallRNA_result/index.html">示范 lncRNA 项目结题报告</a></div>
<div><a href="http://www.biotrainee.com/jmzeng/html_report/d/e/e/p/i/n/lncRNA_result/index.html">http://www.biotrainee.com/jmzeng/html_report/d/e/e/p/i/n/lncRNA_result/index.html</a></div>
<div></div>
<div><a href="http://www.biotrainee.com/jmzeng/html_report/d/e/e/p/i/n/lncRNA_result/index.html">示范ChIP-Seq结题报告</a></div>
<div><a href="http://www.biotrainee.com/jmzeng/html_report/d/e/e/p/i/n/chip-report/index.html">http://www.biotrainee.com/jmzeng/html_report/d/e/e/p/i/n/chip-report/index.html</a></div>
<div></div>
<div></div>
<div><a href="http://www.biotrainee.com/jmzeng/html_report/d/e/e/p/i/n/chip-report/index.html">示范 转录组测序（De novo） 项目结题报告</a></div>
<div><a href="http://www.biotrainee.com/jmzeng/html_report/d/e/e/p/i/n/Denovo_transcriptome/index.html">http://www.biotrainee.com/jmzeng/html_report/d/e/e/p/i/n/Denovo_transcriptome/index.html</a></div>
<div></div>
<div><a href="http://www.biotrainee.com/jmzeng/html_report/d/e/e/p/i/n/Denovo_transcriptome/index.html">示范 WGCNA分析 结题报告</a></div>
<div><a href="http://www.biotrainee.com/jmzeng/html_report/d/e/e/p/i/n/WGCNA_Traits_result/index.html">http://www.biotrainee.com/jmzeng/html_report/d/e/e/p/i/n/WGCNA_Traits_result/index.html</a></div>
<div></div>
<div><a href="http://www.biotrainee.com/jmzeng/html_report/d/e/e/p/i/n/WGCNA_Traits_result/index.html">蛋白iTRAQ定量分析 项目结题报告</a></div>
<div><a href="http://www.biotrainee.com/jmzeng/html_report/d/e/e/p/i/n/iTRAQ_Result/index.html">http://www.biotrainee.com/jmzeng/html_report/d/e/e/p/i/n/iTRAQ_Result/index.html</a></div>
<div></div>
</div>
<div></div>
]]></content:encoded>
			<wfw:commentRss>http://www.bio-info-trainee.com/2372.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>自学miRNA-seq分析第六讲~miRNA表达量差异分析</title>
		<link>http://www.bio-info-trainee.com/1714.html</link>
		<comments>http://www.bio-info-trainee.com/1714.html#comments</comments>
		<pubDate>Fri, 01 Jul 2016 15:11:26 +0000</pubDate>
		<dc:creator><![CDATA[ulwvfje]]></dc:creator>
				<category><![CDATA[tutorial]]></category>
		<category><![CDATA[DESeq]]></category>
		<category><![CDATA[DESeq2]]></category>
		<category><![CDATA[miRNA-seq]]></category>
		<category><![CDATA[差异分析]]></category>

		<guid isPermaLink="false">http://www.bio-info-trainee.com/?p=1714</guid>
		<description><![CDATA[这一讲是miRNA-seq数据分析的分水岭，前面的5讲说的是读文献下载数据比对然 &#8230; <a href="http://www.bio-info-trainee.com/1714.html">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p>这一讲是miRNA-seq数据分析的分水岭，前面的5讲说的是读文献下载数据比对然后计算表达量，属于常规的流程分析，一般在公司测序之后都可以拿到分析结果，或者文献也会给出下载结果。但是单纯的分析一个样本意义不大，一般来说，我们做研究都是针对于不同状态下的miRNA表达量差异分析，然后做注释，功能分析，网络分析，这才是重点，也是难点。我这里就直接拿文献处理好的miRNA表达量来展示如何做下游分析，首先就是差异分析啦：<span id="more-1714"></span>根据文献，我们可以知道样本的分类情况是:</p>
<blockquote>
<div>GSM1470353: control-CM, experiment1; Homo sapiens; miRNA-Seq   SRR1542714</div>
<div>GSM1470354: ET1-CM, experiment1; Homo sapiens; miRNA-Seq  SRR1542715</div>
<div>GSM1470355: control-CM, experiment2; Homo sapiens; miRNA-SeqSRR1542716</div>
<div>GSM1470356: ET1-CM, experiment2; Homo sapiens; miRNA-Seq SRR1542717</div>
<div>GSM1470357: control-CM, experiment3; Homo sapiens; miRNA-Seq SRR1542718</div>
<div>GSM1470358: ET1-CM, experiment3; Homo sapiens; miRNA-Seq SRR1542719</div>
<div>可以看到是6个样本的测序数据，分成两组，就是ET1刺激了CM细胞系前后对比而已！</div>
</blockquote>
<div>同时，我们也拿到了这6个样本的表达矩阵，计量单位是counts的reads数，所以我们一般会选用DESeq2，edgeR这样的常用包来做差异分析，当然，做差异分析的工具还有十几个，我这里只是拿一根最顺手的举例子，就是DESeq2</div>
<div>下面的代码有点长，因为我在bioconductor系列教程里面多次提到了DESeq2使用方法，这里就只贴出代码，反正我要说的重点就是，我们进行了差异分析，然后得到差异miRNA列表</div>
<blockquote>
<div>### step8: differential expression analysis by R package for miRNA expression patterns:<br />
## 文章里面提到的结果是：<br />
MicroRNA sequencing revealed over 250 known and 34 predicted novel miRNAs to be differentially expressed between ET-1 stimulated and unstimulated control hiPSC-CMs.<br />
## (FDR &lt; 0.1 and 1.5 fold change)<br />
rm(list=ls())<br />
setwd('J:\\miRNA_test\\paper_results')  ##把从GEO里面下载的文献结果放在这里<br />
sampleIDs=c()<br />
groupList=c()<br />
allFiles=list.files(pattern = '.txt')<br />
i=allFiles[1]<br />
sampleID=strsplit(i,"_")[[1]][1]<br />
treat=strsplit(i,"_")[[1]][4]<br />
dat=read.table(i,stringsAsFactors = F)<br />
colnames(dat)=c('miRNA',sampleID)<br />
groupList=c(groupList,treat)<br />
for (i in allFiles[-1]){<br />
sampleID=strsplit(i,"_")[[1]][1]<br />
treat=strsplit(i,"_")[[1]][4]<br />
a=read.table(i,stringsAsFactors = F)<br />
colnames(a)=c('miRNA',sampleID)<br />
dat=merge(dat,a,by='miRNA')<br />
groupList=c(groupList,treat)<br />
}</div>
<div>### 上面的代码只是为了把6个独立的表达文件给合并成一个表达矩阵<br />
## we need to filter the low expression level miRNA<br />
exprSet=dat[,-1]<br />
rownames(exprSet)=dat[,1]<br />
suppressMessages(library(DESeq2))<br />
exprSet=ceiling(exprSet)<br />
(colData &lt;- data.frame(row.names=colnames(exprSet), groupList=groupList))</div>
<div>## DESeq2就是这么简单的用<br />
dds &lt;- DESeqDataSetFromMatrix(countData = exprSet,<br />
colData = colData,<br />
design = ~ groupList)<br />
dds &lt;- DESeq(dds)<br />
png("qc_dispersions.png", 1000, 1000, pointsize=20)<br />
plotDispEsts(dds, main="Dispersion plot")<br />
dev.off()<br />
res &lt;- results(dds)<br />
## 画一些图，相当于做QC吧<br />
png("RAWvsNORM.png")<br />
rld &lt;- rlogTransformation(dds)<br />
exprSet_new=assay(rld)<br />
par(cex = 0.7)<br />
n.sample=ncol(exprSet)<br />
if(n.sample&gt;40) par(cex = 0.5)<br />
cols &lt;- rainbow(n.sample*1.2)<br />
par(mfrow=c(2,2))<br />
boxplot(exprSet,  col = cols,main="expression value",las=2)<br />
boxplot(exprSet_new, col = cols,main="expression value",las=2)<br />
hist(exprSet[,1])<br />
hist(exprSet_new[,1])<br />
dev.off()library(RColorBrewer)<br />
(mycols &lt;- brewer.pal(8, "Dark2")[1:length(unique(groupList))])</p>
<p># Sample distance heatmap<br />
sampleDists &lt;- as.matrix(dist(t(exprSet_new)))<br />
#install.packages("gplots",repos = "http://cran.us.r-project.org")<br />
library(gplots)<br />
png("qc-heatmap-samples.png", w=1000, h=1000, pointsize=20)<br />
heatmap.2(as.matrix(sampleDists), key=F, trace="none",<br />
col=colorpanel(100, "black", "white"),<br />
ColSideColors=mycols[groupList], RowSideColors=mycols[groupList],<br />
margin=c(10, 10), main="Sample Distance Matrix")<br />
dev.off()</p>
<p>png("MA.png")<br />
DESeq2::plotMA(res, main="DESeq2", ylim=c(-2,2))<br />
dev.off()<br />
## 重点就是这里啦，得到了差异分析的结果<br />
resOrdered &lt;- res[order(res$padj),]<br />
resOrdered=as.data.frame(resOrdered)<br />
write.csv(resOrdered,"<span style="color: #ff0000;"><strong>deseq2.results.csv</strong></span>",quote = F)</p>
<p>##下面也是一些图，主要是看看样本之间的差异情况<br />
library(limma)<br />
plotMDS(log(counts(dds, normalized=TRUE) + 1))<br />
plotMDS(log(counts(dds, normalized=TRUE) + 1) - log(t( t(assays(dds)[["mu"]]) / sizeFactors(dds) ) + 1))<br />
plotMDS( assays(dds)[["counts"]] )  ## raw count<br />
plotMDS( assays(dds)[["mu"]] ) ##- fitted values.</p>
</div>
</blockquote>
<div>最后我们得到的差异分析结果：deseq2.results.csv，就可以跟进FDR和fold change来挑选符合要求的差异miRNA啦</div>
<div></div>
<div></div>
<div></div>
<div></div>
<div></div>
<div></div>
<div></div>
]]></content:encoded>
			<wfw:commentRss>http://www.bio-info-trainee.com/1714.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>用R语言的DESeq2包来对RNA-seq数据做差异分析</title>
		<link>http://www.bio-info-trainee.com/1533.html</link>
		<comments>http://www.bio-info-trainee.com/1533.html#comments</comments>
		<pubDate>Mon, 11 Apr 2016 11:21:35 +0000</pubDate>
		<dc:creator><![CDATA[ulwvfje]]></dc:creator>
				<category><![CDATA[R]]></category>
		<category><![CDATA[基础软件]]></category>
		<category><![CDATA[bioconductor]]></category>
		<category><![CDATA[DESeq]]></category>
		<category><![CDATA[DESeq2]]></category>
		<category><![CDATA[RNA-seq]]></category>
		<category><![CDATA[差异分析]]></category>

		<guid isPermaLink="false">http://www.bio-info-trainee.com/?p=1533</guid>
		<description><![CDATA[我以前写过DESeq，以及过时了：http://www.bio-info-tra &#8230; <a href="http://www.bio-info-trainee.com/1533.html">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p>我以前写过DESeq，以及过时了：<a href="http://www.bio-info-trainee.com/867.html">http://www.bio-info-trainee.com/867.html</a></p>
<p>正好准备筹集bioconductor中文社区，我写简单讲一下DESeq2这个包如何用！</p>
<p><span id="more-1533"></span></p>
<blockquote><p>library(DESeq2)<br />
library(limma)<br />
library(pasilla)<br />
data(pasillaGenes)<br />
exprSet=counts(pasillaGenes)  ##做好表达矩阵<br />
group_list=pasillaGenes$condition##做好分组因子即可</p>
<p>(colData &lt;- data.frame(row.names=colnames(exprSet), group_list=group_list))<br />
dds &lt;- DESeqDataSetFromMatrix(countData = exprSet,<br />
colData = colData,<br />
design = ~ group_list)</p>
<p>##上面是第一步第一步，构建dds这个对象，<span style="color: #ff0000;">需要一个表达矩阵和分组矩阵！！！</span></p>
<div>
<blockquote>
<div>dds2 &lt;- DESeq(dds)  ##第二步，直接用DESeq函数即可</div>
<div>resultsNames(dds2)</div>
<div>res &lt;-  results(dds2, contrast=c("group_list","treated","untreated"))</div>
<div>## 提取你想要的差异分析结果，我们这里是treated组对untreated组进行比较</div>
<div>resOrdered &lt;- res[order(res$padj),]</div>
<div>resOrdered=as.data.frame(resOrdered)</div>
</blockquote>
<div>可以看到程序非常好用！</div>
<div>它只对RNA-seq的基因的reads的counts数进行分析，请不要用RPKM等经过了normlization的表达矩阵来分析。</div>
<div>值得一提的是DESeq2软件独有的normlization方法！</div>
<p>rld &lt;- rlogTransformation(dds2)  ## 得到经过DESeq2软件normlization的表达矩阵！<br />
exprSet_new=assay(rld)<br />
par(cex = 0.7)<br />
n.sample=ncol(exprSet)<br />
if(n.sample&gt;40) par(cex = 0.5)<br />
cols &lt;- rainbow(n.sample*1.2)<br />
par(mfrow=c(2,2))<br />
boxplot(exprSet, col = cols,main="expression value",las=2)<br />
boxplot(exprSet_new, col = cols,main="expression value",las=2)<br />
hist(exprSet)<br />
hist(exprSet_new)</p>
</div>
</blockquote>
<div></div>
<div><a href="http://www.bio-info-trainee.com/wp-content/uploads/2016/04/QQ图片20160411191736.png"><img class="alignnone  wp-image-1534" src="http://www.bio-info-trainee.com/wp-content/uploads/2016/04/QQ图片20160411191736.png" alt="QQ图片20160411191736" width="586" height="337" /></a></p>
<div>
<div>看这个图就知道了，它把本来应该是数据离散程度非常大的RNA-seq的基因的reads的counts矩阵经过normlization后变成了类似于芯片表达数据的表达矩阵，然后其实可以直接用T检验来找差异基因了！</div>
<div></div>
<div>但是，如果你的分组不只是两个，就复杂了，你需要再仔细研读说明书，甚至你可能需要咨询实验设计人员或者统计人员！</div>
<div></div>
</div>
</div>
]]></content:encoded>
			<wfw:commentRss>http://www.bio-info-trainee.com/1533.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
