<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>生信菜鸟团 &#187; bead</title>
	<atom:link href="http://www.bio-info-trainee.com/tag/bead/feed" rel="self" type="application/rss+xml" />
	<link>http://www.bio-info-trainee.com</link>
	<description>欢迎去论坛biotrainee.com留言参与讨论，或者关注同名微信公众号biotrainee</description>
	<lastBuildDate>Sat, 28 Jun 2025 14:30:13 +0000</lastBuildDate>
	<language>zh-CN</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>https://wordpress.org/?v=4.1.33</generator>
	<item>
		<title>illumina的bead 系列表达芯片扫盲</title>
		<link>http://www.bio-info-trainee.com/1937.html</link>
		<comments>http://www.bio-info-trainee.com/1937.html#comments</comments>
		<pubDate>Sat, 15 Oct 2016 11:54:38 +0000</pubDate>
		<dc:creator><![CDATA[ulwvfje]]></dc:creator>
				<category><![CDATA[芯片数据处理]]></category>
		<category><![CDATA[bead]]></category>
		<category><![CDATA[illumina]]></category>
		<category><![CDATA[芯片]]></category>

		<guid isPermaLink="false">http://www.bio-info-trainee.com/?p=1937</guid>
		<description><![CDATA[表达芯片大家最熟悉的当然是affymetrix系列芯片啦，而且分析套路很简单，直 &#8230; <a href="http://www.bio-info-trainee.com/1937.html">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p>表达芯片大家最熟悉的当然是affymetrix系列芯片啦，而且分析套路很简单，直接用R的affy包，就可以把cel文件经过RMA或者MAS5方法得到表达矩阵。illumina出厂的芯片略微有点不一样，它的原始数据有3个层级，一般拿到的是Processed data (<a href="ftp://ftp.ncbi.nlm.nih.gov/geo/series/GSE30nnn/GSE30669/suppl/GSE30669_HEK_Sample_Probe_Profile.txt.gz%20" target="_blank">示例</a>), 当仍然需要一系列的统计学方法才能提取到表达矩阵。接下来我们首先讲一讲illumina的bead 系列表达芯片基础知识吧：<span id="more-1937"></span></p>
<div>illumina是大厂家，所以芯片包括人类的，小鼠以及大鼠的，然后对于人来说，经历了V1~V4的进化过程，最新版是 V4。</div>
<div>GEO里面是这样介绍illumina bead V4这个芯片的：</div>
<div>
<table border="1" cellspacing="0" cellpadding="0">
<tbody>
<tr>
<td>The HumanHT-12 v4 Expression BeadChip provides high throughput processing of 12 samples per BeadChip without the need for expensive, specialized automation. The BeadChip is designed to support flexible usage across a wide-spectrum of experiments.</td>
</tr>
</tbody>
</table>
</div>
<div>可以在官网下载芯片探针的详情，manifest数据文件：<a href="http://support.illumina.com/array/array_kits/humanht-12_v4_expression_beadchip_kit/downloads.html">http://support.illumina.com/array/array_kits/humanht-12_v4_expression_beadchip_kit/downloads.html</a> 这些文件写清楚了芯片用的探针的详情，包括使用了哪些control探针，主要是给它自己的BeadStudio 软件来使用的。</div>
<div>NCBI的GEO也提供大批量的公共数据：<a href="https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GPL10558">https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GPL10558</a></div>
<div>芯片厂家illumina本身提供数据<span style="color: #ff0000;">处理软件BeadStudio, GenomeStudio，在R/bioconductor上面也有开源的包做同样的事情，Illuminaio ，beadarray，lumi</span></div>
<div>数据的前期处理有3个层次，都在bioconductor有对应的包可以来处理</div>
<div>Data can be in raw form, where pixel-level data are available from TIFF images, allowing the complete data processing pipeline, including image analysis, to be carried out in R. 这种图片格式的数据，基本上没有人愿意去开始处理的，TIFF格式的图片压缩包，在BeadArrayUseCases包里面附带有一个测试数据</div>
<div>The next level, referred to as bead-level, refers to the availability of intensity and location information for individual beads. In this format, a given probe will have a variable number of replicate intensities per sample. Processed data, where replicate intensities have been summarized and outliers removed to give a mean, a measure of variability, and a number of observations per probe in each sample, is the most commonly available format.</div>
<div>数据处理流程如下：</div>
<div><img class="alignnone  wp-image-1939" src="http://www.bio-info-trainee.com/wp-content/uploads/2016/10/illumina-microarray-data-3-step-processing.png" alt="illumina-microarray-data-3-step-processing" width="774" height="479" /></div>
<div>其实对芯片数据处理最重要的过程，就是如何做QC以及拿到表达量矩阵，后面的差异分析，功能富集分析其实是大同小异的。我比较喜欢用bioconductor包，会讲如何用 lumi包来处理这个芯片数据。</div>
<div>
<div>用bioconductor系列包来处理是最方便的，看这个教程就够了：<a href="https://bioconductor.org/packages/release/data/experiment/vignettes/BeadArrayUseCases/inst/doc/BeadArrayUseCases.pdf">https://bioconductor.org/packages/release/data/experiment/vignettes/BeadArrayUseCases/inst/doc/BeadArrayUseCases.pdf</a></div>
<div>数据处理流程还在plos one杂志上面发表过文章：<a href="http://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1002276">http://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1002276</a></div>
<div>BMC也有一篇：<a href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4486126/">https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4486126/</a> 他们团队做了一个网页版工具，直接可以上传illumina芯片的原始数据去做 全套分析：<a href="http://www.arrayanalysis.org/">http://www.arrayanalysis.org/</a></div>
<div>在R/bioconductor里面，跟人类相关的illumina beadseed芯片注释包如下：</div>
<div>
<table border="1" cellspacing="0" cellpadding="0">
<tbody>
<tr>
<td><a href="http://www.bioconductor.org/packages/release/data/annotation/html/illuminaHumanv1.db.html" target="_blank">illuminaHumanv1.db</a></td>
<td>Mark Dunning</td>
<td>Illumina HumanWG6v1 annotation data (chip illuminaHumanv1)</td>
</tr>
<tr>
<td><a href="http://www.bioconductor.org/packages/release/data/annotation/html/illuminaHumanv2.db.html" target="_blank">illuminaHumanv2.db</a></td>
<td>Mark Dunning</td>
<td>Illumina HumanWG6v2 annotation data (chip illuminaHumanv2)</td>
</tr>
<tr>
<td><a href="http://www.bioconductor.org/packages/release/data/annotation/html/illuminaHumanv2BeadID.db.html" target="_blank">illuminaHumanv2BeadID.db</a></td>
<td>Mark Dunning</td>
<td>Illumina HumanWGv2 annotation data (chip illuminaHumanv2BeadID)</td>
</tr>
<tr>
<td><a href="http://www.bioconductor.org/packages/release/data/annotation/html/illuminaHumanv3.db.html" target="_blank">illuminaHumanv3.db</a></td>
<td>Mark Dunning</td>
<td>Illumina HumanHT12v3 annotation data (chip illuminaHumanv3)</td>
</tr>
<tr>
<td><a href="http://www.bioconductor.org/packages/release/data/annotation/html/illuminaHumanv4.db.html" target="_blank">illuminaHumanv4.db</a></td>
<td>Mark Dunning</td>
<td>Illumina HumanHT12v4 annotation data (chip illuminaHumanv4)</td>
</tr>
</tbody>
</table>
</div>
<div>详情可以去bioconductor官网搜索：<a href="http://www.bioconductor.org/packages/release/BiocViews.html#___AnnotationData">http://www.bioconductor.org/packages/release/BiocViews.html#___AnnotationData</a></div>
<div>芯片包装如下：</div>
<div><a href="http://www.bio-info-trainee.com/wp-content/uploads/2016/10/illumina-microarray.png"><img class="alignnone  wp-image-1942" src="http://www.bio-info-trainee.com/wp-content/uploads/2016/10/illumina-microarray.png" alt="illumina-microarray" width="694" height="511" /></a></div>
</div>
<div></div>
<div></div>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bio-info-trainee.com/1937.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
