<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>生信菜鸟团 &#187; snp6.0</title>
	<atom:link href="http://www.bio-info-trainee.com/tag/snp6-0/feed" rel="self" type="application/rss+xml" />
	<link>http://www.bio-info-trainee.com</link>
	<description>欢迎去论坛biotrainee.com留言参与讨论，或者关注同名微信公众号biotrainee</description>
	<lastBuildDate>Sat, 28 Jun 2025 14:30:13 +0000</lastBuildDate>
	<language>zh-CN</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>https://wordpress.org/?v=4.1.33</generator>
	<item>
		<title>picnic对拷贝数变异检测芯片数据进行分析</title>
		<link>http://www.bio-info-trainee.com/1299.html</link>
		<comments>http://www.bio-info-trainee.com/1299.html#comments</comments>
		<pubDate>Wed, 06 Jan 2016 01:03:34 +0000</pubDate>
		<dc:creator><![CDATA[ulwvfje]]></dc:creator>
				<category><![CDATA[基础软件]]></category>
		<category><![CDATA[picnic]]></category>
		<category><![CDATA[snp6.0]]></category>
		<category><![CDATA[拷贝数]]></category>

		<guid isPermaLink="false">http://www.bio-info-trainee.com/?p=1299</guid>
		<description><![CDATA[这里的拷贝数变异检测芯片指的是Affymetrix Genome-Wide Hu &#8230; <a href="http://www.bio-info-trainee.com/1299.html">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p>这里的拷贝数变异检测芯片指的是Affymetrix Genome-Wide Human SNP Array 6.0</p>
<div>cel数据，处理成segment及genotype数据</p>
<div><b>一、程序安装</b></div>
<div>这本来是一个matlab程序，但是有linux版本，需要安装matlab编译环境</div>
<div>下载地址是 <a href="ftp://ftp.sanger.ac.uk/pub/cancer/picnic_software/picnic_src/">ftp://ftp.sanger.ac.uk/pub/cancer/picnic_software/picnic_src/</a></div>
<div></div>
<div>   <a href="http://www.bio-info-trainee.com/wp-content/uploads/2016/01/clipboard1.png"><img class="alignnone size-full wp-image-1304" src="http://www.bio-info-trainee.com/wp-content/uploads/2016/01/clipboard1.png" alt="clipboard" width="764" height="214" /></a></div>
<div><b>下载解压之后首先安装matlab环境：</b></div>
<div><b>./MCRInstaller.bin -console</b></div>
<div>因为我的服务器没有界面，所以我用了-console执行安装程序，很简单就安装好了</div>
<div><a href="http://www.bio-info-trainee.com/wp-content/uploads/2016/01/clipboard21.png"><img class="alignnone size-full wp-image-1305" src="http://www.bio-info-trainee.com/wp-content/uploads/2016/01/clipboard21.png" alt="clipboard2" width="539" height="348" /></a></div>
<div>其中，cdf目录下面是我们从<a href="http://www.affymetrix.com/">http://www.affymetrix.com</a> 里面下载的<b>Library Files</b><a href="http://www.affymetrix.com/Auth/support/downloads/library_files/genomewidesnp6_libraryfile.zip">Genome-Wide Human SNP Array 6.0 (zip, 246 MB)</a></div>
<div>celConverter目录下面是一个java程序，可以把芯片出来的<b>cel文件转为flat file</b></div>
<div>其中Matlab_running 是我执行安装的matlab环境</div>
<div>其余两个脚本run_HMM.sh  run_preprocessing.sh是picnic程序的第二步和第三步</div>
<div><b><span style="color: #ff0000;">二、输入数据准备</span></b></div>
<div>我随便找了两个snp6.0芯片的raw data</div>
<div><span style="font-family: Calibri; font-size: medium;">-rw-rw-r-- 1 jmzeng jmzeng 66M Dec 30 06:30 GSM1949207.CEL</span></div>
<div><span style="font-family: Calibri; font-size: medium;">-rw-rw-r-- 1 jmzeng jmzeng 66M Sep  9 11:08 GSM887898.CEL</span></div>
<div>
<div></div>
<div><b><span style="color: #ff0000;">三、程序使用</span></b></p>
<div>程序主要分为三个步骤来实现</div>
<div><b>Step 1</b>: Convert the binary *.cel file to a flat file<br />
<b>Step 2</b>: Normalise and estimate the ploidy of the genome and the level of normal contamination<br />
<b>Step 3</b>: Segment the data and produce the genotype</div>
<div>第一步：</div>
<div>暂时还不需要matlab工作环境</div>
<div>示例的code是：<i>java -Xmx2G -jar CelFileConverter.jar -m Snp6FeatureMappings.csv -c 'cdf_file_including_path' -s 'directory name</i><br />
<i>of cel files' -t rootDir/outdir/raw</i></div>
<div>我的code是：</div>
</div>
</div>
<p><span style="font-family: Calibri; font-size: medium;">jmzeng@ubuntu:/home/jmzeng/bio-soft/picnic/c_code/celConverter$<span style="color: #ff0000;"> java -jar CelFileConverter.jar -m Snp6FeatureMappings.csv -c ../cdf/GenomeWideSNP_6.Full.cdf -s ../celFiles/ -t ../result/ </span><br />
Loading CDF file: ../cdf/GenomeWideSNP_6.Full.cdf<br />
Loading feature mapping file: Snp6FeatureMappings.csv<br />
Processing CEL file: /home/jmzeng/bio-soft/picnic/c_code/celConverter/../celFiles/GSM1949207.CEL<br />
Created feature intensity file: /home/jmzeng/bio-soft/picnic/c_code/celConverter/../result/GSM1949207.feature_intensity<br />
Processing CEL file: /home/jmzeng/bio-soft/picnic/c_code/celConverter/../celFiles/GSM887898.CEL<br />
Created feature intensity file: /home/jmzeng/bio-soft/picnic/c_code/celConverter/../result/GSM887898.feature_intensity</span></p>
<div><span style="font-family: Calibri; font-size: medium;">然后在输出目录就有了一个feature_intensity后缀的文本文件，约110M</span></div>
<div><span style="font-family: Calibri; font-size: medium;">-rw-rw-r-- 1 jmzeng jmzeng 105M Dec 30 06:35 GSM1949207.feature_intensity</span></div>
<div><span style="font-family: Calibri; font-size: medium;">-rw-rw-r-- 1 jmzeng jmzeng 109M Dec 30 06:35 GSM887898.feature_intensity</span></div>
<div>
<div>第二步：</div>
<div>这里需要用matlab啦，但是这里经常出现库的问题！！！</div>
<div>示例的code是</div>
<div><i><span style="font-size: medium;">sh run_preprocessing.sh mcr_dir cell_name info_dir feature_int_dir normalised_outdir outdir sample_type in_pi</span></i><br />
我的code是：</div>
<div><a href="mailto:jmzeng@ubuntu">jmzeng@ubuntu</a>:/home/jmzeng/bio-soft/picnic/c_code$ sh run_preprocessing.sh Matlab_running/v710/ GSM1949207 info/ result/ result/output result/<br />
------------------------------------------<br />
Setting up environment variables<br />
---<br />
LD_LIBRARY_PATH is .:Matlab_running/v710//runtime/glnxa64:Matlab_running/v710//bin/glnxa64:Matlab_running/v710//sys/os/glnxa64:Matlab_running/v710//sys/java/jre/glnxa64/jre/lib/amd64/native_threads:Matlab_running/v710//sys/java/jre/glnxa64/jre/lib/amd64/server:Matlab_running/v710//sys/java/jre/glnxa64/jre/lib/amd64/client:Matlab_running/v710//sys/java/jre/glnxa64/jre/lib/amd64<br />
My Own Exception: Fatal error loading library /home/jmzeng/bio-soft/picnic/c_code/Matlab_running/v710/bin/glnxa64/libmwmclmcr.so <span style="color: #ff0000;">Error: libXp.so.6: cannot open shared object file: No such file or directory</span></div>
<div>第三步：</div>
<div>也需要用matlab，库的问题必须解决！！！</div>
<div>我另外一台服务器的结局办法是用v714的matlab</div>
<div>
<div>示例的code是</div>
<div><i><span style="font-size: large;">sh run_HMM.sh /nfs/team78pc2/kwl_temp/segments/PICNIC/C/release/Matlab_Compiler_Runtime/v710 A01_CGP_PD3945a.feature_intensity '/nfs/team78pc3/KWL/segments/PICNIC/matlab/C/release/info/' '/nfs/team78pc2/kwl_temp/segments/PICNIC/data/normalized/' '/nfs/team78pc2/kwl_temp/segments/PICNIC/data/' '10' '0.33598' '1.9915' '0.40997'</span></i><br />
我的code是：无所谓了，这个服务器不知道怎么回事，总是出现库文件的问题，而这个问题需要root权限，我懒得弄了，我在其它的服务器上面都木有问题的！！！</div>
</div>
<div><b>所以我就换了MATLAB，毕竟，这个软件本来就是matlab版本的，第一步照旧，第二步在matlab里面运行！</b></div>
<div><b>我这里只用GSM1949207做测试吧！！！</b></div>
<div><span style="font-family: Times New Roman;">Genomic DNA was extracted from saliva, peripheral blood, or fibroblast<span class="Apple-converted-space"> </span><span style="color: #ff0000;"><b>cell lines</b></span><span class="Apple-converted-space"> </span>using the QIAamp DNA Blood Mini Kit or QIAamp DNA Mini Kit. DNA quality and quantity was assessed using a Nanodrop Spectrophotometer and agarose gel electrophoresis.<br />
</span></div>
<div><span style="font-family: Times New Roman;">打开matlab，进入picnic目录</span></div>
<div><span style="font-family: Times New Roman;">重复第二步，输入：</span></div>
<div>preprocessing('GSM1949207.feature_intensity','info\','result\raw\','result\output\','result\')</div>
<div>需要7个参数，我这个是cell lines数据，所以后面两个参数省略不写！info文件夹自己下载放在picnic目录，其中result\raw 存放你第一步的结果文件！</div>
<div><span style="font-family: Times New Roman;">这一步运行，会比较久！</span></div>
<div><a href="http://www.bio-info-trainee.com/wp-content/uploads/2016/01/3.png"><img class="alignnone size-full wp-image-1300" src="http://www.bio-info-trainee.com/wp-content/uploads/2016/01/3.png" alt="3" width="707" height="109" /></a></div>
<div>同时可以看到程序在我的result目录里面新增加了两个目录用来存放结果，不过这一步的结果还是中间文件，就不解释了！</div>
<div>然后再重复第三步，输入：</div>
</div>
<p><span style="font-family: Times New Roman;"> HMM('GSM1949207.feature_intensity','info\','result\output\','result\',10,0,2.0221,0.40997)</span></p>
<div>这个软件的参数很没规律，最好不要像说明书那些用output做文件夹名字，不然，很容易出错。</div>
<div><span style="font-family: Times New Roman;">这一步好像也很耗时间！</span></div>
<div><a href="http://www.bio-info-trainee.com/wp-content/uploads/2016/01/4.png"><img class="alignnone size-full wp-image-1301" src="http://www.bio-info-trainee.com/wp-content/uploads/2016/01/4.png" alt="4" width="746" height="541" /></a></p>
<div>
<div>重要的结果有两个：</div>
<div></div>
<div><a href="http://www.bio-info-trainee.com/wp-content/uploads/2016/01/5.png"><img class="alignnone size-full wp-image-1302" src="http://www.bio-info-trainee.com/wp-content/uploads/2016/01/5.png" alt="5" width="938" height="413" /></a><a href="http://www.bio-info-trainee.com/wp-content/uploads/2016/01/6.png"><img class="alignnone size-full wp-image-1303" src="http://www.bio-info-trainee.com/wp-content/uploads/2016/01/6.png" alt="6" width="995" height="589" /></a></div>
<div></div>
<div></div>
<div></div>
<div></div>
</div>
</div>
</div>
]]></content:encoded>
			<wfw:commentRss>http://www.bio-info-trainee.com/1299.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
