<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>生信菜鸟团 &#187; 细胞系</title>
	<atom:link href="http://www.bio-info-trainee.com/tag/%e7%bb%86%e8%83%9e%e7%b3%bb/feed" rel="self" type="application/rss+xml" />
	<link>http://www.bio-info-trainee.com</link>
	<description>欢迎去论坛biotrainee.com留言参与讨论，或者关注同名微信公众号biotrainee</description>
	<lastBuildDate>Sat, 28 Jun 2025 14:30:13 +0000</lastBuildDate>
	<language>zh-CN</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>https://wordpress.org/?v=4.1.33</generator>
	<item>
		<title>CCLE数据库几个知识点</title>
		<link>http://www.bio-info-trainee.com/1324.html</link>
		<comments>http://www.bio-info-trainee.com/1324.html#comments</comments>
		<pubDate>Mon, 11 Jan 2016 11:03:47 +0000</pubDate>
		<dc:creator><![CDATA[ulwvfje]]></dc:creator>
				<category><![CDATA[cancer]]></category>
		<category><![CDATA[CCLE]]></category>
		<category><![CDATA[数据库]]></category>
		<category><![CDATA[细胞系]]></category>

		<guid isPermaLink="false">http://www.bio-info-trainee.com/?p=1324</guid>
		<description><![CDATA[发表ccle的文献：http://www.ncbi.nlm.nih.gov/pm &#8230; <a href="http://www.bio-info-trainee.com/1324.html">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<div><span style="font-family: Times New Roman;">发表ccle的文献：<a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3320027/">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3320027/</a></span></div>
<div><span style="font-family: Times New Roman;">Here we describe the Cancer Cell Line Encyclopedia (CCLE): a compilation of gene expression, chromosomal copy number, and massively parallel sequencing data from 947 human cancer cell lines. </span></div>
<div><span style="font-family: Times New Roman;">收集了三种数据：</span></div>
<div><span style="font-family: Times New Roman;">The mutational status of &gt;1,600 genes was determined by targeted massively parallel sequencing, followed by removal of variants likely to be germline events . </span></div>
<div><span style="font-family: Times New Roman;">Moreover, 392 recurrent mutations affecting 33 known cancer genes were assessed by mass spectrometric genotyping13 . </span></div>
<div><span style="font-family: Times New Roman;">DNA copy number was measured using high-density single nucleotide polymorphism arrays (Affymetrix SNP 6.0; Supplementary Methods). </span></div>
<div><span style="font-family: Times New Roman;">Finally, mRNA expression levels were obtained for each of the lines using Affymetrix U133 plus 2.0 arrays. </span></div>
<div><span style="font-family: Times New Roman;">These data were also used to confirm cell line identities .</span></div>
<div><span style="font-family: Times New Roman;">一般用得最多的就是表达数据，因为表达数据最简单，大多数生物信息学分析着只会用这个数据！</span></div>
<div><span style="font-family: Times New Roman;">而它的突变数据又不是通常意义的高通量测序得到的，snp6芯片数据很多人听都没听过</span></div>
<div><span style="font-family: Times New Roman;">文章的<b>附件</b>有对cell lines的具体描述。</span></div>
<div><span style="font-family: Times New Roman;"><a href="http://www.bio-info-trainee.com/wp-content/uploads/2016/01/different_kinds_of_cancer_in_CCLE.png"><img class="alignnone size-full wp-image-1325" src="http://www.bio-info-trainee.com/wp-content/uploads/2016/01/different_kinds_of_cancer_in_CCLE.png" alt="different_kinds_of_cancer_in_CCLE" width="489" height="411" /></a></span></div>
<div><span style="font-family: Times New Roman;">CCLE的数据在broad institute里面可以下载，也放在GEO数据库里面，我比较喜欢GEO里面的数据</span></div>
<div><a href="http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE36139"><span style="font-family: Times New Roman;">http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE36139</span></a></div>
<div><span style="font-family: Times New Roman;">This SuperSeries is composed of the following SubSeries:</span></div>
<div><span style="font-family: Times New Roman;">GSE36133 Expression data from the Cancer Cell Line Encyclopedia (CCLE)</span></div>
<div><span style="font-family: Times New Roman;">GSE36138 SNP array data from the Cancer Cell Line Encyclopedia (CCLE)</span></div>
<div><span style="font-family: Times New Roman;">GSE36133这个study的<b>metadata</b>里面有对每个cellline来源的cancer进行描述！</span></div>
<div><span style="font-family: Times New Roman;">有人喜欢把这个<b>metadata叫做是clinical data。</b></span></div>
<div><span style="font-family: Times New Roman;">library(GEOquery)</span></div>
<div><span style="font-family: Times New Roman;">ccleFromGEO &lt;- getGEO("GSE36133")</span></div>
<div><span style="font-family: Times New Roman;">annotBlock1 &lt;-<span class="Apple-converted-space"> </span><b>pData</b>(<b>phenoData</b>(ccleFromGEO[[1]]))</span></div>
<div><span style="font-family: Times New Roman;">&gt;dim(annotBlock1)</span></div>
<div><span style="font-family: Times New Roman;">[1] 917  38</span></div>
<div><span style="font-family: Times New Roman;">exprSet=exprs(ccleFromGEO[[1]])</span></div>
<div><span style="font-family: Times New Roman;">&gt; dim(<b>exprSet</b>)</span></div>
<div><span style="font-family: Times New Roman;">[1] 18926   917</span></div>
<div><span style="font-family: Times New Roman;">##它的表达数据矩阵，包含了18926个基因，列名是917个细胞系的名字，行是基因的entrez ID</span></div>
<div><span style="font-family: Times New Roman;">keyColumns &lt;- c("title", "source_name_ch1", "characteristics_ch1", "characteristics_ch1.1", </span></div>
<div><span style="font-family: Times New Roman;">    "characteristics_ch1.2")</span></div>
<div><span style="font-family: Times New Roman;">options(stringsAsFactors = F)</span></div>
<div><span style="font-family: Times New Roman;">allAnnot=annotBlock1[,keyColumns]</span></div>
<div><span style="font-family: Times New Roman;">##这几列信息是比较重要的metadata，里面详细记录了细胞系的收集公司单位，tissue，癌症分类等信息</span></div>
<div>
<div><span style="font-family: Times New Roman;">Cell line （1035个细胞系简介）Gene Sets</span></div>
<div><span style="font-family: Times New Roman;">1035 sets of genes with high or low expression in each cell line relative to other cell lines from the CCLE Cell Line Gene Expression Profiles dataset.</span></div>
<div><a href="http://amp.pharm.mssm.edu/Harmonizome/dataset/CCLE+Cell+Line+Gene+Expression+Profiles"><span style="font-family: Times New Roman;">http://amp.pharm.mssm.edu/Harmonizome/dataset/CCLE+Cell+Line+Gene+Expression+Profiles</span></a></div>
<div><span style="font-family: Times New Roman;">一些关于CCLE数据库的文章：</span></div>
<div><a href="http://cancerres.aacrjournals.org/content/73/8_Supplement/2409.short"><span style="font-family: Times New Roman;">http://cancerres.aacrjournals.org/content/73/8_Supplement/2409.short</span></a></div>
<div><a href="http://cancerres.aacrjournals.org/content/74/22/6390.short"><span style="font-family: Times New Roman;">http://cancerres.aacrjournals.org/content/74/22/6390.short</span></a></div>
<div><a href="https://clincancerres.aacrjournals.org/content/19/19_Supplement/IA2.abstract"><span style="font-family: Times New Roman;">https://clincancerres.aacrjournals.org/content/19/19_Supplement/IA2.abstract</span></a></div>
<div><span style="font-family: Times New Roman;"><a href="http://onlinelibrary.wiley.com/doi/10.1002/cncy.21471/pdf">http://onlinelibrary.wiley.com/doi/10.1002/cncy.21471/pdf</a><span class="Apple-converted-space"> </span>介绍了几个类似的数据库资源</span></div>
<div><span style="font-family: Times New Roman;"><a href="http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0088557">http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0088557</a><span class="Apple-converted-space"> </span> 讲解了high/low的知识</span></div>
<div><span style="font-family: Times New Roman;"><a href="http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=7060697">http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=7060697</a><span class="Apple-converted-space"> </span> 药物相关</span></div>
<div><span style="font-family: Times New Roman;">Anticancer drug sensitivity analysis: An integrated approach applied to Erlotinib sensitivity prediction in the CCLE database</span></div>
<div><span style="font-family: Times New Roman;"><a href="http://biorxiv.org/content/biorxiv/early/2015/10/02/028159.full.pdf">http://biorxiv.org/content/biorxiv/early/2015/10/02/028159.full.pdf</a><span class="Apple-converted-space"> </span>比较了CCLE和TCGA的数据</span></div>
<div></div>
</div>
]]></content:encoded>
			<wfw:commentRss>http://www.bio-info-trainee.com/1324.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
