<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>生信菜鸟团 &#187; 单细胞</title>
	<atom:link href="http://www.bio-info-trainee.com/tag/%e5%8d%95%e7%bb%86%e8%83%9e/feed" rel="self" type="application/rss+xml" />
	<link>http://www.bio-info-trainee.com</link>
	<description>欢迎去论坛biotrainee.com留言参与讨论，或者关注同名微信公众号biotrainee</description>
	<lastBuildDate>Sat, 28 Jun 2025 14:30:13 +0000</lastBuildDate>
	<language>zh-CN</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>https://wordpress.org/?v=4.1.33</generator>
	<item>
		<title>单细胞转录组数据分析CNV</title>
		<link>http://www.bio-info-trainee.com/3065.html</link>
		<comments>http://www.bio-info-trainee.com/3065.html#comments</comments>
		<pubDate>Sat, 17 Feb 2018 10:17:02 +0000</pubDate>
		<dc:creator><![CDATA[ulwvfje]]></dc:creator>
				<category><![CDATA[cancer]]></category>
		<category><![CDATA[cnv]]></category>
		<category><![CDATA[单细胞]]></category>
		<category><![CDATA[转录组]]></category>

		<guid isPermaLink="false">http://www.bio-info-trainee.com/?p=3065</guid>
		<description><![CDATA[单细胞转录组数据分析CNV 都来aviv Regev自于实验室，一系列文章都利用 &#8230; <a href="http://www.bio-info-trainee.com/3065.html">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<h1 class="md-end-block md-heading md-focus" contenteditable="true"><span class="">单细胞转录组数据分析CNV</span></h1>
<p><span class="md-line md-end-block" contenteditable="true"><span class="">都来aviv Regev自于实验室，一系列文章都利用了单细胞转录组数据分析CNV。</span></span><span id="more-3065"></span></p>
<h3 class="md-end-block md-heading" contenteditable="true"><span class=""> 2014年关于GBM的science文章</span></h3>
<p><span class="md-line md-end-block" contenteditable="true"><span class="">首先是2014年关于GBM的science文章；PMID: </span><span class=""><a spellcheck="false" href="https://www.ncbi.nlm.nih.gov/pubmed/24925914">24925914</a></span> ，提到了这个分析点，然后还用了CCLE数据库验证可靠性。</span></p>
<p><span class="md-line md-end-block" contenteditable="true">该文章自己的单细胞转录组数据建库选用了 SMART-seq 方法，公布在 <span class=""><a spellcheck="false" href="https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE57872">GSE57872</a></span></span></p>
<ul class="ul-list" data-mark="-">
<li><span class="md-line md-end-block" contenteditable="true">430(576) single glioblastoma cells isolated from 5 individual tumors</span></li>
<li><span class="md-line md-end-block" contenteditable="true">102(192) single cells from gliomasphere cells lines </span></li>
</ul>
<p><span class="md-line md-end-block" contenteditable="true">这个单细胞转录组建库方式有点落后了：</span></p>
<blockquote><p><span class="md-line md-end-block" contenteditable="true">SMART-seq protocol was implemented to generate single cell full length transcriptomes (modified from Shalek, et al Nature 2013) and sequenced using 25 bp paired end reads. Single cell cDNA libraries for <span class=""><strong>MGH30 were resequenced using 100 bp paired end reads</strong></span> to allow for isoform and splice junction reconstruction (96 samples, annotated MGH30L). </span></p></blockquote>
<p><span class="md-line md-end-block" contenteditable="true"><span class="">所以作者过滤的比较严格，可以直接下载其分析好的表达矩阵，也可以下载原始测序数据自己走一波转录组流程。</span></span></p>
<p><span class="md-line md-end-block" contenteditable="true">第一次提出的公式如下：</span></p>
<p><span class="md-line md-end-block" contenteditable="true"><span class="md-image md-img-loaded" contenteditable="false" data-src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/RNA-SEQ-CNV-formula-1.png"><img style="box-sizing: border-box; border-width: 0px 4px 0px 2px; border-right-style: solid; border-left-style: solid; border-right-color: transparent; border-left-color: transparent; vertical-align: middle; max-width: 100%; cursor: default;" src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/RNA-SEQ-CNV-formula-1.png" alt="" /></span></span></p>
<h3 class="md-end-block md-heading" contenteditable="true"><span class=""> 2016年关于melanoma的science文章</span></h3>
<p><span class="md-line md-end-block" contenteditable="true"><span class="">然后是2016年关于melanoma的science文章：PMID: </span><span class=""><a spellcheck="false" href="https://www.ncbi.nlm.nih.gov/pubmed/27124452">27124452</a></span> 也应用了单细胞转录组数据分析CNV，该文章的数据公布在 <span class=""><a spellcheck="false" href="https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE72056">GSE72056</a></span> 这次使用的Smart-seq2建库技术，共计 4645 个细胞，仅仅是表达矩阵就由71Mb，但是原始的测试数据在 dbGaP 数据库，需要申请才能下载。</span></p>
<figure class="md-table-fig" contenteditable="false">
<table class="md-table">
<thead>
<tr class="md-end-block">
<th><span class="td-span" contenteditable="true"><span class=""><strong>Supplementary file</strong></span></span></th>
<th><span class="td-span" contenteditable="true"><span class=""><strong>Size</strong></span></span></th>
<th><span class="td-span" contenteditable="true"><span class=""><strong>Download</strong></span></span></th>
<th><span class="td-span" contenteditable="true"><span class=""><strong>File type/resource</strong></span></span></th>
</tr>
</thead>
<tbody>
<tr class="md-end-block">
<td><span class="td-span" contenteditable="true">GSE72056_melanoma_single_cell_revised_v2.txt.gz</span></td>
<td><span class="td-span" contenteditable="true">71.6 Mb</span></td>
<td><span class="td-span" contenteditable="true"><span class=""><a spellcheck="false" href="ftp://ftp.ncbi.nlm.nih.gov/geo/series/GSE72nnn/GSE72056/suppl/GSE72056_melanoma_single_cell_revised_v2.txt.gz">(ftp)</a></span><span class=""><a spellcheck="false" href="https://www.ncbi.nlm.nih.gov/geo/download/?acc=GSE72056&amp;format=file&amp;file=GSE72056%5Fmelanoma%5Fsingle%5Fcell%5Frevised%5Fv2%2Etxt%2Egz">(http)</a></span></span></td>
<td><span class="td-span" contenteditable="true">TXT</span></td>
</tr>
</tbody>
</table>
</figure>
<blockquote><p><span class="md-line md-end-block" contenteditable="true"><span class="">we applied single-cell RNA sequencing (RNA-seq) to 4645 single cells isolated from 19 patients, profiling malignant, immune, stromal, and endothelial cells.</span></span></p></blockquote>
<p><span class="md-line md-end-block" contenteditable="true">值得注意的是作者还做了bulk的转录组测序，针对6个处理 RAF or RAF+MEK inhibitors 前后供12个数据，公布在 <span class=""><a spellcheck="false" href="https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE77940">GSE77940</a></span></span></p>
<p><span class="md-line md-end-block" contenteditable="true"><span class="">这个时候的计算公式稍微有点变化了，如下：</span></span></p>
<p><span class="md-line md-end-block" contenteditable="true"><span class="md-image md-img-loaded" contenteditable="false" data-src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/rna-seq-cnv-formula2.png"><img style="box-sizing: border-box; border-width: 0px 4px 0px 2px; border-right-style: solid; border-left-style: solid; border-right-color: transparent; border-left-color: transparent; vertical-align: middle; max-width: 100%; cursor: default;" src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/rna-seq-cnv-formula2.png" alt="" /></span></span></p>
<h3 class="md-end-block md-heading" contenteditable="true"><span class=""> 2016年CELL杂志发表的关于头颈癌</span></h3>
<p><span class="md-line md-end-block" contenteditable="true">接着是2016年CELL杂志发表的关于头颈癌的文章：<span class=""><a spellcheck="false" href="https://www.sciencedirect.com/science/article/pii/S0092867417312709?via%3Dihub">Single-Cell Transcriptomic Analysis of Primary and Metastatic Tumor Ecosystems in Head and Neck Cancer</a></span> 测序如下；</span></p>
<blockquote><p><span class="md-line md-end-block" contenteditable="true">We profiled <span class=""><a spellcheck="false" href="https://www.sciencedirect.com/topics/neuroscience/transcriptome">transcriptomes</a></span> of <span class=""><strong>∼6,000 single cells from 18 head and neck <span class=""><a spellcheck="false" href="https://www.sciencedirect.com/topics/neuroscience/squamous-epithelial-cell">squamous cell</a></span> carcinoma</strong></span> (HNSCC) patients, including five matched pairs of primary tumors and <span class=""><a spellcheck="false" href="https://www.sciencedirect.com/topics/neuroscience/lymph-node">lymph node metastases</a></span>.</span></p></blockquote>
<p><span class="md-line md-end-block" contenteditable="true">同时也对这些病人测了whole-exome sequencing (WES) and targeted <span class=""><a spellcheck="false" href="https://www.sciencedirect.com/topics/biochemistry-genetics-and-molecular-biology/genotyping">genotyping</a></span> (SNaPshot) data，但是这些数据公布在 <span spellcheck="false"><code>phs001474.v1.p1</code></span> ，不是很方便下载。</span></p>
<p><span class="md-line md-end-block" contenteditable="true">单细胞转录组建库用的<span spellcheck="false"><code>Smart-seq2</code></span>方法，所有的数据公布在 <span class=""><a spellcheck="false" href="http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE103322">GSE103322</a></span> ， 仅仅是表达矩阵都有近100Mb了。</span></p>
<div class="CodeMirror cm-s-inner CodeMirror-wrap">
<div></div>
<div class="CodeMirror-scroll" tabindex="-1">
<div class="CodeMirror-sizer">
<div>
<div class="CodeMirror-lines">
<div>
<div class="CodeMirror-measure"></div>
<div class="CodeMirror-measure"></div>
<div></div>
<div class="CodeMirror-cursors"></div>
<div class="CodeMirror-code">
<div class="CodeMirror-activeline">
<div class="CodeMirror-activeline-background CodeMirror-linebackground"></div>
<div class="CodeMirror-gutter-background CodeMirror-activeline-gutter"></div>
<pre class=" CodeMirror-line ">GSE103322_HNSCC_all_data.txt.gz | 86.0 Mb |</pre>
</div>
</div>
</div>
</div>
</div>
</div>
<div></div>
</div>
</div>
<p><span class="md-line md-end-block" contenteditable="true">下载地址是： <span class=""><a spellcheck="false" href="ftp://ftp.ncbi.nlm.nih.gov/geo/series/GSE103nnn/GSE103322/suppl/GSE103322%5FHNSCC%5Fall%5Fdata%2Etxt%2Egz">(ftp)</a></span><span class=""><a spellcheck="false" href="https://www.ncbi.nlm.nih.gov/geo/download/?acc=GSE103322&amp;format=file&amp;file=GSE103322%5FHNSCC%5Fall%5Fdata%2Etxt%2Egz">(http)</a></span> </span></p>
<p><span class="md-line md-end-block" contenteditable="true"><span class="md-image md-img-loaded" contenteditable="false" data-src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/rna-seq-cnv-formula-3.png"><img style="box-sizing: border-box; border-width: 0px 4px 0px 2px; border-right-style: solid; border-left-style: solid; border-right-color: transparent; border-left-color: transparent; vertical-align: middle; max-width: 100%; cursor: default;" src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/rna-seq-cnv-formula-3.png" alt="" /></span></span></p>
<h3 class="md-end-block md-heading" contenteditable="true">用CCLE数据做验证</h3>
<p><span class="md-line md-end-block" contenteditable="true">2014年关于GBM的science文章；PMID: <span class=""><a spellcheck="false" href="https://www.ncbi.nlm.nih.gov/pubmed/24925914">24925914</a></span><span class=""> ，文章提到：</span></span></p>
<blockquote><p><span class="md-line md-end-block" contenteditable="true">We downloaded the CCLE gene-centric RMA-normalized Affymetrix data (<span spellcheck="false"><a href="http://www.broadinstitute.org/ccle/">http://www.broadinstitute.org/ccle/</a></span>), and centered the expression of each gene across all cell lines at zero.</span></p></blockquote>
<p><span class="md-line md-end-block" contenteditable="true">需要简单注册后才能下载：<span spellcheck="false"><a href="https://portals.broadinstitute.org/ccle/users/sign_in">https://portals.broadinstitute.org/ccle/users/sign_in</a></span> </span></p>
<p><span class="md-line md-end-block" contenteditable="true">理论上要得到下面的图：</span></p>
<p><span class="md-line md-end-block" contenteditable="true"><span class="md-image md-img-loaded" contenteditable="false" data-src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/highly-correlated-CNV-by-SNP6array-and-RNA-seq.png"><img style="box-sizing: border-box; border-width: 0px 4px 0px 2px; border-right-style: solid; border-left-style: solid; border-right-color: transparent; border-left-color: transparent; vertical-align: middle; max-width: 100%; cursor: default;" src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/highly-correlated-CNV-by-SNP6array-and-RNA-seq.png" alt="" /></span>](<span spellcheck="false"><a href="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/highly-correlated-CNV-by-SNP6array-and-RNA-seq.png">http://www.bio-info-trainee.com/wp-content/uploads/2018/02/highly-correlated-CNV-by-SNP6array-and-RNA-seq.png</a></span><span class="">)</span></span></p>
<p><span class="md-line md-end-block" contenteditable="true"><span class="md-expand">说明使用转录组数据分析到的CNV情况和SNP6.0芯片的结果差异不大。</span></span></p>
<h3 class="md-end-block md-heading" contenteditable="true">还有GTEx数据库的验证</h3>
<p><span class="md-line md-end-block" contenteditable="true">To compare these patterns to an external reference of normal cells we downloaded RNA-Seq data from the GTEX portal (<span spellcheck="false"><a href="http://www.gtexportal.org/">http://www.gtexportal.org/</a></span><span class="">; gene read counts file from Jan. 2013), and estimated CNV values as above: we normalized the read counts into log2(TPM+1), averaged all brain samples, restricted the data to the ~6,000 analyzed genes, subtracted for each gene the average normalized expression from the GBM single-cell data (this step is comparable to the centering of the single cell data) and then used a moving average of 100 genes over the genomically-ordered list of genes to define CNV-cont.</span></span></p>
<h3 class="md-end-block md-heading" contenteditable="true">总结</h3>
<p><span class="md-line md-end-block" contenteditable="true">上述文章及数据都是有表达矩阵可以下载，所以仅仅是根据这些文章的补充材料公布的公式即可重复整个流程啦。</span></p>
]]></content:encoded>
			<wfw:commentRss>http://www.bio-info-trainee.com/3065.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>单细胞转录组看头颈癌的原位癌和复发癌区别</title>
		<link>http://www.bio-info-trainee.com/3044.html</link>
		<comments>http://www.bio-info-trainee.com/3044.html#comments</comments>
		<pubDate>Sat, 17 Feb 2018 02:20:06 +0000</pubDate>
		<dc:creator><![CDATA[ulwvfje]]></dc:creator>
				<category><![CDATA[cancer]]></category>
		<category><![CDATA[单细胞]]></category>
		<category><![CDATA[原位癌]]></category>
		<category><![CDATA[复发]]></category>
		<category><![CDATA[头颈癌]]></category>

		<guid isPermaLink="false">http://www.bio-info-trainee.com/?p=3044</guid>
		<description><![CDATA[单细胞转录组探索头颈癌症的转移癌和原位癌区别 文章发表于2017年12月，在CE &#8230; <a href="http://www.bio-info-trainee.com/3044.html">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p>单细胞转录组探索头颈癌症的转移癌和原位癌区别</p>
<p><span class="md-line md-end-block" contenteditable="true">文章发表于2017年12月，在CELL杂志：<span class=""><a spellcheck="false" href="https://www.sciencedirect.com/science/article/pii/S0092867417312709?via%3Dihub">Single-Cell Transcriptomic Analysis of Primary and Metastatic Tumor Ecosystems in Head and Neck Cancer</a></span> 测序如下；</span></p>
<blockquote><p><span class="md-line md-end-block" contenteditable="true">We profiled <span class=""><a spellcheck="false" href="https://www.sciencedirect.com/topics/neuroscience/transcriptome">transcriptomes</a></span> of <span class=""><strong>∼6,000 single cells from 18 head and neck <span class=""><a spellcheck="false" href="https://www.sciencedirect.com/topics/neuroscience/squamous-epithelial-cell">squamous cell</a></span> carcinoma</strong></span> (HNSCC) patients, including five matched pairs of primary tumors and <span class=""><a spellcheck="false" href="https://www.sciencedirect.com/topics/neuroscience/lymph-node">lymph node metastases</a></span>.</span></p></blockquote>
<p><span class="md-line md-end-block" contenteditable="true">同时也对这些病人测了whole-exome sequencing (WES) and targeted <span class=""><a spellcheck="false" href="https://www.sciencedirect.com/topics/biochemistry-genetics-and-molecular-biology/genotyping">genotyping</a></span> (SNaPshot) data，但是这些数据公布在 <span spellcheck="false"><code>phs001474.v1.p1</code></span> ，不是很方便下载。</span><span id="more-3044"></span></p>
<p><span class="md-line md-end-block" contenteditable="true">单细胞转录组建库用的<span spellcheck="false"><code>Smart-seq2</code></span>方法，所有的数据公布在 <span class=""><a spellcheck="false" href="http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE103322">GSE103322</a></span> ， 仅仅是表达矩阵都有近100Mb了。</span></p>
<div class="CodeMirror cm-s-inner CodeMirror-wrap">
<div></div>
<div class="CodeMirror-scroll" tabindex="-1">
<div class="CodeMirror-sizer">
<div>
<div class="CodeMirror-lines">
<div>
<div class="CodeMirror-measure"></div>
<div class="CodeMirror-measure"></div>
<div></div>
<div class="CodeMirror-cursors"></div>
<div class="CodeMirror-code">
<div class="CodeMirror-activeline">
<div class="CodeMirror-activeline-background CodeMirror-linebackground"></div>
<div class="CodeMirror-gutter-background CodeMirror-activeline-gutter"></div>
<pre class=" CodeMirror-line ">GSE103322_HNSCC_all_data.txt.gz | 86.0 Mb |</pre>
</div>
</div>
</div>
</div>
</div>
</div>
<div></div>
</div>
</div>
<p><span class="md-line md-end-block" contenteditable="true">下载地址是： <span class=""><a spellcheck="false" href="ftp://ftp.ncbi.nlm.nih.gov/geo/series/GSE103nnn/GSE103322/suppl/GSE103322%5FHNSCC%5Fall%5Fdata%2Etxt%2Egz">(ftp)</a></span><span class=""><a spellcheck="false" href="https://www.ncbi.nlm.nih.gov/geo/download/?acc=GSE103322&amp;format=file&amp;file=GSE103322%5FHNSCC%5Fall%5Fdata%2Etxt%2Egz">(http)</a></span> </span></p>
<p><span class="md-line md-end-block" contenteditable="true">实验验证用的是细胞系 Oral cavity HNSCC <span class=""><a spellcheck="false" href="https://www.sciencedirect.com/topics/neuroscience/cell-lines">cell lines</a></span> (Cal-27, SCC9, SCC4, SCC25, and JHU-006; all derived from male patients) 做了RNA-seq 数据。</span></p>
<p><span class="md-line md-end-block" contenteditable="true">肿瘤内异质性是肿瘤学的主要挑战。在新兴的技术中，scRNA-seq有助于确定与肿瘤生物学，诊断和治疗有关的发育等级，抗药性程序和免疫渗透模式。在这里，研究者应用这种方法来表征原发性HNSCC肿瘤和匹配的LN转移瘤。</span></p>
<h3 class="md-end-block md-heading" contenteditable="true">名词介绍</h3>
<ul class="ul-list" data-mark="-">
<li><span class="md-line md-end-block" contenteditable="true">头颈部鳞状细胞癌（HNSCC）</span></li>
<li><span class="md-line md-end-block" contenteditable="true">淋巴结转移（LN）</span></li>
<li><span class="md-line md-end-block" contenteditable="true">癌症基因组图谱（TCGA）</span></li>
<li class=""><span class="md-line md-end-block" contenteditable="true"><span class="">癌相关成纤维细胞（CAF）</span></span></li>
<li><span class="md-line md-end-block" contenteditable="true">细胞外基质（ECM）</span></li>
<li><span class="md-line md-end-block" contenteditable="true">循环肿瘤细胞（CTC）</span></li>
<li><span class="md-line md-end-block" contenteditable="true">单细胞RNA测序（scRNAseq）</span></li>
<li><span class="md-line md-end-block" contenteditable="true">肿瘤微环境（TME）</span></li>
<li><span class="md-line md-end-block" contenteditable="true">全外显子组测序（WES）和靶向基因分型（SNaPshot）</span></li>
<li><span class="md-line md-end-block" contenteditable="true">Epithelial-to-mesenchymal transition (EMT)</span></li>
<li><span class="md-line md-end-block" contenteditable="true">PNI = perineural invasion; LVI = lymphovascular invasion; ECE = extracapsular extension</span></li>
</ul>
<p><span class="md-line md-end-block" contenteditable="true"> ​</span></p>
<h3 class="md-end-block md-heading" contenteditable="true">背景介绍</h3>
<p><span class="md-line md-end-block" contenteditable="true">HNSCC 头颈癌是最常见的十大癌症之一，每年有50万患者深受其害，其中，超过80%的患者为口腔鳞状细胞癌(OSCC)。尽管目前有手术、化疗、放疗等治疗手段，但5年存活率仅有50%，仍是存活率最低的癌症之一，且近30年没有改善。所以探寻新的治疗方式抑制OSCC生长和转移尤为重要。</span></p>
<p><span class="md-line md-end-block" contenteditable="true">头颈部鳞状细胞癌（HNSCC）是一种与酒精和烟草暴露密切相关的具有异质性的上皮肿瘤，患者往往在晚期出现的淋巴结转移（LN)。</span></p>
<h3 class="md-end-block md-heading" contenteditable="true">单细胞转录组数据分析CNV跟WES的对比</h3>
<p><span class="md-line md-end-block" contenteditable="true">首先把所有病人的近6000个细胞根据表达模式区分成恶性与否，分成两组进行CNV聚类，可以看到恶性细胞的CNV模式跟从WES数据分析得到的CNV模式比较类似，说明了单细胞转录组数据分析CNV是靠谱的。当然，本身该课题组前面的几篇文章就提到了这个方法以及证实了其可靠性。</span></p>
<p><span class="md-line md-end-block" contenteditable="true"><span class="md-image md-img-loaded" contenteditable="false" data-src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/all-patients-CNV-scRNAseq-vs-WES.png"><img src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/all-patients-CNV-scRNAseq-vs-WES.png" alt="" /></span></span></p>
<h3 class="md-end-block md-heading" contenteditable="true">病人MEEI5的CNV情况</h3>
<p><span class="md-line md-end-block" contenteditable="true"><span class="">MEEI5 是一个69岁的女性，对来源于她的所有单细胞的转录组数据分析得到的CNV信息进行聚类可以看到比较清晰的patter，其中恶性与否比较容易区分，而且对于恶性细胞也可以看出原位癌和转移癌的区别。</span></span></p>
<p><span class="md-line md-end-block" contenteditable="true"><span class="md-image md-img-loaded" contenteditable="false" data-src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/patient-MEEI5-scRNAseq-CNV-heatmap.png"><img src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/patient-MEEI5-scRNAseq-CNV-heatmap.png" alt="" /></span></span></p>
<h3 class="md-end-block md-heading" contenteditable="true"><span class="">首先区分肿瘤细胞的恶性与否</span></h3>
<p><span class="md-line md-end-block" contenteditable="true">用<span spellcheck="false"><code>Smart-seq2</code></span>建库方法得到的单细胞转录组数据经过QC后，留下了来自18名患者的5,902个细胞，首先可以分成2215个恶性细胞和3363个非恶性细胞。</span></p>
<p><span class="md-line md-end-block" contenteditable="true">首先，研究者根据跨染色体间隔的平均表达谱推断每个单细胞中的大规模染色体拷贝数变异（CNV）。这些推断的CNVs与WES一致，<span class=""><strong>通过推断的CNVs将恶性细胞从正常核型的非恶性细胞中分离出来。</strong></span></span></p>
<p><span class="md-line md-end-block" contenteditable="true">其次，研究者通过其上皮来源鉴别恶性细胞，其不同于TME中的基质和免疫细胞。<span class=""><strong>研究者发现在具有上皮标志物表达的细胞和具有异常核型的细胞之间具有显著的一致性。</strong></span></span></p>
<p><span class="md-line md-end-block" contenteditable="true">最后，研究者通过它们的全局表达模式将细胞划分到初始类。<span class=""><strong>基于CNV和上皮标志物分析，绝大多数细胞均被分到具有一致恶性或非恶性分类类中去。</strong></span></span></p>
<h3 class="md-end-block md-heading" contenteditable="true">非恶性细胞的聚类没有个体差异</h3>
<p><span class="md-line md-end-block" contenteditable="true">如果只对已经被区分出来的<span spellcheck="false"><code>三千多个非恶性肿瘤细胞</code></span><span class="">进行聚类，采取SC3算法，效果如下图，虽然有14个类别，但是根据已知标记基因的表达，可以注释为</span><span class=""><strong>B细胞，巨噬细胞，树突状细胞，肥大细胞，内皮细胞，成纤维细胞和肌细胞</strong></span><span class="">这八个值得探究的类别。</span></span></p>
<p><span class="md-line md-end-block" contenteditable="true"><span class="md-image md-img-loaded" contenteditable="false" data-src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/t-SNE-plot-of-non-malignant-cells.png"><img style="box-sizing: border-box; border-width: 0px 4px 0px 2px; border-right-style: solid; border-left-style: solid; border-right-color: transparent; border-left-color: transparent; vertical-align: middle; max-width: 100%; cursor: default;" src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/t-SNE-plot-of-non-malignant-cells.png" alt="" /></span></span></p>
<p><span class="md-line md-end-block" contenteditable="true"><span class="">值得注意的是，每个类含有来自不同患者的细胞，表明TME(这三千多个非恶性细胞就是肿瘤微环境)中的细胞类型和表达状态在HNSCC肿瘤中基本一致，并且没有患者特异性亚群或批处理效应，尽管它们的比例是不同的。</span></span></p>
<p><span class="md-line md-end-block" contenteditable="true"><span class="md-image md-img-loaded" contenteditable="false" data-src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/non-malignant-cells-characterastics.png"><img src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/non-malignant-cells-characterastics.png" alt="" /></span></span></p>
<p><span class="md-line md-end-block" contenteditable="true"><span class="">而且由于研究者的数据集中T细胞和成纤维细胞，即数量相对较多，研究者通过更精确的聚类发现了T细胞和成纤维细胞的多样性。如下：</span></span></p>
<h3 class="md-end-block md-heading" contenteditable="true">单独查看成纤维细胞CAF</h3>
<p><span class="md-line md-end-block" contenteditable="true"><span class=""><strong>rv1,500成纤维细胞</strong></span><span class="">分成两个大类，一个小类别。</span></span></p>
<ul class="ul-list" data-mark="-">
<li><span class="md-line md-end-block" contenteditable="true"><span class=""><strong>第一个类表达肌成纤维细胞的经典标记，</strong></span>包括α平滑肌肌动蛋白（ACTA2）和肌球蛋白轻链蛋白（MYLK，MYL9）。肌成纤维细胞是TME的成熟组分，并与伤口愈合和挛缩有关。</span></li>
<li class=""><span class="md-line md-end-block" contenteditable="true"><span class=""><strong>第二个类表达与癌相关成纤维细胞（CAF）相关的受体</strong></span>，配体和细胞外基质（ECM）基因，包括成纤维细胞活化蛋白（FAP），podoplanin（PDPN）和结缔组织生长因子（CTGF）。</span></li>
<li class=""><span class="md-line md-end-block" contenteditable="true">第三个类基本不包括肌成纤维细胞和CAF的标记物，<span class=""><strong>并可能代表处于静止状态的成纤维细胞。</strong></span> </span></li>
</ul>
<p><span class="md-line md-end-block" contenteditable="true"><span class="">其中还可以把CAFs（第二个类）分为具有立即早期应答基因（例如JUN，FOS），间充质标志物（例如VIM，THY1），配体和受体（例如FGF7）差异表达的两种类型（CAF1和CAF2），TGFBR2 / 3）和ECM蛋白质（例如MMP11，CAV1）。这种瘤内CAF异质性与TMF中涉及复杂结构和旁分泌相互作用的观点一致。</span></span></p>
<h3 class="md-end-block md-heading" contenteditable="true">单独查看T细胞</h3>
<p><span class="md-line md-end-block" contenteditable="true">主要T细胞类（rv1000个T细胞）可以分为四个亚群，研究者注释为</span></p>
<ul class="ul-list" data-mark="-">
<li><span class="md-line md-end-block" contenteditable="true">调节性T细胞（Treg）</span></li>
<li><span class="md-line md-end-block" contenteditable="true">常规CD4 + T辅助细胞（CD4 + Tconv）</span></li>
<li><span class="md-line md-end-block" contenteditable="true"><span class=""><strong>两种</strong></span>细胞毒性CD8 + T细胞群（CD8 + T和CD8 + Texhausted）</span></li>
</ul>
<p><span class="md-line md-end-block" contenteditable="true">细胞毒素亚型在共抑制性受体（例如PD1，CTLA4）和与T细胞功能障碍和衰竭相关的其他基因的表达方面不同，<span class=""><strong>并由此定义HNSCC中推定的T细胞耗竭程序。</strong></span><span class="">耗竭CD8 + T细胞的部分在研究者的队列患者中显著变化。这些T细胞表达状态可以为理解和预测检查点免疫疗法的反应提供帮助。</span></span></p>
<h3 class="md-end-block md-heading" contenteditable="true">恶性细胞聚类完全取决于患者个体</h3>
<p><span class="md-line md-end-block" contenteditable="true"><span class=""><strong>与非恶性细胞相比，2215恶性细胞根据其起源的肿瘤聚类。</strong></span>超过2000个基因优先在个体肿瘤中表达。</span></p>
<p><span class="md-line md-end-block" contenteditable="true"><span class="md-image md-img-loaded" contenteditable="false" data-src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/malignant-cells-tumor-specific-clusters.png"><img style="box-sizing: border-box; border-width: 0px 4px 0px 2px; border-right-style: solid; border-left-style: solid; border-right-color: transparent; border-left-color: transparent; vertical-align: middle; max-width: 100%; cursor: default;" src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/malignant-cells-tumor-specific-clusters.png" alt="" /></span></span></p>
<p><span class="md-line md-end-block" contenteditable="true"><span class="">差异表达的基因在肿瘤之间不同的CNV内富集。 </span></span></p>
<p><span class="md-line md-end-block" contenteditable="true">最后，其他差异表达的基因与应激（例如JUNB，FOSL1）或免疫激活（例如IDO1，STAT1，TNF）有关，可能对不同的TME有反应。<span class=""><strong>因此，肿瘤间恶性细胞表达异质性反映了研究者队列中肿瘤之间遗传学，亚型和TME的差异。</strong></span></span></p>
<h3 class="md-end-block md-heading" contenteditable="true">恶性细胞的基因特征</h3>
<p><span class="md-line md-end-block" contenteditable="true"><span class="">这里重点分析那些含有恶性细胞转录组最多数量的10对肿瘤样本。比如下面的病人MEEI25，一个76岁的女性：</span></span></p>
<p><span class="md-line md-end-block" contenteditable="true"><span class="md-image md-img-loaded" contenteditable="false" data-src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/patient-MEEI25-NNMF-cluster-6-signatures.png"><img style="box-sizing: border-box; border-width: 0px 4px 0px 2px; border-right-style: solid; border-left-style: solid; border-right-color: transparent; border-left-color: transparent; vertical-align: middle; max-width: 100%; cursor: default;" src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/patient-MEEI25-NNMF-cluster-6-signatures.png" alt="" /></span></span></p>
<p><span class="md-line md-end-block" contenteditable="true">研究者使用非负矩阵分解来<span class=""><strong>揭示在恶性细胞亚群中得到优先共同表达的一系列基因。</strong></span>例如，对于MEEI25恶性细胞，研究者定义了6个不同的基因特征。对10个肿瘤样本中的每一个都应用该方法，共定义了60个基因特征。</span></p>
<p><span class="md-line md-end-block" contenteditable="true">接下来，<span class=""><strong>研究者使用层次聚类来将这些60个特征提取成元特征，</strong></span><span class="">这些元特征反映了在多个肿瘤内变化的常见表达程序。来自不同肿瘤的特征之间的高一致性表明它们反映了肿瘤内表达异质性的共同模式。</span></span></p>
<h3 class="md-end-block md-heading" contenteditable="true">LN转移与原发性肿瘤比较</h3>
<p><span class="md-line md-end-block" contenteditable="true">研究者将LN转移与原发性肿瘤（只有5个病人是取了配对样本）进行了比较。尽管WES和推测的CNV显示了原发性和匹配的LN样本之间的存在一些基因组差异，<span class=""><strong>但是可能是由于所研究的个体数量较少，他们没有鉴别出任何一致的区别。</strong></span></span></p>
<p><span class="md-line md-end-block" contenteditable="true"><span class="md-image md-img-loaded" contenteditable="false" data-src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/DEG-matched-primary-VS-lymphonodes.png"><img style="box-sizing: border-box; border-width: 0px 4px 0px 2px; border-right-style: solid; border-left-style: solid; border-right-color: transparent; border-left-color: transparent; vertical-align: middle; max-width: 100%; cursor: default;" src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/DEG-matched-primary-VS-lymphonodes.png" alt="" /></span></span></p>
<p><span class="md-line md-end-block" contenteditable="true">LN中恶性细胞的表达谱也与相应的原发肿瘤大致匹配。在每个配对样本中，都有较少的差异表达基因是显著差异的，但是它们在整个群体（cohort）中不一致。</span></p>
<p><span class="md-line md-end-block" contenteditable="true"><span class="md-image md-img-loaded" contenteditable="false" data-src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/t-SNE-over-tumor-sites.png"><img style="box-sizing: border-box; border-width: 0px 4px 0px 2px; border-right-style: solid; border-left-style: solid; border-right-color: transparent; border-left-color: transparent; vertical-align: middle; max-width: 100%; cursor: default;" src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/t-SNE-over-tumor-sites.png" alt="" /></span></span></p>
<p><span class="md-line md-end-block" contenteditable="true"><span class="">研究者还观察到淋巴结和原发性肿瘤间质和免疫细胞的特征和表现的总体一致性，虽然有一些重要的区别！</span></span></p>
<h3 class="md-end-block md-heading" contenteditable="true">EMT的部分状态或p-EMT</h3>
<p><span class="md-line md-end-block" contenteditable="true">EMT程序被广泛认为是耐药，侵袭和转移的潜在驱动因素，是一个连续和变化的过程。因此，研究者仔细检查了ECM计划中EMT的特征。除ECM基因如基质金属蛋白酶，层粘连蛋白和整联蛋白外，该程序还包括EMT标志物波形蛋白（VIM）和整联蛋白α-5（ITGA5）。此外，该方案中得分最高的基因之一是转化生长因子（TGF）-b诱导（TGFBI），<span class=""><strong>暗示经典的EMT调节剂TGF-b。</strong></span></span></p>
<p><span class="md-line md-end-block" contenteditable="true"><span class=""><strong>虽然该程序具有经典EMT的关键特征，但缺乏其他标志。</strong></span></span></p>
<p><span class="md-line md-end-block" contenteditable="true">首先，虽然特征伴随着某些上皮基因的表达降低，但是上皮标记物的总体表达还是明显地保持下来。</span></p>
<p><span class="md-line md-end-block" contenteditable="true"><span class="md-image md-img-loaded" contenteditable="false" data-src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/pEMT-vs-canonical-EMT.png"><img style="box-sizing: border-box; border-width: 0px 4px 0px 2px; border-right-style: solid; border-left-style: solid; border-right-color: transparent; border-left-color: transparent; vertical-align: middle; max-width: 100%; cursor: default;" src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/pEMT-vs-canonical-EMT.png" alt="" /></span></span></p>
<p><span class="md-line md-end-block" contenteditable="true"><span class="">其次，研究者没有检测到经典EMT TF，ZEB1 / 2，TWIST1 / 2和SNAIL1的表达。只有SNAIL2被检测到（在70％的HNSCC细胞中），尽管其表达与肿瘤的程序相关，但与肿瘤内个体细胞的程序并不相关。最近的研究表明SNAIL2比其他EMT TFs早。 SNAIL2也涉及伤口愈合中的p-EMT应答。</span></span></p>
<p><span class="md-line md-end-block" contenteditable="true"><span class="md-image md-img-loaded" contenteditable="false" data-src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/percentage-of-cycling-malignant-cells-VS-pEMT-scores.png"><img style="box-sizing: border-box; border-width: 0px 4px 0px 2px; border-right-style: solid; border-left-style: solid; border-right-color: transparent; border-left-color: transparent; vertical-align: middle; max-width: 100%; cursor: default;" src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/percentage-of-cycling-malignant-cells-VS-pEMT-scores.png" alt="" /></span></span></p>
<p><span class="md-line md-end-block" contenteditable="true">因此，研究者建议这里确定的体内程序反映了一个EMT的部分状态或p-EMT。</span></p>
<p><span class="md-line md-end-block" contenteditable="true"><span class="md-image md-img-loaded" contenteditable="false" data-src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/meta-signatures-VS-pEMT-in-all-patients.png"><img style="box-sizing: border-box; border-width: 0px 4px 0px 2px; border-right-style: solid; border-left-style: solid; border-right-color: transparent; border-left-color: transparent; vertical-align: middle; max-width: 100%; cursor: default;" src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/meta-signatures-VS-pEMT-in-all-patients.png" alt="" /></span></span></p>
<p><span class="md-line md-end-block" contenteditable="true"><span class="">上图的分析结果表明，这个p-EMT程序不同于源自细胞系和肿瘤模型的完整EMT程序，以及源自肿瘤的肿瘤谱间充质特征。</span></span></p>
<h3 class="md-end-block md-heading" contenteditable="true">把TCGA的分类应用于scRNA-seq数据</h3>
<p><span class="md-line md-end-block" contenteditable="true"><span class="md-expand">TCGA研究分析了数百个HNSCC肿瘤的表达谱，并将它们分为四个亚型：基础型，间充质型，经典型和非典型型。在TCGA的cohort里面各个类别的样本比例是：atypical(24％),mesenchymal(27％),basal(31％)和classical(18％)。尽管TCGA分型是从大量肿瘤细胞中获取的，但研究者推断单个细胞组分的表达程序可能使研究者能够提取更多的了解。具体而言，研究者从这些批量数据中定义的分子亚型判断是否能够反映恶性程序，恶性细胞组成和/或TME组成的差异。</span></span></p>
<p><span class="md-line md-end-block" contenteditable="true"><span class=""><strong>研究者首先确定了自己研究计划的十个HNSCC肿瘤病人的TCGA表达亚型。</strong></span></span></p>
<p><span class="md-line md-end-block" contenteditable="true"><span class="md-image md-img-loaded" contenteditable="false" data-src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/t-SNE-over-TCGA-subtype.png"><img style="box-sizing: border-box; border-width: 0px 4px 0px 2px; border-right-style: solid; border-left-style: solid; border-right-color: transparent; border-left-color: transparent; vertical-align: middle; max-width: 100%; cursor: default;" src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/t-SNE-over-TCGA-subtype.png" alt="" /></span></span></p>
<p><span class="md-line md-end-block" contenteditable="true">研究者评估了每个肿瘤的恶性细胞与其亚型表达特征的对应关系。<span class=""><strong>引人注目的是，每个肿瘤清楚地映射到三个亚型之一：基本型（n = 7），经典型（n = 2），或非典型（n = 1）。</strong></span>没有一个恶性细胞映射到间充质亚型，即使它是口腔肿瘤中第二常见的亚型。然而，当研究者增大分析样本数目，当样本中包括基质和免疫细胞时，发现数百CAFs、肌成纤维细胞和肌细胞映射到间充质亚型。</span></p>
<p><span class="md-line md-end-block" contenteditable="true"><span class="md-image md-img-loaded" contenteditable="false" data-src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/TCGA-subtype-over-non-malignant-cells-6-meta-signatures.png"><img style="box-sizing: border-box; border-width: 0px 4px 0px 2px; border-right-style: solid; border-left-style: solid; border-right-color: transparent; border-left-color: transparent; vertical-align: middle; max-width: 100%; cursor: default;" src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/TCGA-subtype-over-non-malignant-cells-6-meta-signatures.png" alt="" /></span></span></p>
<p><span class="md-line md-end-block" contenteditable="true"><span class=""><strong>这一发现提出了一种可能性，即TCGA间充质亚型反映大批量样品中的高基质表现而不是独特的恶性细胞程序。</strong></span><span class="">实际上，TCGA样品的分析鉴定到间充质亚型肿瘤高度表达对CAF和肌细胞特异性的基因。此外，当研究者检查TCGA的HNSCC肿瘤的组织学切片时，鉴定到间充质肿瘤的成纤维细胞比基础型肿瘤多大约2.7倍（t检验，p &lt;0.0001）。</span></span></p>
<h3 class="md-end-block md-heading" contenteditable="true">外显子数据分析somatic突变</h3>
<p><span class="md-line md-end-block" contenteditable="true"><span class="">因为外显子测序数据是无法下载的，这里就不过多介绍了。</span></span></p>
<p><a href="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/wes-mutation-primary-vs-metastatics.png"><img class="alignnone size-full wp-image-3059" src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/wes-mutation-primary-vs-metastatics.png" alt="wes-mutation-primary-vs-metastatics" width="954" height="692" /></a></p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bio-info-trainee.com/3044.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>时间序列单细胞转录组数据分析</title>
		<link>http://www.bio-info-trainee.com/3022.html</link>
		<comments>http://www.bio-info-trainee.com/3022.html#comments</comments>
		<pubDate>Sun, 11 Feb 2018 01:16:42 +0000</pubDate>
		<dc:creator><![CDATA[ulwvfje]]></dc:creator>
				<category><![CDATA[cancer]]></category>
		<category><![CDATA[iPSC]]></category>
		<category><![CDATA[单细胞]]></category>
		<category><![CDATA[发育]]></category>

		<guid isPermaLink="false">http://www.bio-info-trainee.com/?p=3022</guid>
		<description><![CDATA[时间序列单细胞转录组数据分析 文章是: Reconstruction of de &#8230; <a href="http://www.bio-info-trainee.com/3022.html">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<div class="markdown-here-wrapper" data-md-url="http://www.bio-info-trainee.com/wp-admin/post.php?post=3022&amp;action=edit">
<h1 id="-" style="margin: 1.3em 0px 1em; padding: 0px; font-weight: bold; font-size: 1.6em; border-bottom: 1px solid #dddddd;">时间序列单细胞转录组数据分析</h1>
<p style="margin: 0px 0px 1.2em !important;">文章是: <a href="https://www.biorxiv.org/content/early/2017/09/27/191056">Reconstruction of developmental landscapes by optimal-transport analysis of single-cell gene expression sheds light on cellular reprogramming.</a> 虽然于2017年9月公布在了bioRxiv上面，但是至今仍然没正式发表，包含了六万多个单细胞转录组数据，持续追踪了MEF细胞系诱导为IPSC细胞的动态变化过程，并且从发育的角度分析了这些数据</p>
<p style="margin: 0px 0px 1.2em !important;"><span id="more-3022"></span></p>
<blockquote style="margin: 1.2em 0px; border-left: 4px solid #dddddd; padding: 0px 1em; color: #777777; quotes: none;">
<p style="margin: 0px 0px 1.2em !important;">We demonstrate the power of WADDINGTON-OT by applying the approach to study 65,781 scRNA-seq profiles collected at 10 time points over 16 days during reprogramming of fibroblasts to iPSCs.</p>
<h2 id="-" style="margin: 1.3em 0px 1em; padding: 0px; font-weight: bold; font-size: 1.4em; border-bottom: 1px solid #eeeeee;">背景介绍</h2>
<h3 id="waddington-" style="margin: 1.3em 0px 1em; padding: 0px; font-weight: bold; font-size: 1.3em;">Waddington<em>提出的</em>发育景观</h3>
<p style="margin: 0px 0px 1.2em !important;">上世纪50年代，胚胎发育生物学家Conrad Hal Waddington提出的发育景观假说认为，分化成熟的细胞变回多能干细胞是个不可能发生的事件。但是日本京都大学教授山中伸弥于2006年却发现并验证，这种细胞可以发育成为身体各种组织细胞。iPS细胞的发现成就了目前轰轰烈烈的干细胞研究领域，山中伸弥教授也因此获得2012年诺贝尔生理或医学奖。<br />
iPS (诱导多潜能干细胞)重编程实验的涌现使人们重新重视了上个世纪50年代胚胎发育生物学家Waddington提出的发育景观。虽然它只是一个隐喻,但其形象地描述了细胞的自发的层次分叉过程并隐含了细胞类型之间转换的可能性,从而作为一个整体框架最近被广泛应用来解释细胞发育和重编程。<br />
详见：<a href="https://zhuanlan.zhihu.com/p/25333058">https://zhuanlan.zhihu.com/p/25333058</a><br />
Waddington 在两个时期提出的假说：</p>
<ul style="margin: 1.2em 0px; padding-left: 2em;">
<li style="margin: 0.5em 0px;">initially (1936) illustrated by railroad cars on switching tracks (A)</li>
<li style="margin: 0.5em 0px;">later (1957) by marbles rolling in a landscape (B), with trajectories shaped by hills and valleys.<br />
如图：<br />
<img src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/Waddington-classic-analogies.png" alt="发育景观" /></p>
<h3 id="-" style="margin: 1.3em 0px 1em; padding: 0px; font-weight: bold; font-size: 1.3em;">最优传输理论</h3>
<p>最优传输理论(Optimal Transport)，也叫Monge-Kantorovich Problem。最早由法国数学家Monge提出，二战期间由俄国数学家Kantorovich推广后开始迅速发展，Kantorovich也因他在这个领域做出的贡献得了1975年的经济学诺贝尔奖。<br />
Monge最早提出的问题可以理解为，<strong>有一堆土在地点A，现在我们要将这堆土转移到地点B，但是我们运土是要费体力的，怎么搬这些土可以让我们的体力消耗降到最小。</strong>现在我们量化这个问题，将土在地点A的分布称之为”Initial Distribution”，在地点B的分布称为”Final Distribution”，我们称消费的体力为Cost，通过一个”Cost Function”计算得出，每种搬运方案为一个”Mapping”。我们现在要在所有Mapping中寻找Cost最低的那一个，这就是最优传输理论要解决的问题。<br />
可能看完这些，有的小伙伴还是不太懂搬运方案和Mapping是怎样一回事。这里解释一下，比如在地点A和地点B的时候，土堆的形状都要形成一个标准正态分布 N(0, 1)，我们”将A土堆中间的土先搬过去形成B土堆的尾巴”和”将A土堆的土直接放到B土堆对应位置”所消耗的体力大部分情况下是不一样的，这就是两种不同的方案对应着不同的Cost。<br />
这几年，由最优传输理论衍生出来的”Martingale Optimal Transport”在金融数学有不少应用，有不少人在研究。简单的说就是给这些”Mapping”加了个限制，要求他们必须是”Martingale”。<br />
如图:<br />
<img src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/transport-map.png" alt="最优传输" /></p>
<h3 id="-" style="margin: 1.3em 0px 1em; padding: 0px; font-weight: bold; font-size: 1.3em;">发育生物学家感兴趣的基本问题</h3>
<p>如下：</li>
<li style="margin: 0.5em 0px;">What classes of cells are present at each stage?</li>
<li style="margin: 0.5em 0px;">For the cells in each class, what was their origin at earlier stages, what are their potential fates at later stages, and what is the actual outcome of a given cell?</li>
<li style="margin: 0.5em 0px;">To what extent are events along a path synchronous or asynchronous?</li>
<li style="margin: 0.5em 0px;">What are the genetic regulatory programs that control each path?</li>
<li style="margin: 0.5em 0px;">What are the intercellular interactions between classes of cells?</li>
<li style="margin: 0.5em 0px;">How deterministic or stochastic is the process—that is: if, and how early, does it become determined that a particular cell or an entire cell class is destined to a specific fate?</li>
<li style="margin: 0.5em 0px;">For a given origin and target fate, is there only a single path to the target, or are there multiple developmental paths?</li>
<li style="margin: 0.5em 0px;">To what extent is the process cell-intrinsic, driven by intracellular mechanisms that do not require ongoing external inputs, or externally regulated, being affected by other contemporaneous cells?</li>
<li style="margin: 0.5em 0px;">For artificial processes such as induced reprogramming, there are additional questions: What off-target cell classes arise?</li>
<li style="margin: 0.5em 0px;">To what extent do cells activate normal developmental programs vs. unnatural hybrid programs?</li>
<li style="margin: 0.5em 0px;">How can the efficiency of reprogramming be improved?<br />
示意图如下；<br />
<img src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/Differentiation-processes-3-models.png" alt="细胞发育的未解之谜" /><br />
然后列举一些前人在探索这些问题方面的研究成果，指出他们做的还不够。<strong>而单细胞转录组测序技术非常强大，适合解决这个问题。</strong><br />
单细胞转录组在探索发育轨迹这方面也有过一些应用了，主要的算法集中于3个：</li>
<li style="margin: 0.5em 0px;">k-nearest neighbor graphs</li>
<li style="margin: 0.5em 0px;">binary trees</li>
<li style="margin: 0.5em 0px;">diffusion maps<br />
他们的缺陷很明显，有3个：</li>
<li style="margin: 0.5em 0px;">首先为那些稳定的生物学过程设计的，比如cell cycle or adult stem cell differentiation</li>
<li style="margin: 0.5em 0px;">其次，单细胞本身也是多种生物学状态的叠加，比如cell proliferation and death就会影响那些算法的表现。</li>
<li style="margin: 0.5em 0px;">最后，大部分模型的假设限制很大，比如one-dimensional trajectories and zero-dimensional branch points<br />
所以作者把Optimal Transport (OT)的算法，应用到了时间序列的单细胞转录组数据来探索发育的过程。当然，表现很好的啦，揭示了重编程的分子机理。<br />
几大发现如下：</li>
<li style="margin: 0.5em 0px;">(1) identifying alternative cell fates, including senescence, apoptosis, neural identity, and placental identity;</li>
<li style="margin: 0.5em 0px;">(2) quantifying the portion of cells in each state at each time point;</li>
<li style="margin: 0.5em 0px;">(3) inferring the probable origin(s) and fate(s) of each cell and cell class at each time point;</li>
<li style="margin: 0.5em 0px;">(4) identifying early molecular markers associated with eventual fates;</li>
<li style="margin: 0.5em 0px;">(5) using trajectory information to identify transcription factors (TFs) associated with the activation of different expression programs.<br />
<h2 id="-" style="margin: 1.3em 0px 1em; padding: 0px; font-weight: bold; font-size: 1.4em; border-bottom: 1px solid #eeeeee;">单细胞转录组数据处理</h2>
<h3 id="-" style="margin: 1.3em 0px 1em; padding: 0px; font-weight: bold; font-size: 1.3em;">首先得到表达矩阵</h3>
<p>因为是 10X Genomics数据，所以直接用官方工具CELLRANGER 即可，过滤后得到<br />
65,781 cells and G = 16, 339 genes 的表达矩阵</p>
<h3 id="-" style="margin: 1.3em 0px 1em; padding: 0px; font-weight: bold; font-size: 1.3em;">然后降维</h3>
<p>先过滤掉那些在所有细胞表达没什么变化的基因，这一步利用的是R包SEURAT的MeanVarPlot函数，剩下2076个基因。<br />
然后使用 diffusion component embedding进行降维处理，，这一步利用的是R包 DESTINY。<br />
分析了top100 diffusion components的，发现只有top20是显著的富集到 developmental processes ，所以作者只选取了top 20 diffusion components</p>
<h3 id="-" style="margin: 1.3em 0px 1em; padding: 0px; font-weight: bold; font-size: 1.3em;">可视化</h3>
<p>现在剩下了20*65781的矩阵，首先使用R语言的FNN包里面的 fast k-NN algorithm ，然后利用ForceAtlas2算法计算 force-directed layout on the k-NN graph</p>
<h3 id="-" style="margin: 1.3em 0px 1em; padding: 0px; font-weight: bold; font-size: 1.3em;">单细胞聚类</h3>
<p>同样的剩下了20*65781的矩阵，使用了 Louvain-Jaccard community detection 算法，默认参数分成33类<br />
<img src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/t-SNE.png" alt="" /></p>
<h3 id="-" style="margin: 1.3em 0px 1em; padding: 0px; font-weight: bold; font-size: 1.3em;">最优传输算法</h3>
<p>主要就是考量 proliferation score和growth rate，</p>
<h3 id="-" style="margin: 1.3em 0px 1em; padding: 0px; font-weight: bold; font-size: 1.3em;">基因调控网络</h3>
<p>自己写Python脚本做的分析，公式有点多而且有点复杂，但是里面提到了Shannon diversity of the transport maps<br />
<img src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/similar-gene-expression-modules.png" alt="" /></p>
<h3 id="-" style="margin: 1.3em 0px 1em; padding: 0px; font-weight: bold; font-size: 1.3em;">基因表达模块</h3>
<p>使用了Graphical Lasso算法，来自于R包glasso，还用了R包IGRAPH的Infomap community detection 算法看基因模块的网络结构。<br />
使用HOMER软件的findGO.pl测序对基因模块注释到biological signatures<br />
每个基因集合的特征分数算法就是它里面的所有基因的z-score的平均值。</p>
<h3 id="-3-" style="margin: 1.3em 0px 1em; padding: 0px; font-weight: bold; font-size: 1.3em;">与3个已有算法比较</h3>
<p>见文末</p>
<h3 id="-ipsc" style="margin: 1.3em 0px 1em; padding: 0px; font-weight: bold; font-size: 1.3em;">实验环节：iPSC</h3>
<p>We obtained mouse embryonic fibroblasts (MEFs) from a single female embryo homozygous for ROSA26-M2rtTA, which constitutively expresses a reverse transactivator controlled by doxycycline (Dox)<br />
多西环素(<em>Doxycycline</em>)，具有抗炎作用,也称作是强力毒素(<em>doxycycline</em>)，a Dox-inducible polycistronic cassette carrying Pou5f1 (Oct4), Klf4, Sox2, and Myc (OKSM), and an EGFP reporter incorporated into the endogenous Oct4 locus (Oct4-IRES-EGFP).<br />
We plated MEFs in serum-containing induction medium, with Dox added on day 0 to induce the OKSM cassette (Phase-1(Dox)).<br />
第八天之后把添加的dox取出来，然后把细胞转移到serum-free N2B27 2i medium (Phase-2(2i)) 和serum (Phase-2(serum)).这两种培养条件下，直到细胞系表达出内源性的Oct4，认为是重编程成功。<br />
如图：<br />
<img src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/Experimental-design-of-scRNA-Seq-time-course-during-iPSC-reprogramming.png" alt="实验环节" /><br />
在各个时间段均测量了好几千个细胞的表达谱，总共65781个细胞。</p>
<h3 id="-" style="margin: 1.3em 0px 1em; padding: 0px; font-weight: bold; font-size: 1.3em;">发育景观</h3>
<p>作者花了5大段在描述下面的图：<br />
<img src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/differentiation-in-2-conditions.png" alt="6万多细胞的发育全景图" /><br />
可以看到细胞发育始于第0天，很容易理解，而且绝大部分的0天细胞都能被聚成一个类，表现为强烈MEF identity的signature信号。但是第二天的Dox处理后，细胞被诱导高表达OKSM cassette，而且开始转变为3个不同的clusters，但总体来说这3类都表现很强的增殖信号。<br />
第4天后细胞很明显朝着两个不同的方向变化，这里定义为：Valley of Stress and the Horn of Transformation。<br />
Following Dox withdrawal and media replacement on day 8, <strong>the cells in the Horn</strong> adopt one of four alternative outcomes by day 12 (senescence, neuronal program, placental program, and pre-iPSCs).</p>
<h3 id="cluster-" style="margin: 1.3em 0px 1em; padding: 0px; font-weight: bold; font-size: 1.3em;">cluster之间的转移</h3>
<p>作者提到了we partitioned the 16,339 detected genes into 44 gene modules and the 65,781 cells into 33 cell clusters，那这33个cluster分属于不同的发育时间，它们之间的发育转移关系如下图：<br />
<img src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/cluster-to-cluster-transitions.png" alt="" /><br />
尽管属于同一个发育时间节点，但是仍然是有发育快慢等多样性，同一发育时间点的不同特性的cluster细胞接下来的命运也差异很大：<br />
By day 4, cells display a bimodal distribution of properties that is strongly correlated with their eventual descendants:</li>
<li style="margin: 0.5em 0px;">cells in cluster 8 (low proliferation, high MEF identity) have 95% of their descendants in the Valley,</li>
<li style="margin: 0.5em 0px;">while cells in cluster 18 (high proliferation, low MEF identity) have 94% of their descendants in the Horn</li>
<li style="margin: 0.5em 0px;">Cells in cluster 7 show intermediate properties and have roughly equal probabilities of each fate<br />
同时挑选了一系列已有的signature来检查它们在发育景观的表现：<br />
<img src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/differentiation-in-12-signatures.png" alt="12个signatures的动态变化" /><br />
当然，也检查了一下marker基因的表达变化情况，就不截图了。</p>
<h3 id="-5-" style="margin: 1.3em 0px 1em; padding: 0px; font-weight: bold; font-size: 1.3em;">重点关注5类细胞</h3>
<p>不同发育时期的细胞可以分成33类，写起了太麻烦，作者挑选了值得讲故事的5类细胞：</li>
<li style="margin: 0.5em 0px;">placental-like cells (clusters 24 and 25) at day 12</li>
<li style="margin: 0.5em 0px;">neural-like cells (clusters 26 and 27) at day 16.<br />
还有iPSCs,Senescent cells, Apoptotic cells.<br />
<img src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/Ancestral-trajectories.png" alt="5类细胞" /><br />
主要也就是提一下他们的特征，高表达哪些基因，它们的来源和去向问题。</p>
<h3 id="3-" style="margin: 1.3em 0px 1em; padding: 0px; font-weight: bold; font-size: 1.3em;">3个其它软件的效果</h3>
<p><img src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/Wishbone-trajectories.png" alt="wishbone" /><br />
<img src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/Monocle2-trajectories.png" alt="Molocle2" /><br />
<img src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/diffusion-pseudo-time.png" alt="DPT" /><br />
这些软件之所以不适用于作者的这个实验设计出来的数据，因为没有考虑到发育时期这个已知的变量。<br />
虽然在作者写作的时候也已经出来了一款新的软件，但测试了，效果也不如作者自己开发的算法。</p>
<h3 id="-" style="margin: 1.3em 0px 1em; padding: 0px; font-weight: bold; font-size: 1.3em;">后记</h3>
<p>这篇文章做的数据实在是太大，而且分析要点太多，涉及到的算法也非常多，实在是没办法一一解读，估计得开一个讨论班，五六个人一起解读。<br />
比如下面这个课题组就讨论过；<br />
<img src="http://www.bio-info-trainee.com/wp-content/uploads/2018/02/MEF-To-iPSC.png" alt="课题组讨论" /></li>
</ul>
</blockquote>
<div style="height: 0; width: 0; max-height: 0; max-width: 0; overflow: hidden; font-size: 0em; padding: 0; margin: 0;" title="MDH:PHA+IyDml7bpl7Tluo/liJfljZXnu4bog57ovazlvZXnu4TmlbDmja7liIbmnpA8L3A+PHA+5paH
56ug5pivOiBbUmVjb25zdHJ1Y3Rpb24gb2YgZGV2ZWxvcG1lbnRhbCBsYW5kc2NhcGVzIGJ5IG9w
dGltYWwtdHJhbnNwb3J0IGFuYWx5c2lzIG9mIHNpbmdsZS1jZWxsIGdlbmUgZXhwcmVzc2lvbiBz
aGVkcyBsaWdodCBvbiBjZWxsdWxhciByZXByb2dyYW1taW5nLl0oaHR0cHM6Ly93d3cuYmlvcnhp
di5vcmcvY29udGVudC9lYXJseS8yMDE3LzA5LzI3LzE5MTA1Nikg6Jm954S25LqOMjAxN+W5tDnm
nIjlhazluIPlnKjkuoZiaW9SeGl25LiK6Z2i77yM5L2G5piv6Iez5LuK5LuN54S25rKh5q2j5byP
5Y+R6KGo77yM5YyF5ZCr5LqG5YWt5LiH5aSa5Liq5Y2V57uG6IOe6L2s5b2V57uE5pWw5o2u77yM
5oyB57ut6L+96Liq5LqGTUVG57uG6IOe57O76K+x5a+85Li6SVBTQ+e7huiDnueahOWKqOaAgeWP
mOWMlui/h+eoi++8jOW5tuS4lOS7juWPkeiCsueahOinkuW6puWIhuaekOS6hui/meS6m+aVsOaN
rjwvcD48cD4mZ3Q7IFdlIGRlbW9uc3RyYXRlIHRoZSBwb3dlciBvZiBXQURESU5HVE9OLU9UIGJ5
IGFwcGx5aW5nIHRoZSBhcHByb2FjaCB0byBzdHVkeSA2NSw3ODEgc2NSTkEtc2VxIHByb2ZpbGVz
IGNvbGxlY3RlZCBhdCAxMCB0aW1lIHBvaW50cyBvdmVyIDE2IGRheXMgZHVyaW5nIHJlcHJvZ3Jh
bW1pbmcgb2YgZmlicm9ibGFzdHMgdG8gaVBTQ3MuPC9wPjxwPiMjIOiDjOaZr+S7i+e7jTwvcD48
cD4jIyMgV2FkZGluZ3Rvbirmj5Dlh7rnmoQq5Y+R6IKy5pmv6KeCPC9wPjxwPuS4iuS4lue6qjUw
5bm05Luj77yM6IOa6IOO5Y+R6IKy55Sf54mp5a2m5a62Q29ucmFkIEhhbCBXYWRkaW5ndG9u5o+Q
5Ye655qE5Y+R6IKy5pmv6KeC5YGH6K+06K6k5Li677yM5YiG5YyW5oiQ54af55qE57uG6IOe5Y+Y
5Zue5aSa6IO95bmy57uG6IOe5piv5Liq5LiN5Y+v6IO95Y+R55Sf55qE5LqL5Lu244CC5L2G5piv
5pel5pys5Lqs6YO95aSn5a2m5pWZ5o6I5bGx5Lit5Ly45byl5LqOMjAwNuW5tOWNtOWPkeeOsOW5
tumqjOivge+8jOi/meenjee7huiDnuWPr+S7peWPkeiCsuaIkOS4uui6q+S9k+WQhOenjee7hOe7
h+e7huiDnuOAgmlQU+e7huiDnueahOWPkeeOsOaIkOWwseS6huebruWJjei9sOi9sOeDiOeDiOea
hOW5sue7huiDnueglOeptumihuWfn++8jOWxseS4reS8uOW8peaVmeaOiOS5n+WboOatpOiOt+W+
lzIwMTLlubTor7rotJ3lsJTnlJ/nkIbmiJbljLvlrablpZbjgII8L3A+PHA+aVBTICjor7Hlr7zl
pJrmvZzog73lubLnu4bog54p6YeN57yW56iL5a6e6aqM55qE5raM546w5L2/5Lq65Lus6YeN5paw
6YeN6KeG5LqG5LiK5Liq5LiW57qqNTDlubTku6Pog5rog47lj5HogrLnlJ/nianlrablrrZXYWRk
aW5ndG9u5o+Q5Ye655qE5Y+R6IKy5pmv6KeC44CC6Jm954S25a6D5Y+q5piv5LiA5Liq6ZqQ5Za7
LOS9huWFtuW9ouixoeWcsOaPj+i/sOS6hue7huiDnueahOiHquWPkeeahOWxguasoeWIhuWPiei/
h+eoi+W5tumakOWQq+S6hue7huiDnuexu+Wei+S5i+mXtOi9rOaNoueahOWPr+iDveaApyzku47o
gIzkvZzkuLrkuIDkuKrmlbTkvZPmoYbmnrbmnIDov5Hooqvlub/ms5vlupTnlKjmnaXop6Pph4rn
u4bog57lj5HogrLlkozph43nvJbnqIvjgII8L3A+PHA+6K+m6KeB77yaaHR0cHM6Ly96aHVhbmxh
bi56aGlodS5jb20vcC8yNTMzMzA1ODwvcD48cD5XYWRkaW5ndG9uIOWcqOS4pOS4quaXtuacn+aP
kOWHuueahOWBh+ivtO+8mjwvcD48cD4tIGluaXRpYWxseSAoMTkzNikgaWxsdXN0cmF0ZWQgYnkg
cmFpbHJvYWQgY2FycyBvbiBzd2l0Y2hpbmcgdHJhY2tzIChBKSA8YnI+LSBsYXRlciAoMTk1Nykg
YnkgbWFyYmxlcyByb2xsaW5nIGluIGEgbGFuZHNjYXBlIChCKSwgd2l0aCB0cmFqZWN0b3JpZXMg
c2hhcGVkIGJ5IGhpbGxzIGFuZCB2YWxsZXlzLjwvcD48cD7lpoLlm77vvJo8L3A+PHA+IVvlj5Ho
grLmma/op4JdKGh0dHA6Ly93d3cuYmlvLWluZm8tdHJhaW5lZS5jb20vd3AtY29udGVudC91cGxv
YWRzLzIwMTgvMDIvV2FkZGluZ3Rvbi1jbGFzc2ljLWFuYWxvZ2llcy5wbmcpPC9wPjxwPiMjIyDm
nIDkvJjkvKDovpPnkIborro8L3A+PHA+5pyA5LyY5Lyg6L6T55CG6K66KE9wdGltYWwgVHJhbnNw
b3J0Ke+8jOS5n+WPq01vbmdlLUthbnRvcm92aWNoIFByb2JsZW3jgILmnIDml6nnlLHms5Xlm73m
lbDlrablrrZNb25nZeaPkOWHuu+8jOS6jOaImOacn+mXtOeUseS/hOWbveaVsOWtpuWutkthbnRv
cm92aWNo5o6o5bm/5ZCO5byA5aeL6L+F6YCf5Y+R5bGV77yMS2FudG9yb3ZpY2jkuZ/lm6Dku5bl
nKjov5nkuKrpoobln5/lgZrlh7rnmoTotKHnjK7lvpfkuoYxOTc15bm055qE57uP5rWO5a2m6K+6
6LSd5bCU5aWW44CCPC9wPjxwPk1vbmdl5pyA5pep5o+Q5Ye655qE6Zeu6aKY5Y+v5Lul55CG6Kej
5Li677yMKirmnInkuIDloIblnJ/lnKjlnLDngrlB77yM546w5Zyo5oiR5Lus6KaB5bCG6L+Z5aCG
5Zyf6L2s56e75Yiw5Zyw54K5Qu+8jOS9huaYr+aIkeS7rOi/kOWcn+aYr+imgei0ueS9k+WKm+ea
hO+8jOaAjuS5iOaQrOi/meS6m+Wcn+WPr+S7peiuqeaIkeS7rOeahOS9k+WKm+a2iOiAl+mZjeWI
sOacgOWwj+OAgioq546w5Zyo5oiR5Lus6YeP5YyW6L+Z5Liq6Zeu6aKY77yM5bCG5Zyf5Zyo5Zyw
54K5QeeahOWIhuW4g+ensOS5i+S4uiJJbml0aWFsIERpc3RyaWJ1dGlvbiLvvIzlnKjlnLDngrlC
55qE5YiG5biD56ew5Li6IkZpbmFsIERpc3RyaWJ1dGlvbiLvvIzmiJHku6znp7DmtojotLnnmoTk
vZPlipvkuLpDb3N077yM6YCa6L+H5LiA5LiqIkNvc3QgRnVuY3Rpb24i6K6h566X5b6X5Ye677yM
5q+P56eN5pCs6L+Q5pa55qGI5Li65LiA5LiqIk1hcHBpbmci44CC5oiR5Lus546w5Zyo6KaB5Zyo
5omA5pyJTWFwcGluZ+S4reWvu+aJvkNvc3TmnIDkvY7nmoTpgqPkuIDkuKrvvIzov5nlsLHmmK/m
nIDkvJjkvKDovpPnkIborrropoHop6PlhrPnmoTpl67popjjgII8L3A+PHA+5Y+v6IO955yL5a6M
6L+Z5Lqb77yM5pyJ55qE5bCP5LyZ5Ly06L+Y5piv5LiN5aSq5oeC5pCs6L+Q5pa55qGI5ZKMTWFw
cGluZ+aYr+aAjuagt+S4gOWbnuS6i+OAgui/memHjOino+mHiuS4gOS4i++8jOavlOWmguWcqOWc
sOeCuUHlkozlnLDngrlC55qE5pe25YCZ77yM5Zyf5aCG55qE5b2i54q26YO96KaB5b2i5oiQ5LiA
5Liq5qCH5YeG5q2j5oCB5YiG5biDIE4oMCwgMSnvvIzmiJHku6wi5bCGQeWcn+WghuS4remXtOea
hOWcn+WFiOaQrOi/h+WOu+W9ouaIkELlnJ/loIbnmoTlsL7lt7Qi5ZKMIuWwhkHlnJ/loIbnmoTl
nJ/nm7TmjqXmlL7liLBC5Zyf5aCG5a+55bqU5L2N572uIuaJgOa2iOiAl+eahOS9k+WKm+Wkp+mD
qOWIhuaDheWGteS4i+aYr+S4jeS4gOagt+eahO+8jOi/meWwseaYr+S4pOenjeS4jeWQjOeahOaW
ueahiOWvueW6lOedgOS4jeWQjOeahENvc3TjgII8L3A+PHA+6L+Z5Yeg5bm077yM55Sx5pyA5LyY
5Lyg6L6T55CG6K666KGN55Sf5Ye65p2l55qEIk1hcnRpbmdhbGUgT3B0aW1hbCBUcmFuc3BvcnQi
5Zyo6YeR6J6N5pWw5a2m5pyJ5LiN5bCR5bqU55So77yM5pyJ5LiN5bCR5Lq65Zyo56CU56m244CC
566A5Y2V55qE6K+05bCx5piv57uZ6L+Z5LqbIk1hcHBpbmci5Yqg5LqG5Liq6ZmQ5Yi277yM6KaB
5rGC5LuW5Lus5b+F6aG75pivIk1hcnRpbmdhbGUi44CCPC9wPjxwPuWmguWbvjo8L3A+PHA+IVvm
nIDkvJjkvKDovpNdKGh0dHA6Ly93d3cuYmlvLWluZm8tdHJhaW5lZS5jb20vd3AtY29udGVudC91
cGxvYWRzLzIwMTgvMDIvdHJhbnNwb3J0LW1hcC5wbmcpPC9wPjxwPiMjIyDlj5HogrLnlJ/nianl
rablrrbmhJ/lhbTotqPnmoTln7rmnKzpl67popg8L3A+PHA+5aaC5LiL77yaPC9wPjxwPi0gV2hh
dCBjbGFzc2VzIG9mIGNlbGxzIGFyZSBwcmVzZW50IGF0IGVhY2ggc3RhZ2U/IDxicj4tIEZvciB0
aGUgY2VsbHMgaW4gZWFjaCBjbGFzcywgd2hhdCB3YXMgdGhlaXIgb3JpZ2luIGF0IGVhcmxpZXIg
c3RhZ2VzLCB3aGF0IGFyZSB0aGVpciBwb3RlbnRpYWwgZmF0ZXMgYXQgbGF0ZXIgc3RhZ2VzLCBh
bmQgd2hhdCBpcyB0aGUgYWN0dWFsIG91dGNvbWUgb2YgYSBnaXZlbiBjZWxsPyA8YnI+LSBUbyB3
aGF0IGV4dGVudCBhcmUgZXZlbnRzIGFsb25nIGEgcGF0aCBzeW5jaHJvbm91cyBvciBhc3luY2hy
b25vdXM/IDxicj4tIFdoYXQgYXJlIHRoZSBnZW5ldGljIHJlZ3VsYXRvcnkgcHJvZ3JhbXMgdGhh
dCBjb250cm9sIGVhY2ggcGF0aD8gPGJyPi0gV2hhdCBhcmUgdGhlIGludGVyY2VsbHVsYXIgaW50
ZXJhY3Rpb25zIGJldHdlZW4gY2xhc3NlcyBvZiBjZWxscz88YnI+LSBIb3cgZGV0ZXJtaW5pc3Rp
YyBvciBzdG9jaGFzdGljIGlzIHRoZSBwcm9jZXNz4oCUdGhhdCBpczogaWYsIGFuZCBob3cgZWFy
bHksIGRvZXMgaXQgYmVjb21lIGRldGVybWluZWQgdGhhdCBhIHBhcnRpY3VsYXIgY2VsbCBvciBh
biBlbnRpcmUgY2VsbCBjbGFzcyBpcyBkZXN0aW5lZCB0byBhIHNwZWNpZmljIGZhdGU/PGJyPi0g
Rm9yIGEgZ2l2ZW4gb3JpZ2luIGFuZCB0YXJnZXQgZmF0ZSwgaXMgdGhlcmUgb25seSBhIHNpbmds
ZSBwYXRoIHRvIHRoZSB0YXJnZXQsIG9yIGFyZSB0aGVyZSBtdWx0aXBsZSBkZXZlbG9wbWVudGFs
IHBhdGhzPyA8YnI+LSBUbyB3aGF0IGV4dGVudCBpcyB0aGUgcHJvY2VzcyBjZWxsLWludHJpbnNp
YywgZHJpdmVuIGJ5IGludHJhY2VsbHVsYXIgbWVjaGFuaXNtcyB0aGF0IGRvIG5vdCByZXF1aXJl
IG9uZ29pbmcgZXh0ZXJuYWwgaW5wdXRzLCBvciBleHRlcm5hbGx5IHJlZ3VsYXRlZCwgYmVpbmcg
YWZmZWN0ZWQgYnkgb3RoZXIgY29udGVtcG9yYW5lb3VzIGNlbGxzPyA8YnI+LSBGb3IgYXJ0aWZp
Y2lhbCBwcm9jZXNzZXMgc3VjaCBhcyBpbmR1Y2VkIHJlcHJvZ3JhbW1pbmcsIHRoZXJlIGFyZSBh
ZGRpdGlvbmFsIHF1ZXN0aW9uczogV2hhdCBvZmYtdGFyZ2V0IGNlbGwgY2xhc3NlcyBhcmlzZT8g
PGJyPi0gVG8gd2hhdCBleHRlbnQgZG8gY2VsbHMgYWN0aXZhdGUgbm9ybWFsIGRldmVsb3BtZW50
YWwgcHJvZ3JhbXMgdnMuIHVubmF0dXJhbCBoeWJyaWQgcHJvZ3JhbXM/IDxicj4tIEhvdyBjYW4g
dGhlIGVmZmljaWVuY3kgb2YgcmVwcm9ncmFtbWluZyBiZSBpbXByb3ZlZD88L3A+PHA+56S65oSP
5Zu+5aaC5LiL77ybPC9wPjxwPiFb57uG6IOe5Y+R6IKy55qE5pyq6Kej5LmL6LCcXShodHRwOi8v
d3d3LmJpby1pbmZvLXRyYWluZWUuY29tL3dwLWNvbnRlbnQvdXBsb2Fkcy8yMDE4LzAyL0RpZmZl
cmVudGlhdGlvbi1wcm9jZXNzZXMtMy1tb2RlbHMucG5nKTwvcD48cD7nhLblkI7liJfkuL7kuIDk
upvliY3kurrlnKjmjqLntKLov5nkupvpl67popjmlrnpnaLnmoTnoJTnqbbmiJDmnpzvvIzmjIfl
h7rku5bku6zlgZrnmoTov5jkuI3lpJ/jgIIqKuiAjOWNlee7huiDnui9rOW9lee7hOa1i+W6j+aK
gOacr+mdnuW4uOW8uuWkp++8jOmAguWQiOino+WGs+i/meS4qumXrumimOOAgioqPGJyPuWNlee7
huiDnui9rOW9lee7hOWcqOaOoue0ouWPkeiCsui9qOi/uei/meaWuemdouS5n+aciei/h+S4gOS6
m+W6lOeUqOS6hu+8jOS4u+imgeeahOeul+azlembhuS4reS6jjPkuKrvvJo8L3A+PHA+LSBrLW5l
YXJlc3QgbmVpZ2hib3IgZ3JhcGhzPGJyPi0gYmluYXJ5IHRyZWVzPGJyPi0gZGlmZnVzaW9uIG1h
cHM8L3A+PHA+5LuW5Lus55qE57y66Zm35b6I5piO5pi+77yM5pyJM+S4qu+8mjwvcD48cD4tIOmm
luWFiOS4uumCo+S6m+eos+WumueahOeUn+eJqeWtpui/h+eoi+iuvuiuoeeahO+8jOavlOWmgmNl
bGwgY3ljbGUgb3IgYWR1bHQgc3RlbSBjZWxsIGRpZmZlcmVudGlhdGlvbjxicj4tIOWFtuasoe+8
jOWNlee7huiDnuacrOi6q+S5n+aYr+WkmuenjeeUn+eJqeWtpueKtuaAgeeahOWPoOWKoO+8jOav
lOWmgmNlbGwgcHJvbGlmZXJhdGlvbiBhbmQgZGVhdGjlsLHkvJrlvbHlk43pgqPkupvnrpfms5Xn
moTooajnjrDjgII8YnI+LSDmnIDlkI7vvIzlpKfpg6jliIbmqKHlnovnmoTlgYforr7pmZDliLbl
vojlpKfvvIzmr5TlpoJvbmUtZGltZW5zaW9uYWwgdHJhamVjdG9yaWVzIGFuZCB6ZXJvLWRpbWVu
c2lvbmFsIGJyYW5jaCBwb2ludHM8L3A+PHA+5omA5Lul5L2c6ICF5oqKT3B0aW1hbCBUcmFuc3Bv
cnQgKE9UKeeahOeul+azle+8jOW6lOeUqOWIsOS6huaXtumXtOW6j+WIl+eahOWNlee7huiDnui9
rOW9lee7hOaVsOaNruadpeaOoue0ouWPkeiCsueahOi/h+eoi+OAguW9k+eEtu+8jOihqOeOsOW+
iOWlveeahOWVpu+8jOaPreekuuS6humHjee8lueoi+eahOWIhuWtkOacuueQhuOAgjxicj7lh6Dl
pKflj5HnjrDlpoLkuIvvvJo8L3A+PHA+LSAoMSkgaWRlbnRpZnlpbmcgYWx0ZXJuYXRpdmUgY2Vs
bCBmYXRlcywgaW5jbHVkaW5nIHNlbmVzY2VuY2UsIGFwb3B0b3NpcywgbmV1cmFsIGlkZW50aXR5
LCBhbmQgcGxhY2VudGFsIGlkZW50aXR5Ozxicj4tICgyKSBxdWFudGlmeWluZyB0aGUgcG9ydGlv
biBvZiBjZWxscyBpbiBlYWNoIHN0YXRlIGF0IGVhY2ggdGltZSBwb2ludDsgPGJyPi0gKDMpIGlu
ZmVycmluZyB0aGUgcHJvYmFibGUgb3JpZ2luKHMpIGFuZCBmYXRlKHMpIG9mIGVhY2ggY2VsbCBh
bmQgY2VsbCBjbGFzcyBhdCBlYWNoIHRpbWUgcG9pbnQ7IDxicj4tICg0KSBpZGVudGlmeWluZyBl
YXJseSBtb2xlY3VsYXIgbWFya2VycyBhc3NvY2lhdGVkIHdpdGggZXZlbnR1YWwgZmF0ZXM7IDxi
cj4tICg1KSB1c2luZyB0cmFqZWN0b3J5IGluZm9ybWF0aW9uIHRvIGlkZW50aWZ5IHRyYW5zY3Jp
cHRpb24gZmFjdG9ycyAoVEZzKSBhc3NvY2lhdGVkIHdpdGggdGhlIGFjdGl2YXRpb24gb2YgZGlm
ZmVyZW50IGV4cHJlc3Npb24gcHJvZ3JhbXMuPC9wPjxwPiMjIOWNlee7huiDnui9rOW9lee7hOaV
sOaNruWkhOeQhjwvcD48cD4jIyMg6aaW5YWI5b6X5Yiw6KGo6L6+55+p6Zi1PC9wPjxwPuWboOS4
uuaYryAxMFggR2Vub21pY3PmlbDmja7vvIzmiYDku6Xnm7TmjqXnlKjlrpjmlrnlt6XlhbdDRUxM
UkFOR0VSIOWNs+WPr++8jOi/h+a7pOWQjuW+l+WIsDxicj42NSw3ODEgY2VsbHMgYW5kIEcgPSAx
NiwgMzM5IGdlbmVzIOeahOihqOi+vuefqemYtTxicj4jIyMg54S25ZCO6ZmN57u0PC9wPjxwPuWF
iOi/h+a7pOaOiemCo+S6m+WcqOaJgOaciee7huiDnuihqOi+vuayoeS7gOS5iOWPmOWMlueahOWf
uuWboO+8jOi/meS4gOatpeWIqeeUqOeahOaYr1LljIVTRVVSQVTnmoRNZWFuVmFyUGxvdOWHveaV
sO+8jOWJqeS4izIwNzbkuKrln7rlm6DjgII8YnI+54S25ZCO5L2/55SoIGRpZmZ1c2lvbiBjb21w
b25lbnQgZW1iZWRkaW5n6L+b6KGM6ZmN57u05aSE55CG77yM77yM6L+Z5LiA5q2l5Yip55So55qE
5pivUuWMhSBERVNUSU5Z44CCPGJyPuWIhuaekOS6hnRvcDEwMCBkaWZmdXNpb24gY29tcG9uZW50
c+eahO+8jOWPkeeOsOWPquaciXRvcDIw5piv5pi+6JGX55qE5a+M6ZuG5YiwIGRldmVsb3BtZW50
YWwgcHJvY2Vzc2VzIO+8jOaJgOS7peS9nOiAheWPqumAieWPluS6hnRvcCAyMCBkaWZmdXNpb24g
Y29tcG9uZW50czxicj4jIyMg5Y+v6KeG5YyWPC9wPjxwPueOsOWcqOWJqeS4i+S6hjIwKjY1Nzgx
55qE55+p6Zi177yM6aaW5YWI5L2/55SoUuivreiogOeahEZOTuWMhemHjOmdoueahCBmYXN0IGst
Tk4gYWxnb3JpdGhtIO+8jOeEtuWQjuWIqeeUqEZvcmNlQXRsYXMy566X5rOV6K6h566XIGZvcmNl
LWRpcmVjdGVkIGxheW91dCBvbiB0aGUgay1OTiBncmFwaCA8YnI+IyMjIOWNlee7huiDnuiBmuex
uzwvcD48cD7lkIzmoLfnmoTliankuIvkuoYyMCo2NTc4MeeahOefqemYte+8jOS9v+eUqOS6hiBM
b3V2YWluLUphY2NhcmQgY29tbXVuaXR5IGRldGVjdGlvbiDnrpfms5XvvIzpu5jorqTlj4LmlbDl
iIbmiJAzM+exuzwvcD48cD4hW10oaHR0cDovL3d3dy5iaW8taW5mby10cmFpbmVlLmNvbS93cC1j
b250ZW50L3VwbG9hZHMvMjAxOC8wMi90LVNORS5wbmcpPC9wPjxwPiMjIyDmnIDkvJjkvKDovpPn
rpfms5U8L3A+PHA+5Li76KaB5bCx5piv6ICD6YePIHByb2xpZmVyYXRpb24gc2NvcmXlkoxncm93
dGggcmF0Ze+8jDxicj4jIyMg5Z+65Zug6LCD5o6n572R57ucPC9wPjxwPuiHquW3seWGmVB5dGhv
buiEmuacrOWBmueahOWIhuaekO+8jOWFrOW8j+acieeCueWkmuiAjOS4lOacieeCueWkjeadgu+8
jOS9huaYr+mHjOmdouaPkOWIsOS6hlNoYW5ub24gZGl2ZXJzaXR5IG9mIHRoZSB0cmFuc3BvcnQg
bWFwczwvcD48cD4hW10oaHR0cDovL3d3dy5iaW8taW5mby10cmFpbmVlLmNvbS93cC1jb250ZW50
L3VwbG9hZHMvMjAxOC8wMi9zaW1pbGFyLWdlbmUtZXhwcmVzc2lvbi1tb2R1bGVzLnBuZyk8L3A+
PHA+IyMjIOWfuuWboOihqOi+vuaooeWdlzwvcD48cD7kvb/nlKjkuoZHcmFwaGljYWwgTGFzc2/n
rpfms5XvvIzmnaXoh6rkuo5S5YyFZ2xhc3Nv77yM6L+Y55So5LqGUuWMhUlHUkFQSOeahEluZm9t
YXAgY29tbXVuaXR5IGRldGVjdGlvbiDnrpfms5XnnIvln7rlm6DmqKHlnZfnmoTnvZHnu5znu5Pm
noTjgII8YnI+5L2/55SoSE9NRVLova/ku7bnmoRmaW5kR08ucGzmtYvluo/lr7nln7rlm6DmqKHl
nZfms6jph4rliLBiaW9sb2dpY2FsIHNpZ25hdHVyZXMgPGJyPuavj+S4quWfuuWboOmbhuWQiOea
hOeJueW+geWIhuaVsOeul+azleWwseaYr+Wug+mHjOmdoueahOaJgOacieWfuuWboOeahHotc2Nv
cmXnmoTlubPlnYflgLzjgII8L3A+PHA+IyMjIOS4jjPkuKrlt7LmnInnrpfms5Xmr5TovoM8L3A+
PHA+6KeB5paH5pyrPC9wPjxwPiMjIyDlrp7pqoznjq/oioLvvJppUFNDPC9wPjxwPldlIG9idGFp
bmVkIG1vdXNlIGVtYnJ5b25pYyBmaWJyb2JsYXN0cyAoTUVGcykgZnJvbSBhIHNpbmdsZSBmZW1h
bGUgZW1icnlvIGhvbW96eWdvdXMgZm9yIFJPU0EyNi1NMnJ0VEEsIHdoaWNoIGNvbnN0aXR1dGl2
ZWx5IGV4cHJlc3NlcyBhIHJldmVyc2UgdHJhbnNhY3RpdmF0b3IgY29udHJvbGxlZCBieSBkb3h5
Y3ljbGluZSAoRG94KTwvcD48cD7lpJropb/njq/ntKAoKkRveHljeWNsaW5lKinvvIzlhbfmnInm
ipfngo7kvZznlKgs5Lmf56ew5L2c5piv5by65Yqb5q+S57SgKCpkb3h5Y3ljbGluZSop77yMYSBE
b3gtaW5kdWNpYmxlIHBvbHljaXN0cm9uaWMgY2Fzc2V0dGUgY2FycnlpbmcgUG91NWYxIChPY3Q0
KSwgS2xmNCwgU294MiwgYW5kIE15YyAoT0tTTSksIGFuZCBhbiBFR0ZQIHJlcG9ydGVyIGluY29y
cG9yYXRlZCBpbnRvIHRoZSBlbmRvZ2Vub3VzIE9jdDQgbG9jdXMgKE9jdDQtSVJFUy1FR0ZQKS48
L3A+PHA+V2UgcGxhdGVkIE1FRnMgaW4gc2VydW0tY29udGFpbmluZyBpbmR1Y3Rpb24gbWVkaXVt
LCB3aXRoIERveCBhZGRlZCBvbiBkYXkgMCB0byBpbmR1Y2UgdGhlIE9LU00gY2Fzc2V0dGUgKFBo
YXNlLTEoRG94KSkuPC9wPjxwPuesrOWFq+WkqeS5i+WQjuaKiua3u+WKoOeahGRveOWPluWHuuad
pe+8jOeEtuWQjuaKiue7huiDnui9rOenu+WIsHNlcnVtLWZyZWUgTjJCMjcgMmkgbWVkaXVtIChQ
aGFzZS0yKDJpKSkg5ZKMc2VydW0gKFBoYXNlLTIoc2VydW0pKS7ov5nkuKTnp43ln7nlhbvmnaHk
u7bkuIvvvIznm7TliLDnu4bog57ns7vooajovr7lh7rlhoXmupDmgKfnmoRPY3Q077yM6K6k5Li6
5piv6YeN57yW56iL5oiQ5Yqf44CCPC9wPjxwPuWmguWbvu+8mjwvcD48cD4hW+WunumqjOeOr+iK
gl0oaHR0cDovL3d3dy5iaW8taW5mby10cmFpbmVlLmNvbS93cC1jb250ZW50L3VwbG9hZHMvMjAx
OC8wMi9FeHBlcmltZW50YWwtZGVzaWduLW9mLXNjUk5BLVNlcS10aW1lLWNvdXJzZS1kdXJpbmct
aVBTQy1yZXByb2dyYW1taW5nLnBuZyk8L3A+PHA+5Zyo5ZCE5Liq5pe26Ze05q615Z2H5rWL6YeP
5LqG5aW95Yeg5Y2D5Liq57uG6IOe55qE6KGo6L6+6LCx77yM5oC75YWxNjU3ODHkuKrnu4bog57j
gII8L3A+PHA+IyMjIOWPkeiCsuaZr+ingjwvcD48cD7kvZzogIXoirHkuoY15aSn5q615Zyo5o+P
6L+w5LiL6Z2i55qE5Zu+77yaPC9wPjxwPiFbNuS4h+Wkmue7huiDnueahOWPkeiCsuWFqOaZr+Wb
vl0oaHR0cDovL3d3dy5iaW8taW5mby10cmFpbmVlLmNvbS93cC1jb250ZW50L3VwbG9hZHMvMjAx
OC8wMi9kaWZmZXJlbnRpYXRpb24taW4tMi1jb25kaXRpb25zLnBuZyk8L3A+PHA+5Y+v5Lul55yL
5Yiw57uG6IOe5Y+R6IKy5aeL5LqO56ysMOWkqe+8jOW+iOWuueaYk+eQhuino++8jOiAjOS4lOe7
neWkp+mDqOWIhueahDDlpKnnu4bog57pg73og73ooqvogZrmiJDkuIDkuKrnsbvvvIzooajnjrDk
uLrlvLrng4hNRUYgaWRlbnRpdHnnmoRzaWduYXR1cmXkv6Hlj7fjgILkvYbmmK/nrKzkuozlpKnn
moREb3jlpITnkIblkI7vvIznu4bog57ooqvor7Hlr7zpq5jooajovr5PS1NNIGNhc3NldHRl77yM
6ICM5LiU5byA5aeL6L2s5Y+Y5Li6M+S4quS4jeWQjOeahGNsdXN0ZXJz77yM5L2G5oC75L2T5p2l
6K+06L+ZM+exu+mDveihqOeOsOW+iOW8uueahOWinuauluS/oeWPt+OAgjwvcD48cD7nrKw05aSp
5ZCO57uG6IOe5b6I5piO5pi+5pyd552A5Lik5Liq5LiN5ZCM55qE5pa55ZCR5Y+Y5YyW77yM6L+Z
6YeM5a6a5LmJ5Li677yaVmFsbGV5IG9mIFN0cmVzcyBhbmQgdGhlIEhvcm4gb2YgVHJhbnNmb3Jt
YXRpb27jgII8L3A+PHA+Rm9sbG93aW5nIERveCB3aXRoZHJhd2FsIGFuZCBtZWRpYSByZXBsYWNl
bWVudCBvbiBkYXkgOCwgKip0aGUgY2VsbHMgaW4gdGhlIEhvcm4qKiBhZG9wdCBvbmUgb2YgZm91
ciBhbHRlcm5hdGl2ZSBvdXRjb21lcyBieSBkYXkgMTIgKHNlbmVzY2VuY2UsIG5ldXJvbmFsIHBy
b2dyYW0sIHBsYWNlbnRhbCBwcm9ncmFtLCBhbmQgcHJlLWlQU0NzKS48L3A+PHA+IyMjIGNsdXN0
ZXLkuYvpl7TnmoTovaznp7s8L3A+PHA+5L2c6ICF5o+Q5Yiw5LqGd2UgcGFydGl0aW9uZWQgdGhl
IDE2LDMzOSBkZXRlY3RlZCBnZW5lcyBpbnRvIDQ0IGdlbmUgbW9kdWxlcyBhbmQgdGhlIDY1LDc4
MSBjZWxscyBpbnRvIDMzIGNlbGwgY2x1c3RlcnPvvIzpgqPov5kzM+S4qmNsdXN0ZXLliIblsZ7k
uo7kuI3lkIznmoTlj5HogrLml7bpl7TvvIzlroPku6zkuYvpl7TnmoTlj5HogrLovaznp7vlhbPn
s7vlpoLkuIvlm77vvJo8L3A+PHA+IVtdKGh0dHA6Ly93d3cuYmlvLWluZm8tdHJhaW5lZS5jb20v
d3AtY29udGVudC91cGxvYWRzLzIwMTgvMDIvY2x1c3Rlci10by1jbHVzdGVyLXRyYW5zaXRpb25z
LnBuZyk8L3A+PHA+5bC9566h5bGe5LqO5ZCM5LiA5Liq5Y+R6IKy5pe26Ze06IqC54K577yM5L2G
5piv5LuN54S25piv5pyJ5Y+R6IKy5b+r5oWi562J5aSa5qC35oCn77yM5ZCM5LiA5Y+R6IKy5pe2
6Ze054K555qE5LiN5ZCM54m55oCn55qEY2x1c3Rlcue7huiDnuaOpeS4i+adpeeahOWRvei/kOS5
n+W3ruW8guW+iOWkp++8mjwvcD48cD5CeSBkYXkgNCwgY2VsbHMgZGlzcGxheSBhIGJpbW9kYWwg
ZGlzdHJpYnV0aW9uIG9mIHByb3BlcnRpZXMgdGhhdCBpcyBzdHJvbmdseSBjb3JyZWxhdGVkIHdp
dGggdGhlaXIgZXZlbnR1YWwgZGVzY2VuZGFudHM6PC9wPjxwPi0gY2VsbHMgaW4gY2x1c3RlciA4
IChsb3cgcHJvbGlmZXJhdGlvbiwgaGlnaCBNRUYgaWRlbnRpdHkpIGhhdmUgOTUlIG9mIHRoZWly
IGRlc2NlbmRhbnRzIGluIHRoZSBWYWxsZXksIDxicj4tIHdoaWxlIGNlbGxzIGluIGNsdXN0ZXIg
MTggKGhpZ2ggcHJvbGlmZXJhdGlvbiwgbG93IE1FRiBpZGVudGl0eSkgaGF2ZSA5NCUgb2YgdGhl
aXIgZGVzY2VuZGFudHMgaW4gdGhlIEhvcm4gPGJyPi0gQ2VsbHMgaW4gY2x1c3RlciA3IHNob3cg
aW50ZXJtZWRpYXRlIHByb3BlcnRpZXMgYW5kIGhhdmUgcm91Z2hseSBlcXVhbCBwcm9iYWJpbGl0
aWVzIG9mIGVhY2ggZmF0ZTwvcD48cD7lkIzml7bmjJHpgInkuobkuIDns7vliJflt7LmnInnmoRz
aWduYXR1cmXmnaXmo4Dmn6XlroPku6zlnKjlj5HogrLmma/op4LnmoTooajnjrDvvJo8L3A+PHA+
IVsxMuS4qnNpZ25hdHVyZXPnmoTliqjmgIHlj5jljJZdKGh0dHA6Ly93d3cuYmlvLWluZm8tdHJh
aW5lZS5jb20vd3AtY29udGVudC91cGxvYWRzLzIwMTgvMDIvZGlmZmVyZW50aWF0aW9uLWluLTEy
LXNpZ25hdHVyZXMucG5nKTwvcD48cD7lvZPnhLbvvIzkuZ/mo4Dmn6XkuobkuIDkuIttYXJrZXLl
n7rlm6DnmoTooajovr7lj5jljJbmg4XlhrXvvIzlsLHkuI3miKrlm77kuobjgII8L3A+PHA+IyMj
IOmHjeeCueWFs+azqDXnsbvnu4bog548L3A+PHA+5LiN5ZCM5Y+R6IKy5pe25pyf55qE57uG6IOe
5Y+v5Lul5YiG5oiQMzPnsbvvvIzlhpnotbfkuoblpKrpurvng6bvvIzkvZzogIXmjJHpgInkuobl
gLzlvpforrLmlYXkuovnmoQ157G757uG6IOe77yaPC9wPjxwPi0gcGxhY2VudGFsLWxpa2UgY2Vs
bHMgKGNsdXN0ZXJzIDI0IGFuZCAyNSkgYXQgZGF5IDEyIDxicj4tIG5ldXJhbC1saWtlIGNlbGxz
IChjbHVzdGVycyAyNiBhbmQgMjcpIGF0IGRheSAxNi48L3A+PHA+6L+Y5pyJaVBTQ3MsU2VuZXNj
ZW50IGNlbGxzLCBBcG9wdG90aWMgY2VsbHMuPC9wPjxwPiFbNeexu+e7huiDnl0oaHR0cDovL3d3
dy5iaW8taW5mby10cmFpbmVlLmNvbS93cC1jb250ZW50L3VwbG9hZHMvMjAxOC8wMi9BbmNlc3Ry
YWwtdHJhamVjdG9yaWVzLnBuZyk8L3A+PHA+5Li76KaB5Lmf5bCx5piv5o+Q5LiA5LiL5LuW5Lus
55qE54m55b6B77yM6auY6KGo6L6+5ZOq5Lqb5Z+65Zug77yM5a6D5Lus55qE5p2l5rqQ5ZKM5Y67
5ZCR6Zeu6aKY44CCPC9wPjxwPiMjIyAz5Liq5YW25a6D6L2v5Lu255qE5pWI5p6cPC9wPjxwPiFb
d2lzaGJvbmVdKGh0dHA6Ly93d3cuYmlvLWluZm8tdHJhaW5lZS5jb20vd3AtY29udGVudC91cGxv
YWRzLzIwMTgvMDIvV2lzaGJvbmUtdHJhamVjdG9yaWVzLnBuZyk8L3A+PHA+IVtNb2xvY2xlMl0o
aHR0cDovL3d3dy5iaW8taW5mby10cmFpbmVlLmNvbS93cC1jb250ZW50L3VwbG9hZHMvMjAxOC8w
Mi9Nb25vY2xlMi10cmFqZWN0b3JpZXMucG5nKTwvcD48cD4hW0RQVF0oaHR0cDovL3d3dy5iaW8t
aW5mby10cmFpbmVlLmNvbS93cC1jb250ZW50L3VwbG9hZHMvMjAxOC8wMi9kaWZmdXNpb24tcHNl
dWRvLXRpbWUucG5nKTwvcD48cD7ov5nkupvova/ku7bkuYvmiYDku6XkuI3pgILnlKjkuo7kvZzo
gIXnmoTov5nkuKrlrp7pqozorr7orqHlh7rmnaXnmoTmlbDmja7vvIzlm6DkuLrmsqHmnInogIPo
mZHliLDlj5HogrLml7bmnJ/ov5nkuKrlt7Lnn6XnmoTlj5jph4/jgII8L3A+PHA+6Jm954S25Zyo
5L2c6ICF5YaZ5L2c55qE5pe25YCZ5Lmf5bey57uP5Ye65p2l5LqG5LiA5qy+5paw55qE6L2v5Lu2
77yM5L2G5rWL6K+V5LqG77yM5pWI5p6c5Lmf5LiN5aaC5L2c6ICF6Ieq5bex5byA5Y+R55qE566X
5rOV44CCPC9wPjxwPiMjIyDlkI7orrA8L3A+PHA+6L+Z56+H5paH56ug5YGa55qE5pWw5o2u5a6e
5Zyo5piv5aSq5aSn77yM6ICM5LiU5YiG5p6Q6KaB54K55aSq5aSa77yM5raJ5Y+K5Yiw55qE566X
5rOV5Lmf6Z2e5bi45aSa77yM5a6e5Zyo5piv5rKh5Yqe5rOV5LiA5LiA6Kej6K+777yM5Lyw6K6h
5b6X5byA5LiA5Liq6K6o6K6654+t77yM5LqU5YWt5Liq5Lq65LiA6LW36Kej6K+744CCPC9wPjxw
PuavlOWmguS4i+mdoui/meS4quivvumimOe7hOWwseiuqOiuuui/h++8mzwvcD48cD4hW+ivvumi
mOe7hOiuqOiuul0oaHR0cDovL3d3dy5iaW8taW5mby10cmFpbmVlLmNvbS93cC1jb250ZW50L3Vw
bG9hZHMvMjAxOC8wMi9NRUYtVG8taVBTQy5wbmcpPC9wPg==">​</div>
</div>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bio-info-trainee.com/3022.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>3500个TNBC单细胞转录组数据重处理</title>
		<link>http://www.bio-info-trainee.com/3019.html</link>
		<comments>http://www.bio-info-trainee.com/3019.html#comments</comments>
		<pubDate>Tue, 06 Feb 2018 06:00:16 +0000</pubDate>
		<dc:creator><![CDATA[ulwvfje]]></dc:creator>
				<category><![CDATA[cancer]]></category>
		<category><![CDATA[10X]]></category>
		<category><![CDATA[TNBC]]></category>
		<category><![CDATA[单细胞]]></category>

		<guid isPermaLink="false">http://www.bio-info-trainee.com/?p=3019</guid>
		<description><![CDATA[文章：A Targetable EGFR-Dependent Tumor-Ini &#8230; <a href="http://www.bio-info-trainee.com/3019.html">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<div class="markdown-here-wrapper" data-md-url="http://www.bio-info-trainee.com/wp-admin/post-new.php">
<p style="margin: 0px 0px 1.2em !important;">文章：<a href="https://www.sciencedirect.com/science/article/pii/S221112471731447X">A Targetable EGFR-Dependent Tumor-Initiating Program in Breast Cancer</a> , 因为bulk测序无法解决问题，所以作者选择了单细胞转录组测序策略：</p>
<p style="margin: 0px 0px 1.2em !important;"><span id="more-3019"></span></p>
<p style="margin: 0px 0px 1.2em !important;">To understand functional properties associated with heterogeneous EGFR expression in an unbiased manner, single cell <a href="https://www.sciencedirect.com/topics/biochemistry-genetics-and-molecular-biology/rna-seq">RNA-seq</a> was performed on freshly dissociated cells from the PDX <strong>(3,483 cells, with an average of 40,564 unique molecular identifiers (UMIs) and 5,146 genes detected per cell)</strong></p>
<p style="margin: 0px 0px 1.2em !important;">数据都在SRA数据库里面，如下：<a href="https://trace.ncbi.nlm.nih.gov/Traces/study/?acc=SRP110989">https://trace.ncbi.nlm.nih.gov/Traces/study/?acc=SRP110989</a></p>
<table style="margin: 1.2em 0px; padding: 0px; border-collapse: collapse; border-spacing: 0px; font-style: inherit; font-variant: inherit; font-weight: inherit; font-stretch: inherit; font-size: inherit; line-height: inherit; font-family: inherit; border: 0px;">
<thead>
<tr style="border-width: 1px 0px 0px; border-image: initial; background-color: white; margin: 0px; padding: 0px; border-color: #cccccc initial initial initial; border-style: solid initial initial initial;">
<th style="text-align: right; font-size: 1em; border: 1px solid #cccccc; margin: 0px; padding: 0.5em 1em; font-weight: bold; background-color: #f0f0f0;"></th>
<th style="font-size: 1em; border: 1px solid #cccccc; margin: 0px; padding: 0.5em 1em; font-weight: bold; background-color: #f0f0f0;">Run</th>
<th style="font-size: 1em; border: 1px solid #cccccc; margin: 0px; padding: 0.5em 1em; font-weight: bold; background-color: #f0f0f0;">Library name</th>
<th style="font-size: 1em; border: 1px solid #cccccc; margin: 0px; padding: 0.5em 1em; font-weight: bold; background-color: #f0f0f0;">MBases</th>
<th style="font-size: 1em; border: 1px solid #cccccc; margin: 0px; padding: 0.5em 1em; font-weight: bold; background-color: #f0f0f0;">MBytes</th>
<th style="font-size: 1em; border: 1px solid #cccccc; margin: 0px; padding: 0.5em 1em; font-weight: bold; background-color: #f0f0f0;">Experiment</th>
<th style="font-size: 1em; border: 1px solid #cccccc; margin: 0px; padding: 0.5em 1em; font-weight: bold; background-color: #f0f0f0;">Instrument</th>
</tr>
</thead>
<tbody style="margin: 0px; padding: 0px; border: 0px;">
<tr style="border-width: 1px 0px 0px; border-image: initial; background-color: white; margin: 0px; padding: 0px; border-color: #cccccc initial initial initial; border-style: solid initial initial initial;">
<td style="text-align: right; font-size: 1em; border: 1px solid #cccccc; margin: 0px; padding: 0.5em 1em;"></td>
<td style="font-size: 1em; border: 1px solid #cccccc; margin: 0px; padding: 0.5em 1em;"><a href="https://trace.ncbi.nlm.nih.gov/Traces/sra/?run=SRR5799776">SRR5799776</a></td>
<td style="font-size: 1em; border: 1px solid #cccccc; margin: 0px; padding: 0.5em 1em;">PDX1735_run4105_lane006</td>
<td style="font-size: 1em; border: 1px solid #cccccc; margin: 0px; padding: 0.5em 1em;">33,751</td>
<td style="font-size: 1em; border: 1px solid #cccccc; margin: 0px; padding: 0.5em 1em;">18,184</td>
<td style="font-size: 1em; border: 1px solid #cccccc; margin: 0px; padding: 0.5em 1em;"><a href="https://www.ncbi.nlm.nih.gov/sra/SRX2979241">SRX2979241</a></td>
<td style="font-size: 1em; border: 1px solid #cccccc; margin: 0px; padding: 0.5em 1em;">Illumina HiSeq 4000</td>
</tr>
<tr style="border-width: 1px 0px 0px; border-image: initial; background-color: #f8f8f8; margin: 0px; padding: 0px; border-color: #cccccc initial initial initial; border-style: solid initial initial initial;">
<td style="text-align: right; font-size: 1em; border: 1px solid #cccccc; margin: 0px; padding: 0.5em 1em;"></td>
<td style="font-size: 1em; border: 1px solid #cccccc; margin: 0px; padding: 0.5em 1em;"><a href="https://trace.ncbi.nlm.nih.gov/Traces/sra/?run=SRR5799775">SRR5799775</a></td>
<td style="font-size: 1em; border: 1px solid #cccccc; margin: 0px; padding: 0.5em 1em;">PDX1735_run4143_lane001</td>
<td style="font-size: 1em; border: 1px solid #cccccc; margin: 0px; padding: 0.5em 1em;">19,420</td>
<td style="font-size: 1em; border: 1px solid #cccccc; margin: 0px; padding: 0.5em 1em;">9,534</td>
<td style="font-size: 1em; border: 1px solid #cccccc; margin: 0px; padding: 0.5em 1em;"><a href="https://www.ncbi.nlm.nih.gov/sra/SRX2979242">SRX2979242</a></td>
<td style="font-size: 1em; border: 1px solid #cccccc; margin: 0px; padding: 0.5em 1em;">Illumina HiSeq 2500</td>
</tr>
<tr style="border-width: 1px 0px 0px; border-image: initial; background-color: white; margin: 0px; padding: 0px; border-color: #cccccc initial initial initial; border-style: solid initial initial initial;">
<td style="text-align: right; font-size: 1em; border: 1px solid #cccccc; margin: 0px; padding: 0.5em 1em;"></td>
<td style="font-size: 1em; border: 1px solid #cccccc; margin: 0px; padding: 0.5em 1em;"><a href="https://trace.ncbi.nlm.nih.gov/Traces/sra/?run=SRR5799774">SRR5799774</a></td>
<td style="font-size: 1em; border: 1px solid #cccccc; margin: 0px; padding: 0.5em 1em;">PDX1735_run4143_lane002</td>
<td style="font-size: 1em; border: 1px solid #cccccc; margin: 0px; padding: 0.5em 1em;">19,408</td>
<td style="font-size: 1em; border: 1px solid #cccccc; margin: 0px; padding: 0.5em 1em;">9,548</td>
<td style="font-size: 1em; border: 1px solid #cccccc; margin: 0px; padding: 0.5em 1em;"><a href="https://www.ncbi.nlm.nih.gov/sra/SRX2979243">SRX2979243</a></td>
<td style="font-size: 1em; border: 1px solid #cccccc; margin: 0px; padding: 0.5em 1em;">Illumina HiSeq 2500</td>
</tr>
</tbody>
</table>
<p style="margin: 0px 0px 1.2em !important;">但是作者并没有给表达矩阵，所以只能自行下载原始数据进行单细胞转录组全流程处理。</p>
<pre style="font-size: 1em; font-family: Consolas, Inconsolata, Courier, monospace; line-height: 1.2em; margin: 1.2em 0px;"><code class="hljs language-Shell" style="font-size: 0.85em; font-family: Consolas, Inconsolata, Courier, monospace; margin: 0px 0.15em; padding: 0.5em; white-space: pre; border: 1px solid #cccccc; background-color: #f8f8f8; border-radius: 3px; display: block; overflow: auto; overflow-x: auto; color: #333333; background: #f8f8f8; text-size-adjust: none;">mkdir -p ~/data/public/TNBC/
cd ~/data/public/TNBC/
nohup wget -c ftp://ftp-trace.ncbi.nih.gov/sra/sra-instant/reads/ByRun/sra/SRR/SRR579/SRR5799774/SRR5799774.sra &amp; 
nohup wget -c ftp://ftp-trace.ncbi.nih.gov/sra/sra-instant/reads/ByRun/sra/SRR/SRR579/SRR5799776/SRR5799776.sra &amp; 
nohup wget -c ftp://ftp-trace.ncbi.nih.gov/sra/sra-instant/reads/ByRun/sra/SRR/SRR579/SRR5799775/SRR5799775.sra &amp;
</code></pre>
<pre style="font-size: 1em; font-family: Consolas, Inconsolata, Courier, monospace; line-height: 1.2em; margin: 1.2em 0px;"><code class="hljs language-shell" style="font-size: 0.85em; font-family: Consolas, Inconsolata, Courier, monospace; margin: 0px 0.15em; padding: 0.5em; white-space: pre; border: 1px solid #cccccc; background-color: #f8f8f8; border-radius: 3px; display: block; overflow: auto; overflow-x: auto; color: #333333; background: #f8f8f8; text-size-adjust: none;">nohup ~/biosoft/sratoolkit/sratoolkit.2.6.3-centos_linux64/bin/fastq-dump --gzip --split-3 SRR5799774.sra &amp;
nohup ~/biosoft/sratoolkit/sratoolkit.2.6.3-centos_linux64/bin/fastq-dump --gzip --split-3 SRR5799775.sra &amp;
nohup ~/biosoft/sratoolkit/sratoolkit.2.6.3-centos_linux64/bin/fastq-dump --gzip --split-3 SRR5799776.sra &amp;
</code></pre>
<p style="margin: 0px 0px 1.2em !important;">下载并且解压后是：</p>
<pre style="font-size: 1em; font-family: Consolas, Inconsolata, Courier, monospace; line-height: 1.2em; margin: 1.2em 0px;"><code style="font-size: 0.85em; font-family: Consolas, Inconsolata, Courier, monospace; margin: 0px 0.15em; padding: 0.5em 0.7em; white-space: pre; border: 1px solid #cccccc; background-color: #f8f8f8; border-radius: 3px; display: block !important; overflow: auto;">1.7G Jan 22 23:34 SRR5799774_1.fastq.gz
 13G Jan 22 23:34 SRR5799774_2.fastq.gz
9.4G Jan 22 17:40 SRR5799774.sra
1.7G Jan 22 23:33 SRR5799775_1.fastq.gz
 13G Jan 22 23:33 SRR5799775_2.fastq.gz
9.4G Jan 22 17:31 SRR5799775.sra
2.9G Jan 23 00:55 SRR5799776_1.fastq.gz
 24G Jan 23 00:55 SRR5799776_2.fastq.gz
 18G Jan 22 18:25 SRR5799776.sra
</code></pre>
<p style="margin: 0px 0px 1.2em !important;">可以看到左右端数据文件大小差别很大，因为这个不是普通的双端测序。</p>
<p style="margin: 0px 0px 1.2em !important;">需要在作者的文章里面找到测序的描述，这篇文章的<a href="https://ars.els-cdn.com/content/image/1-s2.0-S221112471731447X-mmc1.pdf">补充材料</a>有介绍：</p>
<pre style="font-size: 1em; font-family: Consolas, Inconsolata, Courier, monospace; line-height: 1.2em; margin: 1.2em 0px;"><code style="font-size: 0.85em; font-family: Consolas, Inconsolata, Courier, monospace; margin: 0px 0.15em; padding: 0.5em 0.7em; white-space: pre; border: 1px solid #cccccc; background-color: #f8f8f8; border-radius: 3px; display: block !important; overflow: auto;">26 bp Read1, 8 bp I7 Index, 0 bp I5 Index and 98 bp Read2.
</code></pre>
<p style="margin: 0px 0px 1.2em !important;">测序数据量是：a total of <code style="font-size: 0.85em; font-family: Consolas, Inconsolata, Courier, monospace; margin: 0px 0.15em; padding: 0px 0.3em; white-space: pre-wrap; border: 1px solid #eaeaea; background-color: #f8f8f8; border-radius: 3px; display: inline;">717,982,475</code> reads, and <code style="font-size: 0.85em; font-family: Consolas, Inconsolata, Courier, monospace; margin: 0px 0.15em; padding: 0px 0.3em; white-space: pre-wrap; border: 1px solid #eaeaea; background-color: #f8f8f8; border-radius: 3px; display: inline;">179,137</code> reads per single-cell</p>
<p style="margin: 0px 0px 1.2em !important;">因为是 <strong>10x Genomics方法</strong>做的单细胞转录组数据，所以需要使用他们发表的工具来处理：<a href="https://support.10xgenomics.com/single-cell-gene-expression/software/pipelines/latest/installation">Cell Ranger</a> ，需要简单注册才能下载安装，我下载了一个测试数据，发现：</p>
<pre style="font-size: 1em; font-family: Consolas, Inconsolata, Courier, monospace; line-height: 1.2em; margin: 1.2em 0px;"><code style="font-size: 0.85em; font-family: Consolas, Inconsolata, Courier, monospace; margin: 0px 0.15em; padding: 0.5em 0.7em; white-space: pre; border: 1px solid #cccccc; background-color: #f8f8f8; border-radius: 3px; display: block !important; overflow: auto;">├── [237M] neurons_900_S1_L001_I1_001.fastq.gz
├── [642M] neurons_900_S1_L001_R1_001.fastq.gz
├── [1.8G] neurons_900_S1_L001_R2_001.fastq.gz
├── [238M] neurons_900_S1_L002_I1_001.fastq.gz
├── [646M] neurons_900_S1_L002_R1_001.fastq.gz
└── [1.8G] neurons_900_S1_L002_R2_001.fastq.gz
</code></pre>
<p style="margin: 0px 0px 1.2em !important;">可以看到左右端测序数据大小不一致，<strong>而且每次测序是有3个数据</strong>，因为26bp read1 (16bp Chromium <strong>barcode</strong> and 10bp UMI), 98bp read2 (<strong>transcript</strong>), and 8bp I7 sample <strong>barcode</strong> ，只有reads2的fastq里面是真正的转录本序列，另外的两个文件都是barcode！可以直接用 <a href="https://support.10xgenomics.com/single-cell-gene-expression/software/pipelines/latest/installation">Cell Ranger</a> 来做分析，代码如下：</p>
<pre style="font-size: 1em; font-family: Consolas, Inconsolata, Courier, monospace; line-height: 1.2em; margin: 1.2em 0px;"><code class="hljs language-Shell" style="font-size: 0.85em; font-family: Consolas, Inconsolata, Courier, monospace; margin: 0px 0.15em; padding: 0.5em; white-space: pre; border: 1px solid #cccccc; background-color: #f8f8f8; border-radius: 3px; display: block; overflow: auto; overflow-x: auto; color: #333333; background: #f8f8f8; text-size-adjust: none;">/home/jianmingzeng/biosoft/10xgenomic/cellranger-2.1.0/cellranger count --id=neurons \
--localcores 5 \
--transcriptome=/home/jianmingzeng/biosoft/10xgenomic/db/refdata-cellranger-mm10-1.2.0 \
--fastqs=/home/jianmingzeng/data/public/10x/neurons_900_fastqs \
--sample=neurons \
--expect-cells=900
</code></pre>
<p style="margin: 0px 0px 1.2em !important;">但是作者上传的数据缺失了关键信息，我写信给10x genomics公司的人咨询了这件事</p>
<blockquote style="margin: 1.2em 0px; border-left: 4px solid #dddddd; padding: 0px 1em; color: #777777; quotes: none;">
<p style="margin: 0px 0px 1.2em !important;">I just read a paper: A Targetable EGFR-Dependent Tumor-Initiating Program in Breast Cancer<br />
and they choose 10x genomics for scRNA-seq, and upload the raw data into SRA database.</p>
<p style="margin: 0px 0px 1.2em !important;">While I’ve download them, there should be 26 bp Read1, 8 bp I7 Index, 0 bp I5 Index and 98 bp Read2.</p>
<p style="margin: 0px 0px 1.2em !important;">But I just found the 8 bp in fq1, and 98bp in fq2, the key information just lost , which means I can’t use the Cell Ranger to process them.</p>
<p style="margin: 0px 0px 1.2em !important;">Any help ?</p>
</blockquote>
<p style="margin: 0px 0px 1.2em !important;">公司回复我说，如果缺失barcode信息，这个数据是没办法处理的。</p>
<blockquote style="margin: 1.2em 0px; border-left: 4px solid #dddddd; padding: 0px 1em; color: #777777; quotes: none;">
<p style="margin: 0px 0px 1.2em !important;"><strong>Michael Campbell</strong> (10x Genomics)Jan 26, 07:03 PST Hi Jianming,</p>
<p style="margin: 0px 0px 1.2em !important;">That’s right if you don’t have the 26bp read with the 10x barcode and UMI in it you can’t use Cell Ranger, or any other tool for that matter because there is no way to related the second read to the cell it came from. I would contact the corresponding author to see what happened to the R1 read. If you want, you can send me the SRR number and I can have a look to see if the R1 read is buried somewhere.</p>
<p style="margin: 0px 0px 1.2em !important;">Best,<br />
Mike</p>
</blockquote>
<p style="margin: 0px 0px 1.2em !important;">然后我给出了文章以及SRA号，公司的任又检查了一遍，的确是作者的失误。</p>
<blockquote style="margin: 1.2em 0px; border-left: 4px solid #dddddd; padding: 0px 1em; color: #777777; quotes: none;">
<p style="margin: 0px 0px 1.2em !important;">Hi Jainming,</p>
<p style="margin: 0px 0px 1.2em !important;">It looks like they uploaded the index read as read 1 instead of the read with the barcode. It’s not analyzable in this format.</p>
<p style="margin: 0px 0px 1.2em !important;">Best,<br />
Mike</p>
</blockquote>
<div style="height: 0; width: 0; max-height: 0; max-width: 0; overflow: hidden; font-size: 0em; padding: 0; margin: 0;" title="MDH:PHA+5paH56ug77yaW0EgVGFyZ2V0YWJsZSBFR0ZSLURlcGVuZGVudCBUdW1vci1Jbml0aWF0aW5n
IFByb2dyYW0gaW4gQnJlYXN0IENhbmNlcl0oaHR0cHM6Ly93d3cuc2NpZW5jZWRpcmVjdC5jb20v
c2NpZW5jZS9hcnRpY2xlL3BpaS9TMjIxMTEyNDcxNzMxNDQ3WCkgLCDlm6DkuLpidWxr5rWL5bqP
5peg5rOV6Kej5Yaz6Zeu6aKY77yM5omA5Lul5L2c6ICF6YCJ5oup5LqG5Y2V57uG6IOe6L2s5b2V
57uE5rWL5bqP562W55Wl77yaPGJyPjwvcD48cD5UbyB1bmRlcnN0YW5kIGZ1bmN0aW9uYWwgcHJv
cGVydGllcyBhc3NvY2lhdGVkIHdpdGggaGV0ZXJvZ2VuZW91cyBFR0ZSIGV4cHJlc3Npb24gaW4g
YW4gdW5iaWFzZWQgbWFubmVyLCBzaW5nbGUgY2VsbCBbUk5BLXNlcV0oaHR0cHM6Ly93d3cuc2Np
ZW5jZWRpcmVjdC5jb20vdG9waWNzL2Jpb2NoZW1pc3RyeS1nZW5ldGljcy1hbmQtbW9sZWN1bGFy
LWJpb2xvZ3kvcm5hLXNlcSkgd2FzIHBlcmZvcm1lZCBvbiBmcmVzaGx5IGRpc3NvY2lhdGVkIGNl
bGxzIGZyb20gdGhlIFBEWCAqKigzLDQ4MyBjZWxscywgd2l0aCBhbiBhdmVyYWdlIG9mIDQwLDU2
NCB1bmlxdWUgbW9sZWN1bGFyIGlkZW50aWZpZXJzIChVTUlzKSBhbmQgNSwxNDYgZ2VuZXMgZGV0
ZWN0ZWQgcGVyIGNlbGwpKio8L3A+PHA+5pWw5o2u6YO95ZyoU1JB5pWw5o2u5bqT6YeM6Z2i77yM
5aaC5LiL77yaaHR0cHM6Ly90cmFjZS5uY2JpLm5sbS5uaWguZ292L1RyYWNlcy9zdHVkeS8/YWNj
PVNSUDExMDk4OTwvcD48cD58IHwgUnVuIHwgTGlicmFyeSBuYW1lIHwgTUJhc2VzIHwgTUJ5dGVz
IHwgRXhwZXJpbWVudCB8IEluc3RydW1lbnQgfDxicj58IC0tLTogfCAtLS0tLS0tLS0tLS0tLS0t
LS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tIHwgLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tIHwgLS0t
LS0tIHwgLS0tLS0tIHwgLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLSB8
IC0tLS0tLS0tLS0tLS0tLS0tLS0gfDxicj58IHwgW1NSUjU3OTk3NzZdKGh0dHBzOi8vdHJhY2Uu
bmNiaS5ubG0ubmloLmdvdi9UcmFjZXMvc3JhLz9ydW49U1JSNTc5OTc3NikgfCBQRFgxNzM1XF9y
dW40MTA1X2xhbmUwMDYgfCAzMyw3NTEgfCAxOCwxODQgfCBbU1JYMjk3OTI0MV0oaHR0cHM6Ly93
d3cubmNiaS5ubG0ubmloLmdvdi9zcmEvU1JYMjk3OTI0MSkgfCBJbGx1bWluYSBIaVNlcSA0MDAw
IHw8YnI+fCB8IFtTUlI1Nzk5Nzc1XShodHRwczovL3RyYWNlLm5jYmkubmxtLm5paC5nb3YvVHJh
Y2VzL3NyYS8/cnVuPVNSUjU3OTk3NzUpIHwgUERYMTczNVxfcnVuNDE0M19sYW5lMDAxIHwgMTks
NDIwIHwgOSw1MzQgfCBbU1JYMjk3OTI0Ml0oaHR0cHM6Ly93d3cubmNiaS5ubG0ubmloLmdvdi9z
cmEvU1JYMjk3OTI0MikgfCBJbGx1bWluYSBIaVNlcSAyNTAwIHw8YnI+fCB8IFtTUlI1Nzk5Nzc0
XShodHRwczovL3RyYWNlLm5jYmkubmxtLm5paC5nb3YvVHJhY2VzL3NyYS8/cnVuPVNSUjU3OTk3
NzQpIHwgUERYMTczNVxfcnVuNDE0M19sYW5lMDAyIHwgMTksNDA4IHwgOSw1NDggfCBbU1JYMjk3
OTI0M10oaHR0cHM6Ly93d3cubmNiaS5ubG0ubmloLmdvdi9zcmEvU1JYMjk3OTI0MykgfCBJbGx1
bWluYSBIaVNlcSAyNTAwIHw8L3A+PHA+5L2G5piv5L2c6ICF5bm25rKh5pyJ57uZ6KGo6L6+55+p
6Zi177yM5omA5Lul5Y+q6IO96Ieq6KGM5LiL6L295Y6f5aeL5pWw5o2u6L+b6KGM5Y2V57uG6IOe
6L2s5b2V57uE5YWo5rWB56iL5aSE55CG44CCPC9wPjxwPmBgYFNoZWxsPGJyPm1rZGlyIC1wIH4v
ZGF0YS9wdWJsaWMvVE5CQy88YnI+Y2Qgfi9kYXRhL3B1YmxpYy9UTkJDLzxicj5ub2h1cCB3Z2V0
IC1jIGZ0cDovL2Z0cC10cmFjZS5uY2JpLm5paC5nb3Yvc3JhL3NyYS1pbnN0YW50L3JlYWRzL0J5
UnVuL3NyYS9TUlIvU1JSNTc5L1NSUjU3OTk3NzQvU1JSNTc5OTc3NC5zcmEgJmFtcDsgPGJyPm5v
aHVwIHdnZXQgLWMgZnRwOi8vZnRwLXRyYWNlLm5jYmkubmloLmdvdi9zcmEvc3JhLWluc3RhbnQv
cmVhZHMvQnlSdW4vc3JhL1NSUi9TUlI1NzkvU1JSNTc5OTc3Ni9TUlI1Nzk5Nzc2LnNyYSAmYW1w
OyA8YnI+bm9odXAgd2dldCAtYyBmdHA6Ly9mdHAtdHJhY2UubmNiaS5uaWguZ292L3NyYS9zcmEt
aW5zdGFudC9yZWFkcy9CeVJ1bi9zcmEvU1JSL1NSUjU3OS9TUlI1Nzk5Nzc1L1NSUjU3OTk3NzUu
c3JhICZhbXA7IDxicj5gYGA8L3A+PHA+YGBgc2hlbGw8YnI+bm9odXAgfi9iaW9zb2Z0L3NyYXRv
b2xraXQvc3JhdG9vbGtpdC4yLjYuMy1jZW50b3NfbGludXg2NC9iaW4vZmFzdHEtZHVtcCAtLWd6
aXAgLS1zcGxpdC0zIFNSUjU3OTk3NzQuc3JhICZhbXA7PGJyPm5vaHVwIH4vYmlvc29mdC9zcmF0
b29sa2l0L3NyYXRvb2xraXQuMi42LjMtY2VudG9zX2xpbnV4NjQvYmluL2Zhc3RxLWR1bXAgLS1n
emlwIC0tc3BsaXQtMyBTUlI1Nzk5Nzc1LnNyYSAmYW1wOzxicj5ub2h1cCB+L2Jpb3NvZnQvc3Jh
dG9vbGtpdC9zcmF0b29sa2l0LjIuNi4zLWNlbnRvc19saW51eDY0L2Jpbi9mYXN0cS1kdW1wIC0t
Z3ppcCAtLXNwbGl0LTMgU1JSNTc5OTc3Ni5zcmEgJmFtcDs8YnI+YGBgPGJyPuS4i+i9veW5tuS4
lOino+WOi+WQjuaYr++8mjwvcD48cD5gYGA8YnI+MS43RyBKYW4gMjIgMjM6MzQgU1JSNTc5OTc3
NF8xLmZhc3RxLmd6PGJyPiAxM0cgSmFuIDIyIDIzOjM0IFNSUjU3OTk3NzRfMi5mYXN0cS5nejxi
cj45LjRHIEphbiAyMiAxNzo0MCBTUlI1Nzk5Nzc0LnNyYTxicj4xLjdHIEphbiAyMiAyMzozMyBT
UlI1Nzk5Nzc1XzEuZmFzdHEuZ3o8YnI+IDEzRyBKYW4gMjIgMjM6MzMgU1JSNTc5OTc3NV8yLmZh
c3RxLmd6PGJyPjkuNEcgSmFuIDIyIDE3OjMxIFNSUjU3OTk3NzUuc3JhPGJyPjIuOUcgSmFuIDIz
IDAwOjU1IFNSUjU3OTk3NzZfMS5mYXN0cS5nejxicj4gMjRHIEphbiAyMyAwMDo1NSBTUlI1Nzk5
Nzc2XzIuZmFzdHEuZ3o8YnI+IDE4RyBKYW4gMjIgMTg6MjUgU1JSNTc5OTc3Ni5zcmE8YnI+YGBg
PC9wPjxwPuWPr+S7peeci+WIsOW3puWPs+err+aVsOaNruaWh+S7tuWkp+Wwj+W3ruWIq+W+iOWk
p++8jOWboOS4uui/meS4quS4jeaYr+aZrumAmueahOWPjOerr+a1i+W6j+OAgjwvcD48cD7pnIDo
poHlnKjkvZzogIXnmoTmlofnq6Dph4zpnaLmib7liLDmtYvluo/nmoTmj4/ov7DvvIzov5nnr4fm
lofnq6DnmoRb6KGl5YWF5p2Q5paZXShodHRwczovL2Fycy5lbHMtY2RuLmNvbS9jb250ZW50L2lt
YWdlLzEtczIuMC1TMjIxMTEyNDcxNzMxNDQ3WC1tbWMxLnBkZinmnInku4vnu43vvJo8L3A+PHA+
YGBgPGJyPjI2IGJwIFJlYWQxLCA4IGJwIEk3IEluZGV4LCAwIGJwIEk1IEluZGV4IGFuZCA5OCBi
cCBSZWFkMi48YnI+YGBgPC9wPjxwPua1i+W6j+aVsOaNrumHj+aYr++8mmEgdG90YWwgb2YgYDcx
Nyw5ODIsNDc1YCByZWFkcywgYW5kIGAxNzksMTM3YCByZWFkcyBwZXIgc2luZ2xlLWNlbGw8L3A+
PHA+5Zug5Li65pivICoqMTB4IEdlbm9taWNz5pa55rOVKirlgZrnmoTljZXnu4bog57ovazlvZXn
u4TmlbDmja7vvIzmiYDku6XpnIDopoHkvb/nlKjku5bku6zlj5HooajnmoTlt6XlhbfmnaXlpITn
kIbvvJpbQ2VsbCBSYW5nZXJdKGh0dHBzOi8vc3VwcG9ydC4xMHhnZW5vbWljcy5jb20vc2luZ2xl
LWNlbGwtZ2VuZS1leHByZXNzaW9uL3NvZnR3YXJlL3BpcGVsaW5lcy9sYXRlc3QvaW5zdGFsbGF0
aW9uKSDvvIzpnIDopoHnroDljZXms6jlhozmiY3og73kuIvovb3lronoo4XvvIzmiJHkuIvovb3k
uobkuIDkuKrmtYvor5XmlbDmja7vvIzlj5HnjrDvvJo8L3A+PHA+YGBgPGJyPuKUnOKUgOKUgCBb
MjM3TV0gbmV1cm9uc185MDBfUzFfTDAwMV9JMV8wMDEuZmFzdHEuZ3o8YnI+4pSc4pSA4pSAIFs2
NDJNXSBuZXVyb25zXzkwMF9TMV9MMDAxX1IxXzAwMS5mYXN0cS5nejxicj7ilJzilIDilIAgWzEu
OEddIG5ldXJvbnNfOTAwX1MxX0wwMDFfUjJfMDAxLmZhc3RxLmd6PGJyPuKUnOKUgOKUgCBbMjM4
TV0gbmV1cm9uc185MDBfUzFfTDAwMl9JMV8wMDEuZmFzdHEuZ3o8YnI+4pSc4pSA4pSAIFs2NDZN
XSBuZXVyb25zXzkwMF9TMV9MMDAyX1IxXzAwMS5mYXN0cS5nejxicj7ilJTilIDilIAgWzEuOEdd
IG5ldXJvbnNfOTAwX1MxX0wwMDJfUjJfMDAxLmZhc3RxLmd6PGJyPmBgYDwvcD48cD7lj6/ku6Xn
nIvliLDlt6blj7Pnq6/mtYvluo/mlbDmja7lpKflsI/kuI3kuIDoh7TvvIwqKuiAjOS4lOavj+as
oea1i+W6j+aYr+aciTPkuKrmlbDmja4qKu+8jOWboOS4ujI2YnAgcmVhZDEgKDE2YnAgQ2hyb21p
dW0gKipiYXJjb2RlKiogYW5kIDEwYnAgVU1JKSwgOThicCByZWFkMiAoKip0cmFuc2NyaXB0Kiop
LCBhbmQgOGJwIEk3IHNhbXBsZSAqKmJhcmNvZGUqKiDvvIzlj6rmnIlyZWFkczLnmoRmYXN0cemH
jOmdouaYr+ecn+ato+eahOi9rOW9leacrOW6j+WIl++8jOWPpuWklueahOS4pOS4quaWh+S7tumD
veaYr2JhcmNvZGXvvIHlj6/ku6Xnm7TmjqXnlKggW0NlbGwgUmFuZ2VyXShodHRwczovL3N1cHBv
cnQuMTB4Z2Vub21pY3MuY29tL3NpbmdsZS1jZWxsLWdlbmUtZXhwcmVzc2lvbi9zb2Z0d2FyZS9w
aXBlbGluZXMvbGF0ZXN0L2luc3RhbGxhdGlvbikg5p2l5YGa5YiG5p6Q77yM5Luj56CB5aaC5LiL
77yaPC9wPjxwPmBgYFNoZWxsPGJyPi9ob21lL2ppYW5taW5nemVuZy9iaW9zb2Z0LzEweGdlbm9t
aWMvY2VsbHJhbmdlci0yLjEuMC9jZWxscmFuZ2VyIGNvdW50IC0taWQ9bmV1cm9ucyBcPGJyPi0t
bG9jYWxjb3JlcyA1IFw8YnI+LS10cmFuc2NyaXB0b21lPS9ob21lL2ppYW5taW5nemVuZy9iaW9z
b2Z0LzEweGdlbm9taWMvZGIvcmVmZGF0YS1jZWxscmFuZ2VyLW1tMTAtMS4yLjAgXDxicj4tLWZh
c3Rxcz0vaG9tZS9qaWFubWluZ3plbmcvZGF0YS9wdWJsaWMvMTB4L25ldXJvbnNfOTAwX2Zhc3Rx
cyBcPGJyPi0tc2FtcGxlPW5ldXJvbnMgXDxicj4tLWV4cGVjdC1jZWxscz05MDA8YnI+YGBgPGJy
PuS9huaYr+S9nOiAheS4iuS8oOeahOaVsOaNrue8uuWkseS6huWFs+mUruS/oeaBr++8jOaIkeWG
meS/oee7mTEweCBnZW5vbWljc+WFrOWPuOeahOS6uuWSqOivouS6hui/meS7tuS6izwvcD48cD4m
Z3Q7IEkganVzdCByZWFkIGEgcGFwZXI6IEEgVGFyZ2V0YWJsZSBFR0ZSLURlcGVuZGVudCBUdW1v
ci1Jbml0aWF0aW5nIFByb2dyYW0gaW4gQnJlYXN0IENhbmNlcjxicj4mZ3Q7IGFuZCB0aGV5IGNo
b29zZSAxMHggZ2Vub21pY3MgZm9yIHNjUk5BLXNlcSwgYW5kIHVwbG9hZCB0aGUgcmF3IGRhdGEg
aW50byBTUkEgZGF0YWJhc2UuPGJyPiZndDs8YnI+Jmd0OyBXaGlsZSBJJ3ZlIGRvd25sb2FkIHRo
ZW0sIHRoZXJlIHNob3VsZCBiZSAyNiBicCBSZWFkMSwgOCBicCBJNyBJbmRleCwgMCBicCBJNSBJ
bmRleCBhbmQgOTggYnAgUmVhZDIuPGJyPiZndDs8YnI+Jmd0OyBCdXQgSSBqdXN0IGZvdW5kIHRo
ZSA4IGJwIGluIGZxMSwgYW5kIDk4YnAgaW4gZnEyLCB0aGUga2V5IGluZm9ybWF0aW9uIGp1c3Qg
bG9zdCAsIHdoaWNoIG1lYW5zIEkgY2FuJ3QgdXNlIHRoZSBDZWxsIFJhbmdlciB0byBwcm9jZXNz
IHRoZW0uPGJyPiZndDs8YnI+Jmd0OyBBbnkgaGVscCA/PC9wPjxwPuWFrOWPuOWbnuWkjeaIkeiv
tO+8jOWmguaenOe8uuWksWJhcmNvZGXkv6Hmga/vvIzov5nkuKrmlbDmja7mmK/msqHlip7ms5Xl
pITnkIbnmoTjgII8L3A+PHA+Jmd0OyAqKk1pY2hhZWwgQ2FtcGJlbGwqKiAoMTB4IEdlbm9taWNz
KUphbiAyNiwgMDc6MDMgUFNUIEhpIEppYW5taW5nLDxicj4mZ3Q7PGJyPiZndDsgVGhhdCdzIHJp
Z2h0IGlmIHlvdSBkb24ndCBoYXZlIHRoZSAyNmJwIHJlYWQgd2l0aCB0aGUgMTB4IGJhcmNvZGUg
YW5kIFVNSSBpbiBpdCB5b3UgY2FuJ3QgdXNlIENlbGwgUmFuZ2VyLCBvciBhbnkgb3RoZXIgdG9v
bCBmb3IgdGhhdCBtYXR0ZXIgYmVjYXVzZSB0aGVyZSBpcyBubyB3YXkgdG8gcmVsYXRlZCB0aGUg
c2Vjb25kIHJlYWQgdG8gdGhlIGNlbGwgaXQgY2FtZSBmcm9tLiBJIHdvdWxkIGNvbnRhY3QgdGhl
IGNvcnJlc3BvbmRpbmcgYXV0aG9yIHRvIHNlZSB3aGF0IGhhcHBlbmVkIHRvIHRoZSBSMSByZWFk
LiBJZiB5b3Ugd2FudCwgeW91IGNhbiBzZW5kIG1lIHRoZSBTUlIgbnVtYmVyIGFuZCBJIGNhbiBo
YXZlIGEgbG9vayB0byBzZWUgaWYgdGhlIFIxIHJlYWQgaXMgYnVyaWVkIHNvbWV3aGVyZS48YnI+
Jmd0Ozxicj4mZ3Q7IEJlc3QsPGJyPiZndDsgTWlrZTwvcD48cD7nhLblkI7miJHnu5nlh7rkuobm
lofnq6Dku6Xlj4pTUkHlj7fvvIzlhazlj7jnmoTku7vlj4jmo4Dmn6XkuobkuIDpgY3vvIznmoTn
oa7mmK/kvZzogIXnmoTlpLHor6/jgII8L3A+PHA+Jmd0O0hpIEphaW5taW5nLDxicj4mZ3Q7PGJy
PiZndDtJdCBsb29rcyBsaWtlIHRoZXkgdXBsb2FkZWQgdGhlIGluZGV4IHJlYWQgYXMgcmVhZCAx
IGluc3RlYWQgb2YgdGhlIHJlYWQgd2l0aCB0aGUgYmFyY29kZS4gSXQncyBub3QgYW5hbHl6YWJs
ZSBpbiB0aGlzIGZvcm1hdC48YnI+Jmd0Ozxicj4mZ3Q7QmVzdCw8YnI+Jmd0O01pa2U8L3A+">​</div>
</div>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bio-info-trainee.com/3019.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
