<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>生信菜鸟团 &#187; 视频教程</title>
	<atom:link href="http://www.bio-info-trainee.com/tag/%e8%a7%86%e9%a2%91%e6%95%99%e7%a8%8b/feed" rel="self" type="application/rss+xml" />
	<link>http://www.bio-info-trainee.com</link>
	<description>欢迎去论坛biotrainee.com留言参与讨论，或者关注同名微信公众号biotrainee</description>
	<lastBuildDate>Sat, 28 Jun 2025 14:30:13 +0000</lastBuildDate>
	<language>zh-CN</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>https://wordpress.org/?v=4.1.33</generator>
	<item>
		<title>生物信息入门视频推荐-新一代测序数据分析</title>
		<link>http://www.bio-info-trainee.com/689.html</link>
		<comments>http://www.bio-info-trainee.com/689.html#comments</comments>
		<pubDate>Sun, 03 May 2015 14:18:59 +0000</pubDate>
		<dc:creator><![CDATA[ulwvfje]]></dc:creator>
				<category><![CDATA[生信基础]]></category>
		<category><![CDATA[视频教程]]></category>

		<guid isPermaLink="false">http://www.bio-info-trainee.com/?p=689</guid>
		<description><![CDATA[新一代测序数据分析-在优酷里面可以搜索到，一下是配套视频的讲义及下载地址！ Le &#8230; <a href="http://www.bio-info-trainee.com/689.html">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<ol style="list-style-type: upper-roman;">
<li>新一代测序数据分析-在优酷里面可以搜索到，一下是配套视频的讲义及下载地址！</li>
<li>Lecture Notes</li>
<li>Lectures will appear below as they are presented. Homeworks are specified in each handout.</li>
<li>Lecture 1 - <a href="./files/2013/lecture-1.pdf">slides</a>, <a href="./files/2013/lecture-1-handouts.pdf">handouts</a>. course information, homework and project information, introduction to computing, setting up you computer, basic unix command line usage, organizing your projects, homework 1.
<ul>
<li>Suggested reading: <a href="http://www.ploscompbiol.org/article/info%3Adoi%2F10.1371%2Fjournal.pcbi.1000424">A Quick Guide to Organizing Computational Biology Projects, Plos Comp Bio, 2009</a>,</li>
<li>Other blog posts on <a href="http://nicercode.github.io/blog/2013-04-05-projects/">organizing projects</a> and <a href="http://carlboettiger.info/2012/05/06/research-workflow.html">research workflow</a>.</li>
</ul>
</li>
<li>Lecture 2 - <a href="./files/2013/lecture-2.pdf">slides</a>, <a href="./files/2013/lecture-2-handouts.pdf">handouts</a>, The <a href="http://www.sequenceontology.org/gff3.shtml">GFF format</a>, <a href="http://www.sequenceontology.org/index.html">sequence ontologies</a>, basic Unix commands: wc, grep, cut, sort, redirecting input and output streams, piping commands, processing a tabular file with UNIX tools, homework 2
<ul>
<li>dataset for this lecture, the latest yeast genome feature file: <a href="http://downloads.yeastgenome.org/curation/chromosomal_feature/saccharomyces_cerevisiae.gff">saccharomyces_cerevisiae.gff</a></li>
<li>Unix flags explained: <a href="http://picard.sourceforge.net/explain-flags.html">Explain Shell</a></li>
<li>Suggested reading: <a href="http://korflab.ucdavis.edu/unix_and_Perl/">Unix for Biologists</a></li>
</ul>
</li>
<li>Lecture 3 - <a href="./files/2013/lecture-3.pdf">slides</a>, <a href="./files/2013/lecture-3-handouts.pdf">handouts</a>. programming languages, download and install an proper editor, introduction to the <a href="http://en.wikipedia.org/wiki/AWK">AWK</a> programming language, tabular file processing, filtering by feature types, <a href="http://www.catonmat.net/series/awk-one-liners-explained">Awk onliners explained</a>, another collections of <a href="http://www.pement.org/awk/awk1line.txt">AWK oneliners</a>, homework 3.
<ul>
<li>Recommended editors <a href="http://www.activestate.com/komodo-edit">Komodo Edit</a> or <a href="http://www.sublimetext.com/">Sublime Text</a></li>
<li>Suggested reading <a href="http://www.catb.org/esr/writings/taoup/html/">The Art of Unix Programming</a>,</li>
<li>Biostar question of the day, iterating through files: <a href="http://www.biostars.org/p/63336/">Very Bad Things</a></li>
</ul>
</li>
<li>Lecture 4 - <a href="./files/2013/lecture-4.pdf">slides</a>, <a href="./files/2013/lecture-4-handouts.pdf">handouts</a>, sequencing technologies, sequence representations, the FASTA format, processing FASTA files at the command line, homework 4.
<ul>
<li>dataset for this lecture, sequencing reads from a 454 instrument: <a href="http://www.personal.psu.edu/iua1/courses/files/2013/data/lec4.fa">lec4.fa (6Mb)</a></li>
<li>The <a href="http://en.wikipedia.org/wiki/FASTA_format">FASTA format</a> and the <a href="http://www.bioinformatics.org/sms2/iupac.html">IUPAC codes</a></li>
<li>Suggested reading: <a href="http://www.biostat.wisc.edu/~craven/hunter.pdf">Molecular Biology for Computer Scientists</a>,</li>
<li>Biostar question of the day: <a href="http://www.biostars.org/p/52150/">Sequencing Technology Reviews</a></li>
</ul>
</li>
<li>Lecture 5 - <a href="./files/2013/lecture-5.pdf">slides</a>, <a href="./files/2013/lecture-5-handouts.pdf">handouts</a>, string matching, edit distances, regular expressions, local and global alignments, homework 5.
<ul>
<li><a href="http://regexpal.com/">Regular Expression tester</a></li>
<li><a href="http://www.ebi.ac.uk/Tools/psa/">Pairwise sequence alignment tools at EMBL</a></li>
<li>Book: comprehensive discussion of basic bioinformatics concepts: <a href="http://www.amazon.com/Understanding-Bioinformatics-Marketa-Zvelebil/dp/0815340249">Understanding Bioinformatics</a> by <a href="http://www.icr.ac.uk/research/team_leaders/Zvelebil_Marketa/index.shtml">Dr. Marketa Zvelebil</a> and Jeremy Baum.</li>
</ul>
</li>
<li>Lecture 6 - <a href="./files/2013/lecture-6.pdf">slides</a>, <a href="./files/2013/lecture-6-handouts.pdf">handouts</a>, introduction to using blast, legacy blast and blast+, preparing blast databases, performing a blastn query, formatting blast output, homework 6.
<ul>
<li><a href="ftp://ftp.ncbi.nlm.nih.gov/blast/executables/blast+/LATEST/">Download blast executables</a></li>
<li>Download: <a href="http://downloads.yeastgenome.org/curation/chromosomal_feature/saccharomyces_cerevisiae.gff">saccharomyces_cerevisiae.gff</a></li>
<li>Book: detailed information on "legacy" blast (published in 2003): <a href="http://www.amazon.com/Blast-Ian-Korf/dp/0596002998">BLAST</a> by Ian Korf, Mark Yandell and Joseph Bedell</li>
</ul>
</li>
<li>Lecture 7 - <a href="./files/2013/lecture-7.pdf">slides</a>, <a href="./files/2013/lecture-7-handouts.pdf">handouts</a>, using blast, formatting databases, using the blastdbcmd, extract sequences, batch operations, formatting blast queries, homework 7.
<ul>
<li>Official Blast documentation: <a href="http://www.ncbi.nlm.nih.gov/books/NBK1763/">BLAST Command Line Applications User Manual</a></li>
</ul>
</li>
<li>Lecture 8 - <a href="./files/2013/lecture-8.pdf">slides</a>, <a href="./files/2013/lecture-8-handouts.pdf">handouts</a>, blast score and E-values, search strategies, usage examples for blastn, blastp, blastx, tblastn, and tblastx, homework 8.
<ul>
<li>Download the <a href="http://www.personal.psu.edu/iua1/courses/files/2013/data/data.zip">required dataset</a></li>
<li>Optional: Join the <a href="http://www.getstudyroom.com/">Applied Bioinformatics Study Room</a> find the course called Applied Bioinformatics at Penn State</li>
<li>NCBI guide to BLAST E-Values: <a href="http://www.ncbi.nlm.nih.gov/BLAST/tutorial/Altschul-1.html">The Statistics of Sequence Similarity Scores</a></li>
</ul>
</li>
<li>Lecture 9 - <a href="./files/2013/lecture-9.pdf">slides</a>, <a href="./files/2013/lecture-9-handouts.pdf">handouts</a>, quality encodings, phred scales, the FASTQ format, homework 9.
<ul>
<li>The <a href="http://en.wikipedia.org/wiki/FASTQ_format">FASTQ format wiki</a> wiki page</li>
<li>The <a href="http://en.wikipedia.org/wiki/Phred_score">Phred quality score</a></li>
</ul>
</li>
<li>Lecture 10 - <a href="./files/2013/lecture-10.pdf">slides</a>, <a href="./files/2013/lecture-10-handouts.pdf">handouts</a>, file compression, gzip, zip, bz2, file archives, tarbombs, plotting fastq qualities homework 10.
<ul>
<li>Download the <a href="http://www.personal.psu.edu/iua1/courses/files/2013/data/lecture-10.tar.gz">lecture-10.tar.gz</a> dataset</li>
<li>A <a href="http://www.personal.psu.edu/iua1/courses/files/2013/data/tarbomb.tar.gz">tarbomb</a>, handle with care ...</li>
<li>The <a href="http://www.bioinformatics.babraham.ac.uk/projects/fastqc/">FastQC</a> toolkit</li>
</ul>
</li>
<li>Lecture 11 - <a href="./files/2013/lecture-11.pdf">slides</a>, <a href="./files/2013/lecture-11-handouts.pdf">handouts</a> installing tools, quality control, adapter trimming, error corrections
<ul>
<li><a href="https://github.com/lh3/seqtk">SeqTK gitHub page</a></li>
<li>Biostar Question of the Day: <a href="http://www.biostars.org/p/53528/">Fastq Quality Control Shootout</a></li>
<li>Package manager <a href="http://brew.sh/">Homebrew for Mac</a></li>
<li>Suggested reading: ${link('illumina')}.</li>
<li>Simple sequence related utilities in the <a href="http://www.bioinformatics.org/sms2/">Sequence Manipulation Suite</a></li>
<li>Software tools for adapter trimming:
<ul>
<li><a href="http://code.google.com/p/cutadapt/">CutAdapt</a> application note in <a href="http://journal.embnet.org/index.php/embnetjournal/article/view/200">Embnet Journal, 2011</a></li>
<li><a href="http://code.google.com/p/ea-utils/">fastq-mcf</a> published in <a href="http://benthamscience.com/open/openaccess.php?tobioij/articles/V007/1TOBIOIJ.htm">The Open Bioinformatics Journal, 2013</a></li>
<li><a href="http://prinseq.sourceforge.net/">PrinSeq</a> application note in <a href="http://www.ncbi.nlm.nih.gov/pubmed/21278185">Bioinformatics, 2011</a></li>
<li><a href="http://www.usadellab.org/cms/index.php?page=trimmomatic">Trimmomatic</a> application note in <a href="http://nar.oxfordjournals.org/content/40/W1.toc">Nucleic Acid Research, 2012, web server issue</a></li>
<li><a href="http://www.bioinformatics.babraham.ac.uk/projects/trim_galore/">Trim Galore</a></li>
<li><a href="http://www.nipgr.res.in/ngsqctoolkit.html">NGS Toolkit</a> published in <a href="http://www.plosone.org/article/info:doi/10.1371/journal.pone.0030619">Plos One, 2012</a></li>
<li><a href="http://hannonlab.cshl.edu/fastx_toolkit/">Fastx Toolkit</a></li>
<li><a href="https://code.google.com/p/biopieces/">BioPieces</a> a suite of programs for sequence preprocessing</li>
<li><a href="https://github.com/vsbuffalo/scythe">Scythe</a> a bayesian adaptor trimmer</li>
<li><a href="http://sourceforge.net/projects/flexbar/">FlexBar, Flexible barcode and adapter removal</a> published in <a href="http://www.mdpi.com/2079-7737/1/3/895">Biology, 2012</a></li>
<li><a href="https://github.com/jstjohn/SeqPrep">SeqPrep</a></li>
<li><a href="http://genome.gsc.riken.jp/osc/english/software/">TagDust</a> published in <a href="http://www.ncbi.nlm.nih.gov/pubmed/19737799">Bioinformatics, 2009</a></li>
<li><a href="http://tagcleaner.sourceforge.net/">TagCleaner</a> published in <a href="http://www.biomedcentral.com/1471-2105/11/341">BMC Bioinformatics, 2010</a></li>
<li>Libraries via R (Bioconductor): <a href="http://www.bioinfo.uh.edu/PIQA/">PIQA</a>, <a href="http://www.bioconductor.org/packages/2.12/bioc/html/ShortRead.html">ShortRead</a></li>
</ul>
</li>
</ul>
</li>
<li>Lecture 12 - <a href="./files/2013/lecture-12.pdf">slides</a>, <a href="./files/2013/lecture-12-handouts.pdf">handouts</a> paired end sequencing, quality control for paired end sequencing, the bioawk language
<ul>
<li>download the dataset for lecture 12: <a href="http://www.personal.psu.edu/iua1/courses/files/2013/data/lec12.tar.gz">lec12.tar.gz (20Mb)</a></li>
<li><a href="https://github.com/lh3/bioawk">bioawk GitHub page</a></li>
<li>Paried end aware software tools for adapter trimming:
<ul>
<li><a href="http://www.usadellab.org/cms/index.php?page=trimmomatic">Trimmomatic</a> application note in <a href="http://nar.oxfordjournals.org/content/40/W1.toc">Nucleic Acid Research, 2012, web server issue</a></li>
<li><a href="http://code.google.com/p/ea-utils/">fastq-mcf</a> published in <a href="http://benthamscience.com/open/openaccess.php?tobioij/articles/V007/1TOBIOIJ.htm">The Open Bioinformatics Journal, 2013</a></li>
</ul>
</li>
</ul>
</li>
<li>Lecture 13 - <a href="./files/2013/lecture-13.pdf">slides</a>, <a href="./files/2013/lecture-13-handouts.pdf">handouts</a> paired end sequencing, read stiching, automating tasks with shell scripts
<ul>
<li>for the homework use the dataset for lecture 12: <a href="http://www.personal.psu.edu/iua1/courses/files/2013/data/lec12.tar.gz">lec12.tar.gz (20Mb)</a></li>
<li><a href="http://www.usadellab.org/cms/index.php?page=trimmomatic">Trimmomatic</a> application note in <a href="http://nar.oxfordjournals.org/content/40/W1.toc">Nucleic Acid Research, 2012, web server issue</a></li>
<li><a href="http://ccb.jhu.edu/software/FLASH/">Fast Length Adjustment of SHort reads</a></li>
<li><a href="http://musket.sourceforge.net/homepage.htm#latest">Musket - a multistage k-mer spectrum based corrector</a></li>
<li><a href="http://tldp.org/HOWTO/Bash-Prog-Intro-HOWTO.html">Bash programming HOWTO</a></li>
</ul>
</li>
<li>Lecture 14 - <a href="./files/2013/lecture-14.pdf">slides</a>, <a href="./files/2013/lecture-14-handouts.pdf">handouts</a> short read alignments, bwa, bowtie and other tools.
<ul>
<li>official <a href="http://bio-bwa.sourceforge.net/">bwa website</a></li>
<li>the <a href="http://arxiv.org/abs/1303.3997">bwa-mem paper</a> currently on arxiv</li>
<li><a href="https://gist.github.com/ctb/5663625">bwa-mem rejection from Bioinformatics</a> and <a href="http://figshare.com/articles/Review_of_bwa_mem/714096">bwa-mem open review</a></li>
<li><a href="http://en.wikipedia.org/wiki/Heng_Li">Heng Li</a> on wikipedia and on <a href="http://lh3lh3.users.sourceforge.net/">sourceforge</a></li>
<li>The <a href="http://bowtie-bio.sourceforge.net/index.shtml">bowtie1 aligner</a> and the <a href="http://bowtie-bio.sourceforge.net/bowtie2/index.shtml">bowtie2 aligner</a></li>
</ul>
</li>
<li>Lecture 15 - <a href="./files/2013/lecture-15.pdf">slides</a>, <a href="./files/2013/lecture-15-handouts.pdf">handouts</a> the sequence alignment map SAM format
<ul>
<li>dataset for the homework: <a href="http://www.personal.psu.edu/iua1/courses/files/2013/data/lec15.fq.gz">lec15.fq.gz (3.8Mb)</a></li>
<li>official <a href="http://samtools.sourceforge.net/SAMv1.pdf">SAM specification</a></li>
<li>What do flags mean: <a href="http://picard.sourceforge.net/explain-flags.html">Explain SAM flags</a></li>
</ul>
</li>
<li>Lecture 16 - <a href="./files/2013/lecture-16.pdf">slides</a>, <a href="./files/2013/lecture-16-handouts.pdf">handouts</a> the SAM/BAM format, sorting and indexing BAM files, using the samtools program
<ul>
<li>The <a href="http://samtools.sourceforge.net/">Samtools page</a></li>
<li>What do SAM flags mean: <a href="http://picard.sourceforge.net/explain-flags.html">Explain SAM flags</a></li>
</ul>
</li>
<li>Lecture 17 - <a href="./files/2013/lecture-17.pdf">slides</a>, <a href="./files/2013/lecture-17-handouts.pdf">handouts</a> aligning paired end reads, comparing and evaluating aligners, simulating sequencing reads with the wgsim tool
<ul>
<li>The <a href="https://github.com/lh3/wgsim">wgsim github</a> page</li>
<li>The <a href="http://sourceforge.net/apps/mediawiki/dnaa/index.php?title=Whole_Genome_Simulation">dwgsim sourceforge page</a></li>
<li>The <a href="http://www.ebi.ac.uk/goldman-srv/simNGS/">simNGS</a> tool</li>
<li><a href="http://www.ncbi.nlm.nih.gov/pubmed/22199392">ART: a next-generation sequencing read simulator</a> Bioinformatics, 2012</li>
<li><a href="http://www.ncbi.nlm.nih.gov/pubmed/23758764">Benchmarking short sequence mapping tools</a>, BMC Bioinformatics. 2013</li>
<li><a href="http://www.ncbi.nlm.nih.gov/pubmed/22199392">pIRS: Profile-based Illumina pair-end reads simulator</a>, Bioinformatics. 2012</li>
<li><a href="http://www.ncbi.nlm.nih.gov/pubmed/22336055">GemSIM: general, error-model based simulator of next-generation sequencing data</a>, BMC Genomics. 2012</li>
<li><a href="http://www.ncbi.nlm.nih.gov/pubmed/23922726">Optimizing information in Next-Generation-Sequencing (NGS) reads for improving de novo genome assembly</a></li>
</ul>
</li>
<li>Lecture 18 - <a href="./files/2013/lecture-18.pdf">slides</a>, <a href="./files/2013/lecture-18-handouts.pdf">handouts</a> read duplication, visualizing alignments with IGV and IGB
<ul>
<li><a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3527383/">FastUniq: A Fast De Novo Duplicates Removal Tool for Paired Short Reads</a>, PLoS One. 2012;</li>
<li>The old <a href="http://samtools.sourceforge.net/pileup.shtml">Pileup format</a></li>
<li><a href="http://samtools.sourceforge.net/tview.shtml">Samtools text alignment viewer</a></li>
<li><a href="https://www.broadinstitute.org/igv/">Integrative Genomics Viewer</a> by the Broad Institute, <a href="http://bioviz.org/igb/">Integrated Genome Browser</a> by UNC Charlotte, <a href="http://bioinf.scri.ac.uk/tablet/">Tablet - Next Generation Sequence Assembly Visualization</a></li>
<li><a href="http://www.ncbi.nlm.nih.gov/pubmed/20195257">Visualizing genomes, techniques and challenges</a>, Nat Methods. 2010</li>
<li>Neat visualizations via <a href="https://dnaskittle.com/">DNA Skittle</a></li>
<li>Biostar question of the day <a href="http://www.biostars.org/p/363/">What tools/libraries do you use to visualize genomic feature data?</a></li>
</ul>
</li>
<li>Lecture 19, guest lecture by Nicholas Stoler - <a href="./files/2013/lecture-19.pdf">slides</a>, the variant call format (VCF), calling variants with samtools mpileup
<ul>
<li>The <a href="http://samtools.sourceforge.net/mpileup.shtml">multi sample pileup format</a></li>
<li><a href="http://massgenomics.org/2012/03/5-things-to-know-about-samtools-mpileup.html">5 Things to Know About SAMtools Mpileup</a></li>
<li>The <a href="http://www.1000genomes.org/wiki/Analysis/Variant%20Call%20Format/vcf-variant-call-format-version-41">VCF format 4.1 specification</a></li>
<li>The <a href="http://vcftools.sourceforge.net/VCF-poster.pdf">The Variant Call Format and VCFtools Poster</a></li>
<li>Suggested reading: <a href="http://www.ncbi.nlm.nih.gov/pubmed/21903627">A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data.</a>, Bioinformatics, 2011</li>
<li>Biostar question of the day <a href="http://www.biostars.org/p/12964/">How to distinguish heterozygotes and homozygotes from variants in VCF format</a></li>
</ul>
</li>
<li>Lecture 20,- <a href="./files/2013/lecture-20.pdf">slides</a>, <a href="./files/2013/lecture-20-handouts.pdf">handouts</a> origins of genome variations, more on SNP calling, successes and failures
<ul>
<li><a href="http://www.broadinstitute.org/gatk/">Genome Analysis Toolkit</a></li>
<li>Recommended reading: <a href="http://www.genomesunzipped.org/">Genomes Unzipped blog</a>:
<ul>
<li><a href="http://www.sciencemag.org/content/333/6038/53">Widespread RNA and DNA Sequence Differences in the Human Transcriptome</a>, Science, 2011</li>
<li><a href="http://www.genomesunzipped.org/2012/03/questioning-the-evidence-for-non-canonical-rna-editing-in-humans.php">Questioning the evidence for non-canonical RNA editing in humans, part</a></li>
<li><a href="http://www.genomesunzipped.org/2012/04/guest-post-accurate-identification-of-rna-editing-sites-from-high-throughput-sequencing-data.php">Accurate identification of RNA editing sites from high-throughput sequencing data</a></li>
<li><a href="http://www.genomesunzipped.org/2013/10/guest-post-the-perils-of-genetic-risk-prediction-in-autism.php">The perils of genetic risk prediction in autism</a></li>
</ul>
</li>
</ul>
</li>
<li>Lecture 21,- <a href="./files/2013/lecture-21.pdf">slides</a>, <a href="./files/2013/lecture-21-handouts.pdf">handouts</a> interval representation, BED and GFF formats, representing data
<ul>
<li><a href="http://genome.ucsc.edu/FAQ/FAQformat.html">USCS Format FAQ</a></li>
<li><a href="http://www.sequenceontology.org/gff3.shtml">The GFF3 format</a></li>
</ul>
</li>
<li>Lecture 22,- <a href="./files/2013/lecture-22.pdf">slides</a>, <a href="./files/2013/lecture-22-handouts.pdf">handouts</a> interval operations: complement, extension, flanking, Using the BedTools package
<ul>
<li><a href="https://code.google.com/p/bedtools/">Bedtools homepage</a> and the <a href="http://bedtools.readthedocs.org/en/latest/">latest documentation</a></li>
<li><a href="https://code.google.com/p/bedtools/downloads/detail?name=BEDTools-User-Manual.v4.pdf&amp;can=2&amp;q=">Bedtools old pdf documentation</a> contains many details that the new documentation does not</li>
<li>Biostar question of the day <a href="http://www.biostars.org/p/7126/">What are the most common stupid mistakes in bioinformatics?</a></li>
</ul>
</li>
<li>Lecture 23,- <a href="./files/2013/lecture-23.pdf">slides</a>, <a href="./files/2013/lecture-23-handouts.pdf">handouts</a> interval operations: intersect, window, selecting closest features
<ul>
<li><a href="https://code.google.com/p/bedtools/">Bedtools homepage</a> and the <a href="http://bedtools.readthedocs.org/en/latest/">latest documentation</a></li>
<li>The <a href="https://code.google.com/p/bedops/">bedops</a> tool and a <a href="http://www.biostars.org/p/73830/">nice usage example</a></li>
</ul>
</li>
<li>Lecture 24,- <a href="./files/2013/lecture-24.pdf">slides</a>, <a href="./files/2013/lecture-24-handouts.pdf">handouts</a> an introduction to genome assembly, using the velvet assembler, evaluating genome assemblies with QUAST
<ul>
<li>The <a href="http://www.ebi.ac.uk/~zerbino/velvet/">Velvet</a> assembler, see also: <a href="http://genome.cshlp.org/content/18/5/821.short">Velvet: Algorithms for de novo short read assembly using de Bruijn graphs</a></li>
<li>The <a href="http://minia.genouest.org/">Minia</a> assembler, see also <a href="http://minia.genouest.org/#article">Space-efficient and exact de Bruijn graph representation based on a Bloom filter</a>, WABI 2012</li>
<li>The <a href="https://github.com/lh3/fermi.git">Fermi</a> assembler, see also <a href="http://bioinformatics.oxfordjournals.org/content/28/14/1838">Exploring single-sample SNP and INDEL calling with whole-genome de novo assembly</a>, Bioinformatics, 2012</li>
<li><a href="http://bioinf.spbau.ru/QUAST">Quast assembly evaluator</a>, see also <a href="http://bioinformatics.oxfordjournals.org/content/29/8/1072.abstract">QUAST: quality assessment tool for genome assemblies</a></li>
</ul>
</li>
<li>Lecture 25,- <a href="./files/2013/lecture-25.pdf">slides</a>, <a href="./files/2013/lecture-25-handouts.pdf">handouts</a>, <a href="./files/2013/data/meta.tar.gz">meta.tar.gz (25MB)</a> an introduction to metagenomics, software packages mothur, QIIME and MetaSim, online tools RDP, MG-RAST
<ul>
<li><a href="./files/2013/lecture-25-how-many-bacteria.pdf">So how many bacteria are in the human body</a></li>
<li><a href="http://www.mothur.org/wiki/Main_Page">the mothur package</a></li>
<li><a href="http://qiime.org/">QIIME: Quantitative Insights Into Microbial Ecology</a></li>
<li><a href="http://ab.inf.uni-tuebingen.de/software/megan/">MEGAN: the Metagenome Analyzer</a></li>
<li><a href="http://ab.inf.uni-tuebingen.de/software/metasim/">MetaSim: A Sequencing Simulator for Genomics and Metagenomics</a></li>
<li><a href="http://sourceforge.net/projects/rdp-classifier/">RDP: classifier</a></li>
</ul>
</li>
<li>Lecture 26,- <a href="./files/2013/lecture-26.pdf">slides</a>, <a href="./files/2013/lecture-26-handouts.pdf">handouts</a>, <a href="./files/2013/data/lec26.tar.gz">lec26.tar.gz (25MB)</a> an introduction to Chip-Seq technology, peak calling concepts, preprocessing and peak calling methods (part 1)
<ul>
<li>Search google for "Introduction to Chip-Seq", quite a few resources</li>
<li>Recommended reading <a href="http://www.nature.com/nrg/series/nextgeneration/index.html">Applications of next-generation sequencing</a> (Nature, resources)</li>
<li><a href="https://github.com/ialbert/bioawk-tools">the bioawk-tools</a> utilities</li>
</ul>
</li>
<li>Lecture 27,- <a href="./files/2013/lecture-27.pdf">slides</a>, <a href="./files/2013/lecture-27-handouts.pdf">handouts</a>, Chip-Seq peak calling sofware, preprocessing and peak calling methods (part 2)
<ul>
<li>The peak caller used in the Encode Project <a href="http://liulab.dfci.harvard.edu/MACS/">Model-based Analysis for ChIP-Seq</a> and <a href="http://www.ncbi.nlm.nih.gov/pubmed/22936215">Identifying ChIP-seq enrichment using MACS</a> in Nature Protocols, 2012</li>
<li><a href="http://sissrs.rajajothi.com/">Site Identification from Short Sequence Reads</a></li>
<li><a href="http://www.nature.com/nprot/journal/v7/n8/full/nprot.2012.088.html">A complete workflow for the analysis of full-size ChIP-seq (and similar) data sets using peak-motifs</a>, Nature Protocols, 2012</li>
</ul>
</li>
<li>Lecture 28,- <a href="./files/2013/lecture-28.pdf">slides</a>, <a href="./files/2013/lecture-28-handouts.pdf">handouts</a>, <a href="./files/2013/data/lec28.tar.gz">lec28.tar.gz</a> basic RNA-Seq data analysis concepts, split read mapping
<ul>
<li>The Tuxedo suite: <a href="http://bowtie-bio.sourceforge.net/index.shtml">Bowtie</a>, <a href="http://tophat.cbcb.umd.edu/">Tophat</a>, <a href="http://cufflinks.cbcb.umd.edu/">Cufflinks</a></li>
<li>The <a href="http://www.ebi.ac.uk/goldman-srv/rlsim/">rlsim simulation package</a> and related paper <a href="http://arxiv.org/abs/1308.3172">Realistic simulations reveal extensive sample-specificity of RNA-seq biases</a></li>
<li>The <a href="http://sammeth.net/confluence/display/SIM/Home">FLUX read simulator</a> and related paper: <a href="http://www.ncbi.nlm.nih.gov/pubmed/22962361">Modelling and simulating generic RNA-Seq experiments with the flux simulator</a> NAR, 2012</li>
</ul>
</li>
<li>Lecture 29, <a href="./files/2013/lecture-29.pdf">slides</a>, <a href="./files/2013/lecture-29-handouts.pdf">handouts</a>, <a href="./files/2013/data/lec29.tar.gz">lec29.tar.gz</a> RNA-Seq (part 2)
<ul>
<li><a href="http://blog.nextgenetics.net/?e=51">BLOG: RPKM measure is inconsistent among samples</a>, Damian Kao provides a simple and logical description of a paper that talks about the problems with RPKM</li>
<li><a href="http://www.youtube.com/watch?v=5NiFibnbE8o">CSHL Keynote 2013; Dr. Lior Pachter, UC Berkeley</a> TLDR: use TPM (Transcript per million) instead of RPKM, FPKM</li>
</ul>
</li>
<li>Lecture 30, <a href="./files/2013/lecture-30.pdf">slides</a>, <a href="./files/2013/lecture-30-handouts.pdf">handouts</a>, bioinformatics beyond the command line: using R for data analysis
<ul>
<li>The <a href="http://www.bioconductor.org/">Bioconductor package</a></li>
<li><a href="http://nsaunders.wordpress.com/2010/08/20/a-brief-introduction-to-apply-in-r/">A brief introduction to "apply" in R</a></li>
<li><a href="http://had.co.nz/">Hadley Wickham's homepage</a></li>
<li><a href="http://ggplot2.org/">The ggplot2 visualiztion library</a></li>
</ul>
</li>
<li>Final Project 30, <a href="./files/2013/my-little-pony.pdf">final-project</a>, data for final project <a href="./files/2013/data/pony.tar.gz">pony.tar.gz (17Mb)</a> BMB 597D: Final project, 50% of the final grade, due 5pm Saturday Dec 14th, 2013</li>
</ol>
]]></content:encoded>
			<wfw:commentRss>http://www.bio-info-trainee.com/689.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
